1
|
Giraud-Gatineau A, Ayachit G, Nieves C, Dagbo KC, Bourhy K, Pulido F, Huete SG, Benaroudj N, Picardeau M, Veyrier FJ. Inter-species Transcriptomic Analysis Reveals a Constitutive Adaptation Against Oxidative Stress for the Highly Virulent Leptospira Species. Mol Biol Evol 2024; 41:msae066. [PMID: 38573174 PMCID: PMC11021026 DOI: 10.1093/molbev/msae066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 02/29/2024] [Accepted: 03/07/2024] [Indexed: 04/05/2024] Open
Abstract
Transcriptomic analyses across large scales of evolutionary distance have great potential to shed light on regulatory evolution but are complicated by difficulties in establishing orthology and limited availability of accessible software. We introduce here a method and a graphical user interface wrapper, called Annotator-RNAtor, for performing interspecies transcriptomic analysis and studying intragenus evolution. The pipeline uses third-party software to infer homologous genes in various species and highlight differences in the expression of the core-genes. To illustrate the methodology and demonstrate its usefulness, we focus on the emergence of the highly virulent Leptospira subclade known as P1+, which includes the causative agents of leptospirosis. Here, we expand on the genomic study through the comparison of transcriptomes between species from P1+ and their related P1- counterparts (low-virulent pathogens). In doing so, we shed light on differentially expressed pathways and focused on describing a specific example of adaptation based on a differential expression of PerRA-controlled genes. We showed that P1+ species exhibit higher expression of the katE gene, a well-known virulence determinant in pathogenic Leptospira species correlated with greater tolerance to peroxide. Switching PerRA alleles between P1+ and P1- species demonstrated that the lower repression of katE and greater tolerance to peroxide in P1+ species was solely controlled by PerRA and partly caused by a PerRA amino-acid permutation. Overall, these results demonstrate the strategic fit of the methodology and its ability to decipher adaptive transcriptomic changes, not observable by comparative genome analysis, that may have been implicated in the emergence of these pathogens.
Collapse
Affiliation(s)
- Alexandre Giraud-Gatineau
- Microbiology Department, Institut Pasteur, Université Paris Cité, Biology of Spirochetes Unit, Paris, France
| | - Garima Ayachit
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, Quebec H7V 1B7, Canada
| | - Cecilia Nieves
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, Quebec H7V 1B7, Canada
| | - Kouessi C Dagbo
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, Quebec H7V 1B7, Canada
| | - Konogan Bourhy
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, Quebec H7V 1B7, Canada
| | - Francisco Pulido
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, Quebec H7V 1B7, Canada
| | - Samuel G Huete
- Microbiology Department, Institut Pasteur, Université Paris Cité, Biology of Spirochetes Unit, Paris, France
| | - Nadia Benaroudj
- Microbiology Department, Institut Pasteur, Université Paris Cité, Biology of Spirochetes Unit, Paris, France
| | - Mathieu Picardeau
- Microbiology Department, Institut Pasteur, Université Paris Cité, Biology of Spirochetes Unit, Paris, France
| | - Frédéric J Veyrier
- INRS-Centre Armand-Frappier Santé Biotechnologie, Bacterial Symbionts Evolution, Laval, Quebec H7V 1B7, Canada
| |
Collapse
|
2
|
Jurado MR, Tombor LS, Arsalan M, Holubec T, Emrich F, Walther T, Abplanalp W, Fischer A, Zeiher AM, Schulz MH, Dimmeler S, John D. Improved integration of single-cell transcriptome data demonstrates common and unique signatures of heart failure in mice and humans. Gigascience 2024; 13:giae011. [PMID: 38573186 PMCID: PMC10993718 DOI: 10.1093/gigascience/giae011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 01/17/2024] [Accepted: 03/06/2024] [Indexed: 04/05/2024] Open
Abstract
BACKGROUND Cardiovascular research heavily relies on mouse (Mus musculus) models to study disease mechanisms and to test novel biomarkers and medications. Yet, applying these results to patients remains a major challenge and often results in noneffective drugs. Therefore, it is an open challenge of translational science to develop models with high similarities and predictive value. This requires a comparison of disease models in mice with diseased tissue derived from humans. RESULTS To compare the transcriptional signatures at single-cell resolution, we implemented an integration pipeline called OrthoIntegrate, which uniquely assigns orthologs and therewith merges single-cell RNA sequencing (scRNA-seq) RNA of different species. The pipeline has been designed to be as easy to use and is fully integrable in the standard Seurat workflow.We applied OrthoIntegrate on scRNA-seq from cardiac tissue of heart failure patients with reduced ejection fraction (HFrEF) and scRNA-seq from the mice after chronic infarction, which is a commonly used mouse model to mimic HFrEF. We discovered shared and distinct regulatory pathways between human HFrEF patients and the corresponding mouse model. Overall, 54% of genes were commonly regulated, including major changes in cardiomyocyte energy metabolism. However, several regulatory pathways (e.g., angiogenesis) were specifically regulated in humans. CONCLUSIONS The demonstration of unique pathways occurring in humans indicates limitations on the comparability between mice models and human HFrEF and shows that results from the mice model should be validated carefully. OrthoIntegrate is publicly accessible (https://github.com/MarianoRuzJurado/OrthoIntegrate) and can be used to integrate other large datasets to provide a general comparison of models with patient data.
Collapse
Affiliation(s)
- Mariano Ruz Jurado
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
- Cardio-Pulmonary Institute (CPI), Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Lukas S Tombor
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
| | - Mani Arsalan
- Department of Cardiovascular Surgery, Goethe University Hospital, 60590 Frankfurt am Main, Germany
| | - Tomas Holubec
- Department of Cardiovascular Surgery, Goethe University Hospital, 60590 Frankfurt am Main, Germany
| | - Fabian Emrich
- Department of Cardiovascular Surgery, Goethe University Hospital, 60590 Frankfurt am Main, Germany
| | - Thomas Walther
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
- Cardio-Pulmonary Institute (CPI), Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- Department of Cardiovascular Surgery, Goethe University Hospital, 60590 Frankfurt am Main, Germany
| | - Wesley Abplanalp
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
- Cardio-Pulmonary Institute (CPI), Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Ariane Fischer
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Andreas M Zeiher
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
- Cardio-Pulmonary Institute (CPI), Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Marcel H Schulz
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
- Cardio-Pulmonary Institute (CPI), Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Stefanie Dimmeler
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
- Cardio-Pulmonary Institute (CPI), Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - David John
- Institute of Cardiovascular Regeneration, Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- German Centre for Cardiovascular Research (DZHK), 60590 Frankfurt am Main, Germany
- Cardio-Pulmonary Institute (CPI), Goethe University Frankfurt, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| |
Collapse
|
3
|
Haldar A, Oza VH, DeVoss NS, Clark AD, Lasseigne BN. CoSIA: an R Bioconductor package for CrOss Species Investigation and Analysis. Bioinformatics 2023; 39:btad759. [PMID: 38109675 PMCID: PMC10749757 DOI: 10.1093/bioinformatics/btad759] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Revised: 10/30/2023] [Accepted: 12/16/2023] [Indexed: 12/20/2023] Open
Abstract
SUMMARY High-throughput sequencing technologies have enabled cross-species comparative transcriptomic studies; however, there are numerous challenges for these studies due to biological and technical factors. We developed CoSIA (Cross-Species Investigation and Analysis), a Bioconductor R package and Shiny app that provides an alternative framework for cross-species transcriptomic comparison of non-diseased wild-type RNA sequencing gene expression data from Bgee across tissues and species (human, mouse, rat, zebrafish, fly, and nematode) through visualization of variability, diversity, and specificity metrics. AVAILABILITY AND IMPLEMENTATION https://github.com/lasseignelab/CoSIA.
Collapse
Affiliation(s)
- Anisha Haldar
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL 35294, United States
| | - Vishal H Oza
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL 35294, United States
| | - Nathaniel S DeVoss
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL 35294, United States
| | - Amanda D Clark
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL 35294, United States
| | - Brittany N Lasseigne
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL 35294, United States
| |
Collapse
|
4
|
Haldar A, Oza VH, DeVoss NS, Clark AD, Lasseigne BN. CoSIA: an R Bioconductor package for CrOss Species Investigation and Analysis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.21.537877. [PMID: 37163017 PMCID: PMC10168259 DOI: 10.1101/2023.04.21.537877] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
High throughput sequencing technologies have enabled cross-species comparative transcriptomic studies; however, there are numerous challenges for these studies due to biological and technical factors. We developed CoSIA (Cross-Species Investigation and Analysis), an Bioconductor R package and Shiny app that provides an alternative framework for cross-species transcriptomic comparison of non-diseased wild-type RNA sequencing gene expression data from Bgee across tissues and species (human, mouse, rat, zebrafish, fly, and nematode) through visualization of variability, diversity, and specificity metrics.
Collapse
Affiliation(s)
- Anisha Haldar
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Vishal H. Oza
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Nathaniel S. DeVoss
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Amanda D. Clark
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| | - Brittany N. Lasseigne
- The Department of Cell, Developmental and Integrative Biology, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, Alabama, USA
| |
Collapse
|
5
|
Bastide P, Soneson C, Stern DB, Lespinet O, Gallopin M. A Phylogenetic Framework to Simulate Synthetic Interspecies RNA-Seq Data. Mol Biol Evol 2023; 40:msac269. [PMID: 36508357 PMCID: PMC11249980 DOI: 10.1093/molbev/msac269] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 11/14/2022] [Accepted: 12/07/2022] [Indexed: 12/14/2022] Open
Abstract
Interspecies RNA-Seq datasets are increasingly common, and have the potential to answer new questions about the evolution of gene expression. Single-species differential expression analysis is now a well-studied problem that benefits from sound statistical methods. Extensive reviews on biological or synthetic datasets have provided the community with a clear picture on the relative performances of the available methods in various settings. However, synthetic dataset simulation tools are still missing in the interspecies gene expression context. In this work, we develop and implement a new simulation framework. This tool builds on both the RNA-Seq and the phylogenetic comparative methods literatures to generate realistic count datasets, while taking into account the phylogenetic relationships between the samples. We illustrate the usefulness of this new framework through a targeted simulation study, that reproduces the features of a recently published dataset, containing gene expression data in adult eye tissue across blind and sighted freshwater crayfish species. Using our simulated datasets, we perform a fair comparison of several approaches used for differential expression analysis. This benchmark reveals some of the strengths and weaknesses of both the classical and phylogenetic approaches for interspecies differential expression analysis, and allows for a reanalysis of the crayfish dataset. The tool has been integrated in the R package compcodeR, freely available on Bioconductor.
Collapse
Affiliation(s)
- Paul Bastide
- IMAG, Université de Montpellier, CNRS, Montpellier, France
| | - Charlotte Soneson
- Friedrich Miescher Institute for Biomedical Research, 4058 Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, 4058 Basel, Switzerland
| | - David B Stern
- Department of Integrative Biology, University of Wisconsin-Madison, 430 Lincoln Drive, Madison, WI 53706, USA
| | - Olivier Lespinet
- Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay, CEA, CNRS, 91198 Gif-sur-Yvette, France
| | - Mélina Gallopin
- Institute for Integrative Biology of the Cell (I2BC), Université Paris-Saclay, CEA, CNRS, 91198 Gif-sur-Yvette, France
| |
Collapse
|
6
|
Breeschoten T, Schranz ME, Poelman EH, Simon S. Family dinner: Transcriptional plasticity of five Noctuidae (Lepidoptera) feeding on three host plant species. Ecol Evol 2022; 12:e9258. [PMID: 36091341 PMCID: PMC9448971 DOI: 10.1002/ece3.9258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2022] [Accepted: 08/08/2022] [Indexed: 11/30/2022] Open
Abstract
Polyphagous insects often show specialization in feeding on different host plants in terms of survival and growth and, therefore, can be considered minor or major pests of particular hosts. Whether polyphagous insects employ a common transcriptional response to cope with defenses from diverse host plants is under-studied. We focused on patterns of transcriptional plasticity in polyphagous moths (Noctuidae), of which many species are notorious pests, in relation to herbivore performance on different host plants. We compared the transcriptional plasticity of five polyphagous moth species feeding and developing on three different host plant species. Using a comparative phylogenetic framework, we evaluated if successful herbivory, as measured by larval performance, is determined by a shared or lineage-specific transcriptional response. The upregulated transcriptional activity, or gene expression pattern, of larvae feeding on the different host plants and artificial control diet was highly plastic and moth species-specific. Specialization, defined as high herbivore success for specific host plants, was not generally linked to a lower number of induced genes. Moths that were more distantly related and showing high herbivore success for certain host plants showed shared expression of multiple homologous genes, indicating convergence. We further observed specific transcriptional responses within phylogenetic lineages. These expression patterns for specific host plant species are likely caused by shared evolutionary histories, for example, symplesiomorphic patterns, and could therefore not be associated with herbivore success alone. Multiple gene families, with roles in plant digestion and detoxification, were widely expressed in response to host plant feeding but again showed highly moth species-specific. Consequently, high herbivore success for specific host plants is also driven by species-specific transcriptional plasticity. Thus, potential pest moths display a complex and species-specific transcriptional plasticity.
Collapse
Affiliation(s)
- Thijmen Breeschoten
- Biosystematics GroupWageningen University & ResearchWageningenThe Netherlands
| | - M. Eric Schranz
- Biosystematics GroupWageningen University & ResearchWageningenThe Netherlands
| | - Erik H. Poelman
- Laboratory of EntomologyWageningen University & ResearchWageningenThe Netherlands
| | - Sabrina Simon
- Biosystematics GroupWageningen University & ResearchWageningenThe Netherlands
| |
Collapse
|
7
|
Liang Y, Wang M, Liu Y, Wang C, Takahashi K, Naruse K. Meta-Analysis-Assisted Detection of Gravity-Sensitive Genes in Human Vascular Endothelial Cells. Front Cell Dev Biol 2021; 9:689662. [PMID: 34422812 PMCID: PMC8371407 DOI: 10.3389/fcell.2021.689662] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 07/14/2021] [Indexed: 12/18/2022] Open
Abstract
Gravity affects the function and maintenance of organs, such as bones, muscles, and the heart. Several studies have used DNA microarrays to identify genes with altered expressions in response to gravity. However, it is technically challenging to combine the results from various microarray datasets because of their different data structures. We hypothesized that it is possible to identify common changes in gene expression from the DNA microarray datasets obtained under various conditions and methods. In this study, we grouped homologous genes to perform a meta-analysis of multiple vascular endothelial cell and skeletal muscle datasets. According to the t-distributed stochastic neighbor embedding (t-SNE) analysis, the changes in the gene expression pattern in vascular endothelial cells formed specific clusters. We also identified candidate genes in endothelial cells that responded to gravity. Further, we exposed human umbilical vein endothelial cells (HUVEC) to simulated microgravity (SMG) using a clinostat and measured the expression levels of the candidate genes. Gene expression analysis using qRT-PCR revealed that the expression level of the prostaglandin (PG) transporter gene SLCO2A1 decreased in response to microgravity, consistent with the meta-analysis of microarray datasets. Furthermore, the direction of gravity affected the expression level of SLCO2A1, buttressing the finding that its expression was affected by gravity. These results suggest that a meta-analysis of DNA microarray datasets may help identify new target genes previously overlooked in individual microarray analyses.
Collapse
Affiliation(s)
- Yin Liang
- Department of Cardiovascular Physiology, Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama, Japan
| | - Mengxue Wang
- Department of Cardiovascular Physiology, Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama, Japan
| | - Yun Liu
- Department of Cardiovascular Physiology, Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama, Japan
| | - Chen Wang
- Department of Cardiovascular Physiology, Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama, Japan
| | - Ken Takahashi
- Department of Cardiovascular Physiology, Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama, Japan
| | - Keiji Naruse
- Department of Cardiovascular Physiology, Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama, Japan
| |
Collapse
|
8
|
Sun YE, Zhou HJ, Li JJ. Bipartite tight spectral clustering (BiTSC) algorithm for identifying conserved gene co-clusters in two species. Bioinformatics 2021; 37:1225-1233. [PMID: 32814973 DOI: 10.1093/bioinformatics/btaa741] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 05/20/2020] [Accepted: 08/13/2020] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Gene clustering is a widely used technique that has enabled computational prediction of unknown gene functions within a species. However, it remains a challenge to refine gene function prediction by leveraging evolutionarily conserved genes in another species. This challenge calls for a new computational algorithm to identify gene co-clusters in two species, so that genes in each co-cluster exhibit similar expression levels in each species and strong conservation between the species. RESULTS Here, we develop the bipartite tight spectral clustering (BiTSC) algorithm, which identifies gene co-clusters in two species based on gene orthology information and gene expression data. BiTSC novelly implements a formulation that encodes gene orthology as a bipartite network and gene expression data as node covariates. This formulation allows BiTSC to adopt and combine the advantages of multiple unsupervised learning techniques: kernel enhancement, bipartite spectral clustering, consensus clustering, tight clustering and hierarchical clustering. As a result, BiTSC is a flexible and robust algorithm capable of identifying informative gene co-clusters without forcing all genes into co-clusters. Another advantage of BiTSC is that it does not rely on any distributional assumptions. Beyond cross-species gene co-clustering, BiTSC also has wide applications as a general algorithm for identifying tight node co-clusters in any bipartite network with node covariates. We demonstrate the accuracy and robustness of BiTSC through comprehensive simulation studies. In a real data example, we use BiTSC to identify conserved gene co-clusters of Drosophila melanogaster and Caenorhabditis elegans, and we perform a series of downstream analysis to both validate BiTSC and verify the biological significance of the identified co-clusters. AVAILABILITY AND IMPLEMENTATION The Python package BiTSC is open-access and available at https://github.com/edensunyidan/BiTSC. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yidan Eden Sun
- Department of Statistics, University of California, Los Angeles, CA 90095-1554, USA
| | - Heather J Zhou
- Department of Statistics, University of California, Los Angeles, CA 90095-1554, USA
| | - Jingyi Jessica Li
- Department of Statistics, University of California, Los Angeles, CA 90095-1554, USA.,Department of Human Genetics, University of California, Los Angeles, CA 90095-7088, USA.,Department of Computational Medicine, University of California, Los Angeles, CA 90095-1766, USA
| |
Collapse
|
9
|
AbdelAziz AM, Soliman T, Ghany KKA, Sewisy A. A hybrid multi-objective whale optimization algorithm for analyzing microarray data based on Apache Spark. PeerJ Comput Sci 2021; 7:e416. [PMID: 33834101 PMCID: PMC8022636 DOI: 10.7717/peerj-cs.416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Accepted: 02/05/2021] [Indexed: 06/12/2023]
Abstract
A microarray is a revolutionary tool that generates vast volumes of data that describe the expression profiles of genes under investigation that can be qualified as Big Data. Hadoop and Spark are efficient frameworks, developed to store and analyze Big Data. Analyzing microarray data helps researchers to identify correlated genes. Clustering has been successfully applied to analyze microarray data by grouping genes with similar expression profiles into clusters. The complex nature of microarray data obligated clustering methods to employ multiple evaluation functions to ensure obtaining solutions with high quality. This transformed the clustering problem into a Multi-Objective Problem (MOP). A new and efficient hybrid Multi-Objective Whale Optimization Algorithm with Tabu Search (MOWOATS) was proposed to solve MOPs. In this article, MOWOATS is proposed to analyze massive microarray datasets. Three evaluation functions have been developed to ensure an effective assessment of solutions. MOWOATS has been adapted to run in parallel using Spark over Hadoop computing clusters. The quality of the generated solutions was evaluated based on different indices, such as Silhouette and Davies-Bouldin indices. The obtained clusters were very similar to the original classes. Regarding the scalability, the running time was inversely proportional to the number of computing nodes.
Collapse
Affiliation(s)
| | - Taysir Soliman
- Faculty of Computers and Information, Assiut University, Egypt
| | - Kareem Kamal A. Ghany
- Faculty of Computers and Artificial Intelligence, Beni-Suef University, Egypt
- College of Computing and Informatics, Saudi Electronic University, Riyadh, KSA
| | - Adel Sewisy
- Faculty of Computers and Information, Assiut University, Egypt
| |
Collapse
|
10
|
Li A, Li A, Deng Z, Guo J, Wu H. Cross-Species Annotation of Expressed Genes and Detection of Different Functional Gene Modules Between 10 Cold- and 10 Hot-Propertied Chinese Herbal Medicines. Front Genet 2020; 11:532. [PMID: 32625232 PMCID: PMC7314971 DOI: 10.3389/fgene.2020.00532] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2020] [Accepted: 05/04/2020] [Indexed: 12/11/2022] Open
Abstract
According to the traditional Chinese medicine (TCM) system, Chinese herbal medicines (HMs) can be divided into four categories: hot, warm, cold, and cool. A cool nature usually is categorized as a cold nature, and a warm nature is classified as a hot nature. However, the detectable characteristics of the gene expression profile associated with the cold and hot properties have not been studied. To address this question, a strategy for the cross-species annotation of conserved genes was established in the present study by using transcriptome data of 20 HMs with cold and hot properties. Functional enrichment analysis was performed on group-specific expressed genes inferred from the functional genome of the reference species (i.e., Arabidopsis). Results showed that metabolic pathways relevant to chrysoeriol, luteolin, paniculatin, and wogonin were enriched for cold-specific genes, and pathways of inositol, heptadecane, lauric acid, octanoic acid, hexadecanoic acid, and pentadecanoic acid were enriched for hot-specific genes. Six functional modules were identified in the HMs with the cold property: nucleotide biosynthetic process, peptidy-L-cysteine S-palmitoylation, lipid modification, base-excision repair, dipeptide transport, and response to endoplasmic reticulum stress. For the hot HMs, another six functional modules were identified: embryonic meristem development, embryonic pattern specification, axis specification, regulation of RNA polymerase II transcriptional preinitiation complex assembly, mitochondrial RNA modification, and cell redox homeostasis. The research provided a new insight into HMs’ cold and hot properties from the perspective of the gene expression profile of plants.
Collapse
Affiliation(s)
- Arong Li
- Guangzhou Hospital of Traditional Chinese Medicine, Guangzhou University of Chinese Medicine, Guangzhou, China.,Department of Pharmacy, Guangdong Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Aqian Li
- Guangdong Key Laboratory of Mental Health and Cognitive Science, Center for Studies of Psychological Application, School of Psychology, South China Normal University, Guangzhou, China
| | - Zhijun Deng
- Department of Pharmacy, Guangdong Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Jiewen Guo
- Guangzhou Hospital of Traditional Chinese Medicine, Guangzhou University of Chinese Medicine, Guangzhou, China.,Department of Pharmacy, Guangdong Hospital of Traditional Chinese Medicine, Guangzhou, China
| | - Hongkai Wu
- State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, Guangzhou Institute of Respiratory Health, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou Medical University, Guangzhou, China
| |
Collapse
|
11
|
Ng WL, Wu W, Zou P, Zhou R. Comparative transcriptomics sheds light on differential adaptation and species diversification between two Melastoma species and their F 1 hybrid. AOB PLANTS 2019; 11:plz019. [PMID: 31037213 PMCID: PMC6481908 DOI: 10.1093/aobpla/plz019] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2018] [Accepted: 03/27/2019] [Indexed: 06/09/2023]
Abstract
Variation in gene expression has been shown to promote adaptive divergence, and can lead to speciation. The plant genus Melastoma, thought to have diversified through adaptive radiation, provides an excellent model for the study of gene expressional changes during adaptive differentiation and following interspecific hybridization. In this study, we performed RNA-seq on M. candidum, M. sanguineum and their F1 hybrid, to investigate the role of gene expression in species diversification within the genus. Reference transcriptomes were assembled using combined data from both parental species, resulting in 50 519 and 48 120 transcripts for the leaf and flower petal, after removing redundancy. Differential expression analysis uncovered 3793 and 2116 differentially expressed (DE) transcripts, most of which are between M. candidum and M. sanguineum. Differential expression was observed for genes related to light responses, as well as genes that regulate the development of leaf trichomes, a trait that among others is thought to protect plants against sunlight, suggesting the differential adaptation of the species to sunlight intensity. The analysis of positively selected genes between the two species also revealed possible differential adaptation to other abiotic stresses such as drought and temperature. In the hybrid, almost all possible modes of expression were observed at the DE transcripts, although at most transcripts, the expression levels were similar to that of either parent instead of being intermediate. A small number of transgressively expressed transcripts that matched genes known to promote plant growth and adaptation to stresses in new environments were also found, possibly explaining the vigour observed in the hybrid. The findings in this study provided insights into the role of gene expression in the diversification of Melastoma, which we believe is an important example for more cross-taxa comparisons in the future.
Collapse
Affiliation(s)
- Wei Lun Ng
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
- China-ASEAN College of Marine Sciences, Xiamen University Malaysia, Sepang, Selangor, Malaysia
| | - Wei Wu
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Peishan Zou
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Renchao Zhou
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| |
Collapse
|
12
|
A statistical normalization method and differential expression analysis for RNA-seq data between different species. BMC Bioinformatics 2019; 20:163. [PMID: 30925894 PMCID: PMC6441199 DOI: 10.1186/s12859-019-2745-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2018] [Accepted: 03/18/2019] [Indexed: 02/06/2023] Open
Abstract
Background High-throughput techniques bring novel tools and also statistical challenges to genomic research. Identifying genes with differential expression between different species is an effective way to discover evolutionarily conserved transcriptional responses. To remove systematic variation between different species for a fair comparison, normalization serves as a crucial pre-processing step that adjusts for the varying sample sequencing depths and other confounding technical effects. Results In this paper, we propose a scale based normalization (SCBN) method by taking into account the available knowledge of conserved orthologous genes and by using the hypothesis testing framework. Considering the different gene lengths and unmapped genes between different species, we formulate the problem from the perspective of hypothesis testing and search for the optimal scaling factor that minimizes the deviation between the empirical and nominal type I errors. Conclusions Simulation studies show that the proposed method performs significantly better than the existing competitor in a wide range of settings. An RNA-seq dataset of different species is also analyzed and it coincides with the conclusion that the proposed method outperforms the existing method. For practical applications, we have also developed an R package named “SCBN”, which is freely available at http://www.bioconductor.org/packages/devel/bioc/html/SCBN.html. Electronic supplementary material The online version of this article (10.1186/s12859-019-2745-1) contains supplementary material, which is available to authorized users.
Collapse
|
13
|
Dissecting clinical outcome of porcine circovirus type 2 with in vivo derived transcriptomic signatures of host tissue responses. BMC Genomics 2018; 19:831. [PMID: 30458705 PMCID: PMC6247532 DOI: 10.1186/s12864-018-5217-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2018] [Accepted: 10/31/2018] [Indexed: 12/30/2022] Open
Abstract
Background Porcine Circovirus Type 2 (PCV2) is a pathogen that has the ability to cause often devastating disease manifestations in pig populations with major economic implications. How PCV2 establishes subclinical persistence and why certain individuals progress to lethal lymphoid depletion remain to be elucidated. Results Here we present PorSignDB, a gene signature database describing in vivo porcine tissue physiology that we generated from a large compendium of in vivo transcriptional profiles and that we subsequently leveraged for deciphering the distinct physiological states underlying PCV2-affected lymph nodes. This systems genomics approach indicated that subclinical PCV2 infections suppress a myeloid leukocyte mediated immune response. However, in contrast an inflammatory myeloid cell activation is promoted in PCV2 patients with clinical manifestations. Functional genomics further uncovered STAT3 as a druggable PCV2 host factor candidate. Moreover, IL-2 supplementation of primary lymphocytes enabled ex vivo study of PCV2 replication in its target cell, the lymphoblast. Conclusion Our systematic dissection of the mechanistic basis of PCV2 reveals that subclinical and clinical PCV2 display two diametrically opposed immunotranscriptomic recalibrations that represent distinct physiological states in vivo, which suggests a paradigm shift in this field. Finally, our PorSignDB signature database is publicly available as a community resource (http://www.vetvirology.ugent.be/PorSignDB/, included in Gene Sets from Community Contributors http://software.broadinstitute.org/gsea/msigdb/contributed_genesets.jsp) and provides systems biologists with a valuable tool for catalyzing studies of human and veterinary disease. Finally, a primary porcine lymphoblast cell culture system paves the way for unraveling the impact of host genetics on PCV2 replication. Electronic supplementary material The online version of this article (10.1186/s12864-018-5217-5) contains supplementary material, which is available to authorized users.
Collapse
|
14
|
Wu HM, Tien YJ, Ho MR, Hwu HG, Lin WC, Tao MH, Chen CH. Covariate-adjusted heatmaps for visualizing biological data via correlation decomposition. Bioinformatics 2018; 34:3529-3538. [PMID: 29718246 DOI: 10.1093/bioinformatics/bty335] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2017] [Accepted: 04/25/2018] [Indexed: 11/13/2022] Open
Abstract
Motivation Heatmap is a popular visualization technique in biology and related fields. In this study, we extend heatmaps within the framework of matrix visualization (MV) by incorporating a covariate adjustment process through the estimation of conditional correlations. MV can explore the embedded information structure of high-dimensional large-scale datasets effectively without dimension reduction. The benefit of the proposed covariate-adjusted heatmap is in the exploration of conditional association structures among the subjects or variables that cannot be done with conventional MV. Results For adjustment of a discrete covariate, the conditional correlation is estimated by the within and between analysis. This procedure decomposes a correlation matrix into the within- and between-component matrices. The contribution of the covariate effects can then be assessed through the relative structure of the between-component to the original correlation matrix while the within-component acts as a residual. When a covariate is of continuous nature, the conditional correlation is equivalent to the partial correlation under the assumption of a joint normal distribution. A test is then employed to identify the variable pairs which possess the most significant differences at varying levels of correlation before and after a covariate adjustment. In addition, a z-score significance map is constructed to visualize these results. A simulation and three biological datasets are employed to illustrate the power and versatility of our proposed method. Availability and implementation GAP is available to readers and is free to non-commercial applications. The installation instructions, the user's manual, and the detailed tutorials can be found at http://gap.stat.sinica.edu.tw/Software/GAP. Supplementary information Supplementary Data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Han-Ming Wu
- Department of Statistics, National Taipei University, New Taipei City, Taiwan, R.O.C
| | - Yin-Jing Tien
- Digital Transformation Institute, Institute for Information Industry, Taipei, Taiwan, R.O.C
| | | | - Hai-Gwo Hwu
- Department of Psychiatry, National Taiwan University Hospital and College of Medicine, Taipei, Taiwan, R.O.C
| | - Wen-Chang Lin
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan, R.O.C
| | - Mi-Hua Tao
- Institute of Biomedical Sciences, Academia Sinica, Taipei, Taiwan, R.O.C
| | - Chun-Houh Chen
- Institute of Statistical Science, Academia Sinica, Taipei, Taiwan, R.O.C
| |
Collapse
|
15
|
A Robust Distributed Big Data Clustering-based on Adaptive Density Partitioning using Apache Spark. Symmetry (Basel) 2018. [DOI: 10.3390/sym10080342] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Unsupervised machine learning and knowledge discovery from large-scale datasets have recently attracted a lot of research interest. The present paper proposes a distributed big data clustering approach-based on adaptive density estimation. The proposed method is developed-based on Apache Spark framework and tested on some of the prevalent datasets. In the first step of this algorithm, the input data is divided into partitions using a Bayesian type of Locality Sensitive Hashing (LSH). Partitioning makes the processing fully parallel and much simpler by avoiding unneeded calculations. Each of the proposed algorithm steps is completely independent of the others and no serial bottleneck exists all over the clustering procedure. Locality preservation also filters out the outliers and enhances the robustness of the proposed approach. Density is defined on the basis of Ordered Weighted Averaging (OWA) distance which makes clusters more homogenous. According to the density of each node, the local density peaks will be detected adaptively. By merging the local peaks, final cluster centers will be obtained and other data points will be a member of the cluster with the nearest center. The proposed method has been implemented and compared with similar recently published researches. Cluster validity indexes achieved from the proposed method shows its superiorities in precision and noise robustness in comparison with recent researches. Comparison with similar approaches also shows superiorities of the proposed method in scalability, high performance, and low computation cost. The proposed method is a general clustering approach and it has been used in gene expression clustering as a sample of its application.
Collapse
|
16
|
Jung J, Kim GW, Lee W, Mok C, Chung SH, Jang W. Meta- and cross-species analyses of insulin resistance based on gene expression datasets in human white adipose tissues. Sci Rep 2018; 8:3747. [PMID: 29487289 PMCID: PMC5829071 DOI: 10.1038/s41598-017-18082-7] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 12/06/2017] [Indexed: 01/08/2023] Open
Abstract
Ample evidence indicates that insulin resistance (IR) is closely related to white adipose tissue (WAT), but the underlying mechanisms of IR pathogenesis are still unclear. Using 352 microarray datasets from seven independent studies, we identified a meta-signature which comprised of 1,413 genes. Our meta-signature was also enriched in overall WAT in in vitro and in vivo IR models. Only 12 core enrichment genes were consistently enriched across all IR models. Among the meta-signature, we identified a drug signature made up of 211 genes with expression levels that were co-regulated by thiazolidinediones and metformin using cross-species analysis. To confirm the clinical relevance of our drug signature, we found that the expression levels of 195 genes in the drug signature were significantly correlated with both homeostasis model assessment 2-IR score and body mass index. Finally, 18 genes from the drug signature were identified by protein-protein interaction network cluster. Four core enrichment genes were included in 18 genes and the expression levels of selected 8 genes were validated by quantitative PCR. These findings suggest that our signatures provide a robust set of genetic markers which can be used to provide a starting point for developing potential therapeutic targets in improving IR in WAT.
Collapse
Affiliation(s)
- Junghyun Jung
- Department of Life Science, Dongguk University, 30 Pildong ro 1-gil, 04620, Seoul, Korea
| | - Go Woon Kim
- Department of Pharmacology, College of Pharmacy, Kyung Hee University, 26 Kyungheedae-ro, 02447, Seoul, Korea
| | - Woosuk Lee
- Department of Life Science, Dongguk University, 30 Pildong ro 1-gil, 04620, Seoul, Korea
| | - Changsoo Mok
- Department of Life Science, Dongguk University, 30 Pildong ro 1-gil, 04620, Seoul, Korea
| | - Sung Hyun Chung
- Department of Pharmacology, College of Pharmacy, Kyung Hee University, 26 Kyungheedae-ro, 02447, Seoul, Korea
| | - Wonhee Jang
- Department of Life Science, Dongguk University, 30 Pildong ro 1-gil, 04620, Seoul, Korea.
| |
Collapse
|
17
|
Hoffman AM, Smith MD. Gene expression differs in codominant prairie grasses under drought. Mol Ecol Resour 2017; 18:334-346. [DOI: 10.1111/1755-0998.12733] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 10/01/2017] [Accepted: 10/17/2017] [Indexed: 11/28/2022]
Affiliation(s)
- Ava M. Hoffman
- Department of Biology and Graduate Degree Program in Ecology Colorado State University Fort Collins CO USA
| | - Melinda D. Smith
- Department of Biology and Graduate Degree Program in Ecology Colorado State University Fort Collins CO USA
| |
Collapse
|
18
|
Djordjevic D, Kusumi K, Ho JWK. XGSA: A statistical method for cross-species gene set analysis. Bioinformatics 2017; 32:i620-i628. [PMID: 27587682 DOI: 10.1093/bioinformatics/btw428] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
MOTIVATION Gene set analysis is a powerful tool for determining whether an experimentally derived set of genes is statistically significantly enriched for genes in other pre-defined gene sets, such as known pathways, gene ontology terms, or other experimentally derived gene sets. Current gene set analysis methods do not facilitate comparing gene sets across different organisms as they do not explicitly deal with homology mapping between species. There lacks a systematic investigation about the effect of complex gene homology on cross-species gene set analysis. RESULTS In this study, we show that not accounting for the complex homology structure when comparing gene sets in two species can lead to false positive discoveries, especially when comparing gene sets that have complex gene homology relationships. To overcome this bias, we propose a straightforward statistical approach, called XGSA, that explicitly takes the cross-species homology mapping into consideration when doing gene set analysis. Simulation experiments confirm that XGSA can avoid false positive discoveries, while maintaining good statistical power compared to other ad hoc approaches for cross-species gene set analysis. We further demonstrate the effectiveness of XGSA with two real-life case studies that aim to discover conserved or species-specific molecular pathways involved in social challenge and vertebrate appendage regeneration. AVAILABILITY AND IMPLEMENTATION The R source code for XGSA is available under a GNU General Public License at http://github.com/VCCRI/XGSA CONTACT: jho@victorchang.edu.au.
Collapse
Affiliation(s)
- Djordje Djordjevic
- Victor Chang Cardiac Research Institute, Darlinghurst, NSW 2010, Australia, St Vincent's Clinical School, University of New South Wales Australia, Darlinghurst, NSW 2010, Australia
| | - Kenro Kusumi
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Joshua W K Ho
- Victor Chang Cardiac Research Institute, Darlinghurst, NSW 2010, Australia, St Vincent's Clinical School, University of New South Wales Australia, Darlinghurst, NSW 2010, Australia
| |
Collapse
|
19
|
Koch C, Konieczka J, Delorey T, Lyons A, Socha A, Davis K, Knaack SA, Thompson D, O'Shea EK, Regev A, Roy S. Inference and Evolutionary Analysis of Genome-Scale Regulatory Networks in Large Phylogenies. Cell Syst 2017; 4:543-558.e8. [PMID: 28544882 PMCID: PMC5515301 DOI: 10.1016/j.cels.2017.04.010] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2016] [Revised: 02/20/2017] [Accepted: 04/26/2017] [Indexed: 11/22/2022]
Abstract
Changes in transcriptional regulatory networks can significantly contribute to species evolution and adaptation. However, identification of genome-scale regulatory networks is an open challenge, especially in non-model organisms. Here, we introduce multi-species regulatory network learning (MRTLE), a computational approach that uses phylogenetic structure, sequence-specific motifs, and transcriptomic data, to infer the regulatory networks in different species. Using simulated data from known networks and transcriptomic data from six divergent yeasts, we demonstrate that MRTLE predicts networks with greater accuracy than existing methods because it incorporates phylogenetic information. We used MRTLE to infer the structure of the transcriptional networks that control the osmotic stress responses of divergent, non-model yeast species and then validated our predictions experimentally. Interrogating these networks reveals that gene duplication promotes network divergence across evolution. Taken together, our approach facilitates study of regulatory network evolutionary dynamics across multiple poorly studied species.
Collapse
Affiliation(s)
- Christopher Koch
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, Wl, USA
| | - Jay Konieczka
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Toni Delorey
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Ana Lyons
- Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Amanda Socha
- Dartmouth College, Biology department, Hanover, NH 03755, USA
| | - Kathleen Davis
- Department of Molecular and Cell Biology, University of California, Berkeley, California 94720, USA
| | - Sara A Knaack
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, Wl, USA
| | - Dawn Thompson
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
| | - Erin K O'Shea
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts, USA
- Howard Hughes Medical Institute, Harvard University, Northwest Laboratory, Cambridge, Massachusetts, USA
- Faculty of Arts and Sciences Center for Systems Biology, Harvard University, Northwest Laboratory, Cambridge, Massachusetts, USA
- Department of Molecular and Cellular Biology, Harvard University, Northwest Laboratory, Cambridge, Massachusetts, USA
| | - Aviv Regev
- Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142, USA
- Howard Hughes Medical Institute, Chevy Chase, Maryland, USA
| | - Sushmita Roy
- Wisconsin Institute for Discovery, 330 N. Orchard Street, Madison, Wl, USA
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, Wl, USA
| |
Collapse
|
20
|
Biase FH. Oocyte Developmental Competence: Insights from Cross-Species Differential Gene Expression and Human Oocyte-Specific Functional Gene Networks. ACTA ACUST UNITED AC 2017; 21:156-168. [DOI: 10.1089/omi.2016.0177] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
|
21
|
Seo M, Caetano-Anolles K, Rodriguez-Zas S, Ka S, Jeong JY, Park S, Kim MJ, Nho WG, Cho S, Kim H, Lee HJ. Comprehensive identification of sexually dimorphic genes in diverse cattle tissues using RNA-seq. BMC Genomics 2016; 17:81. [PMID: 26818975 PMCID: PMC4728830 DOI: 10.1186/s12864-016-2400-4] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2015] [Accepted: 01/18/2016] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND Molecular mechanisms associated with sexual dimorphism in cattle have not been well elucidated. Furthermore, as recent studies have implied that gene expression patterns are highly tissue specific, it is essential to investigate gene expression in a variety of tissues using RNA-seq. Here, we employed and compared two statistical methods, a simple two group test and Analysis of deviance (ANODEV), in order to investigate bovine sexually dimorphic genes in 40 RNA-seq samples distributed across two factors: sex and tissue. RESULTS As a result, we detected 752 sexually dimorphic genes across tissues from two statistical approaches and identified strong tissue-specific patterns of gene expression. Additionally, significantly detected sex-related genes shared between two mammal species (cattle and rat) were identified using qRT-PCR. CONCLUSIONS Results of our analyses reveal that sexual dimorphism of metabolic tissues and pituitary gland in cattle involves various biological processes. Several differentially expressed genes between sexes in cattle and rat species are shared, but show tissue-specific patterns. Finally, we concluded that two distinct statistical approaches have their advantages and disadvantages in RNA-seq studies investigating multiple tissues.
Collapse
Affiliation(s)
- Minseok Seo
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, South Korea, 151-741, Republic of Korea.
- CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea.
| | | | | | - Sojeong Ka
- Department of Agricultural Biotechnology, Animal Biotechnology Major, and Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 151-921, Republic of Korea.
| | - Jin Young Jeong
- Division of Animal Products R&D, National Institute of Animal science, #1500 Kongjwipatjwi-ro, Wansan-gu, Jeonju-si, Jeollabuk-do, 565-851, Republic of Korea.
| | - Sungkwon Park
- Department of food science and technology, Sejong University, 98 Gun-Ja-Dong, Seoul, 143-747, Republic of Korea.
| | - Min Ji Kim
- Department of food science and technology, Sejong University, 98 Gun-Ja-Dong, Seoul, 143-747, Republic of Korea.
| | - Whan-Gook Nho
- Department of Swine & Poultry Science, National College of Agriculture and Fisheries, #1515 Kongjwipatjwi-ro, Wansan-gu, Jeonju-si, Jeollabuk-do, 560-500, Republic of Korea.
| | - Seoae Cho
- CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea.
| | - Heebal Kim
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, South Korea, 151-741, Republic of Korea.
- CHO&KIM genomics, Main Bldg. #514, SNU Research Park, Seoul National University Mt.4-2, NakSeoungDae, Gwanakgu, Seoul, 151-919, Republic of Korea.
- Department of Agricultural Biotechnology, Animal Biotechnology Major, and Research Institute for Agriculture and Life Sciences, Seoul National University, Seoul, 151-921, Republic of Korea.
| | - Hyun-Jeong Lee
- Interdisciplinary Program in Bioinformatics, Seoul National University, Kwan-ak St. 599, Kwan-ak Gu, Seoul, South Korea, 151-741, Republic of Korea.
- Division of Animal Products R&D, National Institute of Animal science, #1500 Kongjwipatjwi-ro, Wansan-gu, Jeonju-si, Jeollabuk-do, 565-851, Republic of Korea.
| |
Collapse
|
22
|
SanCristobal M, Rohart F, Lascor C, Bouffaud M, Trouilh L, Martin PGP, Lippi Y, Tribout T, Faraut T, Mercat MJ, Milan D, Liaubet L. Exploring transcriptomic diversity in muscle revealed that cellular signaling pathways mainly differentiate five Western porcine breeds. BMC Genomics 2015; 16:1055. [PMID: 26651482 PMCID: PMC4676870 DOI: 10.1186/s12864-015-2259-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2015] [Accepted: 11/30/2015] [Indexed: 12/23/2022] Open
Abstract
Background Among transcriptomic studies, those comparing species or populations can increase our understanding of the impact of the evolutionary forces on the differentiation of populations. A particular situation is the one of short evolution time with breeds of a domesticated species that underwent strong selective pressures. In this study, the gene expression diversity across five pig breeds has been explored in muscle. Samples came from: 24 Duroc, 33 Landrace, 41 Large White dam line, 10 Large White sire line and 39 Piétrain. From these animals, 147 muscle samples obtained at slaughter were analyzed using the porcine Agilent 44 K v1 microarray. Results A total of 12,358 genes were identified as expressed in muscle after normalization and 1,703 genes were declared differential for at least one breed (FDR < 0.001). The functional analysis highlighted that gene expression diversity is mainly linked to cellular signaling pathways such as the PI3K (phosphoinositide 3-kinase) pathway. The PI3K pathway is known to be involved in the control of development of the skeletal muscle mass by affecting extracellular matrix - receptor interactions, regulation of actin cytoskeleton pathways and some metabolic functions. This study also highlighted 228 spots (171 unique genes) that differentiate the breeds from each other. A common subgroup of 15 genes selected by three statistical methods was able to differentiate Duroc, Large White and Piétrain breeds. Conclusions This study on transcriptomic differentiation across Western pig breeds highlighted a global picture: mainly signaling pathways were affected. This result is consistent with the selection objective of increasing muscle mass. These transcriptional changes may indicate selection pressure or simply breed differences which may be driven by human selection. Further work aiming at comparing genetic and transcriptomic diversities would further increase our understanding of the consequences of human impact on livestock species. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-2259-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Magali SanCristobal
- INRA, UMR1388 Génétique, Physiologie et Systèmes d'Elevage, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENSAT, UMR1388 Génétique, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENVT, UMR1388 Génétique, F-31076, Toulouse, France.
| | - Florian Rohart
- INRA, UMR1388 Génétique, Physiologie et Systèmes d'Elevage, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENSAT, UMR1388 Génétique, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENVT, UMR1388 Génétique, F-31076, Toulouse, France. .,Australian Institute for Bioengineering and Nanotechnology (AIBN), Corner College and Cooper Rds (Bldg 75), The University of Queensland, Brisbane Qld, 4072, Australia.
| | - Christine Lascor
- INRA, UMR1388 Génétique, Physiologie et Systèmes d'Elevage, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENSAT, UMR1388 Génétique, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENVT, UMR1388 Génétique, F-31076, Toulouse, France.
| | | | - Lidwine Trouilh
- Plateforme Transcriptome GeT-Biopuces, Laboratoire d'Ingénierie des Systèmes Biologiques et des Procédés (LISBP), F-31077, Toulouse, France.
| | - Pascal G P Martin
- Plateau Transcriptomic impact of Xenobiotics (TRiX), ToxAlim INRA/INP, F-31027, Toulouse, France.
| | - Yannick Lippi
- Plateau Transcriptomic impact of Xenobiotics (TRiX), ToxAlim INRA/INP, F-31027, Toulouse, France.
| | | | - Thomas Faraut
- INRA, UMR1388 Génétique, Physiologie et Systèmes d'Elevage, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENSAT, UMR1388 Génétique, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENVT, UMR1388 Génétique, F-31076, Toulouse, France.
| | | | - Denis Milan
- INRA, UMR1388 Génétique, Physiologie et Systèmes d'Elevage, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENSAT, UMR1388 Génétique, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENVT, UMR1388 Génétique, F-31076, Toulouse, France.
| | - Laurence Liaubet
- INRA, UMR1388 Génétique, Physiologie et Systèmes d'Elevage, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENSAT, UMR1388 Génétique, F-31326, Castanet-Tolosan, France. .,Physiologie et Systèmes d'Elevage, Université de Toulouse INPT ENVT, UMR1388 Génétique, F-31076, Toulouse, France.
| |
Collapse
|
23
|
LoVerso PR, Cui F. A Computational Pipeline for Cross-Species Analysis of RNA-seq Data Using R and Bioconductor. Bioinform Biol Insights 2015; 9:165-74. [PMID: 26692761 PMCID: PMC4668955 DOI: 10.4137/bbi.s30884] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 10/21/2015] [Accepted: 10/24/2015] [Indexed: 01/25/2023] Open
Abstract
RNA sequencing (RNA-seq) has revolutionized transcriptome analysis through profiling the expression of thousands of genes at the same time. Systematic analysis of orthologous transcripts across species is critical for understanding the evolution of gene expression and uncovering important information in animal models of human diseases. Several computational methods have been published for analyzing gene expression between species, but they often lack crucial details and therefore cannot serve as a practical guide. Here, we present the first step-by-step protocol for cross-species RNA-seq analysis with a concise workflow that is largely based on the free open-source R language and Bioconductor packages. This protocol covers the entire process from short-read mapping, gene expression quantification, differential expression analysis to pathway enrichment. Many useful utilities for data visualization are included. This complete and easy-to-follow protocol provides hands-on guidance for users who are new to cross-species gene expression analysis.
Collapse
Affiliation(s)
- Peter R LoVerso
- Thomas H. Gosnell School of Life Sciences, Rochester Institute of Technology, One Lomb Memorial Drive, Rochester, NY, USA
| | - Feng Cui
- Thomas H. Gosnell School of Life Sciences, Rochester Institute of Technology, One Lomb Memorial Drive, Rochester, NY, USA
| |
Collapse
|
24
|
Chaudhuri R, Khoo PS, Tonks K, Junutula JR, Kolumam G, Modrusan Z, Samocha-Bonet D, Meoli CC, Hocking S, Fazakerley DJ, Stöckli J, Hoehn KL, Greenfield JR, Yang JYH, James DE. Cross-species gene expression analysis identifies a novel set of genes implicated in human insulin sensitivity. NPJ Syst Biol Appl 2015; 1:15010. [PMID: 28725461 PMCID: PMC5516867 DOI: 10.1038/npjsba.2015.10] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Revised: 07/24/2015] [Accepted: 08/24/2015] [Indexed: 12/24/2022] Open
Abstract
OBJECTIVE Insulin resistance (IR) is one of the earliest predictors of type 2 diabetes. However, diagnosis of IR is limited. High fat fed mouse models provide key insights into IR. We hypothesized that early features of IR are associated with persistent changes in gene expression (GE) and endeavored to (a) develop novel methods for improving signal:noise in analysis of human GE using mouse models; (b) identify a GE motif that accurately diagnoses IR in humans; and (c) identify novel biology associated with IR in humans. METHODS We integrated human muscle GE data with longitudinal mouse GE data and developed an unbiased three-level cross-species analysis platform (single gene, gene set, and networks) to generate a gene expression motif (GEM) indicative of IR. A logistic regression classification model validated GEM in three independent human data sets (n=115). RESULTS This GEM of 93 genes substantially improved diagnosis of IR compared with routine clinical measures across multiple independent data sets. Individuals misclassified by GEM possessed other metabolic features raising the possibility that they represent a separate metabolic subclass. The GEM was enriched in pathways previously implicated in insulin action and revealed novel associations between β-catenin and Jak1 and IR. Functional analyses using small molecule inhibitors showed an important role for these proteins in insulin action. CONCLUSIONS This study shows that systems approaches for identifying molecular signatures provides a powerful way to stratify individuals into discrete metabolic groups. Moreover, we speculate that the β-catenin pathway may represent a novel biomarker for IR in humans that warrant future investigation.
Collapse
Affiliation(s)
- Rima Chaudhuri
- Charles Perkins Centre, School of Molecular Bioscience, The University of Sydney, Sydney, NSW, Australia.,Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Poh Sim Khoo
- Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Katherine Tonks
- Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia.,Department of Endocrinology and Diabetes Centre, St Vincent's Hospital, Sydney, NSW, Australia
| | | | | | - Zora Modrusan
- Genentech Incorporated, South San Francisco, CA, USA
| | - Dorit Samocha-Bonet
- Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia.,Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - Christopher C Meoli
- Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Samantha Hocking
- Department of Endocrinology, Royal North Shore Hospital, Sydney, NSW, Australia.,School of Medicine, The University of Sydney, Sydney, NSW, Australia
| | - Daniel J Fazakerley
- Charles Perkins Centre, School of Molecular Bioscience, The University of Sydney, Sydney, NSW, Australia.,Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Jacqueline Stöckli
- Charles Perkins Centre, School of Molecular Bioscience, The University of Sydney, Sydney, NSW, Australia.,Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia
| | - Kyle L Hoehn
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Jerry R Greenfield
- Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia.,Department of Endocrinology and Diabetes Centre, St Vincent's Hospital, Sydney, NSW, Australia.,Faculty of Medicine, University of New South Wales, Sydney, NSW, Australia
| | - Jean Yee Hwa Yang
- School of Mathematics and Statistics, University of Sydney, Sydney, NSW, Australia
| | - David E James
- Charles Perkins Centre, School of Molecular Bioscience, The University of Sydney, Sydney, NSW, Australia.,Diabetes and Obesity Program, Garvan Institute of Medical Research, Darlinghurst, NSW, Australia.,School of Medicine, The University of Sydney, Sydney, NSW, Australia
| |
Collapse
|
25
|
Sun L, Lamont SJ, Cooksey AM, McCarthy F, Tudor CO, Vijay-Shanker K, DeRita RM, Rothschild M, Ashwell C, Persia ME, Schmidt CJ. Transcriptome response to heat stress in a chicken hepatocellular carcinoma cell line. Cell Stress Chaperones 2015; 20:939-50. [PMID: 26238561 PMCID: PMC4595433 DOI: 10.1007/s12192-015-0621-0] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2015] [Revised: 06/22/2015] [Accepted: 06/30/2015] [Indexed: 12/31/2022] Open
Abstract
Heat stress triggers an evolutionarily conserved set of responses in cells. The transcriptome responds to hyperthermia by altering expression of genes to adapt the cell or organism to survive the heat challenge. RNA-seq technology allows rapid identification of environmentally responsive genes on a large scale. In this study, we have used RNA-seq to identify heat stress responsive genes in the chicken male white leghorn hepatocellular (LMH) cell line. The transcripts of 812 genes were responsive to heat stress (p < 0.01) with 235 genes upregulated and 577 downregulated following 2.5 h of heat stress. Among the upregulated were genes whose products function as chaperones, along with genes affecting collagen synthesis and deposition, transcription factors, chromatin remodelers, and genes modulating the WNT and TGF-beta pathways. Predominant among the downregulated genes were ones that affect DNA replication and repair along with chromosomal segregation. Many of the genes identified in this study have not been previously implicated in the heat stress response. These data extend our understanding of the transcriptome response to heat stress with many of the identified biological processes and pathways likely to function in adapting cells and organisms to hyperthermic stress. Furthermore, this study should provide important insight to future efforts attempting to improve species abilities to withstand heat stress through genome-wide association studies and breeding.
Collapse
Affiliation(s)
- Liang Sun
- Department of Animal and Food Sciences, University of Delaware, Newark, DE, 19716, USA
| | - Susan J Lamont
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| | - Amanda M Cooksey
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Fiona McCarthy
- School of Animal and Comparative Biomedical Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Catalina O Tudor
- Department of Computer and Information Sciences, University of Delaware, Newark, DE, 19716, USA
| | - K Vijay-Shanker
- Department of Computer and Information Sciences, University of Delaware, Newark, DE, 19716, USA
| | - Rachael M DeRita
- Department of Animal and Food Sciences, University of Delaware, Newark, DE, 19716, USA
| | - Max Rothschild
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| | - Chris Ashwell
- Department of Poultry Science, North Carolina State University, Raleigh, NC, 27695, USA
| | - Michael E Persia
- Department of Animal Science, Iowa State University, Ames, IA, 50011, USA
| | - Carl J Schmidt
- Department of Animal and Food Sciences, University of Delaware, Newark, DE, 19716, USA.
| |
Collapse
|
26
|
El-Kebir M, Soueidan H, Hume T, Beisser D, Dittrich M, Müller T, Blin G, Heringa J, Nikolski M, Wessels LFA, Klau GW. xHeinz: an algorithm for mining cross-species network modules under a flexible conservation model. Bioinformatics 2015; 31:3147-55. [PMID: 26023104 DOI: 10.1093/bioinformatics/btv316] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Accepted: 05/18/2015] [Indexed: 01/18/2023] Open
Abstract
MOTIVATION Integrative network analysis methods provide robust interpretations of differential high-throughput molecular profile measurements. They are often used in a biomedical context-to generate novel hypotheses about the underlying cellular processes or to derive biomarkers for classification and subtyping. The underlying molecular profiles are frequently measured and validated on animal or cellular models. Therefore the results are not immediately transferable to human. In particular, this is also the case in a study of the recently discovered interleukin-17 producing helper T cells (Th17), which are fundamental for anti-microbial immunity but also known to contribute to autoimmune diseases. RESULTS We propose a mathematical model for finding active subnetwork modules that are conserved between two species. These are sets of genes, one for each species, which (i) induce a connected subnetwork in a species-specific interaction network, (ii) show overall differential behavior and (iii) contain a large number of orthologous genes. We propose a flexible notion of conservation, which turns out to be crucial for the quality of the resulting modules in terms of biological interpretability. We propose an algorithm that finds provably optimal or near-optimal conserved active modules in our model. We apply our algorithm to understand the mechanisms underlying Th17 T cell differentiation in both mouse and human. As a main biological result, we find that the key regulation of Th17 differentiation is conserved between human and mouse. AVAILABILITY AND IMPLEMENTATION xHeinz, an implementation of our algorithm, as well as all input data and results, are available at http://software.cwi.nl/xheinz and as a Galaxy service at http://services.cbib.u-bordeaux2.fr/galaxy in CBiB Tools. CONTACT gunnar.klau@cwi.nl SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mohammed El-Kebir
- Life Sciences, Centrum Wiskunde & Informatica (CWI), Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU, VU University Amsterdam, The Netherlands, Center for Computational Molecular Biology, Brown University, Providence, RI, USA
| | - Hayssam Soueidan
- Computational Cancer Biology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Thomas Hume
- Univ. Bordeaux, CBiB, 33000 Bordeaux, France, Univ. Bordeaux, CNRS/LaBRI, 33405 Talence, France
| | - Daniela Beisser
- Genome Informatics, Institute of Human Genetics, University Hospital Essen, University of Duisburg-Essen
| | - Marcus Dittrich
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany, Institute of Human Genetics, University of Würzburg, Würzburg, Germany and
| | - Tobias Müller
- Department of Bioinformatics, Biocenter, University of Würzburg, Würzburg, Germany
| | | | - Jaap Heringa
- Centre for Integrative Bioinformatics VU, VU University Amsterdam, The Netherlands
| | - Macha Nikolski
- Univ. Bordeaux, CBiB, 33000 Bordeaux, France, Univ. Bordeaux, CNRS/LaBRI, 33405 Talence, France
| | - Lodewyk F A Wessels
- Computational Cancer Biology, The Netherlands Cancer Institute, Amsterdam, The Netherlands
| | - Gunnar W Klau
- Life Sciences, Centrum Wiskunde & Informatica (CWI), Amsterdam, The Netherlands, Centre for Integrative Bioinformatics VU, VU University Amsterdam, The Netherlands, Erable Team, INRIA, Lyon, France
| |
Collapse
|
27
|
Baskaran P, Rödelsperger C, Prabh N, Serobyan V, Markov GV, Hirsekorn A, Dieterich C. Ancient gene duplications have shaped developmental stage-specific expression in Pristionchus pacificus. BMC Evol Biol 2015; 15:185. [PMID: 26370559 PMCID: PMC4570658 DOI: 10.1186/s12862-015-0466-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Accepted: 08/14/2015] [Indexed: 12/28/2022] Open
Abstract
Background The development of multicellular organisms is accompanied by gene expression changes in differentiating cells. Profiling stage-specific expression during development may reveal important insights into gene sets that contributed to the morphological diversity across the animal kingdom. Results We sequenced RNA-seq libraries throughout a developmental timecourse of the nematode Pristionchus pacificus. The transcriptomes reflect early larval stages, adult worms including late larvae, and growth-arrested dauer larvae and allowed the identification of developmentally regulated gene clusters. Our data reveals similar trends as previous transcriptome profiling of dauer worms and represents the first expression data for early larvae in P. pacificus. Gene expression clusters characterizing early larval stages show most significant enrichments of chaperones, while collagens are most significantly enriched in transcriptomes of late larvae and adult worms. By combining expression data with phylogenetic analysis, we found that developmentally regulated genes are found in paralogous clusters that have arisen through lineage-specific duplications after the split from the Caenorhabditis elegans branch. Conclusions We propose that gene duplications of developmentally regulated genes represent a plausible evolutionary mechanism to increase the dosage of stage-specific expression. Consequently, this may contribute to the substantial divergence in expression profiles that has been observed across larger evolutionary time scales. Electronic supplementary material The online version of this article (doi:10.1186/s12862-015-0466-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Praveen Baskaran
- Max-Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany.
| | - Christian Rödelsperger
- Max-Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany.
| | - Neel Prabh
- Max-Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany.
| | - Vahan Serobyan
- Max-Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany.
| | - Gabriel V Markov
- Max-Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany.
| | - Antje Hirsekorn
- Max-Delbrück Center for Molecular Medicine, Robert-Rössle-Straße 10, Berlin-Buch, 13125, Germany.
| | - Christoph Dieterich
- Max-Planck Institute for Biology of Aging, Joseph-Stelzmann-Str. 9b, Köln, 50866, Germany.
| |
Collapse
|
28
|
Goodman JE, Petito Boyce C, Sax SN, Beyer LA, Prueitt RL. Rethinking Meta-Analysis: Applications for Air Pollution Data and Beyond. RISK ANALYSIS : AN OFFICIAL PUBLICATION OF THE SOCIETY FOR RISK ANALYSIS 2015; 35:1017-39. [PMID: 25969128 PMCID: PMC4690509 DOI: 10.1111/risa.12405] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Meta-analyses offer a rigorous and transparent systematic framework for synthesizing data that can be used for a wide range of research areas, study designs, and data types. Both the outcome of meta-analyses and the meta-analysis process itself can yield useful insights for answering scientific questions and making policy decisions. Development of the National Ambient Air Quality Standards illustrates many potential applications of meta-analysis. These applications demonstrate the strengths and limitations of meta-analysis, issues that arise in various data realms, how meta-analysis design choices can influence interpretation of results, and how meta-analysis can be used to address bias and heterogeneity. Reviewing available data from a meta-analysis perspective can provide a useful framework and impetus for identifying and refining strategies for future research. Moreover, increased pervasiveness of a meta-analysis mindset-focusing on how the pieces of the research puzzle fit together-would benefit scientific research and data syntheses regardless of whether or not a quantitative meta-analysis is undertaken. While an individual meta-analysis can only synthesize studies addressing the same research question, the results of separate meta-analyses can be combined to address a question encompassing multiple data types. This observation applies to any scientific or policy area where information from a variety of disciplines must be considered to address a broader research question.
Collapse
|
29
|
Neuromolecular responses to social challenge: common mechanisms across mouse, stickleback fish, and honey bee. Proc Natl Acad Sci U S A 2014; 111:17929-34. [PMID: 25453090 DOI: 10.1073/pnas.1420369111] [Citation(s) in RCA: 120] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Certain complex phenotypes appear repeatedly across diverse species due to processes of evolutionary conservation and convergence. In some contexts like developmental body patterning, there is increased appreciation that common molecular mechanisms underlie common phenotypes; these molecular mechanisms include highly conserved genes and networks that may be modified by lineage-specific mutations. However, the existence of deeply conserved mechanisms for social behaviors has not yet been demonstrated. We used a comparative genomics approach to determine whether shared neuromolecular mechanisms could underlie behavioral response to territory intrusion across species spanning a broad phylogenetic range: house mouse (Mus musculus), stickleback fish (Gasterosteus aculeatus), and honey bee (Apis mellifera). Territory intrusion modulated similar brain functional processes in each species, including those associated with hormone-mediated signal transduction and neurodevelopment. Changes in chromosome organization and energy metabolism appear to be core, conserved processes involved in the response to territory intrusion. We also found that several homologous transcription factors that are typically associated with neural development were modulated across all three species, suggesting that shared neuronal effects may involve transcriptional cascades of evolutionarily conserved genes. Furthermore, immunohistochemical analyses of a subset of these transcription factors in mouse again implicated modulation of energy metabolism in the behavioral response. These results provide support for conserved genetic "toolkits" that are used in independent evolutions of the response to social challenge in diverse taxa.
Collapse
|
30
|
Schroyen M, Tuggle CK. Current transcriptomics in pig immunity research. Mamm Genome 2014; 26:1-20. [PMID: 25398484 PMCID: PMC7087981 DOI: 10.1007/s00335-014-9549-4] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2014] [Accepted: 10/21/2014] [Indexed: 01/05/2023]
Abstract
Swine performance in the face of disease challenge is becoming progressively more important. To improve the pig’s robustness and resilience against pathogens through selection, a better understanding of the genetic and epigenetic factors in the immune response is required. This review highlights results from the most recent transcriptome research, and the meta-analyses performed, in the context of pig immunity. A technological overview is given including wholegenome microarrays, immune-specific arrays, small-scale high-throughput expression methods, high-density tiling arrays, and next generation sequencing (NGS). Although whole genome microarray techniques will remain complementary to NGS for some time in domestic species, research will transition to sequencing-based methods due to cost-effectiveness and the extra information that such methods provide. Furthermore, upcoming high-throughput epigenomic studies, which will add greatly to our knowledge concerning the impact of epigenetic modifications on pig immune response, are listed in this review. With emphasis on the insights obtained from transcriptomic analyses for porcine immunity, we also discuss the experimental design in pig immunity research and the value of the newly published porcine genome assembly in using the pig as a model for human immune response. We conclude by discussing the importance of establishing community standards to maximize the possibility of integrative computational analyses, such as was clearly beneficial for the human ENCODE project.
Collapse
Affiliation(s)
- Martine Schroyen
- Department of Animal Science, Iowa State University, 2255 Kildee Hall, Ames, IA, 50011, USA,
| | | |
Collapse
|
31
|
Okyere J, Oppon E, Dzidzienyo D, Sharma L, Ball G. Cross-species gene expression analysis of species specific differences in the preclinical assessment of pharmaceutical compounds. PLoS One 2014; 9:e96853. [PMID: 24823806 PMCID: PMC4019543 DOI: 10.1371/journal.pone.0096853] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Accepted: 04/11/2014] [Indexed: 01/11/2023] Open
Abstract
Animals are frequently used as model systems for determination of safety and efficacy in pharmaceutical research and development. However, significant quantitative and qualitative differences exist between humans and the animal models used in research. This is as a result of genetic variation between human and the laboratory animal. Therefore the development of a system that would allow the assessment of all molecular differences between species after drug exposure would have a significant impact on drug evaluation for toxicity and efficacy. Here we describe a cross-species microarray methodology that identifies and selects orthologous probes after cross-species sequence comparison to develop an orthologous cross-species gene expression analysis tool. The assumptions made by the use of this orthologous gene expression strategy for cross-species extrapolation is that; conserved changes in gene expression equate to conserved pharmacodynamic endpoints. This assumption is supported by the fact that evolution and selection have maintained the structure and function of many biochemical pathways over time, resulting in the conservation of many important processes. We demonstrate this cross-species methodology by investigating species specific differences of the peroxisome proliferator-activator receptor (PPAR) α response in rat and human.
Collapse
Affiliation(s)
- John Okyere
- CrossGen Limited, BioCity Nottingham, Pennyfoot Street, Nottingham, United Kingdom
- * E-mail:
| | - Ekow Oppon
- CrossGen Limited, BioCity Nottingham, Pennyfoot Street, Nottingham, United Kingdom
| | - Daniel Dzidzienyo
- CrossGen Limited, BioCity Nottingham, Pennyfoot Street, Nottingham, United Kingdom
| | - Lav Sharma
- CrossGen Limited, BioCity Nottingham, Pennyfoot Street, Nottingham, United Kingdom
| | - Graham Ball
- John Van Geest Cancer Research Centre, Nottingham Trent University, Clifton Campus, Clifton Lane, Nottingham, United Kingdom
| |
Collapse
|
32
|
|