1
|
Baghfalaki T, Ganjali M, Berridge D. Generalized estimating equations by considering additive terms for analyzing time-course gene sets data. J Korean Stat Soc 2018. [DOI: 10.1016/j.jkss.2018.05.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
|
2
|
Zhang Y, Topham DJ, Thakar J, Qiu X. FUNNEL-GSEA: FUNctioNal ELastic-net regression in time-course gene set enrichment analysis. Bioinformatics 2018; 33:1944-1952. [PMID: 28334094 PMCID: PMC5939227 DOI: 10.1093/bioinformatics/btx104] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2016] [Accepted: 02/17/2017] [Indexed: 01/26/2023] Open
Abstract
Motivation Gene set enrichment analyses (GSEAs) are widely used in genomic research to identify underlying biological mechanisms (defined by the gene sets), such as Gene Ontology terms and molecular pathways. There are two caveats in the currently available methods: (i) they are typically designed for group comparisons or regression analyses, which do not utilize temporal information efficiently in time-series of transcriptomics measurements; and (ii) genes overlapping in multiple molecular pathways are considered multiple times in hypothesis testing. Results We propose an inferential framework for GSEA based on functional data analysis, which utilizes the temporal information based on functional principal component analysis, and disentangles the effects of overlapping genes by a functional extension of the elastic-net regression. Furthermore, the hypothesis testing for the gene sets is performed by an extension of Mann-Whitney U test which is based on weighted rank sums computed from correlated observations. By using both simulated datasets and a large-scale time-course gene expression data on human influenza infection, we demonstrate that our method has uniformly better receiver operating characteristic curves, and identifies more pathways relevant to immune-response to human influenza infection than the competing approaches. Availability and Implementation The methods are implemented in R package FUNNEL, freely and publicly available at: https://github.com/yunzhang813/FUNNEL-GSEA-R-Package. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yun Zhang
- Department of Biostatistics and Computational Biology
| | - David J Topham
- Department of Microbiology and Immunology, University of Rochester, Rochester, NY 14642, USA
| | - Juilee Thakar
- Department of Biostatistics and Computational Biology.,Department of Microbiology and Immunology, University of Rochester, Rochester, NY 14642, USA
| | - Xing Qiu
- Department of Biostatistics and Computational Biology
| |
Collapse
|
3
|
Lopez-Moya F, Kowbel D, Nueda MJ, Palma-Guerrero J, Glass NL, Lopez-Llorca LV. Neurospora crassa transcriptomics reveals oxidative stress and plasma membrane homeostasis biology genes as key targets in response to chitosan. MOLECULAR BIOSYSTEMS 2016; 12:391-403. [PMID: 26694141 PMCID: PMC4729629 DOI: 10.1039/c5mb00649j] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Chitosan is a natural polymer with antimicrobial activity. Chitosan causes plasma membrane permeabilization and induction of intracellular reactive oxygen species (ROS) in Neurospora crassa. We have determined the transcriptional profile of N. crassa to chitosan and identified the main gene targets involved in the cellular response to this compound. Global network analyses showed membrane, transport and oxidoreductase activity as key nodes affected by chitosan. Activation of oxidative metabolism indicates the importance of ROS and cell energy together with plasma membrane homeostasis in N. crassa response to chitosan. Deletion strain analysis of chitosan susceptibility pointed NCU03639 encoding a class 3 lipase, involved in plasma membrane repair by lipid replacement, and NCU04537 a MFS monosaccharide transporter related to assimilation of simple sugars, as main gene targets of chitosan. NCU10521, a glutathione S-transferase-4 involved in the generation of reducing power for scavenging intracellular ROS is also a determinant chitosan gene target. Ca(2+) increased tolerance to chitosan in N. crassa. Growth of NCU10610 (fig 1 domain) and SYT1 (a synaptotagmin) deletion strains was significantly increased by Ca(2+) in the presence of chitosan. Both genes play a determinant role in N. crassa membrane homeostasis. Our results are of paramount importance for developing chitosan as an antifungal.
Collapse
Affiliation(s)
- Federico Lopez-Moya
- Laboratory of Plant Pathology, Multidisciplinary Institute for Environmental Studies (MIES) Ramon Margalef, Department of Marine Sciences and Applied Biology, University of Alicante, E-03080 Alicante, Spain.
| | - David Kowbel
- Department of Plant and Microbial Biology, University of California, Berkeley CA, 94720-3120 USA.
| | - Maria José Nueda
- Statistics and Operation Research Department, University of Alicante, E-03080 Alicante, Spain.
| | - Javier Palma-Guerrero
- Department of Plant and Microbial Biology, University of California, Berkeley CA, 94720-3120 USA.
| | - N Louise Glass
- Department of Plant and Microbial Biology, University of California, Berkeley CA, 94720-3120 USA.
| | - Luis Vicente Lopez-Llorca
- Laboratory of Plant Pathology, Multidisciplinary Institute for Environmental Studies (MIES) Ramon Margalef, Department of Marine Sciences and Applied Biology, University of Alicante, E-03080 Alicante, Spain.
| |
Collapse
|
4
|
Hejblum BP, Skinner J, Thiébaut R. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data. PLoS Comput Biol 2015; 11:e1004310. [PMID: 26111374 PMCID: PMC4482329 DOI: 10.1371/journal.pcbi.1004310] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Accepted: 04/30/2015] [Indexed: 01/13/2023] Open
Abstract
Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA) introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR) measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial), and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA) for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package. Gene set analysis methods use prior biological knowledge to analyze gene expression data. This prior knowledge takes the form of predefined groups of genes, linked through their biological function. Gene set analysis methods have been successfully applied in transversal studies, their results being more sensitive and interpretable than those of methods investigating genomic data one gene at a time. The time-course gene set analysis (TcGSA) introduced here is an extension of such gene set analysis to longitudinal data. This method identifies a priori defined groups of genes whose expression is not stable over time, taking into account the potential heterogeneity between patients and between genes. When biological conditions are compared, it identifies the gene sets that have different expression dynamics according to these conditions. Data from 2 studies are analyzed: data from an HIV therapeutic vaccine trial, and data from a recent study on influenza and pneumococcal vaccines. In both cases, TcGSA provided new insights compared to standard approaches thanks to an increased sensitivity compared to other approaches. Those results highlight the benefits of the TcGSA method for analyzing gene expression dynamics.
Collapse
Affiliation(s)
- Boris P. Hejblum
- Univ. Bordeaux, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INSERM, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INRIA, Team SISTM, F-33000 Bordeaux, France
- Vaccine Research Institute-VRI, Hôpital Henri Mondor, Créteil, France
- Baylor Institute for Immunology Research, Dallas, Texas, United States of America
| | - Jason Skinner
- Vaccine Research Institute-VRI, Hôpital Henri Mondor, Créteil, France
- Baylor Institute for Immunology Research, Dallas, Texas, United States of America
| | - Rodolphe Thiébaut
- Univ. Bordeaux, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INSERM, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INRIA, Team SISTM, F-33000 Bordeaux, France
- Vaccine Research Institute-VRI, Hôpital Henri Mondor, Créteil, France
- Baylor Institute for Immunology Research, Dallas, Texas, United States of America
- * E-mail:
| |
Collapse
|
5
|
Nueda MJ, Tarazona S, Conesa A. Next maSigPro: updating maSigPro bioconductor package for RNA-seq time series. Bioinformatics 2014; 30:2598-602. [PMID: 24894503 PMCID: PMC4155246 DOI: 10.1093/bioinformatics/btu333] [Citation(s) in RCA: 211] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Motivation: The widespread adoption of RNA-seq to quantitatively measure gene expression has increased the scope of sequencing experimental designs to include time-course experiments. maSigPro is an R package specifically suited for the analysis of time-course gene expression data, which was developed originally for microarrays and hence was limited in its application to count data. Results: We have updated maSigPro to support RNA-seq time series analysis by introducing generalized linear models in the algorithm to support the modeling of count data while maintaining the traditional functionalities of the package. We show a good performance of the maSigPro-GLM method in several simulated time-course scenarios and in a real experimental dataset. Availability and implementation: The package is freely available under the LGPL license from the Bioconductor Web site (http://bioconductor.org). Contact:mj.nueda@ua.es or aconesa@cipf.es
Collapse
Affiliation(s)
- María José Nueda
- Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain
| | - Sonia Tarazona
- Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain
| | - Ana Conesa
- Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain
| |
Collapse
|
6
|
Martini P, Sales G, Calura E, Cagnin S, Chiogna M, Romualdi C. timeClip: pathway analysis for time course data without replicates. BMC Bioinformatics 2014; 15 Suppl 5:S3. [PMID: 25077979 PMCID: PMC4095003 DOI: 10.1186/1471-2105-15-s5-s3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Background Time-course gene expression experiments are useful tools for exploring biological processes. In this type of experiments, gene expression changes are monitored along time. Unfortunately, replication of time series is still costly and usually long time course do not have replicates. Many approaches have been proposed to deal with this data structure, but none of them in the field of pathway analysis. Pathway analyses have acquired great relevance for helping the interpretation of gene expression data. Several methods have been proposed to this aim: from the classical enrichment to the more complex topological analysis that gains power from the topology of the pathway. None of them were devised to identify temporal variations in time course data. Results Here we present timeClip, a topology based pathway analysis specifically tailored to long time series without replicates. timeClip combines dimension reduction techniques and graph decomposition theory to explore and identify the portion of pathways that is most time-dependent. In the first step, timeClip selects the time-dependent pathways; in the second step, the most time dependent portions of these pathways are highlighted. We used timeClip on simulated data and on a benchmark dataset regarding mouse muscle regeneration model. Our approach shows good performance on different simulated settings. On the real dataset, we identify 76 time-dependent pathways, most of which known to be involved in the regeneration process. Focusing on the 'mTOR signaling pathway' we highlight the timing of key processes of the muscle regeneration: from the early pathway activation through growth factor signals to the late burst of protein production needed for the fiber regeneration. Conclusions timeClip represents a new improvement in the field of time-dependent pathway analysis. It allows to isolate and dissect pathways characterized by time-dependent components. Furthermore, using timeClip on a mouse muscle regeneration dataset we were able to characterize the process of muscle fiber regeneration with its correct timing.
Collapse
|
7
|
Ponzoni I, Nueda M, Tarazona S, Götz S, Montaner D, Dussaut J, Dopazo J, Conesa A. Pathway network inference from gene expression data. BMC SYSTEMS BIOLOGY 2014; 8 Suppl 2:S7. [PMID: 25032889 PMCID: PMC4101702 DOI: 10.1186/1752-0509-8-s2-s7] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
8
|
Abstract
The rise of systems biology is intertwined with that of genomics, yet their primordial relationship to one another is ill-defined. We discuss how the growth of genomics provided a critical boost to the popularity of systems biology. We describe the parts of genomics that share common areas of interest with systems biology today in the areas of gene expression, network inference, chromatin state analysis, pathway analysis, personalized medicine, and upcoming areas of synergy as genomics continues to expand its scope across all biomedical fields.
Collapse
Affiliation(s)
- Ana Conesa
- Genomics of Gene Expression Lab, Centro de Investigaciones Príncipe Felipe, Valencia, Spain
| | - Ali Mortazavi
- Department of Developmental and Cell Biology, University of California, Irvine, CA 92697, USA
- Center for Complex Biological Systems, University of California, Irvine, CA 92697, USA
| |
Collapse
|
9
|
de Vega-Bartol JJ, Simões M, Lorenz WW, Rodrigues AS, Alba R, Dean JFD, Miguel CM. Transcriptomic analysis highlights epigenetic and transcriptional regulation during zygotic embryo development of Pinus pinaster. BMC PLANT BIOLOGY 2013; 13:123. [PMID: 23987738 PMCID: PMC3844413 DOI: 10.1186/1471-2229-13-123] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2013] [Accepted: 08/24/2013] [Indexed: 05/18/2023]
Abstract
BACKGROUND It is during embryogenesis that the plant body plan is established and the meristems responsible for all post-embryonic growth are specified. The molecular mechanisms governing conifer embryogenesis are still largely unknown. Their elucidation may contribute valuable information to clarify if the distinct features of embryo development in angiosperms and gymnosperms result from differential gene regulation. To address this issue, we have performed the first transcriptomic analysis of zygotic embryo development in a conifer species (Pinus pinaster) focusing our study in particular on regulatory genes playing important roles during plant embryo development, namely epigenetic regulators and transcription factors. RESULTS Microarray analysis of P. pinaster zygotic embryogenesis was performed at five periods of embryo development from early developing to mature embryos. Our results show that most changes in transcript levels occurred in the first and the last embryo stage-to-stage transitions, namely early to pre-cotyledonary embryo and cotyledonary to mature embryo. An analysis of functional categories for genes that were differentially expressed through embryogenesis highlighted several epigenetic regulation mechanisms. While putative orthologs of transcripts associated with mechanisms that target transposable elements and repetitive sequences were strongly expressed in early embryogenesis, PRC2-mediated repression of genes seemed more relevant during late embryogenesis. On the other hand, functions related to sRNA pathways appeared differentially regulated across all stages of embryo development with a prevalence of miRNA functions in mid to late embryogenesis. Identification of putative transcription factor genes differentially regulated between consecutive embryo stages was strongly suggestive of the relevance of auxin responses and regulation of auxin carriers during early embryogenesis. Such responses could be involved in establishing embryo patterning. Later in development, transcripts with homology to genes acting on modulation of auxin flow and determination of adaxial-abaxial polarity were up-regulated, as were putative orthologs of genes required for meristem formation and function as well as establishment of organ boundaries. Comparative analysis with A. thaliana embryogenesis also highlighted genes involved in auxin-mediated responses, as well as epigenetic regulation, indicating highly correlated transcript profiles between the two species. CONCLUSIONS This is the first report of a time-course transcriptomic analysis of zygotic embryogenesis in a conifer. Taken together our results show that epigenetic regulation and transcriptional control related to auxin transport and response are critical during early to mid stages of pine embryogenesis and that important events during embryogenesis seem to be coordinated by putative orthologs of major developmental regulators in angiosperms.
Collapse
Affiliation(s)
- José J de Vega-Bartol
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Marta Simões
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - W Walter Lorenz
- Warnell School of Forestry and Natural Resources, The University of Georgia, Athens, GA 30602, USA
| | - Andreia S Rodrigues
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Rob Alba
- Monsanto Company, Mailstop CC4, 700 Chesterfield Parkway West, Chesterfield, MO 63017, USA
| | - Jeffrey F D Dean
- Warnell School of Forestry and Natural Resources, The University of Georgia, Athens, GA 30602, USA
| | - Célia M Miguel
- iBET - Instituto de Biologia Experimental e Tecnológica, Apartado 12, 2780-901 Oeiras, Portugal
- Instituto de Tecnologia Química e Biológica, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| |
Collapse
|
10
|
Babcock JJ, Du F, Xu K, Wheelan SJ, Li M. Integrated analysis of drug-induced gene expression profiles predicts novel hERG inhibitors. PLoS One 2013; 8:e69513. [PMID: 23936032 PMCID: PMC3720659 DOI: 10.1371/journal.pone.0069513] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2013] [Accepted: 06/07/2013] [Indexed: 11/19/2022] Open
Abstract
Growing evidence suggests that drugs interact with diverse molecular targets mediating both therapeutic and toxic effects. Prediction of these complex interactions from chemical structures alone remains challenging, as compounds with different structures may possess similar toxicity profiles. In contrast, predictions based on systems-level measurements of drug effect may reveal pharmacologic similarities not evident from structure or known therapeutic indications. Here we utilized drug-induced transcriptional responses in the Connectivity Map (CMap) to discover such similarities among diverse antagonists of the human ether-à-go-go related (hERG) potassium channel, a common target of promiscuous inhibition by small molecules. Analysis of transcriptional profiles generated in three independent cell lines revealed clusters enriched for hERG inhibitors annotated using a database of experimental measurements (hERGcentral) and clinical indications. As a validation, we experimentally identified novel hERG inhibitors among the unannotated drugs in these enriched clusters, suggesting transcriptional responses may serve as predictive surrogates of cardiotoxicity complementing existing functional assays.
Collapse
Affiliation(s)
- Joseph J. Babcock
- The Solomon H. Snyder Department of Neuroscience and High Throughput Biology Center, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Fang Du
- The Solomon H. Snyder Department of Neuroscience and High Throughput Biology Center, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Kaiping Xu
- Johns Hopkins Ion Channel Center (JHICC), The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
| | - Sarah J. Wheelan
- Department of Oncology, Division of Biostatistics and Bioinformatics, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
- * E-mail: (ML); (SJW)
| | - Min Li
- The Solomon H. Snyder Department of Neuroscience and High Throughput Biology Center, The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
- Johns Hopkins Ion Channel Center (JHICC), The Johns Hopkins University School of Medicine, Baltimore, Maryland, United States of America
- * E-mail: (ML); (SJW)
| |
Collapse
|
11
|
Pirim H, Ekşioğlu B, Perkins A, Yüceer Ç. Clustering of High Throughput Gene Expression Data. COMPUTERS & OPERATIONS RESEARCH 2012; 39:3046-3061. [PMID: 23144527 PMCID: PMC3491664 DOI: 10.1016/j.cor.2012.03.008] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
High throughput biological data need to be processed, analyzed, and interpreted to address problems in life sciences. Bioinformatics, computational biology, and systems biology deal with biological problems using computational methods. Clustering is one of the methods used to gain insight into biological processes, particularly at the genomics level. Clearly, clustering can be used in many areas of biological data analysis. However, this paper presents a review of the current clustering algorithms designed especially for analyzing gene expression data. It is also intended to introduce one of the main problems in bioinformatics - clustering gene expression data - to the operations research community.
Collapse
Affiliation(s)
- Harun Pirim
- Department of Industrial and Systems Engineering, Mississippi State University, P.O. Box 9542, Mississippi State, MS 39762
- Corresponding author. Tel.:+1-662-325-4226;
| | - Burak Ekşioğlu
- Department of Industrial and Systems Engineering, Mississippi State University, P.O. Box 9542, Mississippi State, MS 39762
| | - Andy Perkins
- Department of Computer Science and Engineering, Mississippi State University
| | - Çetin Yüceer
- Department of Forestry, Mississippi State University
| |
Collapse
|
12
|
Rizza S, Conesa A, Juarez J, Catara A, Navarro L, Duran-Vila N, Ancillo G. Microarray analysis of Etrog citron (Citrus medica L.) reveals changes in chloroplast, cell wall, peroxidase and symporter activities in response to viroid infection. MOLECULAR PLANT PATHOLOGY 2012; 13:852-64. [PMID: 22420919 PMCID: PMC6638686 DOI: 10.1111/j.1364-3703.2012.00794.x] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Viroids are small (246-401 nucleotides), single-stranded, circular RNA molecules that infect several crop plants and can cause diseases of economic importance. Citrus are the hosts in which the largest number of viroids have been identified. Citrus exocortis viroid (CEVd), the causal agent of citrus exocortis disease, induces considerable losses in citrus crops. Changes in the gene expression profile during the early (pre-symptomatic) and late (post-symptomatic) stages of Etrog citron infected with CEVd were investigated using a citrus cDNA microarray. MaSigPro analysis was performed and, on the basis of gene expression profiles as a function of the time after infection, the differentially expressed genes were classified into five clusters. FatiScan analysis revealed significant enrichment of functional categories for each cluster, indicating that viroid infection triggers important changes in chloroplast, cell wall, peroxidase and symporter activities.
Collapse
Affiliation(s)
- Serena Rizza
- Department of Phytosanitary Sciences and Technologies-University of Catania, Via S. Sofia 102, 95123 Catania, Italy
| | | | | | | | | | | | | |
Collapse
|
13
|
Castro-Melchor M, Le H, Hu WS. Transcriptome data analysis for cell culture processes. ADVANCES IN BIOCHEMICAL ENGINEERING/BIOTECHNOLOGY 2012; 127:27-70. [PMID: 22194060 DOI: 10.1007/10_2011_116] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
In the past decade, DNA microarrays have fundamentally changed the way we study complex biological systems. By measuring the expression levels of thousands of transcripts, the paradigm of studying organisms has shifted from focusing on the local phenomena of a few genes to surveying the whole genome. DNA microarrays are used in a variety of ways, from simple comparisons between two samples to more intricate time-series studies. With the large number of genes being studied, the dimensionality of the problem is inevitably high. The analysis of microarray data thus requires specific approaches. In the case of time-series microarray studies, data analysis is further complicated by the correlation between successive time points in a series.In this review, we survey the methodologies used in the analysis of static and time-series microarray data, covering data pre-processing, identification of differentially expressed genes, profile pattern recognition, pathway analysis, and network reconstruction. When available, examples of their use in mammalian cell cultures are presented.
Collapse
|
14
|
Khalaf AA, Gmitter FG, Conesa A, Dopazo J, Moore GA. Fortunella margarita transcriptional reprogramming triggered by Xanthomonas citri subsp. citri. BMC PLANT BIOLOGY 2011; 11:159. [PMID: 22078099 PMCID: PMC3235979 DOI: 10.1186/1471-2229-11-159] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2011] [Accepted: 11/11/2011] [Indexed: 05/04/2023]
Abstract
BACKGROUND Citrus canker disease caused by the bacterial pathogen Xanthomonas citri subsp. citri (Xcc) has become endemic in areas where high temperature, rain, humidity, and windy conditions provide a favourable environment for the dissemination of the bacterium. Xcc is pathogenic on many commercial citrus varieties but appears to elicit an incompatible reaction on the citrus relative Fortunella margarita Swing (kumquat), in the form of a very distinct delayed necrotic response. We have developed subtractive libraries enriched in sequences expressed in kumquat leaves during both early and late stages of the disease. The isolated differentially expressed transcripts were subsequently sequenced. Our results demonstrate how the use of microarray expression profiling can help assign roles to previously uncharacterized genes and elucidate plant pathogenesis-response related mechanisms. This can be considered to be a case study in a citrus relative where high throughput technologies were utilized to understand defence mechanisms in Fortunella and citrus at the molecular level. RESULTS cDNAs from sequenced kumquat libraries (ESTs) made from subtracted RNA populations, healthy vs. infected, were used to make this microarray. Of 2054 selected genes on a customized array, 317 were differentially expressed (P < 0.05) in Xcc challenged kumquat plants compared to mock-inoculated ones. This study identified components of the incompatible interaction such as reactive oxygen species (ROS) and programmed cell death (PCD). Common defence mechanisms and a number of resistance genes were also identified. In addition, there were a considerable number of differentially regulated genes that had no homologues in the databases. This could be an indication of either a specialized set of genes employed by kumquat in response to canker disease or new defence mechanisms in citrus. CONCLUSION Functional categorization of kumquat Xcc-responsive genes revealed an enhanced defence-related metabolism as well as a number of resistant response-specific genes in the kumquat transcriptome in response to Xcc inoculation. Gene expression profile(s) were analyzed to assemble a comprehensive and inclusive image of the molecular interaction in the kumquat/Xcc system. This was done in order to elucidate molecular mechanisms associated with the development of the hypersensitive response phenotype in kumquat leaves. These data will be used to perform comparisons among citrus species to evaluate means to enhance the host immune responses against bacterial diseases.
Collapse
Affiliation(s)
- Abeer A Khalaf
- Plant Molecular and Cellular Biology Program (PMCB), Horticultural Sciences Department, University of Florida, Gainesville, Fl., 32611,USA
- PMCB, Citrus Research and Education Center, University of Florida, Lake Alfred, Fl., USA
| | - Frederick G Gmitter
- PMCB, Citrus Research and Education Center, University of Florida, Lake Alfred, Fl., USA
| | - Ana Conesa
- Centro de Investigación Príncipe Felipe,Valencia, SPAIN
| | | | - Gloria A Moore
- Plant Molecular and Cellular Biology Program (PMCB), Horticultural Sciences Department, University of Florida, Gainesville, Fl., 32611,USA
| |
Collapse
|
15
|
Yung S, Ledran M, Moreno-Gimeno I, Conesa A, Montaner D, Dopazo J, Dimmick I, Slater NJ, Marenah L, Real PJ, Paraskevopoulou I, Bisbal V, Burks D, Santibanez-Koref M, Moreno R, Mountford J, Menendez P, Armstrong L, Lako M. Large-scale transcriptional profiling and functional assays reveal important roles for Rho-GTPase signalling and SCL during haematopoietic differentiation of human embryonic stem cells. Hum Mol Genet 2011; 20:4932-46. [PMID: 21937587 DOI: 10.1093/hmg/ddr431] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Understanding the transcriptional cues that direct differentiation of human embryonic stem cells (hESCs) and human-induced pluripotent stem cells to defined and functional cell types is essential for future clinical applications. In this study, we have compared transcriptional profiles of haematopoietic progenitors derived from hESCs at various developmental stages of a feeder- and serum-free differentiation method and show that the largest transcriptional changes occur during the first 4 days of differentiation. Data mining on the basis of molecular function revealed Rho-GTPase signalling as a key regulator of differentiation. Inhibition of this pathway resulted in a significant reduction in the numbers of emerging haematopoietic progenitors throughout the differentiation window, thereby uncovering a previously unappreciated role for Rho-GTPase signalling during human haematopoietic development. Our analysis indicated that SCL was the 11th most upregulated transcript during the first 4 days of the hESC differentiation process. Overexpression of SCL in hESCs promoted differentiation to meso-endodermal lineages, the emergence of haematopoietic and erythro-megakaryocytic progenitors and accelerated erythroid differentiation. Importantly, intrasplenic transplantation of SCL-overexpressing hESC-derived haematopoietic cells enhanced recovery from induced acute anaemia without significant cell engraftment, suggesting a paracrine-mediated effect.
Collapse
Affiliation(s)
- Sun Yung
- Institute of Genetic Medicine, Newcastle University, Newcastle upon Tyne, UK
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Salem RM, O'Connor DT, Schork NJ. Curve-based multivariate distance matrix regression analysis: application to genetic association analyses involving repeated measures. Physiol Genomics 2010; 42:236-47. [PMID: 20423962 PMCID: PMC3032281 DOI: 10.1152/physiolgenomics.00118.2009] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2009] [Accepted: 04/21/2010] [Indexed: 01/09/2023] Open
Abstract
Most, if not all, human phenotypes exhibit a temporal, dosage-dependent, or age effect. Despite this fact, it is rare that data are collected over time or in sequence in relevant studies of the determinants of these phenotypes. The costs and organizational sophistication necessary to collect repeated measurements or longitudinal data for a given phenotype are clearly impediments to this, but greater efforts in this area are needed if insights into human phenotypic expression are to be obtained. Appropriate data analysis methods for genetic association studies involving repeated or longitudinal measures are also needed. We consider the use of longitudinal profiles obtained from fitted functions on repeated data collections from a set of individuals whose similarities are contrasted between sets of individuals with different genotypes to test hypotheses about genetic influences on time-dependent phenotype expression. The proposed approach can accommodate uncertainty of the fitted functions, as well as weighting factors across the time points, and is easily extended to a wide variety of complex analysis settings. We showcase the proposed approach with data from a clinical study investigating human blood vessel response to tyramine. We also compare the proposed approach with standard analytic procedures and investigate its robustness and power via simulation studies. The proposed approach is found to be quite flexible and performs either as well or better than traditional statistical methods.
Collapse
|
17
|
Nueda MJ, Carbonell J, Medina I, Dopazo J, Conesa A. Serial Expression Analysis: a web tool for the analysis of serial gene expression data. Nucleic Acids Res 2010; 38:W239-45. [PMID: 20525784 PMCID: PMC2896172 DOI: 10.1093/nar/gkq488] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Serial transcriptomics experiments investigate the dynamics of gene expression changes associated with a quantitative variable such as time or dosage. The statistical analysis of these data implies the study of global and gene-specific expression trends, the identification of significant serial changes, the comparison of expression profiles and the assessment of transcriptional changes in terms of cellular processes. We have created the SEA (Serial Expression Analysis) suite to provide a complete web-based resource for the analysis of serial transcriptomics data. SEA offers five different algorithms based on univariate, multivariate and functional profiling strategies framed within a user-friendly interface and a project-oriented architecture to facilitate the analysis of serial gene expression data sets from different perspectives. SEA is available at sea.bioinfo.cipf.es.
Collapse
Affiliation(s)
- Maria José Nueda
- Statistics and Operation Research Department, University of Alicante, Spain
| | | | | | | | | |
Collapse
|
18
|
D'Elia D, Gisel A, Eriksson NE, Kossida S, Mattila K, Klucar L, Bongcam-Rudloff E. The 20th anniversary of EMBnet: 20 years of bioinformatics for the Life Sciences community. BMC Bioinformatics 2009; 10 Suppl 6:S1. [PMID: 19534734 PMCID: PMC2697632 DOI: 10.1186/1471-2105-10-s6-s1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023] Open
Abstract
The EMBnet Conference 2008, focusing on 'Leading Applications and Technologies in Bioinformatics', was organized by the European Molecular Biology network (EMBnet) to celebrate its 20th anniversary. Since its foundation in 1988, EMBnet has been working to promote collaborative development of bioinformatics services and tools to serve the European community of molecular biology laboratories. This conference was the first meeting organized by the network that was open to the international scientific community outside EMBnet. The conference covered a broad range of research topics in bioinformatics with a main focus on new achievements and trends in emerging technologies supporting genomics, transcriptomics and proteomics analyses such as high-throughput sequencing and data managing, text and data-mining, ontologies and Grid technologies. Papers selected for publication, in this supplement to BMC Bioinformatics, cover a broad range of the topics treated, providing also an overview of the main bioinformatics research fields that the EMBnet community is involved in.
Collapse
Affiliation(s)
- Domenica D'Elia
- Institute for Biomedical Technologies, CNR, Via Amendola 122/D, 70126 Bari, Italy
| | - Andreas Gisel
- Institute for Biomedical Technologies, CNR, Via Amendola 122/D, 70126 Bari, Italy
| | - Nils-Einar Eriksson
- Uppsala Biomedical Centre (BMC), Computing Department, University of Uppsala, Box 570 SE-751 23 Uppsala, Sweden
| | - Sophia Kossida
- Bioinformatics & Medical Informatics Team, Biomedical Research Foundation of the Academy of Athens, 11527 Athens, Greece
| | - Kimmo Mattila
- CSC – IT Center for Science Ltd., Keilaranta 14, 02100 Espoo, Finland
| | - Lubos Klucar
- Institute of Molecular Biology, Slovak Academy of Sciences, Dubravska cesta 21, 84551 Bratislava, Slovakia
| | - Erik Bongcam-Rudloff
- Department of Animal Breeding and Genetics, Swedish University of Agricultural Sciences, 75024 Uppsala, Sweden
| |
Collapse
|