1
|
Williams JR, Yang R, Clifford JL, Watson D, Campbell R, Getnet D, Kumar R, Hammamieh R, Jett M. Functional Heatmap: an automated and interactive pattern recognition tool to integrate time with multi-omics assays. BMC Bioinformatics 2019; 20:81. [PMID: 30770734 PMCID: PMC6377781 DOI: 10.1186/s12859-019-2657-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Accepted: 01/28/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Life science research is moving quickly towards large-scale experimental designs that are comprised of multiple tissues, time points, and samples. Omic time-series experiments offer answers to three big questions: what collective patterns do most analytes follow, which analytes follow an identical pattern or synchronize across multiple cohorts, and how do biological functions evolve over time. Existing tools fall short of robustly answering and visualizing all three questions in a unified interface. RESULTS Functional Heatmap offers time-series data visualization through a Master Panel page, and Combined page to answer each of the three time-series questions. It dissects the complex multi-omics time-series readouts into patterned clusters with associated biological functions. It allows users to identify a cascade of functional changes over a time variable. Inversely, Functional Heatmap can compare a pattern with specific biology respond to multiple experimental conditions. All analyses are interactive, searchable, and exportable in a form of heatmap, line-chart, or text, and the results are easy to share, maintain, and reproduce on the web platform. CONCLUSIONS Functional Heatmap is an automated and interactive tool that enables pattern recognition in time-series multi-omics assays. It significantly reduces the manual labour of pattern discovery and comparison by transferring statistical models into visual clues. The new pattern recognition feature will help researchers identify hidden trends driven by functional changes using multi-tissues/conditions on a time-series fashion from omic assays.
Collapse
Affiliation(s)
- Joshua R. Williams
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Ruoting Yang
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - John L. Clifford
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Daniel Watson
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
| | - Ross Campbell
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Derese Getnet
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Raina Kumar
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research sponsored by the National Cancer Institute, Frederick, MD 21702-5010 USA
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Rasha Hammamieh
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| | - Marti Jett
- Integrative Systems Biology Program, US Army Center for Environmental Health Research, Fort Detrick, Frederick, MD 21702-5010 USA
| |
Collapse
|
2
|
Pediatric asthma and autism-genomic perspectives. Clin Transl Med 2015; 4:37. [PMID: 26668064 PMCID: PMC4678135 DOI: 10.1186/s40169-015-0078-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Accepted: 11/29/2015] [Indexed: 02/06/2023] Open
Abstract
High-throughput technologies, ranging from microarrays to NexGen sequencing of RNA and genomic DNA, have opened new avenues for exploration of the pathobiology of human disease. Comparisons of the architecture of the genome, identification of mutated or modified sequences, and pre-and post- transcriptional regulation of gene expression as disease specific biomarkers are revolutionizing our understanding of the causes of disease and are guiding the development of new therapies. There is enormous heterogeneity in types of genomic variation that occur in human disease. Some are inherited, while others are the result of new somatic or germline mutations or errors in chromosomal replication. In this review, we provide examples of changes that occur in the human genome in two of the most common chronic pediatric disorders, autism and asthma. The incidence and economic burden of both of these disorders are increasing worldwide. Genomic variations have the potential to serve as biomarkers for personalization of therapy and prediction of outcomes.
Collapse
|
3
|
Hejblum BP, Skinner J, Thiébaut R. Time-Course Gene Set Analysis for Longitudinal Gene Expression Data. PLoS Comput Biol 2015; 11:e1004310. [PMID: 26111374 PMCID: PMC4482329 DOI: 10.1371/journal.pcbi.1004310] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2014] [Accepted: 04/30/2015] [Indexed: 01/13/2023] Open
Abstract
Gene set analysis methods, which consider predefined groups of genes in the analysis of genomic data, have been successfully applied for analyzing gene expression data in cross-sectional studies. The time-course gene set analysis (TcGSA) introduced here is an extension of gene set analysis to longitudinal data. The proposed method relies on random effects modeling with maximum likelihood estimates. It allows to use all available repeated measurements while dealing with unbalanced data due to missing at random (MAR) measurements. TcGSA is a hypothesis driven method that identifies a priori defined gene sets with significant expression variations over time, taking into account the potential heterogeneity of expression within gene sets. When biological conditions are compared, the method indicates if the time patterns of gene sets significantly differ according to these conditions. The interest of the method is illustrated by its application to two real life datasets: an HIV therapeutic vaccine trial (DALIA-1 trial), and data from a recent study on influenza and pneumococcal vaccines. In the DALIA-1 trial TcGSA revealed a significant change in gene expression over time within 69 gene sets during vaccination, while a standard univariate individual gene analysis corrected for multiple testing as well as a standard a Gene Set Enrichment Analysis (GSEA) for time series both failed to detect any significant pattern change over time. When applied to the second illustrative data set, TcGSA allowed the identification of 4 gene sets finally found to be linked with the influenza vaccine too although they were found to be associated to the pneumococcal vaccine only in previous analyses. In our simulation study TcGSA exhibits good statistical properties, and an increased power compared to other approaches for analyzing time-course expression patterns of gene sets. The method is made available for the community through an R package. Gene set analysis methods use prior biological knowledge to analyze gene expression data. This prior knowledge takes the form of predefined groups of genes, linked through their biological function. Gene set analysis methods have been successfully applied in transversal studies, their results being more sensitive and interpretable than those of methods investigating genomic data one gene at a time. The time-course gene set analysis (TcGSA) introduced here is an extension of such gene set analysis to longitudinal data. This method identifies a priori defined groups of genes whose expression is not stable over time, taking into account the potential heterogeneity between patients and between genes. When biological conditions are compared, it identifies the gene sets that have different expression dynamics according to these conditions. Data from 2 studies are analyzed: data from an HIV therapeutic vaccine trial, and data from a recent study on influenza and pneumococcal vaccines. In both cases, TcGSA provided new insights compared to standard approaches thanks to an increased sensitivity compared to other approaches. Those results highlight the benefits of the TcGSA method for analyzing gene expression dynamics.
Collapse
Affiliation(s)
- Boris P. Hejblum
- Univ. Bordeaux, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INSERM, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INRIA, Team SISTM, F-33000 Bordeaux, France
- Vaccine Research Institute-VRI, Hôpital Henri Mondor, Créteil, France
- Baylor Institute for Immunology Research, Dallas, Texas, United States of America
| | - Jason Skinner
- Vaccine Research Institute-VRI, Hôpital Henri Mondor, Créteil, France
- Baylor Institute for Immunology Research, Dallas, Texas, United States of America
| | - Rodolphe Thiébaut
- Univ. Bordeaux, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INSERM, ISPED, Centre INSERM U897-Epidemiologie-Biostatistique, F-33000 Bordeaux, France
- INRIA, Team SISTM, F-33000 Bordeaux, France
- Vaccine Research Institute-VRI, Hôpital Henri Mondor, Créteil, France
- Baylor Institute for Immunology Research, Dallas, Texas, United States of America
- * E-mail:
| |
Collapse
|
4
|
Moni MA, Liò P. Network-based analysis of comorbidities risk during an infection: SARS and HIV case studies. BMC Bioinformatics 2014; 15:333. [PMID: 25344230 PMCID: PMC4363349 DOI: 10.1186/1471-2105-15-333] [Citation(s) in RCA: 75] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2013] [Accepted: 09/19/2014] [Indexed: 01/02/2023] Open
Abstract
Background Infections are often associated to comorbidity that increases the risk of medical conditions which can lead to further morbidity and mortality. SARS is a threat which is similar to MERS virus, but the comorbidity is the key aspect to underline their different impacts. One UK doctor says "I’d rather have HIV than diabetes" as life expectancy among diabetes patients is lower than that of HIV. However, HIV has a comorbidity impact on the diabetes. Results We present a quantitative framework to compare and explore comorbidity between diseases. By using neighbourhood based benchmark and topological methods, we have built comorbidity relationships network based on the OMIM and our identified significant genes. Then based on the gene expression, PPI and signalling pathways data, we investigate the comorbidity association of these 2 infective pathologies with other 7 diseases (heart failure, kidney disorder, breast cancer, neurodegenerative disorders, bone diseases, Type 1 and Type 2 diabetes). Phenotypic association is measured by calculating both the Relative Risk as the quantified measures of comorbidity tendency of two disease pairs and the ϕ-correlation to measure the robustness of the comorbidity associations. The differential gene expression profiling strongly suggests that the response of SARS affected patients seems to be mainly an innate inflammatory response and statistically dysregulates a large number of genes, pathways and PPIs subnetworks in different pathologies such as chronic heart failure (21 genes), breast cancer (16 genes) and bone diseases (11 genes). HIV-1 induces comorbidities relationship with many other diseases, particularly strong correlation with the neurological, cancer, metabolic and immunological diseases. Similar comorbidities risk is observed from the clinical information. Moreover, SARS and HIV infections dysregulate 4 genes (ANXA3, GNS, HIST1H1C, RASA3) and 3 genes (HBA1, TFRC, GHITM) respectively that affect the ageing process. It is notable that HIV and SARS similarly dysregulated 11 genes and 3 pathways. Only 4 significantly dysregulated genes are common between SARS-CoV and MERS-CoV, including NFKBIA that is a key regulator of immune responsiveness implicated in susceptibility to infectious and inflammatory diseases. Conclusions Our method presents a ripe opportunity to use data-driven approaches for advancing our current knowledge on disease mechanism and predicting disease comorbidities in a quantitative way. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-333) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Mohammad Ali Moni
- Computer Laboratory, University of Cambridge, William Gates Building, 15 JJ Thomson Avenue, Cambridge CB3 0FD, UK.
| | | |
Collapse
|
5
|
Xu L, Zhao F, Ren H, Li L, Lu J, Liu J, Zhang S, Liu GE, Song J, Zhang L, Wei C, Du L. Co-expression analysis of fetal weight-related genes in ovine skeletal muscle during mid and late fetal development stages. Int J Biol Sci 2014; 10:1039-50. [PMID: 25285036 PMCID: PMC4183924 DOI: 10.7150/ijbs.9737] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2014] [Accepted: 08/16/2014] [Indexed: 11/05/2022] Open
Abstract
BACKGROUND Muscle development and lipid metabolism play important roles during fetal development stages. The commercial Texel sheep are more muscular than the indigenous Ujumqin sheep. RESULTS We performed serial transcriptomics assays and systems biology analyses to investigate the dynamics of gene expression changes associated with fetal longissimus muscles during different fetal stages in two sheep breeds. Totally, we identified 1472 differentially expressed genes during various fetal stages using time-series expression analysis. A systems biology approach, weighted gene co-expression network analysis (WGCNA), was used to detect modules of correlated genes among these 1472 genes. Dramatically different gene modules were identified in four merged datasets, corresponding to the mid fetal stage in Texel and Ujumqin sheep, the late fetal stage in Texel and Ujumqin sheep, respectively. We further detected gene modules significantly correlated with fetal weight, and constructed networks and pathways using genes with high significances. In these gene modules, we identified genes like TADA3, LMNB1, TGF-β3, EEF1A2, FGFR1, MYOZ1, and FBP2 correlated with fetal weight. CONCLUSION Our study revealed the complex network characteristics involved in muscle development and lipid metabolism during fetal development stages. Diverse patterns of the network connections observed between breeds and fetal stages could involve some hub genes, which play central roles in fetal development, correlating with fetal weight. Our findings could provide potential valuable biomarkers for selection of body weight-related traits in sheep and other livestock.
Collapse
Affiliation(s)
- Lingyang Xu
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China; ; 4. Animal Genomics and Improvement Laboratory, U.S. Department of Agriculture-Agricultural Research Services, Beltsville, Maryland 20705, USA; ; 5. Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Fuping Zhao
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Hangxing Ren
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China; ; 2. Chongqing Academy of Animal Sciences, Chongqing, 402460, China
| | - Li Li
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China; ; 3. College of Animal Science and Technology, Sichuan Agricultural University, Ya'an, Sichuan, 625014, China
| | - Jian Lu
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Jiasen Liu
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Shifang Zhang
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - George E Liu
- 4. Animal Genomics and Improvement Laboratory, U.S. Department of Agriculture-Agricultural Research Services, Beltsville, Maryland 20705, USA
| | - Jiuzhou Song
- 5. Department of Animal and Avian Sciences, University of Maryland, College Park, Maryland 20742, USA
| | - Li Zhang
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Caihong Wei
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| | - Lixin Du
- 1. National Center for Molecular Genetics and Breeding of Animal, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100193, China
| |
Collapse
|
6
|
Nueda MJ, Tarazona S, Conesa A. Next maSigPro: updating maSigPro bioconductor package for RNA-seq time series. Bioinformatics 2014; 30:2598-602. [PMID: 24894503 PMCID: PMC4155246 DOI: 10.1093/bioinformatics/btu333] [Citation(s) in RCA: 211] [Impact Index Per Article: 21.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Motivation: The widespread adoption of RNA-seq to quantitatively measure gene expression has increased the scope of sequencing experimental designs to include time-course experiments. maSigPro is an R package specifically suited for the analysis of time-course gene expression data, which was developed originally for microarrays and hence was limited in its application to count data. Results: We have updated maSigPro to support RNA-seq time series analysis by introducing generalized linear models in the algorithm to support the modeling of count data while maintaining the traditional functionalities of the package. We show a good performance of the maSigPro-GLM method in several simulated time-course scenarios and in a real experimental dataset. Availability and implementation: The package is freely available under the LGPL license from the Bioconductor Web site (http://bioconductor.org). Contact:mj.nueda@ua.es or aconesa@cipf.es
Collapse
Affiliation(s)
- María José Nueda
- Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain
| | - Sonia Tarazona
- Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain
| | - Ana Conesa
- Statistics and Operational Research Department, University of Alicante, 03690, Alicante, Spain, Genomics of Gene Expression Laboratory, Prince Felipe Research Centre, 46012 Valencia, Spain and Applied Statistics, Operational Research and Quality Department, Polytechnic University of Valencia, 46020 Valencia, Spain
| |
Collapse
|
7
|
Oh S, Song S, Dasgupta N, Grabowski G. The analytical landscape of static and temporal dynamics in transcriptome data. Front Genet 2014; 5:35. [PMID: 24600473 PMCID: PMC3929947 DOI: 10.3389/fgene.2014.00035] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2013] [Accepted: 01/30/2014] [Indexed: 12/16/2022] Open
Abstract
Interpreting gene expression profiles often involves statistical analysis of large numbers of differentially expressed genes, isoforms, and alternative splicing events at either static or dynamic spectrums. Reduced sequencing costs have made feasible dense time-series analysis of gene expression using RNA-seq; however, statistical methods in the context of temporal RNA-seq data are poorly developed. Here we will review current methods for identifying temporal changes in gene expression using RNA-seq, which are limited to static pairwise comparisons of time points and which fail to account for temporal dependencies in gene expression patterns. We also review recently developed very few number of temporal dynamic RNA-seq specific methods. Application and development of RNA-specific temporal dynamic methods have been continuously under the development, yet, it is still in infancy. We fully cover microarray specific temporal methods and transcriptome studies in initial digital technology (e.g., SAGE) between traditional microarray and new RNA-seq.
Collapse
Affiliation(s)
- Sunghee Oh
- Division of Human Genetics, Department of Pediatrics, Cincinnati Children's Hospital Medical Center Cincinnati, OH, USA
| | - Seongho Song
- Department of Mathematical Sciences, McMicken College of Arts and Sciences, University of Cincinnati Cincinnati, OH, USA
| | - Nupur Dasgupta
- Division of Human Genetics, Department of Pediatrics, Cincinnati Children's Hospital Medical Center Cincinnati, OH, USA
| | - Gregory Grabowski
- Division of Human Genetics, Department of Pediatrics, Cincinnati Children's Hospital Medical Center Cincinnati, OH, USA
| |
Collapse
|
8
|
Xue L, Cai JY, Ma J, Huang Z, Guo MX, Fu LZ, Shi YB, Li WX. Global expression profiling reveals genetic programs underlying the developmental divergence between mouse and human embryogenesis. BMC Genomics 2013; 14:568. [PMID: 23961710 PMCID: PMC3924405 DOI: 10.1186/1471-2164-14-568] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Accepted: 07/30/2013] [Indexed: 01/18/2023] Open
Abstract
Background Mouse has served as an excellent model for studying human development and diseases due to its similarity to human. Advances in transgenic and knockout studies in mouse have dramatically strengthened the use of this model and significantly improved our understanding of gene function during development in the past few decades. More recently, global gene expression analyses have revealed novel features in early embryogenesis up to gastrulation stages and have indeed provided molecular evidence supporting the conservation in early development in human and mouse. On the other hand, little information is known about the gene regulatory networks governing the subsequent organogenesis. Importantly, mouse and human development diverges during organogenesis. For instance, the mouse embryo is born around the end of organogenesis while in human the subsequent fetal period of ongoing growth and maturation of most organs spans more than 2/3 of human embryogenesis. While two recent studies reported the gene expression profiles during human organogenesis, no global gene expression analysis had been done for mouse organogenesis. Results Here we report a detailed analysis of the global gene expression profiles from egg to the end of organogenesis in mouse. Our studies have revealed distinct temporal regulation patterns for genes belonging to different functional (Gene Ontology or GO) categories that support their roles during organogenesis. More importantly, comparative analyses identify both conserved and divergent gene regulation programs in mouse and human organogenesis, with the latter likely responsible for the developmental divergence between the two species, and further suggest a novel developmental strategy during vertebrate evolution. Conclusions We have reported here the first genome-wide gene expression analysis of the entire mouse embryogenesis and compared the transcriptome atlas during mouse and human embryogenesis. Given our earlier observation that genes function in a given process tends to be developmentally co-regulated during organogenesis, our microarray data here should help to identify genes associated with mouse development and/or infer the developmental functions of unknown genes. In addition, our study might be useful for invesgtigating the molecular basis of vertebrate evolution.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Yun-Bo Shi
- College of Life Sciences, Wuhan University, Wuhan 430072, P,R China.
| | | |
Collapse
|
9
|
Time series expression analyses using RNA-seq: a statistical approach. BIOMED RESEARCH INTERNATIONAL 2013; 2013:203681. [PMID: 23586021 PMCID: PMC3622290 DOI: 10.1155/2013/203681] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/06/2012] [Revised: 01/10/2013] [Accepted: 01/15/2013] [Indexed: 11/29/2022]
Abstract
RNA-seq is becoming the de facto standard approach for transcriptome analysis with ever-reducing cost. It has considerable advantages over conventional technologies (microarrays) because it allows for direct identification and quantification of transcripts. Many time series RNA-seq datasets have been collected to study the dynamic regulations of transcripts. However, statistically rigorous and computationally efficient methods are needed to explore the time-dependent changes of gene expression in biological systems. These methods should explicitly account for the dependencies of expression patterns across time points. Here, we discuss several methods that can be applied to model timecourse RNA-seq data, including statistical evolutionary trajectory index (SETI), autoregressive time-lagged regression (AR(1)), and hidden Markov model (HMM) approaches. We use three real datasets and simulation studies to demonstrate the utility of these dynamic methods in temporal analysis.
Collapse
|