1
|
Majumder A, Sarkar M, Sharma P. A Composite Mode Differential Gene Regulatory Architecture based on Temporal Expression Profiles. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019; 16:1785-1793. [PMID: 29993888 DOI: 10.1109/tcbb.2018.2828418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Exploring the complex interactive mechanism in a Gene Regulatory Network (GRN) developed using transcriptome data obtained from standard microarray and/or RNA-seq experiments helps us to understand the triggering factors in cancer research. The Transcription Factor (TF) genes generate protein complexes which affect the transcription of various target genes. However, considering the mode of regulation in a time frame such transcriptional activities are dependent on some specific activation time points only. It is also crucial to check whether the regulating capabilities are uniform across varied stages, especially when periodicity is a big issue. In this context, we propose an algorithm called RIFT which helps to monitor the temporal differential regulatory pattern of a Differentially Expressed (DE) target gene either by a TF gene or a group of TF genes from a large time series (TS) data. We have tested our algorithm on HeLa cell cycle data and compared the result with its most advanced state of the art counterpart proposed so far. As our algorithm yields up stringent mode and target specific significant valid TF genes for a DE gene, we can expect to have new forms of genetic interactions.
Collapse
|
2
|
Iuchi H, Sugimoto M, Tomita M. MICOP: Maximal information coefficient-based oscillation prediction to detect biological rhythms in proteomics data. BMC Bioinformatics 2018; 19:249. [PMID: 29954316 PMCID: PMC6025708 DOI: 10.1186/s12859-018-2257-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 06/20/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Circadian rhythms comprise oscillating molecular interactions, the disruption of the homeostasis of which would cause various disorders. To understand this phenomenon systematically, an accurate technique to identify oscillating molecules among omics datasets must be developed; however, this is still impeded by many difficulties, such as experimental noise and attenuated amplitude. RESULTS To address these issues, we developed a new algorithm named Maximal Information Coefficient-based Oscillation Prediction (MICOP), a sine curve-matching method. The performance of MICOP in labeling oscillation or non-oscillation was compared with four reported methods using Mathews correlation coefficient (MCC) values. The numerical experiments were performed with time-series data with (1) mimicking of molecular oscillation decay, (2) high noise and low sampling frequency and (3) one-cycle data. The first experiment revealed that MICOP could accurately identify the rhythmicity of decaying molecular oscillation (MCC > 0.7). The second experiment revealed that MICOP was robust against high-level noise (MCC > 0.8) even upon the use of low-sampling-frequency data. The third experiment revealed that MICOP could accurately identify the rhythmicity of noisy one-cycle data (MCC > 0.8). As an application, we utilized MICOP to analyze time-series proteome data of mouse liver. MICOP identified that novel oscillating candidates numbered 14 and 30 for C57BL/6 and C57BL/6 J, respectively. CONCLUSIONS In this paper, we presented MICOP, which is an MIC-based algorithm, for predicting periodic patterns in large-scale time-resolved protein expression profiles. The performance test using artificially generated simulation data revealed that the performance of MICOP for decaying data was superior to that of the existing widely used methods. It can reveal novel findings from time-series data and may contribute to biologically significant results. This study suggests that MICOP is an ideal approach for detecting and characterizing oscillations in time-resolved omics data sets.
Collapse
Affiliation(s)
- Hitoshi Iuchi
- Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, 252-8520, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, 997-0052, Japan
| | - Masahiro Sugimoto
- Institute for Advanced Biosciences, Keio University, Tsuruoka, 997-0052, Japan. .,Health Promotion and Preemptive Medicine, Research and Development Center for Minimally Invasive Therapies, Tokyo Medical University, Shinjuku, Tokyo, 160-0022, Japan.
| | - Masaru Tomita
- Systems Biology Program, Graduate School of Media and Governance, Keio University, Fujisawa, 252-8520, Japan.,Institute for Advanced Biosciences, Keio University, Tsuruoka, 997-0052, Japan.,Department of Environment and Information Studies, Keio University, Fujisawa, 252-8520, Japan
| |
Collapse
|
3
|
Ostaszewski M, Eifes S, del Sol A. Evolutionary conservation and network structure characterize genes of phenotypic relevance for mitosis in human. PLoS One 2012; 7:e36488. [PMID: 22577488 PMCID: PMC3342260 DOI: 10.1371/journal.pone.0036488] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2011] [Accepted: 04/07/2012] [Indexed: 11/19/2022] Open
Abstract
The impact of gene silencing on cellular phenotypes is difficult to establish due to the complexity of interactions in the associated biological processes and pathways. A recent genome-wide RNA knock-down study both identified and phenotypically characterized a set of important genes for the cell cycle in HeLa cells. Here, we combine a molecular interaction network analysis, based on physical and functional protein interactions, in conjunction with evolutionary information, to elucidate the common biological and topological properties of these key genes. Our results show that these genes tend to be conserved with their corresponding protein interactions across several species and are key constituents of the evolutionary conserved molecular interaction network. Moreover, a group of bistable network motifs is found to be conserved within this network, which are likely to influence the network stability and therefore the robustness of cellular functioning. They form a cluster, which displays functional homogeneity and is significantly enriched in genes phenotypically relevant for mitosis. Additional results reveal a relationship between specific cellular processes and the phenotypic outcomes induced by gene silencing. This study introduces new ideas regarding the relationship between genotype and phenotype in the context of the cell cycle. We show that the analysis of molecular interaction networks can result in the identification of genes relevant to cellular processes, which is a promising avenue for future research.
Collapse
Affiliation(s)
| | - Serge Eifes
- Luxembourg Centre for Systems Biomedicine, Luxembourg, Luxembourg
| | - Antonio del Sol
- Luxembourg Centre for Systems Biomedicine, Luxembourg, Luxembourg
- * E-mail:
| |
Collapse
|
4
|
Transcriptome phase distribution analysis reveals diurnal regulated biological processes and key pathways in rice flag leaves and seedling leaves. PLoS One 2011; 6:e17613. [PMID: 21407816 PMCID: PMC3047585 DOI: 10.1371/journal.pone.0017613] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2010] [Accepted: 02/01/2011] [Indexed: 12/04/2022] Open
Abstract
Plant diurnal oscillation is a 24-hour period based variation. The correlation between diurnal genes and biological pathways was widely revealed by microarray analysis in different species. Rice (Oryza sativa) is the major food staple for about half of the world's population. The rice flag leaf is essential in providing photosynthates to the grain filling. However, there is still no comprehensive view about the diurnal transcriptome for rice leaves. In this study, we applied rice microarray to monitor the rhythmically expressed genes in rice seedling and flag leaves. We developed a new computational analysis approach and identified 6,266 (10.96%) diurnal probe sets in seedling leaves, 13,773 (24.08%) diurnal probe sets in flag leaves. About 65% of overall transcription factors were identified as flag leaf preferred. In seedling leaves, the peak of phase distribution was from 2:00am to 4:00am, whereas in flag leaves, the peak was from 8:00pm to 2:00am. The diurnal phase distribution analysis of gene ontology (GO) and cis-element enrichment indicated that, some important processes were waken by the light, such as photosynthesis and abiotic stimulus, while some genes related to the nuclear and ribosome involved processes were active mostly during the switch time of light to dark. The starch and sucrose metabolism pathway genes also showed diurnal phase. We conducted comparison analysis between Arabidopsis and rice leaf transcriptome throughout the diurnal cycle. In summary, our analysis approach is feasible for relatively unbiased identification of diurnal transcripts, efficiently detecting some special periodic patterns with non-sinusoidal periodic patterns. Compared to the rice flag leaves, the gene transcription levels of seedling leaves were relatively limited to the diurnal rhythm. Our comprehensive microarray analysis of seedling and flag leaves of rice provided an overview of the rice diurnal transcriptome and indicated some diurnal regulated biological processes and key functional pathways in rice.
Collapse
|
5
|
Yang R, Su Z. Analyzing circadian expression data by harmonic regression based on autoregressive spectral estimation. Bioinformatics 2010; 26:i168-74. [PMID: 20529902 PMCID: PMC2881374 DOI: 10.1093/bioinformatics/btq189] [Citation(s) in RCA: 134] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Motivation: Circadian rhythms are prevalent in most organisms. Identification of circadian-regulated genes is a crucial step in discovering underlying pathways and processes that are clock-controlled. Such genes are largely detected by searching periodic patterns in microarray data. However, temporal gene expression profiles usually have a short time-series with low sampling frequency and high levels of noise. This makes circadian rhythmic analysis of temporal microarray data very challenging. Results: We propose an algorithm named ARSER, which combines time domain and frequency domain analysis for extracting and characterizing rhythmic expression profiles from temporal microarray data. ARSER employs autoregressive spectral estimation to predict an expression profile's periodicity from the frequency spectrum and then models the rhythmic patterns by using a harmonic regression model to fit the time-series. ARSER describes the rhythmic patterns by four parameters: period, phase, amplitude and mean level, and measures the multiple testing significance by false discovery rate q-value. When tested on well defined periodic and non-periodic short time-series data, ARSER was superior to two existing and widely-used methods, COSOPT and Fisher's G-test, during identification of sinusoidal and non-sinusoidal periodic patterns in short, noisy and non-stationary time-series. Finally, analysis of Arabidopsis microarray data using ARSER led to identification of a novel set of previously undetected non-sinusoidal periodic transcripts, which may lead to new insights into molecular mechanisms of circadian rhythms. Availability: ARSER is implemented by Python and R. All source codes are available from http://bioinformatics.cau.edu.cn/ARSER Contact:zhensu@cau.edu.cn
Collapse
Affiliation(s)
- Rendong Yang
- Division of Bioinformatics, State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | | |
Collapse
|
6
|
Pyne S, Gutman R, Kim CS, Futcher B. Phase Coupled Meta-analysis: sensitive detection of oscillations in cell cycle gene expression, as applied to fission yeast. BMC Genomics 2009; 10:440. [PMID: 19761608 PMCID: PMC2753555 DOI: 10.1186/1471-2164-10-440] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2009] [Accepted: 09/17/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Many genes oscillate in their level of expression through the cell division cycle. Previous studies have identified such genes by applying Fourier analysis to cell cycle time course experiments. Typically, such analyses generate p-values; i.e., an oscillating gene has a small p-value, and the observed oscillation is unlikely due to chance. When multiple time course experiments are integrated, p-values from the individual experiments are combined using classical meta-analysis techniques. However, this approach sacrifices information inherent in the individual experiments, because the hypothesis that a gene is regulated according to the time in the cell cycle makes two independent predictions: first, that an oscillation in expression will be observed; and second, that gene expression will always peak in the same phase of the cell cycle, such as S-phase. Approaches that simply combine p-values ignore the second prediction. RESULTS Here, we improve the detection of cell cycle oscillating genes by systematically taking into account the phase of peak gene expression. We design a novel meta-analysis measure based on vector addition: when a gene peaks or troughs in all experiments in the same phase of the cell cycle, the representative vectors add to produce a large final vector. Conversely, when the peaks in different experiments are in various phases of the cycle, vector addition produces a small final vector. We apply the measure to ten genome-wide cell cycle time course experiments from the fission yeast Schizosaccharomyces pombe, and detect many new, weakly oscillating genes. CONCLUSION A very large fraction of all genes in S. pombe, perhaps one-quarter to one-half, show some cell cycle oscillation, although in many cases these oscillations may be incidental rather than adaptive.
Collapse
Affiliation(s)
- Saumyadipta Pyne
- Broad Institute of Massachusetts Institute of Technology and Harvard University, Cambridge, MA 02142, USA.
| | | | | | | |
Collapse
|
7
|
Duffield G, Loros JJ, Dunlap JC. Analysis of circadian output rhythms of gene expression in Neurospora and mammalian cells in culture. Methods Enzymol 2008; 393:315-41. [PMID: 15817297 DOI: 10.1016/s0076-6879(05)93014-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
The true biology of chronobiology lies in the spectrum of processes that are controlled by the circadian clock. Although this biology plays out at the level of the whole organism, it derives, finally, from clock-driven changes in physiology, and frequently in gene expression, that occur at the level of individual cells. For this reason, analysis of gene expression rhythms measured in cell culture or in organisms that elaborate only a few cell types provides insights not possible in multicellular organisms. In this context we have used mammalian fibroblasts in culture as well as the eukaryotic filamentous fungus Neurospora crassa to study circadian output, in particular the output rhythms in gene expression that underlie so much of circadian biology. Each cell type has its own advantages: Data from mammalian cells are obviously immediately pertinent to animal cell rhythms, but the system allows little genetics and only limited amounts of material can be collected. Alternatively, Neurospora allows genetic and molecular analyses and is useful for developing concepts and models of output that can be examined in other contexts. This methods article focuses on these two systems for analysis, providing an overview of how control is presently viewed followed by current methods for its analysis.
Collapse
Affiliation(s)
- Giles Duffield
- Department of Genetics, Dartmouth Medical School, Hanover, New Hampshire 03755, USA
| | | | | |
Collapse
|
8
|
|
9
|
Gauthier NP, Larsen ME, Wernersson R, de Lichtenberg U, Jensen LJ, Brunak S, Jensen TS. Cyclebase.org--a comprehensive multi-organism online database of cell-cycle experiments. Nucleic Acids Res 2007; 36:D854-9. [PMID: 17940094 PMCID: PMC2238932 DOI: 10.1093/nar/gkm729] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The past decade has seen the publication of a large number of cell-cycle microarray studies and many more are in the pipeline. However, data from these experiments are not easy to access, combine and evaluate. We have developed a centralized database with an easy-to-use interface, Cyclebase.org, for viewing and downloading these data. The user interface facilitates searches for genes of interest as well as downloads of genome-wide results. Individual genes are displayed with graphs of expression profiles throughout the cell cycle from all available experiments. These expression profiles are normalized to a common timescale to enable inspection of the combined experimental evidence. Furthermore, state-of-the-art computational analyses provide key information on both individual experiments and combined datasets such as whether or not a gene is periodically expressed and, if so, the time of peak expression. Cyclebase is available at http://www.cyclebase.org.
Collapse
Affiliation(s)
- Nicholas Paul Gauthier
- Center for Biological Sequence Analysis, BioCentrum-DTU, Technical University of Denmark, Building 208, DK-2800 Lyngby, Denmark
| | | | | | | | | | | | | |
Collapse
|
10
|
Abstract
Grouping of gene expression patterns across biological experiments, treatments and time-series data is performed in q-intervals of measurements using phase-shifted analysis of gene expression (PAGE); a Java-based tool to find clusters of genes that share trends of expression profiles within the dataset. The patterns and genes within q-Clusters are visualized in trend plots and compared to determine biological relevance from the gene annotations.
Collapse
Affiliation(s)
- Elo Leung
- Bioinformatics and Computational Biology, School of Computational Sciences, George Mason University, Manassas, VA 20110, USA
| | | |
Collapse
|
11
|
Glynn EF, Chen J, Mushegian AR. Detecting periodic patterns in unevenly spaced gene expression time series using Lomb-Scargle periodograms. ACTA ACUST UNITED AC 2005; 22:310-6. [PMID: 16303799 DOI: 10.1093/bioinformatics/bti789] [Citation(s) in RCA: 130] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Periodic patterns in time series resulting from biological experiments are of great interest. The commonly used Fast Fourier Transform (FFT) algorithm is applicable only when data are evenly spaced and when no values are missing, which is not always the case in high-throughput measurements. The choice of statistic to evaluate the significance of the periodic patterns for unevenly spaced gene expression time series has not been well substantiated. METHODS The Lomb-Scargle periodogram approach is used to search time series of gene expression to quantify the periodic behavior of every gene represented on the DNA array. The Lomb-Scargle periodogram analysis provides a direct method to treat missing values and unevenly spaced time points. We propose the combination of a Lomb-Scargle test statistic for periodicity and a multiple hypothesis testing procedure with controlled false discovery rate to detect significant periodic gene expression patterns. RESULTS We analyzed the Plasmodium falciparum gene expression dataset. In the Quality Control Dataset of 5080 expression patterns, we found 4112 periodic probes. In addition, we identified 243 probes with periodic expression in the Complete Dataset, which could not be examined in the original study by the FFT analysis due to an excessive number of missing values. While most periodic genes had a period of 48 h, some had a period close to 24 h. Our approach should be applicable for detection and quantification of periodic patterns in any unevenly spaced gene expression time-series data.
Collapse
Affiliation(s)
- Earl F Glynn
- Stowers Institute for Medical Research, 1000 East 50th Street, Kansas City, MO 64110, USA
| | | | | |
Collapse
|