Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zampieri M, Soranzo N, Altafini C. Discerning static and causal interactions in genome-wide reverse engineering problems. ACTA ACUST UNITED AC 2008;24:1510-5. [PMID: 18467346 DOI: 10.1093/bioinformatics/btn220] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

For:	Zampieri M, Soranzo N, Altafini C. Discerning static and causal interactions in genome-wide reverse engineering problems. ACTA ACUST UNITED AC 2008;24:1510-5. [PMID: 18467346 DOI: 10.1093/bioinformatics/btn220] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]

Number

Cited by Other Article(s)

Zenere A, Rundquist O, Gustafsson M, Altafini C. Multi-omics protein-coding units as massively parallel Bayesian networks: empirical validation of causality structure. iScience 2022;25:104048. [PMID: 35355520 PMCID: PMC8958332 DOI: 10.1016/j.isci.2022.104048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 01/17/2022] [Accepted: 03/08/2022] [Indexed: 11/29/2022] Open

Zenere A, Rundquist O, Gustafsson M, Altafini C. Using high-throughput multi-omics data to investigate structural balance in elementary gene regulatory network motifs. Bioinformatics 2021;38:173-178. [PMID: 34383882 PMCID: PMC8696094 DOI: 10.1093/bioinformatics/btab577] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 07/04/2021] [Accepted: 08/10/2021] [Indexed: 02/03/2023] Open

Iliopoulos A, Beis G, Apostolou P, Papasotiriou I. Complex Networks, Gene Expression and Cancer Complexity: A Brief Review of Methodology and Applications. Curr Bioinform 2020. [DOI: 10.2174/1574893614666191017093504] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]

Chowdhury HA, Bhattacharyya DK, Kalita JK. (Differential) Co-Expression Analysis of Gene Expression: A Survey of Best Practices. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020;17:1154-1173. [PMID: 30668502 DOI: 10.1109/tcbb.2019.2893170] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Rago A, Werren JH, Colbourne JK. Sex biased expression and co-expression networks in development, using the hymenopteran Nasonia vitripennis. PLoS Genet 2020;16:e1008518. [PMID: 31986136 PMCID: PMC7004391 DOI: 10.1371/journal.pgen.1008518] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2019] [Revised: 02/06/2020] [Accepted: 11/13/2019] [Indexed: 12/17/2022] Open

Mercatelli D, Scalambra L, Triboli L, Ray F, Giorgi FM. Gene regulatory network inference resources: A practical overview. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2019;1863:194430. [PMID: 31678629 DOI: 10.1016/j.bbagrm.2019.194430] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2019] [Revised: 09/06/2019] [Accepted: 09/09/2019] [Indexed: 02/08/2023]

Abstract

Transcriptional regulation is a fundamental molecular mechanism involved in almost every aspect of life, from homeostasis to development, from metabolism to behavior, from reaction to stimuli to disease progression. In recent years, the concept of Gene Regulatory Networks (GRNs) has grown popular as an effective applied biology approach for describing the complex and highly dynamic set of transcriptional interactions, due to its easy-to-interpret features. Since cataloguing, predicting and understanding every GRN connection in all species and cellular contexts remains a great challenge for biology, researchers have developed numerous tools and methods to infer regulatory processes. In this review, we catalogue these methods in six major areas, based on the dominant underlying information leveraged to infer GRNs: Coexpression, Sequence Motifs, Chromatin Immunoprecipitation (ChIP), Orthology, Literature and Protein-Protein Interaction (PPI) specifically focused on transcriptional complexes. The methods described here cover a wide range of user-friendliness: from web tools that require no prior computational expertise to command line programs and algorithms for large scale GRN inferences. Each method for GRN inference described herein effectively illustrates a type of transcriptional relationship, with many methods being complementary to others. While a truly holistic approach for inferring and displaying GRNs remains one of the greatest challenges in the field of systems biology, we believe that the integration of multiple methods described herein provides an effective means with which experimental and computational biologists alike may obtain the most complete pictures of transcriptional relationships. This article is part of a Special Issue entitled: Transcriptional Profiles and Regulatory Gene Networks edited by Dr. Federico Manuel Giorgi and Dr. Shaun Mahony.

Collapse

Oliveira GB, Regitano LCA, Cesar ASM, Reecy JM, Degaki KY, Poleti MD, Felício AM, Koltes JE, Coutinho LL. Integrative analysis of microRNAs and mRNAs revealed regulation of composition and metabolism in Nelore cattle. BMC Genomics 2018;19:126. [PMID: 29415651 PMCID: PMC5804041 DOI: 10.1186/s12864-018-4514-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2016] [Accepted: 01/31/2018] [Indexed: 01/01/2023] Open

Abstract

BACKGROUND

The amount of intramuscular fat can influence the sensory characteristics and nutritional value of beef, thus the selection of animals with adequate fat deposition is important to the consumer. There is growing knowledge about the genes and pathways that control the biological processes involved in fat deposition in muscle. MicroRNAs (miRNAs) belong to a well-conserved class of non-coding small RNAs that modulate gene expression across a range of biological functions in animal development and physiology. The aim of this study was to identify differentially expressed (DE) miRNAs, regulatory candidate genes and co-expression networks related to intramuscular fat (IMF) deposition. To achieve this, we used mRNA and miRNA expression data from the Longissimus dorsi muscle of 30 Nelore steers with high (H) and low (L) genomic estimated breeding values (GEBV) for IMF deposition.

RESULTS

Differential miRNA expression analysis between animals with extreme GEBV values for IMF identified six DE miRNAs (FDR 10%). Functional annotation of the target genes for these microRNAs indicated that the PPARs signaling pathway is involved with IMF deposition. Candidate regulatory genes such as SDHAF4, FBXO17, ALDOA and PKM were identified by partial correlation with information theory (PCIT), phenotypic impact factor (PIF) and regulatory impact factor (RIF) co-expression approaches from integrated miRNA-mRNA expression data. Two DE miRNAs (FDR 10%), bta-miR-143 and bta-miR-146b, which were upregulated in the Low IMF group, were correlated with regulatory candidate genes, which were functionally enriched for fatty acid oxidation GO terms. Co-expression patterns obtained by weighted correlation network analysis (WGCNA), which showed possible interaction and regulation between mRNAs and miRNAs, identified several modules related to immune system function, protein metabolism, energy metabolism and glucose catabolism according to in silico analysis performed herein.

CONCLUSION

In this study, several genes and miRNAs were identified as candidate regulators of IMF by analyzing DE miRNAs using two different miRNA-mRNA co-expression network methods. This study contributes to the understanding of potential regulatory mechanisms of gene signaling networks involved in fat deposition processes measured in muscle. Glucose metabolism and inflammation processes were the main pathways found in silico to influence intramuscular fat deposition in beef cattle in the integrative mRNA-miRNA co-expression analysis.

Collapse

Bottje W, Kong BW, Reverter A, Waardenberg AJ, Lassiter K, Hudson NJ. Progesterone signalling in broiler skeletal muscle is associated with divergent feed efficiency. BMC SYSTEMS BIOLOGY 2017;11:29. [PMID: 28235404 PMCID: PMC5324283 DOI: 10.1186/s12918-017-0396-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/12/2016] [Accepted: 01/16/2017] [Indexed: 01/08/2023]

Abstract

Background

We contrast the pectoralis muscle transcriptomes of broilers selected from within a single genetic line expressing divergent feed efficiency (FE) in an effort to improve our understanding of the mechanistic basis of FE.

Results

Application of a virtual muscle model to gene expression data pointed to a coordinated reduction in slow twitch muscle isoforms of the contractile apparatus (MYH15, TPM3, MYOZ2, TNNI1, MYL2, MYOM3, CSRP3, TNNT2), consistent with diminishment in associated slow machinery (myoglobin and phospholamban) in the high FE animals. These data are in line with the repeated transition from red slow to white fast muscle fibres observed in agricultural species selected on mass and FE. Surprisingly, we found that the expression of 699 genes encoding the broiler mitoproteome is modestly–but significantly–biased towards the high FE group, suggesting a slightly elevated mitochondrial content. This is contrary to expectation based on the slow muscle isoform data and theoretical physiological capacity arguments. Reassuringly, the extreme 40 most DE genes can successfully cluster the 12 individuals into the appropriate FE treatment group. Functional groups contained in this DE gene list include metabolic proteins (including opposing patterns of CA3 and CA4), mitochondrial proteins (CKMT1A), oxidative status (SEPP1, HIG2A) and cholesterol homeostasis (APOA1, INSIG1). We applied a differential network method (Regulatory Impact Factors) whose aim is to use patterns of differential co-expression to detect regulatory molecules transcriptionally rewired between the groups. This analysis clearly points to alterations in progesterone signalling (via the receptor PGR) as the major driver. We show the progesterone receptor localises to the mitochondria in a quail muscle cell line.

Conclusions

Progesterone is sometimes used in the cattle industry in exogenous hormone mixes that lead to a ~20% increase in FE. Because the progesterone receptor can localise to avian mitochondria, our data continue to point to muscle mitochondrial metabolism as an important component of the phenotypic expression of variation in broiler FE.

Electronic supplementary material

The online version of this article (doi:10.1186/s12918-017-0396-2) contains supplementary material, which is available to authorized users.

Collapse

Wang D, Wang J, Jiang Y, Liang Y, Xu D. BFDCA: A Comprehensive Tool of Using Bayes Factor for Differential Co-Expression Analysis. J Mol Biol 2016;429:446-453. [PMID: 27984044 DOI: 10.1016/j.jmb.2016.10.030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Revised: 10/22/2016] [Accepted: 10/23/2016] [Indexed: 10/20/2022]

Differential Coexpression Analysis Reveals Extensive Rewiring of Arabidopsis Gene Coexpression in Response to Pseudomonas syringae Infection. Sci Rep 2016;6:35064. [PMID: 27721457 PMCID: PMC5056366 DOI: 10.1038/srep35064] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2016] [Accepted: 09/23/2016] [Indexed: 01/21/2023] Open

Giorgi FM, Lopez G, Woo JH, Bisikirska B, Califano A, Bansal M. Inferring protein modulation from gene expression data using conditional mutual information. PLoS One 2014;9:e109569. [PMID: 25314274 PMCID: PMC4196905 DOI: 10.1371/journal.pone.0109569] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2014] [Accepted: 09/12/2014] [Indexed: 01/18/2023] Open

Affiliation(s)

Federico M. Giorgi Department of Systems Biology, Columbia University, New York, New York, United States of America Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
Gonzalo Lopez Department of Systems Biology, Columbia University, New York, New York, United States of America Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
Jung H. Woo Department of Systems Biology, Columbia University, New York, New York, United States of America Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
Brygida Bisikirska Department of Systems Biology, Columbia University, New York, New York, United States of America Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America
Andrea Califano Department of Systems Biology, Columbia University, New York, New York, United States of America Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America Columbia Genome Center, High Throughput Screening facility, Columbia University, New York, New York, United States of America Department of Biomedical Informatics, Columbia University, New York, New York, United States of America Department of Biochemistry and Molecular Biophysics, Columbia University, New York, New York, United States of America Institute for Cancer Genetics, Columbia University, New York, New York, United States of America Herbert Irving Comprehensive Cancer Center, Columbia University, New York, New York, United States of America * E-mail: (AC); (MB)
Mukesh Bansal Department of Systems Biology, Columbia University, New York, New York, United States of America Center for Computational Biology and Bioinformatics, Columbia University, New York, New York, United States of America * E-mail: (AC); (MB)

Collapse

Wang HQ, Tsai CJ. CorSig: a general framework for estimating statistical significance of correlation and its application to gene co-expression analysis. PLoS One 2013;8:e77429. [PMID: 24194884 PMCID: PMC3806744 DOI: 10.1371/journal.pone.0077429] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2013] [Accepted: 09/02/2013] [Indexed: 11/19/2022] Open

Abstract

UNLABELLED

With the rapid increase of omics data, correlation analysis has become an indispensable tool for inferring meaningful associations from a large number of observations. Pearson correlation coefficient (PCC) and its variants are widely used for such purposes. However, it remains challenging to test whether an observed association is reliable both statistically and biologically. We present here a new method, CorSig, for statistical inference of correlation significance. CorSig is based on a biology-informed null hypothesis, i.e., testing whether the true PCC (ρ) between two variables is statistically larger than a user-specified PCC cutoff (τ), as opposed to the simple null hypothesis of ρ = 0 in existing methods, i.e., testing whether an association can be declared without a threshold. CorSig incorporates Fisher's Z transformation of the observed PCC (r), which facilitates use of standard techniques for p-value computation and multiple testing corrections. We compared CorSig against two methods: one uses a minimum PCC cutoff while the other (Zhu's procedure) controls correlation strength and statistical significance in two discrete steps. CorSig consistently outperformed these methods in various simulation data scenarios by balancing between false positives and false negatives. When tested on real-world Populus microarray data, CorSig effectively identified co-expressed genes in the flavonoid pathway, and discriminated between closely related gene family members for their differential association with flavonoid and lignin pathways. The p-values obtained by CorSig can be used as a stand-alone parameter for stratification of co-expressed genes according to their correlation strength in lieu of an arbitrary cutoff. CorSig requires one single tunable parameter, and can be readily extended to other correlation measures. Thus, CorSig should be useful for a wide range of applications, particularly for network analysis of high-dimensional genomic data.

SOFTWARE AVAILABILITY

A web server for CorSig is provided at http://202.127.200.1:8080/probeWeb. R code for CorSig is freely available for non-commercial use at http://aspendb.uga.edu/downloads.

Collapse

Giorgi FM, Del Fabbro C, Licausi F. Comparative study of RNA-seq- and microarray-derived coexpression networks in Arabidopsis thaliana. ACTA ACUST UNITED AC 2013;29:717-24. [PMID: 23376351 DOI: 10.1093/bioinformatics/btt053] [Citation(s) in RCA: 69] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]

Vasilevski A, Giorgi FM, Bertinetti L, Usadel B. LASSO modeling of the Arabidopsis thaliana seed/seedling transcriptome: a model case for detection of novel mucilage and pectin metabolism genes. MOLECULAR BIOSYSTEMS 2013;8:2566-74. [PMID: 22735692 DOI: 10.1039/c2mb25096a] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Yu S, Zheng L, Li Y, Li C, Ma C, Yu Y, Li X, Hao P. Causal co-expression method with module analysis to screen drugs with specific target. Gene 2012;518:145-51. [PMID: 23266800 DOI: 10.1016/j.gene.2012.11.051] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2012] [Accepted: 11/27/2012] [Indexed: 01/19/2023]

Abstract

The considerable increase of investment in research and development by the pharmaceutical industry over the past three decades has not added the number of approved new drugs. An important issue ignored by drug discovery practice is the multi-dimensional interaction network between drugs and their targets. Thus, it is essential to view drug actions through the lens of network biology. In the current study, based on the co-expression network of transcription factors and their downstream genes, we proposed a novel approach, called causal co-expression method with module analysis, to screen drugs with specific target and fewer side effects. We presented a causal co-expression method with module analysis and it could be used in analyzing the microarray data of different drug candidates. At first, the differential wiring value (DW) was calculated to find some causal transcription factors (TFs) by combining with differential expression genes in the regulated networks. After the discovery of the causal TFs, co-expression module analysis method was applied to mine molecular pharmacology pathways around these causal TFs at molecular level. We applied our methods to two drug candidates, Argyrin A and Bortezomib, both with anti-cancer activities. We first obtained some differentially expressed transcription factors of cells treated with Argyrin A or Bortezomib. Nearly all these transcription factors are associated with the tumor suppressor protein p27kip1. Furthermore, module analysis showed that Bortezomib inhibited tumor growth not specifically by cell cycle and cell proliferation pathway, but through many basic metabolic processes which result in cell toxicity. In contrast, Argyrin A had influence on cell cycle, and was involved in DNA damage repair at the same time, showing that Argyrin A was a more suitable drug for anti-cancer treatment. Our study revealed that the causal co-expression method with module analysis was effective and can be used as a tool to evaluate drug candidates.

Collapse

Hempel S, Koseska A, Nikoloski Z, Kurths J. Unraveling gene regulatory networks from time-resolved gene expression data - a measures comparison study. BMC Bioinformatics 2011;12:292. [PMID: 21771321 PMCID: PMC3161045 DOI: 10.1186/1471-2105-12-292] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2011] [Accepted: 07/19/2011] [Indexed: 11/25/2022] Open

Abstract

BACKGROUND

Inferring regulatory interactions between genes from transcriptomics time-resolved data, yielding reverse engineered gene regulatory networks, is of paramount importance to systems biology and bioinformatics studies. Accurate methods to address this problem can ultimately provide a deeper insight into the complexity, behavior, and functions of the underlying biological systems. However, the large number of interacting genes coupled with short and often noisy time-resolved read-outs of the system renders the reverse engineering a challenging task. Therefore, the development and assessment of methods which are computationally efficient, robust against noise, applicable to short time series data, and preferably capable of reconstructing the directionality of the regulatory interactions remains a pressing research problem with valuable applications.

RESULTS

Here we perform the largest systematic analysis of a set of similarity measures and scoring schemes within the scope of the relevance network approach which are commonly used for gene regulatory network reconstruction from time series data. In addition, we define and analyze several novel measures and schemes which are particularly suitable for short transcriptomics time series. We also compare the considered 21 measures and 6 scoring schemes according to their ability to correctly reconstruct such networks from short time series data by calculating summary statistics based on the corresponding specificity and sensitivity. Our results demonstrate that rank and symbol based measures have the highest performance in inferring regulatory interactions. In addition, the proposed scoring scheme by asymmetric weighting has shown to be valuable in reducing the number of false positive interactions. On the other hand, Granger causality as well as information-theoretic measures, frequently used in inference of regulatory networks, show low performance on the short time series analyzed in this study.

CONCLUSIONS

Our study is intended to serve as a guide for choosing a particular combination of similarity measures and scoring schemes suitable for reconstruction of gene regulatory networks from short time series data. We show that further improvement of algorithms for reverse engineering can be obtained if one considers measures that are rooted in the study of symbolic dynamics or ranks, in contrast to the application of common similarity measures which do not consider the temporal character of the employed data. Moreover, we establish that the asymmetric weighting scoring scheme together with symbol based measures (for low noise level) and rank based measures (for high noise level) are the most suitable choices.

Collapse

Wang XD, Qi YX, Jiang ZL. Reconstruction of transcriptional network from microarray data using combined mutual information and network-assisted regression. IET Syst Biol 2011;5:95-102. [PMID: 21405197 DOI: 10.1049/iet-syb.2010.0041] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Tan M, Alshalalfa M, Alhajj R, Polat F. Influence of prior knowledge in constraint-based learning of gene regulatory networks. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011;8:130-142. [PMID: 21071802 DOI: 10.1109/tcbb.2009.58] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/30/2023]

Yano K. Gene expression correlation analysis predicts involvement of high- and low-confidence risk genes in different stages of prostate carcinogenesis. Prostate 2010;70:1746-59. [PMID: 20564324 DOI: 10.1002/pros.21210] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]

Ruepp A, Waegele B, Lechner M, Brauner B, Dunger-Kaltenbach I, Fobo G, Frishman G, Montrone C, Mewes HW. CORUM: the comprehensive resource of mammalian protein complexes--2009. Nucleic Acids Res 2009;38:D497-501. [PMID: 19884131 PMCID: PMC2808912 DOI: 10.1093/nar/gkp914] [Citation(s) in RCA: 509] [Impact Index Per Article: 33.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open

Inferring the transcriptional landscape of bovine skeletal muscle by integrating co-expression networks. PLoS One 2009;4:e7249. [PMID: 19794913 PMCID: PMC2749936 DOI: 10.1371/journal.pone.0007249] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2009] [Accepted: 08/31/2009] [Indexed: 11/19/2022] Open

Abstract

BACKGROUND

Despite modern technologies and novel computational approaches, decoding causal transcriptional regulation remains challenging. This is particularly true for less well studied organisms and when only gene expression data is available. In muscle a small number of well characterised transcription factors are proposed to regulate development. Therefore, muscle appears to be a tractable system for proposing new computational approaches.

METHODOLOGY/PRINCIPAL FINDINGS

Here we report a simple algorithm that asks "which transcriptional regulator has the highest average absolute co-expression correlation to the genes in a co-expression module?" It correctly infers a number of known causal regulators of fundamental biological processes, including cell cycle activity (E2F1), glycolysis (HLF), mitochondrial transcription (TFB2M), adipogenesis (PIAS1), neuronal development (TLX3), immune function (IRF1) and vasculogenesis (SOX17), within a skeletal muscle context. However, none of the canonical pro-myogenic transcription factors (MYOD1, MYOG, MYF5, MYF6 and MEF2C) were linked to muscle structural gene expression modules. Co-expression values were computed using developing bovine muscle from 60 days post conception (early foetal) to 30 months post natal (adulthood) for two breeds of cattle, in addition to a nutritional comparison with a third breed. A number of transcriptional landscapes were constructed and integrated into an always correlated landscape. One notable feature was a 'metabolic axis' formed from glycolysis genes at one end, nuclear-encoded mitochondrial protein genes at the other, and centrally tethered by mitochondrially-encoded mitochondrial protein genes.

CONCLUSIONS/SIGNIFICANCE

The new module-to-regulator algorithm complements our recently described Regulatory Impact Factor analysis. Together with a simple examination of a co-expression module's contents, these three gene expression approaches are starting to illuminate the in vivo transcriptional regulation of skeletal muscle development.

Collapse

He F, Balling R, Zeng AP. Reverse engineering and verification of gene networks: principles, assumptions, and limitations of present methods and future perspectives. J Biotechnol 2009;144:190-203. [PMID: 19631244 DOI: 10.1016/j.jbiotec.2009.07.013] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2009] [Revised: 07/13/2009] [Accepted: 07/16/2009] [Indexed: 12/21/2022]

Michoel T, De Smet R, Joshi A, Van de Peer Y, Marchal K. Comparative analysis of module-based versus direct methods for reverse-engineering transcriptional regulatory networks. BMC SYSTEMS BIOLOGY 2009;3:49. [PMID: 19422680 PMCID: PMC2684101 DOI: 10.1186/1752-0509-3-49] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/21/2008] [Accepted: 05/07/2009] [Indexed: 12/20/2022]

Abstract

BACKGROUND

A myriad of methods to reverse-engineer transcriptional regulatory networks have been developed in recent years. Direct methods directly reconstruct a network of pairwise regulatory interactions while module-based methods predict a set of regulators for modules of coexpressed genes treated as a single unit. To date, there has been no systematic comparison of the relative strengths and weaknesses of both types of methods.

RESULTS

We have compared a recently developed module-based algorithm, LeMoNe (Learning Module Networks), to a mutual information based direct algorithm, CLR (Context Likelihood of Relatedness), using benchmark expression data and databases of known transcriptional regulatory interactions for Escherichia coli and Saccharomyces cerevisiae. A global comparison using recall versus precision curves hides the topologically distinct nature of the inferred networks and is not informative about the specific subtasks for which each method is most suited. Analysis of the degree distributions and a regulator specific comparison show that CLR is 'regulator-centric', making true predictions for a higher number of regulators, while LeMoNe is 'target-centric', recovering a higher number of known targets for fewer regulators, with limited overlap in the predicted interactions between both methods. Detailed biological examples in E. coli and S. cerevisiae are used to illustrate these differences and to prove that each method is able to infer parts of the network where the other fails. Biological validation of the inferred networks cautions against over-interpreting recall and precision values computed using incomplete reference networks.

CONCLUSION

Our results indicate that module-based and direct methods retrieve largely distinct parts of the underlying transcriptional regulatory networks. The choice of algorithm should therefore be based on the particular biological problem of interest and not on global metrics which cannot be transferred between organisms. The development of sound statistical methods for integrating the predictions of different reverse-engineering strategies emerges as an important challenge for future research.

Collapse

Hudson NJ, Reverter A, Dalrymple BP. A differential wiring analysis of expression data correctly identifies the gene containing the causal mutation. PLoS Comput Biol 2009;5:e1000382. [PMID: 19412532 PMCID: PMC2671163 DOI: 10.1371/journal.pcbi.1000382] [Citation(s) in RCA: 148] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2008] [Accepted: 04/01/2009] [Indexed: 11/18/2022] Open

Abstract

Transcription factor (TF) regulation is often post-translational. TF modifications such as reversible phosphorylation and missense mutations, which can act independent of TF expression level, are overlooked by differential expression analysis. Using bovine Piedmontese myostatin mutants as proof-of-concept, we propose a new algorithm that correctly identifies the gene containing the causal mutation from microarray data alone. The myostatin mutation releases the brakes on Piedmontese muscle growth by translating a dysfunctional protein. Compared to a less muscular non-mutant breed we find that myostatin is not differentially expressed at any of ten developmental time points. Despite this challenge, the algorithm identifies the myostatin ‘smoking gun’ through a coordinated, simultaneous, weighted integration of three sources of microarray information: transcript abundance, differential expression, and differential wiring. By asking the novel question “which regulator is cumulatively most differentially wired to the abundant most differentially expressed genes?” it yields the correct answer, “myostatin”. Our new approach identifies causal regulatory changes by globally contrasting co-expression network dynamics. The entirely data-driven ‘weighting’ procedure emphasises regulatory movement relative to the phenotypically relevant part of the network. In contrast to other published methods that compare co-expression networks, significance testing is not used to eliminate connections.

Evolution, development, and cancer are governed by regulatory circuits where the central nodes are transcription factors. Consequently, there is great interest in methods that can identify the causal mutation/perturbation responsible for any circuit rewiring. The most widely available high-throughput technology, the microarray, assays the transcriptome. However, many regulatory perturbations are post-transcriptional. This means that they are overlooked by traditional differential gene expression analysis. We hypothesised that by viewing biological systems as networks one could identify causal mutations and perturbations by examining those regulators whose position in the network changes the most. Using muscular myostatin mutant cattle as a proof-of-concept, we propose an analysis that succeeds based solely on microarray expression data from just 27 animals. Our analysis differs from competing network approaches in that we do not use significance testing to eliminate connections. All connections are contrasted, no matter how weak. Further, the identity of target genes is maintained throughout the analysis. Finally, the analysis is ‘weighted’ such that movement relative to the phenotypically most relevant part of the network is emphasised. By identifying the question to which myostatin is the answer, we present a comparison of network connectivity that is potentially generalisable.

Collapse

Reverter A, Chan EKF. Combining partial correlation and an information theory approach to the reversed engineering of gene co-expression networks. ACTA ACUST UNITED AC 2008;24:2491-7. [PMID: 18784117 DOI: 10.1093/bioinformatics/btn482] [Citation(s) in RCA: 215] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]