201
|
Transcription factors for predictive plant metabolic engineering: are we there yet? Curr Opin Biotechnol 2008; 19:138-44. [PMID: 18374558 DOI: 10.1016/j.copbio.2008.02.002] [Citation(s) in RCA: 88] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2007] [Revised: 02/06/2008] [Accepted: 02/12/2008] [Indexed: 01/21/2023]
Abstract
Transcription factors (TFs) are considered viable alternatives to 'single enzyme' approaches for the manipulation of plant metabolic pathways. Because of the ability to control multiple, if not all steps in a particular metabolic pathway, TFs provide attractive tools for overcoming flux bottlenecks involving multiple enzymatic steps, or for deploying pathway genes in specific organs, cell types or even plants where they normally do not express. The potential of a TF for the predictive manipulation of plant metabolism is intimately linked to understanding how it fits in the gene regulatory organization. The knowledge gained over the past decade on how plant pathways are controlled together with increasing efforts aimed at deciphering the overall architecture of plant gene regulatory networks are starting to realize the potential of TFs for predictive plant metabolic engineering.
Collapse
|
202
|
Yu H, Braun P, Yildirim MA, Lemmens I, Venkatesan K, Sahalie J, Hirozane-Kishikawa T, Gebreab F, Li N, Simonis N, Hao T, Rual JF, Dricot A, Vazquez A, Murray RR, Simon C, Tardivo L, Tam S, Svrzikapa N, Fan C, de Smet AS, Motyl A, Hudson ME, Park J, Xin X, Cusick ME, Moore T, Boone C, Snyder M, Roth FP, Barabási AL, Tavernier J, Hill DE, Vidal M. High-quality binary protein interaction map of the yeast interactome network. Science 2008; 322:104-10. [PMID: 18719252 DOI: 10.1126/science.1158684] [Citation(s) in RCA: 962] [Impact Index Per Article: 56.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Current yeast interactome network maps contain several hundred molecular complexes with limited and somewhat controversial representation of direct binary interactions. We carried out a comparative quality assessment of current yeast interactome data sets, demonstrating that high-throughput yeast two-hybrid (Y2H) screening provides high-quality binary interaction information. Because a large fraction of the yeast binary interactome remains to be mapped, we developed an empirically controlled mapping framework to produce a "second-generation" high-quality, high-throughput Y2H data set covering approximately 20% of all yeast binary interactions. Both Y2H and affinity purification followed by mass spectrometry (AP/MS) data are of equally high quality but of a fundamentally different and complementary nature, resulting in networks with different topological and biological properties. Compared to co-complex interactome models, this binary map is enriched for transient signaling interactions and intercomplex connections with a highly significant clustering between essential proteins. Rather than correlating with essentiality, protein connectivity correlates with genetic pleiotropy.
Collapse
Affiliation(s)
- Haiyuan Yu
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
203
|
Shalgi R, Lieber D, Oren M, Pilpel Y. Global and local architecture of the mammalian microRNA-transcription factor regulatory network. PLoS Comput Biol 2008; 3:e131. [PMID: 17630826 PMCID: PMC1914371 DOI: 10.1371/journal.pcbi.0030131] [Citation(s) in RCA: 385] [Impact Index Per Article: 22.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2006] [Accepted: 05/22/2007] [Indexed: 01/07/2023] Open
Abstract
microRNAs (miRs) are small RNAs that regulate gene expression at the posttranscriptional level. It is anticipated that, in combination with transcription factors (TFs), they span a regulatory network that controls thousands of mammalian genes. Here we set out to uncover local and global architectural features of the mammalian miR regulatory network. Using evolutionarily conserved potential binding sites of miRs in human targets, and conserved binding sites of TFs in promoters, we uncovered two regulation networks. The first depicts combinatorial interactions between pairs of miRs with many shared targets. The network reveals several levels of hierarchy, whereby a few miRs interact with many other lowly connected miR partners. We revealed hundreds of “target hubs” genes, each potentially subject to massive regulation by dozens of miRs. Interestingly, many of these target hub genes are transcription regulators and they are often related to various developmental processes. The second network consists of miR–TF pairs that coregulate large sets of common targets. We discovered that the network consists of several recurring motifs. Most notably, in a significant fraction of the miR–TF coregulators the TF appears to regulate the miR, or to be regulated by the miR, forming a diversity of feed-forward loops. Together these findings provide new insights on the architecture of the combined transcriptional–post transcriptional regulatory network. It is becoming increasingly appreciated that a new type of gene which does not code for proteins, the regulatory RNAs, constitutes a considerable portion of mammalian genomes, and these genes serve as key players in the regulatory network of living cells. Among these regulatory RNAs are the microRNAs (miRs), small RNAs that mediate posttranscriptional gene silencing through inhibition of protein production or degradation of mRNAs. So far little is known about the extent of regulation by miRs, and their potential cooperation with other regulatory layers in the network. We investigated the potential crosstalk between the miR-mediated posttranscription layer, and the transcriptional regulation layer, whose dominant players, the transcription factors (TFs), regulate the production of protein-coding mRNAs. We found that the extent of miR regulation varies extensively among different genes, some of which, especially those who serve as regulators themselves, are subject to enhanced miR silencing. Further, we identified thousands of genes that are potentially subjected to coordinated regulation by multiple miRs and by specific combinations of TFs and miRs. The regulatory network, comprising transcriptional and posttranscriptional regulation, manifests several recurring architectures, one of which consists of a TF and a miR that together regulate a large set of common genes, and that also appear to regulate one another. Altogether this work provides new insights into the logic and evolution of a new regulatory layer of the mammalian genome, and its effect on other regulatory networks in the cell.
Collapse
Affiliation(s)
- Reut Shalgi
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Daniel Lieber
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
| | - Moshe Oren
- Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Yitzhak Pilpel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, Israel
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
204
|
Karimpour-Fard A, Leach SM, Hunter LE, Gill RT. The topology of the bacterial co-conserved protein network and its implications for predicting protein function. BMC Genomics 2008; 9:313. [PMID: 18590549 PMCID: PMC2488357 DOI: 10.1186/1471-2164-9-313] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2008] [Accepted: 06/30/2008] [Indexed: 11/12/2022] Open
Abstract
Background Protein-protein interactions networks are most often generated from physical protein-protein interaction data. Co-conservation, also known as phylogenetic profiles, is an alternative source of information for generating protein interaction networks. Co-conservation methods generate interaction networks among proteins that are gained or lost together through evolution. Co-conservation is a particularly useful technique in the compact bacteria genomes. Prior studies in yeast suggest that the topology of protein-protein interaction networks generated from physical interaction assays can offer important insight into protein function. Here, we hypothesize that in bacteria, the topology of protein interaction networks derived via co-conservation information could similarly improve methods for predicting protein function. Since the topology of bacteria co-conservation protein-protein interaction networks has not previously been studied in depth, we first perform such an analysis for co-conservation networks in E. coli K12. Next, we demonstrate one way in which network connectivity measures and global and local function distribution can be exploited to predict protein function for previously uncharacterized proteins. Results Our results showed, like most biological networks, our bacteria co-conserved protein-protein interaction networks had scale-free topologies. Our results indicated that some properties of the physical yeast interaction network hold in our bacteria co-conservation networks, such as high connectivity for essential proteins. However, the high connectivity among protein complexes in the yeast physical network was not seen in the co-conservation network which uses all bacteria as the reference set. We found that the distribution of node connectivity varied by functional category and could be informative for function prediction. By integrating of functional information from different annotation sources and using the network topology, we were able to infer function for uncharacterized proteins. Conclusion Interactions networks based on co-conservation can contain information distinct from networks based on physical or other interaction types. Our study has shown co-conservation based networks to exhibit a scale free topology, as expected for biological networks. We also revealed ways that connectivity in our networks can be informative for the functional characterization of proteins.
Collapse
Affiliation(s)
- Anis Karimpour-Fard
- Center for Computational Pharmacology, University of Colorado School of Medicine, Aurora, Colorado 80045, USA.
| | | | | | | |
Collapse
|
205
|
Marr C, Geertz M, Hütt MT, Muskhelishvili G. Dissecting the logical types of network control in gene expression profiles. BMC SYSTEMS BIOLOGY 2008; 2:18. [PMID: 18284674 PMCID: PMC2263018 DOI: 10.1186/1752-0509-2-18] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2007] [Accepted: 02/19/2008] [Indexed: 11/19/2022]
Abstract
Background In the bacterium Escherichia coli the transcriptional regulation of gene expression involves both dedicated regulators binding specific DNA sites with high affinity and also global regulators – abundant DNA architectural proteins of the bacterial nucleoid binding multiple sites with a wide range of affinities and thus modulating the superhelical density of DNA. The first form of transcriptional regulation is predominantly pairwise and specific, representing digitial control, while the second form is (in strength and distribution) continuous, representing analog control. Results Here we look at the properties of effective networks derived from significant gene expression changes under variation of the two forms of control and find that upon limitations of one type of control (caused e.g. by mutation of a global DNA architectural factor) the other type can compensate for compromised regulation. Mutations of global regulators significantly enhance the digital control, whereas in the presence of global DNA architectural proteins regulation is mostly of the analog type, coupling spatially neighboring genomic loci. Taken together our data suggest that two logically distinct – digital and analog – types of control are balancing each other. Conclusion By revealing two distinct logical types of control, our approach provides basic insights into both the organizational principles of transcriptional regulation and the mechanisms buffering genetic flexibility. We anticipate that the general concept of distinguishing logical types of control will apply to many complex biological networks.
Collapse
Affiliation(s)
- Carsten Marr
- Computational Systems Biology Group, Jacobs University, Campus Ring 1, 28759 Bremen, Germany.
| | | | | | | |
Collapse
|
206
|
Codoñer FM, Fares MA. Why should we care about molecular coevolution? Evol Bioinform Online 2008; 4:29-38. [PMID: 19204805 PMCID: PMC2614197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Non-independent evolution of amino acid sites has become a noticeable limitation of most methods aimed at identifying selective constraints at functionally important amino acid sites or protein regions. The need for a generalised framework to account for non-independence of amino acid sites has fuelled the design and development of new mathematical models and computational tools centred on resolving this problem. Molecular coevolution is one of the most active areas of research, with an increasing rate of new models and methods being developed everyday. Both parametric and non-parametric methods have been developed to account for correlated variability of amino acid sites. These methods have been utilised for detecting phylogenetic, functional and structural coevolution as well as to identify surfaces of amino acid sites involved in protein-protein interactions. Here we discuss and briefly describe these methods, and identify their advantages and limitations.
Collapse
Affiliation(s)
- Francisco M. Codoñer
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College, Institute of Immunology, Biology Department, National University of Ireland Maynooth
| | - Mario A. Fares
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College,Correspondence:
| |
Collapse
|
207
|
Abstract
Non-independent evolution of amino acid sites has become a noticeable limitation of most methods aimed at identifying selective constraints at functionally important amino acid sites or protein regions. The need for a generalised framework to account for non-independence of amino acid sites has fuelled the design and development of new mathematical models and computational tools centred on resolving this problem. Molecular coevolution is one of the most active areas of research, with an increasing rate of new models and methods being developed everyday. Both parametric and non-parametric methods have been developed to account for correlated variability of amino acid sites. These methods have been utilised for detecting phylogenetic, functional and structural coevolution as well as to identify surfaces of amino acid sites involved in protein-protein interactions. Here we discuss and briefly describe these methods, and identify their advantages and limitations.
Collapse
Affiliation(s)
- Francisco M. Codoñer
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College
- Institute of Immunology, Biology Department, National University of Ireland Maynooth
| | - Mario A. Fares
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College
| |
Collapse
|
208
|
Jeong J, Berman P. On cycles in the transcription network of Saccharomyces cerevisiae. BMC SYSTEMS BIOLOGY 2008; 2:12. [PMID: 18237406 PMCID: PMC2275720 DOI: 10.1186/1752-0509-2-12] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2007] [Accepted: 01/31/2008] [Indexed: 11/10/2022]
Abstract
BACKGROUND We investigate the cycles in the transcription network of Saccharomyces cerevisiae. Unlike a similar network of Escherichia coli, it contains many cycles. We characterize properties of these cycles and their place in the regulatory mechanism of the cell. RESULTS Almost all cycles in the transcription network of Saccharomyces cerevisiae are contained in a single strongly connected component, which we call LSCC (L for "largest"), except for a single cycle of two transcription factors. The fact that LSCC includes almost all cycles is well explained by the properties of a random graph with the same in- and out-degrees of the nodes. Among different physiological conditions, cell cycle has the most significant relationship with LSCC, as the set of 64 transcription interactions that are active in all phases of the cell cycle has overlap of 27 with the interactions of LSCC (of which there are 49).Conversely, if we remove the interactions that are active in all phases of the cell cycle (25% of interactions to transcription factors), the LSCC would have only three nodes and 5 edges, many fewer than expected. This subgraph of the transcription network consists mostly of interactions that are active only in the stress response subnetwork. We also characterize the role of LSCC in the topology of the network. We show that LSCC can be used to define a natural hierarchy in the network and that in every physiological subnetwork LSCC plays a pivotal role. CONCLUSION Apart from those well-defined conditions, the transcription network of Saccharomyces cerevisiae is devoid of cycles. It was observed that two conditions that were studied and that have no cycles of their own are exogenous: diauxic shift and DNA repair, while cell cycle and sporulation are endogenous. We claim that in a certain sense (slow recovery) stress response is endogenous as well.
Collapse
Affiliation(s)
- Jieun Jeong
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Piotr Berman
- Department of Computer Science and Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| |
Collapse
|
209
|
Wang ZH, Liu QJ, Zhu YP. [Research on modular organization of gene regulatory network]. YI CHUAN = HEREDITAS 2008; 30:20-27. [PMID: 18244898 DOI: 10.3724/sp.j.1005.2008.00020] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Gene regulatory network is very important for researchers to understand biological processes and gene functions. It can deliver complex information about how could large amount of genes be regulated by transcriptional factors and translated into proteins which can carried out biological functions. Generally, knowledge of network topological structure and organization formation can be used to find the regulatory mechanism of genes in the regulatory network. It can illuminate the local characters of the network and reveal the constructing methods of regulatory network; moreover, it can also analyze regulatory pathway completely and systemically. Now, more and more researchers approbate the hierarchy structure of gene regulatory network: regulatory component, Motif, module and the whole network. Here, we discuss the middle two levels: motif and module. We compared various research results of network organization carried out in recent years, explicated their biology signification and pointed out the existing disadvantages and problems. According these problems, we also bring up some possible research trend. And at last, we discuss the prospect of gene regulatory network modular organization researching work.
Collapse
Affiliation(s)
- Zheng-Hua Wang
- National Laboratory For Parallel & Distributed Processing, National University of Deference and Technology, Changsha 410073, China.
| | | | | |
Collapse
|
210
|
Palotai R, Szalay MS, Csermely P. Chaperones as integrators of cellular networks: Changes of cellular integrity in stress and diseases. IUBMB Life 2007; 60:10-8. [DOI: 10.1002/iub.8] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
211
|
Korcsmáros T, Kovács IA, Szalay MS, Csermely P. Molecular chaperones: the modular evolution of cellular networks. J Biosci 2007; 32:441-6. [PMID: 17536163 DOI: 10.1007/s12038-007-0043-y] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Molecular chaperones play a prominent role in signaling and transcriptional regulatory networks of the cell. Recent advances uncovered that chaperones act as genetic buffers stabilizing the phenotype of various cells and organisms and may serve as potential regulators of evolvability. Chaperones have weak links, connect hubs, are in the overlaps of network modules and may uncouple these modules during stress,which gives an additional protection for the cell at the network-level. Moreover,after stress chaperones are essential to re-build inter-modular contacts by their low affinity sampling of the potential interaction partners in different modules. This opens the way to the chaperone-regulated modular evolution of cellular networks,and helps us to design novel therapeutic and anti-aging strategies.
Collapse
Affiliation(s)
- Tamás Korcsmáros
- Department of Medical Chemistry, Semmelweis University, Budapest, Hungary
| | | | | | | |
Collapse
|
212
|
Irons DJ, Monk NAM. Identifying dynamical modules from genetic regulatory systems: applications to the segment polarity network. BMC Bioinformatics 2007; 8:413. [PMID: 17961242 PMCID: PMC2233651 DOI: 10.1186/1471-2105-8-413] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2007] [Accepted: 10/25/2007] [Indexed: 11/30/2022] Open
Abstract
Background It is widely accepted that genetic regulatory systems are 'modular', in that the whole system is made up of smaller 'subsystems' corresponding to specific biological functions. Most attempts to identify modules in genetic regulatory systems have relied on the topology of the underlying network. However, it is the temporal activity (dynamics) of genes and proteins that corresponds to biological functions, and hence it is dynamics that we focus on here for identifying subsystems. Results Using Boolean network models as an exemplar, we present a new technique to identify subsystems, based on their dynamical properties. The main part of the method depends only on the stable dynamics (attractors) of the system, thus requiring no prior knowledge of the underlying network. However, knowledge of the logical relationships between the network components can be used to describe how each subsystem is regulated. To demonstrate its applicability to genetic regulatory systems, we apply the method to a model of the Drosophila segment polarity network, providing a detailed breakdown of the system. Conclusion We have designed a technique for decomposing any set of discrete-state, discrete-time attractors into subsystems. Having a suitable mathematical model also allows us to describe how each subsystem is regulated and how robust each subsystem is against perturbations. However, since the subsystems are found directly from the attractors, a mathematical model or underlying network topology is not necessarily required to identify them, potentially allowing the method to be applied directly to experimental expression data.
Collapse
Affiliation(s)
- David J Irons
- Department of Computer Science, University of Sheffield, UK.
| | | |
Collapse
|
213
|
Janga SC, Collado-Vides J. Structure and evolution of gene regulatory networks in microbial genomes. Res Microbiol 2007; 158:787-94. [PMID: 17996425 DOI: 10.1016/j.resmic.2007.09.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2007] [Revised: 08/07/2007] [Accepted: 09/17/2007] [Indexed: 12/24/2022]
Abstract
With the availability of genome sequences for hundreds of microbial genomes, it has become possible to address several questions from a comparative perspective to understand the structure and function of regulatory systems, at least in model organisms. Recent studies have focused on topological properties and the evolution of regulatory networks and their components. Our understanding of natural networks is paving the way to embedding synthetic regulatory systems into organisms, allowing us to expand the natural diversity of living systems to an extent we had never before anticipated.
Collapse
Affiliation(s)
- Sarath Chandra Janga
- Program of Computational Genomics, CCG-UNAM, Apdo Postal 565-A, Cuernavaca, Morelos, 62100 Mexico.
| | | |
Collapse
|
214
|
Janga SC, Salgado H, Martínez-Antonio A, Collado-Vides J. Coordination logic of the sensing machinery in the transcriptional regulatory network of Escherichia coli. Nucleic Acids Res 2007; 35:6963-72. [PMID: 17933780 PMCID: PMC2175315 DOI: 10.1093/nar/gkm743] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
The active and inactive state of transcription factors in growing cells is usually directed by allosteric physicochemical signals or metabolites, which are in turn either produced in the cell or obtained from the environment by the activity of the products of effector genes. To understand the regulatory dynamics and to improve our knowledge about how transcription factors (TFs) respond to endogenous and exogenous signals in the bacterial model, Escherichia coli, we previously proposed to classify TFs into external, internal and hybrid sensing classes depending on the source of their allosteric or equivalent metabolite. Here we analyze how a cell uses its topological structures in the context of sensing machinery and show that, while feed forward loops (FFLs) tightly integrate internal and external sensing TFs connecting TFs from different layers of the hierarchical transcriptional regulatory network (TRN), bifan motifs frequently connect TFs belonging to the same sensing class and could act as a bridge between TFs originating from the same level in the hierarchy. We observe that modules identified in the regulatory network of E. coli are heterogeneous in sensing context with a clear combination of internal and external sensing categories depending on the physiological role played by the module. We also note that propensity of two-component response regulators increases at promoters, as the number of TFs regulating a target operon increases. Finally we show that evolutionary families of TFs do not show a tendency to preserve their sensing abilities. Our results provide a detailed panorama of the topological structures of E. coli TRN and the way TFs they compose off, sense their surroundings by coordinating responses.
Collapse
Affiliation(s)
- Sarath Chandra Janga
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, 62100, México.
| | | | | | | |
Collapse
|
215
|
Markowetz F, Kostka D, Troyanskaya OG, Spang R. Nested effects models for high-dimensional phenotyping screens. ACTA ACUST UNITED AC 2007; 23:i305-12. [PMID: 17646311 DOI: 10.1093/bioinformatics/btm178] [Citation(s) in RCA: 87] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION In high-dimensional phenotyping screens, a large number of cellular features is observed after perturbing genes by knockouts or RNA interference. Comprehensive analysis of perturbation effects is one of the most powerful techniques for attributing functions to genes, but not much work has been done so far to adapt statistical and computational methodology to the specific needs of large-scale and high-dimensional phenotyping screens. RESULTS We introduce and compare probabilistic methods to efficiently infer a genetic hierarchy from the nested structure of observed perturbation effects. These hierarchies elucidate the structures of signaling pathways and regulatory networks. Our methods achieve two goals: (1) they reveal clusters of genes with highly similar phenotypic profiles, and (2) they order (clusters of) genes according to subset relationships between phenotypes. We evaluate our algorithms in the controlled setting of simulation studies and show their practical use in two experimental scenarios: (1) a data set investigating the response to microbial challenge in Drosophila melanogaster, and (2) a compendium of expression profiles of Saccharomyces cerevisiae knockout strains. We show that our methods identify biologically justified genetic hierarchies of perturbation effects. AVAILABILITY The software used in our analysis is freely available in the R package 'nem' from www.bioconductor.org.
Collapse
Affiliation(s)
- Florian Markowetz
- Lewis-Sigler Institute for Integrative Genomics and Department of Computer Science, Princeton University, Princeton, NJ, 08544, USA
| | | | | | | |
Collapse
|
216
|
Abstract
The execution of complex biological processes requires the precise interaction and regulation of thousands of molecules. Systematic approaches to study large numbers of proteins, metabolites, and their modification have revealed complex molecular networks. These biological networks are significantly different from random networks and often exhibit ubiquitous properties in terms of their structure and organization. Analyzing these networks provides novel insights in understanding basic mechanisms controlling normal cellular processes and disease pathologies.
Collapse
Affiliation(s)
- Xiaowei Zhu
- Department of Molecular, Cellular, and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA
| | | | | |
Collapse
|
217
|
Smith JJ, Ramsey SA, Marelli M, Marzolf B, Hwang D, Saleem RA, Rachubinski RA, Aitchison JD. Transcriptional responses to fatty acid are coordinated by combinatorial control. Mol Syst Biol 2007; 3:115. [PMID: 17551510 PMCID: PMC1911199 DOI: 10.1038/msb4100157] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2006] [Accepted: 04/23/2007] [Indexed: 11/25/2022] Open
Abstract
In transcriptional regulatory networks, the coincident binding of a combination of factors to regulate a gene implies the existence of complex mechanisms to control both the gene expression profile and specificity of the response. Unraveling this complexity is a major challenge to biologists. Here, a novel network topology-based clustering approach was applied to condition-specific genome-wide chromatin localization and expression data to characterize a dynamic transcriptional regulatory network responsive to the fatty acid oleate. A network of four (predicted) regulators of the response (Oaf1p, Pip2p, Adr1p and Oaf3p) was investigated. By analyzing trends in the network structure, we found that two groups of multi-input motifs form in response to oleate, each controlling distinct functional classes of genes. This functionality is contributed in part by Oaf1p, which is a component of both types of multi-input motifs and has two different regulatory activities depending on its binding context. The dynamic cooperation between Oaf1p and Pip2p appears to temporally synchronize the two different responses. Together, these data suggest a network mechanism involving dynamic combinatorial control for coordinating transcriptional responses.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - John D Aitchison
- Institute for Systems Biology, Seattle, WA, USA
- Department of Cell Biology, University of Alberta, Edmonton, Alberta, Canada
- Institute for Systems Biology, 1441 N 34th Street, Seattle, WA 98103-8904, USA. Tel.: +1 206 732 1344; Fax: +1 206 732 1299;
| |
Collapse
|
218
|
Abstract
SUMMARYChanges in gene expression underlie phenotypic plasticity, variation within species, and phenotypic divergence between species. These expression differences arise from modulation of regulatory networks. To understand the source of expression differences, networks of interactions among genes and gene products that orchestrate gene expression must be considered. Here I review the basic structure of eukaryotic regulatory networks and discuss selected case studies that provide insight into how these networks are altered to create expression differences within and between species.
Collapse
Affiliation(s)
- Patricia J Wittkopp
- Department of Ecology and Evolutionary Biology, Department of Molecular, Cellular, and Developmental Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
219
|
Cosentino Lagomarsino M, Jona P, Bassetti B, Isambert H. Hierarchy and feedback in the evolution of the Escherichia coli transcription network. Proc Natl Acad Sci U S A 2007; 104:5516-20. [PMID: 17372223 PMCID: PMC1838485 DOI: 10.1073/pnas.0609023104] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The Escherichia coli transcription network has an essentially feedforward structure, with abundant feedback at the level of self-regulations. Here, we investigate how these properties emerged during evolution. An assessment of the role of gene duplication based on protein domain architecture shows that (i) transcriptional autoregulators have mostly arisen through duplication, whereas (ii) the expected feedback loops stemming from their initial cross-regulation are strongly selected against. This requires a divergent coevolution of the transcription factor DNA-binding sites and their respective DNA cis-regulatory regions. Moreover, we find that the network tends to grow by expansion of the existing hierarchical layers of computation, rather than by addition of new layers. We also argue that rewiring of regulatory links due to mutation/selection of novel transcription factor/DNA binding interactions appears not to significantly affect the network global hierarchy, and that horizontally transferred genes are mainly added at the bottom, as new target nodes. These findings highlight the important evolutionary roles of both duplication and selective deletion of cross-talks between autoregulators in the emergence of the hierarchical transcription network of E. coli.
Collapse
Affiliation(s)
- M. Cosentino Lagomarsino
- *Unité Mixte de Recherche 168/Institut Curie, 26 rue d'Ulm, 75005 Paris, France
- Università degli Studi di Milano, Dipartimento di Fisica, Via Celoria 16, 20133 Milano, Italy
- To whom correspondence may be addressed. E-mail: or
| | - P. Jona
- Politecnico di Milano, Dipartimento di Fisica, Pza Leonardo Da Vinci 32, 20133 Milano, Italy; and
| | - B. Bassetti
- Università degli Studi di Milano, Dipartimento di Fisica, Via Celoria 16, 20133 Milano, Italy
- Istituto Nazionale di Fisica Nucleare, 20133 Milano, Italy
| | - H. Isambert
- *Unité Mixte de Recherche 168/Institut Curie, 26 rue d'Ulm, 75005 Paris, France
- To whom correspondence may be addressed. E-mail: or
| |
Collapse
|