1
|
Beccari L, Marco-Ferreres R, Bovolenta P. The logic of gene regulatory networks in early vertebrate forebrain patterning. Mech Dev 2012; 130:95-111. [PMID: 23111324 DOI: 10.1016/j.mod.2012.10.004] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2012] [Accepted: 10/09/2012] [Indexed: 01/19/2023]
Abstract
The vertebrate forebrain or prosencephalon is patterned at the beginning of neurulation into four major domains: the telencephalic, hypothalamic, retinal and diencephalic anlagen. These domains will then give rise to the majority of the brain structures involved in sensory integration and the control of higher intellectual and homeostatic functions. Understanding how forebrain pattering arises has thus attracted the interest of developmental neurobiologists for decades. As a result, most of its regulators have been identified and their hierarchical relationship is now the object of active investigation. Here, we summarize the main morphogenetic pathways and transcription factors involved in forebrain specification and propose the backbone of a possible gene regulatory network (GRN) governing its specification, taking advantage of the GRN principles elaborated by pioneer studies in simpler organisms. We will also discuss this GRN and its operational logic in the context of the remarkable morphological and functional diversification that the forebrain has undergone during evolution.
Collapse
Affiliation(s)
- Leonardo Beccari
- Centro de Biología Molecular "Severo Ochoa", CSIC-UAM, c/Nicolas Cabrera, 1, Madrid 28049, Spain
| | | | | |
Collapse
|
2
|
When needles look like hay: how to find tissue-specific enhancers in model organism genomes. Dev Biol 2010; 350:239-54. [PMID: 21130761 DOI: 10.1016/j.ydbio.2010.11.026] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2010] [Revised: 11/11/2010] [Accepted: 11/22/2010] [Indexed: 01/22/2023]
Abstract
A major prerequisite for the investigation of tissue-specific processes is the identification of cis-regulatory elements. No generally applicable technique is available to distinguish them from any other type of genomic non-coding sequence. Therefore, researchers often have to identify these elements by elaborate in vivo screens, testing individual regions until the right one is found. Here, based on many examples from the literature, we summarize how functional enhancers have been isolated from other elements in the genome and how they have been characterized in transgenic animals. Covering computational and experimental studies, we provide an overview of the global properties of cis-regulatory elements, like their specific interactions with promoters and target gene distances. We describe conserved non-coding elements (CNEs) and their internal structure, nucleotide composition, binding site clustering and overlap, with a special focus on developmental enhancers. Conflicting data and unresolved questions on the nature of these elements are highlighted. Our comprehensive overview of the experimental shortcuts that have been found in the different model organism communities and the new field of high-throughput assays should help during the preparation phase of a screen for enhancers. The review is accompanied by a list of general guidelines for such a project.
Collapse
|
3
|
He X, Sinha S. Evolution of cis-regulatory sequences in Drosophila. Methods Mol Biol 2010; 674:283-296. [PMID: 20827599 DOI: 10.1007/978-1-60761-854-6_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Cross-species comparison is an emerging paradigm for identifying cis-regulatory sequences and understanding their function and evolution. In this chapter, we review probabilistic models of evolution of transcription factor binding sites, which provide the theoretical basis for a number of new bioinformatics tools for comparative sequence analysis. We illustrate how important functional and evolutionary insights on binding site gain and loss can be acquired through sequence comparison. This includes the observation that binding site turnover follows a molecular clock and that its rate correlates with the strength of binding sites and the presence of other sites in the neighborhood. We also comment on emerging trends that go beyond individual binding sites to a more holistic study of regulatory evolution. We point out common technical challenges, such as reliable sequence alignment and binding site prediction, when doing comparative regulatory sequence analysis and note some potential solutions thereof.
Collapse
Affiliation(s)
- Xin He
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA.
| | | |
Collapse
|
4
|
Bina M, Wyss P, Lazarus SA, Shah SR, Ren W, Szpankowski W, Crawford GE, Park SP, Song XC. Discovering sequences with potential regulatory characteristics. Genomics 2009; 93:314-22. [DOI: 10.1016/j.ygeno.2008.11.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2007] [Revised: 05/28/2008] [Accepted: 11/17/2008] [Indexed: 11/25/2022]
|
5
|
Liberman LM, Stathopoulos A. Design flexibility in cis-regulatory control of gene expression: synthetic and comparative evidence. Dev Biol 2008; 327:578-89. [PMID: 19135437 DOI: 10.1016/j.ydbio.2008.12.020] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2008] [Revised: 12/13/2008] [Accepted: 12/16/2008] [Indexed: 11/18/2022]
Abstract
In early Drosophila embryos, the transcription factor Dorsal regulates patterns of gene expression and cell fate specification along the dorsal-ventral axis. How gene expression is produced within the broad lateral domain of the presumptive neurogenic ectoderm is not understood. To investigate transcriptional control during neurogenic ectoderm specification, we examined divergence and function of an embryonic cis-regulatory element controlling the gene short gastrulation (sog). While transcription factor binding sites are not completely conserved, we demonstrate that these sequences are bona fide regulatory elements, despite variable regulatory architecture. Mutation of conserved sequences revealed that putative transcription factor binding sites for Dorsal and Zelda, a ubiquitous maternal transcription factor, are required for proper sog expression. When Zelda and Dorsal sites are paired in a synthetic regulatory element, broad lateral expression results. However, synthetic regulatory elements that contain Dorsal and an additional activator also drive expression throughout the neurogenic ectoderm. Our results suggest that interaction between Dorsal and Zelda drives expression within the presumptive neurogenic ectoderm, but they also demonstrate that regulatory architecture directing expression in this domain is flexible. We propose a model for neurogenic ectoderm specification in which gene regulation occurs at the intersection of temporal and spatial transcription factor inputs.
Collapse
Affiliation(s)
- Louisa M Liberman
- California Institute of Technology, Division of Biology, 1200 E. California Blvd., MC 114-96, Pasadena, CA 91125, USA
| | | |
Collapse
|
6
|
Rao A, Hero AO, States DJ, Engel JD. Using directed information to build biologically relevant influence networks. J Bioinform Comput Biol 2008; 6:493-519. [PMID: 18574860 DOI: 10.1142/s0219720008003515] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2007] [Revised: 12/01/2007] [Accepted: 01/03/2008] [Indexed: 11/18/2022]
Abstract
The systematic inference of biologically relevant influence networks remains a challenging problem in computational biology. Even though the availability of high-throughput data has enabled the use of probabilistic models to infer the plausible structure of such networks, their true interpretation of the biology of the process is questionable. In this work, we propose a network inference methodology, based on the directed information (DTI) criterion, that incorporates the biology of transcription within the framework so as to enable experimentally verifiable inference. We use publicly available embryonic kidney and T-cell microarray datasets to demonstrate our results. We present two variants of network inference via DTI--supervised and unsupervised--and the inferred networks relevant to mammalian nephrogenesis and T-cell activation. Conformity of the obtained interactions with the literature as well as comparison with the coefficient of determination (CoD) method are demonstrated. Apart from network inference, the proposed framework enables the exploration of specific interactions, not just those revealed by data. To illustrate the latter point, a DTI-based framework to resolve interactions between transcription factor modules and target coregulated genes is proposed. Additionally, we show that DTI can be used in conjunction with mutual information to infer higher-order influence networks involving cooperative gene interactions.
Collapse
Affiliation(s)
- Arvind Rao
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, MI 48109, USA.
| | | | | | | |
Collapse
|
7
|
Lecci MS, Malta TM, Flausino VT, Gitaí DL, Ruiz JC, Monesi N. Functional and bioinformatics analyses reveal conservation ofcis-regulatory elements between sciaridae and drosophilidae. Genesis 2008; 46:43-51. [DOI: 10.1002/dvg.20364] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
8
|
Cheung TH, Barthel KKB, Kwan YL, Liu X. Identifying pattern-defined regulatory islands in mammalian genomes. Proc Natl Acad Sci U S A 2007; 104:10116-21. [PMID: 17535887 PMCID: PMC1891267 DOI: 10.1073/pnas.0704028104] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
Identifying cis-regulatory regions in mammalian genomes is a key challenge toward understanding transcriptional regulation. However, identification and functional characterization of those regulatory elements governing differential gene expression has been hampered by the limited understanding of their organization and locations in genomes. We hypothesized that genes that are conserved across species will also display conservation at the level of their transcriptional regulation and that this will be reflected in the organization of cis-elements mediating this regulation. Using a computational approach, clusters of transcription factor binding sites that are absolutely conserved in order and in spacing across human, rat, and mouse genomes were identified. We term these regions pattern-defined regulatory islands (PRIs). We discovered that these sequences are frequently active sites of transcriptional regulation. These PRIs occur in approximately 1.1% of the half-billion base pairs covered in the search and are located mainly in noncoding regions of the genome. We show that the premise of PRIs can be used to identify previously known and novel cis-regulatory regions controlling genes regulated by myogenic differentiation. Thus, PRIs may represent a fundamental property of the architecture of cis-regulatory elements in mammalian genomes, and this feature can be exploited to pinpoint critical transcriptional regulatory elements governing cell type-specific gene expression.
Collapse
Affiliation(s)
- Tom H. Cheung
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309
| | | | - Yin Lam Kwan
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309
| | - Xuedong Liu
- Department of Chemistry and Biochemistry, University of Colorado, Boulder, CO 80309
- To whom correspondence should be addressed. E-mail:
| |
Collapse
|
9
|
Barta E. Comparative genomics-based orthologous promoter analysis using the DoOP database and the DoOPSearch web tool. Methods Mol Biol 2007; 395:319-328. [PMID: 17993683 DOI: 10.1007/978-1-59745-514-5_20] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Bioinformatic and experimental analyses of promoter regions are available for a long time. Finding of the transcription factor binding sites (TFBSs), however, by either method still faces a number of problems. For example, because of the ambiguity of binding of transcription factors, the number of false-positives and -negatives can be unexpectedly high in these sequence analyses. We can assume that evolutionary conserved motifs or regions in the promoters of the homologous genes function as TFBSs. Thus, a comparative genomic approach can provide a partial resolution for the problem previously outlined. This chapter describes application of the DoOP database and the DoOPSearch web tools for such a comparative genomic analysis. Orthologous promoter sequences and conserved motifs can be extracted from the DoOP database for further analysis. The web-based tools of the DoOPSearch webpage can be used for searching and comparing conserved motifs. Using these tools, it is possible to compare short sequences with conserved motifs, to map conserved motifs into a longer promoter region, or find sequence patterns in different sets of promoter sequences.
Collapse
Affiliation(s)
- Endre Barta
- Agricultural Biotechnology Center, Bioinformatics Group
| |
Collapse
|
10
|
Gómez-Skarmeta JL, Lenhard B, Becker TS. New technologies, new findings, and new concepts in the study of vertebrate cis-regulatory sequences. Dev Dyn 2006; 235:870-85. [PMID: 16395688 DOI: 10.1002/dvdy.20659] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
All vertebrates share a similar early embryonic body plan and use the same regulatory genes for their development. The availability of numerous sequenced vertebrate genomes and significant advances in bioinformatics have resulted in the finding that the genomic regions of many of these developmental regulatory genes also contain highly conserved noncoding sequence. In silico discovery of conserved noncoding regions and of transcription factor binding sites as well as the development of methods for high throughput transgenesis in Xenopus and zebrafish are dramatically increasing the speed with which regulatory elements can be discovered, characterized, and tested in the context of whole live embryos. We review here some of the recent technological developments that will likely lead to a surge in research on how vertebrate genomes encode regulation of transcriptional activity, how regulatory sequences constrain genomic architecture, and ultimately how vertebrate form has evolved.
Collapse
|
11
|
Papatsenko D, Kislyuk A, Levine M, Dubchak I. Conservation patterns in different functional sequence categories of divergent Drosophila species. Genomics 2006; 88:431-42. [PMID: 16697139 DOI: 10.1016/j.ygeno.2006.03.012] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2006] [Revised: 03/16/2006] [Accepted: 03/21/2006] [Indexed: 01/12/2023]
Abstract
We have explored the distributions of fully conserved ungapped blocks in genome-wide pair-wise alignments of recently completed species of Drosophila: D. melanogaster, D. yakuba, D. ananassae, D. pseudoobscura, D. virilis, and D. mojavensis. Based on these distributions we have found that nearly every functional sequence category possesses its own distinctive conservation pattern, sometimes independent of the overall sequence conservation level. In the coding and regulatory regions, the ungapped blocks were longer than in introns, UTRs, and nonfunctional sequences. At the same time, the blocks in the coding regions carried a 3N + 2 signature characteristic of synonymous substitutions in the third-codon position. Larger block sizes in transcription regulatory regions can be explained by the presence of conserved arrays of binding sites for transcription factors. We also have shown that the longest ungapped blocks, or "ultraconserved" sequences, are associated with specific gene groups, including those encoding ion channels and components of the cytoskeleton. We discuss how restraining conservation patterns may help in mapping functional sequence categories and improve genome annotation.
Collapse
Affiliation(s)
- Dmitri Papatsenko
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA.
| | | | | | | |
Collapse
|
12
|
Abstract
The synthesis of gene expression data and cis-regulatory analysis permits the elucidation of genomic regulatory networks. These networks provide a direct visualization of the functional interconnections among the regulatory genes and signaling components leading to cell-specific patterns of gene activity. Complex developmental processes are thereby illuminated in ways not revealed by the conventional analysis of individual genes. In this review, we describe emerging networks in several different model systems, and compare them with the gene regulatory network that controls dorsoventral patterning of the Drosophila embryo.
Collapse
Affiliation(s)
- Angelike Stathopoulos
- Division of Biology, California Institute of Technology, Pasadena, California 91125, USA.
| | | |
Collapse
|
13
|
Bejerano G, Siepel AC, Kent WJ, Haussler D. Computational screening of conserved genomic DNA in search of functional noncoding elements. Nat Methods 2005; 2:535-45. [PMID: 16170870 DOI: 10.1038/nmeth0705-535] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Affiliation(s)
- Gill Bejerano
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California 95064, USA.
| | | | | | | |
Collapse
|