1
|
Joshi M, Kapopoulou A, Laurent S. Impact of Genetic Variation in Gene Regulatory Sequences: A Population Genomics Perspective. Front Genet 2021; 12:660899. [PMID: 34276769 PMCID: PMC8282999 DOI: 10.3389/fgene.2021.660899] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Accepted: 05/31/2021] [Indexed: 01/06/2023] Open
Abstract
The unprecedented rise of high-throughput sequencing and assay technologies has provided a detailed insight into the non-coding sequences and their potential role as gene expression regulators. These regulatory non-coding sequences are also referred to as cis-regulatory elements (CREs). Genetic variants occurring within CREs have been shown to be associated with altered gene expression and phenotypic changes. Such variants are known to occur spontaneously and ultimately get fixed, due to selection and genetic drift, in natural populations and, in some cases, pave the way for speciation. Hence, the study of genetic variation at CREs has improved our overall understanding of the processes of local adaptation and evolution. Recent advances in high-throughput sequencing and better annotations of CREs have enabled the evaluation of the impact of such variation on gene expression, phenotypic alteration and fitness. Here, we review recent research on the evolution of CREs and concentrate on studies that have investigated genetic variation occurring in these regulatory sequences within the context of population genetics.
Collapse
Affiliation(s)
- Manas Joshi
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | | | - Stefan Laurent
- Department of Comparative Development and Genetics, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| |
Collapse
|
2
|
Zhao XW, Kishino H. Multiple Isolated Transcription Factors Act as Switches and Contribute to Species Uniqueness. Genes (Basel) 2020; 11:E1148. [PMID: 33003522 PMCID: PMC7600484 DOI: 10.3390/genes11101148] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2020] [Revised: 09/20/2020] [Accepted: 09/28/2020] [Indexed: 01/01/2023] Open
Abstract
Mammals have variable numbers (1300-2000) of transcription factors (TFs), but the reasons for this large variation are unclear. To investigate general TF patterns, we de novo identified 156,906 TFs from 96 mammalian species. We identified more than 500 human isolated TFs that are rarely reported in human TF-to-TF networks. Mutations in the genes of these TFs were less lethal than those of connected TFs. Consequently, these isolated TFs are more tolerant of changes and have become unique during speciation. They may also serve as a source of variation for TF evolution. Reconciliation of TF-family phylogenetic trees with a mammalian species tree revealed an average of 37.8% TF gains and 15.0% TF losses over 177 million years, which implies that isolated TFs are pervasive in mammals. Compared with non-TF interacting genes, TF-interacting genes have unique TF profiles and have higher expression levels in mice than in humans. Different expression levels of the same TF-interacting gene contribute to species-specific phenotypes. Formation and loss of isolated TFs enabling unique TF profiles may provide variable switches that adjust divergent expression profiles of target genes to generate species-specific phenotypes, thereby making species unique.
Collapse
Affiliation(s)
- Xin-Wei Zhao
- Graduate School of Agricultural and Life Sciences, The University of Tokyo, Tokyo 113-8657, Japan;
| | | |
Collapse
|
3
|
Abstract
One of the most important resources for researchers of noncoding RNAs is the information available in public databases spread over the internet. However, the effective exploration of this data can represent a daunting task, given the large amount of databases available and the variety of stored data. This chapter describes a classification of databases based on information source, type of RNA, source organisms, data formats, and the mechanisms for information retrieval, detailing the relevance of each of these classifications and its usability by researchers. This classification is used to update a 2012 review, indexing now more than 229 public databases. This review will include an assessment of the new trends for ncRNA research based on the information that is being offered by the databases. Additionally, we will expand the previous analysis focusing on the usability and application of these databases in pathogen and disease research. Finally, this chapter will analyze how currently available database schemas can help the development of new and improved web resources.
Collapse
|
4
|
Biçer A, Orlando S, Islam ABMMK, Gallastegui E, Besson A, Aligué R, Bachs O, Pujol MJ. ChIP-Seq analysis identifies p27(Kip1)-target genes involved in cell adhesion and cell signalling in mouse embryonic fibroblasts. PLoS One 2017; 12:e0187891. [PMID: 29155860 PMCID: PMC5695801 DOI: 10.1371/journal.pone.0187891] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Accepted: 10/27/2017] [Indexed: 12/25/2022] Open
Abstract
The protein p27Kip1 (p27), a member of the Cip-Kip family of cyclin-dependent kinase inhibitors, is involved in tumorigenesis and a correlation between reduced levels of this protein in human tumours and a worse prognosis has been established. Recent reports revealed that p27 also behaves as a transcriptional regulator. Thus, it has been postulated that the development of tumours with low amounts of p27 could be propitiated by deregulation of transcriptional programs under the control of p27. However, these programs still remain mostly unknown. The aim of this study has been to define the transcriptional programs regulated by p27 by first identifying the p27-binding sites (p27-BSs) on the whole chromatin of quiescent mouse embryonic fibroblasts. The chromatin regions associated to p27 have been annotated to the most proximal genes and it has been considered that the expression of these genes could by regulated by p27. The identification of the chromatin p27-BSs has been performed by Chromatin Immunoprecipitation Sequencing (ChIP-seq). Results revealed that p27 associated with 1839 sites that were annotated to 1417 different genes being 852 of them protein coding genes. Interestingly, most of the p27-BSs were in distal intergenic regions and introns whereas, in contrast, its association with promoter regions was very low. Gene ontology analysis of the protein coding genes revealed a number of relevant transcriptional programs regulated by p27 as cell adhesion, intracellular signalling and neuron differentiation among others. We validated the interaction of p27 with different chromatin regions by ChIP followed by qPCR and demonstrated that the expressions of several genes belonging to these programs are actually regulated by p27. Finally, cell adhesion assays revealed that the adhesion of p27-/- cells to the plates was much higher that controls, revealing a role of p27 in the regulation of a transcriptional program involved in cell adhesion.
Collapse
Affiliation(s)
- Atilla Biçer
- Department of Biomedical Sciences, University of Barcelona-IDIBAPS (Institut d'investigacions Biomèdiques August Pi i Sunyer), Barcelona, Spain
| | - Serena Orlando
- Department of Biomedical Sciences, University of Barcelona-IDIBAPS (Institut d'investigacions Biomèdiques August Pi i Sunyer), Barcelona, Spain
| | - Abul B M M K Islam
- Department of Genetic Engineering and Biotechnology, University of Dhaka, Dhaka, Bangladesh
| | - Edurne Gallastegui
- Department of Biomedical Sciences, University of Barcelona-IDIBAPS (Institut d'investigacions Biomèdiques August Pi i Sunyer), Barcelona, Spain
| | - Arnaud Besson
- INSERM UMR1037, Cancer Research Center of Toulouse, Toulouse, France.,Université de Toulouse, Toulouse, France.,CNRS ERL5294, Toulouse, France
| | - Rosa Aligué
- Department of Biomedical Sciences, University of Barcelona-IDIBAPS (Institut d'investigacions Biomèdiques August Pi i Sunyer), Barcelona, Spain
| | - Oriol Bachs
- Department of Biomedical Sciences, University of Barcelona-IDIBAPS (Institut d'investigacions Biomèdiques August Pi i Sunyer), Barcelona, Spain
| | - Maria Jesús Pujol
- Department of Biomedical Sciences, University of Barcelona-IDIBAPS (Institut d'investigacions Biomèdiques August Pi i Sunyer), Barcelona, Spain
| |
Collapse
|
5
|
Verma S, Gazara RK, Verma PK. Transcription Factor Repertoire of Necrotrophic Fungal Phytopathogen Ascochyta rabiei: Predominance of MYB Transcription Factors As Potential Regulators of Secretome. FRONTIERS IN PLANT SCIENCE 2017; 8:1037. [PMID: 28659964 PMCID: PMC5470089 DOI: 10.3389/fpls.2017.01037] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2017] [Accepted: 05/30/2017] [Indexed: 06/02/2023]
Abstract
Transcription factors (TFs) are the key players in gene expression and their study is highly significant for shedding light on the molecular mechanisms and evolutionary history of organisms. During host-pathogen interaction, extensive reprogramming of gene expression facilitated by TFs is likely to occur in both host and pathogen. To date, the knowledge about TF repertoire in filamentous fungi is in infancy. The necrotrophic fungus Ascochyta rabiei, that causes destructive Ascochyta blight (AB) disease of chickpea (Cicer arietinum), demands more comprehensive study for better understanding of Ascochyta-legume pathosystem. In the present study, we performed the genome-wide identification and analysis of TFs in A. rabiei. Taking advantage of A. rabiei genome sequence, we used a bioinformatic approach to predict the TF repertoire of A. rabiei. For identification and classification of A. rabiei TFs, we designed a comprehensive pipeline using a combination of BLAST and InterProScan software. A total of 381 A. rabiei TFs were predicted and divided into 32 fungal specific families of TFs. The gene structure, domain organization and phylogenetic analysis of abundant families of A. rabiei TFs were also carried out. Comparative study of A. rabiei TFs with that of other necrotrophic, biotrophic, hemibiotrophic, symbiotic, and saprotrophic fungi was performed. It suggested presence of both conserved as well as unique features among them. Moreover, cis-acting elements on promoter sequences of earlier predicted A. rabiei secretome were also identified. With the help of published A. rabiei transcriptome data, the differential expression of TF and secretory protein coding genes was analyzed. Furthermore, comprehensive expression analysis of few selected A. rabiei TFs using quantitative real-time polymerase chain reaction revealed variety of expression patterns during host colonization. These genes were expressed in at least one of the time points tested post infection. Overall, this study illustrates the first genome-wide identification and analysis of TF repertoire of A. rabiei. This work would provide the basis for further studies to dissect role of TFs in the molecular mechanisms during A. rabiei-chickpea interactions.
Collapse
Affiliation(s)
| | | | - Praveen K. Verma
- Plant Immunity Laboratory, National Institute of Plant Genome ResearchNew Delhi, India
| |
Collapse
|
6
|
Henriques R, Ferreira FL, Madeira SC. BicPAMS: software for biological data analysis with pattern-based biclustering. BMC Bioinformatics 2017; 18:82. [PMID: 28153040 PMCID: PMC5290636 DOI: 10.1186/s12859-017-1493-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 01/21/2017] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Biclustering has been largely applied for the unsupervised analysis of biological data, being recognised today as a key technique to discover putative modules in both expression data (subsets of genes correlated in subsets of conditions) and network data (groups of coherently interconnected biological entities). However, given its computational complexity, only recent breakthroughs on pattern-based biclustering enabled efficient searches without the restrictions that state-of-the-art biclustering algorithms place on the structure and homogeneity of biclusters. As a result, pattern-based biclustering provides the unprecedented opportunity to discover non-trivial yet meaningful biological modules with putative functions, whose coherency and tolerance to noise can be tuned and made problem-specific. METHODS To enable the effective use of pattern-based biclustering by the scientific community, we developed BicPAMS (Biclustering based on PAttern Mining Software), a software that: 1) makes available state-of-the-art pattern-based biclustering algorithms (BicPAM (Henriques and Madeira, Alg Mol Biol 9:27, 2014), BicNET (Henriques and Madeira, Alg Mol Biol 11:23, 2016), BicSPAM (Henriques and Madeira, BMC Bioinforma 15:130, 2014), BiC2PAM (Henriques and Madeira, Alg Mol Biol 11:1-30, 2016), BiP (Henriques and Madeira, IEEE/ACM Trans Comput Biol Bioinforma, 2015), DeBi (Serin and Vingron, AMB 6:1-12, 2011) and BiModule (Okada et al., IPSJ Trans Bioinf 48(SIG5):39-48, 2007)); 2) consistently integrates their dispersed contributions; 3) further explores additional accuracy and efficiency gains; and 4) makes available graphical and application programming interfaces. RESULTS Results on both synthetic and real data confirm the relevance of BicPAMS for biological data analysis, highlighting its essential role for the discovery of putative modules with non-trivial yet biologically significant functions from expression and network data. CONCLUSIONS BicPAMS is the first biclustering tool offering the possibility to: 1) parametrically customize the structure, coherency and quality of biclusters; 2) analyze large-scale biological networks; and 3) tackle the restrictive assumptions placed by state-of-the-art biclustering algorithms. These contributions are shown to be key for an adequate, complete and user-assisted unsupervised analysis of biological data. SOFTWARE BicPAMS and its tutorial available in http://www.bicpams.com .
Collapse
Affiliation(s)
- Rui Henriques
- INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal
| | | | - Sara C. Madeira
- INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, Lisboa, Portugal
| |
Collapse
|
7
|
Schmeier S, Alam T, Essack M, Bajic VB. TcoF-DB v2: update of the database of human and mouse transcription co-factors and transcription factor interactions. Nucleic Acids Res 2016; 45:D145-D150. [PMID: 27789689 PMCID: PMC5210517 DOI: 10.1093/nar/gkw1007] [Citation(s) in RCA: 52] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2016] [Revised: 09/29/2016] [Accepted: 10/17/2016] [Indexed: 12/13/2022] Open
Abstract
Transcription factors (TFs) play a pivotal role in transcriptional regulation, making them crucial for cell survival and important biological functions. For the regulation of transcription, interactions of different regulatory proteins known as transcription co-factors (TcoFs) and TFs are essential in forming necessary protein complexes. Although TcoFs themselves do not bind DNA directly, their influence on transcriptional regulation and initiation, although indirect, has been shown to be significant, with the functionality of TFs strongly influenced by the presence of TcoFs. In the TcoF-DB v2 database, we collect information on TcoFs. In this article, we describe updates and improvements implemented in TcoF-DB v2. TcoF-DB v2 provides several new features that enables exploration of the roles of TcoFs. The content of the database has significantly expanded, and is enriched with information from Gene Ontology, biological pathways, diseases and molecular signatures. TcoF-DB v2 now includes many more TFs; has substantially increased the number of human TcoFs to 958, and now includes information on mouse (418 new TcoFs). TcoF-DB v2 enables the exploration of information on TcoFs and allows investigations into their influence on transcriptional regulation in humans and mice. TcoF-DB v2 can be accessed at http://tcofdb.org/.
Collapse
Affiliation(s)
- Sebastian Schmeier
- Massey University Auckland, Institute of Natural and Mathematical Sciences, Auckland, New Zealand
| | - Tanvir Alam
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Thuwal, Kingdom of Saudi Arabia
| | - Magbubah Essack
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Thuwal, Kingdom of Saudi Arabia
| | - Vladimir B Bajic
- King Abdullah University of Science and Technology (KAUST), Computational Bioscience Research Center (CBRC), Thuwal, Kingdom of Saudi Arabia
| |
Collapse
|
8
|
Henriques R, Madeira SC. BicPAM: Pattern-based biclustering for biomedical data analysis. Algorithms Mol Biol 2014; 9:27. [PMID: 25649207 PMCID: PMC4302537 DOI: 10.1186/s13015-014-0027-z] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2014] [Accepted: 11/12/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Biclustering, the discovery of sets of objects with a coherent pattern across a subset of conditions, is a critical task to study a wide-set of biomedical problems, where molecular units or patients are meaningfully related with a set of properties. The challenging combinatorial nature of this task led to the development of approaches with restrictions on the allowed type, number and quality of biclusters. Contrasting, recent biclustering approaches relying on pattern mining methods can exhaustively discover flexible structures of robust biclusters. However, these approaches are only prepared to discover constant biclusters and their underlying contributions remain dispersed. METHODS The proposed BicPAM biclustering approach integrates existing principles made available by state-of-the-art pattern-based approaches with two new contributions. First, BicPAM is the first efficient attempt to exhaustively mine non-constant types of biclusters, including additive and multiplicative coherencies in the presence or absence of symmetries. Second, BicPAM provides strategies to effectively compose different biclustering structures and to handle arbitrary levels of noise inherent to data and with discretization procedures. RESULTS Results show BicPAM's superiority against its peers and its ability to retrieve unique types of biclusters of interest, to efficiently deliver exhaustive solutions and to successfully recover planted biclusters in datasets with varying levels of missing values and noise. Its application over gene expression data leads to unique solutions with heightened biological relevance. CONCLUSIONS BicPAM approaches integrate existing disperse efforts towards pattern-based biclustering and provides the first critical strategies to efficiently discover exhaustive solutions of biclusters with shifting, scaling and symmetric assumptions with varying quality and underlying structures. Additionally, BicPAM dynamically adapts its behavior to mine data with different levels of missing values and noise.
Collapse
Affiliation(s)
- Rui Henriques
- INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
| | - Sara C Madeira
- INESC-ID and Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal
| |
Collapse
|
9
|
Paschoal AR, Maracaja-Coutinho V, Setubal JC, Simões ZLP, Verjovski-Almeida S, Durham AM. Non-coding transcription characterization and annotation. RNA Biol 2014; 9:274-82. [DOI: 10.4161/rna.19352] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
10
|
Zhang HM, Liu T, Liu CJ, Song S, Zhang X, Liu W, Jia H, Xue Y, Guo AY. AnimalTFDB 2.0: a resource for expression, prediction and functional study of animal transcription factors. Nucleic Acids Res 2014; 43:D76-81. [PMID: 25262351 PMCID: PMC4384004 DOI: 10.1093/nar/gku887] [Citation(s) in RCA: 201] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
Transcription factors (TFs) are key regulators for gene expression. Here we updated the animal TF database AnimalTFDB to version 2.0 (http://bioinfo.life.hust.edu.cn/AnimalTFDB/). Using the improved prediction pipeline, we identified 72 336 TF genes, 21 053 transcription co-factor genes and 6502 chromatin remodeling factor genes from 65 species covering main animal lineages. Besides the abundant annotations (basic information, gene model, protein functional domain, gene ontology, pathway, protein interaction, ortholog and paralog, etc.) in the previous version, we made several new features and functions in the updated version. These new features are: (i) gene expression from RNA-Seq for nine model species, (ii) gene phenotype information, (iii) multiple sequence alignment of TF DNA-binding domains, and the weblogo and phylogenetic tree based on the alignment, (iv) a TF prediction server to identify new TFs from input sequences and (v) a BLAST server to search against TFs in AnimalTFDB. A new nice web interface was designed for AnimalTFDB 2.0 allowing users to browse and search all data in the database. We aim to maintain the AnimalTFDB as a solid resource for TF identification and studies of transcription regulation and comparative genomics.
Collapse
Affiliation(s)
- Hong-Mei Zhang
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Teng Liu
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Chun-Jie Liu
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Shuangyang Song
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Xiantong Zhang
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Wei Liu
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Haibo Jia
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - Yu Xue
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| | - An-Yuan Guo
- Department of Biomedical Engineering, Key Laboratory of Molecular Biophysics of the Ministry of Education, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, Hubei 430074, PR China
| |
Collapse
|
11
|
Lomonaco V, Martoglia R, Mandreoli F, Anderlucci L, Emmett W, Bicciato S, Taccioli C. UCbase 2.0: ultraconserved sequences database (2014 update). DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014; 2014:bau062. [PMID: 24951797 PMCID: PMC4064129 DOI: 10.1093/database/bau062] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
UCbase 2.0 (http://ucbase.unimore.it) is an update, extension and evolution of UCbase, a Web tool dedicated to the analysis of ultraconserved sequences (UCRs). UCRs are 481 sequences >200 bases sharing 100% identity among human, mouse and rat genomes. They are frequently located in genomic regions known to be involved in cancer or differentially expressed in human leukemias and carcinomas. UCbase 2.0 is a platform-independent Web resource that includes the updated version of the human genome annotation (hg19), information linking disorders to chromosomal coordinates based on the Systematized Nomenclature of Medicine classification, a query tool to search for Single Nucleotide Polymorphisms (SNPs) and a new text box to directly interrogate the database using a MySQL interface. To facilitate the interactive visual interpretation of UCR chromosomal positioning, UCbase 2.0 now includes a graph visualization interface directly linked to UCSC genome browser. Database URL:http://ucbase.unimore.it
Collapse
Affiliation(s)
- Vincenzo Lomonaco
- Computer Engineering Department, University of Modena, Via Campi 213/b, 44100, Modena, Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy, Department of Genetics, Environment and Evolution, Genetics Institute, University College London, London, WC1E 6BT, UK and Department of Life Sciences, Center for Genome Research, University of Modena and Reggio Emilia, Via G. Campi 287, 41100, Modena, Italy
| | - Riccardo Martoglia
- Computer Engineering Department, University of Modena, Via Campi 213/b, 44100, Modena, Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy, Department of Genetics, Environment and Evolution, Genetics Institute, University College London, London, WC1E 6BT, UK and Department of Life Sciences, Center for Genome Research, University of Modena and Reggio Emilia, Via G. Campi 287, 41100, Modena, Italy
| | - Federica Mandreoli
- Computer Engineering Department, University of Modena, Via Campi 213/b, 44100, Modena, Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy, Department of Genetics, Environment and Evolution, Genetics Institute, University College London, London, WC1E 6BT, UK and Department of Life Sciences, Center for Genome Research, University of Modena and Reggio Emilia, Via G. Campi 287, 41100, Modena, Italy
| | - Laura Anderlucci
- Computer Engineering Department, University of Modena, Via Campi 213/b, 44100, Modena, Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy, Department of Genetics, Environment and Evolution, Genetics Institute, University College London, London, WC1E 6BT, UK and Department of Life Sciences, Center for Genome Research, University of Modena and Reggio Emilia, Via G. Campi 287, 41100, Modena, Italy
| | - Warren Emmett
- Computer Engineering Department, University of Modena, Via Campi 213/b, 44100, Modena, Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy, Department of Genetics, Environment and Evolution, Genetics Institute, University College London, London, WC1E 6BT, UK and Department of Life Sciences, Center for Genome Research, University of Modena and Reggio Emilia, Via G. Campi 287, 41100, Modena, Italy
| | - Silvio Bicciato
- Computer Engineering Department, University of Modena, Via Campi 213/b, 44100, Modena, Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy, Department of Genetics, Environment and Evolution, Genetics Institute, University College London, London, WC1E 6BT, UK and Department of Life Sciences, Center for Genome Research, University of Modena and Reggio Emilia, Via G. Campi 287, 41100, Modena, Italy
| | - Cristian Taccioli
- Computer Engineering Department, University of Modena, Via Campi 213/b, 44100, Modena, Department of Statistical Sciences, University of Bologna, Via Belle Arti 41, 40126, Bologna, Italy, Department of Genetics, Environment and Evolution, Genetics Institute, University College London, London, WC1E 6BT, UK and Department of Life Sciences, Center for Genome Research, University of Modena and Reggio Emilia, Via G. Campi 287, 41100, Modena, Italy
| |
Collapse
|
12
|
Burgess D, Freeling M. The most deeply conserved noncoding sequences in plants serve similar functions to those in vertebrates despite large differences in evolutionary rates. THE PLANT CELL 2014; 26:946-61. [PMID: 24681619 PMCID: PMC4001403 DOI: 10.1105/tpc.113.121905] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/12/2023]
Abstract
In vertebrates, conserved noncoding elements (CNEs) are functionally constrained sequences that can show striking conservation over >400 million years of evolutionary distance and frequently are located megabases away from target developmental genes. Conserved noncoding sequences (CNSs) in plants are much shorter, and it has been difficult to detect conservation among distantly related genomes. In this article, we show not only that CNS sequences can be detected throughout the eudicot clade of flowering plants, but also that a subset of 37 CNSs can be found in all flowering plants (diverging ∼170 million years ago). These CNSs are functionally similar to vertebrate CNEs, being highly associated with transcription factor and development genes and enriched in transcription factor binding sites. Some of the most highly conserved sequences occur in genes encoding RNA binding proteins, particularly the RNA splicing-associated SR genes. Differences in sequence conservation between plants and animals are likely to reflect differences in the biology of the organisms, with plants being much more able to tolerate genomic deletions and whole-genome duplication events due, in part, to their far greater fecundity compared with vertebrates.
Collapse
|
13
|
An improved SELEX-Seq strategy for characterizing DNA-binding specificity of transcription factor: NF-κB as an example. PLoS One 2013; 8:e76109. [PMID: 24130762 PMCID: PMC3794954 DOI: 10.1371/journal.pone.0076109] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2013] [Accepted: 08/20/2013] [Indexed: 12/29/2022] Open
Abstract
SELEX-Seq is now the optimal high-throughput technique for characterizing DNA-binding specificities of transcription factors. In this study, we introduced an improved EMSA-based SELEX-Seq strategy with several advantages. The improvements of this strategy included: (1) using a FAM-labeled probe to track protein-DNA complex in polyacrylamide gel for rapidly recovering the protein-bound dsDNA without relying on traditional radioactive labeling or ethidium bromide staining; (2) monitoring the specificity of SELEX selection by detecting a positive and negative sequence doped into the input DNAs used in each round with PCR amplification; (3) using nested PCR to ensure the specificity of PCR amplification of the selected DNAs after each round; (4) using the nucleotides added at the 5′ end of the nested PCR primers as the split barcode to code DNAs from various rounds for multiplexing sequencing samples. The split barcode minimized selection times and thus greatly simplified the current SELEX-Seq procedure. The reliability of the strategy was demonstrated by performing a successful SELEX-Seq of a well-known transcription factor, NF-κB. Therefore, this study provided a useful SELEX-Seq strategy for characterizing DNA-binding specificities of transcription factors.
Collapse
|
14
|
Martin A, Orgogozo V. The Loci of repeated evolution: a catalog of genetic hotspots of phenotypic variation. Evolution 2013; 67:1235-50. [PMID: 23617905 DOI: 10.1111/evo.12081] [Citation(s) in RCA: 221] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Accepted: 01/26/2013] [Indexed: 12/11/2022]
Abstract
What is the nature of the genetic changes underlying phenotypic evolution? We have catalogued 1008 alleles described in the literature that cause phenotypic differences among animals, plants, and yeasts. Surprisingly, evolution of similar traits in distinct lineages often involves mutations in the same gene ("gene reuse"). This compilation yields three important qualitative implications about repeated evolution. First, the apparent evolution of similar traits by gene reuse can be traced back to two alternatives, either several independent causative mutations or a single original mutational event followed by sorting processes. Second, hotspots of evolution-defined as the repeated occurrence of de novo mutations at orthologous loci and causing similar phenotypic variation-are omnipresent in the literature with more than 100 examples covering various levels of analysis, including numerous gain-of-function events. Finally, several alleles of large effect have been shown to result from the aggregation of multiple small-effect mutations at the same hotspot locus, thus reconciling micromutationist theories of adaptation with the empirical observation of large-effect variants. Although data heterogeneity and experimental biases prevented us from extracting quantitative trends, our synthesis highlights the existence of genetic paths of least resistance leading to viable evolutionary change.
Collapse
Affiliation(s)
- Arnaud Martin
- Department of Ecology and Evolutionary Biology, Cornell University, Corson Hall, 215 Tower Road, Ithaca, New York, 14853, USA.
| | | |
Collapse
|
15
|
Dimitrieva S, Bucher P. UCNEbase--a database of ultraconserved non-coding elements and genomic regulatory blocks. Nucleic Acids Res 2012. [PMID: 23193254 PMCID: PMC3531063 DOI: 10.1093/nar/gks1092] [Citation(s) in RCA: 87] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
UCNEbase (http://ccg.vital-it.ch/UCNEbase) is a free, web-accessible information resource on the evolution and genomic organization of ultra-conserved non-coding elements (UCNEs). It currently covers 4351 such elements in 18 different species. The majority of UCNEs are supposed to be transcriptional regulators of key developmental genes. As most of them occur as clusters near potential target genes, the database is organized along two hierarchical levels: individual UCNEs and ultra-conserved genomic regulatory blocks (UGRBs). UCNEbase introduces a coherent nomenclature for UCNEs reflecting their respective associations with likely target genes. Orthologous and paralogous UCNEs share components of their names and are systematically cross-linked. Detailed synteny maps between the human and other genomes are provided for all UGRBs. UCNEbase is managed by a relational database system and can be accessed by a variety of web-based query pages. As it relies on the UCSC genome browser as visualization platform, a large part of its data content is also available as browser viewable custom track files. UCNEbase is potentially useful to any computational, experimental or evolutionary biologist interested in conserved non-coding DNA elements in vertebrates.
Collapse
Affiliation(s)
- Slavica Dimitrieva
- Swiss Institute for Experimental Cancer Research (ISREC), School of Life Sciences, Swiss Federal Institute of Technology (EPFL) and Swiss Institute of Bioinformatics (SIB), CH-1015 Lausanne, Switzerland
- *To whom correspondence should be addressed. Tel: +41 21 693 0956; Fax: +41 21 693 1850;
| | - Philipp Bucher
- Swiss Institute for Experimental Cancer Research (ISREC), School of Life Sciences, Swiss Federal Institute of Technology (EPFL) and Swiss Institute of Bioinformatics (SIB), CH-1015 Lausanne, Switzerland
- Correspondence may also be addressed to Slavica Dimitrieva. Tel: +41 21 693 0958; Fax: +41 21 693 1850;
| |
Collapse
|
16
|
Transcription factors expressed in embryonic and adult olfactory bulb neural stem cells reveal distinct proliferation, differentiation and epigenetic control. Genomics 2012; 101:12-9. [PMID: 23041222 DOI: 10.1016/j.ygeno.2012.09.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2012] [Accepted: 09/27/2012] [Indexed: 01/19/2023]
Abstract
TF genomic markers associated with neurogenesis, proliferation, differentiation, and epigenetic control in human embryonic neural stem cells (hENSC(, and adult human olfactory bulb neural stem cells (OBNSC) were studied by immunohistochemistry (IHC) and DNA microarray. The biological impact of TF gene changes in the examined cell types was estimated using DAVID to specify a different GO class and signaling pathway based on KEGG database. Eleven, and twenty eight TF genes were up-regulated (fold change≤2-39) in OBNSC, and hENSC respectively. KEGG pathway analysis for the up-regulated TF genes revealed significant enrichments for the basal transcription factor pathway, and Notch signaling pathway in OBNSCs, and hENSCs, respectively. Immunofluorescence analysis revealed a significantly greater number of β-tubulin III (TUBB3), MAP, glial fibrillary acidic protein (GFAP), and O4 in hENSC when compared to those in OBNSC. Furthermore, the expression of epigenetic-related TF-genes SMARCC1, TAF12, and UHRF1 increased significantly in OBNSC when compared with hENSC.
Collapse
|
17
|
Wiese CB, Ireland S, Fleming NL, Yu J, Valerius MT, Georgas K, Chiu HS, Brennan J, Armstrong J, Little MH, McMahon AP, Southard-Smith EM. A genome-wide screen to identify transcription factors expressed in pelvic Ganglia of the lower urinary tract. Front Neurosci 2012; 6:130. [PMID: 22988430 PMCID: PMC3439845 DOI: 10.3389/fnins.2012.00130] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2012] [Accepted: 08/22/2012] [Indexed: 12/16/2022] Open
Abstract
Relative positions of neurons within mature murine pelvic ganglia based on expression of neurotransmitters have been described. However the spatial organization of developing innervation in the murine urogenital tract (UGT) and the gene networks that regulate specification and maturation of neurons within the pelvic ganglia of the lower urinary tract (LUT) are unknown. We used whole-mount immunohistochemistry and histochemical stains to localize neural elements in 15.5 days post coitus (dpc) fetal mice. To identify potential regulatory factors expressed in pelvic ganglia, we surveyed expression patterns for known or probable transcription factors (TF) annotated in the mouse genome by screening a whole-mount in situ hybridization library of fetal UGTs. Of the 155 genes detected in pelvic ganglia, 88 encode TFs based on the presence of predicted DNA-binding domains. Neural crest (NC)-derived progenitors within the LUT were labeled by Sox10, a well-known regulator of NC development. Genes identified were categorized based on patterns of restricted expression in pelvic ganglia, pelvic ganglia and urethral epithelium, or pelvic ganglia and urethral mesenchyme. Gene expression patterns and the distribution of Sox10+, Phox2b+, Hu+, and PGP9.5+ cells within developing ganglia suggest previously unrecognized regional segregation of Sox10+ progenitors and differentiating neurons in early development of pelvic ganglia. Reverse transcription-PCR of pelvic ganglia RNA from fetal and post-natal stages demonstrated that multiple TFs maintain post-natal expression, although Pax3 is extinguished before weaning. Our analysis identifies multiple potential regulatory genes including TFs that may participate in segregation of discrete lineages within pelvic ganglia. The genes identified here are attractive candidate disease genes that may now be further investigated for their roles in malformation syndromes or in LUT dysfunction.
Collapse
Affiliation(s)
- Carrie B Wiese
- Division of Genetic Medicine, Department of Medicine, Vanderbilt University School of Medicine Nashville, TN, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Yu J, Valerius MT, Duah M, Staser K, Hansard JK, Guo JJ, McMahon J, Vaughan J, Faria D, Georgas K, Rumballe B, Ren Q, Krautzberger AM, Junker JP, Thiagarajan RD, Machanick P, Gray PA, van Oudenaarden A, Rowitch DH, Stiles CD, Ma Q, Grimmond SM, Bailey TL, Little MH, McMahon AP. Identification of molecular compartments and genetic circuitry in the developing mammalian kidney. Development 2012; 139:1863-73. [PMID: 22510988 DOI: 10.1242/dev.074005] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
Abstract
Lengthy developmental programs generate cell diversity within an organotypic framework, enabling the later physiological actions of each organ system. Cell identity, cell diversity and cell function are determined by cell type-specific transcriptional programs; consequently, transcriptional regulatory factors are useful markers of emerging cellular complexity, and their expression patterns provide insights into the regulatory mechanisms at play. We performed a comprehensive genome-scale in situ expression screen of 921 transcriptional regulators in the developing mammalian urogenital system. Focusing on the kidney, analysis of regional-specific expression patterns identified novel markers and cell types associated with development and patterning of the urinary system. Furthermore, promoter analysis of synexpressed genes predicts transcriptional control mechanisms that regulate cell differentiation. The annotated informational resource (www.gudmap.org) will facilitate functional analysis of the mammalian kidney and provides useful information for the generation of novel genetic tools to manipulate emerging cell populations.
Collapse
Affiliation(s)
- Jing Yu
- Department of Stem Cell and Regenerative Biology, Department of Molecular and Cellular Biology, Harvard Stem Cell Institute, Harvard University, Cambridge, MA 02138, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Vaquerizas JM, Teichmann SA, Luscombe NM. How do you find transcription factors? Computational approaches to compile and annotate repertoires of regulators for any genome. Methods Mol Biol 2012; 786:3-19. [PMID: 21938617 DOI: 10.1007/978-1-61779-292-2_1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Abstract
Transcription factors (TFs) play an important role in regulating gene expression. The availability of complete genome sequences and associated functional genomic data offer excellent opportunities to understand the transcriptional regulatory system of an entire organism. To do so, however, it is essential to compile a reliable dataset of regulatory components. Here, we review computational methods and publicly accessible resources that help identify TF-coding genes in prokaryotic and eukaryotic genomes. Since the regulatory functions of most TFs remain unknown, we also discuss approaches for combining diverse genomic datasets that will help elucidate their chromosomal organisation, expression, and evolutionary conservation. These analysis methods provide a solid foundation for further investigations of the transcriptional regulatory system.
Collapse
|
20
|
Zhang HM, Chen H, Liu W, Liu H, Gong J, Wang H, Guo AY. AnimalTFDB: a comprehensive animal transcription factor database. Nucleic Acids Res 2011; 40:D144-9. [PMID: 22080564 PMCID: PMC3245155 DOI: 10.1093/nar/gkr965] [Citation(s) in RCA: 235] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Transcription factors (TFs) are proteins that bind to specific DNA sequences, thereby playing crucial roles in gene-expression regulation through controlling the transcription of genetic information from DNA to RNA. Transcription cofactors and chromatin remodeling factors are also essential in the gene transcriptional regulation. Identifying and annotating all the TFs are primary and crucial steps for illustrating their functions and understanding the transcriptional regulation. In this study, based on manual literature reviews, we collected and curated 72 TF families for animals, which is currently the most complete list of TF families in animals. Then, we systematically characterized all the TFs in 50 animal species and constructed a comprehensive animal TF database, AnimalTFDB. To better serve the community, we provided detailed annotations for each TF, including basic information, gene structure, functional domain, 3D structure hit, Gene Ontology, pathway, protein–protein interaction, paralogs, orthologs, potential TF-binding sites and targets. In addition, we collected and annotated transcription cofactors and chromatin remodeling factors. AnimalTFDB has a user-friendly web interface with multiple browse and search functions, as well as data downloading. It is freely available at http://www.bioguo.org/AnimalTFDB/.
Collapse
Affiliation(s)
- Hong-Mei Zhang
- Hubei Bioinformatics & Molecular Imaging Key Laboratory, Department of Systems Biology, College of Life Science, Huazhong University of Science and Technology Wenhua College, Wuhan 430074, China
| | | | | | | | | | | | | |
Collapse
|
21
|
Lee AP, Brenner S, Venkatesh B. Mouse transgenesis identifies conserved functional enhancers and cis-regulatory motif in the vertebrate LIM homeobox gene Lhx2 locus. PLoS One 2011; 6:e20088. [PMID: 21629789 PMCID: PMC3100342 DOI: 10.1371/journal.pone.0020088] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2011] [Accepted: 04/17/2011] [Indexed: 12/03/2022] Open
Abstract
The vertebrate Lhx2 is a member of the LIM homeobox family of transcription factors. It is essential for the normal development of the forebrain, eye, olfactory system and liver as well for the differentiation of lymphoid cells. However, despite the highly restricted spatio-temporal expression pattern of Lhx2, nothing is known about its transcriptional regulation. In mammals and chicken, Crb2, Dennd1a and Lhx2 constitute a conserved linkage block, while the intervening Dennd1a is lost in the fugu Lhx2 locus. To identify functional enhancers of Lhx2, we predicted conserved noncoding elements (CNEs) in the human, mouse and fugu Crb2-Lhx2 loci and assayed their function in transgenic mouse at E11.5. Four of the eight CNE constructs tested functioned as tissue-specific enhancers in specific regions of the central nervous system and the dorsal root ganglia (DRG), recapitulating partial and overlapping expression patterns of Lhx2 and Crb2 genes. There was considerable overlap in the expression domains of the CNEs, which suggests that the CNEs are either redundant enhancers or regulating different genes in the locus. Using a large set of CNEs (810 CNEs) associated with transcription factor-encoding genes that express predominantly in the central nervous system, we predicted four over-represented 8-mer motifs that are likely to be associated with expression in the central nervous system. Mutation of one of them in a CNE that drove reporter expression in the neural tube and DRG abolished expression in both domains indicating that this motif is essential for expression in these domains. The failure of the four functional enhancers to recapitulate the complete expression pattern of Lhx2 at E11.5 indicates that there must be other Lhx2 enhancers that are either located outside the region investigated or divergent in mammals and fishes. Other approaches such as sequence comparison between multiple mammals are required to identify and characterize such enhancers.
Collapse
Affiliation(s)
- Alison P. Lee
- Comparative Genomics Laboratory, Institute of Molecular and Cell Biology,
A*STAR (Agency for Science, Technology and Research), Singapore,
Singapore
| | - Sydney Brenner
- Comparative Genomics Laboratory, Institute of Molecular and Cell Biology,
A*STAR (Agency for Science, Technology and Research), Singapore,
Singapore
| | - Byrappa Venkatesh
- Comparative Genomics Laboratory, Institute of Molecular and Cell Biology,
A*STAR (Agency for Science, Technology and Research), Singapore,
Singapore
| |
Collapse
|
22
|
Abstract
The control of gene expression is a biological process essential to all organisms. This is accomplished through the interaction of regulatory proteins with specific DNA motifs in the control regions of the genes that they regulate. Upon binding to DNA, and through specific protein-protein interactions, these regulatory proteins convey signals to the basal transcriptional machinery, containing the respective RNA polymerases, resulting in particular rates of gene expression. In eukaryotes, in addition and complementary to the binding of regulatory proteins to DNA, chromatin structure plays a role in modulating gene expression. Small RNAs are emerging as key components in this process. This chapter provides an introduction to some of the basic players participating in these processes, the transcription factors and co-regulators, the cis-regulatory elements that often function as transcription factor docking sites, and the emerging role of small RNAs in the regulation of gene expression.
Collapse
Affiliation(s)
- Alper Yilmaz
- Plant Biotechnology Center and Department of Plant Cellular and Molecular Biology, The Ohio State University, Columbus, OH, USA.
| | | |
Collapse
|
23
|
Deplancke B. Experimental advances in the characterization of metazoan gene regulatory networks. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:12-27. [PMID: 19324929 DOI: 10.1093/bfgp/elp001] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Gene regulatory networks (GRNs) play a vital role in metazoan development and function, and deregulation of these networks is often implicated in disease. GRNs depict the dynamic interactions between genomic and regulatory state components. The genomic components comprise genes and their associated cis-regulatory elements. The regulatory state components consist primarily of transcriptional complexes that bind the latter elements. With the availability of complete genome sequences, several approaches have recently been developed which promise to significantly enhance our ability to identify either the genomic or regulatory state components, or the interactions between these two. In this review, I provide an in-depth overview of these approaches and detail how each contributes to a more comprehensive understanding of GRN composition and function.
Collapse
Affiliation(s)
- Bart Deplancke
- Ecole Polytechnique Fédérale de Lausanne, School of Life Sciences, Institute of Bioengineering, Lausanne, Switzerland.
| |
Collapse
|
24
|
Fredman D, Engstrom PG, Lenhard B. Web-based tools and approaches to study long-range gene regulation in Metazoa. BRIEFINGS IN FUNCTIONAL GENOMICS AND PROTEOMICS 2009; 8:231-42. [DOI: 10.1093/bfgp/elp023] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
|
25
|
Fulton DL, Sundararajan S, Badis G, Hughes TR, Wasserman WW, Roach JC, Sladek R. TFCat: the curated catalog of mouse and human transcription factors. Genome Biol 2009; 10:R29. [PMID: 19284633 PMCID: PMC2691000 DOI: 10.1186/gb-2009-10-3-r29] [Citation(s) in RCA: 153] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Revised: 02/26/2009] [Accepted: 03/12/2009] [Indexed: 11/20/2022] Open
Abstract
TFCat is a catalog of mouse and human transcription factors based on a reliable core collection of annotations obtained by expert review of the scientific literature Unravelling regulatory programs governed by transcription factors (TFs) is fundamental to understanding biological systems. TFCat is a catalog of mouse and human TFs based on a reliable core collection of annotations obtained by expert review of the scientific literature. The collection, including proven and homology-based candidate TFs, is annotated within a function-based taxonomy and DNA-binding proteins are organized within a classification system. All data and user-feedback mechanisms are available at the TFCat portal .
Collapse
Affiliation(s)
- Debra L Fulton
- Department of Medical Genetics, Centre for Molecular Medicine and Therapeutics, Child and Family Research Institute, University of British Columbia, Vancouver, Canada.
| | | | | | | | | | | | | |
Collapse
|
26
|
Xie HB, Irwin DM, Zhang YP. Evolution of conserved secondary structures and their function in transcriptional regulation networks. BMC Genomics 2008; 9:520. [PMID: 18976501 PMCID: PMC2584662 DOI: 10.1186/1471-2164-9-520] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2008] [Accepted: 11/02/2008] [Indexed: 12/12/2022] Open
Abstract
Background Many conserved secondary structures have been identified within conserved elements in the human genome, but only a small fraction of them are known to be functional RNAs. The evolutionary variations of these conserved secondary structures in human populations and their biological functions have not been fully studied. Results We searched for polymorphisms within conserved secondary structures and identified a number of SNPs within these elements even though they are highly conserved among species. The density of SNPs in conserved secondary structures is about 65% of that of their flanking, non-conserved, sequences. Classification of sites as stems or as loops/bulges revealed that the density of SNPs in stems is about 62% of that found in loops/bulges. Analysis of derived allele frequency data indicates that sites in stems are under stronger evolutionary constraint than sites in loops/bulges. Intergenic conserved secondary structures tend to associate with transcription factor-encoding genes with genetic distance being the measure of regulator-gene associations. A substantial fraction of intergenic conserved secondary structures overlap characterized binding sites for multiple transcription factors. Conclusion Strong purifying selection implies that secondary structures are probably important carriers of biological functions for conserved sequences. The overlap between intergenic conserved secondary structures and transcription factor binding sites further suggests that intergenic conserved secondary structures have essential roles in directing gene expression in transcriptional regulation networks.
Collapse
Affiliation(s)
- Hai-Bing Xie
- State Key Laboratory of Genetic Resource and Evolution, Kunming Institute of Zoology, Kunming 650223, PR China.
| | | | | |
Collapse
|
27
|
Elgar G, Vavouri T. Tuning in to the signals: noncoding sequence conservation in vertebrate genomes. Trends Genet 2008; 24:344-52. [PMID: 18514361 DOI: 10.1016/j.tig.2008.04.005] [Citation(s) in RCA: 129] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2008] [Revised: 04/14/2008] [Accepted: 04/14/2008] [Indexed: 01/25/2023]
|