1
|
Pavlu S, Nikumbh S, Kovacik M, An T, Lenhard B, Simkova H, Navratilova P. Core promoterome of barley embryo. Comput Struct Biotechnol J 2024; 23:264-277. [PMID: 38173877 PMCID: PMC10762323 DOI: 10.1016/j.csbj.2023.12.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/01/2023] [Accepted: 12/02/2023] [Indexed: 01/05/2024] Open
Abstract
Precise localization and dissection of gene promoters are key to understanding transcriptional gene regulation and to successful bioengineering applications. The core RNA polymerase II initiation machinery is highly conserved among eukaryotes, leading to a general expectation of equivalent underlying mechanisms. Still, less is known about promoters in the plant kingdom. In this study, we employed cap analysis of gene expression (CAGE) at three embryonic developmental stages in barley to accurately map, annotate, and quantify transcription initiation events. Unsupervised discovery of de novo sequence clusters grouped promoters based on characteristic initiator and position-specific core-promoter motifs. This grouping was complemented by the annotation of transcription factor binding site (TFBS) motifs. Integration with genome-wide epigenomic data sets and gene ontology (GO) enrichment analysis further delineated the chromatin environments and functional roles of genes associated with distinct promoter categories. The TATA-box presence governs all features explored, supporting the general model of two separate genomic regulatory environments. We describe the extent and implications of alternative transcription initiation events, including those that are specific to developmental stages, which can affect the protein sequence or the presence of regions that regulate translation. The generated promoterome dataset provides a valuable genomic resource for enhancing the functional annotation of the barley genome. It also offers insights into the transcriptional regulation of individual genes and presents opportunities for the informed manipulation of promoter architecture, with the aim of enhancing traits of agronomic importance.
Collapse
Affiliation(s)
- Simon Pavlu
- Institute of Experimental Botany of the Czech Academy of Sciences, Slechtitelu 31, 77900 Olomouc, Czech Republic
- Department of Cell Biology and Genetics, Faculty of Science, Palacky University, Slechtitelu 27, 78371 Olomouc, Czech Republic
| | - Sarvesh Nikumbh
- Merck Sharp & Dohme (UK) Limited, 120 Moorgate, London EC2M 6UR, UK
| | - Martin Kovacik
- Institute of Experimental Botany of the Czech Academy of Sciences, Slechtitelu 31, 77900 Olomouc, Czech Republic
- Department of Cell Biology and Genetics, Faculty of Science, Palacky University, Slechtitelu 27, 78371 Olomouc, Czech Republic
| | - Tadaichi An
- DNAFORM Precision Gene Technologies, 230–0046 Yokohama, Kanagawa, Japan
| | - Boris Lenhard
- Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London, UK
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, UK
| | - Hana Simkova
- Institute of Experimental Botany of the Czech Academy of Sciences, Slechtitelu 31, 77900 Olomouc, Czech Republic
| | - Pavla Navratilova
- Institute of Experimental Botany of the Czech Academy of Sciences, Slechtitelu 31, 77900 Olomouc, Czech Republic
| |
Collapse
|
2
|
Peng B, Sun G, Fan Y. iProL: identifying DNA promoters from sequence information based on Longformer pre-trained model. BMC Bioinformatics 2024; 25:224. [PMID: 38918692 PMCID: PMC11201334 DOI: 10.1186/s12859-024-05849-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Accepted: 06/19/2024] [Indexed: 06/27/2024] Open
Abstract
Promoters are essential elements of DNA sequence, usually located in the immediate region of the gene transcription start sites, and play a critical role in the regulation of gene transcription. Its importance in molecular biology and genetics has attracted the research interest of researchers, and it has become a consensus to seek a computational method to efficiently identify promoters. Still, existing methods suffer from imbalanced recognition capabilities for positive and negative samples, and their recognition effect can still be further improved. We conducted research on E. coli promoters and proposed a more advanced prediction model, iProL, based on the Longformer pre-trained model in the field of natural language processing. iProL does not rely on prior biological knowledge but simply uses promoter DNA sequences as plain text to identify promoters. It also combines one-dimensional convolutional neural networks and bidirectional long short-term memory to extract both local and global features. Experimental results show that iProL has a more balanced and superior performance than currently published methods. Additionally, we constructed a novel independent test set following the previous specification and compared iProL with three existing methods on this independent test set.
Collapse
Affiliation(s)
- Binchao Peng
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China
| | - Guicong Sun
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China
| | - Yongxian Fan
- School of Computer Science and Information Security, Guilin University of Electronic Technology, Guilin, 541004, China.
| |
Collapse
|
3
|
Wudarski J, Aliabadi S, Gulia-Nuss M. Arthropod promoters for genetic control of disease vectors. Trends Parasitol 2024:S1471-4922(24)00093-X. [PMID: 38824066 DOI: 10.1016/j.pt.2024.04.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Revised: 04/15/2024] [Accepted: 04/15/2024] [Indexed: 06/03/2024]
Abstract
Vector-borne diseases (VBDs) impose devastating effects on human health and a heavy financial burden. Malaria, Lyme disease, and dengue fever are just a few examples of VBDs that cause severe illnesses. The current strategies to control VBDs consist mainly of environmental modification and chemical use, and to a small extent, genetic approaches. The genetic approaches, including transgenesis/genome modification and gene-drive technologies, provide the basis for developing new tools for VBD prevention by suppressing vector populations or reducing their capacity to transmit pathogens. The regulatory elements such as promoters are required for a robust sex-, tissue-, and stage-specific transgene expression. As discussed in this review, information on the regulatory elements is available for mosquito vectors but is scant for other vectors.
Collapse
Affiliation(s)
- Jakub Wudarski
- Department of Biochemistry and Molecular Biology, University of Nevada, Reno, NV, USA
| | - Simindokht Aliabadi
- Department of Biochemistry and Molecular Biology, University of Nevada, Reno, NV, USA
| | - Monika Gulia-Nuss
- Department of Biochemistry and Molecular Biology, University of Nevada, Reno, NV, USA.
| |
Collapse
|
4
|
Khamwachirapithak P, Guillaume-Schoepfer D, Chansongkrow P, Teichmann SA, Wigge PA, Charoensawan V. Characterizing Different Modes of Interplay Between Rap1 and H3 Using Inducible H3-depletion Yeast. J Mol Biol 2023; 435:168355. [PMID: 37935256 DOI: 10.1016/j.jmb.2023.168355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 10/31/2023] [Accepted: 11/01/2023] [Indexed: 11/09/2023]
Abstract
Histones and transcription factors (TFs) are two important DNA-binding proteins that interact, compete, and together regulate transcriptional processes in response to diverse internal and external stimuli. Condition-specific depletion of histones in Saccharomyces cerevisiae using a galactose-inducible H3 promoter provides a suitable framework for examining transcriptional alteration resulting from reduced nucleosome content. However, the effect on DNA binding activities of TFs is yet to be fully explored. In this work, we combine ChIP-seq of H3 with RNA-seq to elucidate the genome-scale relationships between H3 occupancy patterns and transcriptional dynamics before and after global H3 depletion. ChIP-seq of Rap1 is also conducted in the H3-depletion and control treatments, to investigate the interplay between this master regulator TF and nucleosomal H3, and to explore the impact on diverse transcriptional responses of different groups of target genes and functions. Ultimately, we propose a working model and testable hypotheses regarding the impact of global and local H3 depletion on transcriptional changes. We also demonstrate different potential modes of interaction between Rap1 and H3, which sheds light on the potential multifunctional regulatory capabilities of Rap1 and potentially other pioneer factors.
Collapse
Affiliation(s)
- Peerapat Khamwachirapithak
- Doctor of Philosophy Program in Biochemistry (International Program), Faculty of Science, Mahidol University, Bangkok, Thailand; Department of Biochemistry, Faculty of Science, Mahidol University, Bangkok, Thailand
| | | | - Pakkanan Chansongkrow
- Department of Biochemistry, Faculty of Science, Mahidol University, Bangkok, Thailand
| | - Sarah A Teichmann
- Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK; Department of Physics, Cavendish Laboratory, University of Cambridge, Cambridge, UK.
| | - Philip A Wigge
- Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom; University Potsdam, Institute for Biochemistry and Biology, Molecular Biology, Karl-Liebknecht-Str, Potsdam-Golm, Germany; Institute of Biochemistry and Biology, University of Potsdam, Potsdam-Golm, Germany.
| | - Varodom Charoensawan
- Department of Biochemistry, Faculty of Science, Mahidol University, Bangkok, Thailand; Sainsbury Laboratory, University of Cambridge, Cambridge, United Kingdom; Systems Biology of Diseases Research Unit, Faculty of Science, Mahidol University, Bangkok, Thailand; Integrative Computational BioScience (ICBS) center, Mahidol University, Nakhon Pathom, Thailand; School of Chemistry, Institute of Science, Suranaree University of Technology, Nakhon Ratchasima, Thailand.
| |
Collapse
|
5
|
Arenas-Mena C, Akin S. Widespread priming of transcriptional regulatory elements by incipient accessibility or RNA polymerase II pause in early embryos of the sea urchin Strongylocentrotus purpuratus. Genetics 2023; 225:iyad145. [PMID: 37551428 PMCID: PMC10789315 DOI: 10.1093/genetics/iyad145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 05/17/2023] [Accepted: 07/25/2023] [Indexed: 08/09/2023] Open
Abstract
Transcriptional regulatory elements (TREs) are the primary nodes that control developmental gene regulatory networks. In embryo stages, larvae, and adult differentiated red spherule cells of the sea urchin Strongylocentrotus purpuratus, transcriptionally engaged TREs are detected by Precision Run-On Sequencing (PRO-seq), which maps genome-wide at base pair resolution the location of paused or elongating RNA polymerase II (Pol II). In parallel, TRE accessibility is estimated by the Assay for Transposase-Accessible Chromatin using Sequencing (ATAC-seq). Our analysis identifies surprisingly early and widespread TRE accessibility in 4-cell cleavage embryos that is not necessarily followed by concurrent or subsequent transcription. TRE transcriptional differences identified by PRO-seq provide more contrast among embryonic stages than ATAC-seq accessibility differences, in agreement with the apparent excess of accessible but inactive TREs during embryogenesis. Global TRE accessibility reaches a maximum around the 20-hour late blastula stage, which coincides with the consolidation of major embryo regionalizations and peak histone variant H2A.Z expression. A transcriptional potency model based on labile nucleosome TRE occupancy driven by DNA sequences and the prevalence of histone variants is proposed in order to explain the basal accessibility of transcriptionally inactive TREs during embryogenesis. However, our results would not reconcile well with labile nucleosome models based on simple A/T sequence enrichment. In addition, a large number of distal TREs become transcriptionally disengaged during developmental progression, in support of an early Pol II paused model for developmental gene regulation that eventually resolves in transcriptional activation or silencing. Thus, developmental potency in early embryos may be facilitated by incipient accessibility and transcriptional pause at TREs.
Collapse
Affiliation(s)
- Cesar Arenas-Mena
- Department of Biology, College of Staten Island, City University of New York (CUNY), 2800 Victory Boulevard, Staten Island, NY, 10314, USA
- PhD Programs in Biology and Biochemistry at the City University of New York (CUNY), Graduate Center, 365 Fifth Avenue, New York, NY, 10016, USA
| | - Serhat Akin
- Department of Biology, College of Staten Island, City University of New York (CUNY), 2800 Victory Boulevard, Staten Island, NY, 10314, USA
- PhD Program in Biology at the City University of New York (CUNY), Graduate Center, 365 Fifth Avenue, New York, NY, 10016, USA
| |
Collapse
|
6
|
Liu X, Teng L, Luo Y, Xu Y. Prediction of prokaryotic and eukaryotic promoters based on information-theoretic features. Biosystems 2023; 231:104979. [PMID: 37423595 DOI: 10.1016/j.biosystems.2023.104979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 07/06/2023] [Accepted: 07/07/2023] [Indexed: 07/11/2023]
Abstract
Promoters are DNA regulatory elements located near the transcription start site and are responsible for regulating the transcription of genes. DNA fragments arranged in a certain order form specific functional regions with different information contents. Information theory is the science that studies the extraction, measurement and transmission of information. The genetic information contained in DNA follows the general laws of information storage. Therefore, method in information theory can be used for the analysis of promoters carrying genetic information. In this study, we introduced the concept of information theory to the study of promoter prediction. We used 107 features extracted based on information theory methods and a backpropagation neural network to build a classifier. Then, the trained classifier was applied to predict the promoters of 6 organisms. The average AUCs of the 6 organisms obtained by using hold-out validation and ten-fold cross-validation were 0.885 and 0.886, respectively. The results verified the effectiveness of information-theoretic features in promoter prediction. Considering the possible redundancy in the feature set, we performed feature selection and obtained key feature subsets related to promoter characteristics. The results indicate the potential utility of information-theoretic features in promoter prediction.
Collapse
Affiliation(s)
- Xiao Liu
- School of Microelectronics and Communication Engineering, Chongqing University, 174 ShaPingBa District, Chongqing, 400044, China.
| | - Li Teng
- School of Microelectronics and Communication Engineering, Chongqing University, 174 ShaPingBa District, Chongqing, 400044, China
| | - Yachuan Luo
- School of Microelectronics and Communication Engineering, Chongqing University, 174 ShaPingBa District, Chongqing, 400044, China
| | - Yuqiao Xu
- School of Microelectronics and Communication Engineering, Chongqing University, 174 ShaPingBa District, Chongqing, 400044, China
| |
Collapse
|
7
|
Sui Z, Zhang Y, Zhang Z, Wang C, Li X, Xing F, Chu M. Analysis of Lin28B Promoter Activity and Screening of Related Transcription Factors in Dolang Sheep. Genes (Basel) 2023; 14:genes14051049. [PMID: 37239408 DOI: 10.3390/genes14051049] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 05/01/2023] [Accepted: 05/05/2023] [Indexed: 05/28/2023] Open
Abstract
The Lin28B gene is involved in the initiation of puberty, but its regulatory mechanisms remain unclear. Therefore, in this study, we aimed to study the regulatory mechanism of the Lin28B promoter by cloning the Lin28B proximal promoter for bioinformatic analysis. Next, a series of deletion vectors were constructed based on the bioinformatic analysis results for dual-fluorescein activity detection. The transcriptional regulation mechanism of the Lin28B promoter region was analyzed by detecting mutations in transcription factor-binding sites and overexpression of transcription factors. The dual-luciferase assay showed that the Lin28B promoter region -837 to -338 bp had the highest transcriptional activity, and the transcriptional activity of the Lin28B transcriptional regulatory region decreased significantly after Egr1 and SP1 mutations. Overexpression of the Egr1 transcription factor significantly enhanced the transcription of Lin28B, and the results indicated that Egr1 and SP1 play important roles in regulating Lin28B. These results provide a theoretical basis for further research on the transcriptional regulation of sheep Lin28B during puberty initiation.
Collapse
Affiliation(s)
- Zhiyuan Sui
- Key Laboratory of Tarim Animal Husbandry Science and Technology, Xinjiang Production and Construction Group, Alar 843300, China
- College of Animal Science and Technology, Tarim University, Alar 843300, China
| | - Yongjie Zhang
- Key Laboratory of Tarim Animal Husbandry Science and Technology, Xinjiang Production and Construction Group, Alar 843300, China
- College of Animal Science and Technology, Tarim University, Alar 843300, China
| | - Zhishuai Zhang
- Key Laboratory of Tarim Animal Husbandry Science and Technology, Xinjiang Production and Construction Group, Alar 843300, China
- College of Animal Science and Technology, Tarim University, Alar 843300, China
| | - Chenguang Wang
- Key Laboratory of Tarim Animal Husbandry Science and Technology, Xinjiang Production and Construction Group, Alar 843300, China
- College of Animal Science and Technology, Tarim University, Alar 843300, China
| | - Xiaojun Li
- Key Laboratory of Tarim Animal Husbandry Science and Technology, Xinjiang Production and Construction Group, Alar 843300, China
- College of Animal Science and Technology, Tarim University, Alar 843300, China
| | - Feng Xing
- Key Laboratory of Tarim Animal Husbandry Science and Technology, Xinjiang Production and Construction Group, Alar 843300, China
- College of Animal Science and Technology, Tarim University, Alar 843300, China
| | - Mingxing Chu
- Key Laboratory of Animal Genetics, Breeding and Reproduction of Ministry of Agriculture and Rural Affairs, Institute of Animal Science, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| |
Collapse
|
8
|
Identification and Functional Analysis of the Promoter of a Leucoanthocyanidin Reductase Gene from Gossypium hirsutum. Mol Biotechnol 2023; 65:645-654. [PMID: 36155889 DOI: 10.1007/s12033-022-00571-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Accepted: 09/15/2022] [Indexed: 10/14/2022]
Abstract
Leucoanthocyanidin reductase (LAR) is the critical enzyme in the synthesis pathway of proanthocyanidins, which are the primary pigments in brown cotton fibers. Our previous study has revealed significant differences in the expression levels of GhLAR1 between white and brown cotton fibers at 10 DPA. In this work, the expression pattern of the GhLAR1 gene was further studied, and the promoter of GhLAR1 (1780 bp) was isolated and characterized. Bioinformatic analysis indicated that GhLAR1 promoter contained many known light response elements and several defenses related to transcriptional factor-binding boxes, which may partially explain the response of the GhLAR1 to temperature, NaCl, and PEG treatments. Furthermore, GhLAR1 was preferentially and strongly expressed in fibers and flowers of cotton, and the expression levels in all tested tissues (especially fibers) of brown cotton were significantly higher than those in white cotton. Consistent with the expression analysis, the GhLAR1 promoter mainly drove GUS expression in epidermal trichomes and floral organs.
Collapse
|
9
|
Ji X, Cai J, Liang L, Shi T, Liu J. Gene expression variability across cells and species shapes the relationship between renal resident macrophages and infiltrated macrophages. BMC Bioinformatics 2023; 24:72. [PMID: 36858955 PMCID: PMC9976410 DOI: 10.1186/s12859-023-05198-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2022] [Accepted: 02/21/2023] [Indexed: 03/03/2023] Open
Abstract
BACKGROUND Two main subclasses of macrophages are found in almost all solid tissues: embryo-derived resident tissue macrophages and bone marrow-derived infiltrated macrophages. These macrophage subtypes show transcriptional and functional divergence, and the programs that have shaped the evolution of renal macrophages and related signaling pathways remain poorly understood. To clarify these processes, we performed data analysis based on single-cell transcriptional profiling of renal tissue-resident and infiltrated macrophages in human, mouse and rat. RESULTS In this study, we (i) characterized the transcriptional divergence among species and (ii) illustrated variability in expression among cells of each subtype and (iii) compared the gene regulation network and (iv) ligand-receptor pairs in human and mouse. Using single-cell transcriptomics, we mapped the promoter architecture during homeostasis. CONCLUSIONS Transcriptionally divergent genes, such as the differentially TF-encoding genes expressed in resident and infiltrated macrophages across the three species, vary among cells and include distinct promoter structures. The gene regulatory network in infiltrated macrophages shows comparatively better species-wide consistency than resident macrophages. The conserved transcriptional gene regulatory network in infiltrated macrophages among species is uniquely enriched in pathways related to kinases, and TFs associated with largely conserved regulons among species are uniquely enriched in kinase-related pathways.
Collapse
Affiliation(s)
- Xiangjun Ji
- grid.284723.80000 0000 8877 7471Guangdong Provincial Key Laboratory of Proteomics, Department of Pathophysiology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515 China
| | - Junwei Cai
- grid.284723.80000 0000 8877 7471Guangdong Provincial Key Laboratory of Proteomics, Department of Pathophysiology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515 China
| | - Lixin Liang
- grid.284723.80000 0000 8877 7471Guangdong Provincial Key Laboratory of Proteomics, Department of Pathophysiology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515 China
| | - Tieliu Shi
- Center for Bioinformatics and Computational Biology, The Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China. .,Beijing Advanced Innovation Center, for Big Data-Based Precision Medicine, Beihang University and Capital Medical University, Beijing, 100083, China.
| | - Jinghua Liu
- Guangdong Provincial Key Laboratory of Proteomics, Department of Pathophysiology, School of Basic Medical Sciences, Southern Medical University, Guangzhou, 510515, China.
| |
Collapse
|
10
|
Xiang G, Huang L, Zhang X, Wang N, Wang H, Mu Y, Li K, Liu Z. Molecular Characteristics and Promoter Analysis of Porcine COL1A1. Genes (Basel) 2022; 13:1971. [PMID: 36360208 PMCID: PMC9689670 DOI: 10.3390/genes13111971] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2022] [Revised: 10/20/2022] [Accepted: 10/24/2022] [Indexed: 03/25/2024] Open
Abstract
COL1A1 encodes the type I collagen α1 chain, which shows the highest abundance among members of the collagen family and is widely expressed in different mammalian cells and tissues. However, its molecular characteristics are not completely elucidated. In this study, the molecular profiles of COL1A1 and characteristics of the COL1A1 protein were investigated using a promoter activity assay and multiple bioinformatics tools. The results showed that the 5' flanking region of porcine COL1A1 contained two CpG islands, five core promoter sequences, and twenty-six transcription factor-binding sites. In the luciferase assay, the upstream 294 bp region of the initiation codon of COL1A1 showed the highest activity, confirming that this section is the core region of the porcine COL1A1 promoter. Bioinformatic analysis revealed that COL1A1 is a negatively charged, hydrophilic secreted protein. It does not contain a transmembrane domain and is highly conserved in humans, mice, sheep, and pigs. Protein interaction analysis demonstrated that the interaction coefficient of COL1A1 with COL1A2, COL3A1, ITGB1, and ITGA2 was greater than 0.9, suggesting that this protein plays a crucial role in collagen structure formation and cell adhesion. These results provide a theoretical basis for further investigation of the functions of porcine COL1A1.
Collapse
Affiliation(s)
- Guangming Xiang
- Key Laboratory of Animal Genetics Breeding and Reproduction of Ministry of Agriculture and Rural Affairs of China, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Lei Huang
- Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Xiuling Zhang
- Key Laboratory of Animal Genetics Breeding and Reproduction of Ministry of Agriculture and Rural Affairs of China, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Nan Wang
- Key Laboratory of Animal Genetics Breeding and Reproduction of Ministry of Agriculture and Rural Affairs of China, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Hui Wang
- Key Laboratory of Animal Genetics Breeding and Reproduction of Ministry of Agriculture and Rural Affairs of China, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Yulian Mu
- Key Laboratory of Animal Genetics Breeding and Reproduction of Ministry of Agriculture and Rural Affairs of China, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| | - Kui Li
- Key Laboratory of Animal Genetics Breeding and Reproduction of Ministry of Agriculture and Rural Affairs of China, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
- Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518124, China
| | - Zhiguo Liu
- Key Laboratory of Animal Genetics Breeding and Reproduction of Ministry of Agriculture and Rural Affairs of China, Institute of Animal Sciences, Chinese Academy of Agricultural Sciences, Beijing 100193, China
| |
Collapse
|
11
|
Nguyen-Vo TH, Trinh QH, Nguyen L, Nguyen-Hoang PU, Rahardja S, Nguyen BP. iPromoter-Seqvec: identifying promoters using bidirectional long short-term memory and sequence-embedded features. BMC Genomics 2022; 23:681. [PMID: 36192696 PMCID: PMC9531353 DOI: 10.1186/s12864-022-08829-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 08/08/2022] [Indexed: 11/30/2022] Open
Abstract
Background Promoters, non-coding DNA sequences located at upstream regions of the transcription start site of genes/gene clusters, are essential regulatory elements for the initiation and regulation of transcriptional processes. Furthermore, identifying promoters in DNA sequences and genomes significantly contributes to discovering entire structures of genes of interest. Therefore, exploration of promoter regions is one of the most imperative topics in molecular genetics and biology. Besides experimental techniques, computational methods have been developed to predict promoters. In this study, we propose iPromoter-Seqvec – an efficient computational model to predict TATA and non-TATA promoters in human and mouse genomes using bidirectional long short-term memory neural networks in combination with sequence-embedded features extracted from input sequences. The promoter and non-promoter sequences were retrieved from the Eukaryotic Promoter database and then were refined to create four benchmark datasets. Results The area under the receiver operating characteristic curve (AUCROC) and the area under the precision-recall curve (AUCPR) were used as two key metrics to evaluate model performance. Results on independent test sets showed that iPromoter-Seqvec outperformed other state-of-the-art methods with AUCROC values ranging from 0.85 to 0.99 and AUCPR values ranging from 0.86 to 0.99. Models predicting TATA promoters in both species had slightly higher predictive power compared to those predicting non-TATA promoters. With a novel idea of constructing artificial non-promoter sequences based on promoter sequences, our models were able to learn highly specific characteristics discriminating promoters from non-promoters to improve predictive efficiency. Conclusions iPromoter-Seqvec is a stable and robust model for predicting both TATA and non-TATA promoters in human and mouse genomes. Our proposed method was also deployed as an online web server with a user-friendly interface to support research communities. Links to our source codes and web server are available at https://github.com/mldlproject/2022-iPromoter-Seqvec. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08829-6.
Collapse
Affiliation(s)
- Thanh-Hoang Nguyen-Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand
| | - Quang H Trinh
- School of Information and Communication Technology, Hanoi University of Science and Technology, 1 Dai Co Viet, 100000, Hanoi, Vietnam
| | - Loc Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand
| | - Phuong-Uyen Nguyen-Hoang
- Computational Biology Center, International University - VNU HCMC, Quarter 6, Linh Trung Ward, Thu Duc District, 700000, Ho Chi Minh City, Vietnam
| | - Susanto Rahardja
- School of Marine Science and Technology, Northwestern Polytechnical University, 127 West Youyi Road, 710072, Xi'an, China. .,Infocomm Technology Cluster, Singapore Institute of Technology, 10 Dover Drive, 138683, Singapore, Singapore.
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Gate 7, Kelburn Parade, 6140, Wellington, New Zealand.
| |
Collapse
|
12
|
Abstract
Enhancers control the establishment of spatiotemporal gene expression patterns throughout development. Over the past decade, the development of new technologies has improved our capacity to link enhancers with their target genes based on their colocalization within the same topological domains. However, the mechanisms that regulate how enhancers specifically activate some genes but not others within a given domain remain unclear. In this Review, we discuss recent insights into the factors controlling enhancer specificity, including the genetic composition of enhancers and promoters, the linear and 3D distance between enhancers and their target genes, and cell-type specific chromatin landscapes. We also discuss how elucidating the molecular principles of enhancer specificity might help us to better understand and predict the pathological consequences of human genetic, epigenetic and structural variants.
Collapse
Affiliation(s)
- Tomás Pachano
- Institute of Biomedicine and Biotechnology of Cantabria (IBBTEC), CSIC/Universidad de Cantabria/SODERCAN, Albert Einstein 22, 39011 Santander, Spain
| | - Endika Haro
- Institute of Biomedicine and Biotechnology of Cantabria (IBBTEC), CSIC/Universidad de Cantabria/SODERCAN, Albert Einstein 22, 39011 Santander, Spain
| | - Alvaro Rada-Iglesias
- Institute of Biomedicine and Biotechnology of Cantabria (IBBTEC), CSIC/Universidad de Cantabria/SODERCAN, Albert Einstein 22, 39011 Santander, Spain
| |
Collapse
|
13
|
Encyclopaedia of eukaryotic DNA methylation: from patterns to mechanisms and functions. Biochem Soc Trans 2022; 50:1179-1190. [PMID: 35521905 PMCID: PMC9246332 DOI: 10.1042/bst20210725] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Revised: 04/08/2022] [Accepted: 04/11/2022] [Indexed: 12/14/2022]
Abstract
DNA methylation is an epigenetic modification with a very long evolutionary history. However, DNA methylation evolves surprisingly rapidly across eukaryotes. The genome-wide distribution of methylation diversifies rapidly in different lineages, and DNA methylation is lost altogether surprisingly frequently. The growing availability of genomic and epigenomic sequencing across organisms highlights this diversity but also illuminates potential factors that could explain why both the DNA methylation machinery and its genome-wide distribution evolve so rapidly. Key to this are new discoveries about the fitness costs associated with DNA methylation, and new theories about how the fundamental biochemical mechanisms of DNA methylation introduction and maintenance could explain how new genome-wide patterns of methylation evolve.
Collapse
|
14
|
Maderazo D, Flegg JA, Algama M, Ramialison M, Keith J. Detection and identification of cis-regulatory elements using change-point and classification algorithms. BMC Genomics 2022; 23:78. [PMID: 35078412 PMCID: PMC8790847 DOI: 10.1186/s12864-021-08190-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 11/19/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Transcriptional regulation is primarily mediated by the binding of factors to non-coding regions in DNA. Identification of these binding regions enhances understanding of tissue formation and potentially facilitates the development of gene therapies. However, successful identification of binding regions is made difficult by the lack of a universal biological code for their characterisation. RESULTS We extend an alignment-based method, changept, and identify clusters of biological significance, through ontology and de novo motif analysis. Further, we apply a Bayesian method to estimate and combine binary classifiers on the clusters we identify to produce a better performing composite. CONCLUSIONS The analysis we describe provides a computational method for identification of conserved binding sites in the human genome and facilitates an alternative interrogation of combinations of existing data sets with alignment data.
Collapse
Affiliation(s)
- Dominic Maderazo
- School of Mathematics and Statistics, The University of Melbourne, Melbourne, 3010, VIC, Australia.
| | - Jennifer A Flegg
- School of Mathematics and Statistics, The University of Melbourne, Melbourne, 3010, VIC, Australia
| | - Manjula Algama
- School of Mathematics, Monash University, Melbourne, 3800, VIC, Australia
| | - Mirana Ramialison
- Australian Regenerative Medicine Institute, Monash University, Melbourne, 3800, VIC, Australia
| | - Jonathan Keith
- School of Mathematics, Monash University, Melbourne, 3800, VIC, Australia
| |
Collapse
|
15
|
Arenas-Mena C, Miljovska S, Rice EJ, Gurges J, Shashikant T, Wang Z, Ercan S, Danko CG. Identification and prediction of developmental enhancers in sea urchin embryos. BMC Genomics 2021; 22:751. [PMID: 34666684 PMCID: PMC8527612 DOI: 10.1186/s12864-021-07936-0] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 06/28/2021] [Indexed: 11/21/2022] Open
Abstract
Background The transcription of developmental regulatory genes is often controlled by multiple cis-regulatory elements. The identification and functional characterization of distal regulatory elements remains challenging, even in tractable model organisms like sea urchins. Results We evaluate the use of chromatin accessibility, transcription and RNA Polymerase II for their ability to predict enhancer activity of genomic regions in sea urchin embryos. ATAC-seq, PRO-seq, and Pol II ChIP-seq from early and late blastula embryos are manually contrasted with experimental cis-regulatory analyses available in sea urchin embryos, with particular attention to common developmental regulatory elements known to have enhancer and silencer functions differentially deployed among embryonic territories. Using the three functional genomic data types, machine learning models are trained and tested to classify and quantitatively predict the enhancer activity of several hundred genomic regions previously validated with reporter constructs in vivo. Conclusions Overall, chromatin accessibility and transcription have substantial power for predicting enhancer activity. For promoter-overlapping cis-regulatory elements in particular, the distribution of Pol II is the best predictor of enhancer activity in blastula embryos. Furthermore, ATAC- and PRO-seq predictive value is stage dependent for the promoter-overlapping subset. This suggests that the sequence of regulatory mechanisms leading to transcriptional activation have distinct relevance at different levels of the developmental gene regulatory hierarchy deployed during embryogenesis. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07936-0.
Collapse
Affiliation(s)
- César Arenas-Mena
- College of Staten Island, The City University of New York (CUNY), Staten Island, NY, 10314, USA. .,Programs in Biology and Biochemistry, The Graduate Center, CUNY, New York, NY, 10016, USA.
| | - Sofija Miljovska
- Department of Biology, New York University, New York, NY, 10003, USA
| | - Edward J Rice
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, 14853, USA
| | - Justin Gurges
- College of Staten Island, The City University of New York (CUNY), Staten Island, NY, 10314, USA
| | - Tanvi Shashikant
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Zihe Wang
- College of Staten Island, The City University of New York (CUNY), Staten Island, NY, 10314, USA
| | - Sevinç Ercan
- Department of Biology, New York University, New York, NY, 10003, USA.,Center for Genomics and Systems Biology, New York University, New York, NY, 10003, USA
| | - Charles G Danko
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY, 14853, USA.,Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, NY, 14853, USA
| |
Collapse
|
16
|
Barnes T, Korber P. The Active Mechanism of Nucleosome Depletion by Poly(dA:dT) Tracts In Vivo. Int J Mol Sci 2021; 22:ijms22158233. [PMID: 34360997 PMCID: PMC8347975 DOI: 10.3390/ijms22158233] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2021] [Revised: 07/27/2021] [Accepted: 07/29/2021] [Indexed: 12/16/2022] Open
Abstract
Poly(dA:dT) tracts cause nucleosome depletion in many species, e.g., at promoters and replication origins. Their intrinsic biophysical sequence properties make them stiff and unfavorable for nucleosome assembly, as probed by in vitro nucleosome reconstitution. The mere correlation between nucleosome depletion over poly(dA:dT) tracts in in vitro reconstituted and in in vivo chromatin inspired an intrinsic nucleosome exclusion mechanism in vivo that is based only on DNA and histone properties. However, we compile here published and new evidence that this correlation does not reflect mechanistic causation. (1) Nucleosome depletion over poly(dA:dT) in vivo is not universal, e.g., very weak in S. pombe. (2) The energy penalty for incorporating poly(dA:dT) tracts into nucleosomes is modest (<10%) relative to ATP hydrolysis energy abundantly invested by chromatin remodelers. (3) Nucleosome depletion over poly(dA:dT) is much stronger in vivo than in vitro if monitored without MNase and (4) actively maintained in vivo. (5) S. cerevisiae promoters evolved a strand-biased poly(dA) versus poly(dT) distribution. (6) Nucleosome depletion over poly(dA) is directional in vivo. (7) The ATP dependent chromatin remodeler RSC preferentially and directionally displaces nucleosomes towards 5′ of poly(dA). Especially distribution strand bias and displacement directionality would not be expected for an intrinsic mechanism. Together, this argues for an in vivo mechanism where active and species-specific read out of intrinsic sequence properties, e.g., by remodelers, shapes nucleosome organization.
Collapse
|
17
|
Genome information processing by the INO80 chromatin remodeler positions nucleosomes. Nat Commun 2021; 12:3231. [PMID: 34050142 PMCID: PMC8163841 DOI: 10.1038/s41467-021-23016-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 04/07/2021] [Indexed: 11/26/2022] Open
Abstract
The fundamental molecular determinants by which ATP-dependent chromatin remodelers organize nucleosomes across eukaryotic genomes remain largely elusive. Here, chromatin reconstitutions on physiological, whole-genome templates reveal how remodelers read and translate genomic information into nucleosome positions. Using the yeast genome and the multi-subunit INO80 remodeler as a paradigm, we identify DNA shape/mechanics encoded signature motifs as sufficient for nucleosome positioning and distinct from known DNA sequence preferences of histones. INO80 processes such information through an allosteric interplay between its core- and Arp8-modules that probes mechanical properties of nucleosomal and linker DNA. At promoters, INO80 integrates this readout of DNA shape/mechanics with a readout of co-evolved sequence motifs via interaction with general regulatory factors bound to these motifs. Our findings establish a molecular mechanism for robust and yet adjustable +1 nucleosome positioning and, more generally, remodelers as information processing hubs that enable active organization and allosteric regulation of the first level of chromatin. DNA sequence preferences or statistical positioning of histones has not explained genomic patterns of nucleosome organisation in vivo. Here, the authors establish DNA shape/mechanics as key elements that have evolved together with binding sites of DNA sequence-specific barriers so that such information directs nucleosome positioning by chromatin remodelers.
Collapse
|
18
|
Serizay J, Dong Y, Jänes J, Chesney M, Cerrato C, Ahringer J. Distinctive regulatory architectures of germline-active and somatic genes in C. elegans. Genome Res 2020; 30:1752-1765. [PMID: 33093068 PMCID: PMC7706728 DOI: 10.1101/gr.265934.120] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 10/08/2020] [Indexed: 01/08/2023]
Abstract
RNA profiling has provided increasingly detailed knowledge of gene expression patterns, yet the different regulatory architectures that drive them are not well understood. To address this, we profiled and compared transcriptional and regulatory element activities across five tissues of Caenorhabditis elegans, covering ∼90% of cells. We find that the majority of promoters and enhancers have tissue-specific accessibility, and we discover regulatory grammars associated with ubiquitous, germline, and somatic tissue–specific gene expression patterns. In addition, we find that germline-active and soma-specific promoters have distinct features. Germline-active promoters have well-positioned +1 and −1 nucleosomes associated with a periodic 10-bp WW signal (W = A/T). Somatic tissue–specific promoters lack positioned nucleosomes and this signal, have wide nucleosome-depleted regions, and are more enriched for core promoter elements, which largely differ between tissues. We observe the 10-bp periodic WW signal at ubiquitous promoters in other animals, suggesting it is an ancient conserved signal. Our results show fundamental differences in regulatory architectures of germline and somatic tissue–specific genes, uncover regulatory rules for generating diverse gene expression patterns, and provide a tissue-specific resource for future studies.
Collapse
Affiliation(s)
- Jacques Serizay
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge CB2 1QN, United Kingdom
| | - Yan Dong
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge CB2 1QN, United Kingdom
| | - Jürgen Jänes
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge CB2 1QN, United Kingdom
| | - Michael Chesney
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge CB2 1QN, United Kingdom
| | - Chiara Cerrato
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge CB2 1QN, United Kingdom
| | - Julie Ahringer
- The Gurdon Institute and Department of Genetics, University of Cambridge, Cambridge CB2 1QN, United Kingdom
| |
Collapse
|
19
|
Lewis SH, Ross L, Bain SA, Pahita E, Smith SA, Cordaux R, Miska EA, Lenhard B, Jiggins FM, Sarkies P. ------Widespread conservation and lineage-specific diversification of genome-wide DNA methylation patterns across arthropods. PLoS Genet 2020; 16:e1008864. [PMID: 32584820 PMCID: PMC7343188 DOI: 10.1371/journal.pgen.1008864] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 07/08/2020] [Accepted: 05/15/2020] [Indexed: 12/23/2022] Open
Abstract
Cytosine methylation is an ancient epigenetic modification yet its function and extent within genomes is highly variable across eukaryotes. In mammals, methylation controls transposable elements and regulates the promoters of genes. In insects, DNA methylation is generally restricted to a small subset of transcribed genes, with both intergenic regions and transposable elements (TEs) depleted of methylation. The evolutionary origin and the function of these methylation patterns are poorly understood. Here we characterise the evolution of DNA methylation across the arthropod phylum. While the common ancestor of the arthropods had low levels of TE methylation and did not methylate promoters, both of these functions have evolved independently in centipedes and mealybugs. In contrast, methylation of the exons of a subset of transcribed genes is ancestral and widely conserved across the phylum, but has been lost in specific lineages. A similar set of genes is methylated in all species that retained exon-enriched methylation. We show that these genes have characteristic patterns of expression correlating to broad transcription initiation sites and well-positioned nucleosomes, providing new insights into potential mechanisms driving methylation patterns over hundreds of millions of years.
Collapse
Affiliation(s)
- Samuel H. Lewis
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Imperial College London, London, United Kingdom
| | - Laura Ross
- Institute of Evolutionary Biology, Edinburgh, United Kingdom
| | - Stevie A. Bain
- Institute of Evolutionary Biology, Edinburgh, United Kingdom
| | - Eleni Pahita
- MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Imperial College London, London, United Kingdom
| | - Stephen A. Smith
- Department of Biomedical Sciences and Pathobiology, Virginia Maryland College of Veterinary Medicine, Virginia Tech, Blacksburg, Virginia, United States of America
| | - Richard Cordaux
- Laboratoire Ecologie et Biologie des Interactions Universite de Poitiers, France
| | - Eric A. Miska
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Wellcome Trust/Cancer Research UK Gurdon Institute, Cambridge, United Kingdom
| | - Boris Lenhard
- MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Imperial College London, London, United Kingdom
- Sars International Centre for Marine Molecular Biology, University of Bergen, Bergen, Norway
| | - Francis M. Jiggins
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Peter Sarkies
- MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Imperial College London, London, United Kingdom
| |
Collapse
|
20
|
Flickinger R. Polymorphism of simple sequence repeats may quantitatively regulate gene transcription. Exp Cell Res 2020; 390:111969. [PMID: 32199920 DOI: 10.1016/j.yexcr.2020.111969] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 02/15/2020] [Accepted: 03/17/2020] [Indexed: 02/07/2023]
Abstract
The degree of polymorphism, i.e., DNA sequence divergence, of short AT-rich tandemly arranged simple sequence repeats at or near promoters and 5'- untranslated regions of mRNA may quantitatively regulate transcription of tissue-specific genes. Less polymorphic repeats allow greater gene expression. Preferential binding of hypophosphorylated H1 histone to these repeats may diminish binding of transcription factors. Preferential binding of hypophosphorylated high mobility group chromatin proteins would increase this binding. Shorter simple sequence repeats have undergone fewer point mutations than longer repeats, hence they are less polymorphic and more conserved. The role of transcribed simple sequence repeats in frog embryo germ layer determination is considered.
Collapse
Affiliation(s)
- Reed Flickinger
- Department of Biological Sciences, State University of New York, Buffalo, N.Y. 14260, Mailing Address:P.O. Box 741 Captain Cook, HI, 96704, USA.
| |
Collapse
|
21
|
Seto K, Mok W, Stone J. Bridging the gap between theory and practice in elucidating modular gene regulatory sequence organisation within genomes. Genome 2020; 63:281-289. [PMID: 32114793 DOI: 10.1139/gen-2019-0150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Changes to promoter regions probably have been responsible for many morphological evolutionary transitions, especially in animals. This idea is becoming testable, as data from genome projects amass and enable bioinformaticians to conduct comparative sequence analyses and test for correlations between genotypic similarities or differences and phenotypic likeness or disparity. Although such practical pursuits have initiated some theoretical considerations, a conceptual framework for understanding promoter region evolution, potentially effecting morphological evolution, is only starting to emerge, predominantly resulting from computational research. We contribute to this framework by specifying three big problems for promoter region research; reviewing computational research on promoter region evolution; and exemplifying a topic for future promoter region research - module evolution.
Collapse
Affiliation(s)
- Kelly Seto
- Department of Molecular & Medical Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Wendy Mok
- Department of Molecular Biology & Biophysics, University of Connecticut Health, Farmington, CT 06032, USA
| | - Jonny Stone
- Department of Biology, McMaster University, Hamilton, ON L8S 4K1, Canada; SHARCNet, McMaster University, Hamilton, ON L8S 4L8, Canada; Origins Institute, McMaster University, Hamilton, ON L8S 4M1, Canada
| |
Collapse
|
22
|
Oruba A, Saccani S, van Essen D. Role of cell-type specific nucleosome positioning in inducible activation of mammalian promoters. Nat Commun 2020; 11:1075. [PMID: 32103026 PMCID: PMC7044431 DOI: 10.1038/s41467-020-14950-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2019] [Accepted: 02/10/2020] [Indexed: 12/18/2022] Open
Abstract
The organization of nucleosomes across functional genomic elements represents a critical layer of control. Here, we present a strategy for high-resolution nucleosome profiling at selected genomic features, and use this to analyse dynamic nucleosome positioning at inducible and cell-type-specific mammalian promoters. We find that nucleosome patterning at inducible promoters frequently resembles that at active promoters, even before stimulus-driven activation. Accordingly, the nucleosome profile at many inactive inducible promoters is sufficient to predict cell-type-specific responsiveness. Induction of gene expression is generally not associated with major changes to nucleosome patterning, and a subset of inducible promoters can be activated without stable nucleosome depletion from their transcription start sites. These promoters are generally dependent on remodelling enzymes for their inducible activation, and exhibit transient nucleosome depletion only at alleles undergoing transcription initiation. Together, these data reveal how the responsiveness of inducible promoters to activating stimuli is linked to cell-type-specific nucleosome patterning. Nucleosome organisation plays important roles in regulating functional genomic elements. Here, the authors use high-resolution profiling to analyse dynamic nucleosome positioning at inducible and cell-type-specific promoters, providing a global view of chromatin architecture at inducible promoters.
Collapse
Affiliation(s)
- Agata Oruba
- Max Planck Institute for Immunobiology & Epigenetics, Stübeweg 51, Freiburg, D79108, Germany
| | - Simona Saccani
- Max Planck Institute for Immunobiology & Epigenetics, Stübeweg 51, Freiburg, D79108, Germany. .,Institute for Research on Cancer & Aging, Nice (IRCAN), 28 Avenue Valombrose, Nice, 06107, France.
| | - Dominic van Essen
- Max Planck Institute for Immunobiology & Epigenetics, Stübeweg 51, Freiburg, D79108, Germany. .,Institute for Research on Cancer & Aging, Nice (IRCAN), 28 Avenue Valombrose, Nice, 06107, France.
| |
Collapse
|
23
|
Erwin DH. The origin of animal body plans: a view from fossil evidence and the regulatory genome. Development 2020; 147:147/4/dev182899. [DOI: 10.1242/dev.182899] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
ABSTRACT
The origins and the early evolution of multicellular animals required the exploitation of holozoan genomic regulatory elements and the acquisition of new regulatory tools. Comparative studies of metazoans and their relatives now allow reconstruction of the evolution of the metazoan regulatory genome, but the deep conservation of many genes has led to varied hypotheses about the morphology of early animals and the extent of developmental co-option. In this Review, I assess the emerging view that the early diversification of animals involved small organisms with diverse cell types, but largely lacking complex developmental patterning, which evolved independently in different bilaterian clades during the Cambrian Explosion.
Collapse
Affiliation(s)
- Douglas H. Erwin
- Department of Paleobiology, MRC-121, National Museum of Natural History, PO Box 37012, Washington, DC 20013-7012, USA
- Santa Fe Institute, 1399 Hyde Park Road, Santa Fe, NM 87501, USA
| |
Collapse
|
24
|
Lai HY, Zhang ZY, Su ZD, Su W, Ding H, Chen W, Lin H. iProEP: A Computational Predictor for Predicting Promoter. MOLECULAR THERAPY. NUCLEIC ACIDS 2019; 17:337-346. [PMID: 31299595 PMCID: PMC6616480 DOI: 10.1016/j.omtn.2019.05.028] [Citation(s) in RCA: 99] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2019] [Revised: 05/18/2019] [Accepted: 05/19/2019] [Indexed: 11/29/2022]
Abstract
Promoter is a fundamental DNA element located around the transcription start site (TSS) and could regulate gene transcription. Promoter recognition is of great significance in determining transcription units, studying gene structure, analyzing gene regulation mechanisms, and annotating gene functional information. Many models have already been proposed to predict promoters. However, the performances of these methods still need to be improved. In this work, we combined pseudo k-tuple nucleotide composition (PseKNC) with position-correlation scoring function (PCSF) to formulate promoter sequences of Homo sapiens (H. sapiens), Drosophila melanogaster (D. melanogaster), Caenorhabditis elegans (C. elegans), Bacillus subtilis (B. subtilis), and Escherichia coli (E. coli). Minimum Redundancy Maximum Relevance (mRMR) algorithm and increment feature selection strategy were then adopted to find out optimal feature subsets. Support vector machine (SVM) was used to distinguish between promoters and non-promoters. In the 10-fold cross-validation test, accuracies of 93.3%, 93.9%, 95.7%, 95.2%, and 93.1% were obtained for H. sapiens, D. melanogaster, C. elegans, B. subtilis, and E. coli, with the areas under receiver operating curves (AUCs) of 0.974, 0.975, 0.981, 0.988, and 0.976, respectively. Comparative results demonstrated that our method outperforms existing methods for identifying promoters. An online web server was established that can be freely accessed (http://lin-group.cn/server/iProEP/).
Collapse
Affiliation(s)
- Hong-Yan Lai
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Zhao-Yue Zhang
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Zhen-Dong Su
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Wei Su
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Hui Ding
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China
| | - Wei Chen
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China; Innovative Institute of Chinese Medicine and Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611730, China; Center for Genomics and Computational Biology, School of Life Sciences, North China University of Science and Technology, Tangshan 063000, China.
| | - Hao Lin
- Key Laboratory for NeuroInformation of Ministry of Education, School of Life Science and Technology and Center for Informational Biology, University of Electronic Science and Technology of China, Chengdu 610054, China.
| |
Collapse
|
25
|
Abugessaisa I, Noguchi S, Hasegawa A, Kondo A, Kawaji H, Carninci P, Kasukawa T. refTSS: A Reference Data Set for Human and Mouse Transcription Start Sites. J Mol Biol 2019; 431:2407-2422. [DOI: 10.1016/j.jmb.2019.04.045] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2018] [Revised: 04/25/2019] [Accepted: 04/29/2019] [Indexed: 01/22/2023]
|
26
|
Lu Y, Li Q, Zheng K, Fu C, Jiang C, Zhou D, Xia C, Ma S. Development of a high efficient promoter finding method based on transient transfection. Gene 2019; 2:100008. [PMID: 32550544 PMCID: PMC7286058 DOI: 10.1016/j.gene.2019.100008] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Revised: 01/23/2019] [Accepted: 02/08/2019] [Indexed: 01/19/2023]
Abstract
In metazoan genome, the mechanism of gene expression regulation between transcriptional regulatory elements and their target gene is spatiotemporal. Active promoters possess many specific chromosomal features, such as hypersensitive to DNaseI and enrichment of specific histone modifications. In this article, we proposed a novel method which possesses a high efficiency to find promoters in vitro. A promoter-trap library was constructed with totally 706 random mouse genomic DNA fragment clones, and 260 promoter-active fragments of the library were screened by transient transfection into 4T1 cells. To demonstrate the accuracy of this promoter finding method, 13 fragments with promoter activities were randomly selected for published DNase-seq and ChIP-seq data analysis, downstream transcripts prediction and expression confirmation. qRT-PCR results showed that six predicted transcription units were successfully amplified in different mouse tissues/cells or in reconstituted mouse mammary tumors. Our results indicate that this promoter finding method can successfully detect the promoter-active fragments and their downstream transcripts.
Collapse
Key Words
- ATAC-seq, Assay for transposase-accessible chromatin using sequencing
- Bioinformatics
- CAGE, cap analysis of gene expression
- CMV, Cytomegalovirus
- Cancer-specific promoter
- ChIP-seq, Chromatin immunoprecipitation followed by massively parallel DNA sequencing
- Ct, threshold
- DHS, DNaseI hypersensitive sites
- DNase-seq, DNase I hypersensitive sites sequencing
- EF1a1, eukaryotic translation elongation factor 1 alpha 1
- FBS, fetal bovine serum
- GRO-seq, global run-on sequencing
- Gene expression regulation
- Gene finding
- H3K4me3, histone H3 lysine 4 trimethylation
- Itpr2, inositol 1, 4, 5-triphosphate receptor 2
- LSINCT5, long stress-induced non-coding transcript 5
- MCS, multiple cloning site
- MPRA, Massively parallel reporter assays
- Mouse breast cancer
- PBS, phosphate buffered solution
- Promoter trap
- RNA-seq, RNA sequencing
- SD, standard deviation
- STARR-seq, Self-transcribing active regulatory region sequencing
- TFs, transcription factors
- TSS, transcription start sites
- dNTPs, deoxy-ribonucleoside triphosphate
- eRNAs, enhancer RNAs
- mSEAP, mouse synthetic secreted embryonic alkaline phosphatase
- pNPP, p-nitropheny-phosate
- qRT-PCR, quantitative RT-PCR
- tpk1, thiamine pyrophosphokinase
Collapse
Affiliation(s)
- Yao Lu
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| | - Qilong Li
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| | - Kexin Zheng
- College of Food Science and Technology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| | - Chenghao Fu
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| | - Chunying Jiang
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| | - Dayu Zhou
- College of Food Science and Technology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| | - Chao Xia
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| | - Shiliang Ma
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang, Liaoning, PR China
| |
Collapse
|
27
|
Lim WJ, Kim KH, Kim JY, Jeong S, Kim N. Identification of DNA-Methylated CpG Islands Associated With Gene Silencing in the Adult Body Tissues of the Ogye Chicken Using RNA-Seq and Reduced Representation Bisulfite Sequencing. Front Genet 2019; 10:346. [PMID: 31040866 PMCID: PMC6476954 DOI: 10.3389/fgene.2019.00346] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Accepted: 04/01/2019] [Indexed: 12/14/2022] Open
Abstract
DNA methylation is an epigenetic mark that plays an essential role in regulating gene expression. CpG islands are DNA methylations regions in promoters known to regulate gene expression through transcriptional silencing of the corresponding gene. DNA methylation at CpG islands is crucial for gene expression and tissue-specific processes. At the current time, a limited number of studies have reported on gene expression associated with DNA methylation in diverse adult tissues at the genome-wide level. Expression levels are rarely affected by DNA methylation in normal adult tissues; however, statistical differences in gene expression level correlated with DNA methylation have recently been revealed. In this study, we examined 20 pairs of DNA methylomes and transcriptomes from RNA-seq and reduced representation bisulfite sequencing (RRBS) data using adult Ogye chicken tissues. A total of 3,133 CpG islands were identified from 20 tissue data in a single chicken sample which could affect downstream genes. Analyzing these CpG island and gene pairs, 121 significant units were statistically correlated. Among them, six genes (CLDN3, DECR2, EVA1B, NME4, NTSR1, and XPNPEP2) were highly significantly changed by altered DNA methylation. Finally, our data demonstrated how DNA methylation correlated to gene expression in normal adult tissues. Our source codes can be found at https://github.com/wjlim/correlation-between-rna-seq-and-RRBS.
Collapse
Affiliation(s)
- Won-Jun Lim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, South Korea.,Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology, Daejeon, South Korea
| | - Kyoung Hyoun Kim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, South Korea.,Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology, Daejeon, South Korea
| | - Jae-Yoon Kim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, South Korea.,Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology, Daejeon, South Korea
| | - Seongmun Jeong
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, South Korea
| | - Namshin Kim
- Genome Editing Research Center, Korea Research Institute of Bioscience and Biotechnology, Daejeon, South Korea.,Department of Bioinformatics, KRIBB School of Bioscience, University of Science and Technology, Daejeon, South Korea
| |
Collapse
|
28
|
Cvetesic N, Leitch HG, Borkowska M, Müller F, Carninci P, Hajkova P, Lenhard B. SLIC-CAGE: high-resolution transcription start site mapping using nanogram-levels of total RNA. Genome Res 2018; 28:1943-1956. [PMID: 30404778 PMCID: PMC6280763 DOI: 10.1101/gr.235937.118] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2018] [Accepted: 10/25/2018] [Indexed: 01/22/2023]
Abstract
Cap analysis of gene expression (CAGE) is a methodology for genome-wide quantitative mapping of mRNA 5′ ends to precisely capture transcription start sites at a single nucleotide resolution. In combination with high-throughput sequencing, CAGE has revolutionized our understanding of the rules of transcription initiation, led to discovery of new core promoter sequence features, and discovered transcription initiation at enhancers genome-wide. The biggest limitation of CAGE is that even the most recently improved version (nAnT-iCAGE) still requires large amounts of total cellular RNA (5 µg), preventing its application to scarce biological samples such as those from early embryonic development or rare cell types. Here, we present SLIC-CAGE, a Super-Low Input Carrier-CAGE approach to capture 5′ ends of RNA polymerase II transcripts from as little as 5–10 ng of total RNA. This dramatic increase in sensitivity is achieved by specially designed, selectively degradable carrier RNA. We demonstrate the ability of SLIC-CAGE to generate data for genome-wide promoterome with 1000-fold less material than required by existing CAGE methods, by generating a complex, high-quality library from mouse embryonic day 11.5 primordial germ cells.
Collapse
Affiliation(s)
- Nevena Cvetesic
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom
| | - Harry G Leitch
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom
| | - Malgorzata Borkowska
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom
| | - Ferenc Müller
- Institute of Cancer and Genomic Sciences, University of Birmingham, Edgbaston B15 2TT, United Kingdom
| | - Piero Carninci
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama City, Kanagawa 230-0045, Japan.,RIKEN Omics Science Center, Yokohama City, Kanagawa 230-0045, Japan
| | - Petra Hajkova
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom
| | - Boris Lenhard
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom.,Sars International Centre for Marine Molecular Biology, University of Bergen, N-5008 Bergen, Norway
| |
Collapse
|
29
|
Gene expression variability across cells and species shapes innate immunity. Nature 2018; 563:197-202. [PMID: 30356220 PMCID: PMC6347972 DOI: 10.1038/s41586-018-0657-2] [Citation(s) in RCA: 125] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2017] [Accepted: 08/17/2018] [Indexed: 11/09/2022]
Abstract
As the first line of defence against pathogens, cells mount an innate immune response, which varies widely from cell to cell. The response must be potent but carefully controlled to avoid self-damage. How these constraints have shaped the evolution of innate immunity remains poorly understood. Here we characterize the innate immune response's transcriptional divergence between species and variability in expression among cells. Using bulk and single-cell transcriptomics in fibroblasts and mononuclear phagocytes from different species, challenged with immune stimuli, we map the architecture of the innate immune response. Transcriptionally diverging genes, including those that encode cytokines and chemokines, vary across cells and have distinct promoter structures. Conversely, genes that are involved in the regulation of this response, such as those that encode transcription factors and kinases, are conserved between species and display low cell-to-cell variability in expression. We suggest that this expression pattern, which is observed across species and conditions, has evolved as a mechanism for fine-tuned regulation to achieve an effective but balanced response.
Collapse
|
30
|
Oliveira GP, de Aquino ILM, Luiz APMF, Abrahão JS. Putative Promoter Motif Analyses Reinforce the Evolutionary Relationships Among Faustoviruses, Kaumoebavirus, and Asfarvirus. Front Microbiol 2018; 9:1041. [PMID: 29875752 PMCID: PMC5974111 DOI: 10.3389/fmicb.2018.01041] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2018] [Accepted: 05/02/2018] [Indexed: 11/13/2022] Open
Abstract
Putative promoter motifs have been described in viruses belonging to the nucleocytoplasmic large DNA viruses (NCLDVs) group; however, few studies have been conducted to search for promoter sequences in newly discovered amoebal giant viruses. Faustovirus and kaumoebavirus are two Asfarviridae-related giant viruses belonging to the NCLDVs group. The phylogenetic relationships among these viruses led us to investigate if the promoter regions previously identified in the asfarvirus genome could be shared by its amoebal virus relatives. Previous studies demonstrated the role of A/T-rich motifs as promoters of asfarvirus. In this study, we reinforce the importance of A/T rich motifs in asfarvirus and show that the TATTT and TATATA motifs are also shared in abundance by faustovirus and kaumoebavirus. Here, we demonstrate that TATTT and TATATA are mostly present in faustovirus and kaumoebavirus genomic intergenic regions (IRs) and that they are widely distributed at 0 to -100 bp upstream to the start codons. We observed that putative promoter motifs are present as one to dozens of repetitions in IRs of faustovirus, kaumoebavirus, and asfarvirus, which is similar to that described previously for marseilleviruses. Furthermore, the motifs were found in most of the upstream regions of the core genes of faustovirus, kaumoebavirus, and asfarvirus, which suggests that the motifs could already be present in the ancestor of these viruses before the irradiation of this group. Our work provides an in-depth analysis of the putative promoter motifs present in asfarvirus, kaumoebavirus, and faustovirus, which reinforces the relationship among these viruses.
Collapse
Affiliation(s)
- Graziele P Oliveira
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Isabella L M de Aquino
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Ana P M F Luiz
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| | - Jônatas S Abrahão
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte, Brazil
| |
Collapse
|
31
|
Fraile-Bethencourt E, Valenzuela-Palomo A, Díez-Gómez B, Infante M, Durán M, Marcos G, Lastra E, Gómez-Barrero S, Velasco EA. Genetic dissection of the BRCA2 promoter and transcriptional impact of DNA variants. Breast Cancer Res Treat 2018; 171:53-63. [PMID: 29766361 DOI: 10.1007/s10549-018-4826-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Accepted: 04/16/2018] [Indexed: 11/28/2022]
Abstract
PURPOSE Promoter mutations may affect transcription and can be associated with human diseases. However, the promoters of the breast cancer (BC) genes are not regularly screened. Our goal was to investigate the BRCA2 promoter in order to study a possible correlation between impaired transcription and disease. METHODS The proximal and core promoter of the BRCA2 gene was sequenced in 95 high-risk BC patients. A BRCA2-promoter insert [- 938 to + 312 from the transcription start site (TSS)] was generated and cloned into the firefly luciferase vector pGL4.10. Promoter variants and deletions were introduced by site-directed mutagenesis and quantified by Dual-Luciferase assays and semi-quantitative RT-PCR. RESULTS Three different variants were detected in high-risk BC patients: rs3092989, rs206118, and rs563971900. Functional mapping of 13 overlapping deletions revealed four down-regulating segments (TSS positions): -59_-10del/µdel3 (16% of activity of the wild-type construct), -104_-55del/µdel4 (62%), -239_-190del/µdel7 (39%), -464_-415/µdel12 (78%), suggesting the presence therein of putative transcriptional activator motifs. Additionally, six microdeletions rendered luciferase overexpression: +32_+81del/µdel1 (356%), -14_+36del/µdel2 (180%), -194_-145del/µdel6 (154%), -284_-235del/µdel8 (168%), -329_-280del/µdel9 (111%), and -509_-460del/µdel13 (139%), which is indicative of repressor elements. Functional assays of 15 promoter variants (including those detected in patients) showed that ten of them significantly altered expression with seven up-regulating (113-163%) and three down-regulating (rs551887850_G, rs570548398_T, rs55880202_T; 72-83%) SNPs. Eight of them were located in an ENCODE-DNase Hypersensitive Cluster (TSS - 185 to + 105) where most active transcriptional motifs are known to be placed. CONCLUSIONS BRCA2 expression is highly sensitive to promoter variations as most of them induced relevant changes. Moreover, we mapped critical regions of the BRCA2 promoter that may constitute potential targets for regulatory variants. Three SNPs moderately decreased luciferase activity, but confirmation of its potential pathogenicity requires further analysis. These data reinforce the need to screen the promoter regions of breast cancer genes with a view to discovering novel deleterious mutations.
Collapse
Affiliation(s)
- Eugenia Fraile-Bethencourt
- Grupo de Splicing y Cáncer, Instituto de Biología y Genética Molecular (IBGM), Consejo Superior de Investigaciones Científicas (CSIC-UVa), Sanz y Forés 3, 47003, Valladolid, Spain
| | - Alberto Valenzuela-Palomo
- Grupo de Splicing y Cáncer, Instituto de Biología y Genética Molecular (IBGM), Consejo Superior de Investigaciones Científicas (CSIC-UVa), Sanz y Forés 3, 47003, Valladolid, Spain
| | - Beatriz Díez-Gómez
- Grupo de Splicing y Cáncer, Instituto de Biología y Genética Molecular (IBGM), Consejo Superior de Investigaciones Científicas (CSIC-UVa), Sanz y Forés 3, 47003, Valladolid, Spain
| | - Mar Infante
- Cancer Genetics, Instituto de Biología y Genética Molecular (UVa-CSIC), Valladolid, Spain
| | - Mercedes Durán
- Cancer Genetics, Instituto de Biología y Genética Molecular (UVa-CSIC), Valladolid, Spain
| | - Germán Marcos
- Servicio de Oncología, Hospital Río Hortega, Valladolid, Spain
| | - Enrique Lastra
- Servicio de Oncología, Hospital Universitario de Burgos, Burgos, Spain
| | | | - Eladio A Velasco
- Grupo de Splicing y Cáncer, Instituto de Biología y Genética Molecular (IBGM), Consejo Superior de Investigaciones Científicas (CSIC-UVa), Sanz y Forés 3, 47003, Valladolid, Spain.
| |
Collapse
|
32
|
Wu Q, Yu H, Wei W, Cheng Y, Huang S, Shi H, Liu S, Xia J, Jia H, Hao L. Linkage disequilibrium and functional analysis of PRE1 insertion together with SNPs in the promoter region of IGFBP7 gene in different pig breeds. J Appl Genet 2018; 59:231-241. [PMID: 29574509 DOI: 10.1007/s13353-018-0430-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2017] [Revised: 01/10/2018] [Accepted: 01/15/2018] [Indexed: 11/24/2022]
Abstract
Polymorphisms in regions upstream of transcription initiation site may modify the transcriptional activity of target genes by changing promoter activity. This study aims to determine whether or not polymorphisms at porcine IGFBP7 promoter region affect gene expression. In this study, eight SNPs and one PRE1 insertion in this region were first confirmed. The PRE1 insertion was widespread in 20 Chinese indigenous breeds, but was not observed in three commercial breeds. A perfect linkage disequilibrium, consisting of six of those SNPs and a PRE1, was observed with two haplotypes (h1 and h2) in five pig breeds. The h1 haplotype had an overwhelming superiority distribution in Large White, Landrace, and Bama mini-pig; in turn, the h2 only existed in the PRE1 presence breeds. As the haplotypes and PRE1 were located at gene promoter regions, we further investigated the transfection of plasmids with three different fragments of IGFBP-7 promoter region (H1, H2, RF). The CMV promoter of the pEGFP-N1 was substituted by these three different fragments, respectively. Different transcriptional and translational activities of EGFP in PK-15 cells were observed in these three recombinant plasmids by quantitative real-time PCR and flow cytometric analysis. The results indicated that H1 had the higher transcriptional and translational activities of EGFP as compared to the H2 (P < 0.05, P < 0.05). As compared to the RF group, EGFP mRNA expression level was significantly higher in H1 groups (P < 0.05). The IGFBP-7 promoter polymorphisms detected in this study may be important functional variants and potential genetic markers for pig population genetic study.
Collapse
Affiliation(s)
- Qingyan Wu
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China
| | - Hao Yu
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China
| | - Wenzhen Wei
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China
| | - Yunyun Cheng
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China
| | - Shan Huang
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China
| | - Hongyu Shi
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China
| | - Songcai Liu
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China.,Five-Star Animal Health Pharmaceutical Factory, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China
| | - Jichao Xia
- Melbourne School of Population and Global Health, The University of Melbourne, 207-221 Bouverie St, Carlton, VIC, 3053, Australia
| | - Hongyao Jia
- First hospital of Jilin University, No.71 Xinmin Street, Changchun, Jilin, 130021, China.
| | - Linlin Hao
- College of Animal Science, Jilin University, No. 5333 Xi'an Road, Changchun, Jilin, 130062, China.
| |
Collapse
|
33
|
|
34
|
Brozovic M, Dantec C, Dardaillon J, Dauga D, Faure E, Gineste M, Louis A, Naville M, Nitta KR, Piette J, Reeves W, Scornavacca C, Simion P, Vincentelli R, Bellec M, Aicha SB, Fagotto M, Guéroult-Bellone M, Haeussler M, Jacox E, Lowe EK, Mendez M, Roberge A, Stolfi A, Yokomori R, Brown C, Cambillau C, Christiaen L, Delsuc F, Douzery E, Dumollard R, Kusakabe T, Nakai K, Nishida H, Satou Y, Swalla B, Veeman M, Volff JN, Lemaire P. ANISEED 2017: extending the integrated ascidian database to the exploration and evolutionary comparison of genome-scale datasets. Nucleic Acids Res 2018; 46:D718-D725. [PMID: 29149270 PMCID: PMC5753386 DOI: 10.1093/nar/gkx1108] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2017] [Revised: 10/22/2017] [Accepted: 11/09/2017] [Indexed: 12/14/2022] Open
Abstract
ANISEED (www.aniseed.cnrs.fr) is the main model organism database for tunicates, the sister-group of vertebrates. This release gives access to annotated genomes, gene expression patterns, and anatomical descriptions for nine ascidian species. It provides increased integration with external molecular and taxonomy databases, better support for epigenomics datasets, in particular RNA-seq, ChIP-seq and SELEX-seq, and features novel interactive interfaces for existing and novel datatypes. In particular, the cross-species navigation and comparison is enhanced through a novel taxonomy section describing each represented species and through the implementation of interactive phylogenetic gene trees for 60% of tunicate genes. The gene expression section displays the results of RNA-seq experiments for the three major model species of solitary ascidians. Gene expression is controlled by the binding of transcription factors to cis-regulatory sequences. A high-resolution description of the DNA-binding specificity for 131 Ciona robusta (formerly C. intestinalis type A) transcription factors by SELEX-seq is provided and used to map candidate binding sites across the Ciona robusta and Phallusia mammillata genomes. Finally, use of a WashU Epigenome browser enhances genome navigation, while a Genomicus server was set up to explore microsynteny relationships within tunicates and with vertebrates, Amphioxus, echinoderms and hemichordates.
Collapse
Affiliation(s)
| | | | | | - Delphine Dauga
- Bioself Communication; 28 rue de la Bibliothèque, F-13001 Marseille, France
| | - Emmanuel Faure
- Institut de Biologie Computationnelle, Université de Montpellier, Montpellier, France
- Team VORTEX, Institut de Recherche en Informatique de Toulouse, Universities Toulouse I and III, CNRS, INPT, ENSEEIHT; 2 rue Camichel, BP 7122, F-31071 Toulouse Cedex 7, France
| | | | - Alexandra Louis
- DYOGEN, IBENS, Département de Biologie, Ecole Normale Supérieure, CNRS, Inserm, PSL Research University, F-75005, Paris, France
| | - Magali Naville
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, CNRS; 46 allée d’Italie, F-69364 Lyon, France
| | - Kazuhiro R Nitta
- IBDM, Aix-Marseille Université, CNRS, Campus de Luminy, Case 907; 163 Avenue de Luminy, F-13288 Marseille Cedex 9, France
| | - Jacques Piette
- CRBM, Université de Montpellier, CNRS, Montpellier, France
| | - Wendy Reeves
- Division of Biology, Kansas State University, Manhattan, Kansas
| | | | - Paul Simion
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Renaud Vincentelli
- AFMB, Aix-Marseille Université, CNRS, Campus de Luminy, Case 932, 163 Avenue de Luminy, F-13288 Marseille Cedex 9, France
| | | | - Sameh Ben Aicha
- Laboratoire de Biologie du Développement de Villefranche-sur-mer (LBDV), Sorbonne Universités, Université Pierre-et-Marie-Curie, CNRS; Quai de la Darse, F-06234 Villefranche-sur-Mer Cedex, France
| | | | | | - Maximilian Haeussler
- Santa Cruz Genomics Institute, MS CBSE, University of California, 1156 High Street, Santa Cruz, CA 95064, USA
| | - Edwin Jacox
- CRBM, Université de Montpellier, CNRS, Montpellier, France
| | - Elijah K Lowe
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Napoli, Italy
- BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI48824, USA
| | - Mickael Mendez
- IBDM, Aix-Marseille Université, CNRS, Campus de Luminy, Case 907; 163 Avenue de Luminy, F-13288 Marseille Cedex 9, France
| | | | - Alberto Stolfi
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Rui Yokomori
- Human Genome Center, the Institute of Medical Science, the University of Tokyo, 4-6-1 Shirokanedai, Minato, Tokyo 108-8639, Japan
| | - C Titus Brown
- BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI48824, USA
- Population Health and Reproduction, UC Davis, Davis, CA 95616, USA
| | - Christian Cambillau
- AFMB, Aix-Marseille Université, CNRS, Campus de Luminy, Case 932, 163 Avenue de Luminy, F-13288 Marseille Cedex 9, France
| | - Lionel Christiaen
- New York University, Center for Developmental Genetics, Department of Biology, 1009 Silver Center, 100 Washington Square East, New York City, NY10003, USA
| | - Frédéric Delsuc
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Emmanuel Douzery
- ISEM, Université de Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Rémi Dumollard
- Laboratoire de Biologie du Développement de Villefranche-sur-mer (LBDV), Sorbonne Universités, Université Pierre-et-Marie-Curie, CNRS; Quai de la Darse, F-06234 Villefranche-sur-Mer Cedex, France
| | - Takehiro Kusakabe
- Department of Biology, Faculty of Science and Engineering, Konan University, Kobe 658-8501, Japan
| | - Kenta Nakai
- Human Genome Center, the Institute of Medical Science, the University of Tokyo, 4-6-1 Shirokanedai, Minato, Tokyo 108-8639, Japan
| | - Hiroki Nishida
- Department of Biological Sciences, Graduate School of Science, Osaka University, 1-1 Machikaneyama-cho, Toyonaka, Osaka 560-0043, Japan
| | - Yutaka Satou
- Department of Zoology, Graduate School of Science, Kyoto University, Kyoto 606-8502, Japan
| | - Billie Swalla
- BEACON Center for the Study of Evolution in Action, Michigan State University, East Lansing, MI48824, USA
- Friday Harbor Laboratories, 620 University Road, Friday Harbor, WA 98250-9299, USA
| | - Michael Veeman
- Division of Biology, Kansas State University, Manhattan, Kansas
| | - Jean-Nicolas Volff
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, CNRS; 46 allée d’Italie, F-69364 Lyon, France
| | - Patrick Lemaire
- CRBM, Université de Montpellier, CNRS, Montpellier, France
- Institut de Biologie Computationnelle, Université de Montpellier, Montpellier, France
| |
Collapse
|
35
|
Xiang J, Wen F, Zhang L, Zhou Y. FOXD3 inhibits SCN2A gene transcription in intractable epilepsy cell models. Exp Neurol 2017; 302:14-21. [PMID: 29288635 DOI: 10.1016/j.expneurol.2017.12.012] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Revised: 12/20/2017] [Accepted: 12/26/2017] [Indexed: 12/28/2022]
Abstract
The expression of sodium voltage-gated channel alpha subunit 2 (SCN2A) is closely related to the development of epilepsy. This study investigated regulatory element of the SCN2A gene involved in epilepsy. An intractable epilepsy cell model was constructed using hippocampal primary neurons and the SH-SY5Y cell line. SCN2A protein and gene expression in cells as well as the level of lactic acid dehydrogenase (LDH) in the cell culture supernatants was detected. Potential regulatory factors of SCN2A and its upstream regulatory elements were identified using the dual-luciferase reporter assay. Finally, the role of the hypothetical transcription factor in epilepsy was examined by using its small interfering RNA (siRNA). Results found that levels of LDH and expression of the hypothetical transcription factor, Forkhead box D3 (FOXD3), was both increased in the model cells, whereas that of SCN2A was decreased. The results of dual-luciferase reporter assays revealed that an upstream region of SCN2A gene spanning from nucleotides -1617 to -1470 was a transcription factor binding region and a trans-acting factor role of FOXD3 was identified in the core region (GGCAAAATTAT). Then the FOXD3 binding site was further verified by the chromatin immunoprecipitation (ChIP) assay and electrophoretic mobility shift assay (EMSA). After SH-SY5Y cells were transfected with FOXD3 siRNA, the release of LDH into culture supernatants and the LDH expression levels in cells were significantly decreased. SCN2A expression in model cells was increased by knockdown of FOXD3. Therefore, this study demonstrated that FOXD3 is a trans-acting factor of SCN2A, and this mechanism may play a role in cell injury after epilepsy.
Collapse
Affiliation(s)
- Jun Xiang
- Department of Neurosurgery, The Second Xiang-Ya Hospital of Central South University, Changsha 410011, Hunan Province, China.
| | - Fang Wen
- Neurology Electroencephalogram Room of Neurology Department, The Second Xiang-Ya Hospital of Central South University, Changsha 410011, Hunan Province, China
| | - Lingyun Zhang
- Department of Neurosurgery, The Second Xiang-Ya Hospital of Central South University, Changsha 410011, Hunan Province, China
| | - Yu Zhou
- Department of Neurosurgery, The Second Xiang-Ya Hospital of Central South University, Changsha 410011, Hunan Province, China
| |
Collapse
|
36
|
The Investigation of Promoter Sequences of Marseilleviruses Highlights a Remarkable Abundance of the AAATATTT Motif in Intergenic Regions. J Virol 2017; 91:JVI.01088-17. [PMID: 28794030 DOI: 10.1128/jvi.01088-17] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2017] [Accepted: 08/04/2017] [Indexed: 12/13/2022] Open
Abstract
Viruses display a wide range of genomic profiles and, consequently, a variety of gene expression strategies. Specific sequences associated with transcriptional processes have been described in viruses, and putative promoter motifs have been elucidated for some nucleocytoplasmic large DNA viruses (NCLDV). Among NCLDV, the Marseilleviridae is a well-recognized family because of its genomic mosaicism. The marseilleviruses have an ability to incorporate foreign genes, especially from sympatric organisms inhabiting Acanthamoeba, its main known host. Here, we identified for the first time an eight-nucleotide A/T-rich promoter sequence (AAATATTT) associated with 55% of marseillevirus genes that is conserved in all marseilleviruses lineages, a higher level of conservation than that of any giant virus described to date. We instigated our prediction about the promoter motif by biological assays and by evaluating how single mutations in this octamer can impact gene expression. The investigation of sequences that regulate the expression of genes relative to lateral transfer revealed that the promoter motifs do not appear to be incorporated by marseilleviruses from donor organisms. Indeed, analyses of the intergenic regions that regulate lateral gene transfer-related genes have revealed an independent origin of the marseillevirus intergenic regions that does not match gene-donor organisms. About 50% of AAATATTT motifs spread throughout intergenic regions of the marseilleviruses are present as multiple copies. We believe that such multiple motifs are associated with increased expression of a given gene or are related to incorporation of foreign genes into the mosaic genome of marseilleviruses.IMPORTANCE The marseilleviruses draw attention because of the peculiar features of their genomes; however, little is known about their gene expression patterns or the factors that regulate those expression patterns. The limited published research on the expression patterns of the marseilleviruses and their unique genomes has led us to study the promoter motif sequences in the intergenic regions of the marseilleviruses. This work is the first to analyze promoter sequences in the genomes of the marseilleviruses. We also suggest a strong capacity to acquire foreign genes and to express those genes mediated by multiple copies of the promoter motifs available in intergenic regions. These findings contribute to an understanding of genomic expansion and plasticity observed in these giant viruses.
Collapse
|
37
|
Abstract
The leap from simple unicellularity to complex multicellularity remains one of life's major enigmas. The origins of metazoan developmental gene regulatory mechanisms are sought by analyzing gene regulation in extant eumetazoans, sponges, and unicellular organisms. The main hypothesis of this manuscript is that, developmental enhancers evolved from unicellular inducible promoters that diversified the expression of regulatory genes during metazoan evolution. Promoters and enhancers are functionally similar; both can regulate the transcription of distal promoters and both direct local transcription. Additionally, enhancers have experimentally characterized structural features that reveal their origin from inducible promoters. The distal co-operative regulation among promoters identified in unicellular opisthokonts possibly represents the precursor of distal regulation of promoters by enhancers. During metazoan evolution, constitutive-type promoters of regulatory genes would have acquired novel receptivity to distal regulatory inputs from promoters of inducible genes that eventually specialized as enhancers. The novel regulatory interactions would have caused constitutively expressed genes controlling differential gene expression in unicellular organisms to become themselves differentially expressed. The consequence of the novel regulatory interactions was that regulatory pathways of unicellular organisms became interlaced and ultimately evolved into the intricate developmental gene regulatory networks (GRNs) of extant metazoans.
Collapse
Affiliation(s)
- César Arenas-Mena
- Department of Biology, College of Staten Island and Graduate Center, The City University of New York (CUNY), Staten Island, NY 10314, USA
| |
Collapse
|
38
|
Oliveira GP, Andrade ACDSP, Rodrigues RAL, Arantes TS, Boratto PVM, Silva LKDS, Dornas FP, Trindade GDS, Drumond BP, La Scola B, Kroon EG, Abrahão JS. Promoter Motifs in NCLDVs: An Evolutionary Perspective. Viruses 2017; 9:v9010016. [PMID: 28117683 PMCID: PMC5294985 DOI: 10.3390/v9010016] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2016] [Revised: 12/30/2016] [Accepted: 01/05/2017] [Indexed: 01/18/2023] Open
Abstract
For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV), raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’) that could be evolved gradually by nucleotides’ gain and loss and point mutations.
Collapse
Affiliation(s)
- Graziele Pereira Oliveira
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Ana Cláudia Dos Santos Pereira Andrade
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Rodrigo Araújo Lima Rodrigues
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Thalita Souza Arantes
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Paulo Victor Miranda Boratto
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Ludmila Karen Dos Santos Silva
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Fábio Pio Dornas
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Giliane de Souza Trindade
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Betânia Paiva Drumond
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Bernard La Scola
- Unité de Recherche sur les Maladies Infectieuses et Tropicales Emergentes (URMITE) UM63 CNRS 7278 IRD 198 INSERM U1095, Aix-Marseille Université., 27 Boulevard Jean Moulin, Faculté de Médecine, 13385 Marseille Cedex 05, France.
| | - Erna Geessien Kroon
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| | - Jônatas Santos Abrahão
- Laboratório de Vírus, Departamento de Microbiologia, Instituto de Ciências Biológicas, Universidade Federal de Minas Gerais, Belo Horizonte 31270-901, Minas Gerais, Brazil.
| |
Collapse
|
39
|
Influence of Rotational Nucleosome Positioning on Transcription Start Site Selection in Animal Promoters. PLoS Comput Biol 2016; 12:e1005144. [PMID: 27716823 PMCID: PMC5055345 DOI: 10.1371/journal.pcbi.1005144] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2016] [Accepted: 09/11/2016] [Indexed: 01/20/2023] Open
Abstract
The recruitment of RNA-Pol-II to the transcription start site (TSS) is an important step in gene regulation in all organisms. Core promoter elements (CPE) are conserved sequence motifs that guide Pol-II to the TSS by interacting with specific transcription factors (TFs). However, only a minority of animal promoters contains CPEs. It is still unknown how Pol-II selects the TSS in their absence. Here we present a comparative analysis of promoters' sequence composition and chromatin architecture in five eukaryotic model organisms, which shows the presence of common and unique DNA-encoded features used to organize chromatin. Analysis of Pol-II initiation patterns uncovers that, in the absence of certain CPEs, there is a strong correlation between the spread of initiation and the intensity of the 10 bp periodic signal in the nearest downstream nucleosome. Moreover, promoters' primary and secondary initiation sites show a characteristic 10 bp periodicity in the absence of CPEs. We also show that DNA natural variants in the region immediately downstream the TSS are able to affect both the nucleosome-DNA affinity and Pol-II initiation pattern. These findings support the notion that, in addition to CPEs mediated selection, sequence-induced nucleosome positioning could be a common and conserved mechanism of TSS selection in animals.
Collapse
|
40
|
Merenciano M, Ullastres A, de Cara MAR, Barrón MG, González J. Multiple Independent Retroelement Insertions in the Promoter of a Stress Response Gene Have Variable Molecular and Functional Effects in Drosophila. PLoS Genet 2016; 12:e1006249. [PMID: 27517860 PMCID: PMC4982627 DOI: 10.1371/journal.pgen.1006249] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2015] [Accepted: 07/18/2016] [Indexed: 12/20/2022] Open
Abstract
Promoters are structurally and functionally diverse gene regulatory regions. The presence or absence of sequence motifs and the spacing between the motifs defines the properties of promoters. Recent alternative promoter usage analyses in Drosophila melanogaster revealed that transposable elements significantly contribute to promote diversity. In this work, we analyzed in detail one of the transposable element insertions, named FBti0019985, that has been co-opted to drive expression of CG18446, a candidate stress response gene. We analyzed strains from different natural populations and we found that besides FBti0019985, there are another eight independent transposable elements inserted in the proximal promoter region of CG18446. All nine insertions are solo-LTRs that belong to the roo family. We analyzed the sequence of the nine roo insertions and we investigated whether the different insertions were functionally equivalent by performing 5'-RACE, gene expression, and cold-stress survival experiments. We found that different insertions have different molecular and functional consequences. The exact position where the transposable elements are inserted matters, as they all showed highly conserved sequences but only two of the analyzed insertions provided alternative transcription start sites, and only the FBti0019985 insertion consistently affects CG18446 expression. The phenotypic consequences of the different insertions also vary: only FBti0019985 was associated with cold-stress tolerance. Interestingly, the only previous report of transposable elements inserting repeatedly and independently in a promoter region in D. melanogaster, were also located upstream of a stress response gene. Our results suggest that functional validation of individual structural variants is needed to resolve the complexity of insertion clusters.
Collapse
Affiliation(s)
- Miriam Merenciano
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
| | - Anna Ullastres
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
| | - M. A. R. de Cara
- Laboratoire d’Eco-anthropologie et Ethnobiologie, UMR 7206, CNRS/MNHN/Universite Paris 7, Museum National d’Histoire Naturelle, F-75116 Paris, France
| | - Maite G. Barrón
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Barcelona. Spain
- * E-mail:
| |
Collapse
|