1
|
Johnson MR, Mallarino R. Genome-Wide Profiling of Cis-regulatory Elements in Mammalian Skin. Methods Mol Biol 2024; 2805:127-135. [PMID: 39008178 DOI: 10.1007/978-1-0716-3854-5_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/16/2024]
Abstract
The modulation of cis-regulatory elements (e.g., enhancers and promoters) is a major mechanism by which gene expression can be controlled in a temporal and spatially restricted manner. However, methods for both identifying these elements and inferring their activity are limited and often require a substantial investment of time, money, and resources. Here, using mammalian skin as a model, we demonstrate a streamlined protocol by which these hurdles can be overcome using a novel chromatin profiling technique (CUT&RUN) to map histone modifications genome-wide. This protocol can be used to map the location and activity of putative cis-regulatory elements, providing mechanistic insight into how differential gene expression is controlled in mammalian tissues.
Collapse
Affiliation(s)
- Matthew R Johnson
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA
| | - Ricardo Mallarino
- Department of Molecular Biology, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
2
|
Song C, Li W, Wang Z. The Landscape of Liver Chromatin Accessibility and Conserved Non-coding Elements in Larimichthys crocea, Nibea albiflora, and Lateolabrax maculatus. MARINE BIOTECHNOLOGY (NEW YORK, N.Y.) 2022; 24:763-775. [PMID: 35895229 DOI: 10.1007/s10126-022-10142-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 07/08/2022] [Indexed: 06/15/2023]
Abstract
Large yellow croaker (Larimichthys crocea), yellow drum (Nibea albiflora), and Chinese seabass (Lateolabrax maculatus) are important economic marine fishes in China. The conserved non-coding elements (CNEs) in the liver tissues of the three kinds of fish are directly or indirectly involved in the regulation of gene expression and affect liver functions. However, the fishes' CNEs and even chromatin accessibility landscape have not been effectively investigated. Hence, this study established the landscapes of the fishes' genome-wide chromatin accessibility and CNEs by detecting regions of the open chromatin in their livers using an assay for transposase-accessible chromatin by high-throughput sequencing (ATAC-seq) and comparative genomics approach. The results showed that Smad1, Sp1, and Foxl1 transcription factor binding motifs were considerably enriched in the chromatin accessibility landscape in the liver of the three species, and the three transcription factors (TFs) had a wide range of common targets. The hypothetical gene set was targeted by one, two, or all three TFs, which was much higher than would be expected for an accidental outcome. The gene sets near the CNEs were mainly enriched through processes such as a macromolecule metabolic process and ribonucleoprotein complex biogenesis. The active CNEs were found in the promoter regions of genes such as ap1g1, hax1, and ndufs2. And 5 CNEs were predicted to be highly conserved active enhancers. These results demonstrated that Smad1, Sp1, and Foxl1 might be related to the liver function in the three fishes. In addition, we found a series of ATAC-seq-labeled CNEs located in the gene promoter regions, and highly conserved H3k27ac + -labeled CNEs located in the liver function genes. The highly conserved nature of these regulatory elements suggests that they play important roles in the liver in fish. This study mined the landscape of chromatin accessibility and CNEs of three important economic fishes to fill the knowledge gaps in this field. Moreover, the work provides useful data for the industrial application and theoretical research of these three fish species.
Collapse
Affiliation(s)
- Chaowei Song
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China
| | - Wanbo Li
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China
| | - Zhiyong Wang
- Key Laboratory of Healthy Mariculture for the East China Sea, Ministry of Agriculture and Rural Affairs, Jimei University, Xiamen, China.
- Laboratory for Marine Fisheries Science and Food Production Processes, National Laboratory for Marine Science and Technology, Qingdao, China.
| |
Collapse
|
3
|
Wang X, Aguirre L, Rodríguez-Leal D, Hendelman A, Benoit M, Lippman ZB. Dissecting cis-regulatory control of quantitative trait variation in a plant stem cell circuit. NATURE PLANTS 2021; 7:419-427. [PMID: 33846596 DOI: 10.1038/s41477-021-00898-x] [Citation(s) in RCA: 66] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Accepted: 03/10/2021] [Indexed: 05/22/2023]
Abstract
Cis-regulatory mutations underlie important crop domestication and improvement traits1,2. However, limited allelic diversity has hindered functional dissection of the large number of cis-regulatory elements and their potential interactions, thereby precluding a deeper understanding of how cis-regulatory variation impacts traits quantitatively. Here, we engineered over 60 promoter alleles in two tomato fruit size genes3,4 to characterize cis-regulatory sequences and study their functional relationships. We found that targeted mutations in conserved promoter sequences of SlCLV3, a repressor of stem cell proliferation5,6, have a weak impact on fruit locule number. Pairwise combinations of these mutations mildly enhance this phenotype, revealing additive and synergistic relationships between conserved regions and further suggesting even higher-order cis-regulatory interactions within the SlCLV3 promoter. In contrast, SlWUS, a positive regulator of stem cell proliferation repressed by SlCLV3 (refs. 5,6), is more tolerant to promoter perturbations. Our results show that complex interplay among cis-regulatory variants can shape quantitative variation, and suggest that empirical dissections of this hidden complexity can guide promoter engineering to predictably modify crop traits.
Collapse
Affiliation(s)
- Xingang Wang
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Lyndsey Aguirre
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Daniel Rodríguez-Leal
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- Inari Agriculture, Cambridge, MA, USA
| | - Anat Hendelman
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Matthias Benoit
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Zachary B Lippman
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
- School of Biological Sciences, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
- Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
4
|
Figiel DM, Elsayed R, Nelson AC. Investigating the molecular guts of endoderm formation using zebrafish. Brief Funct Genomics 2021:elab013. [PMID: 33754635 DOI: 10.1093/bfgp/elab013] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2020] [Revised: 01/27/2021] [Accepted: 02/19/2021] [Indexed: 02/07/2023] Open
Abstract
The vertebrate endoderm makes major contributions to the respiratory and gastrointestinal tracts and all associated organs. Zebrafish and humans share a high degree of genetic homology and strikingly similar endodermal organ systems. Combined with a multitude of experimental advantages, zebrafish are an attractive model organism to study endoderm development and disease. Recent functional genomics studies have shed considerable light on the gene regulatory programs governing early zebrafish endoderm development, while advances in biological and technological approaches stand to further revolutionize our ability to investigate endoderm formation, function and disease. Here, we discuss the present understanding of endoderm specification in zebrafish compared to other vertebrates, how current and emerging methods will allow refined and enhanced analysis of endoderm formation, and how integration with human data will allow modeling of the link between non-coding sequence variants and human disease.
Collapse
Affiliation(s)
- Daniela M Figiel
- Medical Research Council Doctoral Training Partnership in Interdisciplinary Biomedical Research at Warwick Medical School
| | - Randa Elsayed
- Medical Research Council Doctoral Training Partnership in Interdisciplinary Biomedical Research at Warwick Medical School
| | | |
Collapse
|
5
|
Hendelman A, Zebell S, Rodriguez-Leal D, Dukler N, Robitaille G, Wu X, Kostyun J, Tal L, Wang P, Bartlett ME, Eshed Y, Efroni I, Lippman ZB. Conserved pleiotropy of an ancient plant homeobox gene uncovered by cis-regulatory dissection. Cell 2021; 184:1724-1739.e16. [PMID: 33667348 DOI: 10.1016/j.cell.2021.02.001] [Citation(s) in RCA: 102] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Revised: 01/03/2021] [Accepted: 02/01/2021] [Indexed: 01/09/2023]
Abstract
Divergence of gene function is a hallmark of evolution, but assessing functional divergence over deep time is not trivial. The few alleles available for cross-species studies often fail to expose the entire functional spectrum of genes, potentially obscuring deeply conserved pleiotropic roles. Here, we explore the functional divergence of WUSCHEL HOMEOBOX9 (WOX9), suggested to have species-specific roles in embryo and inflorescence development. Using a cis-regulatory editing drive system, we generate a comprehensive allelic series in tomato, which revealed hidden pleiotropic roles for WOX9. Analysis of accessible chromatin and conserved cis-regulatory sequences identifies the regions responsible for this pleiotropic activity, the functions of which are conserved in groundcherry, a tomato relative. Mimicking these alleles in Arabidopsis, distantly related to tomato and groundcherry, reveals new inflorescence phenotypes, exposing a deeply conserved pleiotropy. We suggest that targeted cis-regulatory mutations can uncover conserved gene functions and reduce undesirable effects in crop improvement.
Collapse
Affiliation(s)
- Anat Hendelman
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Sophia Zebell
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | | - Noah Dukler
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Gina Robitaille
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Xuelin Wu
- The Salk Institute for Biological Research, San Diego, CA, USA
| | - Jamie Kostyun
- Biology Department, University of Massachusetts Amherst, Amherst, MA, USA
| | - Lior Tal
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Peipei Wang
- Institute of Plant Sciences and Genetics in Agriculture, The Robert H. Smith Faculty of Agriculture, The Hebrew University, Rehovot, Israel
| | | | - Yuval Eshed
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Idan Efroni
- Institute of Plant Sciences and Genetics in Agriculture, The Robert H. Smith Faculty of Agriculture, The Hebrew University, Rehovot, Israel.
| | - Zachary B Lippman
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA; Howard Hughes Medical Institute, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA.
| |
Collapse
|
6
|
Hatleberg WL, Hinman VF. Modularity and hierarchy in biological systems: Using gene regulatory networks to understand evolutionary change. Curr Top Dev Biol 2021; 141:39-73. [DOI: 10.1016/bs.ctdb.2020.11.004] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
7
|
Duplications involving the long range HMX1 enhancer are associated with human isolated bilateral concha-type microtia. J Transl Med 2020; 18:244. [PMID: 32552830 PMCID: PMC7302384 DOI: 10.1186/s12967-020-02409-6] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 06/05/2020] [Indexed: 02/08/2023] Open
Abstract
Background Microtia is a congenital anomaly of ear that ranges in severity from mild structural abnormalities to complete absence of the outer ears. Concha-type microtia is considered to be a mild form. The H6 family homeobox 1 transcription factor gene (HMX1) plays an important role in craniofacial structures development. Copy number variations (CNVs) of a downstream evolutionarily conserved enhancer region (ECR) of Hmx1 associated with ear and eye abnormalities have been reported in different animals, but not yet in human. To date, no genetic defects responsible for isolated human microtia has been reported except for mutations in HOXA2. Here we recruited five Chinese families with isolated bilateral concha-type microtia, and attempt to identify the underlying genetic causes. Methods Single Nucleotide polymorphism (SNP) array was performed to map the disease locus and detect CNVs on a genome scale primarily in the largest family (F1). Whole genome sequencing was performed to screen all SNVs and CNVs in the candidate disease locus. Array comparative genomic hybridization (aCGH) was then performed to detect CNVs in the other four families, F2-F5. Quantitative real-time polymerase chain reaction (qPCR) was used to validate and determine the extent of identified CNVs containing HMX1-ECR region. Precise breakpoints in F1 and F2 were identified by gap-PCR and sanger sequencing. Dual-luciferase assays were used to detect the enhancer function. qPCR assays were also used to detect HMX1-ECR CNVs in 61 patients with other types mictrotia. Results Linkage and haplotype analysis in F1 mapped the disease locus to a 1.9 Mb interval on 4p16.1 containing HMX1 and its downstream ECR region. Whole genome sequencing detected no potential pathogenic SNVs in coding regions of HMX1 or other genes within the candidate disease locus, but it detected a 94.6 Kb duplication in an intergenic region between HMX1 and CPZ. aCGH and qPCRs also revealed co-segregated duplications in intergenic region downstream of HMX1 in the other four families. The 21.8 Kb minimal overlapping region encompassing the core sequences consensus with mouse ECR of Hmx1. Luciferase assays confirmed the enhancer function in human sequences, and proved that HOXA2 could increase its enhancer activity. No CNVs were detected in HMX1-ECR regions in 61 patients with other type of microtia. Conclusion Duplications involving long range HMX1 enhancers are associated with human isolated bilateral concha-type microtia. We add to evidences in human that copy number variations in HMX1-ECR associates with ear malformations, as in other species. This study also provides an additional example of functional conserved non-coding elements (CNEs) in humans.
Collapse
|
8
|
A Systematic Analysis Revealed the Potential Gene Regulatory Processes of ATRA-Triggered Neuroblastoma Differentiation and Identified a Novel RA Response Sequence in the NTRK2 Gene. BIOMED RESEARCH INTERNATIONAL 2020; 2020:6734048. [PMID: 32149119 PMCID: PMC7053487 DOI: 10.1155/2020/6734048] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/30/2019] [Revised: 01/03/2020] [Accepted: 01/16/2020] [Indexed: 12/14/2022]
Abstract
Retinoic acid- (RA-) triggered neuroblastoma cell lines are widely used cell modules of neuronal differentiation in neurodegenerative disease studies, but the gene regulatory mechanism underlying differentiation is unclear now. In this study, system biological analysis was performed on public microarray data from three neuroblastoma cell lines (SK-N-SH, SH-SY5Y-A, and SH-SY5Y-E) to explore the potential molecular processes of all-trans retinoic acid- (ATRA-) triggered differentiation. RT-qPCR, functional genomics analysis, western blotting, chromatin immunoprecipitation (ChIP), and homologous sequence analysis were further performed to validate the gene regulation processes and identify the RA response element in a specific gene. The potential disturbed biological pathways (111 functional GO terms in 14 interactive functional groups) and gene regulatory network (10 regulators and 71 regulated genes) in neuroblastoma differentiation were obtained. 15 of the 71 regulated genes are neuronal projection-related. Among them, NTRK2 is the only one that was dramatically upregulated in the RT-qPCR test that we performed on ATRA-treated SH-SY5Y-A cells. We further found that the overexpression of the NTRK2 gene can trigger differentiation-like changes in SH-SY5Y-A cells. Functional genomic analysis and western blotting assay suggested that, in neuroblastoma cells, ATRA may directly regulate the NTRK2 gene by activating the RA receptor (RAR) that binds in its promoter region. A novel RA response DNA element in the NTRK2 gene was then identified by bioinformatics analysis and chromatin immunoprecipitation (ChIP) assay. The novel element is sequence conservation and position variation among different species. Our study systematically provided the potential regulatory information of ATRA-triggered neuroblastoma differentiation, and in the NTRK2 gene, we identified a novel RA response DNA element, which may contribute to the differentiation in a human-specific manner.
Collapse
|
9
|
Identification and Characterization of Cis-Regulatory Elements for Photoreceptor-Type-Specific Transcription in ZebraFish. Methods Mol Biol 2020; 2092:123-145. [PMID: 31786786 DOI: 10.1007/978-1-0716-0175-4_10] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/06/2022]
Abstract
Tissue-specific or cell-type-specific transcription of protein-coding genes is controlled by both trans-regulatory elements (TREs) and cis-regulatory elements (CREs). However, it is challenging to identify TREs and CREs, which are unknown for most genes. Here, we describe a protocol for identifying two types of transcription-activating CREs-core promoters and enhancers-of zebrafish photoreceptor type-specific genes. This protocol is composed of three phases: bioinformatic prediction, experimental validation, and characterization of the CREs. To better illustrate the principles and logic of this protocol, we exemplify it with the discovery of the core promoter and enhancer of the mpp5b apical polarity gene (also known as ponli), whose red, green, and blue (RGB) cone-specific transcription requires its enhancer, a member of the rainbow enhancer family. While exemplified with an RGB-cone-specific gene, this protocol is general and can be used to identify the core promoters and enhancers of other protein-coding genes.
Collapse
|
10
|
Sharma A, Basu U, Malik N, Daware A, Thakro V, Narnoliya L, Bajaj D, Tripathi S, Hegde VS, Upadhyaya HD, Tyagi AK, Parida SK. Genome-wide cis-regulatory signatures for modulation of agronomic traits as exemplified by drought yield index (DYI) in chickpea. Funct Integr Genomics 2019; 19:973-992. [PMID: 31177403 DOI: 10.1007/s10142-019-00691-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2018] [Revised: 05/07/2019] [Accepted: 05/10/2019] [Indexed: 12/26/2022]
Abstract
Developing functional molecular tags from the cis-regulatory sequence components of genes is vital for their deployment in efficient genetic dissection of complex quantitative traits in crop plants including chickpea. The current study identified 431,194 conserved non-coding SNP (CNSNP) from the cis-regulatory element regions of genes which were annotated on a chickpea genome. These genome-wide CNSNP marker resources are made publicly accessible through a user-friendly web-database ( http://www.cnsnpcicarbase.com ). The CNSNP-based quantitative trait loci (QTL) and expression QTL (eQTL) mapping and genome-wide association study (GWAS) were further integrated with global gene expression landscapes, molecular haplotyping, and DNA-protein interaction study in the association panel and recombinant inbred lines (RIL) mapping population to decode complex genetic architecture of one of the vital seed yield trait under drought stress, drought yield index (DYI), in chickpea. This delineated two constituted natural haplotypes and alleles from a histone H3 protein-coding gene and its transcriptional regulator NAC transcription factor (TF) harboring the major QTLs and trans-acting eQTL governing DYI in chickpea. The effect of CNSNPs in TF-binding cis-element of a histone H3 gene in altering the binding affinity and transcriptional activity of NAC TF based on chromatin immunoprecipitation-quantitative PCR (ChIP-qPCR) assay was evident. The CNSNP-led promising molecular tags scanned will essentially have functional significance to decode transcriptional gene regulatory function and thus can drive translational genomic analysis in chickpea.
Collapse
Affiliation(s)
- Akash Sharma
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Udita Basu
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Naveen Malik
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Anurag Daware
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Virevol Thakro
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Laxmi Narnoliya
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Deepak Bajaj
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India
| | - Shailesh Tripathi
- Division of Genetics, Indian Agricultural Research Institute (IARI), New Delhi, 110012, India
| | - V S Hegde
- Division of Genetics, Indian Agricultural Research Institute (IARI), New Delhi, 110012, India
| | - Hari D Upadhyaya
- International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru, Telangana, 502324, India
| | - Akhilesh K Tyagi
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India.,Department of Plant Molecular Biology, University of Delhi, South Campus, New Delhi, 110021, India
| | - Swarup K Parida
- Genomics-Assisted Breeding and Crop Improvement Laboratory, National Institute of Plant Genome Research (NIPGR), Aruna Asaf Ali Marg, New Delhi, 110067, India.
| |
Collapse
|
11
|
Fico A, Fiorenzano A, Pascale E, Patriarca EJ, Minchiotti G. Long non-coding RNA in stem cell pluripotency and lineage commitment: functions and evolutionary conservation. Cell Mol Life Sci 2019; 76:1459-1471. [PMID: 30607432 PMCID: PMC6439142 DOI: 10.1007/s00018-018-3000-z] [Citation(s) in RCA: 62] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2018] [Revised: 11/13/2018] [Accepted: 12/17/2018] [Indexed: 02/07/2023]
Abstract
LncRNAs have recently emerged as new and fundamental transcriptional and post-transcriptional regulators acting at multiple levels of gene expression. Indeed, lncRNAs participate in a wide variety of stem cell and developmental processes, acting in cis and/or in trans in the nuclear and/or in the cytoplasmic compartments, and generating an intricate network of interactions with RNAs, enhancers, and chromatin-modifier complexes. Given the versatility of these molecules to operate in different subcellular compartments, via different modes of action and with different target specificity, the interest in this research field is rapidly growing. Here, we review recent progress in defining the functional role of lncRNAs in stem cell biology with a specific focus on the underlying mechanisms. We also discuss recent findings on a new family of evolutionary conserved lncRNAs transcribed from ultraconserved elements, which show perfect conservation between human, mouse, and rat genomes, and that are emerging as new player in this complex scenario.
Collapse
Affiliation(s)
- Annalisa Fico
- Stem Cell Fate Laboratory, Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy.
- Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy.
| | - Alessandro Fiorenzano
- Stem Cell Fate Laboratory, Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
- Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
- Developmental and Regenerative Neurobiology, Wallenberg Neuroscience Center, and Lund Stem Cell Centre, Department of Experimental Medical Science, Lund University, 22184, Lund, Sweden
| | - Emilia Pascale
- Stem Cell Fate Laboratory, Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
- Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
| | - Eduardo Jorge Patriarca
- Stem Cell Fate Laboratory, Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
- Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
| | - Gabriella Minchiotti
- Stem Cell Fate Laboratory, Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
- Institute of Genetics and Biophysics "A. Buzzati-Traverso", CNR, 80131, Naples, Italy
| |
Collapse
|
12
|
Alexandre CM, Urton JR, Jean-Baptiste K, Huddleston J, Dorrity MW, Cuperus JT, Sullivan AM, Bemm F, Jolic D, Arsovski AA, Thompson A, Nemhauser JL, Fields S, Weigel D, Bubb KL, Queitsch C. Complex Relationships between Chromatin Accessibility, Sequence Divergence, and Gene Expression in Arabidopsis thaliana. Mol Biol Evol 2019; 35:837-854. [PMID: 29272536 DOI: 10.1093/molbev/msx326] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Variation in regulatory DNA is thought to drive phenotypic variation, evolution, and disease. Prior studies of regulatory DNA and transcription factors across animal species highlighted a fundamental conundrum: Transcription factor binding domains and cognate binding sites are conserved, while regulatory DNA sequences are not. It remains unclear how conserved transcription factors and dynamic regulatory sites produce conserved expression patterns across species. Here, we explore regulatory DNA variation and its functional consequences within Arabidopsis thaliana, using chromatin accessibility to delineate regulatory DNA genome-wide. Unlike in previous cross-species comparisons, the positional homology of regulatory DNA is maintained among A. thaliana ecotypes and less nucleotide divergence has occurred. Of the ∼50,000 regulatory sites in A. thaliana, we found that 15% varied in accessibility among ecotypes. Some of these accessibility differences were associated with extensive, previously unannotated sequence variation, encompassing many deletions and ancient hypervariable alleles. Unexpectedly, for the majority of such regulatory sites, nearby gene expression was unaffected. Nevertheless, regulatory sites with high levels of sequence variation and differential chromatin accessibility were the most likely to be associated with differential gene expression. Finally, and most surprising, we found that the vast majority of differentially accessible sites show no underlying sequence variation. We argue that these surprising results highlight the necessity to consider higher-order regulatory context in evaluating regulatory variation and predicting its phenotypic consequences.
Collapse
Affiliation(s)
| | - James R Urton
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Ken Jean-Baptiste
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - John Huddleston
- Department of Genome Sciences, University of Washington, Seattle, WA.,Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, WA
| | - Michael W Dorrity
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Josh T Cuperus
- Department of Genome Sciences, University of Washington, Seattle, WA
| | | | - Felix Bemm
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Dino Jolic
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | | | | | | | - Stan Fields
- Department of Genome Sciences, University of Washington, Seattle, WA.,Howard Hughes Medical Institute, University of Washington, Seattle, WA
| | - Detlef Weigel
- Department of Molecular Biology, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Kerry L Bubb
- Department of Genome Sciences, University of Washington, Seattle, WA
| | - Christin Queitsch
- Department of Genome Sciences, University of Washington, Seattle, WA
| |
Collapse
|
13
|
Exaptation at the molecular genetic level. SCIENCE CHINA-LIFE SCIENCES 2018; 62:437-452. [PMID: 30798493 DOI: 10.1007/s11427-018-9447-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2018] [Accepted: 12/01/2018] [Indexed: 12/22/2022]
Abstract
The realization that body parts of animals and plants can be recruited or coopted for novel functions dates back to, or even predates the observations of Darwin. S.J. Gould and E.S. Vrba recognized a mode of evolution of characters that differs from adaptation. The umbrella term aptation was supplemented with the concept of exaptation. Unlike adaptations, which are restricted to features built by selection for their current role, exaptations are features that currently enhance fitness, even though their present role was not a result of natural selection. Exaptations can also arise from nonaptations; these are characters which had previously been evolving neutrally. All nonaptations are potential exaptations. The concept of exaptation was expanded to the molecular genetic level which aided greatly in understanding the enormous potential of neutrally evolving repetitive DNA-including transposed elements, formerly considered junk DNA-for the evolution of genes and genomes. The distinction between adaptations and exaptations is outlined in this review and examples are given. Also elaborated on is the fact that such distinctions are sometimes more difficult to determine; this is a widespread phenomenon in biology, where continua abound and clear borders between states and definitions are rare.
Collapse
|
14
|
Vega WHO, Quirino CR, Bartholazzi-Junior A, Rua MAS, Serapião RV, Oliveira CS. Variants in the CYP19A1 gene can affect in vitro embryo production traits in cattle. J Assist Reprod Genet 2018; 35:2233-2241. [PMID: 30232641 DOI: 10.1007/s10815-018-1320-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2018] [Accepted: 09/13/2018] [Indexed: 11/30/2022] Open
Abstract
PURPOSE This study aimed to associate DNA variants in promoter and exon flanking regions of the CYP19A1 gene with in vitro embryo production traits in cattle. The role of transcription factor binding sites created or lost due to DNA sequence variation and their possible effect on gene expression was also evaluated. METHODS We collected date from Gyr dairy oocyte donor cows (Bos taurus indicus) at a commercial in vitro embryo production farm and analyzed the genotype-phenotype association with in vitro production traits. Using Sanger sequencing and web-based software, we assessed important CYP19A1 gene regions in oocyte donor cows and analyzed the effects of variants on the transcription factor binding sites. RESULTS Two SNP mutations significantly associated with oocyte production, oocyte viability, embryo development, and pregnancies were found (T > C in the untranslated exon 1 flanking region ([GenBank: AJ250379.1]: rs718446508 T > C), and a T > C in the 5'-upstream region (1.1 promoter) ([GenBank: AC_000167.1]: rs41651668 T > C). Six new transcription factor binding sites were created. A binding site for transcription factors associated with the development of the placenta and embryo implantation was eliminated due to variations in the DNA sequence identified. CONCLUSIONS The CYP19A1 gene contributes to genetic variation of in vitro embryo production traits in cattle. The complexity of the physiological phenomena related to estrogen pathways and their influence on reproduction in cattle allow indication of the mutations evaluated here as possible genetic markers for embryo production traits, which should be validated in the next steps of marker-assisted selection.
Collapse
Affiliation(s)
- Wilder Hernando Ortiz Vega
- Laboratory for Animal Breeding and Genetic Improvement, Norte Fluminense State University, Av. Alberto Lamego, 2000, Campos dos Goytacazes, Rio de Janeiro, 28013-602, Brazil.
| | - Celia Raquel Quirino
- Laboratory for Animal Breeding and Genetic Improvement, Norte Fluminense State University, Av. Alberto Lamego, 2000, Campos dos Goytacazes, Rio de Janeiro, 28013-602, Brazil
| | - Aylton Bartholazzi-Junior
- Laboratory for Animal Breeding and Genetic Improvement, Norte Fluminense State University, Av. Alberto Lamego, 2000, Campos dos Goytacazes, Rio de Janeiro, 28013-602, Brazil
| | - Miguel Alejandro Silva Rua
- Laboratory for Animal Breeding and Genetic Improvement, Norte Fluminense State University, Av. Alberto Lamego, 2000, Campos dos Goytacazes, Rio de Janeiro, 28013-602, Brazil
| | - Raquel Varella Serapião
- PESAGRO-RIO, Laboratory for Animal Reproduction, Santa Mônica Experimental Farm (CESM), Valença, Rio de Janeiro, Brazil
| | - Clara Slade Oliveira
- Embrapa Dairy Cattle Research Unit, Laboratory for Animal Reproduction, Santa Mônica Experimental Farm (CESM), Valença, Rio de Janeiro, Brazil
| |
Collapse
|
15
|
Liang P, Saqib HSA, Zhang X, Zhang L, Tang H. Single-Base Resolution Map of Evolutionary Constraints and Annotation of Conserved Elements across Major Grass Genomes. Genome Biol Evol 2018; 10:473-488. [PMID: 29378032 PMCID: PMC5798027 DOI: 10.1093/gbe/evy006] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/08/2018] [Indexed: 12/20/2022] Open
Abstract
Conserved noncoding sequences (CNSs) are evolutionarily conserved DNA sequences that do not encode proteins but may have potential regulatory roles in gene expression. CNS in crop genomes could be linked to many important agronomic traits and ecological adaptations. Compared with the relatively mature exon annotation protocols, efficient methods are lacking to predict the location of noncoding sequences in the plant genomes. We implemented a computational pipeline that is tailored to the comparisons of plant genomes, yielding a large number of conserved sequences using rice genome as the reference. In this study, we used 17 published grass genomes, along with five monocot genomes as well as the basal angiosperm genome of Amborella trichopoda. Genome alignments among these genomes suggest that at least 12.05% of the rice genome appears to be evolving under constraints in the Poaceae lineage, with close to half of the evolutionarily constrained sequences located outside protein-coding regions. We found evidence for purifying selection acting on the conserved sequences by analyzing segregating SNPs within the rice population. Furthermore, we found that known functional motifs were significantly enriched within CNS, with many motifs associated with the preferred binding of ubiquitous transcription factors. The conserved elements that we have curated are accessible through our public database and the JBrowse server. In-depth functional annotations and evolutionary dynamics of the identified conserved sequences provide a solid foundation for studying gene regulation, genome evolution, as well as to inform gene isolation for cereal biologists.
Collapse
Affiliation(s)
- Pingping Liang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Center for Genomics and Biotechnology, Ministry of Education; Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
- Key Laboratory of the Ministry of Education for Coastal and Wetland Ecosystems, College of the Environment and Ecology, Xiamen University, China
| | - Hafiz Sohaib Ahmed Saqib
- Institute of Applied Ecology, Fujian Agriculture and Forestry University, Fuzhou, China
- State Key Laboratory of Ecological Pest Control for Fujian and Taiwan Crops, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Xingtan Zhang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Center for Genomics and Biotechnology, Ministry of Education; Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Liangsheng Zhang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Center for Genomics and Biotechnology, Ministry of Education; Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| | - Haibao Tang
- Key Laboratory of Genetics, Breeding and Multiple Utilization of Corps, Center for Genomics and Biotechnology, Ministry of Education; Fujian Provincial Key Laboratory of Haixia Applied Plant Systems Biology, Fujian Agriculture and Forestry University, Fuzhou, China
| |
Collapse
|
16
|
Markakis K, De Las Heras A, Elfick A. Analytical approach for the calculation of promoter activities based on fluorescent protein expression data. ENGINEERING BIOLOGY 2017. [DOI: 10.1049/enb.2017.0002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Affiliation(s)
- Konstantinos Markakis
- School of Engineering, Institute of Bioengineering The University of Edinburgh Edinburgh EH9 3DW UK
- SynthSys – Synthetic and Systems Biology Research Centre Edinburgh EH9 3BF UK
| | - Aitor De Las Heras
- School of Engineering, Institute of Bioengineering The University of Edinburgh Edinburgh EH9 3DW UK
- SynthSys – Synthetic and Systems Biology Research Centre Edinburgh EH9 3BF UK
| | - Alistair Elfick
- School of Engineering, Institute of Bioengineering The University of Edinburgh Edinburgh EH9 3DW UK
- SynthSys – Synthetic and Systems Biology Research Centre Edinburgh EH9 3BF UK
| |
Collapse
|
17
|
Mutations in ACTRT1 and its enhancer RNA elements lead to aberrant activation of Hedgehog signaling in inherited and sporadic basal cell carcinomas. Nat Med 2017; 23:1226-1233. [PMID: 28869610 DOI: 10.1038/nm.4368] [Citation(s) in RCA: 49] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 06/14/2017] [Indexed: 12/19/2022]
Abstract
Basal cell carcinoma (BCC), the most common human cancer, results from aberrant activation of the Hedgehog signaling pathway. Although most cases of BCC are sporadic, some forms are inherited, such as Bazex-Dupré-Christol syndrome (BDCS)-a cancer-prone genodermatosis with an X-linked, dominant inheritance pattern. We have identified mutations in the ACTRT1 gene, which encodes actin-related protein T1 (ARP-T1), in two of the six families with BDCS that were examined in this study. High-throughput sequencing in the four remaining families identified germline mutations in noncoding sequences surrounding ACTRT1. These mutations were located in transcribed sequences encoding enhancer RNAs (eRNAs) and were shown to impair enhancer activity and ACTRT1 expression. ARP-T1 was found to directly bind to the GLI1 promoter, thus inhibiting GLI1 expression, and loss of ARP-T1 led to activation of the Hedgehog pathway in individuals with BDCS. Moreover, exogenous expression of ACTRT1 reduced the in vitro and in vivo proliferation rates of cell lines with aberrant activation of the Hedgehog signaling pathway. In summary, our study identifies a disease mechanism in BCC involving mutations in regulatory noncoding elements and uncovers the tumor-suppressor properties of ACTRT1.
Collapse
|
18
|
Lickwar CR, Camp JG, Weiser M, Cocchiaro JL, Kingsley DM, Furey TS, Sheikh SZ, Rawls JF. Genomic dissection of conserved transcriptional regulation in intestinal epithelial cells. PLoS Biol 2017; 15:e2002054. [PMID: 28850571 PMCID: PMC5574553 DOI: 10.1371/journal.pbio.2002054] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2017] [Accepted: 07/31/2017] [Indexed: 12/17/2022] Open
Abstract
The intestinal epithelium serves critical physiologic functions that are shared among all vertebrates. However, it is unknown how the transcriptional regulatory mechanisms underlying these functions have changed over the course of vertebrate evolution. We generated genome-wide mRNA and accessible chromatin data from adult intestinal epithelial cells (IECs) in zebrafish, stickleback, mouse, and human species to determine if conserved IEC functions are achieved through common transcriptional regulation. We found evidence for substantial common regulation and conservation of gene expression regionally along the length of the intestine from fish to mammals and identified a core set of genes comprising a vertebrate IEC signature. We also identified transcriptional start sites and other putative regulatory regions that are differentially accessible in IECs in all 4 species. Although these sites rarely showed sequence conservation from fish to mammals, surprisingly, they drove highly conserved IEC expression in a zebrafish reporter assay. Common putative transcription factor binding sites (TFBS) found at these sites in multiple species indicate that sequence conservation alone is insufficient to identify much of the functionally conserved IEC regulatory information. Among the rare, highly sequence-conserved, IEC-specific regulatory regions, we discovered an ancient enhancer upstream from her6/HES1 that is active in a distinct population of Notch-positive cells in the intestinal epithelium. Together, these results show how combining accessible chromatin and mRNA datasets with TFBS prediction and in vivo reporter assays can reveal tissue-specific regulatory information conserved across 420 million years of vertebrate evolution. We define an IEC transcriptional regulatory network that is shared between fish and mammals and establish an experimental platform for studying how evolutionarily distilled regulatory information commonly controls IEC development and physiology. The epithelium lining the intestine is an ancient animal tissue that serves as a primary site of nutrient absorption and interaction with microbiota. Its formation and function require complex patterns of gene transcription that vary along the intestine and in specialized intestinal epithelial cell (IEC) subtypes. However, it is unknown how the underlying transcriptional regulatory mechanisms have changed over the course of vertebrate evolution. Here, we used genome-wide profiling of mRNA levels and chromatin accessibility to identify conserved IEC genes and regulatory regions in 4 vertebrate species (zebrafish, stickleback, mouse, and human) separated from a common ancestor by 420 million years. We identified substantial similarities in genes expressed along the vertebrate intestine. These data disclosed putative conserved transcription factor binding sites (TFBS) enriched in accessible chromatin near IEC genes and in regulatory sites with accessibility restricted to IECs. Fluorescent reporter assays in transparent zebrafish showed that these regions, which frequently lacked sequence conservation, were still capable of driving conserved expression patterns. We also found a highly conserved region near mammalian and fish hes1 sufficient to drive expression in a specific population of IECs with active Notch signaling. These results establish a platform to define the conserved transcriptional networks underlying vertebrate IEC physiology.
Collapse
Affiliation(s)
- Colin R. Lickwar
- Department of Molecular Genetics and Microbiology, Center for the Genomics of Microbial Systems, Duke University, Durham, North Carolina, United States of America
- Department of Cell Biology and Physiology, Center for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - J. Gray Camp
- Department of Cell Biology and Physiology, Center for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- Department of Developmental Biology, Stanford University, Stanford, California, United States of America
| | - Matthew Weiser
- Departments of Genetics and Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Jordan L. Cocchiaro
- Department of Molecular Genetics and Microbiology, Center for the Genomics of Microbial Systems, Duke University, Durham, North Carolina, United States of America
- Department of Cell Biology and Physiology, Center for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - David M. Kingsley
- Department of Developmental Biology, Stanford University, Stanford, California, United States of America
| | - Terrence S. Furey
- Departments of Genetics and Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Shehzad Z. Sheikh
- Department of Medicine, Center for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - John F. Rawls
- Department of Molecular Genetics and Microbiology, Center for the Genomics of Microbial Systems, Duke University, Durham, North Carolina, United States of America
- Department of Cell Biology and Physiology, Center for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
19
|
Meyer KA, Marques-Bonet T, Sestan N. Differential Gene Expression in the Human Brain Is Associated with Conserved, but Not Accelerated, Noncoding Sequences. Mol Biol Evol 2017; 34:1217-1229. [PMID: 28204568 PMCID: PMC5400397 DOI: 10.1093/molbev/msx076] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
Previous studies have found that genes which are differentially expressed within the developing human brain disproportionately neighbor conserved noncoding sequences (CNSs) that have an elevated substitution rate in humans and in other species. One explanation for this general association of differential expression with accelerated CNSs is that genes with pre-existing patterns of differential expression have been preferentially targeted by species-specific regulatory changes. Here we provide support for an alternative explanation: genes that neighbor a greater number of CNSs have a higher probability of differential expression and a higher probability of neighboring a CNS with lineage-specific acceleration. Thus, neighboring an accelerated element from any species signals that a gene likely neighbors many CNSs. We extend the analyses beyond the prenatal time points considered in previous studies to demonstrate that this association persists across developmental and adult periods. Examining differential expression between non-neural tissues suggests that the relationship between the number of CNSs a gene neighbors and its differential expression status may be particularly strong for expression differences among brain regions. In addition, by considering this relationship, we highlight a recently defined set of putative human-specific gain-of-function sequences that, even after adjusting for the number of CNSs neighbored by genes, shows a positive relationship with upregulation in the brain compared with other tissues examined.
Collapse
Affiliation(s)
- Kyle A. Meyer
- Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology (UPF-CSIC), PRBB, Barcelona, Spain
- Catalan Institution of Research and Advanced Studies (ICREA), Passeig de Lluís Companys, Barcelona, Spain
- CNAG-CRG, Centre for Genomic Regulation (CRG), Barcelona Institute of Science and Technology (BIST), Barcelona, Spain
| | - Nenad Sestan
- Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT
- Departments of Genetics and Psychiatry, Section of Comparative Medicine, Program in Cellular Neuroscience, Neurodegeneration and Repair, and Yale Child Study Center, Yale School of Medicine, New Haven, CT
| |
Collapse
|
20
|
Kamstra JH, Sales LB, Aleström P, Legler J. Differential DNA methylation at conserved non-genic elements and evidence for transgenerational inheritance following developmental exposure to mono(2-ethylhexyl) phthalate and 5-azacytidine in zebrafish. Epigenetics Chromatin 2017; 10:20. [PMID: 28413451 PMCID: PMC5389146 DOI: 10.1186/s13072-017-0126-4] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2016] [Accepted: 04/04/2017] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Exposure to environmental stressors during development may lead to latent and transgenerational adverse health effects. To understand the role of DNA methylation in these effects, we used zebrafish as a vertebrate model to investigate heritable changes in DNA methylation following chemical-induced stress during early development. We exposed zebrafish embryos to non-embryotoxic concentrations of the biologically active phthalate metabolite mono(2-ethylhexyl) phthalate (MEHP, 30 µM) and the DNA methyltransferase 1 inhibitor 5-azacytidine (5AC, 10 µM). Direct, latent and transgenerational effects on DNA methylation were assessed using global, genome-wide and locus-specific DNA methylation analyses. RESULTS Following direct exposure in zebrafish embryos from 0 to 6 days post-fertilization, genome-wide analysis revealed a multitude of differentially methylated regions, strongly enriched at conserved non-genic elements for both compounds. Pathways involved in adipogenesis were enriched with the putative obesogenic compound MEHP. Exposure to 5AC resulted in enrichment of pathways involved in embryonic development and transgenerational effects on larval body length. Locus-specific methylation analysis of 10 differentially methylated sites revealed six of these loci differentially methylated in sperm sampled from adult zebrafish exposed during development to 5AC, and in first and second generation larvae. With MEHP, consistent changes were found at 2 specific loci in first and second generation larvae. CONCLUSIONS Our results suggest a functional role for DNA methylation on cis-regulatory conserved elements following developmental exposure to compounds. Effects on these regions are potentially transferred to subsequent generations.
Collapse
Affiliation(s)
- Jorke H. Kamstra
- Faculty of Veterinary Medicine, Department of Basic Sciences and Aquatic Medicine, CoE CERAD, Norwegian University of Life Sciences, P.O. Box 8146 Dep., 0033 Oslo, Norway
| | - Liana Bastos Sales
- Institute for Environmental Studies, VU University Amsterdam, Amsterdam, The Netherlands
| | - Peter Aleström
- Faculty of Veterinary Medicine, Department of Basic Sciences and Aquatic Medicine, CoE CERAD, Norwegian University of Life Sciences, P.O. Box 8146 Dep., 0033 Oslo, Norway
| | - Juliette Legler
- Institute for Environmental Studies, VU University Amsterdam, Amsterdam, The Netherlands
- Institute for Environment, Health and Societies, College of Health and Life Sciences, Brunel University London, Uxbridge, UK
| |
Collapse
|
21
|
Drevinge C, Dalen KT, Mannila MN, Täng MS, Ståhlman M, Klevstig M, Lundqvist A, Mardani I, Haugen F, Fogelstrand P, Adiels M, Asin-Cayuela J, Ekestam C, Gådin JR, Lee YK, Nebb H, Svedlund S, Johansson BR, Hultén LM, Romeo S, Redfors B, Omerovic E, Levin M, Gan LM, Eriksson P, Andersson L, Ehrenborg E, Kimmel AR, Borén J, Levin MC. Perilipin 5 is protective in the ischemic heart. Int J Cardiol 2016; 219:446-54. [DOI: 10.1016/j.ijcard.2016.06.037] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/18/2016] [Accepted: 06/12/2016] [Indexed: 10/21/2022]
|
22
|
Yue JX, Kozmikova I, Ono H, Nossa CW, Kozmik Z, Putnam NH, Yu JK, Holland LZ. Conserved Noncoding Elements in the Most Distant Genera of Cephalochordates: The Goldilocks Principle. Genome Biol Evol 2016; 8:2387-405. [PMID: 27412606 PMCID: PMC5010895 DOI: 10.1093/gbe/evw158] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Cephalochordates, the sister group of vertebrates + tunicates, are evolving particularly slowly. Therefore, genome comparisons between two congeners of Branchiostoma revealed so many conserved noncoding elements (CNEs), that it was not clear how many are functional regulatory elements. To more effectively identify CNEs with potential regulatory functions, we compared noncoding sequences of genomes of the most phylogenetically distant cephalochordate genera, Asymmetron and Branchiostoma, which diverged approximately 120-160 million years ago. We found 113,070 noncoding elements conserved between the two species, amounting to 3.3% of the genome. The genomic distribution, target gene ontology, and enriched motifs of these CNEs all suggest that many of them are probably cis-regulatory elements. More than 90% of previously verified amphioxus regulatory elements were re-captured in this study. A search of the cephalochordate CNEs around 50 developmental genes in several vertebrate genomes revealed eight CNEs conserved between cephalochordates and vertebrates, indicating sequence conservation over >500 million years of divergence. The function of five CNEs was tested in reporter assays in zebrafish, and one was also tested in amphioxus. All five CNEs proved to be tissue-specific enhancers. Taken together, these findings indicate that even though Branchiostoma and Asymmetron are distantly related, as they are evolving slowly, comparisons between them are likely optimal for identifying most of their tissue-specific cis-regulatory elements laying the foundation for functional characterizations and a better understanding of the evolution of developmental regulation in cephalochordates.
Collapse
Affiliation(s)
- Jia-Xing Yue
- Biosciences at Rice, Rice University, Houston, Texas Present address: Institute for Research on Cancer and Aging, Nice (IRCAN), CNRS UMR 7284, INSERM U1081, Nice 06107 France
| | - Iryna Kozmikova
- Department of Transcriptional Regulation, Institute of Molecular Genetics, Prague 14220, Czech Republic
| | - Hiroki Ono
- Marine Biology Research Division, Scripps Institution of Oceanography, UC San Diego, La Jolla, California
| | - Carlos W Nossa
- Biosciences at Rice, Rice University, Houston, Texas Present address: Gene by Gene Ltd., Houston, TX 77008
| | - Zbynek Kozmik
- Department of Transcriptional Regulation, Institute of Molecular Genetics, Prague 14220, Czech Republic
| | - Nicholas H Putnam
- Biosciences at Rice, Rice University, Houston, Texas Present address: Dovetail Genomics, Santa Cruz, CA 95060
| | - Jr-Kai Yu
- Institute of Cellular and Organismic Biology, Academia Sinica, Taipei, Taiwan
| | - Linda Z Holland
- Marine Biology Research Division, Scripps Institution of Oceanography, UC San Diego, La Jolla, California
| |
Collapse
|
23
|
Hoffmann RD, Palmgren M. Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana. BMC Genomics 2016; 17:456. [PMID: 27296049 PMCID: PMC4906602 DOI: 10.1186/s12864-016-2803-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2015] [Accepted: 05/27/2016] [Indexed: 01/13/2023] Open
Abstract
Background Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Results Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3′ untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Conclusions Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2803-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Robert D Hoffmann
- Center for Membrane Pumps in Cells and Disease - PUMPKIN, Danish National Research Foundation, Department of Plant and Environmental Sciences, University of Copenhagen, 1871, Frederiksberg C, Denmark.
| | - Michael Palmgren
- Center for Membrane Pumps in Cells and Disease - PUMPKIN, Danish National Research Foundation, Department of Plant and Environmental Sciences, University of Copenhagen, 1871, Frederiksberg C, Denmark
| |
Collapse
|
24
|
Sciamanna I, De Luca C, Spadafora C. The Reverse Transcriptase Encoded by LINE-1 Retrotransposons in the Genesis, Progression, and Therapy of Cancer. Front Chem 2016; 4:6. [PMID: 26904537 PMCID: PMC4749692 DOI: 10.3389/fchem.2016.00006] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 01/26/2016] [Indexed: 12/24/2022] Open
Abstract
In higher eukaryotic genomes, Long Interspersed Nuclear Element 1 (LINE-1) retrotransposons represent a large family of repeated genomic elements. They transpose using a reverse transcriptase (RT), which they encode as part of the ORF2p product. RT inhibition in cancer cells, either via RNA interference-dependent silencing of active LINE-1 elements, or using RT inhibitory drugs, reduces cancer cell proliferation, promotes their differentiation and antagonizes tumor progression in animal models. Indeed, the non-nucleoside RT inhibitor efavirenz has recently been tested in a phase II clinical trial with metastatic prostate cancer patients. An in-depth analysis of ORF2p in a mouse model of breast cancer showed ORF2p to be precociously expressed in precancerous lesions and highly abundant in advanced cancer stages, while being barely detectable in normal breast tissue, providing a rationale for the finding that RT-expressing tumors are therapeutically sensitive to RT inhibitors. We summarize mechanistic and gene profiling studies indicating that abundant LINE-1-derived RT can “sequester” RNA substrates for reverse transcription in tumor cells, entailing the formation of RNA:DNA hybrid molecules and impairing the overall production of regulatory miRNAs, with a global impact on the cell transcriptome. Based on these data, LINE-1-ORF2 encoded RT has a tumor-promoting potential that is exerted at an epigenetic level. We propose a model whereby LINE1-RT drives a previously unrecognized global regulatory process, the deregulation of which drives cell transformation and tumorigenesis with possible implications for cancer cell heterogeneity.
Collapse
Affiliation(s)
| | | | - Corrado Spadafora
- Institute of Translational Pharmacology, National Resarch Council of Italy Rome, Italy
| |
Collapse
|
25
|
Sobiak B, Graczyk‐Jarzynka A, Leśniak W. Comparison of DNA Methylation and Expression Pattern of S100 and Other Epidermal Differentiation Complex Genes in Differentiating Keratinocytes. J Cell Biochem 2015; 117:1092-8. [DOI: 10.1002/jcb.25392] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2015] [Accepted: 10/05/2015] [Indexed: 02/05/2023]
Affiliation(s)
- Barbara Sobiak
- Department of Molecular and Cellular NeurobiologyNencki Institute of Experimental Biology, 3 Pasteur StreetWarsaw02‐093Poland
| | - Agnieszka Graczyk‐Jarzynka
- Department of Molecular and Cellular NeurobiologyNencki Institute of Experimental Biology, 3 Pasteur StreetWarsaw02‐093Poland
| | - Wiesława Leśniak
- Department of Molecular and Cellular NeurobiologyNencki Institute of Experimental Biology, 3 Pasteur StreetWarsaw02‐093Poland
| |
Collapse
|
26
|
|
27
|
Martinez-Morales JR. Toward understanding the evolution of vertebrate gene regulatory networks: comparative genomics and epigenomic approaches. Brief Funct Genomics 2015; 15:315-21. [PMID: 26293604 DOI: 10.1093/bfgp/elv032] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Vertebrates, as most animal phyla, originated >500 million years ago during the Cambrian explosion, and progressively radiated into the extant classes. Inferring the evolutionary history of the group requires understanding the architecture of the developmental programs that constrain the vertebrate anatomy. Here, I review recent comparative genomic and epigenomic studies, based on ChIP-seq and chromatin accessibility, which focus on the identification of functionally equivalent cis-regulatory modules among species. This pioneer work, primarily centered in the mammalian lineage, has set the groundwork for further studies in representative vertebrate and chordate species. Mapping of active regulatory regions across lineages will shed new light on the evolutionary forces stabilizing ancestral developmental programs, as well as allowing their variation to sustain morphological adaptations on the inherited vertebrate body plan.
Collapse
|
28
|
Zaucker A, Bodur T, Roest Crollius H, Hadzhiev Y, Gehrig J, Loosli F, Watson C, Müller F. Description of embryonic development of spotted green pufferfish (Tetraodon nigroviridis). Zebrafish 2015; 11:509-17. [PMID: 25243591 DOI: 10.1089/zeb.2014.0984] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Pufferfish species of the Tetraodontidae family carry the smallest genomes among vertebrates. Their compressed genomes are thought to be enriched for functional DNA compared to larger vertebrate genomes, and they are important models for comparative genomics. The significance of pufferfish as model organisms in comparative genomics is due to the availability of two sequenced genomes, that of spotted green pufferfish (Tetraodon nigroviridis) and fugu (Takifugu rubripes). However, there is only a very limited utilization of pufferfish as an experimental model organism, due to the lack of established husbandry and developmental genetics protocols. In this study, we provide the first description of the normal embryonic development of Tetraodon nigroviridis. Embryos were obtained by in vitro fertilization of eggs, and subsequent development was monitored by brightfield microscopy at constant temperature. Tetraodon development was divided into distinct stages based on diagnostic morphological features, which were adopted from published literature on normal development of other fish species like medaka (Oryzias latipes), zebrafish (Danio rerio), and fugu. Tetraodon embryos show more similar morphologies to medaka than to zebrafish, reflecting its phylogenetic position. The early developmental stage series described in this study forms the foundation for the utilization of tetraodon as an experimental model organism for comparative developmental studies.
Collapse
Affiliation(s)
- Andreas Zaucker
- 1 School of Clinical and Experimental Medicine, College of Medical and Dental Sciences, University of Birmingham , Edgbaston, United Kingdom
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Grice J, Noyvert B, Doglio L, Elgar G. A Simple Predictive Enhancer Syntax for Hindbrain Patterning Is Conserved in Vertebrate Genomes. PLoS One 2015; 10:e0130413. [PMID: 26131856 PMCID: PMC4489388 DOI: 10.1371/journal.pone.0130413] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 05/19/2015] [Indexed: 12/17/2022] Open
Abstract
Background Determining the function of regulatory elements is fundamental for our understanding of development, disease and evolution. However, the sequence features that mediate these functions are often unclear and the prediction of tissue-specific expression patterns from sequence alone is non-trivial. Previous functional studies have demonstrated a link between PBX-HOX and MEIS/PREP binding interactions and hindbrain enhancer activity, but the defining grammar of these sites, if any exists, has remained elusive. Results Here, we identify a shared sequence signature (syntax) within a heterogeneous set of conserved vertebrate hindbrain enhancers composed of spatially co-occurring PBX-HOX and MEIS/PREP transcription factor binding motifs. We use this syntax to accurately predict hindbrain enhancers in 89% of cases (67/75 predicted elements) from a set of conserved non-coding elements (CNEs). Furthermore, mutagenesis of the sites abolishes activity or generates ectopic expression, demonstrating their requirement for segmentally restricted enhancer activity in the hindbrain. We refine and use our syntax to predict over 3,000 hindbrain enhancers across the human genome. These sequences tend to be located near developmental transcription factors and are enriched in known hindbrain activating elements, demonstrating the predictive power of this simple model. Conclusion Our findings support the theory that hundreds of CNEs, and perhaps thousands of regions across the human genome, function to coordinate gene expression in the developing hindbrain. We speculate that deeply conserved sequences of this kind contributed to the co-option of new genes into the hindbrain gene regulatory network during early vertebrate evolution by linking patterns of hox expression to downstream genes involved in segmentation and patterning, and evolutionarily newer instances may have continued to contribute to lineage-specific elaboration of the hindbrain.
Collapse
Affiliation(s)
- Joseph Grice
- The Francis Crick Institute Mill Hill Laboratory, The Ridgeway, Mill Hill, London, NW7 1AA, United Kingdom
| | - Boris Noyvert
- The Francis Crick Institute Mill Hill Laboratory, The Ridgeway, Mill Hill, London, NW7 1AA, United Kingdom
| | - Laura Doglio
- The Francis Crick Institute Mill Hill Laboratory, The Ridgeway, Mill Hill, London, NW7 1AA, United Kingdom
| | - Greg Elgar
- The Francis Crick Institute Mill Hill Laboratory, The Ridgeway, Mill Hill, London, NW7 1AA, United Kingdom
- * E-mail:
| |
Collapse
|
30
|
Schilter KF, Reis LM, Sorokina EA, Semina EV. Identification of an Alu-repeat-mediated deletion of OPTN upstream region in a patient with a complex ocular phenotype. Mol Genet Genomic Med 2015; 3:490-9. [PMID: 26740941 PMCID: PMC4694134 DOI: 10.1002/mgg3.159] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2015] [Revised: 05/05/2015] [Accepted: 05/08/2015] [Indexed: 12/14/2022] Open
Abstract
Genetic causes of ocular conditions remain largely unknown. To reveal the molecular basis for a congenital ocular phenotype associated with glaucoma we performed whole‐exome sequencing (WES) and whole‐genome copy number analyses of patient DNA. WES did not identify a causative variant. Copy number variation analysis identified a deletion of 10p13 in the patient and his unaffected father; the deletion breakpoint contained a single 37‐bp sequence that is normally present in two distinct Alu repeats separated by ~181 kb. The deletion removed part of the upstream region of optineurin (OPTN) as well as the upstream sequence and two coding exons of coiled‐coil domain containing 3 (CCDC3); analysis of the patient's second allele showed normal OPTN and CCDC3 sequences. Studies of zebrafish orthologs identified expression in the developing eye for both genes. OPTN is a known factor in dominant adult‐onset glaucoma and Amyotrophic Lateral Sclerosis (ALS). The deletion eliminates 98 kb of the OPTN upstream sequence leaving only ~1 kb of the proximal promoter region. Comparison of transcriptional activation capability of the 3 kb normal and the rearranged del(10)(p13) OPTN promoter sequences demonstrated a statistically significant decrease for the deleted allele; sequence analysis of the entire deleted region identified multiple conserved elements with possible cis‐regulatory activity. Additional screening of CCDC3 indicated that heterozygous loss‐of‐function alleles are unlikely to cause congenital ocular disease. In summary, we report the first regulatory region deletion involving OPTN, caused by Alu‐mediated nonallelic homologous recombination and possibly contributing to the patient's ocular phenotype. In addition, our data indicate that Alu‐mediated rearrangements of the OPTN upstream region may represent a new source of affected alleles in human conditions. Evaluation of the upstream OPTN sequences in additional ocular and ALS patients may help to determine the role of this region, if any, in human disease.
Collapse
Affiliation(s)
- Kala F Schilter
- Department of Pediatrics and Children's Research InstituteMedical College of WisconsinMilwaukeeWisconsin53226; Department of Cell Biology, Neurobiology and AnatomyMedical College of WisconsinMilwaukeeWisconsin53226
| | - Linda M Reis
- Department of Pediatrics and Children's Research Institute Medical College of Wisconsin Milwaukee Wisconsin 53226
| | - Elena A Sorokina
- Department of Pediatrics and Children's Research Institute Medical College of Wisconsin Milwaukee Wisconsin 53226
| | - Elena V Semina
- Department of Pediatrics and Children's Research InstituteMedical College of WisconsinMilwaukeeWisconsin53226; Department of Cell Biology, Neurobiology and AnatomyMedical College of WisconsinMilwaukeeWisconsin53226
| |
Collapse
|
31
|
Gordon KL, Arthur RK, Ruvinsky I. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence. PLoS Genet 2015; 11:e1005268. [PMID: 26020930 PMCID: PMC4447282 DOI: 10.1371/journal.pgen.1005268] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2014] [Accepted: 05/09/2015] [Indexed: 11/28/2022] Open
Abstract
Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. To explore the phylogenetic limits of conservation of cis-regulatory elements, we used transgenesis to test the functions of enhancers of four genes from several species spanning the phylum Nematoda. While we found a striking degree of functional conservation among the examined cis elements, their DNA sequences lacked apparent conservation with the C. elegans orthologs. In fact, sequence similarity between C. elegans and the distantly related nematodes was no greater than would be expected by chance. Short motifs, similar to known regulatory sequences in C. elegans, can be detected in most of the cis elements. When tested, some of these sites appear to mediate regulatory function. However, they seem to have originated through motif turnover, rather than to have been preserved from a common ancestor. Our results suggest that gene regulatory networks are broadly conserved in the phylum Nematoda, but this conservation persists despite substantial reorganization of regulatory elements and could not be detected using naïve comparisons of sequence similarity.
Collapse
Affiliation(s)
- Kacy L. Gordon
- Department of Organismal Biology and Anatomy, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (KLG); (IR)
| | - Robert K. Arthur
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, United States of America
| | - Ilya Ruvinsky
- Department of Organismal Biology and Anatomy, The University of Chicago, Chicago, Illinois, United States of America
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (KLG); (IR)
| |
Collapse
|
32
|
Davies KTJ, Tsagkogeorga G, Rossiter SJ. Divergent evolutionary rates in vertebrate and mammalian specific conserved non-coding elements (CNEs) in echolocating mammals. BMC Evol Biol 2014; 14:261. [PMID: 25523630 PMCID: PMC4302572 DOI: 10.1186/s12862-014-0261-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2014] [Accepted: 12/08/2014] [Indexed: 11/26/2022] Open
Abstract
BACKGROUND The majority of DNA contained within vertebrate genomes is non-coding, with a certain proportion of this thought to play regulatory roles during development. Conserved Non-coding Elements (CNEs) are an abundant group of putative regulatory sequences that are highly conserved across divergent groups and thus assumed to be under strong selective constraint. Many CNEs may contain regulatory factor binding sites, and their frequent spatial association with key developmental genes - such as those regulating sensory system development - suggests crucial roles in regulating gene expression and cellular patterning. Yet surprisingly little is known about the molecular evolution of CNEs across diverse mammalian taxa or their role in specific phenotypic adaptations. We examined 3,110 vertebrate-specific and ~82,000 mammalian-specific CNEs across 19 and 9 mammalian orders respectively, and tested for changes in the rate of evolution of CNEs located in the proximity of genes underlying the development or functioning of auditory systems. As we focused on CNEs putatively associated with genes underlying the development/functioning of auditory systems, we incorporated echolocating taxa in our dataset because of their highly specialised and derived auditory systems. RESULTS Phylogenetic reconstructions of concatenated CNEs broadly recovered accepted mammal relationships despite high levels of sequence conservation. We found that CNE substitution rates were highest in rodents and lowest in primates, consistent with previous findings. Comparisons of CNE substitution rates from several genomic regions containing genes linked to auditory system development and hearing revealed differences between echolocating and non-echolocating taxa. Wider taxonomic sampling of four CNEs associated with the homeobox genes Hmx2 and Hmx3 - which are required for inner ear development - revealed family-wise variation across diverse bat species. Specifically within one family of echolocating bats that utilise frequency-modulated echolocation calls varying widely in frequency and intensity high levels of sequence divergence were found. CONCLUSIONS Levels of selective constraint acting on CNEs differed both across genomic locations and taxa, with observed variation in substitution rates of CNEs among bat species. More work is needed to determine whether this variation can be linked to echolocation, and wider taxonomic sampling is necessary to fully document levels of conservation in CNEs across diverse taxa.
Collapse
Affiliation(s)
- Kalina T J Davies
- School of Biological & Chemical Sciences, Queen Mary University of London, Mile End Road, London, E1 4NS, UK.
| | - Georgia Tsagkogeorga
- School of Biological & Chemical Sciences, Queen Mary University of London, Mile End Road, London, E1 4NS, UK.
| | - Stephen J Rossiter
- School of Biological & Chemical Sciences, Queen Mary University of London, Mile End Road, London, E1 4NS, UK.
| |
Collapse
|
33
|
Gutierrez-Triana JA, Herget U, Lichtner P, Castillo-Ramírez LA, Ryu S. A vertebrate-conserved cis-regulatory module for targeted expression in the main hypothalamic regulatory region for the stress response. BMC DEVELOPMENTAL BIOLOGY 2014; 14:41. [PMID: 25427861 PMCID: PMC4248439 DOI: 10.1186/s12861-014-0041-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Accepted: 11/11/2014] [Indexed: 01/30/2023]
Abstract
Background The homeodomain transcription factor orthopedia (Otp) is an evolutionarily conserved regulator of neuronal fates. In vertebrates, Otp is necessary for the proper development of different regions of the brain and is required in the diencephalon to specify several hypothalamic cell types, including the cells that control the stress response. To understand how this widely expressed transcription factor accomplishes hypothalamus-specific functions, we performed a comprehensive screening of otp cis-regulatory regions in zebrafish. Results Here, we report the identification of an evolutionarily conserved vertebrate enhancer module with activity in a restricted area of the forebrain, which includes the region of the hypothalamus that controls the stress response. This region includes neurosecretory cells producing Corticotropin-releasing hormone (Crh), Oxytocin (Oxt) and Arginine vasopressin (Avp), which are key components of the stress axis. Lastly, expression of the bacterial nitroreductase gene under this specific enhancer allowed pharmacological attenuation of the stress response in zebrafish larvae. Conclusion Vertebrates share many cellular and molecular components of the stress response and our work identified a striking conservation at the cis-regulatory level of a key hypothalamic developmental gene. In addition, this enhancer provides a useful tool to manipulate and visualize stress-regulatory hypothalamic cells in vivo with the long-term goal of understanding the ontogeny of the stress axis in vertebrates. Electronic supplementary material The online version of this article (doi:10.1186/s12861-014-0041-x) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jose Arturo Gutierrez-Triana
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany. .,Current address: Centre for Organismal Studies (COS), University of Heidelberg, Im Neuenheimer Feld 230, D-69120, Heidelberg, Germany.
| | - Ulrich Herget
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany. .,The Hartmut Hoffmann-Berling International Graduate School of Molecular and Cellular Biology, University of Heidelberg, Heidelberg, Germany.
| | - Patrick Lichtner
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany.
| | - Luis A Castillo-Ramírez
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany. .,The Hartmut Hoffmann-Berling International Graduate School of Molecular and Cellular Biology, University of Heidelberg, Heidelberg, Germany.
| | - Soojin Ryu
- Developmental Genetics of the Nervous System, Max Planck Institute for Medical Research, Jahnstrasse 29, D-69120, Heidelberg, Germany.
| |
Collapse
|
34
|
Abstract
Combined with TCR stimuli, extracellular cytokine signals initiate the differentiation of naive CD4(+) T cells into specialized effector T-helper (Th) and regulatory T (Treg) cell subsets. The lineage specification and commitment process occurs through the combinatorial action of multiple transcription factors (TFs) and epigenetic mechanisms that drive lineage-specific gene expression programs. In this article, we review recent studies on the transcriptional and epigenetic regulation of distinct Th cell lineages. Moreover, we review current study linking immune disease-associated single-nucleotide polymorphisms with distal regulatory elements and their potential role in the disease etiology.
Collapse
Affiliation(s)
- Subhash K Tripathi
- Turku Centre for Biotechnology, University of Turku and
Åbo Akademi UniversityTurku, Finland
- National Doctoral Programme in Informational and
Structural BiologyTurku, Finland
- Turku Doctoral Programme of Molecular Medicine (TuDMM),
University of TurkuTurku, Finland
| | - Riitta Lahesmaa
- Turku Centre for Biotechnology, University of Turku and
Åbo Akademi UniversityTurku, Finland
| |
Collapse
|
35
|
Dippold RP, Fisher SA. A bioinformatic and computational study of myosin phosphatase subunit diversity. Am J Physiol Regul Integr Comp Physiol 2014; 307:R256-70. [PMID: 24898838 PMCID: PMC4121627 DOI: 10.1152/ajpregu.00145.2014] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2014] [Accepted: 05/25/2014] [Indexed: 01/01/2023]
Abstract
Variability in myosin phosphatase (MP) subunits may provide specificity in signaling pathways that regulate muscle tone. We utilized public databases and computational algorithms to investigate the phylogenetic diversity of MP regulatory (PPP1R12A-C) and inhibitory (PPP1R14A-D) subunits. The comparison of exonic coding sequences and expression data confirmed or refuted the existence of isoforms and their tissue-specific expression in different model organisms. The comparison of intronic and exonic sequences identified potential expressional regulatory elements. As examples, smooth muscle MP regulatory subunit (PPP1R12A) is highly conserved through evolution. Its alternative exon E24 is present in fish through mammals with two invariant features: 1) a reading frame shift generating a premature termination codon and 2) a hexanucleotide sequence adjacent to the 3' splice site hypothesized to be a novel suppressor of exon splicing. A characteristic of the striated muscle MP regulatory subunit (PPP1R12B) locus is numerous and phylogenetically variable transcriptional start sites. In fish this locus only codes for the small (M21) subunit, suggesting the primordial function of this gene. Inhibitory subunits show little intragenic variability; their diversity is thought to have arisen by expansion and tissue-specific expression of different gene family members. We demonstrate differences in the regulatory landscape between smooth muscle enriched (PPP1R14A) and more ubiquitously expressed (PPP1R14B) family members and identify deeply conserved intronic sequence and predicted transcriptional cis-regulatory elements. This bioinformatic and computational study has uncovered a number of attributes of MP subunits that supports selection of ideal model organisms and testing of hypotheses regarding their physiological significance and regulated expression.
Collapse
Affiliation(s)
- Rachael P Dippold
- Department of Medicine, Cardiology, University of Maryland Baltimore, Baltimore, Maryland
| | - Steven A Fisher
- Department of Medicine, Cardiology, University of Maryland Baltimore, Baltimore, Maryland
| |
Collapse
|
36
|
Brosius J. The persistent contributions of RNA to eukaryotic gen(om)e architecture and cellular function. Cold Spring Harb Perspect Biol 2014; 6:a016089. [PMID: 25081515 DOI: 10.1101/cshperspect.a016089] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Currently, the best scenario for earliest forms of life is based on RNA molecules as they have the proven ability to catalyze enzymatic reactions and harbor genetic information. Evolutionary principles valid today become apparent in such models already. Furthermore, many features of eukaryotic genome architecture might have their origins in an RNA or RNA/protein (RNP) world, including the onset of a further transition, when DNA replaced RNA as the genetic bookkeeper of the cell. Chromosome maintenance, splicing, and regulatory function via RNA may be deeply rooted in the RNA/RNP worlds. Mostly in eukaryotes, conversion from RNA to DNA is still ongoing, which greatly impacts the plasticity of extant genomes. Raw material for novel genes encoding protein or RNA, or parts of genes including regulatory elements that selection can act on, continues to enter the evolutionary lottery.
Collapse
Affiliation(s)
- Jürgen Brosius
- Institute of Experimental Pathology (ZMBE), University of Münster, D-48149 Münster, Germany
| |
Collapse
|
37
|
Kurihara M, Shiraishi A, Satake H, Kimura AP. A conserved noncoding sequence can function as a spermatocyte-specific enhancer and a bidirectional promoter for a ubiquitously expressed gene and a testis-specific long noncoding RNA. J Mol Biol 2014; 426:3069-93. [PMID: 25020229 DOI: 10.1016/j.jmb.2014.06.018] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2014] [Revised: 06/26/2014] [Accepted: 06/27/2014] [Indexed: 12/13/2022]
Abstract
Tissue-specific gene expression is tightly regulated by various elements such as promoters, enhancers, and long noncoding RNAs (lncRNAs). In the present study, we identified a conserved noncoding sequence (CNS1) as a novel enhancer for the spermatocyte-specific mouse testicular cell adhesion molecule 1 (Tcam1) gene. CNS1 was located 3.4kb upstream of the Tcam1 gene and associated with histone H3K4 mono-methylation in testicular germ cells. By the in vitro reporter gene assay, CNS1 could enhance Tcam1 promoter activity only in GC-2spd(ts) cells, which were derived from mouse spermatocytes. When we integrated the 6.9-kb 5'-flanking sequence of Tcam1 with or without a deletion of CNS1 linked to the enhanced green fluorescent protein gene into the chromatin of GC-2spd(ts) cells, CNS1 significantly enhanced Tcam1 promoter activity. These results indicate that CNS1 could function as a spermatocyte-specific enhancer. Interestingly, CNS1 also showed high bidirectional promoter activity in the reporter assay, and consistent with this, the Smarcd2 gene and lncRNA, designated lncRNA-Tcam1, were transcribed from adjacent regions of CNS1. While Smarcd2 was ubiquitously expressed, lncRNA-Tcam1 expression was restricted to testicular germ cells, although this lncRNA did not participate in Tcam1 activation. Ubiquitous Smarcd2 expression was correlated to CpG hypo-methylation of CNS1 and partially controlled by Sp1. However, for lncRNA-Tcam1 transcription, the strong association with histone acetylation and histone H3K4 tri-methylation also appeared to be required. The present data suggest that CNS1 is a spermatocyte-specific enhancer for the Tcam1 gene and a bidirectional promoter of Smarcd2 and lncRNA-Tcam1.
Collapse
Affiliation(s)
- Misuzu Kurihara
- Graduate School of Life Science, Hokkaido University, Sapporo 060-0810, Japan
| | - Akira Shiraishi
- Suntory Foundation for Life Sciences, Bioorganic Research Institute, Osaka 618-8503, Japan
| | - Honoo Satake
- Suntory Foundation for Life Sciences, Bioorganic Research Institute, Osaka 618-8503, Japan
| | - Atsushi P Kimura
- Graduate School of Life Science, Hokkaido University, Sapporo 060-0810, Japan; Department of Biological Sciences, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan.
| |
Collapse
|
38
|
Barrière A, Ruvinsky I. Pervasive divergence of transcriptional gene regulation in Caenorhabditis nematodes. PLoS Genet 2014; 10:e1004435. [PMID: 24968346 PMCID: PMC4072541 DOI: 10.1371/journal.pgen.1004435] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2013] [Accepted: 04/28/2014] [Indexed: 12/18/2022] Open
Abstract
Because there is considerable variation in gene expression even between closely related species, it is clear that gene regulatory mechanisms evolve relatively rapidly. Because primary sequence conservation is an unreliable proxy for functional conservation of cis-regulatory elements, their assessment must be carried out in vivo. We conducted a survey of cis-regulatory conservation between C. elegans and closely related species C. briggsae, C. remanei, C. brenneri, and C. japonica. We tested enhancers of eight genes from these species by introducing them into C. elegans and analyzing the expression patterns they drove. Our results support several notable conclusions. Most exogenous cis elements direct expression in the same cells as their C. elegans orthologs, confirming gross conservation of regulatory mechanisms. However, the majority of exogenous elements, when placed in C. elegans, also directed expression in cells outside endogenous patterns, suggesting functional divergence. Recurrent ectopic expression of different promoters in the same C. elegans cells may reflect biases in the directions in which expression patterns can evolve due to shared regulatory logic of coexpressed genes. The fact that, despite differences between individual genes, several patterns repeatedly emerged from our survey, encourages us to think that general rules governing regulatory evolution may exist and be discoverable.
Collapse
Affiliation(s)
- Antoine Barrière
- Department of Ecology and Evolution and Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (AB); (IR)
| | - Ilya Ruvinsky
- Department of Ecology and Evolution and Institute for Genomics and Systems Biology, The University of Chicago, Chicago, Illinois, United States of America
- Department of Organismal Biology and Anatomy, The University of Chicago, Chicago, Illinois, United States of America
- * E-mail: (AB); (IR)
| |
Collapse
|
39
|
Koufariotis L, Chen YPP, Bolormaa S, Hayes BJ. Regulatory and coding genome regions are enriched for trait associated variants in dairy and beef cattle. BMC Genomics 2014; 15:436. [PMID: 24903263 PMCID: PMC4070550 DOI: 10.1186/1471-2164-15-436] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2013] [Accepted: 05/22/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In livestock, as in humans, the number of genetic variants that can be tested for association with complex quantitative traits, or used in genomic predictions, is increasing exponentially as whole genome sequencing becomes more common. The power to identify variants associated with traits, particularly those of small effects, could be increased if certain regions of the genome were known a priori to be enriched for associations. Here, we investigate whether twelve genomic annotation classes were enriched or depleted for significant associations in genome wide association studies for complex traits in beef and dairy cattle. We also describe a variance component approach to determine the proportion of genetic variance captured by each annotation class. RESULTS P-values from large GWAS using 700K SNP in both dairy and beef cattle were available for 11 and 10 traits respectively. We found significant enrichment for trait associated variants (SNP significant in the GWAS) in the missense class along with regions 5 kilobases upstream and downstream of coding genes. We found that the non-coding conserved regions (across mammals) were not enriched for trait associated variants. The results from the enrichment or depletion analysis were not in complete agreement with the results from variance component analysis, where the missense and synonymous classes gave the greatest increase in variance explained, while the upstream and downstream classes showed a more modest increase in the variance explained. CONCLUSION Our results indicate that functional annotations could assist in prioritization of variants to a subset more likely to be associated with complex traits; including missense variants, and upstream and downstream regions. The differences in two sets of results (GWAS enrichment depletion versus variance component approaches) might be explained by the fact that the variance component approach has greater power to capture the cumulative effect of mutations of small effect, while the enrichment or depletion approach only captures the variants that are significant in GWAS, which is restricted to a limited number of common variants of moderate effects.
Collapse
Affiliation(s)
- Lambros Koufariotis
- Faculty of Science, Technology and Engineering, La Trobe University, Melbourne, Victoria 3086, Australia.
| | | | | | | |
Collapse
|
40
|
Polychronopoulos D, Sellis D, Almirantis Y. Conserved noncoding elements follow power-law-like distributions in several genomes as a result of genome dynamics. PLoS One 2014; 9:e95437. [PMID: 24787386 PMCID: PMC4008492 DOI: 10.1371/journal.pone.0095437] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2013] [Accepted: 03/26/2014] [Indexed: 12/31/2022] Open
Abstract
Conserved, ultraconserved and other classes of constrained elements (collectively referred as CNEs here), identified by comparative genomics in a wide variety of genomes, are non-randomly distributed across chromosomes. These elements are defined using various degrees of conservation between organisms and several thresholds of minimal length. We here investigate the chromosomal distribution of CNEs by studying the statistical properties of distances between consecutive CNEs. We find widespread power-law-like distributions, i.e. linearity in double logarithmic scale, in the inter-CNE distances, a feature which is connected with fractality and self-similarity. Given that CNEs are often found to be spatially associated with genes, especially with those that regulate developmental processes, we verify by appropriate gene masking that a power-law-like pattern emerges irrespectively of whether elements found close or inside genes are excluded or not. An evolutionary model is put forward for the understanding of these findings that includes segmental or whole genome duplication events and eliminations (loss) of most of the duplicated CNEs. Simulations reproduce the main features of the observed size distributions. Power-law-like patterns in the genomic distributions of CNEs are in accordance with current knowledge about their evolutionary history in several genomes.
Collapse
Affiliation(s)
- Dimitris Polychronopoulos
- Institute of Biosciences and Applications, National Center for Scientific Research “Demokritos”, Athens, Greece
- Department of Biochemistry and Molecular Biology, Faculty of Biology, National and Kapodistrian University of Athens, Athens, Greece
| | - Diamantis Sellis
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Yannis Almirantis
- Institute of Biosciences and Applications, National Center for Scientific Research “Demokritos”, Athens, Greece
- * E-mail:
| |
Collapse
|
41
|
Tena JJ, González-Aguilera C, Fernández-Miñán A, Vázquez-Marín J, Parra-Acero H, Cross JW, Rigby PWJ, Carvajal JJ, Wittbrodt J, Gómez-Skarmeta JL, Martínez-Morales JR. Comparative epigenomics in distantly related teleost species identifies conserved cis-regulatory nodes active during the vertebrate phylotypic period. Genome Res 2014; 24:1075-85. [PMID: 24709821 PMCID: PMC4079964 DOI: 10.1101/gr.163915.113] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
The complex relationship between ontogeny and phylogeny has been the subject of attention and controversy since von Baer’s formulations in the 19th century. The classic concept that embryogenesis progresses from clade general features to species-specific characters has often been revisited. It has become accepted that embryos from a clade show maximum morphological similarity at the so-called phylotypic period (i.e., during mid-embryogenesis). According to the hourglass model, body plan conservation would depend on constrained molecular mechanisms operating at this period. More recently, comparative transcriptomic analyses have provided conclusive evidence that such molecular constraints exist. Examining cis-regulatory architecture during the phylotypic period is essential to understand the evolutionary source of body plan stability. Here we compare transcriptomes and key epigenetic marks (H3K4me3 and H3K27ac) from medaka (Oryzias latipes) and zebrafish (Danio rerio), two distantly related teleosts separated by an evolutionary distance of 115–200 Myr. We show that comparison of transcriptome profiles correlates with anatomical similarities and heterochronies observed at the phylotypic stage. Through comparative epigenomics, we uncover a pool of conserved regulatory regions (≈700), which are active during the vertebrate phylotypic period in both species. Moreover, we show that their neighboring genes encode mainly transcription factors with fundamental roles in tissue specification. We postulate that these regulatory regions, active in both teleost genomes, represent key constrained nodes of the gene networks that sustain the vertebrate body plan.
Collapse
Affiliation(s)
- Juan J Tena
- Centro Andaluz de Biología del Desarrollo (CSIC/UPO/JA), 41013 Sevilla, Spain
| | | | - Ana Fernández-Miñán
- Centro Andaluz de Biología del Desarrollo (CSIC/UPO/JA), 41013 Sevilla, Spain
| | | | - Helena Parra-Acero
- Centro Andaluz de Biología del Desarrollo (CSIC/UPO/JA), 41013 Sevilla, Spain
| | - Joe W Cross
- Division of Cancer Biology, The Institute of Cancer Research, London SW3 6JB, United Kingdom
| | - Peter W J Rigby
- Division of Cancer Biology, The Institute of Cancer Research, London SW3 6JB, United Kingdom
| | - Jaime J Carvajal
- Centro Andaluz de Biología del Desarrollo (CSIC/UPO/JA), 41013 Sevilla, Spain; Division of Cancer Biology, The Institute of Cancer Research, London SW3 6JB, United Kingdom
| | - Joachim Wittbrodt
- Centre for Organismal Studies, COS, University of Heidelberg, 69120 Heidelberg, Germany
| | | | | |
Collapse
|
42
|
Zare H, Khodursky A, Sartorelli V. An evolutionarily biased distribution of miRNA sites toward regulatory genes with high promoter-driven intrinsic transcriptional noise. BMC Evol Biol 2014; 14:74. [PMID: 24707827 PMCID: PMC4031498 DOI: 10.1186/1471-2148-14-74] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2013] [Accepted: 03/24/2014] [Indexed: 12/21/2022] Open
Abstract
Background miRNAs are a major class of regulators of gene expression in metazoans. By targeting cognate mRNAs, miRNAs are involved in regulating most, if not all, biological processes in different cell and tissue types. To better understand how this regulatory potential is allocated among different target gene sets, we carried out a detailed and systematic analysis of miRNA target sites distribution in the mouse genome. Results We used predicted conserved and non-conserved sites for 779 miRNAs in 3′ UTR of 18440 genes downloaded from TargetScan website. Our analysis reveals that 3′ UTRs of genes encoding regulatory proteins harbor significantly greater number of miRNA sites than those of non-regulatory, housekeeping and structural, genes. Analysis of miRNA sites for orthologous 3′UTR’s in 10 other species indicates that the regulatory genes were maintaining or accruing miRNA sites while non-regulatory genes gradually shed them in the course of evolution. Furthermore, we observed that 3′ UTR of genes with higher gene expression variability driven by their promoter sequence content are targeted by many more distinct miRNAs compared to genes with low transcriptional noise. Conclusions Based on our results we envision a model, which we dubbed “selective inclusion”, whereby non-regulatory genes with low transcription noise and stable expression profile lost their sites, while regulatory genes which endure higher transcription noise retained and gained new sites. This adaptation is consistent with the requirements that regulatory genes need to be tightly controlled in order to have precise and optimum protein level to properly function.
Collapse
Affiliation(s)
- Hossein Zare
- Laboratory of Muscle Stem Cells and Gene Regulation, National Institute of Arthritis, Musculoskeletal and Skin Diseases, National Institutes of Health, 50 South Drive, Bethesda, MD 20892, USA.
| | | | | |
Collapse
|
43
|
Turner EE, Cox TC. Genetic evidence for conserved non-coding element function across species-the ears have it. Front Physiol 2014; 5:7. [PMID: 24478720 PMCID: PMC3896894 DOI: 10.3389/fphys.2014.00007] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2013] [Accepted: 01/05/2014] [Indexed: 01/08/2023] Open
Abstract
Comparison of genomic sequences from diverse vertebrate species has revealed numerous highly conserved regions that do not appear to encode proteins or functional RNAs. Often these “conserved non-coding elements,” or CNEs, can direct gene expression to specific tissues in transgenic models, demonstrating they have regulatory function. CNEs are frequently found near “developmental” genes, particularly transcription factors, implying that these elements have essential regulatory roles in development. However, actual examples demonstrating CNE regulatory functions across species have been few, and recent loss-of-function studies of several CNEs in mice have shown relatively minor effects. In this Perspectives article, we discuss new findings in “fancy” rats and Highland cattle demonstrating that function of a CNE near the Hmx1 gene is crucial for normal external ear development and when disrupted can mimic loss-of function Hmx1 coding mutations in mice and humans. These findings provide important support for conserved developmental roles of CNEs in divergent species, and reinforce the concept that CNEs should be examined systematically in the ongoing search for genetic causes of human developmental disorders in the era of genome-scale sequencing.
Collapse
Affiliation(s)
- Eric E Turner
- Center for Integrative Brain Research, Seattle Children's Research Institute Seattle, WA, USA ; Center on Human Development and Disability, University of Washington Seattle, WA, USA ; Department of Psychiatry and Behavioral Sciences, University of Washington Seattle, WA, USA
| | - Timothy C Cox
- Center on Human Development and Disability, University of Washington Seattle, WA, USA ; Department of Pediatrics (Craniofacial Medicine), University of Washington Seattle, WA, USA ; Department of Anatomy and Developmental Biology, Monash University Clayton, VIC, Australia ; Center for Developmental Biology and Regenerative Medicine, Seattle Children's Research Institute Seattle, WA, USA
| |
Collapse
|
44
|
Matsubara S, Kurihara M, Kimura AP. A long non-coding RNA transcribed from conserved non-coding sequences contributes to the mouse prolyl oligopeptidase gene activation. J Biochem 2013; 155:243-56. [PMID: 24369296 DOI: 10.1093/jb/mvt113] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Prolyl oligopeptidase (POP) is a multifunctional protease which is involved in many physiological events, but its gene regulatory mechanism is poorly understood. To identify novel regulatory elements of the POP gene, we compared the genomic sequences at the mouse and human POP loci and found six conserved non-coding sequences (CNSs) at adjacent intergenic regions. From these CNSs, four long non-coding RNAs (lncRNAs) were transcribed and the expression pattern of one (lncPrep+96kb) was correlated with that of POP. lncPrep+96kb was transcribed as two forms due to the different transcriptional start sites and was localized at the nucleus and cytoplasm, although more was present at the nucleus. When we knocked down lncPrep+96kb in the primary ovarian granulosa cell and a hepatic cell line, the POP expression was decreased in both cells. In contrast, overexpression of lncPrep+96kb increased the POP expression only in the granulosa cell. Because lncPrep+96kb was upregulated with the same timing as POP in the hormone-treated ovary, this lncRNA could play a role in the POP gene activation in the granulosa cell. Moreover, a downstream region of the human POP gene was also transcribed. We propose a novel mechanism for the POP gene activation.
Collapse
Affiliation(s)
- Shin Matsubara
- Graduate School of Life Science and Department of Biological Sciences, Faculty of Science, Hokkaido University, Sapporo 060-0810, Japan
| | | | | |
Collapse
|
45
|
Harmston N, Baresic A, Lenhard B. The mystery of extreme non-coding conservation. Philos Trans R Soc Lond B Biol Sci 2013; 368:20130021. [PMID: 24218634 PMCID: PMC3826495 DOI: 10.1098/rstb.2013.0021] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Regions of several dozen to several hundred base pairs of extreme conservation have been found in non-coding regions in all metazoan genomes. The distribution of these elements within and across genomes has suggested that many have roles as transcriptional regulatory elements in multi-cellular organization, differentiation and development. Currently, there is no known mechanism or function that would account for this level of conservation at the observed evolutionary distances. Previous studies have found that, while these regions are under strong purifying selection, and not mutational coldspots, deletion of entire regions in mice does not necessarily lead to identifiable changes in phenotype during development. These opposing findings lead to several questions regarding their functional importance and why they are under strong selection in the first place. In this perspective, we discuss the methods and techniques used in identifying and dissecting these regions, their observed patterns of conservation, and review the current hypotheses on their functional significance.
Collapse
Affiliation(s)
- Nathan Harmston
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London and MRC Clinical Sciences Centre, , Hammersmith Hospital Campus, Du Cane Road, London W12 0NN, UK
| | | | | |
Collapse
|