1
|
Massively parallel reporter assays and mouse transgenic assays provide complementary information about neuronal enhancer activity. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.22.590634. [PMID: 38712228 PMCID: PMC11071441 DOI: 10.1101/2024.04.22.590634] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/08/2024]
Abstract
Genetic studies find hundreds of thousands of noncoding variants associated with psychiatric disorders. Massively parallel reporter assays (MPRAs) and in vivo transgenic mouse assays can be used to assay the impact of these variants. However, the relevance of MPRAs to in vivo function is unknown and transgenic assays suffer from low throughput. Here, we studied the utility of combining the two assays to study the impact of non-coding variants. We carried out an MPRA on over 50,000 sequences derived from enhancers validated in transgenic mouse assays and from multiple fetal neuronal ATAC-seq datasets. We also tested over 20,000 variants, including synthetic mutations in highly active neuronal enhancers and 177 common variants associated with psychiatric disorders. Variants with a high impact on MPRA activity were further tested in mice. We found a strong and specific correlation between MPRA and mouse neuronal enhancer activity including changes in neuronal enhancer activity in mouse embryos for variants with strong MPRA effects. Mouse assays also revealed pleiotropic variant effects that could not be observed in MPRA. Our work provides a large catalog of functional neuronal enhancers and variant effects and highlights the effectiveness of combining MPRAs and mouse transgenic assays.
Collapse
|
2
|
Increased enhancer-promoter interactions during developmental enhancer activation in mammals. Nat Genet 2024; 56:675-685. [PMID: 38509385 DOI: 10.1038/s41588-024-01681-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 02/06/2024] [Indexed: 03/22/2024]
Abstract
Remote enhancers are thought to interact with their target promoters via physical proximity, yet the importance of this proximity for enhancer function remains unclear. Here we investigate the three-dimensional (3D) conformation of enhancers during mammalian development by generating high-resolution tissue-resolved contact maps for nearly a thousand enhancers with characterized in vivo activities in ten murine embryonic tissues. Sixty-one percent of developmental enhancers bypass their neighboring genes, which are often marked by promoter CpG methylation. The majority of enhancers display tissue-specific 3D conformations, and both enhancer-promoter and enhancer-enhancer interactions are moderately but consistently increased upon enhancer activation in vivo. Less than 14% of enhancer-promoter interactions form stably across tissues; however, these invariant interactions form in the absence of the enhancer and are likely mediated by adjacent CTCF binding. Our results highlight the general importance of enhancer-promoter physical proximity for developmental gene activation in mammals.
Collapse
|
3
|
Dynamic enhancer landscapes in human craniofacial development. Nat Commun 2024; 15:2030. [PMID: 38448444 PMCID: PMC10917818 DOI: 10.1038/s41467-024-46396-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 02/25/2024] [Indexed: 03/08/2024] Open
Abstract
The genetic basis of human facial variation and craniofacial birth defects remains poorly understood. Distant-acting transcriptional enhancers control the fine-tuned spatiotemporal expression of genes during critical stages of craniofacial development. However, a lack of accurate maps of the genomic locations and cell type-resolved activities of craniofacial enhancers prevents their systematic exploration in human genetics studies. Here, we combine histone modification, chromatin accessibility, and gene expression profiling of human craniofacial development with single-cell analyses of the developing mouse face to define the regulatory landscape of facial development at tissue- and single cell-resolution. We provide temporal activity profiles for 14,000 human developmental craniofacial enhancers. We find that 56% of human craniofacial enhancers share chromatin accessibility in the mouse and we provide cell population- and embryonic stage-resolved predictions of their in vivo activity. Taken together, our data provide an expansive resource for genetic and developmental studies of human craniofacial development.
Collapse
|
4
|
A cell type-aware framework for nominating non-coding variants in Mendelian regulatory disorders. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.12.22.23300468. [PMID: 38234731 PMCID: PMC10793524 DOI: 10.1101/2023.12.22.23300468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/19/2024]
Abstract
Unsolved Mendelian cases often lack obvious pathogenic coding variants, suggesting potential non-coding etiologies. Here, we present a single cell multi-omic framework integrating embryonic mouse chromatin accessibility, histone modification, and gene expression assays to discover cranial motor neuron (cMN) cis-regulatory elements and subsequently nominate candidate non-coding variants in the congenital cranial dysinnervation disorders (CCDDs), a set of Mendelian disorders altering cMN development. We generated single cell epigenomic profiles for ~86,000 cMNs and related cell types, identifying ~250,000 accessible regulatory elements with cognate gene predictions for ~145,000 putative enhancers. Seventy-five percent of elements (44 of 59) validated in an in vivo transgenic reporter assay, demonstrating that single cell accessibility is a strong predictor of enhancer activity. Applying our cMN atlas to 899 whole genome sequences from 270 genetically unsolved CCDD pedigrees, we achieved significant reduction in our variant search space and nominated candidate variants predicted to regulate known CCDD disease genes MAFB, PHOX2A, CHN1, and EBF3 - as well as new candidates in recurrently mutated enhancers through peak- and gene-centric allelic aggregation. This work provides novel non-coding variant discoveries of relevance to CCDDs and a generalizable framework for nominating non-coding variants of potentially high functional impact in other Mendelian disorders.
Collapse
|
5
|
Single-cell, whole-embryo phenotyping of mammalian developmental disorders. Nature 2023; 623:772-781. [PMID: 37968388 PMCID: PMC10665194 DOI: 10.1038/s41586-023-06548-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 08/16/2023] [Indexed: 11/17/2023]
Abstract
Mouse models are a critical tool for studying human diseases, particularly developmental disorders1. However, conventional approaches for phenotyping may fail to detect subtle defects throughout the developing mouse2. Here we set out to establish single-cell RNA sequencing of the whole embryo as a scalable platform for the systematic phenotyping of mouse genetic models. We applied combinatorial indexing-based single-cell RNA sequencing3 to profile 101 embryos of 22 mutant and 4 wild-type genotypes at embryonic day 13.5, altogether profiling more than 1.6 million nuclei. The 22 mutants represent a range of anticipated phenotypic severities, from established multisystem disorders to deletions of individual regulatory regions4,5. We developed and applied several analytical frameworks for detecting differences in composition and/or gene expression across 52 cell types or trajectories. Some mutants exhibit changes in dozens of trajectories whereas others exhibit changes in only a few cell types. We also identify differences between widely used wild-type strains, compare phenotyping of gain- versus loss-of-function mutants and characterize deletions of topological associating domain boundaries. Notably, some changes are shared among mutants, suggesting that developmental pleiotropy might be 'decomposable' through further scaling of this approach. Overall, our findings show how single-cell profiling of whole embryos can enable the systematic molecular and cellular phenotypic characterization of mouse mutants with unprecedented breadth and resolution.
Collapse
|
6
|
Rare variation in noncoding regions with evolutionary signatures contributes to autism spectrum disorder risk. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.09.19.23295780. [PMID: 37790480 PMCID: PMC10543033 DOI: 10.1101/2023.09.19.23295780] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Little is known about the role of noncoding regions in the etiology of autism spectrum disorder (ASD). We examined three classes of noncoding regions: Human Accelerated Regions (HARs), which show signatures of positive selection in humans; experimentally validated neural Vista Enhancers (VEs); and conserved regions predicted to act as neural enhancers (CNEs). Targeted and whole genome analysis of >16,600 samples and >4900 ASD probands revealed that likely recessive, rare, inherited variants in HARs, VEs, and CNEs substantially contribute to ASD risk in probands whose parents share ancestry, which enriches for recessive contributions, but modestly, if at all, in simplex family structures. We identified multiple patient variants in HARs near IL1RAPL1 and in a VE near SIM1 and showed that they change enhancer activity. Our results implicate both human-evolved and evolutionarily conserved noncoding regions in ASD risk and suggest potential mechanisms of how changes in regulatory regions can modulate social behavior.
Collapse
|
7
|
Noncoding variants alter GATA2 expression in rhombomere 4 motor neurons and cause dominant hereditary congenital facial paresis. Nat Genet 2023; 55:1149-1163. [PMID: 37386251 PMCID: PMC10335940 DOI: 10.1038/s41588-023-01424-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 05/10/2023] [Indexed: 07/01/2023]
Abstract
Hereditary congenital facial paresis type 1 (HCFP1) is an autosomal dominant disorder of absent or limited facial movement that maps to chromosome 3q21-q22 and is hypothesized to result from facial branchial motor neuron (FBMN) maldevelopment. In the present study, we report that HCFP1 results from heterozygous duplications within a neuron-specific GATA2 regulatory region that includes two enhancers and one silencer, and from noncoding single-nucleotide variants (SNVs) within the silencer. Some SNVs impair binding of NR2F1 to the silencer in vitro and in vivo and attenuate in vivo enhancer reporter expression in FBMNs. Gata2 and its effector Gata3 are essential for inner-ear efferent neuron (IEE) but not FBMN development. A humanized HCFP1 mouse model extends Gata2 expression, favors the formation of IEEs over FBMNs and is rescued by conditional loss of Gata3. These findings highlight the importance of temporal gene regulation in development and of noncoding variation in rare mendelian disease.
Collapse
|
8
|
Combinatorial transcription factor binding encodes cis-regulatory wiring of forebrain GABAergic neurogenesis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.28.546894. [PMID: 37425940 PMCID: PMC10327028 DOI: 10.1101/2023.06.28.546894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
Transcription factors (TFs) bind combinatorially to genomic cis-regulatory elements (cREs), orchestrating transcription programs. While studies of chromatin state and chromosomal interactions have revealed dynamic neurodevelopmental cRE landscapes, parallel understanding of the underlying TF binding lags. To elucidate the combinatorial TF-cRE interactions driving mouse basal ganglia development, we integrated ChIP-seq for twelve TFs, H3K4me3-associated enhancer-promoter interactions, chromatin and transcriptional state, and transgenic enhancer assays. We identified TF-cREs modules with distinct chromatin features and enhancer activity that have complementary roles driving GABAergic neurogenesis and suppressing other developmental fates. While the majority of distal cREs were bound by one or two TFs, a small proportion were extensively bound, and these enhancers also exhibited exceptional evolutionary conservation, motif density, and complex chromosomal interactions. Our results provide new insights into how modules of combinatorial TF-cRE interactions activate and repress developmental expression programs and demonstrate the value of TF binding data in modeling gene regulatory wiring.
Collapse
|
9
|
Cell Type- and Tissue-specific Enhancers in Craniofacial Development. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.26.546603. [PMID: 37425964 PMCID: PMC10327103 DOI: 10.1101/2023.06.26.546603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
The genetic basis of craniofacial birth defects and general variation in human facial shape remains poorly understood. Distant-acting transcriptional enhancers are a major category of non-coding genome function and have been shown to control the fine-tuned spatiotemporal expression of genes during critical stages of craniofacial development1-3. However, a lack of accurate maps of the genomic location and cell type-specific in vivo activities of all craniofacial enhancers prevents their systematic exploration in human genetics studies. Here, we combined histone modification and chromatin accessibility profiling from different stages of human craniofacial development with single-cell analyses of the developing mouse face to create a comprehensive catalogue of the regulatory landscape of facial development at tissue- and single cell-resolution. In total, we identified approximately 14,000 enhancers across seven developmental stages from weeks 4 through 8 of human embryonic face development. We used transgenic mouse reporter assays to determine the in vivo activity patterns of human face enhancers predicted from these data. Across 16 in vivo validated human enhancers, we observed a rich diversity of craniofacial subregions in which these enhancers are active in vivo. To annotate the cell type specificities of human-mouse conserved enhancers, we performed single-cell RNA-seq and single-nucleus ATAC-seq of mouse craniofacial tissues from embryonic days e11.5 to e15.5. By integrating these data across species, we find that the majority (56%) of human craniofacial enhancers are functionally conserved in mice, providing cell type- and embryonic stage-resolved predictions of their in vivo activity profiles. Using retrospective analysis of known craniofacial enhancers in combination with single cell-resolved transgenic reporter assays, we demonstrate the utility of these data for predicting the in vivo cell type specificity of enhancers. Taken together, our data provide an expansive resource for genetic and developmental studies of human craniofacial development.
Collapse
|
10
|
Topologically associating domain boundaries are required for normal genome function. Commun Biol 2023; 6:435. [PMID: 37081156 PMCID: PMC10119121 DOI: 10.1038/s42003-023-04819-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Accepted: 04/06/2023] [Indexed: 04/22/2023] Open
Abstract
Topologically associating domain (TAD) boundaries partition the genome into distinct regulatory territories. Anecdotal evidence suggests that their disruption may interfere with normal gene expression and cause disease phenotypes1-3, but the overall extent to which this occurs remains unknown. Here we demonstrate that targeted deletions of TAD boundaries cause a range of disruptions to normal in vivo genome function and organismal development. We used CRISPR genome editing in mice to individually delete eight TAD boundaries (11-80 kb in size) from the genome. All deletions examined resulted in detectable molecular or organismal phenotypes, which included altered chromatin interactions or gene expression, reduced viability, and anatomical phenotypes. We observed changes in local 3D chromatin architecture in 7 of 8 (88%) cases, including the merging of TADs and altered contact frequencies within TADs adjacent to the deleted boundary. For 5 of 8 (63%) loci examined, boundary deletions were associated with increased embryonic lethality or other developmental phenotypes. For example, a TAD boundary deletion near Smad3/Smad6 caused complete embryonic lethality, while a deletion near Tbx5/Lhx5 resulted in a severe lung malformation. Our findings demonstrate the importance of TAD boundary sequences for in vivo genome function and reinforce the critical need to carefully consider the potential pathogenicity of noncoding deletions affecting TAD boundaries in clinical genetics screening.
Collapse
|
11
|
Genetic determinants of switchgrass-root-associated microbiota in field sites spanning its natural range. Curr Biol 2023; 33:1926-1938.e6. [PMID: 37080198 DOI: 10.1016/j.cub.2023.03.078] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 02/03/2023] [Accepted: 03/27/2023] [Indexed: 04/22/2023]
Abstract
A fundamental goal in plant microbiome research is to determine the relative impacts of host and environmental effects on root microbiota composition, particularly how host genotype impacts bacterial community composition. Most studies characterizing the effect of plant genotype on root microbiota undersample host genetic diversity and grow plants outside of their native ranges, making the associations between host and microbes difficult to interpret. Here, we characterized the root microbiota of a large diversity panel of switchgrass, a North American native C4 bioenergy crop, in three field locations spanning its native range. Our data, composed of 1,961 samples, suggest that field location is the primary determinant of microbiome composition; however, substantial heritable variation is widespread across bacterial taxa, especially those in the Sphingomonadaceae family. Despite diverse compositions, relatively few highly prevalent taxa make up the majority of the switchgrass root microbiota, a large fraction of which is shared across sites. Local genotypes preferentially recruit/filter for local microbes, supporting the idea of affinity between local plants and their microbiota. Using genome-wide association, we identified loci impacting the abundance of >400 microbial strains and found an enrichment of genes involved in immune responses, signaling pathways, and secondary metabolism. We found loci associated with over half of the core microbiota (i.e., microbes in >80% of samples), regardless of field location. Finally, we show a genetic relationship between a basal plant immunity pathway and relative abundances of root microbiota. This study brings us closer to harnessing and manipulating beneficial microbial associations via host genetics.
Collapse
|
12
|
Single cell evaluation of endocardial Hand2 gene regulatory networks reveals HAND2-dependent pathways that impact cardiac morphogenesis. Development 2023; 150:dev201341. [PMID: 36620995 PMCID: PMC10110492 DOI: 10.1242/dev.201341] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Accepted: 12/26/2022] [Indexed: 01/10/2023]
Abstract
The transcription factor HAND2 plays essential roles during cardiogenesis. Hand2 endocardial deletion (H2CKO) results in tricuspid atresia or double inlet left ventricle with accompanying intraventricular septum defects, hypo-trabeculated ventricles and an increased density of coronary lumens. To understand the regulatory mechanisms of these phenotypes, single cell transcriptome analysis of mouse E11.5 H2CKO hearts was performed revealing a number of disrupted endocardial regulatory pathways. Using HAND2 DNA occupancy data, we identify several HAND2-dependent enhancers, including two endothelial enhancers for the shear-stress master regulator KLF2. A 1.8 kb enhancer located 50 kb upstream of the Klf2 TSS imparts specific endothelial/endocardial expression within the vasculature and endocardium. This enhancer is HAND2-dependent for ventricular endocardium expression but HAND2-independent for Klf2 vascular and valve expression. Deletion of this Klf2 enhancer results in reduced Klf2 expression within ventricular endocardium. These data reveal that HAND2 functions within endocardial gene regulatory networks including shear-stress response.
Collapse
|
13
|
Abstract
Establishing causal links between inherited polymorphisms and cancer risk is challenging. Here, we focus on the single-nucleotide polymorphism rs55705857, which confers a sixfold greater risk of isocitrate dehydrogenase (IDH)-mutant low-grade glioma (LGG). We reveal that rs55705857 itself is the causal variant and is associated with molecular pathways that drive LGG. Mechanistically, we show that rs55705857 resides within a brain-specific enhancer, where the risk allele disrupts OCT2/4 binding, allowing increased interaction with the Myc promoter and increased Myc expression. Mutating the orthologous mouse rs55705857 locus accelerated tumor development in an Idh1R132H-driven LGG mouse model from 472 to 172 days and increased penetrance from 30% to 75%. Our work reveals mechanisms of the heritable predisposition to lethal glioma in ~40% of LGG patients.
Collapse
|
14
|
Genome-wide fetalization of enhancer architecture in heart disease. Cell Rep 2022; 40:111400. [PMID: 36130500 PMCID: PMC9534044 DOI: 10.1016/j.celrep.2022.111400] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 06/10/2022] [Accepted: 09/01/2022] [Indexed: 11/22/2022] Open
Abstract
Heart disease is associated with re-expression of key transcription factors normally active only during prenatal development of the heart. However, the impact of this reactivation on the regulatory landscape in heart disease is unclear. Here, we use RNA-seq and ChIP-seq targeting a histone modification associated with active transcriptional enhancers to generate genome-wide enhancer maps from left ventricle tissue from up to 26 healthy controls, 18 individuals with idiopathic dilated cardiomyopathy (DCM), and five fetal hearts. Healthy individuals have a highly reproducible epigenomic landscape, consisting of more than 33,000 predicted heart enhancers. In contrast, we observe reproducible disease-associated changes in activity at 6,850 predicted heart enhancers. Combined analysis of adult and fetal samples reveals that the heart disease epigenome and transcriptome both acquire fetal-like characteristics, with 3,400 individual enhancers sharing fetal regulatory properties. We also provide a comprehensive data resource (http://heart.lbl.gov) for the mechanistic exploration of DCM etiology.
Collapse
|
15
|
Author Correction: Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 2022; 605:E3. [PMID: 35474001 PMCID: PMC9095460 DOI: 10.1038/s41586-021-04226-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
16
|
Abstract
Across the human genome, there are nearly 500 'ultraconserved' elements: regions of at least 200 contiguous nucleotides that are perfectly conserved in both the mouse and rat genomes. Remarkably, the majority of these sequences are non-coding, and many can function as enhancers that activate tissue-specific gene expression during embryonic development. From their first description more than 15 years ago, their extreme conservation has both fascinated and perplexed researchers in genomics and evolutionary biology. The intrigue around ultraconserved elements only grew with the observation that they are dispensable for viability. Here, we review recent progress towards understanding the general importance and the specific functions of ultraconserved sequences in mammalian development and human disease and discuss possible explanations for their extreme conservation.
Collapse
|
17
|
Abstract
Embryonic morphogenesis is strictly dependent on tight spatiotemporal control of developmental gene expression, which is typically achieved through the concerted activity of multiple enhancers driving cell type-specific expression of a target gene. Mammalian genomes are organized in topologically associated domains, providing a preferred environment and framework for interactions between transcriptional enhancers and gene promoters. While epigenomic profiling and three-dimensional chromatin conformation capture have significantly increased the accuracy of identifying enhancers, assessment of subregional enhancer activities via transgenic reporter assays in mice remains the gold standard for assigning enhancer activity in vivo. Once this activity is defined, the ideal method to explore the functional necessity of a transcriptional enhancer and its contribution to target gene dosage and morphological or physiological processes is deletion of the enhancer sequence from the mouse genome. Here we present detailed protocols for efficient introduction of enhancer-reporter transgenes and CRISPR-mediated genomic deletions into the mouse genome, including a step-by-step guide for pronuclear microinjection of fertilized mouse eggs. We provide instructions for the assembly and genomic integration of enhancer-reporter cassettes that have been used for validation of thousands of putative enhancer sequences accessible through the VISTA enhancer browser, including a recently published method for robust site-directed transgenesis at the H11 safe-harbor locus. Together, these methods enable rapid and large-scale assessment of enhancer activities and sequence variants in mice, which is essential to understand mammalian genome function and genetic diseases.
Collapse
|
18
|
Abstract
The α- and β-globin loci harbor developmentally expressed genes, which are silenced throughout post-natal life. Reactivation of these genes may offer therapeutic approaches for the hemoglobinopathies, the most common single gene disorders. Here, we address mechanisms regulating the embryonically expressed α-like globin, termed ζ-globin. We show that in embryonic erythroid cells, the ζ-gene lies within a ~65 kb sub-TAD (topologically associating domain) of open, acetylated chromatin and interacts with the α-globin super-enhancer. By contrast, in adult erythroid cells, the ζ-gene is packaged within a small (~10 kb) sub-domain of hypoacetylated, facultative heterochromatin within the acetylated sub-TAD and that it no longer interacts with its enhancers. The ζ-gene can be partially re-activated by acetylation and inhibition of histone de-acetylases. In addition to suggesting therapies for severe α-thalassemia, these findings illustrate the general principles by which reactivation of developmental genes may rescue abnormalities arising from mutations in their adult paralogues.
Collapse
|
19
|
Coding and noncoding variants in EBF3 are involved in HADDS and simplex autism. Hum Genomics 2021; 15:44. [PMID: 34256850 PMCID: PMC8278787 DOI: 10.1186/s40246-021-00342-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Accepted: 06/17/2021] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND Previous research in autism and other neurodevelopmental disorders (NDDs) has indicated an important contribution of protein-coding (coding) de novo variants (DNVs) within specific genes. The role of de novo noncoding variation has been observable as a general increase in genetic burden but has yet to be resolved to individual functional elements. In this study, we assessed whole-genome sequencing data in 2671 families with autism (discovery cohort of 516 families, replication cohort of 2155 families). We focused on DNVs in enhancers with characterized in vivo activity in the brain and identified an excess of DNVs in an enhancer named hs737. RESULTS We adapted the fitDNM statistical model to work in noncoding regions and tested enhancers for excess of DNVs in families with autism. We found only one enhancer (hs737) with nominal significance in the discovery (p = 0.0172), replication (p = 2.5 × 10-3), and combined dataset (p = 1.1 × 10-4). Each individual with a DNV in hs737 had shared phenotypes including being male, intact cognitive function, and hypotonia or motor delay. Our in vitro assessment of the DNVs showed they all reduce enhancer activity in a neuronal cell line. By epigenomic analyses, we found that hs737 is brain-specific and targets the transcription factor gene EBF3 in human fetal brain. EBF3 is genome-wide significant for coding DNVs in NDDs (missense p = 8.12 × 10-35, loss-of-function p = 2.26 × 10-13) and is widely expressed in the body. Through characterization of promoters bound by EBF3 in neuronal cells, we saw enrichment for binding to NDD genes (p = 7.43 × 10-6, OR = 1.87) involved in gene regulation. Individuals with coding DNVs have greater phenotypic severity (hypotonia, ataxia, and delayed development syndrome [HADDS]) in comparison to individuals with noncoding DNVs that have autism and hypotonia. CONCLUSIONS In this study, we identify DNVs in the hs737 enhancer in individuals with autism. Through multiple approaches, we find hs737 targets the gene EBF3 that is genome-wide significant in NDDs. By assessment of noncoding variation and the genes they affect, we are beginning to understand their impact on gene regulatory networks in NDDs.
Collapse
|
20
|
Deletion of a non-canonical regulatory sequence causes loss of Scn1a expression and epileptic phenotypes in mice. Genome Med 2021; 13:69. [PMID: 33910599 PMCID: PMC8080386 DOI: 10.1186/s13073-021-00884-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Accepted: 04/06/2021] [Indexed: 12/16/2022] Open
Abstract
BACKGROUND Genes with multiple co-active promoters appear common in brain, yet little is known about functional requirements for these potentially redundant genomic regulatory elements. SCN1A, which encodes the NaV1.1 sodium channel alpha subunit, is one such gene with two co-active promoters. Mutations in SCN1A are associated with epilepsy, including Dravet syndrome (DS). The majority of DS patients harbor coding mutations causing SCN1A haploinsufficiency; however, putative causal non-coding promoter mutations have been identified. METHODS To determine the functional role of one of these potentially redundant Scn1a promoters, we focused on the non-coding Scn1a 1b regulatory region, previously described as a non-canonical alternative transcriptional start site. We generated a transgenic mouse line with deletion of the extended evolutionarily conserved 1b non-coding interval and characterized changes in gene and protein expression, and assessed seizure activity and alterations in behavior. RESULTS Mice harboring a deletion of the 1b non-coding interval exhibited surprisingly severe reductions of Scn1a and NaV1.1 expression throughout the brain. This was accompanied by electroencephalographic and thermal-evoked seizures, and behavioral deficits. CONCLUSIONS This work contributes to functional dissection of the regulatory wiring of a major epilepsy risk gene, SCN1A. We identified the 1b region as a critical disease-relevant regulatory element and provide evidence that non-canonical and seemingly redundant promoters can have essential function.
Collapse
|
21
|
Ultraconserved enhancer function does not require perfect sequence conservation. Nat Genet 2021; 53:521-528. [PMID: 33782603 PMCID: PMC8038972 DOI: 10.1038/s41588-021-00812-3] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2020] [Accepted: 02/04/2021] [Indexed: 01/09/2023]
Abstract
Ultraconserved enhancer sequences show perfect conservation between human and rodent genomes, suggesting that their functions are highly sensitive to mutation. However, current models of enhancer function do not sufficiently explain this extreme evolutionary constraint. We subjected 23 ultraconserved enhancers to different levels of mutagenesis, collectively introducing 1,547 mutations, and examined their activities in transgenic mouse reporter assays. Overall, we find that the regulatory properties of ultraconserved enhancers are robust to mutation. Upon mutagenesis, nearly all (19/23, 83%) still functioned as enhancers at one developmental stage, as did most of those tested again later in development (5/9, 56%). Replacement of endogenous enhancers with mutated alleles in mice corroborated results of transgenic assays, including the functional resilience of ultraconserved enhancers to mutation. Our findings show that the currently known activities of ultraconserved enhancers do not necessarily require the perfect conservation observed in evolution and suggest that additional regulatory or other functions contribute to their sequence constraint.
Collapse
|
22
|
HAND transcription factors cooperatively specify the aorta and pulmonary trunk. Dev Biol 2021; 476:1-10. [PMID: 33757801 DOI: 10.1016/j.ydbio.2021.03.011] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 03/11/2021] [Accepted: 03/15/2021] [Indexed: 01/11/2023]
Abstract
Congenital heart defects (CHDs) affecting the cardiac outflow tract (OFT) constitute a significant cause of morbidity and mortality. The OFT develops from migratory cell populations which include the cardiac neural crest cells (cNCCs) and secondary heart field (SHF) derived myocardium and endocardium. The related transcription factors HAND1 and HAND2 have been implicated in human CHDs involving the OFT. Although Hand1 is expressed within the OFT, Hand1 NCC-specific conditional knockout mice (H1CKOs) are viable. Here we show that these H1CKOs present a low penetrance of OFT phenotypes, whereas SHF-specific Hand1 ablation does not reveal any cardiac phenotypes. Further, HAND1 and HAND2 appear functionally redundant within the cNCCs, as a reduction/ablation of Hand2 on an NCC-specific H1CKO background causes pronounced OFT defects. Double conditional Hand1 and Hand2 NCC knockouts exhibit persistent truncus arteriosus (PTA) with 100% penetrance. NCC lineage-tracing and Sema3c in situ mRNA expression reveal that Sema3c-expressing cells are mis-localized, resulting in a malformed septal bridge within the OFTs of H1CKO;H2CKO embryos. Interestingly, Hand1 and Hand2 also genetically interact within the SHF, as SHF H1CKOs on a heterozygous Hand2 background exhibit Ventricular Septal Defects (VSDs) with incomplete penetrance. Previously, we identified a BMP, HAND2, and GATA-dependent Hand1 OFT enhancer sufficient to drive reporter gene expression within the nascent OFT and aorta. Using these transcription inputs as a probe, we identify a novel Hand2 OFT enhancer, suggesting that a conserved BMP-GATA dependent mechanism transcriptionally regulates both HAND factors. These findings support the hypothesis that HAND factors interpret BMP signaling within the cNCCs to cooperatively coordinate OFT morphogenesis.
Collapse
|
23
|
Abstract
A Correction to this paper has been published: https://doi.org/10.1038/s41586-020-03089-4.
Collapse
|
24
|
Abstract
RATIONALE Cardiac pacemaker cells (PCs) in the sinoatrial node (SAN) have a distinct gene expression program that allows them to fire automatically and initiate the heartbeat. Although critical SAN transcription factors, including Isl1 (Islet-1), Tbx3 (T-box transcription factor 3), and Shox2 (short-stature homeobox protein 2), have been identified, the cis-regulatory architecture that governs PC-specific gene expression is not understood, and discrete enhancers required for gene regulation in the SAN have not been identified. OBJECTIVE To define the epigenetic profile of PCs using comparative ATAC-seq (assay for transposase-accessible chromatin with sequencing) and to identify novel enhancers involved in SAN gene regulation, development, and function. METHODS AND RESULTS We used ATAC-seq on sorted neonatal mouse SAN to compare regions of accessible chromatin in PCs and right atrial cardiomyocytes. PC-enriched assay for transposase-accessible chromatin peaks, representing candidate SAN regulatory elements, were located near established SAN genes and were enriched for distinct sets of TF (transcription factor) binding sites. Among several novel SAN enhancers that were experimentally validated using transgenic mice, we identified a 2.9-kb regulatory element at the Isl1 locus that was active specifically in the cardiac inflow at embryonic day 8.5 and throughout later SAN development and maturation. Deletion of this enhancer from the genome of mice resulted in SAN hypoplasia and sinus arrhythmias. The mouse SAN enhancer also directed reporter activity to the inflow tract in developing zebrafish hearts, demonstrating deep conservation of its upstream regulatory network. Finally, single nucleotide polymorphisms in the human genome that occur near the region syntenic to the mouse enhancer exhibit significant associations with resting heart rate in human populations. CONCLUSIONS (1) PCs have distinct regions of accessible chromatin that correlate with their gene expression profile and contain novel SAN enhancers, (2) cis-regulation of Isl1 specifically in the SAN depends upon a conserved SAN enhancer that regulates PC development and SAN function, and (3) a corresponding human ISL1 enhancer may regulate human SAN function.
Collapse
|
25
|
Author Correction: An atlas of dynamic chromatin landscapes in mouse fetal development. Nature 2020; 586:E31. [PMID: 33037424 PMCID: PMC7962567 DOI: 10.1038/s41586-020-2841-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
26
|
Genomic Resolution of DLX-Orchestrated Transcriptional Circuits Driving Development of Forebrain GABAergic Neurons. Cell Rep 2020; 28:2048-2063.e8. [PMID: 31433982 PMCID: PMC6750766 DOI: 10.1016/j.celrep.2019.07.022] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 05/29/2019] [Accepted: 07/08/2019] [Indexed: 11/24/2022] Open
Abstract
DLX transcription factors (TFs) are master regulators of the developing vertebrate brain, driving forebrain GABAergic neuronal differentiation. Ablation of Dlx1&2 alters expression of genes that are critical for forebrain GABAergic development. We integrated epigenomic and transcriptomic analyses, complemented with in situ hybridization (ISH), and in vivo and in vitro studies of regulatory element (RE) function. This revealed the DLX-organized gene regulatory network at genomic, cellular, and spatial levels in mouse embryonic basal ganglia. DLX TFs perform dual activating and repressing functions; the consequences of their binding were determined by the sequence and genomic context of target loci. Our results reveal and, in part, explain the paradox of widespread DLX binding contrasted with a limited subset of target loci that are sensitive at the epigenomic and transcriptomic level to Dlx1&2 ablation. The regulatory properties identified here for DLX TFs suggest general mechanisms by which TFs orchestrate dynamic expression programs underlying neurodevelopment. Lindtner et al. reveal the regulatory wiring organized by DLX transcription factors in forebrain GABAergic neuronal specification, by integrating functional genomic, epigenomic, and genetic data on a transgenic mouse model. This network determines key sequence-encoded regulatory elements and implicates a combination of histone modifications and biophysical interactions.
Collapse
|
27
|
Spatiotemporal DNA methylome dynamics of the developing mouse fetus. Nature 2020; 583:752-759. [PMID: 32728242 PMCID: PMC7398276 DOI: 10.1038/s41586-020-2119-x] [Citation(s) in RCA: 67] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Accepted: 06/11/2019] [Indexed: 01/10/2023]
Abstract
Cytosine DNA methylation is essential for mammalian development but understanding of its spatiotemporal distribution in the developing embryo remains limited1,2. Here, as part of the mouse Encyclopedia of DNA Elements (ENCODE) project, we profiled 168 methylomes from 12 mouse tissues or organs at 9 developmental stages from embryogenesis to adulthood. We identified 1,808,810 genomic regions that showed variations in CG methylation by comparing the methylomes of different tissues or organs from different developmental stages. These DNA elements predominantly lose CG methylation during fetal development, whereas the trend is reversed after birth. During late stages of fetal development, non-CG methylation accumulated within the bodies of key developmental transcription factor genes, coinciding with their transcriptional repression. Integration of genome-wide DNA methylation, histone modification and chromatin accessibility data enabled us to predict 461,141 putative developmental tissue-specific enhancers, the human orthologues of which were enriched for disease-associated genetic variants. These spatiotemporal epigenome maps provide a resource for studies of gene regulation during tissue or organ progression, and a starting point for investigating regulatory elements that are involved in human developmental disorders. Analysis of 168 methylomes from 12 mouse tissues at 9 developmental stages sheds light on the epigenetic and regulatory landscape during mammalian fetal development.
Collapse
|
28
|
Abstract
The Encyclopedia of DNA Elements (ENCODE) project has established a genomic resource for mammalian development, profiling a diverse panel of mouse tissues at 8 developmental stages from 10.5 days after conception until birth, including transcriptomes, methylomes and chromatin states. Here we systematically examined the state and accessibility of chromatin in the developing mouse fetus. In total we performed 1,128 chromatin immunoprecipitation with sequencing (ChIP-seq) assays for histone modifications and 132 assay for transposase-accessible chromatin using sequencing (ATAC-seq) assays for chromatin accessibility across 72 distinct tissue-stages. We used integrative analysis to develop a unified set of chromatin state annotations, infer the identities of dynamic enhancers and key transcriptional regulators, and characterize the relationship between chromatin state and accessibility during developmental gene regulation. We also leveraged these data to link enhancers to putative target genes and demonstrate tissue-specific enrichments of sequence variants associated with disease in humans. The mouse ENCODE data sets provide a compendium of resources for biomedical researchers and achieve, to our knowledge, the most comprehensive view of chromatin dynamics during mammalian fetal development to date.
Collapse
|
29
|
Abstract
The human and mouse genomes contain instructions that specify RNAs and proteins and govern the timing, magnitude, and cellular context of their production. To better delineate these elements, phase III of the Encyclopedia of DNA Elements (ENCODE) Project has expanded analysis of the cell and tissue repertoires of RNA transcription, chromatin structure and modification, DNA methylation, chromatin looping, and occupancy by transcription factors and RNA-binding proteins. Here we summarize these efforts, which have produced 5,992 new experimental datasets, including systematic determinations across mouse fetal development. All data are available through the ENCODE data portal (https://www.encodeproject.org), including phase II ENCODE1 and Roadmap Epigenomics2 data. We have developed a registry of 926,535 human and 339,815 mouse candidate cis-regulatory elements, covering 7.9 and 3.4% of their respective genomes, by integrating selected datatypes associated with gene regulation, and constructed a web-based server (SCREEN; http://screen.encodeproject.org) to provide flexible, user-defined access to this resource. Collectively, the ENCODE data and registry provide an expansive resource for the scientific community to build a better understanding of the organization and function of the human and mouse genomes.
Collapse
|
30
|
The changing mouse embryo transcriptome at whole tissue and single-cell resolution. Nature 2020; 583:760-767. [PMID: 32728245 PMCID: PMC7410830 DOI: 10.1038/s41586-020-2536-x] [Citation(s) in RCA: 84] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2018] [Accepted: 06/22/2020] [Indexed: 02/07/2023]
Abstract
During mammalian embryogenesis, differential gene expression gradually builds the identity and complexity of each tissue and organ system1. Here we systematically quantified mouse polyA-RNA from day 10.5 of embryonic development to birth, sampling 17 tissues and organs. The resulting developmental transcriptome is globally structured by dynamic cytodifferentiation, body-axis and cell-proliferation gene sets that were further characterized by the transcription factor motif codes of their promoters. We decomposed the tissue-level transcriptome using single-cell RNA-seq (sequencing of RNA reverse transcribed into cDNA) and found that neurogenesis and haematopoiesis dominate at both the gene and cellular levels, jointly accounting for one-third of differential gene expression and more than 40% of identified cell types. By integrating promoter sequence motifs with companion ENCODE epigenomic profiles, we identified a prominent promoter de-repression mechanism in neuronal expression clusters that was attributable to known and novel repressors. Focusing on the developing limb, single-cell RNA data identified 25 candidate cell types that included progenitor and differentiating states with computationally inferred lineage relationships. We extracted cell-type transcription factor networks and complementary sets of candidate enhancer elements by using single-cell RNA-seq to decompose integrative cis-element (IDEAS) models that were derived from whole-tissue epigenome chromatin data. These ENCODE reference data, computed network components and IDEAS chromatin segmentations are companion resources to the matching epigenomic developmental matrix, and are available for researchers to further mine and integrate.
Collapse
|
31
|
Stable enhancers are active in development, and fragile enhancers are associated with evolutionary adaptation. Genome Biol 2019; 20:140. [PMID: 31307522 PMCID: PMC6631995 DOI: 10.1186/s13059-019-1750-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Accepted: 06/28/2019] [Indexed: 12/13/2022] Open
Abstract
Background Despite continual progress in the identification and characterization of trait- and disease-associated variants that disrupt transcription factor (TF)-DNA binding, little is known about the distribution of TF binding deactivating mutations (deMs) in enhancer sequences. Here, we focus on elucidating the mechanism underlying the different densities of deMs in human enhancers. Results We identify two classes of enhancers based on the density of nucleotides prone to deMs. Firstly, fragile enhancers with abundant deM nucleotides are associated with the immune system and regular cellular maintenance. Secondly, stable enhancers with only a few deM nucleotides are associated with the development and regulation of TFs and are evolutionarily conserved. These two classes of enhancers feature different regulatory programs: the binding sites of pioneer TFs of FOX family are specifically enriched in stable enhancers, while tissue-specific TFs are enriched in fragile enhancers. Moreover, stable enhancers are more tolerant of deMs due to their dominant employment of homotypic TF binding site (TFBS) clusters, as opposed to the larger-extent usage of heterotypic TFBS clusters in fragile enhancers. Notably, the sequence environment and chromatin context of the cognate motif, other than the motif itself, contribute more to the susceptibility to deMs of TF binding. Conclusions This dichotomy of enhancer activity is conserved across different tissues, has a specific footprint in epigenetic profiles, and argues for a bimodal evolution of gene regulatory programs in vertebrates. Specifically encoded stable enhancers are evolutionarily conserved and associated with development, while differently encoded fragile enhancers are associated with the adaptation of species. Electronic supplementary material The online version of this article (10.1186/s13059-019-1750-z) contains supplementary material, which is available to authorized users.
Collapse
|
32
|
Dynamic BAF chromatin remodeling complex subunit inclusion promotes temporally distinct gene expression programs in cardiogenesis. Development 2019; 146:dev.174086. [PMID: 30814119 PMCID: PMC6803373 DOI: 10.1242/dev.174086] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2018] [Accepted: 02/19/2019] [Indexed: 01/31/2023]
Abstract
Chromatin remodeling complexes instruct cellular differentiation and lineage specific transcription. The BRG1/BRM-associated factor (BAF) complexes are important for several aspects of differentiation. We show that the catalytic subunit gene Brg1 has a specific role in cardiac precursors (CPs) to initiate cardiac gene expression programs and repress non-cardiac expression. Using immunopurification with mass spectrometry, we have determined the dynamic composition of BAF complexes during mammalian cardiac differentiation, identifying several cell-type specific subunits. We focused on the CP- and cardiomyocyte (CM)-enriched subunits BAF60c (SMARCD3) and BAF170 (SMARCC2). Baf60c and Baf170 co-regulate gene expression with Brg1 in CPs, and in CMs their loss results in broadly deregulated cardiac gene expression. BRG1, BAF60c and BAF170 modulate chromatin accessibility, to promote accessibility at activated genes while closing chromatin at repressed genes. BAF60c and BAF170 are required for proper BAF complex composition, and BAF170 loss leads to retention of BRG1 at CP-specific sites. Thus, dynamic interdependent BAF complex subunit assembly modulates chromatin states and thereby participates in directing temporal gene expression programs in cardiogenesis.
Collapse
|
33
|
Abstract
Many components of the circadian molecular clock are conserved from flies to mammals; however, the role of mammalian Timeless remains ambiguous. Here, we report a mutation in the human TIMELESS (hTIM) gene that causes familial advanced sleep phase (FASP). Tim CRISPR mutant mice exhibit FASP with altered photic entrainment but normal circadian period. We demonstrate that the mutation prevents TIM accumulation in the nucleus and has altered affinity for CRY2, leading to destabilization of PER/CRY complex and a shortened period in nonmature mouse embryonic fibroblasts (MEFs). We conclude that TIM, when excluded from the nucleus, can destabilize the negative regulators of the circadian clock, alter light entrainment, and cause FASP.
Collapse
|
34
|
Parkinson-Associated SNCA Enhancer Variants Revealed by Open Chromatin in Mouse Dopamine Neurons. Am J Hum Genet 2018; 103:874-892. [PMID: 30503521 DOI: 10.1016/j.ajhg.2018.10.018] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2018] [Accepted: 10/17/2018] [Indexed: 12/31/2022] Open
Abstract
The progressive loss of midbrain (MB) dopaminergic (DA) neurons defines the motor features of Parkinson disease (PD), and modulation of risk by common variants in PD has been well established through genome-wide association studies (GWASs). We acquired open chromatin signatures of purified embryonic mouse MB DA neurons because we anticipated that a fraction of PD-associated genetic variation might mediate the variants' effects within this neuronal population. Correlation with >2,300 putative enhancers assayed in mice revealed enrichment for MB cis-regulatory elements (CREs), and these data were reinforced by transgenic analyses of six additional sequences in zebrafish and mice. One CRE, within intron 4 of the familial PD gene SNCA, directed reporter expression in catecholaminergic neurons from transgenic mice and zebrafish. Sequencing of this CRE in 986 individuals with PD and 992 controls revealed two common variants associated with elevated PD risk. To assess potential mechanisms of action, we screened >16,000 proteins for DNA binding capacity and identified a subset whose binding is impacted by these enhancer variants. Additional genotyping across the SNCA locus identified a single PD-associated haplotype, containing the minor alleles of both of the aforementioned PD-risk variants. Our work posits a model for how common variation at SNCA might modulate PD risk and highlights the value of cell-context-dependent guided searches for functional non-coding variation.
Collapse
|
35
|
Relationship between genetic variation at PPP1R3B and levels of liver glycogen and triglyceride. Hepatology 2018; 67:2182-2195. [PMID: 29266543 PMCID: PMC5991995 DOI: 10.1002/hep.29751] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/15/2017] [Revised: 10/31/2017] [Accepted: 12/18/2017] [Indexed: 12/15/2022]
Abstract
UNLABELLED Genetic variation at rs4240624 on chromosome 8 is associated with an attenuated signal on hepatic computerized tomography, which has been attributed to changes in hepatic fat. The closest coding gene to rs4240624, PPP1R3B, encodes a protein that promotes hepatic glycogen synthesis. Here, we performed studies to determine whether the x-ray attenuation associated with rs4240624 is due to differences in hepatic glycogen or hepatic triglyceride content (HTGC). A sequence variant in complete linkage disequilibrium with rs4240624, rs4841132, was genotyped in the Dallas Heart Study (DHS), the Dallas Liver Study, and the Copenhagen Cohort (n = 112,428) of whom 1,539 had nonviral liver disease. The minor A-allele of rs4841132 was associated with increased hepatic x-ray attenuation (n = 1,572; P = 4 × 10-5 ), but not with HTGC (n = 2,674; P = 0.58). Rs4841132-A was associated with modest, but significant, elevations in serum alanine aminotransferase (ALT) in the Copenhagen Cohort (P = 3 × 10-4 ) and the DHS (P = 0.004), and with odds ratios for liver disease of 1.13 (95% CI, 0.97-1.31) and 1.23 (1.01-1.51), respectively. Mice lacking protein phosphatase 1 regulatory subunit 3B (PPP1R3B) were deficient in hepatic glycogen, whereas HTGC was unchanged. Hepatic overexpression of PPP1R3B caused accumulation of hepatic glycogen and elevated plasma levels of ALT, but did not change HTGC. CONCLUSION These observations are consistent with the notion that the minor allele of rs4841132 promotes a mild form of hepatic glycogenosis that is associated with hepatic injury. (Hepatology 2018;67:2182-2195).
Collapse
|
36
|
Ultraconserved Enhancers Are Required for Normal Development. Cell 2018; 172:491-499.e15. [PMID: 29358049 DOI: 10.1016/j.cell.2017.12.017] [Citation(s) in RCA: 125] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2017] [Revised: 10/27/2017] [Accepted: 12/11/2017] [Indexed: 01/26/2023]
Abstract
Non-coding "ultraconserved" regions containing hundreds of consecutive bases of perfect sequence conservation across mammalian genomes can function as distant-acting enhancers. However, initial deletion studies in mice revealed that loss of such extraordinarily constrained sequences had no immediate impact on viability. Here, we show that ultraconserved enhancers are required for normal development. Focusing on some of the longest ultraconserved sites genome wide, located near the essential neuronal transcription factor Arx, we used genome editing to create an expanded series of knockout mice lacking individual or combinations of ultraconserved enhancers. Mice with single or pairwise deletions of ultraconserved enhancers were viable and fertile but in nearly all cases showed neurological or growth abnormalities, including substantial alterations of neuron populations and structural brain defects. Our results demonstrate the functional importance of ultraconserved enhancers and indicate that remarkably strong sequence conservation likely results from fitness deficits that appear subtle in a laboratory setting.
Collapse
|
37
|
Genomic Patterns of De Novo Mutation in Simplex Autism. Cell 2017; 171:710-722.e12. [PMID: 28965761 PMCID: PMC5679715 DOI: 10.1016/j.cell.2017.08.047] [Citation(s) in RCA: 224] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Revised: 08/03/2017] [Accepted: 08/25/2017] [Indexed: 12/22/2022]
Abstract
To further our understanding of the genetic etiology of autism, we generated and analyzed genome sequence data from 516 idiopathic autism families (2,064 individuals). This resource includes >59 million single-nucleotide variants (SNVs) and 9,212 private copy number variants (CNVs), of which 133,992 and 88 are de novo mutations (DNMs), respectively. We estimate a mutation rate of ∼1.5 × 10-8 SNVs per site per generation with a significantly higher mutation rate in repetitive DNA. Comparing probands and unaffected siblings, we observe several DNM trends. Probands carry more gene-disruptive CNVs and SNVs, resulting in severe missense mutations and mapping to predicted fetal brain promoters and embryonic stem cell enhancers. These differences become more pronounced for autism genes (p = 1.8 × 10-3, OR = 2.2). Patients are more likely to carry multiple coding and noncoding DNMs in different genes, which are enriched for expression in striatal neurons (p = 3 × 10-3), suggesting a path forward for genetically characterizing more complex cases of autism.
Collapse
|
38
|
Cooperative activation of cardiac transcription through myocardin bridging of paired MEF2 sites. Development 2017; 144:1235-1241. [PMID: 28351867 DOI: 10.1242/dev.138487] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2016] [Accepted: 01/25/2017] [Indexed: 12/17/2022]
Abstract
Enhancers frequently contain multiple binding sites for the same transcription factor. These homotypic binding sites often exhibit synergy, whereby the transcriptional output from two or more binding sites is greater than the sum of the contributions of the individual binding sites alone. Although this phenomenon is frequently observed, the mechanistic basis for homotypic binding site synergy is poorly understood. Here, we identify a bona fide cardiac-specific Prkaa2 enhancer that is synergistically activated by homotypic MEF2 binding sites. We show that two MEF2 sites in the enhancer function cooperatively due to bridging of the MEF2C-bound sites by the SAP domain-containing co-activator protein myocardin, and we show that paired sites buffer the enhancer from integration site-dependent effects on transcription in vivo Paired MEF2 sites are prevalent in cardiac enhancers, suggesting that this might be a common mechanism underlying synergy in the control of cardiac gene expression in vivo.
Collapse
|
39
|
Limb-Enhancer Genie: An accessible resource of accurate enhancer predictions in the developing limb. PLoS Comput Biol 2017; 13:e1005720. [PMID: 28827824 PMCID: PMC5578682 DOI: 10.1371/journal.pcbi.1005720] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 08/31/2017] [Accepted: 08/03/2017] [Indexed: 11/18/2022] Open
Abstract
Epigenomic mapping of enhancer-associated chromatin modifications facilitates the genome-wide discovery of tissue-specific enhancers in vivo. However, reliance on single chromatin marks leads to high rates of false-positive predictions. More sophisticated, integrative methods have been described, but commonly suffer from limited accessibility to the resulting predictions and reduced biological interpretability. Here we present the Limb-Enhancer Genie (LEG), a collection of highly accurate, genome-wide predictions of enhancers in the developing limb, available through a user-friendly online interface. We predict limb enhancers using a combination of >50 published limb-specific datasets and clusters of evolutionarily conserved transcription factor binding sites, taking advantage of the patterns observed at previously in vivo validated elements. By combining different statistical models, our approach outperforms current state-of-the-art methods and provides interpretable measures of feature importance. Our results indicate that including a previously unappreciated score that quantifies tissue-specific nuclease accessibility significantly improves prediction performance. We demonstrate the utility of our approach through in vivo validation of newly predicted elements. Moreover, we describe general features that can guide the type of datasets to include when predicting tissue-specific enhancers genome-wide, while providing an accessible resource to the general biological community and facilitating the functional interpretation of genetic studies of limb malformations.
Collapse
|
40
|
Progressive Loss of Function in a Limb Enhancer during Snake Evolution. Cell 2016; 167:633-642.e11. [PMID: 27768887 DOI: 10.1016/j.cell.2016.09.028] [Citation(s) in RCA: 190] [Impact Index Per Article: 23.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2016] [Revised: 08/07/2016] [Accepted: 09/15/2016] [Indexed: 01/08/2023]
Abstract
The evolution of body shape is thought to be tightly coupled to changes in regulatory sequences, but specific molecular events associated with major morphological transitions in vertebrates have remained elusive. We identified snake-specific sequence changes within an otherwise highly conserved long-range limb enhancer of Sonic hedgehog (Shh). Transgenic mouse reporter assays revealed that the in vivo activity pattern of the enhancer is conserved across a wide range of vertebrates, including fish, but not in snakes. Genomic substitution of the mouse enhancer with its human or fish ortholog results in normal limb development. In contrast, replacement with snake orthologs caused severe limb reduction. Synthetic restoration of a single transcription factor binding site lost in the snake lineage reinstated full in vivo function to the snake enhancer. Our results demonstrate changes in a regulatory sequence associated with a major body plan transition and highlight the role of enhancers in morphological evolution. PAPERCLIP.
Collapse
|
41
|
Genome-wide compendium and functional assessment of in vivo heart enhancers. Nat Commun 2016; 7:12923. [PMID: 27703156 PMCID: PMC5059478 DOI: 10.1038/ncomms12923] [Citation(s) in RCA: 61] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2016] [Accepted: 08/16/2016] [Indexed: 12/04/2022] Open
Abstract
Whole-genome sequencing is identifying growing numbers of non-coding variants in human disease studies, but the lack of accurate functional annotations prevents their interpretation. We describe the genome-wide landscape of distant-acting enhancers active in the developing and adult human heart, an organ whose impairment is a predominant cause of mortality and morbidity. Using integrative analysis of >35 epigenomic data sets from mouse and human pre- and postnatal hearts we created a comprehensive reference of >80,000 putative human heart enhancers. To illustrate the importance of enhancers in the regulation of genes involved in heart disease, we deleted the mouse orthologs of two human enhancers near cardiac myosin genes. In both cases, we observe in vivo expression changes and cardiac phenotypes consistent with human heart disease. Our study provides a comprehensive catalogue of human heart enhancers for use in clinical whole-genome sequencing studies and highlights the importance of enhancers for cardiac function. Identification of non-coding variants has outstripped our ability to annotate and interpret them. Dickel et al. present a compendium of over 80,000 putative human heart enhancers and demonstrate that two conserved enhancers are required for proper cardiac function in mice.
Collapse
|
42
|
Enhancer Variants Synergistically Drive Dysfunction of a Gene Regulatory Network In Hirschsprung Disease. Cell 2016; 167:355-368.e10. [PMID: 27693352 DOI: 10.1016/j.cell.2016.09.005] [Citation(s) in RCA: 89] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2016] [Revised: 08/23/2016] [Accepted: 09/02/2016] [Indexed: 12/11/2022]
Abstract
Common sequence variants in cis-regulatory elements (CREs) are suspected etiological causes of complex disorders. We previously identified an intronic enhancer variant in the RET gene disrupting SOX10 binding and increasing Hirschsprung disease (HSCR) risk 4-fold. We now show that two other functionally independent CRE variants, one binding Gata2 and the other binding Rarb, also reduce Ret expression and increase risk 2- and 1.7-fold. By studying human and mouse fetal gut tissues and cell lines, we demonstrate that reduced RET expression propagates throughout its gene regulatory network, exerting effects on both its positive and negative feedback components. We also provide evidence that the presence of a combination of CRE variants synergistically reduces RET expression and its effects throughout the GRN. These studies show how the effects of functionally independent non-coding variants in a coordinated gene regulatory network amplify their individually small effects, providing a model for complex disorders.
Collapse
|
43
|
Abstract
Familial Advanced Sleep Phase (FASP) is a heritable human sleep phenotype characterized by very early sleep and wake times. We identified a missense mutation in the human Cryptochrome 2 (CRY2) gene that co-segregates with FASP in one family. The mutation leads to replacement of an alanine residue at position 260 with a threonine (A260T). In mice, the CRY2 mutation causes a shortened circadian period and reduced phase-shift to early-night light pulse associated with phase-advanced behavioral rhythms in the light-dark cycle. The A260T mutation is located in the phosphate loop of the flavin adenine dinucleotide (FAD) binding domain of CRY2. The mutation alters the conformation of CRY2, increasing its accessibility and affinity for FBXL3 (an E3 ubiquitin ligase), thus promoting its degradation. These results demonstrate that CRY2 stability controlled by FBXL3 plays a key role in the regulation of human sleep wake behavior. DOI:http://dx.doi.org/10.7554/eLife.16695.001 Sleep is an essential process in animals. In humans, the disturbance of normal sleep-wake cycles through shift-work or long-term sleep disorders increases the risk of developing conditions including mental illness, cancer and metabolic syndromes. Understanding how sleep-wake behavior is controlled within cells may help researchers to develop effective therapies to reduce the ill effects of disturbed sleep-wakLouise cycles on health. To understand how our sleep-wake cycles are regulated in cells, researchers have been looking for genetic mutations that affect human sleep schedules. For example, some people have a ‘morning lark’ schedule that makes them prone to go to sleep early and rise early the next day. Others are prone to be ‘night owls’, staying up later at night and waking up later in the morning. By studying the mutations that underlie these behaviors, researchers hope to understand precisely how these genes regulate sleep schedules. Now, Hirano et al. have identified a particular mutation in a gene called Cryptochrome 2 (CRY2) that causes people to have shorter sleep-wake cycles so that they wake up very early in the morning and struggle to stay awake in the evening. For the experiments, mice were genetically engineered to carry the mutant human CRY2 gene, which shortened the sleep-wake cycles of the mice and their responses to light so that they both woke up earlier and went to sleep earlier. Further experiments examined what effect the mutation has on the protein that is produced by CRY2. The mutation changes the shape of the protein, which allows an enzyme called FBXL3 to bind to the mutant protein more easily and rapidly break it down. The length of sleep cycles may be determined by how long it takes FBXL3 to break down the protein produced by CRY2. The findings of Hirano et al. may help researchers to develop treatments for people with sleep problems. DOI:http://dx.doi.org/10.7554/eLife.16695.002
Collapse
|
44
|
Genetic dissection of the α-globin super-enhancer in vivo. Nat Genet 2016; 48:895-903. [PMID: 27376235 PMCID: PMC5058437 DOI: 10.1038/ng.3605] [Citation(s) in RCA: 237] [Impact Index Per Article: 29.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 06/01/2016] [Indexed: 12/18/2022]
Abstract
Many genes determining cell identity are regulated by clusters of Mediator-bound enhancer elements collectively referred to as super-enhancers. These super-enhancers have been proposed to manifest higher-order properties important in development and disease. Here we report a comprehensive functional dissection of one of the strongest putative super-enhancers in erythroid cells. By generating a series of mouse models, deleting each of the five regulatory elements of the α-globin super-enhancer individually and in informative combinations, we demonstrate that each constituent enhancer seems to act independently and in an additive fashion with respect to hematological phenotype, gene expression, chromatin structure and chromosome conformation, without clear evidence of synergistic or higher-order effects. Our study highlights the importance of functional genetic analyses for the identification of new concepts in transcriptional regulation.
Collapse
|
45
|
Brg1 coordinates multiple processes during retinogenesis and is a tumor suppressor in retinoblastoma. Development 2016; 142:4092-106. [PMID: 26628093 PMCID: PMC4712833 DOI: 10.1242/dev.124800] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Retinal development requires precise temporal and spatial coordination of cell cycle exit, cell fate specification, cell migration and differentiation. When this process is disrupted, retinoblastoma, a developmental tumor of the retina, can form. Epigenetic modulators are central to precisely coordinating developmental events, and many epigenetic processes have been implicated in cancer. Studying epigenetic mechanisms in development is challenging because they often regulate multiple cellular processes; therefore, elucidating the primary molecular mechanisms involved can be difficult. Here we explore the role of Brg1 (Smarca4) in retinal development and retinoblastoma in mice using molecular and cellular approaches. Brg1 was found to regulate retinal size by controlling cell cycle length, cell cycle exit and cell survival during development. Brg1 was not required for cell fate specification but was required for photoreceptor differentiation and cell adhesion/polarity programs that contribute to proper retinal lamination during development. The combination of defective cell differentiation and lamination led to retinal degeneration in Brg1-deficient retinae. Despite the hypocellularity, premature cell cycle exit, increased cell death and extended cell cycle length, retinal progenitor cells persisted in Brg1-deficient retinae, making them more susceptible to retinoblastoma. ChIP-Seq analysis suggests that Brg1 might regulate gene expression through multiple mechanisms. Summary: The SWI/SNF protein Brg1 controls cell cycle length, cell cycle exit and cell survival, and is required for cell differentiation and retinal lamination, in the developing mouse retina.
Collapse
|
46
|
Abstract
DNA methylation acts in concert with restriction enzymes to protect the integrity of prokaryotic genomes. Studies in a limited number of organisms suggest that methylation also contributes to prokaryotic genome regulation, but the prevalence and properties of such non-restriction-associated methylation systems remain poorly understood. Here, we used single molecule, real-time sequencing to map DNA modifications including m6A, m4C, and m5C across the genomes of 230 diverse bacterial and archaeal species. We observed DNA methylation in nearly all (93%) organisms examined, and identified a total of 834 distinct reproducibly methylated motifs. This data enabled annotation of the DNA binding specificities of 620 DNA Methyltransferases (MTases), doubling known specificities for previously hard to study Type I, IIG and III MTases, and revealing their extraordinary diversity. Strikingly, 48% of organisms harbor active Type II MTases with no apparent cognate restriction enzyme. These active ‘orphan’ MTases are present in diverse bacterial and archaeal phyla and show motif specificities and methylation patterns consistent with functions in gene regulation and DNA replication. Our results reveal the pervasive presence of DNA methylation throughout the prokaryotic kingdoms, as well as the diversity of sequence specificities and potential functions of DNA methylation systems. DNA methylation is a chemical modification of DNA present in many prokaryotic genomes. The best-known role of DNA methylation is as a component of restriction-modification systems. In these systems, restriction enzymes target foreign DNA for cleavage, while DNA methylation protects the host genome from destruction. Studies in a handful of organisms show that DNA methylation may also act independently of restriction systems and function in genome regulation. However, a lack of technologies has limited the study of DNA methylation to a small number of organisms, and the broader patterns and functions of DNA methylation remain unknown. Here we use SMRT-sequencing to determine the genome wide DNA methylation patterns of more than 200 diverse bacteria and archaea. We show that DNA methylation is pervasive and present in more than 90% of studied organisms. Analysis of this data enabled annotation of the specific DNA binding sites of more than 600 restriction systems, revealing their extraordinary diversity. Strikingly, we observed widespread DNA methylation in the absence of restriction systems. Analyses of these patterns reveal that they are conserved through evolution, and likely function in genome regulation. Thus DNA methylation may play a far wider function in prokaryotic genome biology than was previously supposed.
Collapse
|
47
|
Activating Mutations Affecting the Dbl Homology Domain of SOS2 Cause Noonan Syndrome. Hum Mutat 2015; 36:1080-7. [PMID: 26173643 DOI: 10.1002/humu.22834] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 06/30/2015] [Indexed: 12/24/2022]
Abstract
The RASopathies constitute a family of autosomal-dominant disorders whose major features include facial dysmorphism, cardiac defects, reduced postnatal growth, variable cognitive deficits, ectodermal and skeletal anomalies, and susceptibility to certain malignancies. Noonan syndrome (NS), the commonest RASopathy, is genetically heterogeneous and caused by functional dysregulation of signal transducers and regulatory proteins with roles in the RAS/extracellular signal-regulated kinase (ERK) signal transduction pathway. Mutations in known disease genes account for approximately 80% of affected individuals. Here, we report that missense mutations altering Son of Sevenless, Drosophila, homolog 2 (SOS2), which encodes a RAS guanine nucleotide exchange factor, occur in a small percentage of subjects with NS. Four missense mutations were identified in five unrelated sporadic cases and families transmitting NS. Disease-causing mutations affected three conserved residues located in the Dbl homology (DH) domain, of which two are directly involved in the intramolecular binding network maintaining SOS2 in its autoinhibited conformation. All mutations were found to promote enhanced signaling from RAS to ERK. Similar to NS-causing SOS1 mutations, the phenotype associated with SOS2 defects is characterized by normal development and growth, as well as marked ectodermal involvement. Unlike SOS1 mutations, however, those in SOS2 are restricted to the DH domain.
Collapse
|
48
|
A large genomic deletion leads to enhancer adoption by the lamin B1 gene: a second path to autosomal dominant adult-onset demyelinating leukodystrophy (ADLD). Hum Mol Genet 2015; 24:3143-54. [PMID: 25701871 PMCID: PMC4424952 DOI: 10.1093/hmg/ddv065] [Citation(s) in RCA: 97] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2014] [Accepted: 02/13/2015] [Indexed: 01/23/2023] Open
Abstract
Chromosomal rearrangements with duplication of the lamin B1 (LMNB1) gene underlie autosomal dominant adult-onset demyelinating leukodystrophy (ADLD), a rare neurological disorder in which overexpression of LMNB1 causes progressive central nervous system demyelination. However, we previously reported an ADLD family (ADLD-1-TO) without evidence of duplication or other mutation in LMNB1 despite linkage to the LMNB1 locus and lamin B1 overexpression. By custom array-CGH, we further investigated this family and report here that patients carry a large (∼660 kb) heterozygous deletion that begins 66 kb upstream of the LMNB1 promoter. Lamin B1 overexpression was confirmed in further ADLD-1-TO tissues and in a postmortem brain sample, where lamin B1 was increased in the frontal lobe. Through parallel studies, we investigated both loss of genetic material and chromosomal rearrangement as possible causes of LMNB1 overexpression, and found that ADLD-1-TO plausibly results from an enhancer adoption mechanism. The deletion eliminates a genome topological domain boundary, allowing normally forbidden interactions between at least three forebrain-directed enhancers and the LMNB1 promoter, in line with the observed mainly cerebral localization of lamin B1 overexpression and myelin degeneration. This second route to LMNB1 overexpression and ADLD is a new example of the relevance of regulatory landscape modifications in determining Mendelian phenotypes.
Collapse
|
49
|
Occupancy by key transcription factors is a more accurate predictor of enhancer activity than histone modifications or chromatin accessibility. Epigenetics Chromatin 2015; 8:16. [PMID: 25984238 PMCID: PMC4432502 DOI: 10.1186/s13072-015-0009-5] [Citation(s) in RCA: 89] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2015] [Accepted: 04/02/2015] [Indexed: 12/12/2022] Open
Abstract
Background Regulated gene expression controls organismal development, and variation in regulatory patterns has been implicated in complex traits. Thus accurate prediction of enhancers is important for further understanding of these processes. Genome-wide measurement of epigenetic features, such as histone modifications and occupancy by transcription factors, is improving enhancer predictions, but the contribution of these features to prediction accuracy is not known. Given the importance of the hematopoietic transcription factor TAL1 for erythroid gene activation, we predicted candidate enhancers based on genomic occupancy by TAL1 and measured their activity. Contributions of multiple features to enhancer prediction were evaluated based on the results of these and other studies. Results TAL1-bound DNA segments were active enhancers at a high rate both in transient transfections of cultured cells (39 of 79, or 56%) and transgenic mice (43 of 66, or 65%). The level of binding signal for TAL1 or GATA1 did not help distinguish TAL1-bound DNA segments as active versus inactive enhancers, nor did the density of regulation-related histone modifications. A meta-analysis of results from this and other studies (273 tested predicted enhancers) showed that the presence of TAL1, GATA1, EP300, SMAD1, H3K4 methylation, H3K27ac, and CAGE tags at DNase hypersensitive sites gave the most accurate predictors of enhancer activity, with a success rate over 80% and a median threefold increase in activity. Chromatin accessibility assays and the histone modifications H3K4me1 and H3K27ac were sensitive for finding enhancers, but they have high false positive rates unless transcription factor occupancy is also included. Conclusions Occupancy by key transcription factors such as TAL1, GATA1, SMAD1, and EP300, along with evidence of transcription, improves the accuracy of enhancer predictions based on epigenetic features. Electronic supplementary material The online version of this article (doi:10.1186/s13072-015-0009-5) contains supplementary material, which is available to authorized users.
Collapse
|
50
|
Identification of novel craniofacial regulatory domains located far upstream of SOX9 and disrupted in Pierre Robin sequence. Hum Mutat 2015; 35:1011-20. [PMID: 24934569 DOI: 10.1002/humu.22606] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2014] [Accepted: 05/12/2014] [Indexed: 01/08/2023]
Abstract
Mutations in the coding sequence of SOX9 cause campomelic dysplasia (CD), a disorder of skeletal development associated with 46,XY disorders of sex development (DSDs). Translocations, deletions, and duplications within a ∼2 Mb region upstream of SOX9 can recapitulate the CD-DSD phenotype fully or partially, suggesting the existence of an unusually large cis-regulatory control region. Pierre Robin sequence (PRS) is a craniofacial disorder that is frequently an endophenotype of CD and a locus for isolated PRS at ∼1.2-1.5 Mb upstream of SOX9 has been previously reported. The craniofacial regulatory potential within this locus, and within the greater genomic domain surrounding SOX9, remains poorly defined. We report two novel deletions upstream of SOX9 in families with PRS, allowing refinement of the regions harboring candidate craniofacial regulatory elements. In parallel, ChIP-Seq for p300 binding sites in mouse craniofacial tissue led to the identification of several novel craniofacial enhancers at the SOX9 locus, which were validated in transgenic reporter mice and zebrafish. Notably, some of the functionally validated elements fall within the PRS deletions. These studies suggest that multiple noncoding elements contribute to the craniofacial regulation of SOX9 expression, and that their disruption results in PRS.
Collapse
|