1
|
Fleck K, Luria V, Garag N, Karger A, Hunter T, Marten D, Phu W, Nam KM, Sestan N, O’Donnell-Luria AH, Erceg J. Functional associations of evolutionarily recent human genes exhibit sensitivity to the 3D genome landscape and disease. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.17.585403. [PMID: 38559085 PMCID: PMC10980080 DOI: 10.1101/2024.03.17.585403] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Genome organization is intricately tied to regulating genes and associated cell fate decisions. In this study, we examine the positioning and functional significance of human genes, grouped by their evolutionary age, within the 3D organization of the genome. We reveal that genes of different evolutionary origin have distinct positioning relationships with both domains and loop anchors, and remarkably consistent relationships with boundaries across cell types. While the functional associations of each group of genes are primarily cell type-specific, such associations of conserved genes maintain greater stability across 3D genomic features and disease than recently evolved genes. Furthermore, the expression of these genes across various tissues follows an evolutionary progression, such that RNA levels increase from young genes to ancient genes. Thus, the distinct relationships of gene evolutionary age, function, and positioning within 3D genomic features contribute to tissue-specific gene regulation in development and disease.
Collapse
Affiliation(s)
- Katherine Fleck
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269
| | - Victor Luria
- Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115
| | - Nitanta Garag
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
| | - Amir Karger
- IT-Research Computing, Harvard Medical School, Boston, MA 02115
| | - Trevor Hunter
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
| | - Daniel Marten
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
| | - William Phu
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
| | - Kee-Myoung Nam
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06510
| | - Nenad Sestan
- Department of Neuroscience, Yale School of Medicine, New Haven, CT 06510
| | - Anne H. O’Donnell-Luria
- Division of Genetics and Genomics, Boston Children’s Hospital, Boston, MA 02115
- Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142
- Department of Pediatrics, Harvard Medical School, Boston, MA 02115
| | - Jelena Erceg
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT 06269
- Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269
- Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT 06030
| |
Collapse
|
2
|
Khullar S, Wang D. Predicting gene regulatory networks from multi-omics to link genetic risk variants and neuroimmunology to Alzheimer's disease phenotypes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021. [PMID: 34189529 DOI: 10.1101/2021.06.21.449165] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
BACKGROUND Genome-wide association studies have found many genetic risk variants associated with Alzheimer's disease (AD). However, how these risk variants affect deeper phenotypes such as disease progression and immune response remains elusive. Also, our understanding of cellular and molecular mechanisms from disease risk variants to various phenotypes is still limited. To address these problems, we performed an integrative multi-omics analysis of genotype, transcriptomics, and epigenomics for revealing gene regulatory mechanisms from disease variants to AD phenotypes. METHOD First, given the population gene expression data of a cohort, we construct and cluster its gene co-expression network to identify gene co-expression modules for various AD phenotypes. Next, we predict transcription factors (TFs) regulating co-expressed genes and AD risk SNPs that interrupt TF binding sites on regulatory elements. Finally, we construct a gene regulatory network (GRN) linking SNPs, interrupted TFs, and regulatory elements to target genes and gene modules for each phenotype in the cohort. This network thus provides systematic insights into gene regulatory mechanisms from risk variants to AD phenotypes. RESULTS Our analysis predicted GRNs in three major AD-relevant regions: Hippocampus, Dorsolateral Prefrontal Cortex (DLPFC), Lateral Temporal Lobe (LTL). Comparative analyses revealed cross-region-conserved and region-specific GRNs, in which many immunological genes are present. For instance, SNPs rs13404184 and rs61068452 disrupt SPI1 binding and regulation of AD gene INPP5D in the Hippocampus and LTL. However, SNP rs117863556 interrupts bindings of REST to regulate GAB2 in DLPFC only. Driven by emerging neuroinflammation in AD, we used Covid-19 as a proxy to identify possible regulatory mechanisms for neuroimmunology in AD. To this end, we looked at the GRN subnetworks relating to genes from shared AD-Covid pathways. From those subnetworks, our machine learning analysis prioritized the AD-Covid genes for predicting Covid-19 severity. Decision Curve Analysis also validated our AD-Covid genes outperform known Covid-19 genes for classifying severe Covid-19 patients. This suggests AD-Covid genes along with linked SNPs can be potential novel biomarkers for neuroimmunology in AD. Finally, our results are open-source available as a comprehensive functional genomic map for AD, providing a deeper mechanistic understanding of the interplay among multi-omics, brain regions, gene functions like neuroimmunology, and phenotypes.
Collapse
|
3
|
McCole RB, Erceg J, Saylor W, Wu CT. Ultraconserved Elements Occupy Specific Arenas of Three-Dimensional Mammalian Genome Organization. Cell Rep 2019; 24:479-488. [PMID: 29996107 DOI: 10.1016/j.celrep.2018.06.031] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2017] [Revised: 05/09/2018] [Accepted: 06/07/2018] [Indexed: 12/23/2022] Open
Abstract
This study explores the relationship between three-dimensional genome organization and ultraconserved elements (UCEs), an enigmatic set of DNA elements that are perfectly conserved between the reference genomes of distantly related species. Examining both human and mouse genomes, we interrogate the relationship of UCEs to three features of chromosome organization derived from Hi-C studies. We find that UCEs are enriched within contact domains and, further, that the subset of UCEs within domains shared across diverse cell types are linked to kidney-related and neuronal processes. In boundaries, UCEs are generally depleted, with those that do overlap boundaries being overrepresented in exonic UCEs. Regarding loop anchors, UCEs are neither overrepresented nor underrepresented, but those present in loop anchors are enriched for splice sites. Finally, as the relationships between UCEs and human Hi-C features are conserved in mouse, our findings suggest that UCEs contribute to interspecies conservation of genome organization and, thus, genome stability.
Collapse
Affiliation(s)
- Ruth B McCole
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Jelena Erceg
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Wren Saylor
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA
| | - Chao-Ting Wu
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
4
|
Rödelsperger C, Menden K, Serobyan V, Witte H, Baskaran P. First insights into the nature and evolution of antisense transcription in nematodes. BMC Evol Biol 2016; 16:165. [PMID: 27549405 PMCID: PMC4994411 DOI: 10.1186/s12862-016-0740-y] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2016] [Accepted: 08/11/2016] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND The development of multicellular organisms is coordinated by various gene regulatory mechanisms that ensure correct spatio-temporal patterns of gene expression. Recently, the role of antisense transcription in gene regulation has moved into focus of research. To characterize genome-wide patterns of antisense transcription and to study their evolutionary conservation, we sequenced a strand-specific RNA-seq library of the nematode Pristionchus pacificus. RESULTS We identified 1112 antisense configurations of which the largest group represents 465 antisense transcripts (ASTs) that are fully embedded in introns of their host genes. We find that most ASTs show homology to protein-coding genes and are overrepresented in proteomic data. Together with the finding, that expression levels of ASTs and host genes are uncorrelated, this indicates that most ASTs in P. pacificus do not represent non-coding RNAs and do not exhibit regulatory functions on their host genes. We studied the evolution of antisense gene pairs across 20 nematode genomes, showing that the majority of pairs is lineage-specific and even the highly conserved vps-4, ddx-27, and sel-2 loci show abundant structural changes including duplications, deletions, intron gains and loss of antisense transcription. In contrast, host genes in general, are remarkably conserved and encode exceptionally long introns leading to unusually large blocks of conserved synteny. CONCLUSIONS Our study has shown that in P. pacificus antisense transcription as such does not define non-coding RNAs but is rather a feature of highly conserved genes with long introns. We hypothesize that the presence of regulatory elements imposes evolutionary constraint on the intron length, but simultaneously, their large size makes them a likely target for translocation of genomic elements including protein-coding genes that eventually end up as ASTs.
Collapse
Affiliation(s)
- Christian Rödelsperger
- Department for Evolutionary Biology, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany.
| | - Kevin Menden
- Department for Evolutionary Biology, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany.,Eberhard Karls University, Tübingen, Germany
| | - Vahan Serobyan
- Department for Evolutionary Biology, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany
| | - Hanh Witte
- Department for Evolutionary Biology, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany
| | - Praveen Baskaran
- Department for Evolutionary Biology, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen, 72076, Germany
| |
Collapse
|
5
|
Guo G, Rödelsperger C, Digweed M, Robinson PN. Regulation of fibrillin-1 gene expression by Sp1. Gene 2013; 527:448-55. [PMID: 23860323 DOI: 10.1016/j.gene.2013.07.006] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2013] [Revised: 07/01/2013] [Accepted: 07/05/2013] [Indexed: 11/28/2022]
Abstract
Mutations in the fibrillin-1 gene (FBN1) cause Marfan Syndrome (MFS), a hereditary disorder of connective tissue. The transcription of FBN1 has been reported to be driven by a short ultraconserved region (SUPR) in the 5' untranslated exon A of FBN1, but the nature of other factors involved in FBN1 gene regulation has not been clarified. In this study, we characterized the transcription factors involved in FBN1 gene regulation. The results show that Sp1 protein binds to two putative binding sites in the promoter of FBN1. Overexpression of Sp1 resulted in a significant increase in both promoter activity and FBN1 mRNA level in HEK 293 cells, whereas inhibition or knockdown of Sp1 decreased FBN1 gene expression. In addition, we found that Poly [ADP-ribose] polymerase 1 (PARP1) binds to the palindromic sequence TCTCGCGAGA in the ultraconserved region of the FBN1 promoter and that the regulation of FBN1 expression by PARP1 is dependent on Sp1. These results indicate that both Sp1 and PARP1 contribute to FBN1 gene expression. These observations add to our understanding of the transcriptional regulation of FBN1 gene expression.
Collapse
Affiliation(s)
- Gao Guo
- Institute for Medical Genetics and Human Genetics, Charité-Universitätsmedizin Berlin, Augustenburger Platz 1, 13353 Berlin, Germany
| | | | | | | |
Collapse
|
6
|
Ryu T, Seridi L, Ravasi T. The evolution of ultraconserved elements with different phylogenetic origins. BMC Evol Biol 2012; 12:236. [PMID: 23217155 PMCID: PMC3556307 DOI: 10.1186/1471-2148-12-236] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2012] [Accepted: 11/09/2012] [Indexed: 11/10/2022] Open
Abstract
Background Ultraconserved elements of DNA have been identified in vertebrate and invertebrate genomes. These elements have been found to have diverse functions, including enhancer activities in developmental processes. The evolutionary origins and functional roles of these elements in cellular systems, however, have not yet been determined. Results Here, we identified a wide range of ultraconserved elements common to distant species, from primitive aquatic organisms to terrestrial species with complicated body systems, including some novel elements conserved in fruit fly and human. In addition to a well-known association with developmental genes, these DNA elements have a strong association with genes implicated in essential cell functions, such as epigenetic regulation, apoptosis, detoxification, innate immunity, and sensory reception. Interestingly, we observed that ultraconserved elements clustered by sequence similarity. Furthermore, species composition and flanking genes of clusters showed lineage-specific patterns. Ultraconserved elements are highly enriched with binding sites to developmental transcription factors regardless of how they cluster. Conclusion We identified large numbers of ultraconserved elements across distant species. Specific classes of these conserved elements seem to have been generated before the divergence of taxa and fixed during the process of evolution. Our findings indicate that these ultraconserved elements are not the exclusive property of higher modern eukaryotes, but rather transmitted from their metazoan ancestors.
Collapse
Affiliation(s)
- Taewoo Ryu
- Integrative Systems Biology Lab, Division of Biological and Environmental Sciences & Engineering, King Abdullah University of Science and Technology, Thuwal, 23955-6900, Kingdom of Saudi Arabia.
| | | | | |
Collapse
|
7
|
Rödelsperger C, Guo G, Kolanczyk M, Pletschacher A, Köhler S, Bauer S, Schulz MH, Robinson PN. Integrative analysis of genomic, functional and protein interaction data predicts long-range enhancer-target gene interactions. Nucleic Acids Res 2010; 39:2492-502. [PMID: 21109530 PMCID: PMC3074119 DOI: 10.1093/nar/gkq1081] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Multicellular organismal development is controlled by a complex network of transcription factors, promoters and enhancers. Although reliable computational and experimental methods exist for enhancer detection, prediction of their target genes remains a major challenge. On the basis of available literature and ChIP-seq and ChIP-chip data for enhanceosome factor p300 and the transcriptional regulator Gli3, we found that genomic proximity and conserved synteny predict target genes with a relatively low recall of 12–27% within 2 Mb intervals centered at the enhancers. Here, we show that functional similarities between enhancer binding proteins and their transcriptional targets and proximity in the protein–protein interactome improve prediction of target genes. We used all four features to train random forest classifiers that predict target genes with a recall of 58% in 2 Mb intervals that may contain dozens of genes, representing a better than two-fold improvement over the performance of prediction based on single features alone. Genome-wide ChIP data is still relatively poorly understood, and it remains difficult to assign biological significance to binding events. Our study represents a first step in integrating various genomic features in order to elucidate the genomic network of long-range regulatory interactions.
Collapse
Affiliation(s)
- Christian Rödelsperger
- Berlin-Brandenburg Center for Regenerative Therapies, Charité-Universitätsmedizin Berlin, Max Planck Institute for Molecular Genetics, Berlin, Germany
| | | | | | | | | | | | | | | |
Collapse
|
8
|
Yang MQ, Laflamme K, Gotea V, Joiner CH, Seidel NE, Wong C, Petrykowska HM, Lichtenberg J, Lee S, Welch L, Gallagher PG, Bodine DM, Elnitski L. Genome-wide detection of a TFIID localization element from an initial human disease mutation. Nucleic Acids Res 2010; 39:2175-87. [PMID: 21071415 PMCID: PMC3064768 DOI: 10.1093/nar/gkq1035] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Eukaryotic core promoters are often characterized by the presence of consensus motifs such as the TATA box or initiator elements, which attract and direct the transcriptional machinery to the transcription start site. However, many human promoters have none of the known core promoter motifs, suggesting that undiscovered promoter motifs exist in the genome. We previously identified a mutation in the human Ankyrin-1 (ANK-1) promoter that causes the disease ankyrin-deficient Hereditary Spherocytosis (HS). Although the ANK-1 promoter is CpG rich, no discernable basal promoter elements had been identified. We showed that the HS mutation disrupted the binding of the transcription factor TFIID, the major component of the pre-initiation complex. We hypothesized that the mutation identified a candidate promoter element with a more widespread role in gene regulation. We examined 17,181 human promoters for the experimentally validated binding site, called the TFIID localization sequence (DLS) and found three times as many promoters containing DLS than TATA motifs. Mutational analyses of DLS sequences confirmed their functional significance, as did the addition of a DLS site to a minimal Sp1 promoter. Our results demonstrate that novel promoter elements can be identified on a genome-wide scale through observations of regulatory disruptions that cause human disease.
Collapse
Affiliation(s)
- Mary Q Yang
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Rockville, MD 20852, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|