1
|
Wallace AD, Wendt GA, Barcellos LF, de Smith AJ, Walsh KM, Metayer C, Costello JF, Wiemels JL, Francis SS. To ERV Is Human: A Phenotype-Wide Scan Linking Polymorphic Human Endogenous Retrovirus-K Insertions to Complex Phenotypes. Front Genet 2018; 9:298. [PMID: 30154825 PMCID: PMC6102640 DOI: 10.3389/fgene.2018.00298] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Accepted: 07/16/2018] [Indexed: 12/13/2022] Open
Abstract
Approximately 8% of the human genome is comprised of endogenous retroviral insertions (ERVs) originating from historic retroviral integration into germ cells. The function of ERVs as regulators of gene expression is well established. Less well studied are insertional polymorphisms of ERVs and their contribution to the heritability of complex phenotypes. The most recent integration of ERV, HERV-K, is expressed in a range of complex human conditions from cancer to neurologic diseases. Using an in-house computational pipeline and whole-genome sequencing data from the diverse 1,000 Genomes Phase 3 population (n = 2,504), we identified 46 polymorphic HERV-K insertions that are tagged by adjacent single nucleotide polymorphisms (SNPs). To test the potential role of polymorphic HERV-K in the heritability of complex diseases, existing databases were queried for enrichment of established relationships between the HERV-K insertion-associated SNPs (hiSNPs), and tissue specific gene expression and disease phenotypes. Overall, hiSNPs for the 46 polymorphic HERV-K sites were statistically enriched (p < 1.0E-16) for eQTLs across 44 human tissues. Fifteen of the 46 HERV-K insertions had hiSNPs annotated in the EMBL-EBI GWAS Catalog and cumulatively associated with >100 phenotypes. Experimental factor ontology enrichment analysis suggests that polymorphic HERV-K specifically contribute to neurologic and immunologic disease phenotypes, including traits related to intra cranial volume (FDR 2.00E-09), Parkinson's disease (FDR 1.80E-09), and autoimmune diseases (FDR 1.80E-09). These results provide strong candidates for context-specific study of polymorphic HERV-K insertions in disease-related traits, serving as a roadmap for future studies of the heritability of complex disease.
Collapse
Affiliation(s)
- Amelia D Wallace
- Division of Epidemiology, School of Public Health, University of California, Berkeley, Berkeley, CA, United States
| | - George A Wendt
- Division of Epidemiology, School of Community Health Sciences, University of Nevada, Reno, NV, United States
| | - Lisa F Barcellos
- Division of Epidemiology, School of Public Health, University of California, Berkeley, Berkeley, CA, United States
| | - Adam J de Smith
- Department of Epidemiology and Biostatistics, Helen Diller Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States
| | - Kyle M Walsh
- Department of Neurosurgery, Duke University, Durham, NC, United States
| | - Catherine Metayer
- Division of Epidemiology, School of Public Health, University of California, Berkeley, Berkeley, CA, United States
| | - Joseph F Costello
- Department of Neurosurgery, Helen Diller Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States
| | - Joseph L Wiemels
- Department of Epidemiology and Biostatistics, Helen Diller Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States.,Department of Neurosurgery, Helen Diller Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States
| | - Stephen S Francis
- Division of Epidemiology, School of Community Health Sciences, University of Nevada, Reno, NV, United States.,Department of Epidemiology and Biostatistics, Helen Diller Comprehensive Cancer Center, University of California, San Francisco, San Francisco, CA, United States
| |
Collapse
|
2
|
Hu T, Zhu X, Pi W, Yu M, Shi H, Tuan D. Hypermethylated LTR retrotransposon exhibits enhancer activity. Epigenetics 2017; 12:226-237. [PMID: 28165867 DOI: 10.1080/15592294.2017.1289300] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
LTR retrotransposons are repetitive DNA elements comprising ∼10% of the human genome. They are silenced by hypermethylation of cytosines in CpG dinucleotides and are considered parasitic DNA serving no useful function for the host genome. However, hypermethylated LTRs contain enhancer and promoter sequences and can promote tissue-specific transcription of cis-linked genes. To resolve the apparent paradox of hypermethylated LTRs possessing transcriptional activities, we studied the ERV-9 LTR retrotransposon located at the 5' border of the transcriptionally active β-globin gene locus in human erythroid progenitor and erythroleukemia K562 cells. We found that the ERV-9 LTR, containing 65 CpGs in 1.7 kb DNA, was hypermethylated (with > 90% methylated CpGs). Hypermethylated LTR possessed transcriptional enhancer activity, since in vivo deletion of the LTR by CRISPR-cas9 suppressed transcription of the globin genes by > 50%. ChIP-qPCR and ChIP-seq studies showed that the hypermethylated LTR enhancer spanning recurrent CCAATCG and GATA motifs associated respectively with key transcription factors (TFs) NF-Y and GATA-1 and -2 at reduced levels, compared with the unmethylated LTR in transfected LTR-reporter gene plasmids. Electrophoretic mobility shift assays with methylated LTR enhancer probe showed that the methylated probe bound both NF-Y and GATA-1 and -2 with lower affinities than the unmethylated enhancer probe. Thus, hypermethylation drastically reduced, but did not totally abolish, the binding affinities of the enhancer motifs to the key TFs to assemble the LTR-pol II transcription complex that activated transcription of cis-linked genes at reduced efficiency.
Collapse
Affiliation(s)
- Tianxiang Hu
- a Department of Biochemistry and Molecular Biology , Medical College of Georgia, Augusta University , Augusta , GA , USA
| | - Xingguo Zhu
- a Department of Biochemistry and Molecular Biology , Medical College of Georgia, Augusta University , Augusta , GA , USA
| | - Wenhu Pi
- a Department of Biochemistry and Molecular Biology , Medical College of Georgia, Augusta University , Augusta , GA , USA
| | - Miao Yu
- b Georgia Cancer Center , Medical College of Georgia, Augusta University , Augusta , GA , USA
| | - Huidong Shi
- b Georgia Cancer Center , Medical College of Georgia, Augusta University , Augusta , GA , USA
| | - Dorothy Tuan
- a Department of Biochemistry and Molecular Biology , Medical College of Georgia, Augusta University , Augusta , GA , USA
| |
Collapse
|
3
|
Garazha A, Ivanova A, Suntsova M, Malakhova G, Roumiantsev S, Zhavoronkov A, Buzdin A. New bioinformatic tool for quick identification of functionally relevant endogenous retroviral inserts in human genome. Cell Cycle 2016; 14:1476-84. [PMID: 25853282 PMCID: PMC4612461 DOI: 10.1080/15384101.2015.1022696] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Endogenous retroviruses (ERVs) and LTR retrotransposons (LRs) occupy ∼8% of human genome. Deep sequencing technologies provide clues to understanding of functional relevance of individual ERVs/LRs by enabling direct identification of transcription factor binding sites (TFBS) and other landmarks of functional genomic elements. Here, we performed the genome-wide identification of human ERVs/LRs containing TFBS according to the ENCODE project. We created the first interactive ERV/LRs database that groups the individual inserts according to their familial nomenclature, number of mapped TFBS and divergence from their consensus sequence. Information on any particular element can be easily extracted by the user. We also created a genome browser tool, which enables quick mapping of any ERV/LR insert according to genomic coordinates, known human genes and TFBS. These tools can be used to easily explore functionally relevant individual ERV/LRs, and for studying their impact on the regulation of human genes. Overall, we identified ∼110,000 ERV/LR genomic elements having TFBS. We propose a hypothesis of “domestication” of ERV/LR TFBS by the genome milieu including subsequent stages of initial epigenetic repression, partial functional release, and further mutation-driven reshaping of TFBS in tight coevolution with the enclosing genomic loci.
Collapse
Affiliation(s)
- Andrew Garazha
- a Group for Genomic Regulation of Cell Signaling Systems ; Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry ; Moscow , Russia
| | | | | | | | | | | | | |
Collapse
|
4
|
Suntsova M, Garazha A, Ivanova A, Kaminsky D, Zhavoronkov A, Buzdin A. Molecular functions of human endogenous retroviruses in health and disease. Cell Mol Life Sci 2015; 72:3653-75. [PMID: 26082181 PMCID: PMC11113533 DOI: 10.1007/s00018-015-1947-6] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2015] [Revised: 05/29/2015] [Accepted: 06/03/2015] [Indexed: 12/13/2022]
Abstract
Human endogenous retroviruses (HERVs) and related genetic elements form 504 distinct families and occupy ~8% of human genome. Recent success of high-throughput experimental technologies facilitated understanding functional impact of HERVs for molecular machinery of human cells. HERVs encode active retroviral proteins, which may exert important physiological functions in the body, but also may be involved in the progression of cancer and numerous human autoimmune, neurological and infectious diseases. The spectrum of related malignancies includes, but not limits to, multiple sclerosis, psoriasis, lupus, schizophrenia, multiple cancer types and HIV. In addition, HERVs regulate expression of the neighboring host genes and modify genomic regulatory landscape, e.g., by providing regulatory modules like transcription factor binding sites (TFBS). Indeed, recent bioinformatic profiling identified ~110,000 regulatory active HERV elements, which formed at least ~320,000 human TFBS. These and other peculiarities of HERVs might have played an important role in human evolution and speciation. In this paper, we focus on the current progress in understanding of normal and pathological molecular niches of HERVs, on their implications in human evolution, normal physiology and disease. We also review the available databases dealing with various aspects of HERV genetics.
Collapse
Affiliation(s)
- Maria Suntsova
- Group for Genomic Regulation of Cell Signaling Systems, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, 117997, Russia.
- Laboratory of Bioinformatics, D. Rogachyov Federal Research Center of Pediatric Hematology, Oncology and Immunology, Moscow, 117198, Russia.
| | - Andrew Garazha
- Group for Genomic Regulation of Cell Signaling Systems, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, 117997, Russia.
- Laboratory of Bioinformatics, D. Rogachyov Federal Research Center of Pediatric Hematology, Oncology and Immunology, Moscow, 117198, Russia.
| | - Alena Ivanova
- Group for Genomic Regulation of Cell Signaling Systems, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, 117997, Russia.
- Pathway Pharmaceuticals, Wan Chai, Hong Kong, Hong Kong SAR.
| | - Dmitry Kaminsky
- Pathway Pharmaceuticals, Wan Chai, Hong Kong, Hong Kong SAR.
| | - Alex Zhavoronkov
- Pathway Pharmaceuticals, Wan Chai, Hong Kong, Hong Kong SAR.
- Department of Translational and Regenerative Medicine, Moscow Institute of Physics and Technology, 9 Institutskiy per., Dolgoprudny, Moscow, 141700, Russia.
| | - Anton Buzdin
- Group for Genomic Regulation of Cell Signaling Systems, Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Moscow, 117997, Russia.
- Pathway Pharmaceuticals, Wan Chai, Hong Kong, Hong Kong SAR.
- National Research Centre "Kurchatov Institute", Centre for Convergence of Nano-, Bio-, Information and Cognitive Sciences and Technologies, 1, Akademika Kurchatova sq., Moscow, 123182, Russia.
| |
Collapse
|
5
|
St Laurent G, Shtokalo D, Dong B, Tackett MR, Fan X, Lazorthes S, Nicolas E, Sang N, Triche TJ, McCaffrey TA, Xiao W, Kapranov P. VlincRNAs controlled by retroviral elements are a hallmark of pluripotency and cancer. Genome Biol 2013; 14:R73. [PMID: 23876380 PMCID: PMC4053963 DOI: 10.1186/gb-2013-14-7-r73] [Citation(s) in RCA: 67] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2012] [Accepted: 07/22/2013] [Indexed: 12/20/2022] Open
Abstract
BACKGROUND The function of the non-coding portion of the human genome remains one of the most important questions of our time. Its vast complexity is exemplified by the recent identification of an unusual and notable component of the transcriptome - very long intergenic non-coding RNAs, termed vlincRNAs. RESULTS Here we identify 2,147 vlincRNAs covering 10 percent of our genome. We show they are present not only in cancerous cells, but also in primary cells and normal human tissues, and are controlled by canonical promoters. Furthermore, vlincRNA promoters frequently originate from within endogenous retroviral sequences. Strikingly, the number of vlincRNAs expressed from endogenous retroviral promoters strongly correlates with pluripotency or the degree of malignant transformation. These results suggest a previously unknown connection between the pluripotent state and cancer via retroviral repeat-driven expression of vlincRNAs. Finally, we show that vlincRNAs can be syntenically conserved in humans and mouse and their depletion using RNAi can cause apoptosis in cancerous cells. CONCLUSIONS These intriguing observations suggest that vlincRNAs could create a framework that combines many existing short ESTs and lincRNAs into a landscape of very long transcripts functioning in the regulation of gene expression in the nucleus. Certain types of vlincRNAs participate at specific stages of normal development and, based on analysis of a limited set of cancerous and primary cell lines, they appear to be co-opted by cancer-associated transcriptional programs. This provides additional understanding of transcriptome regulation during the malignant state, and could lead to additional targets and options for its reversal.
Collapse
Affiliation(s)
- Georges St Laurent
- St. Laurent Institute, One Kendall Square, Cambridge, MA
- Department of Molecular Biology, Cell Biology, and Biochemistry, Brown University, Providence, RI
| | - Dmitry Shtokalo
- St. Laurent Institute, One Kendall Square, Cambridge, MA
- A.P.Ershov Institute of Informatics Systems SB RAS, 6, Acad. Lavrentjev ave., Novosibirsk 630090, Russia
| | - Biao Dong
- Department of Microbiology and Immunology, Sol Sherry Thrombosis Research Center, Temple University, Philadelphia, PA
| | | | - Xiaoxuan Fan
- Department of Microbiology and Immunology, Sol Sherry Thrombosis Research Center, Temple University, Philadelphia, PA
| | - Sandra Lazorthes
- Université de Toulouse, UPS, LBCMCP, F-31062 Toulouse, France
- CNRS, LBCMCP, F-31062 Toulouse, France
| | - Estelle Nicolas
- Université de Toulouse, UPS, LBCMCP, F-31062 Toulouse, France
- CNRS, LBCMCP, F-31062 Toulouse, France
| | - Nianli Sang
- Department of Biology, Drexel University, 3245 Chestnut St, PISB 417, Philadelphia, PA
| | - Timothy J Triche
- Department of Pathology, University of Southern California, 1975 Zonal Avenue, Los Angeles, CA
| | - Timothy A McCaffrey
- The George Washington University Medical Center, Department of Medicine, Division of Genomic Medicine, 2300 I St. NW, Washington, D.C
| | - Weidong Xiao
- Department of Microbiology and Immunology, Sol Sherry Thrombosis Research Center, Temple University, Philadelphia, PA
| | | |
Collapse
|
6
|
Genes associated with the cis-regulatory functions of intragenic LINE-1 elements. BMC Genomics 2013; 14:205. [PMID: 23530910 PMCID: PMC3643820 DOI: 10.1186/1471-2164-14-205] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2012] [Accepted: 03/19/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Thousands of intragenic long interspersed element 1 sequences (LINE-1 elements or L1s) reside within genes. These intragenic L1 sequences are conserved and regulate the expression of their host genes. When L1 methylation is decreased, either through chemical induction or in cancer, the intragenic L1 transcription is increased. The resulting L1 mRNAs form RISC complexes with pre-mRNA to degrade the complementary mRNA. In this study, we screened for genes that are involved in intragenic L1 regulation networks. RESULTS Genes containing L1s were obtained from L1Base (http://l1base.molgen.mpg.de). The expression profiles of 205 genes in 516 gene knockdown experiments were obtained from the Gene Expression Omnibus (GEO) (http://www.ncbi.nlm.nih.gov/geo). The expression levels of the genes with and without L1s were compared using Pearson's chi-squared test. After a permutation based statistical analysis and a multiple hypothesis testing, 73 genes were found to induce significant regulatory changes (upregulation and/or downregulation) in genes with L1s. In detail, 5 genes were found to induce both the upregulation and downregulation of genes with L1s, whereas 27 and 37 genes induced the downregulation and upregulation, respectively, of genes with L1s. These regulations sometimes differed depending on the cell type and the orientation of the intragenic L1s. Moreover, the siRNA-regulating genes containing L1s possess a variety of molecular functions, are responsible for many cellular phenotypes and are associated with a number of diseases. CONCLUSIONS Cells use intragenic L1s as cis-regulatory elements within gene bodies to modulate gene expression. There may be several mechanisms by which L1s mediate gene expression. Intragenic L1s may be involved in the regulation of several biological processes, including DNA damage and repair, inflammation, immune function, embryogenesis, cell differentiation, cellular response to external stimuli and hormonal responses. Furthermore, in addition to cancer, intragenic L1s may alter gene expression in a variety of diseases and abnormalities.
Collapse
|
7
|
Gao D, Jimenez-Lopez JC, Iwata A, Gill N, Jackson SA. Functional and structural divergence of an unusual LTR retrotransposon family in plants. PLoS One 2012; 7:e48595. [PMID: 23119066 PMCID: PMC3485330 DOI: 10.1371/journal.pone.0048595] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Accepted: 09/28/2012] [Indexed: 12/24/2022] Open
Abstract
Retrotransposons with long terminal repeats (LTRs) more than 3 kb are not frequent in most eukaryotic genomes. Rice LTR retrotransposon, Retrosat2, has LTRs greater than 3.2 kb and two open reading frames (ORF): ORF1 encodes enzymes for retrotransposition whereas no function can be assigned to ORF0 as it is not found in any other organism. A variety of experimental and in silico approaches were used to determine the origin of Retrosat2 and putative function of ORF0. Our data show that not only is Retrosat2 highly abundant in the Oryza genus, it may yet be active in rice. Homologs of Retrosat2 were identified in maize, sorghum, Arabidopsis and other plant genomes suggesting that the Retrosat2 family is of ancient origin. Several putatively cis-acting elements, some multicopy, that regulate retrotransposon replication or responsiveness to environmental factors were found in the LTRs of Retrosat2. Unlike the ORF1, the ORF0 sequences from Retrosat2 and homologs are divergent at the sequence level, 3D-structures and predicted biological functions. In contrast to other retrotransposon families, Retrosat2 and its homologs are dispersed throughout genomes and not concentrated in the specific chromosomal regions, such as centromeres. The genomic distribution of Retrosat2 homologs varies across species which likely reflects the differing evolutionary trajectories of this retrotransposon family across diverse species.
Collapse
Affiliation(s)
- Dongying Gao
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
| | - Jose C. Jimenez-Lopez
- Department of Biochemistry, Cell & Molecular Biology of Plants, Estacion Experimental del Zaidin, High Council for Scientific Research, Granada, Spain
| | - Aiko Iwata
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
| | - Navdeep Gill
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
- Department of Botany, University of British Columbia, Vancouver, British Columbia, Canada
| | - Scott A. Jackson
- Center for Applied Genetic Technologies, University of Georgia, Athens, Georgia, United States of America
| |
Collapse
|
8
|
Gabriel U, Steidler A, Trojan L, Michel MS, Seifarth W, Fabarius A. Smoking increases transcription of human endogenous retroviruses in a newly established in vitro cell model and in normal urothelium. AIDS Res Hum Retroviruses 2010; 26:883-8. [PMID: 20666582 DOI: 10.1089/aid.2010.0014] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Human endogenous retroviruses (HERVs) accounting for 9% of the human genome are considered as surrogate markers for genetic instability and as a driving force of genetic variation. Moreover, they modulate regular gene activities and give rise to expression of disease-associated peptides that may serve as diagnostic markers or even targets for T cell-based immune responses. To date, no data are available on the potential link between urothelial carcinogenesis, HERV activity, and tobacco smoking, the main risk for bladder cancer. Here, we report on potential alterations in HERV transcription induced by smoking in a newly established in vitro model and in human urothelium. Normal human dermal fibroblasts were cultivated with urine from never (n = 6) and current smokers (n = 6) and transcription levels for the HERV subfamilies HERV-E 4-1, HERV-T S71-TK1, and HERV-K HML-6 were measured by quantitative real-time PCR. Tendencies toward increased mean transcript levels were detected for cells treated with urine from current smokers. Equally, activity measured in human urothelium supported an increase of HERV transcription in current smokers (n = 9) compared to never smokers (n = 4).
Collapse
Affiliation(s)
- Ute Gabriel
- Department of Urology, University of Heidelberg, Mannheim, Germany
| | - Annette Steidler
- Department of Urology, University of Heidelberg, Mannheim, Germany
| | - Lutz Trojan
- Department of Urology, University of Heidelberg, Mannheim, Germany
| | | | - Wolfgang Seifarth
- Medical Clinic III, Mannheim Medical Center, University of Heidelberg, Mannheim, Germany
| | - Alice Fabarius
- Medical Clinic III, Mannheim Medical Center, University of Heidelberg, Mannheim, Germany
| |
Collapse
|
9
|
Goto Y, Kimura H. Inactive X chromosome-specific histone H3 modifications and CpG hypomethylation flank a chromatin boundary between an X-inactivated and an escape gene. Nucleic Acids Res 2010; 37:7416-28. [PMID: 19843608 PMCID: PMC2794193 DOI: 10.1093/nar/gkp860] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
In mammals, the dosage compensation of sex chromosomes between males and females is achieved by transcriptional inactivation of one of the two X chromosomes in females. However, a number of genes escape X-inactivation in humans. It remains poorly understood how the transcriptional activity of these ‘escape genes’ is maintained despite the chromosome-wide heterochromatin formation. To address this question, we analyzed a putative chromatin boundary between the inactivated RBM10 and an escape gene, UBA1/UBE1. Chromatin immunoprecipitation revealed that trimethylated histone H3 lysine 9 and H4 lysine 20 were enriched in the last exon through the proximal downstream region of RBM10, but were remarkably diminished at ∼2 kb upstream of the UBA1 transcription start site. Whereas RNA polymerase II was not loaded onto the intergenic region, CTCF (CCCTC binding factor) was enriched around the boundary, where some CpG sites were hypomethylated specifically on inactive X. These findings suggest that local DNA hypomethylation and CTCF binding are involved in the formation of a chromatin boundary, which protects the UBA1 escape gene against the chromosome-wide transcriptional silencing.
Collapse
Affiliation(s)
- Yuji Goto
- Nuclear Function and Dynamics Unit, Horizontal Medical Research Organization, Graduate School of Medicine, Kyoto University, Sakyo-ku, Kyoto 606-8501, Japan
| | | |
Collapse
|
10
|
Unique functions of repetitive transcriptomes. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2010; 285:115-88. [PMID: 21035099 DOI: 10.1016/b978-0-12-381047-2.00003-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Repetitive sequences occupy a huge fraction of essentially every eukaryotic genome. Repetitive sequences cover more than 50% of mammalian genomic DNAs, whereas gene exons and protein-coding sequences occupy only ~3% and 1%, respectively. Numerous genomic repeats include genes themselves. They generally encode "selfish" proteins necessary for the proliferation of transposable elements (TEs) in the host genome. The major part of evolutionary "older" TEs accumulated mutations over time and fails to encode functional proteins. However, repeats have important functions also on the RNA level. Repetitive transcripts may serve as multifunctional RNAs by participating in the antisense regulation of gene activity and by competing with the host-encoded transcripts for cellular factors. In addition, genomic repeats include regulatory sequences like promoters, enhancers, splice sites, polyadenylation signals, and insulators, which actively reshape cellular transcriptomes. TE expression is tightly controlled by the host cells, and some mechanisms of this regulation were recently decoded. Finally, capacity of TEs to proliferate in the host genome led to the development of multiple biotechnological applications.
Collapse
|
11
|
Gogvadze E, Buzdin A. Retroelements and their impact on genome evolution and functioning. Cell Mol Life Sci 2009; 66:3727-42. [PMID: 19649766 PMCID: PMC11115525 DOI: 10.1007/s00018-009-0107-2] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2009] [Revised: 06/11/2009] [Accepted: 07/14/2009] [Indexed: 12/31/2022]
Abstract
Retroelements comprise a considerable fraction of eukaryotic genomes. Since their initial discovery by Barbara McClintock in maize DNA, retroelements have been found in genomes of almost all organisms. First considered as a "junk DNA" or genomic parasites, they were shown to influence genome functioning and to promote genetic innovations. For this reason, they were suggested as an important creative force in the genome evolution and adaptation of an organism to altered environmental conditions. In this review, we summarize the up-to-date knowledge of different ways of retroelement involvement in structural and functional evolution of genes and genomes, as well as the mechanisms generated by cells to control their retrotransposition.
Collapse
Affiliation(s)
- Elena Gogvadze
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, 16/10 Miklukho-Maklaya st, 117997 Moscow, Russia.
| | | |
Collapse
|
12
|
Zhu X, Ling J, Zhang L, Pi W, Wu M, Tuan D. A facilitated tracking and transcription mechanism of long-range enhancer function. Nucleic Acids Res 2007; 35:5532-44. [PMID: 17704132 PMCID: PMC2018613 DOI: 10.1093/nar/gkm595] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In the human ε−globin gene locus, the HS2 enhancer in the Locus Control Region regulates transcription of the embryonic ε-globin gene located over 10 kb away. The mechanism of long-range HS2 enhancer function was not fully established. Here we show that the HS2 enhancer complex containing the enhancer DNA together with RNA polymerase II (pol II) and TBP tracks along the intervening DNA, synthesizing short, polyadenylated, intergenic RNAs to ultimately loop with the ε-globin promoter. Guided by this facilitated tracking and transcription mechanism, the HS2 enhancer delivers pol II and TBP to the cis-linked globin promoter to activate mRNA synthesis from the target gene. An insulator inserted in the intervening DNA between the enhancer and the promoter traps the enhancer DNA and the associated pol II and TBP at the insulator site, blocking mid-stream the facilitated tracking and transcription mechanism of the enhancer complex, thereby blocking long-range enhancer function.
Collapse
Affiliation(s)
| | | | | | | | | | - Dorothy Tuan
- *To whom correspondence should be addressed. 706 721 0272706 721 6608
| |
Collapse
|
13
|
Papachatzopoulou A, Kourakli A, Makropoulou P, Kakagianne T, Sgourou A, Papadakis M, Athanassiadou A. Genotypic heterogeneity and correlation to intergenic haplotype within high HbF beta-thalassemia intermedia. Eur J Haematol 2006; 76:322-30. [PMID: 16519704 DOI: 10.1111/j.1600-0609.2005.00618.x] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
OBJECTIVES A molecular study was carried out of beta-thalassemia intermedia patients, compound heterozygotes for mutations usually found in beta-thalassemia major, with high levels of HbF in the absence of hereditary persistence of fetal hemoglobin (HPFH) syndrome. Our objective was to locate cis-DNA structures, DNA haplotypes, motifs, or polymorphisms that may correlate with the presence of high HbF. METHODS Allele-specific oligonucleotide (ASO) hybridization was used for the detection of mutations and restriction fragment length polymorphism (RFLP) analysis and automated sequencing for motifs, haplotypes, and polymorphisms. Southern blot was used for investigating alpha-thalassemia and/or alpha- or gamma-globin genes triplications. RNA extracted from burst forming unit-erythroid (BFU-e) colonies of peripheral blood mononuclear cell cultures was used in reverse transcriptase-polymerase chain reaction (RT-PCR) to investigate intergenic transcription. RESULTS We established that (i) the combination: T haplotype of the Agamma-delta-globin intergenic region, the motif (TA)9N10(TA)10 in the HS2 site of locus control region (LCR), and TAG pre-Ggamma haplotype is sufficient but not necessary for high HbF, (ii) the genetic determinant(s) for high HbF involves an element associated with this combination and must be present in the specific R haplotype occurring in beta-thalassemia intermedia and (iii) the genetic determinant(s) for high HbF does not involve the abolition of intergenic transcription in the Agamma-delta-globin intergenic region. CONCLUSIONS The genetic determinant(s) of high HbF in the absence of HPFH is linked to intergenic haplotype T and does not disrupt intergenic transcription.
Collapse
|
14
|
Ling J, Baibakov B, Pi W, Emerson BM, Tuan D. The HS2 enhancer of the beta-globin locus control region initiates synthesis of non-coding, polyadenylated RNAs independent of a cis-linked globin promoter. J Mol Biol 2005; 350:883-96. [PMID: 15979088 DOI: 10.1016/j.jmb.2005.05.039] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2005] [Revised: 05/06/2005] [Accepted: 05/18/2005] [Indexed: 11/24/2022]
Abstract
The HS2 enhancer in the beta-globin locus control region (LCR) regulates transcription of the globin genes 10-50 kb away. Earlier studies show that a transcription mechanism initiated by the HS2 enhancer through the intervening DNA in the direction of the cis-linked promoter and gene mediates long-range enhancer function. Here, we further analyzed the enhancer-initiated RNAs and their mode of transcription from the HS2 enhancer in the endogenous genome of erythroid K562 cells, in plasmids integrated into K562 cells and in purified DNA used as template in in vitro transcription reactions. We found that the HS2 enhancer was able to initiate transcription autonomously in the absence of a cis-linked globin promoter. The enhancer-initiated, intergenic RNAs were different from the mRNA synthesized at the promoter in several aspects. The enhancer RNAs were synthesized not from a defined site but from multiple sites both within and as far as 1 kb downstream of the enhancer. The enhancer RNAs did not appear to contain a normal cap structure at the 5' ends. They were polyadenylated at multiple sites within 3 kb downstream of their initiation sites and were therefore shorter than 3 kb in lengths. The enhancer RNAs remained in discrete spots within the nucleus and were not processed into mRNA or translated into proteins. These particular features of enhancer-initiated transcription indicate that the transcriptional complex assembled by the enhancer was different from the basal transcription complex assembled at the promoter. The results suggest that in synthesizing non-coding, intergenic RNAs, the enhancer-assembled transcription complex could track through the intervening DNA to reach the basal promoter complex and activate efficient mRNA synthesis from the promoter.
Collapse
Affiliation(s)
- Jianhua Ling
- Department of Biochemistry and Molecular Biology, Medical College of Georgia, Augusta, GA 30912, USA
| | | | | | | | | |
Collapse
|
15
|
Yu X, Zhu X, Pi W, Ling J, Ko L, Takeda Y, Tuan D. The long terminal repeat (LTR) of ERV-9 human endogenous retrovirus binds to NF-Y in the assembly of an active LTR enhancer complex NF-Y/MZF1/GATA-2. J Biol Chem 2005; 280:35184-94. [PMID: 16105833 DOI: 10.1074/jbc.m508138200] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The solitary ERV-9 long terminal repeat (LTR) located upstream of the HS5 site in the human beta-globin locus control region exhibits prominent enhancer activity in embryonic and erythroid cells. The LTR enhancer contains 14 tandemly repeated subunits with recurrent CCAAT, GTGGGGA, and GATA motifs. Here we showed that in erythroid K562 cells these DNA motifs bound the following three transcription factors: ubiquitous NF-Y and hematopoietic MZF1 and GATA-2. These factors and their target DNA motifs exhibited a hierarchy of DNA/protein and protein/protein binding affinities: NF-Y/CCAAT > NF-Y/GATA-2 > NF-Y/MZF1 > MZF1/GTGGGGA; GATA-2/GATA. Through protein/protein interactions, NF-Y bound at the CCAAT motif recruited MZF1 and GATA-2, but not Sp1 and GATA-1, and stabilized their binding to the neighboring GTGGGGA and GATA sites to assemble a novel LTR enhancer complex, NF-Y/MZF1/GATA-2. In the LTR-HS5-epsilonp-GFP plasmid integrated into K562 cells, mutation of the CCAAT motif in the LTR enhancer to abolish NF-Y binding inactivated the enhancer, closed down the chromatin structure of the epsilon-globin promoter, and silenced transcription of the green fluorescent protein gene. The results indicated that NF-Y bound at the CCAAT motifs assembled a robust LTR enhancer complex, which could act over the intervening DNA to remodel the chromatin structure and to stimulate the transcription of the downstream gene locus.
Collapse
Affiliation(s)
- Xiuping Yu
- Department of Biochemistry and Molecular Biology and Institute of Molecular Medicine and Genetics, Medical College of Georgia, Augusta, Georgia 30912, USA
| | | | | | | | | | | | | |
Collapse
|
16
|
Ling J, Ainol L, Zhang L, Yu X, Pi W, Tuan D. HS2 enhancer function is blocked by a transcriptional terminator inserted between the enhancer and the promoter. J Biol Chem 2004; 279:51704-13. [PMID: 15465832 DOI: 10.1074/jbc.m404039200] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
The HS2 enhancer in the beta-globin locus control region regulates transcription of the globin genes 10-50 kb away. How the HS2 enhancer acts over this distance is not clearly understood. Earlier studies show that in erythroid cells the HS2 enhancer initiates synthesis of intergenic RNAs from sites within and downstream of the enhancer, and the enhancer-initiated RNAs are transcribed through the intervening DNA into the cis-linked promoter and gene. To investigate the functional significance of the enhancer-initiated transcription, here we inserted the lac operator sequence in the intervening DNA between the HS2 enhancer and the epsilon-globin promoter in reporter plasmids and integrated the plasmids into erythroid K562 cells expressing the lac repressor protein. We found that the interposed lac operator/repressor complex blocked the elongation of enhancer-initiated transcription through the intervening DNA and drastically reduced HS2 enhancer function as measured by the level of mRNA synthesized from the epsilon-globin promoter. The results indicate that the tracking and transcription mechanism of the HS2 enhancer-assembled transcriptional machinery from the enhancer through the intervening DNA into the cis-linked promoter can mediate enhancer-promoter interaction over a long distance.
Collapse
Affiliation(s)
- Jianhua Ling
- Department of Biochemistry and Molecular Biology, School of Medicine, Medical College of Georgia, Augusta, Georgia 30912, USA
| | | | | | | | | | | |
Collapse
|
17
|
Ling J, Zhang L, Jin H, Pi W, Kosteas T, Anagnou NP, Goodman M, Tuan D. Dynamic retrotransposition of ERV-9 LTR and L1 in the beta-globin gene locus during primate evolution. Mol Phylogenet Evol 2004; 30:867-71. [PMID: 15012967 DOI: 10.1016/j.ympev.2003.10.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2003] [Revised: 09/23/2003] [Indexed: 11/25/2022]
Affiliation(s)
- Jianhua Ling
- Department of Biochemistry and Molecular Biology, School of Medicine, Medical College of Georgia, Augusta, GA 30912, USA
| | | | | | | | | | | | | | | |
Collapse
|