1
|
Haddad-Mashadrizeh A, Mirahmadi M, Taghavizadeh Yazdi ME, Gholampour-Faroji N, Bahrami A, Zomorodipour A, Moghadam Matin M, Qayoomian M, Saebnia N. Introns and Their Therapeutic Applications in Biomedical Researches. IRANIAN JOURNAL OF BIOTECHNOLOGY 2023; 21:e3316. [PMID: 38269198 PMCID: PMC10804063 DOI: 10.30498/ijb.2023.334488.3316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Accepted: 03/23/2023] [Indexed: 01/26/2024]
Abstract
Context Although for a long time, it was thought that intervening sequences (introns) were junk DNA without any function, their critical roles and the underlying molecular mechanisms in genome regulation have only recently come to light. Introns not only carry information for splicing, but they also play many supportive roles in gene regulation at different levels. They are supposed to function as useful tools in various biological processes, particularly in the diagnosis and treatment of diseases. Introns can contribute to numerous biological processes, including gene silencing, gene imprinting, transcription, mRNA metabolism, mRNA nuclear export, mRNA localization, mRNA surveillance, RNA editing, NMD, translation, protein stability, ribosome biogenesis, cell growth, embryonic development, apoptosis, molecular evolution, genome expansion, and proteome diversity through various mechanisms. Evidence Acquisition In order to fulfill the objectives of this study, the following databases were searched: Medline, Scopus, Web of Science, EBSCO, Open Access Journals, and Google Scholar. Only articles published in English were included. Results & Conclusions The intervening sequences of eukaryotic genes have critical functions in genome regulation, as well as in molecular evolution. Here, we summarize recent advances in our understanding of how introns influence genome regulation, as well as their effects on molecular evolution. Moreover, therapeutic strategies based on intron sequences are discussed. According to the obtained results, a thorough understanding of intron functional mechanisms could lead to new opportunities in disease diagnosis and therapies, as well as in biotechnology applications.
Collapse
Affiliation(s)
- Aliakbar Haddad-Mashadrizeh
- Industrial Biotechnology Research Group, Institute of Biotechnology, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Mahdi Mirahmadi
- Stem Cell and Regenerative Medicine Research Group, Iranian Academic Center for Education, Culture and Research (ACECR), Khorasan Razavi Branch, Mashhad, Iran
- Biotechnology Research Center, Pharmaceutical Technology Institute, Mashhad University of Medical Sciences, Mashhad, Iran
| | | | - Nazanin Gholampour-Faroji
- Industrial Biotechnology Research Group, Institute of Biotechnology, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Ahmadreza Bahrami
- Industrial Biotechnology Research Group, Institute of Biotechnology, Ferdowsi University of Mashhad, Mashhad, Iran
- Department of Biology, Faculty of Science, Ferdowsi University of Mashhad, Mashhad, Iran
| | | | - Maryam Moghadam Matin
- Department of Biology, Faculty of Science, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Mohsen Qayoomian
- Applied Biomedical Research Center, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Neda Saebnia
- Department of Biology, Faculty of Science, Ferdowsi University of Mashhad, Mashhad, Iran
| |
Collapse
|
2
|
van Riet J, Saha C, Strepis N, Brouwer RWW, Martens-Uzunova ES, van de Geer WS, Swagemakers SMA, Stubbs A, Halimi Y, Voogd S, Tanmoy AM, Komor MA, Hoogstrate Y, Janssen B, Fijneman RJA, Niknafs YS, Chinnaiyan AM, van IJcken WFJ, van der Spek PJ, Jenster G, Louwen R. CRISPRs in the human genome are differentially expressed between malignant and normal adjacent to tumor tissue. Commun Biol 2022; 5:338. [PMID: 35396392 PMCID: PMC8993844 DOI: 10.1038/s42003-022-03249-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Accepted: 03/09/2022] [Indexed: 11/09/2022] Open
Abstract
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) have been identified in bacteria, archaea and mitochondria of plants, but not in eukaryotes. Here, we report the discovery of 12,572 putative CRISPRs randomly distributed across the human chromosomes, which we termed hCRISPRs. By using available transcriptome datasets, we demonstrate that hCRISPRs are distinctively expressed as small non-coding RNAs (sncRNAs) in cell lines and human tissues. Moreover, expression patterns thereof enabled us to distinguish normal from malignant tissues. In prostate cancer, we confirmed the differential hCRISPR expression between normal adjacent and malignant primary prostate tissue by RT-qPCR and demonstrate that the SHERLOCK and DETECTR dipstick tools are suitable to detect these sncRNAs. We anticipate that the discovery of CRISPRs in the human genome can be further exploited for diagnostic purposes in cancer and other medical conditions, which certainly will lead to the development of point-of-care tests based on the differential expression of the hCRISPRs.
Collapse
Affiliation(s)
- Job van Riet
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, Netherlands
- Cancer Computational Biology Center, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Medical Oncology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Chinmoy Saha
- Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Nikolaos Strepis
- Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Rutger W W Brouwer
- Center for Biomics, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Elena S Martens-Uzunova
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Wesley S van de Geer
- Cancer Computational Biology Center, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Medical Oncology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Sigrid M A Swagemakers
- Clinical Bioinformatics, Department of Pathology, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Andrew Stubbs
- Clinical Bioinformatics, Department of Pathology, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Yassir Halimi
- Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Sanne Voogd
- Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Arif Mohammad Tanmoy
- Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
- Child Health Research Foundation, 23/2 SEL Huq Skypark, Block-B, Khilji Rd, Dhaka, 1207, Bangladesh
| | - Malgorzata A Komor
- Translational Gastrointestinal Oncology, Department of Pathology, Netherlands Cancer Institute, Amsterdam, Netherlands
- Oncoproteomics Laboratory, Department of Medical Oncology, VU University Medical Center, Amsterdam, Netherlands
| | - Youri Hoogstrate
- Department of Neurology, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | | | - Remond J A Fijneman
- Translational Gastrointestinal Oncology, Department of Pathology, Netherlands Cancer Institute, Amsterdam, Netherlands
| | - Yashar S Niknafs
- Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Arul M Chinnaiyan
- Michigan Center for Translational Pathology, University of Michigan, Ann Arbor, MI, USA
| | | | - Peter J van der Spek
- Clinical Bioinformatics, Department of Pathology, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Guido Jenster
- Department of Urology, Erasmus MC Cancer Institute, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Rogier Louwen
- Department of Medical Microbiology and Infectious Diseases, Erasmus University Medical Center Rotterdam, Rotterdam, Netherlands.
| |
Collapse
|
3
|
Romero AA, Cobb SA, Collins JNR, Kliewer SA, Mangelsdorf DJ, Collins JJ. The Schistosoma mansoni nuclear receptor FTZ-F1 maintains esophageal gland function via transcriptional regulation of meg-8.3. PLoS Pathog 2021; 17:e1010140. [PMID: 34910770 PMCID: PMC8673669 DOI: 10.1371/journal.ppat.1010140] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Accepted: 11/23/2021] [Indexed: 11/19/2022] Open
Abstract
Schistosomes infect over 200 million of the world's poorest people, but unfortunately treatment relies on a single drug. Nuclear hormone receptors are ligand-activated transcription factors that regulate diverse processes in metazoans, yet few have been functionally characterized in schistosomes. During a systematic analysis of nuclear receptor function, we found that an FTZ-F1-like receptor was essential for parasite survival. Using a combination of transcriptional profiling and chromatin immunoprecipitation (ChIP), we discovered that the micro-exon gene meg-8.3 is a transcriptional target of SmFTZ-F1. We found that both Smftz-f1 and meg-8.3 are required for esophageal gland maintenance as well as integrity of the worm's head. Together, these studies define a new role for micro-exon gene function in the parasite and suggest that factors associated with the esophageal gland could represent viable therapeutic targets.
Collapse
Affiliation(s)
- Aracely A. Romero
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, Texas, United States of America
| | - Sarah A. Cobb
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, Texas, United States of America
| | - Julie N. R. Collins
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, Texas, United States of America
| | - Steven A. Kliewer
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, Texas, United States of America
- Department of Molecular Biology, UT Southwestern Medical Center, Dallas, Texas, United States of America
| | - David J. Mangelsdorf
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, Texas, United States of America
- Howard Hughes Medical Institute, Dallas, Texas, United States of America
| | - James J. Collins
- Department of Pharmacology, UT Southwestern Medical Center, Dallas, Texas, United States of America
| |
Collapse
|
4
|
Stockwell PA, Lynch-Sutherland CF, Chatterjee A, Macaulay EC, Eccles MR. RepExpress: A Novel Pipeline for the Quantification and Characterization of Transposable Element Expression from RNA-seq Data. Curr Protoc 2021; 1:e206. [PMID: 34387946 DOI: 10.1002/cpz1.206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Transposable elements (TEs) are key regulators of both development and disease; however, their repetitive nature presents substantial computational challenges to their analysis. Due to a lack of computational tools and suitable analysis frameworks, TE expression is often not quantified at the locus level. Therefore, we have developed RepExpress, a novel pipeline that enables locus-level TE quantification and characterization. RepExpress enables the characterization of TE expression in a genomic context, and is the first tool focusing on the identification of tissue-specific TE-derived and TE-regulated genes. RepExpress identifies expressed TEs overlapping with annotated genomic features and enables tissue-specific profiles of TE-derived genes. TEs that are expressed with no overlap with any known genomic features are characterized by the closest downstream genomic feature enabling identification of novel TE-gene regulatory relationships. RepExpress takes standard RNA-seq data as input and performs genomic alignment optimized for TEs. Our novel pipeline quantifies expression of both TEs and genes using featureCounts and Stringtie, respectively. RepExpress then filters expressed repeats and characterizes their genomic context, enabling the identification of TEs that overlap with genes, or that may be influencing gene expression. Here, we describe RepExpress, and provide a step-by-step protocol detailing its workflow. We also discuss other TE analysis tools and their applicability to addressing different biological questions. © 2021 Wiley Periodicals LLC. Basic Protocol: RepExpress workflow.
Collapse
Affiliation(s)
- Peter A Stockwell
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand
| | | | - Aniruddha Chatterjee
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand.,Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| | - Erin C Macaulay
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand
| | - Michael R Eccles
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand.,Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| |
Collapse
|
5
|
Tseng CC, Wong MC, Liao WT, Chen CJ, Lee SC, Yen JH, Chang SJ. Genetic Variants in Transcription Factor Binding Sites in Humans: Triggered by Natural Selection and Triggers of Diseases. Int J Mol Sci 2021; 22:ijms22084187. [PMID: 33919522 PMCID: PMC8073710 DOI: 10.3390/ijms22084187] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2021] [Revised: 04/15/2021] [Accepted: 04/16/2021] [Indexed: 12/14/2022] Open
Abstract
Variants of transcription factor binding sites (TFBSs) constitute an important part of the human genome. Current evidence demonstrates close links between nucleotides within TFBSs and gene expression. There are multiple pathways through which genomic sequences located in TFBSs regulate gene expression, and recent genome-wide association studies have shown the biological significance of TFBS variation in human phenotypes. However, numerous challenges remain in the study of TFBS polymorphisms. This article aims to cover the current state of understanding as regards the genomic features of TFBSs and TFBS variants; the mechanisms through which TFBS variants regulate gene expression; the approaches to studying the effects of nucleotide changes that create or disrupt TFBSs; the challenges faced in studies of TFBS sequence variations; the effects of natural selection on collections of TFBSs; in addition to the insights gained from the study of TFBS alleles related to gout, its associated comorbidities (increased body mass index, chronic kidney disease, diabetes, dyslipidemia, coronary artery disease, ischemic heart disease, hypertension, hyperuricemia, osteoporosis, and prostate cancer), and the treatment responses of patients.
Collapse
Affiliation(s)
- Chia-Chun Tseng
- Graduate Institute of Clinical Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 80708, Taiwan; (C.-C.T.); (J.-H.Y.)
- Division of Rheumatology, Department of Internal Medicine, Kaohsiung Medical University Hospital, Kaohsiung 80756, Taiwan
| | - Man-Chun Wong
- Department of Biotechnology, College of Life Science, Kaohsiung Medical University, Kaohsiung 80708, Taiwan;
| | - Wei-Ting Liao
- Department of Biotechnology, College of Life Science, Kaohsiung Medical University, Kaohsiung 80708, Taiwan;
- Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung 80756, Taiwan
- Correspondence: (W.-T.L.); (S.-J.C.); Tel.: +886-7-3121101 (W.-T.L.); +886-7-5916679 (S.-J.C.); Fax:+886-7-3125339 (W.-T.L.); +886-7-5919264 (S.-J.C.)
| | - Chung-Jen Chen
- Department of Internal Medicine, Kaohsiung Municipal Ta-Tung Hospital, Kaohsiung 80145, Taiwan;
| | - Su-Chen Lee
- Laboratory Diagnosis of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 80708, Taiwan;
| | - Jeng-Hsien Yen
- Graduate Institute of Clinical Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung 80708, Taiwan; (C.-C.T.); (J.-H.Y.)
- Division of Rheumatology, Department of Internal Medicine, Kaohsiung Medical University Hospital, Kaohsiung 80756, Taiwan
- Institute of Biomedical Sciences, National Sun Yat-Sen University, Kaohsiung 80424, Taiwan
- Department of Biological Science and Technology, National Chiao-Tung University, Hsinchu 30010, Taiwan
| | - Shun-Jen Chang
- Department of Kinesiology, Health and Leisure Studies, National University of Kaohsiung, Kaohsiung 81148, Taiwan
- Correspondence: (W.-T.L.); (S.-J.C.); Tel.: +886-7-3121101 (W.-T.L.); +886-7-5916679 (S.-J.C.); Fax:+886-7-3125339 (W.-T.L.); +886-7-5919264 (S.-J.C.)
| |
Collapse
|
6
|
Mobility connects: transposable elements wire new transcriptional networks by transferring transcription factor binding motifs. Biochem Soc Trans 2021; 48:1005-1017. [PMID: 32573687 PMCID: PMC7329337 DOI: 10.1042/bst20190937] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 06/01/2020] [Accepted: 06/03/2020] [Indexed: 12/28/2022]
Abstract
Transposable elements (TEs) constitute major fractions of plant genomes. Their potential to be mobile provides them with the capacity to cause major genome rearrangements. Those effects are potentially deleterious and enforced the evolution of epigenetic suppressive mechanisms controlling TE activity. However, beyond their deleterious effects, TE insertions can be neutral or even advantageous for the host, leading to long-term retention of TEs in the host genome. Indeed, TEs are increasingly recognized as major drivers of evolutionary novelties by regulating the expression of nearby genes. TEs frequently contain binding motifs for transcription factors and capture binding motifs during transposition, which they spread through the genome by transposition. Thus, TEs drive the evolution and diversification of gene regulatory networks by recruiting lineage-specific targets under the regulatory control of specific transcription factors. This process can explain the rapid and repeated evolution of developmental novelties, such as C4 photosynthesis and a wide spectrum of stress responses in plants. It also underpins the convergent evolution of embryo nourishing tissues, the placenta in mammals and the endosperm in flowering plants. Furthermore, the gene regulatory network underlying flower development has also been largely reshaped by TE-mediated recruitment of regulatory elements; some of them being preserved across long evolutionary timescales. In this review, we highlight the potential role of TEs as evolutionary toolkits in plants by showcasing examples of TE-mediated evolutionary novelties.
Collapse
|
7
|
Ali A, Han K, Liang P. Role of Transposable Elements in Gene Regulation in the Human Genome. Life (Basel) 2021; 11:118. [PMID: 33557056 PMCID: PMC7913837 DOI: 10.3390/life11020118] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 01/28/2021] [Accepted: 02/02/2021] [Indexed: 02/07/2023] Open
Abstract
Transposable elements (TEs), also known as mobile elements (MEs), are interspersed repeats that constitute a major fraction of the genomes of higher organisms. As one of their important functional impacts on gene function and genome evolution, TEs participate in regulating the expression of genes nearby and even far away at transcriptional and post-transcriptional levels. There are two known principal ways by which TEs regulate the expression of genes. First, TEs provide cis-regulatory sequences in the genome with their intrinsic regulatory properties for their own expression, making them potential factors for regulating the expression of the host genes. TE-derived cis-regulatory sites are found in promoter and enhancer elements, providing binding sites for a wide range of trans-acting factors. Second, TEs encode for regulatory RNAs with their sequences showed to be present in a substantial fraction of miRNAs and long non-coding RNAs (lncRNAs), indicating the TE origin of these RNAs. Furthermore, TEs sequences were found to be critical for regulatory functions of these RNAs, including binding to the target mRNA. TEs thus provide crucial regulatory roles by being part of cis-regulatory and regulatory RNA sequences. Moreover, both TE-derived cis-regulatory sequences and TE-derived regulatory RNAs have been implicated in providing evolutionary novelty to gene regulation. These TE-derived regulatory mechanisms also tend to function in a tissue-specific fashion. In this review, we aim to comprehensively cover the studies regarding these two aspects of TE-mediated gene regulation, mainly focusing on the mechanisms, contribution of different types of TEs, differential roles among tissue types, and lineage-specificity, based on data mostly in humans.
Collapse
Affiliation(s)
- Arsala Ali
- Department of Biological Sciences, Brock University, St. Catharines, ON L2S 3A1, Canada;
| | - Kyudong Han
- Department of Microbiology, Dankook University, Cheonan 31116, Korea;
- Center for Bio-Medical Engineering Core Facility, Dankook University, Cheonan 31116, Korea
| | - Ping Liang
- Department of Biological Sciences, Brock University, St. Catharines, ON L2S 3A1, Canada;
- Centre of Biotechnologies, Brock University, St. Catharines, ON L2S 3A1, Canada
| |
Collapse
|
8
|
Haddad-Mashadrizeh A, Hemmat J, Aslamkhan M. Intronic regions of the human coagulation factor VIII gene harboring transcription factor binding sites with a strong bias towards the short-interspersed elements. Heliyon 2020; 6:e04727. [PMID: 32944665 PMCID: PMC7481535 DOI: 10.1016/j.heliyon.2020.e04727] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2019] [Revised: 09/03/2019] [Accepted: 08/10/2020] [Indexed: 12/12/2022] Open
Abstract
Increasing data show that intronic derived regulatory elements, such as transcription factor binding sites (TFBs), play key roles in gene regulation, and malfunction. Accordingly, characterizing the sequence context of the intronic regions of the human coagulation factor VIII (hFVIII) gene can be important. In this study, the intronic regions of the hFVIII gene were scrutinized based on in-silico methods. The results disclosed that these regions harbor a rich array of functional elements such as repetitive elements (REs), splicing sites, and transcription factor binding sites (TFBs). Among these elements, TFBs and REs showed a significant distribution and correlation to each other. This survey indicated that 31% of TFBs are localized in the intronic regions of the gene. Moreover, TFBs indicate a strong bias in the regions far from splice sites of introns with mapping to different REs. Accordingly, TFBs showed highly bias toward Short Interspersed Elements (SINEs), which in turn they covering about 12% of the total of REs. However, the distribution pattern of TFBs-REs showed different bias in the intronic regions, spatially into the Introns 13 and 25. The rich array of SINE-TFBs and CR1-TFBs were situated within 5′UTR of the gene that may be an important driving force for regulatory innovation of the hFVIII gene. Taken together, these data may lead to revealing intronic regions with the capacity to renewing gene regulatory networks of the hFVIII gene. On the other hand, these correlations might provide the novel idea for a new hypothesis of molecular evolution of the FVIII gene, and treatment of Hemophilia A which should be considered in future studies.
Collapse
Affiliation(s)
- Aliakbar Haddad-Mashadrizeh
- Recombinant Proteins Research Group, Institute of Biotechnology, Ferdowsi University of Mashhad, Mashhad, Iran
| | - Jafar Hemmat
- Biotechnology Department, Iranian Research Organization for Science and Technology (IROST), Tehran, Iran
| | - Muhammad Aslamkhan
- Human Genetics & Molecular Biology Dept., University of Health Sciences, Lahore, Pakistan.,Honorary Senior Lecturer in the School of the Medicine University of Liverpool, Liverpool, UK
| |
Collapse
|
9
|
Li L, Zhang S, Li LM. Dual Eigen-modules of Cis-Element Regulation Profiles and Selection of Cognition-Language Eigen-direction along Evolution in Hominidae. Mol Biol Evol 2020; 37:1679-1693. [PMID: 32068872 PMCID: PMC10615152 DOI: 10.1093/molbev/msaa036] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
To understand the genomic basis accounting for the phenotypic differences between human and apes, we compare the matrices consisting of the cis-element frequencies in the proximal regulatory regions of their genomes. One such frequency matrix is represented by a robust singular value decomposition. For each singular value, the negative and positive ends of the sorted motif eigenvector correspond to the dual ends of the sorted gene eigenvector, respectively, comprising a dual eigen-module defined by cis-regulatory element frequencies (CREF). The CREF eigen-modules at levels 1, 2, 3, and 6 are highly conserved across humans, chimpanzees, and orangutans. The key biological processes embedded in the top three CREF eigen-modules are reproduction versus embryogenesis, fetal maturation versus immune system, and stress responses versus mitosis. Although the divergence at the nucleotide level between the chimpanzee and human genome was small, their cis-element frequency matrices crossed a singularity point, at which the fourth and fifth singular values were identical. The CREF eigen-modules corresponding to the fourth and fifth singular values were reorganized along the evolution from apes to human. Interestingly, the fourth sorted gene eigenvector encodes the phenotypes unique to human such as long-term memory, language development, and social behavior. The number of motifs present on Alu elements increases substantially at the fourth level. The motif analysis together with the cases of human-specific Alu insertions suggests that mutations related to Alu elements play a critical role in the evolution of the human-phenotypic gene eigenvector.
Collapse
Affiliation(s)
- Liang Li
- National Center of Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences
| | - Sheng Zhang
- National Center of Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences
| | - Lei M Li
- National Center of Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, Beijing, China
- University of Chinese Academy of Sciences
- Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, Kunming, China
| |
Collapse
|
10
|
Lynch-Sutherland CF, Chatterjee A, Stockwell PA, Eccles MR, Macaulay EC. Reawakening the Developmental Origins of Cancer Through Transposable Elements. Front Oncol 2020; 10:468. [PMID: 32432029 PMCID: PMC7214541 DOI: 10.3389/fonc.2020.00468] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 03/16/2020] [Indexed: 12/12/2022] Open
Abstract
Transposable elements (TEs) have an established role as important regulators of early human development, functioning as tissue-specific genes and regulatory elements. Functional TEs are highly active during early development, and interact with important developmental genes, some of which also function as oncogenes. Dedifferentiation is a hallmark of cancer, and is characterized by genetic and epigenetic changes that enable proliferation, self-renewal and a metabolism reminiscent of embryonic stem cells. There is also compelling evidence suggesting that the path to dedifferentiation in cancer can contribute to invasion and metastasis. TEs are frequently expressed in cancer, and recent work has identified a newly proposed mechanism involving extensive recruitment of TE-derived promoters to drive expression of oncogenes and subsequently promote oncogenesis—a process termed onco-exaptation. However, the mechanism by which this phenomenon occurs, and the extent to which it contributes to oncogenesis remains unknown. Initial hypotheses have proposed that onco-exaptation events are cancer-specific and arise randomly due to the dysregulated and hypomethylated state of cancer cells and abundance of TEs across the genome. However, we suspect that exaptation-like events may not just arise due to chance activation of novel regulatory relationships as proposed previously, but as a result of the reestablishment of early developmental regulatory relationships. Dedifferentiation in cancer is well-documented, along with expression of TEs. The known interactions between TEs and pluripotency factors such as NANOG and OCTt4 during early development, along with the expression of some placental-specific TE-derived transcripts in cancer support a possible link between TEs and dedifferentiation of tumor cells. Thus, we hypothesize that onco-exaptation events can be associated with the epigenetic reawakening of early developmental TEs to regulate expression of oncogenes and promote oncogenesis. We also suspect that activation of these early developmental regulatory TEs may promote dedifferentiation, although at this stage it is hard to predict whether TE activation is one of the initial drivers of dedifferentiation. We expect that developmental TE activation occurs as a result of the establishment of an epigenetic landscape in cancer that resembles that of early development and that developmental TE activation may also enable cancers to exploit early developmental pathways, repurposing them to promote malignancy.
Collapse
Affiliation(s)
| | - Aniruddha Chatterjee
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand.,Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| | - Peter A Stockwell
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand
| | - Michael R Eccles
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand.,Maurice Wilkins Centre for Molecular Biodiscovery, Auckland, New Zealand
| | - Erin C Macaulay
- Department of Pathology, Dunedin School of Medicine, University of Otago, Dunedin, New Zealand
| |
Collapse
|
11
|
Clayton EA, Rishishwar L, Huang TC, Gulati S, Ban D, McDonald JF, Jordan IK. An atlas of transposable element-derived alternative splicing in cancer. Philos Trans R Soc Lond B Biol Sci 2020; 375:20190342. [PMID: 32075558 PMCID: PMC7061986 DOI: 10.1098/rstb.2019.0342] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/06/2019] [Indexed: 12/18/2022] Open
Abstract
Transposable element (TE)-derived sequences comprise more than half of the human genome, and their presence has been documented to alter gene expression in a number of different ways, including the generation of alternatively spliced transcript isoforms. Alternative splicing has been associated with tumorigenesis for a number of different cancers. The objective of this study was to broadly characterize the role of human TEs in generating alternatively spliced transcript isoforms in cancer. To do so, we screened for the presence of TE-derived sequences co-located with alternative splice sites that are differentially used in normal versus cancer tissues. We analysed a comprehensive set of alternative splice variants characterized for 614 matched normal-tumour tissue pairs across 13 cancer types, resulting in the discovery of 4820 TE-generated alternative splice events distributed among 723 cancer-associated genes. Short interspersed nuclear elements (Alu) and long interspersed nuclear elements (L1) were found to contribute the majority of TE-generated alternative splice sites in cancer genes. A number of cancer-associated genes, including MYH11, WHSC1 and CANT1, were shown to have overexpressed TE-derived isoforms across a range of cancer types. TE-derived isoforms were also linked to cancer-specific fusion transcripts, suggesting a novel mechanism for the generation of transcriptome diversity via trans-splicing mediated by dispersed TE repeats. This article is part of a discussion meeting issue 'Crossroads between transposons and gene regulation'.
Collapse
Affiliation(s)
- Evan A. Clayton
- Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Lavanya Rishishwar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
- PanAmerican Bioinformatics Institute, Cali, Colombia
- Applied Bioinformatics Laboratory, Atlanta, GA, USA
| | - Tzu-Chuan Huang
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Saurabh Gulati
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - Dongjo Ban
- Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - John F. McDonald
- Integrated Cancer Research Center, School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
| | - I. King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA
- PanAmerican Bioinformatics Institute, Cali, Colombia
- Applied Bioinformatics Laboratory, Atlanta, GA, USA
| |
Collapse
|
12
|
Zeng Y, Cao Y, Halevy RS, Nguyen P, Liu D, Zhang X, Ahituv N, Han JDJ. Characterization of functional transposable element enhancers in acute myeloid leukemia. SCIENCE CHINA-LIFE SCIENCES 2020; 63:675-687. [PMID: 32170627 DOI: 10.1007/s11427-019-1574-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 10/24/2019] [Indexed: 12/15/2022]
Abstract
Transposable elements (TEs) have been shown to have important gene regulatory functions and their alteration could lead to disease phenotypes. Acute myeloid leukemia (AML) develops as a consequence of a series of genetic changes in hematopoietic precursor cells, including mutations in epigenetic factors. Here, we set out to study the gene regulatory role of TEs in AML. We first explored the epigenetic landscape of TEs in AML patients using ATAC-seq data. We show that a large number of TEs in general, and more specifically mammalian-wide interspersed repeats (MIRs), are more enriched in AML cells than in normal blood cells. We obtained a similar finding when analyzing histone modification data in AML patients. Gene Ontology enrichment analysis showed that genes near MIRs in open chromatin regions are involved in leukemogenesis. To functionally validate their regulatory role, we selected 19 MIR regions in AML cells, and tested them for enhancer activity in an AML cell line (Kasumi-1) and a chronic myeloid leukemia (CML) cell line (K562); the results revealed several MIRs to be functional enhancers. Taken together, our results suggest that TEs are potentially involved in myeloid leukemogenesis and highlight these sequences as potential candidates harboring AML-associated variation.
Collapse
Affiliation(s)
- Yingying Zeng
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences Center for Excellence in Molecular Cell Science, Collaborative Innovation Center for Genetics and Developmental Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Yaqiang Cao
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences Center for Excellence in Molecular Cell Science, Collaborative Innovation Center for Genetics and Developmental Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Rivka Sukenik Halevy
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, 94158, USA.,Institute for Human Genetics, University of California San Francisco, San Francisco, 94143, USA.,Sackler School of Medicine, Tel-Aviv University, Tel Aviv, 6997801, Israel
| | - Picard Nguyen
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, 94158, USA.,Institute for Human Genetics, University of California San Francisco, San Francisco, 94143, USA
| | - Denghui Liu
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences Center for Excellence in Molecular Cell Science, Collaborative Innovation Center for Genetics and Developmental Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Xiaoli Zhang
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences Center for Excellence in Molecular Cell Science, Collaborative Innovation Center for Genetics and Developmental Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
| | - Nadav Ahituv
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, 94158, USA. .,Institute for Human Genetics, University of California San Francisco, San Francisco, 94143, USA.
| | - Jing-Dong J Han
- CAS Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences Center for Excellence in Molecular Cell Science, Collaborative Innovation Center for Genetics and Developmental Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China. .,Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology, Peking University, Beijing, 100871, China.
| |
Collapse
|
13
|
Mustafin RN, Khusnutdinova EK. Prospects in the Search for Peptides for Specific Regulation of Aging. ADVANCES IN GERONTOLOGY 2019. [DOI: 10.1134/s2079057019020176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
14
|
Traldi JB, Lui RL, Martinez JDF, Vicari MR, Nogaroto V, Moreira Filho O, Blanco DR. Chromosomal distribution of the retroelements Rex 1, Rex 3 and Rex 6 in species of the genus Harttia and Hypostomus (Siluriformes: Loricariidae). NEOTROPICAL ICHTHYOLOGY 2019. [DOI: 10.1590/1982-0224-20190010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
ABSTRACT The transposable elements (TE) have been widely applied as physical chromosome markers. However, in Loricariidae there are few physical mapping analyses of these elements. Considering the importance of transposable elements for chromosomal evolution and genome organization, this study conducted the physical chromosome mapping of retroelements (RTEs) Rex1, Rex3 and Rex6 in seven species of the genus Harttia and four species of the genus Hypostomus, aiming to better understand the organization and dynamics of genomes of Loricariidae species. The results showed an intense accumulation of RTEs Rex1, Rex3 and Rex6 and dispersed distribution in heterochromatic and euchromatic regions in the genomes of the species studied here. The presence of retroelements in some chromosomal regions suggests their participation in various chromosomal rearrangements. In addition, the intense accumulation of three retroelements in all species of Harttia and Hypostomus, especially in euchromatic regions, can indicate the participation of these elements in the diversification and evolution of these species through the molecular domestication by genomes of hosts, with these sequences being a co-option for new functions.
Collapse
|
15
|
Mustafin RN. The Relationship between Transposons and Transcription Factors in the Evolution of Eukaryotes. J EVOL BIOCHEM PHYS+ 2019. [DOI: 10.1134/s0022093019010022] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
16
|
Mustafin RN, Khusnutdinova EK. The Role of Transposable Elements in Emergence of Metazoa. BIOCHEMISTRY (MOSCOW) 2018; 83:185-199. [PMID: 29625540 DOI: 10.1134/s000629791803001x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Systems initially emerged for protecting genomes against insertions of transposable elements and represented by mechanisms of splicing regulation, RNA-interference, and epigenetic factors have played a key role in the evolution of animals. Many studies have shown inherited transpositions of mobile elements in embryogenesis and preservation of their activities in certain tissues of adult organisms. It was supposed that on the emergence of Metazoa the self-regulation mechanisms of transposons related with the gene networks controlling their activity could be involved in intercellular cell coordination in the cascade of successive divisions with differentiated gene expression for generation of tissues and organs. It was supposed that during evolution species-specific features of transposons in the genomes of eukaryotes could form the basis for creation of dynamically related complexes of systems for epigenetic regulation of gene expression. These complexes could be produced due to the influence of noncoding transposon-derived RNAs on DNA methylation, histone modifications, and processing of alternative splicing variants, whereas the mobile elements themselves could be directly involved in the regulation of gene expression in cis and in trans. Transposons are widely distributed in the genomes of eukaryotes; therefore, their activation can change the expression of specific genes. In turn, this can play an important role in cell differentiation during ontogenesis. It is supposed that transposons can form a species-specific pattern for control of gene expression, and that some variants of this pattern can be favorable for adaptation. The presented data indicate the possible influence of transposons in karyotype formation. It is supposed that transposon localization relative to one another and to protein-coding genes can influence the species-specific epigenetic regulation of ontogenesis.
Collapse
|
17
|
Mustafin RN, Khusnutdinova EK. The Role of Transposons in Epigenetic Regulation of Ontogenesis. Russ J Dev Biol 2018. [DOI: 10.1134/s1062360418020066] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
18
|
Loreto ELS, Deprá M, Diesel JF, Panzera Y, Valente-Gaiesky VLS. Drosophila relics hobo and hobo-MITEs transposons as raw material for new regulatory networks. Genet Mol Biol 2018; 41:198-205. [PMID: 29668013 PMCID: PMC5913719 DOI: 10.1590/1678-4685-gmb-2017-0068] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 08/01/2017] [Indexed: 12/22/2022] Open
Abstract
Hypermutable strains of Drosophila simulans have been studied
for 20 years. Several mutants were isolated and characterized, some of which had
phenotypes associated with alteration in development; for example, showing
ectopic legs with eyes being expressed in place of antennae. The causal agent of
this hypermutability is a non-autonomous hobo-related sequence
(hoboVA). Around 100 mobilizable copies of this element are
present in the D. simulans genome, and these are likely
mobilized by the autonomous and canonical hobo element. We have
shown that hoboVA has transcription factor binding sites for
the developmental genes, hunchback and
even-skipped, and that this transposon is expressed in
embryos, following the patterns of these genes. We suggest that
hobo and hobo-related elements can be
material for the emergence of new regulatory networks.
Collapse
Affiliation(s)
- Elgion L S Loreto
- Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil.,Departamento de Bioquímica e Biologia Molecular (CCNE), Universidade Federal de Santa Maria (UFSM), Santa Maria, RS, Brazil
| | - Maríndia Deprá
- Departamento de Genética, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| | - José F Diesel
- Departamento de Bioquímica e Biologia Molecular (CCNE), Universidade Federal de Santa Maria (UFSM), Santa Maria, RS, Brazil
| | - Yanina Panzera
- Departamento de Genetica, Universidad de la República de Uruguay (UDELAR), Montevideo, Uruguay
| | - Vera Lucia S Valente-Gaiesky
- Programa de Pós-Graduação em Genética e Biologia Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil.,Departamento de Genética, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, RS, Brazil
| |
Collapse
|
19
|
The Genes of Life and Death: A Potential Role for Placental-Specific Genes in Cancer. Bioessays 2017; 39. [DOI: 10.1002/bies.201700091] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2017] [Revised: 08/20/2017] [Indexed: 12/17/2022]
|
20
|
Janoušek V, Laukaitis CM, Yanchukov A, Karn RC. The Role of Retrotransposons in Gene Family Expansions in the Human and Mouse Genomes. Genome Biol Evol 2016; 8:2632-50. [PMID: 27503295 PMCID: PMC5631067 DOI: 10.1093/gbe/evw192] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Retrotransposons comprise a large portion of mammalian genomes. They contribute to structural changes and more importantly to gene regulation. The expansion and diversification of gene families have been implicated as sources of evolutionary novelties. Given the roles retrotransposons play in genomes, their contribution to the evolution of gene families warrants further exploration. In this study, we found a significant association between two major retrotransposon classes, LINEs and LTRs, and lineage-specific gene family expansions in both the human and mouse genomes. The distribution and diversity differ between LINEs and LTRs, suggesting that each has a distinct involvement in gene family expansion. LTRs are associated with open chromatin sites surrounding the gene families, supporting their involvement in gene regulation, whereas LINEs may play a structural role promoting gene duplication. Our findings also suggest that gene family expansions, especially in the mouse genome, undergo two phases. The first phase is characterized by elevated deposition of LTRs and their utilization in reshaping gene regulatory networks. The second phase is characterized by rapid gene family expansion due to continuous accumulation of LINEs and it appears that, in some instances at least, this could become a runaway process. We provide an example in which this has happened and we present a simulation supporting the possibility of the runaway process. Altogether we provide evidence of the contribution of retrotransposons to the expansion and evolution of gene families. Our findings emphasize the putative importance of these elements in diversification and adaptation in the human and mouse lineages.
Collapse
Affiliation(s)
- Václav Janoušek
- Department of Zoology, Faculty of Science, Charles University in Prague, Prague, Czech Republic Institute of Vertebrate Biology, ASCR, Brno, Czech Republic
| | | | - Alexey Yanchukov
- Institute of Vertebrate Biology, ASCR, Brno, Czech Republic Department of Biology, Faculty of Arts and Sciences, Bülent Ecevit University, Zonguldak, Turkey
| | - Robert C Karn
- Department of Medicine, College of Medicine, University of Arizona
| |
Collapse
|
21
|
Ellison CE, Bachtrog D. Non-allelic gene conversion enables rapid evolutionary change at multiple regulatory sites encoded by transposable elements. eLife 2015; 4. [PMID: 25688566 PMCID: PMC4384637 DOI: 10.7554/elife.05899] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2014] [Accepted: 02/16/2015] [Indexed: 12/02/2022] Open
Abstract
Transposable elements (TEs) allow rewiring of regulatory networks, and the recent amplification of the ISX element dispersed 77 functional but suboptimal binding sites for the dosage compensation complex to a newly formed X chromosome in Drosophila. Here we identify two linked refining mutations within ISX that interact epistatically to increase binding affinity to the dosage compensation complex. Selection has increased the frequency of this derived haplotype in the population, which is fixed at 30% of ISX insertions and polymorphic among another 41%. Sharing of this haplotype indicates that high levels of gene conversion among ISX elements allow them to ‘crowd-source’ refining mutations, and a refining mutation that occurs at any single ISX element can spread in two dimensions: horizontally across insertion sites by non-allelic gene conversion, and vertically through the population by natural selection. These results describe a novel route by which fully functional regulatory elements can arise rapidly from TEs and implicate non-allelic gene conversion as having an important role in accelerating the evolutionary fine-tuning of regulatory networks. DOI:http://dx.doi.org/10.7554/eLife.05899.001 Mutations change genes and provide the raw material for evolution. Genes are sections of DNA that contain the instructions for making proteins or other molecules, and so determine the physical characteristics of each organism. Genetic mutations that increase an organism's number of offspring and chances of survival are more likely to be passed on to future generations. Changes to when or where a gene is switched on (so-called regulatory mutations) can also provide fitness benefits and can therefore be selected for during evolution. Transposable elements are sequences of DNA that are also called ‘jumping genes’ because they can make copies of themselves and these copies of the transposable element can move to other locations in the genome. Some transposable elements contain sequences that switch on nearby genes. If different copies of a transposable element that contains such a regulatory sequence insert themselves in more than one place, it can result in a network of genes that can all be controlled in the same way. The regulatory sequences contained within transposable elements are not always optimal, but they can be fine-tuned through evolution. A fruit fly called Drosophila miranda has a transposable element called ISX that has, over time, placed up to 77 regulatory sequences around one of this species' sex chromosomes. Just as in humans, female flies are XX and males are XY; but having only one copy of the X chromosome means that male flies need to increase the expression of certain genes to produce a full-dose of the molecules made by the genes. This process is called dosage compensation and in 2013 the 77 ISX regulatory sequences on the fruit fly's X chromosome were shown to help recruit the molecular machinery that carries out dosage compensation to nearby genes, albeit inefficiently. Now Ellison and Bachtrog—who also conducted the 2013 study—report how these transposable elements have been fine-tuned to make them more effective for dosage compensation. Ellison and Bachtrog uncovered two mutations that make the ISX transposable element better at recruiting the dosage compensation molecular machinery. ISX spread around different locations along the fly's X chromosome before these mutations arose; this means that initially none of the 77 insertions carried the two mutations, but now 30% of the 77 elements have the mutations in all flies, and 41% have them in only some flies. The same mutations have spread between the different ISX elements because transposable elements with the mutations have been used to directly convert other ISX elements without them. These mutations have also become more common in the fruit fly population by being passed on to offspring and increasing their survival. These two routes have accelerated the fine-tuning of these transposable elements for use in gene regulation. This implies that regulatory sequences derived from transposable elements evolve in a way that is fundamentally different from those that arise by other means, as the direct conversion between these insertions allows fine-tuning mutations to spread more rapidly. DOI:http://dx.doi.org/10.7554/eLife.05899.002
Collapse
Affiliation(s)
- Christopher E Ellison
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| | - Doris Bachtrog
- Department of Integrative Biology, University of California, Berkeley, Berkeley, United States
| |
Collapse
|
22
|
Hellen EHB, Kern AD. The role of DNA insertions in phenotypic differentiation between humans and other primates. Genome Biol Evol 2015; 7:1168-78. [PMID: 25635043 PMCID: PMC4419785 DOI: 10.1093/gbe/evv012] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
What makes us human is one of the most interesting and enduring questions in evolutionary biology. To assist in answering this question, we have identified insertions in the human genome which cannot be found in five comparison primate species: Chimpanzee, gorilla, orangutan, gibbon, and macaque. A total of 21,269 nonpolymorphic human-specific insertions were identified, of which only 372 were found in exons. Any function conferred by the remaining 20,897 is likely to be regulatory. Many of these insertions are likely to have been fitness neutral; however, a small number has been identified in genes showing signs of positive selection. Insertions found within positively selected genes show associations to neural phenotypes, which were also enriched in the whole data set. Other phenotypes that are found to be enriched in the data set include dental and sensory perception-related phenotypes, features which are known to differ between humans and other apes. The analysis provides several likely candidates, either genes or regulatory regions, which may be involved in the processes that differentiate humans from other apes.
Collapse
Affiliation(s)
| | - Andrew D Kern
- Department of Genetics, Nelson Biolabs, Piscataway, NJ, USA
| |
Collapse
|
23
|
Sundaram V, Cheng Y, Ma Z, Li D, Xing X, Edge P, Snyder MP, Wang T. Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res 2014; 24:1963-76. [PMID: 25319995 PMCID: PMC4248313 DOI: 10.1101/gr.168872.113] [Citation(s) in RCA: 300] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Transposable elements (TEs) have been shown to contain functional binding sites for certain transcription factors (TFs). However, the extent to which TEs contribute to the evolution of TF binding sites is not well known. We comprehensively mapped binding sites for 26 pairs of orthologous TFs in two pairs of human and mouse cell lines (representing two cell lineages), along with epigenomic profiles, including DNA methylation and six histone modifications. Overall, we found that 20% of binding sites were embedded within TEs. This number varied across different TFs, ranging from 2% to 40%. We further identified 710 TF–TE relationships in which genomic copies of a TE subfamily contributed a significant number of binding peaks for a TF, and we found that LTR elements dominated these relationships in human. Importantly, TE-derived binding peaks were strongly associated with open and active chromatin signatures, including reduced DNA methylation and increased enhancer-associated histone marks. On average, 66% of TE-derived binding events were cell type-specific with a cell type-specific epigenetic landscape. Most of the binding sites contributed by TEs were species-specific, but we also identified binding sites conserved between human and mouse, the functional relevance of which was supported by a signature of purifying selection on DNA sequences of these TEs. Interestingly, several TFs had significantly expanded binding site landscapes only in one species, which were linked to species-specific gene functions, suggesting that TEs are an important driving force for regulatory innovation. Taken together, our data suggest that TEs have significantly and continuously shaped gene regulatory networks during mammalian evolution.
Collapse
Affiliation(s)
- Vasavi Sundaram
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Yong Cheng
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Zhihai Ma
- Department of Genetics, Stanford University, Stanford, California 94305, USA
| | - Daofeng Li
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Xiaoyun Xing
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA
| | - Peter Edge
- Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, Minnesota 55455, USA
| | - Michael P Snyder
- Department of Genetics, Stanford University, Stanford, California 94305, USA;
| | - Ting Wang
- Department of Genetics, Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, Missouri 63108, USA;
| |
Collapse
|
24
|
Ezer D, Zabet NR, Adryan B. Homotypic clusters of transcription factor binding sites: A model system for understanding the physical mechanics of gene expression. Comput Struct Biotechnol J 2014; 10:63-9. [PMID: 25349675 PMCID: PMC4204428 DOI: 10.1016/j.csbj.2014.07.005] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The organization of binding sites in cis-regulatory elements (CREs) can influence gene expression through a combination of physical mechanisms, ranging from direct interactions between TF molecules to DNA looping and transient chromatin interactions. The study of simple and common building blocks in promoters and other CREs allows us to dissect how all of these mechanisms work together. Many adjacent TF binding sites for the same TF species form homotypic clusters, and these CRE architecture building blocks serve as a prime candidate for understanding interacting transcriptional mechanisms. Homotypic clusters are prevalent in both bacterial and eukaryotic genomes, and are present in both promoters as well as more distal enhancer/silencer elements. Here, we review previous theoretical and experimental studies that show how the complexity (number of binding sites) and spatial organization (distance between sites and overall distance from transcription start sites) of homotypic clusters influence gene expression. In particular, we describe how homotypic clusters modulate the temporal dynamics of TF binding, a mechanism that can affect gene expression, but which has not yet been sufficiently characterized. We propose further experiments on homotypic clusters that would be useful in developing mechanistic models of gene expression.
Collapse
Affiliation(s)
- Daphne Ezer
- Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge CB2 1QR, UK
| | - Nicolae Radu Zabet
- Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge CB2 1QR, UK
| | - Boris Adryan
- Cambridge Systems Biology Centre, University of Cambridge, Tennis Court Road, Cambridge CB2 1QR, UK
| |
Collapse
|
25
|
Jjingo D, Conley AB, Wang J, Mariño-Ramírez L, Lunyak VV, Jordan IK. Mammalian-wide interspersed repeat (MIR)-derived enhancers and the regulation of human gene expression. Mob DNA 2014; 5:14. [PMID: 25018785 PMCID: PMC4090950 DOI: 10.1186/1759-8753-5-14] [Citation(s) in RCA: 57] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2013] [Accepted: 04/10/2014] [Indexed: 11/26/2022] Open
Abstract
Background Mammalian-wide interspersed repeats (MIRs) are the most ancient family of transposable elements (TEs) in the human genome. The deep conservation of MIRs initially suggested the possibility that they had been exapted to play functional roles for their host genomes. MIRs also happen to be the only TEs whose presence in-and-around human genes is positively correlated to tissue-specific gene expression. Similar associations of enhancer prevalence within genes and tissue-specific expression, along with MIRs’ previous implication as providing regulatory sequences, suggested a possible link between MIRs and enhancers. Results To test the possibility that MIRs contribute functional enhancers to the human genome, we evaluated the relationship between MIRs and human tissue-specific enhancers in terms of genomic location, chromatin environment, regulatory function, and mechanistic attributes. This analysis revealed MIRs to be highly concentrated in enhancers of the K562 and HeLa human cell-types. Significantly more enhancers were found to be linked to MIRs than would be expected by chance, and putative MIR-derived enhancers are characterized by a chromatin environment highly similar to that of canonical enhancers. MIR-derived enhancers show strong associations with gene expression levels, tissue-specific gene expression and tissue-specific cellular functions, including a number of biological processes related to erythropoiesis. MIR-derived enhancers were found to be a rich source of transcription factor binding sites, underscoring one possible mechanistic route for the element sequences co-option as enhancers. There is also tentative evidence to suggest that MIR-enhancer function is related to the transcriptional activity of non-coding RNAs. Conclusions Taken together, these data reveal enhancers to be an important cis-regulatory platform from which MIRs can exercise a regulatory function in the human genome and help to resolve a long-standing conundrum as to the reason for MIRs’ deep evolutionary conservation.
Collapse
Affiliation(s)
- Daudi Jjingo
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Andrew B Conley
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Jianrong Wang
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA
| | - Leonardo Mariño-Ramírez
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA ; PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| | - Victoria V Lunyak
- PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia ; Buck Institute for Research on Aging, Novato, CA, USA
| | - I King Jordan
- School of Biology, Georgia Institute of Technology, Atlanta, GA, USA ; PanAmerican Bioinformatics Institute, Santa Marta, Magdalena, Colombia
| |
Collapse
|
26
|
DNA hypomethylation within specific transposable element families associates with tissue-specific enhancer landscape. Nat Genet 2013; 45:836-41. [PMID: 23708189 PMCID: PMC3695047 DOI: 10.1038/ng.2649] [Citation(s) in RCA: 162] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2012] [Accepted: 05/02/2013] [Indexed: 12/11/2022]
Abstract
Transposable element (TE) derived sequences comprise half of our genome and DNA methylome, and are presumed densely methylated and inactive. Examination of the genome-wide DNA methylation status within 928 TE subfamilies in human embryonic and adult tissues revealed unexpected tissue-specific and subfamily-specific hypomethylation signatures. Genes proximal to tissue-specific hypomethylated TE sequences were enriched for functions important for the tissue type and their expression correlated strongly with hypomethylation of the TEs. When hypomethylated, these TE sequences gained tissue-specific enhancer marks including H3K4me1 and occupancy by p300, and a majority exhibited enhancer activity in reporter gene assays. Many such TEs also harbored binding sites for transcription factors that are important for tissue-specific functions and exhibited evidence for evolutionary selection. These data suggest that sequences derived from TEs may be responsible for wiring tissue type-specific regulatory networks, and have acquired tissue-specific epigenetic regulation.
Collapse
|
27
|
Abstract
What good are transposable elements (TEs)? Although their activity can be harmful to host genomes and can cause disease, they nevertheless represent an important source of genetic variation that has helped shape genomes. In this review, we examine the impact of TEs, collectively referred to as the mobilome, on the transcriptome. We explore how TEs—particularly retrotransposons—contribute to transcript diversity and consider their potential significance as a source of small RNAs that regulate host gene transcription. We also discuss a critical role for the mobilome in engineering transcriptional networks, permitting coordinated gene expression, and facilitating the evolution of novel physiological processes.
Collapse
Affiliation(s)
- Michael Cowley
- Department of Medical & Molecular Genetics, King's College London, London, United Kingdom
| | - Rebecca J. Oakey
- Department of Medical & Molecular Genetics, King's College London, London, United Kingdom
- * E-mail:
| |
Collapse
|
28
|
Micale L, Loviglio MN, Manzoni M, Fusco C, Augello B, Migliavacca E, Cotugno G, Monti E, Borsani G, Reymond A, Merla G. A fish-specific transposable element shapes the repertoire of p53 target genes in zebrafish. PLoS One 2012; 7:e46642. [PMID: 23118857 PMCID: PMC3485254 DOI: 10.1371/journal.pone.0046642] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2012] [Accepted: 09/03/2012] [Indexed: 12/04/2022] Open
Abstract
Transposable elements, as major components of most eukaryotic organisms' genomes, define their structural organization and plasticity. They supply host genomes with functional elements, for example, binding sites of the pleiotropic master transcription factor p53 were identified in LINE1, Alu and LTR repeats in the human genome. Similarly, in this report we reveal the role of zebrafish (Danio rerio) EnSpmN6_DR non-autonomous DNA transposon in shaping the repertoire of the p53 target genes. The multiple copies of EnSpmN6_DR and their embedded p53 responsive elements drive in several instances p53-dependent transcriptional modulation of the adjacent gene, whose human orthologs were frequently previously annotated as p53 targets. These transposons define predominantly a set of target genes whose human orthologs contribute to neuronal morphogenesis, axonogenesis, synaptic transmission and the regulation of programmed cell death. Consistent with these biological functions the orthologs of the EnSpmN6_DR-colonized loci are enriched for genes expressed in the amygdala, the hippocampus and the brain cortex. Our data pinpoint a remarkable example of convergent evolution: the exaptation of lineage-specific transposons to shape p53-regulated neuronal morphogenesis-related pathways in both a hominid and a teleost fish.
Collapse
Affiliation(s)
- Lucia Micale
- Medical Genetics Unit, IRCCS “Casa Sollievo della Sofferenza”, San Giovanni Rotondo, Italy
| | - Maria Nicla Loviglio
- Medical Genetics Unit, IRCCS “Casa Sollievo della Sofferenza”, San Giovanni Rotondo, Italy
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Marta Manzoni
- Department of Biomedical Science and Biotechnology, University of Brescia, Brescia, Italy
| | - Carmela Fusco
- Medical Genetics Unit, IRCCS “Casa Sollievo della Sofferenza”, San Giovanni Rotondo, Italy
| | - Bartolomeo Augello
- Medical Genetics Unit, IRCCS “Casa Sollievo della Sofferenza”, San Giovanni Rotondo, Italy
| | - Eugenia Migliavacca
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Grazia Cotugno
- Medical Genetics Unit, IRCCS “Casa Sollievo della Sofferenza”, San Giovanni Rotondo, Italy
| | - Eugenio Monti
- Department of Biomedical Science and Biotechnology, University of Brescia, Brescia, Italy
| | - Giuseppe Borsani
- Department of Biomedical Science and Biotechnology, University of Brescia, Brescia, Italy
| | - Alexandre Reymond
- Center for Integrative Genomics, University of Lausanne, Lausanne, Switzerland
| | - Giuseppe Merla
- Medical Genetics Unit, IRCCS “Casa Sollievo della Sofferenza”, San Giovanni Rotondo, Italy
| |
Collapse
|
29
|
Rebollo R, Horard B, Begeot F, Delattre M, Gilson E, Vieira C. A snapshot of histone modifications within transposable elements in Drosophila wild type strains. PLoS One 2012; 7:e44253. [PMID: 22962605 PMCID: PMC3433462 DOI: 10.1371/journal.pone.0044253] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2012] [Accepted: 07/31/2012] [Indexed: 12/19/2022] Open
Abstract
Transposable elements (TEs) are a major source of genetic variability in genomes, creating genetic novelty and driving genome evolution. Analysis of sequenced genomes has revealed considerable diversity in TE families, copy number, and localization between different, closely related species. For instance, although the twin species Drosophila melanogaster and D. simulans share the same TE families, they display different amounts of TEs. Furthermore, previous analyses of wild type derived strains of D. simulans have revealed high polymorphism regarding TE copy number within this species. Several factors may influence the diversity and abundance of TEs in a genome, including molecular mechanisms such as epigenetic factors, which could be a source of variation in TE success. In this paper, we present the first analysis of the epigenetic status of four TE families (roo, tirant, 412 and F) in seven wild type strains of D. melanogaster and D. simulans. Our data shows intra- and inter-specific variations in the histone marks that adorn TE copies. Our results demonstrate that the chromatin state of common TEs varies among TE families, between closely related species and also between wild type strains.
Collapse
Affiliation(s)
- Rita Rebollo
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France
| | - Béatrice Horard
- Institute of Research on Cancer and Aging, CNRS UMR7284/INSERM U1081/UNS Faculté de Médecine, Nice, France
| | - Flora Begeot
- Département de Génétique et Evolution, Université de Genève, Genève, Switzerland
| | - Marion Delattre
- Département de Génétique et Evolution, Université de Genève, Genève, Switzerland
| | - Eric Gilson
- Institute of Research on Cancer and Aging, CNRS UMR7284/INSERM U1081/UNS Faculté de Médecine, Nice, France
| | - Cristina Vieira
- Laboratoire de Biométrie et Biologie Evolutive, UMR5558, CNRS, Université de Lyon, Université Lyon 1, Villeurbanne, France
- * E-mail:
| |
Collapse
|
30
|
Lowe CB, Haussler D. 29 mammalian genomes reveal novel exaptations of mobile elements for likely regulatory functions in the human genome. PLoS One 2012; 7:e43128. [PMID: 22952639 PMCID: PMC3428314 DOI: 10.1371/journal.pone.0043128] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2012] [Accepted: 07/17/2012] [Indexed: 11/18/2022] Open
Abstract
Recent research supports the view that changes in gene regulation, as opposed to changes in the genes themselves, play a significant role in morphological evolution. Gene regulation is largely dependent on transcription factor binding sites. Researchers are now able to use the available 29 mammalian genomes to measure selective constraint at the level of binding sites. This detailed map of constraint suggests that mammalian genomes co-opt fragments of mobile elements to act as gene regulatory sequence on a large scale. In the human genome we detect over 280,000 putative regulatory elements, totaling approximately 7 Mb of sequence, that originated as mobile element insertions. These putative regulatory regions are conserved non-exonic elements (CNEEs), which show considerable cross-species constraint and signatures of continued negative selection in humans, yet do not appear in a known mature transcript. These putative regulatory elements were co-opted from SINE, LINE, LTR and DNA transposon insertions. We demonstrate that at least 11%, and an estimated 20%, of gene regulatory sequence in the human genome showing cross-species conservation was co-opted from mobile elements. The location in the genome of CNEEs co-opted from mobile elements closely resembles that of CNEEs in general, except in the centers of the largest gene deserts where recognizable co-option events are relatively rare. We find that regions of certain mobile element insertions are more likely to be held under purifying selection than others. In particular, we show 6 examples where paralogous instances of an often co-opted mobile element region define a sequence motif that closely matches a transcription factor's binding profile.
Collapse
Affiliation(s)
- Craig B. Lowe
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - David Haussler
- Center for Biomolecular Science and Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
- Howard Hughes Medical Institute, University of California Santa Cruz, Santa Cruz, California, United States of America
| |
Collapse
|
31
|
Fernández-Medina RD, Ribeiro JMC, Carareto CMA, Velasque L, Struchiner CJ. Losing identity: structural diversity of transposable elements belonging to different classes in the genome of Anopheles gambiae. BMC Genomics 2012; 13:272. [PMID: 22726298 PMCID: PMC3442997 DOI: 10.1186/1471-2164-13-272] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2012] [Accepted: 06/08/2012] [Indexed: 01/10/2023] Open
Abstract
Background Transposable elements (TEs), both DNA transposons and retrotransposons, are genetic elements with the main characteristic of being able to mobilize and amplify their own representation within genomes, utilizing different mechanisms of transposition. An almost universal feature of TEs in eukaryotic genomes is their inability to transpose by themselves, mainly as the result of sequence degeneration (by either mutations or deletions). Most of the elements are thus either inactive or non-autonomous. Considering that the bulk of some eukaryotic genomes derive from TEs, they have been conceived as “TE graveyards.” It has been shown that once an element has been inactivated, it progressively accumulates mutations and deletions at neutral rates until completely losing its identity or being lost from the host genome; however, it has also been shown that these “neutral sequences” might serve as raw material for domestication by host genomes. Results We have analyzed the sequence structural variations, nucleotide divergence, and pattern of insertions and deletions of several superfamilies of TEs belonging to both class I (long terminal repeats [LTRs] and non-LTRs [NLTRs]) and II in the genome of Anopheles gambiae, aiming at describing the landscape of deterioration of these elements in this particular genome. Our results describe a great diversity in patterns of deterioration, indicating lineage-specific differences including the presence of Solo-LTRs in the LTR lineage, 5′-deleted NLTRs, and several non-autonomous and MITEs in the class II families. Interestingly, we found fragments of NLTRs corresponding to the RT domain, which preserves high identity among them, suggesting a possible remaining genomic role for these domains. Conclusions We show here that the TEs in the An. gambiae genome deteriorate in different ways according to the class to which they belong. This diversity certainly has implications not only at the host genomic level but also at the amplification dynamic and evolution of the TE families themselves.
Collapse
Affiliation(s)
- Rita D Fernández-Medina
- Escola Nacional de Saúde Pública Sergio Arouca, Fundação Oswaldo Cruz, Rio de Janeiro, Brazil.
| | | | | | | | | |
Collapse
|
32
|
A model of genetic search for beneficial mutations: estimating the constructive capacities of mutagenesis. J Mol Evol 2012; 73:337-54. [PMID: 22212997 DOI: 10.1007/s00239-011-9482-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2011] [Accepted: 12/14/2011] [Indexed: 10/14/2022]
Abstract
We attempted to answer the following question: What evolutionary conditions are required to generate novel genetic modules? Our broad formulation of the problem allows us to simultaneously consider such issues as the relationship between the stage of "genetic search" and the rate of adaptive evolution; the theoretical limits to the generative capacities of spontaneous mutagenesis; and the correlation between genome organization and evolvability. We show that adaptive evolution is feasible only when the mutation rate is fine-tuned to a specific range of values and the structures of the genome and genes are optimized in a certain way. Our quantitative analysis has demonstrated that the rate of evolution of novelty depends on several parameters, such as genome size, the length of a module, the size of the adjacent nonfunctional DNA spacers, and the mutation rate at various genomic scales. We evaluated the efficiency of some mechanisms that increase evolvability: bias in the spectrum of mutation rates towards small mutations, and the availability and size of nonfunctional DNA spacers. We show that the probability of successful duplication and insertion of a copy of a functional module increases by several orders of magnitude depending on the length of the spacers flanking the module. We infer that the adaptive evolution of multicellular organisms has become feasible because of the abundance of nonfunctional DNA spacers, particularly introns, in the genome. We also discuss possible reasons underlying evolutionary retention of the mechanisms that increase evolvability.
Collapse
|
33
|
Vizirianakis IS, Tezias SS, Amanatiadou EP, Tsiftsoglou AS. Possible interaction between B1 retrotransposon-containing sequences and β majorglobin gene transcriptional activation during MEL cell erythroid differentiation. Cell Biol Int 2012; 36:47-55. [DOI: 10.1042/cbi20110236] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]
|
34
|
Pandey R, Mandal AK, Jha V, Mukerji M. Heat shock factor binding in Alu repeats expands its involvement in stress through an antisense mechanism. Genome Biol 2011; 12:R117. [PMID: 22112862 PMCID: PMC3334603 DOI: 10.1186/gb-2011-12-11-r117] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2011] [Revised: 11/09/2011] [Accepted: 11/23/2011] [Indexed: 01/22/2023] Open
Abstract
Background Alu RNAs are present at elevated levels in stress conditions and, consequently, Alu repeats are increasingly being associated with the physiological stress response. Alu repeats are known to harbor transcription factor binding sites that modulate RNA pol II transcription and Alu RNAs act as transcriptional co-repressors through pol II binding in the promoter regions of heat shock responsive genes. An observation of a putative heat shock factor (HSF) binding site in Alu led us to explore whether, through HSF binding, these elements could further contribute to the heat shock response repertoire. Results Alu density was significantly enriched in transcripts that are down-regulated following heat shock recovery in HeLa cells. ChIP analysis confirmed HSF binding to a consensus motif exhibiting positional conservation across various Alu subfamilies, and reporter constructs demonstrated a sequence-specific two-fold induction of these sites in response to heat shock. These motifs were over-represented in the genic regions of down-regulated transcripts in antisense oriented Alus. Affymetrix Exon arrays detected antisense signals in a significant fraction of the down-regulated transcripts, 50% of which harbored HSF sites within 5 kb. siRNA knockdown of the selected antisense transcripts led to the over-expression, following heat shock, of their corresponding down-regulated transcripts. The antisense transcripts were significantly enriched in processes related to RNA pol III transcription and the TFIIIC complex. Conclusions We demonstrate a non-random presence of Alu repeats harboring HSF sites in heat shock responsive transcripts. This presence underlies an antisense-mediated mechanism that represents a novel component of Alu and HSF involvement in the heat shock response.
Collapse
Affiliation(s)
- Rajesh Pandey
- Genomics and Molecular Medicine, Institute of Genomics and Integrative Biology, Council of Scientific and Industrial Research (CSIR-IGIB), Delhi- India
| | | | | | | |
Collapse
|
35
|
Taher L, McGaughey DM, Maragh S, Aneas I, Bessling SL, Miller W, Nobrega MA, McCallion AS, Ovcharenko I. Genome-wide identification of conserved regulatory function in diverged sequences. Genome Res 2011; 21:1139-49. [PMID: 21628450 PMCID: PMC3129256 DOI: 10.1101/gr.119016.110] [Citation(s) in RCA: 59] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2010] [Accepted: 04/19/2011] [Indexed: 01/16/2023]
Abstract
Plasticity of gene regulatory encryption can permit DNA sequence divergence without loss of function. Functional information is preserved through conservation of the composition of transcription factor binding sites (TFBS) in a regulatory element. We have developed a method that can accurately identify pairs of functional noncoding orthologs at evolutionarily diverged loci by searching for conserved TFBS arrangements. With an estimated 5% false-positive rate (FPR) in approximately 3000 human and zebrafish syntenic loci, we detected approximately 300 pairs of diverged elements that are likely to share common ancestry and have similar regulatory activity. By analyzing a pool of experimentally validated human enhancers, we demonstrated that 7/8 (88%) of their predicted functional orthologs retained in vivo regulatory control. Moreover, in 5/7 (71%) of assayed enhancer pairs, we observed concordant expression patterns. We argue that TFBS composition is often necessary to retain and sufficient to predict regulatory function in the absence of overt sequence conservation, revealing an entire class of functionally conserved, evolutionarily diverged regulatory elements that we term "covert."
Collapse
Affiliation(s)
- Leila Taher
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| | - David M. McGaughey
- McKusick–Nathans Institute of Genetic Medicine, Department of Molecular and Comparative Pathobiology, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Samantha Maragh
- McKusick–Nathans Institute of Genetic Medicine, Department of Molecular and Comparative Pathobiology, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
- Biochemical Science Division, National Institute of Standards and Technology, Gaithersburg, Maryland 20899, USA
| | - Ivy Aneas
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | - Seneca L. Bessling
- McKusick–Nathans Institute of Genetic Medicine, Department of Molecular and Comparative Pathobiology, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Webb Miller
- Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Marcelo A. Nobrega
- Department of Human Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | - Andrew S. McCallion
- McKusick–Nathans Institute of Genetic Medicine, Department of Molecular and Comparative Pathobiology, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Ivan Ovcharenko
- Computational Biology Branch, National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894, USA
| |
Collapse
|
36
|
Jjingo D, Huda A, Gundapuneni M, Mariño-Ramírez L, Jordan IK. Effect of the transposable element environment of human genes on gene length and expression. Genome Biol Evol 2011; 3:259-71. [PMID: 21362639 PMCID: PMC3070429 DOI: 10.1093/gbe/evr015] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Independent lines of investigation have documented effects of both transposable elements (TEs) and gene length (GL) on gene expression. However, TE gene fractions are highly correlated with GL, suggesting that they cannot be considered independently. We evaluated the TE environment of human genes and GL jointly in an attempt to tease apart their relative effects. TE gene fractions and GL were compared with the overall level of gene expression and the breadth of expression across tissues. GL is strongly correlated with overall expression level but weakly correlated with the breadth of expression, confirming the selection hypothesis that attributes the compactness of highly expressed genes to selection for economy of transcription. However, TE gene fractions overall, and for the L1 family in particular, show stronger anticorrelations with expression level than GL, indicating that GL may not be the most important target of selection for transcriptional economy. These results suggest a specific mechanism, removal of TEs, by which highly expressed genes are selectively tuned for efficiency. MIR elements are the only family of TEs with gene fractions that show a positive correlation with tissue-specific expression, suggesting that they may provide regulatory sequences that help to control human gene expression. Consistent with this notion, MIR fractions are relatively enriched close to transcription start sites and associated with coexpression in specific sets of related tissues. Our results confirm the overall relevance of the TE environment to gene expression and point to distinct mechanisms by which different TE families may contribute to gene regulation.
Collapse
Affiliation(s)
- Daudi Jjingo
- School of Biology, Georgia Institute of Technology, GA, USA
| | | | | | | | | |
Collapse
|
37
|
Védrine SM, Vourc'h P, Tabagh R, Mignon L, Höfflin S, Cherpi-Antar C, Mbarek O, Paubel A, Moraine C, Raynaud M, Andres CR. A functional tetranucleotide (AAAT) polymorphism in an Alu element in the NF1 gene is associated with mental retardation. Neurosci Lett 2011; 491:118-21. [PMID: 21236316 DOI: 10.1016/j.neulet.2011.01.019] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2010] [Revised: 12/30/2010] [Accepted: 01/06/2011] [Indexed: 11/29/2022]
Abstract
Mental retardation (MR) is frequent in neurofibromatosis type 1 (NF1). Allele 5 of a tetranucleotide polymorphism in an Alu element (GXAlu) localized in intron 27b of the NF1 gene has previously been associated with autism. We considered that the microsatellite GXAlu could also represent a risk factor in MR without autism. We developed a rapid method for genotyping by non-denaturing HPLC and assayed the allelic variation of GXAlu marker on in vitro gene expression in Cos-7 cells. A French population of 157 individuals (68 non syndromic non familial MR (NS-MR) patients diagnosed in the University Hospital of Tours; 89 controls) was tested in a case-control assay. We observed a significant association (χ(2)=7.96; p=0.005) between alu4 carriers (7 AAAT repeats) and MR (OR: 7.86; 95% C.I.: 2.13-28.9). The relative in vitro expression of a reporter gene encoding chloramphenicol acetyl transferase (CAT) was higher for alu4 and alu5, suggesting a regulation effect for these alleles on gene expression in vivo. Our results showed an association with a polymorphism regulating the NF1 gene or other genes during brain development.
Collapse
|
38
|
Huda A, Bowen NJ, Conley AB, Jordan IK. Epigenetic regulation of transposable element derived human gene promoters. Gene 2011; 475:39-48. [PMID: 21215797 DOI: 10.1016/j.gene.2010.12.010] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2010] [Accepted: 12/22/2010] [Indexed: 02/08/2023]
Abstract
It was previously thought that epigenetic histone modifications of mammalian transposable elements (TEs) serve primarily to defend the genome against deleterious effects associated with their activity. However, we recently showed that, genome-wide, human TEs can also be epigenetically modified in a manner consistent with their ability to regulate host genes. Here, we explore the ability of TE sequences to epigenetically regulate individual human genes by focusing on the histone modifications of promoter sequences derived from TEs. We found 1520 human genes that initiate transcription from within TE-derived promoter sequences. We evaluated the distributions of eight histone modifications across these TE-promoters, within and between the GM12878 and K562 cell lines, and related their modification status with the cell-type specific expression patterns of the genes that they regulate. TE-derived promoters are significantly enriched for active histone modifications, and depleted for repressive modifications, relative to the genomic background. Active histone modifications of TE-promoters peak at transcription start sites and are positively correlated with increasing expression within cell lines. Furthermore, differential modification of TE-derived promoters between cell lines is significantly correlated with differential gene expression. LTR-retrotransposon derived promoters in particular play a prominent role in mediating cell-type specific gene regulation, and a number of these LTR-promoter genes are implicated in lineage-specific cellular functions. The regulation of human genes mediated by histone modifications targeted to TE-derived promoters is consistent with the ability of TEs to contribute to the epigenomic landscape in a way that provides functional utility to the host genome.
Collapse
Affiliation(s)
- Ahsan Huda
- School of Biology, Georgia Institute of Technology, 310 Ferst Drive, Atlanta, GA 30332, USA.
| | | | | | | |
Collapse
|
39
|
Transcription factor binding sites and other features in human and Drosophila proximal promoters. Subcell Biochem 2011; 52:205-22. [PMID: 21557085 DOI: 10.1007/978-90-481-9069-0_10] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Eukaryotic promoters determine transcription start sites (TSSs), and are often enriched for transcription factor binding sites (TFBSs), which presumably play a major role in determining the location and activity of the TSS. In mammalian systems, proximal promoters are enriched for the CpG dinucleotide. The TFBSs that are enriched in proximal promoters (-200 bps to the TSS) are CCAAT, ETS, NRF1, SP1, E-Box, CRE, BoxA, and TATA. Only TATA occurs in a DNA strand dependent manner. In Drosophila, proximal promoters are AT rich and many putative TFBSs are enriched in proximal promoters. These sequences are different from those that occur in human promoters, except for TATA and E-Box, and many occur on a single strand of DNA giving directionality to the promoter. Thus, fundamental differences have arisen as promoters evolved in metazoans.
Collapse
|
40
|
Ghedira K, Hornischer K, Konovalova T, Jenhani AZ, Benkahla A, Kel A. Identification of key mechanisms controlling gene expression in Leishmania infected macrophages using genome-wide promoter analysis. INFECTION GENETICS AND EVOLUTION 2010; 11:769-77. [PMID: 21093613 DOI: 10.1016/j.meegid.2010.10.015] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/25/2010] [Revised: 10/18/2010] [Accepted: 10/19/2010] [Indexed: 01/15/2023]
Abstract
The present study describes the in silico prediction of the regulatory network of Leishmania infected human macrophages. The construction of the gene regulatory network requires the identification of Transcription Factor Binding Sites (TFBSs) in the regulatory regions (promoters, enhancers) of genes that are regulated upon Leishmania infection. The promoters of human, mouse, rat, dog and chimpanzee genes were first identified in the whole genomes using available experimental data on full length cDNA sequences or deep CAGE tag data (DBTSS, FANTOM3, FANTOM4), mRNA models (ENSEMBL), or using hand annotated data (EPD, TRANSFAC). A phylogenetic footprinting analysis and a MATCH analysis of the promoter sequences were then performed to predict TFBS. Then, an SQL database that integrates all results of promoter analysis as well as other genome annotation information obtained from ENSEMBL, TRANSFAC, TRED (Transcription Regulatory Element Database), ORegAnno and the ENCODE project, was established. Finally publicly available expression data from human Leishmania infected macrophages were analyzed using the genome-wide information on predicted TFBS with the computer system ExPlain™. The gene regulatory network was constructed and activated signal transduction pathways were revealed. The Irak1 pathway was identified as a key pathway regulating gene expression changes in Leishmania infected macrophages.
Collapse
Affiliation(s)
- Kais Ghedira
- Laboratory of Immunology, Vaccinology, and Molecular Genetics, Institut Pasteur de Tunis, 13 place Pasteur BP 74, Tunis, Tunisia
| | | | | | | | | | | |
Collapse
|
41
|
Paquet Y, Anderson A. Sequence composition similarities with the 7SL RNA are highly predictive of functional genomic features. Nucleic Acids Res 2010; 38:4907-16. [PMID: 20392819 PMCID: PMC2926601 DOI: 10.1093/nar/gkq234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Transposable elements derived from the 7SL RNA gene, such as Alu elements in primates, have had remarkable success in several mammalian lineages. The results presented here show a broad spectrum of functions for genomic segments that display sequence composition similarities with the 7SL RNA gene. Using thoroughly documented loci, we report that DNaseI-hypersensitive sites can be singled out in large genomic sequences by an assessment of sequence composition similarities with the 7SL RNA gene. We apply a root word frequency approach to illustrate a distinctive relationship between the sequence of the 7SL RNA gene and several classes of functional genomic features that are not presumed to be of transposable origin. Transposable elements that show noticeable similarities with the 7SL sequence include Alu sequences, as expected, but also long terminal repeats and the 5′-untranslated regions of long interspersed repetitive elements. In sequences masked for repeated elements, we find, when using the 7SL RNA gene as query sequence, distinctive similarities with promoters, exons and distal gene regulatory regions. The latter being the most notoriously difficult to detect, this approach may be useful for finding genomic segments that have regulatory functions and that may have escaped detection by existing methods.
Collapse
Affiliation(s)
- Yanick Paquet
- Centre de recherche en cancérologie de l’Université Laval, L’Hôtel-Dieu de Québec, Centre hospitalier universitaire de Québec, Québec G1R 2J6 and Département de biologie, Université Laval, Québec G1K 7P4, Canada
| | - Alan Anderson
- Centre de recherche en cancérologie de l’Université Laval, L’Hôtel-Dieu de Québec, Centre hospitalier universitaire de Québec, Québec G1R 2J6 and Département de biologie, Université Laval, Québec G1K 7P4, Canada
- *To whom correspondence should be addressed. Tel: + 418 691 5281; Fax: +418 691 5439;
| |
Collapse
|
42
|
Kumar RP, Senthilkumar R, Singh V, Mishra RK. Repeat performance: how do genome packaging and regulation depend on simple sequence repeats? Bioessays 2010; 32:165-74. [PMID: 20091758 DOI: 10.1002/bies.200900111] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Non-coding DNA has consistently increased during evolution of higher eukaryotes. Since the number of genes has remained relatively static during the evolution of complex organisms, it is believed that increased degree of sophisticated regulation of genes has contributed to the increased complexity. A higher proportion of non-coding DNA, including repeats, is likely to provide more complex regulatory potential. Here, we propose that repeats play a regulatory role by contributing to the packaging of the genome during cellular differentiation. Repeats, and in particular the simple sequence repeats, are proposed to serve as landmarks that can target regulatory mechanisms to a large number of genomic sites with the help of very few factors and regulate the linked loci in a coordinated manner. Repeats may, therefore, function as common target sites for regulatory mechanisms involved in the packaging and dynamic compartmentalization of the chromatin into active and inactive regions during cellular differentiation.
Collapse
Affiliation(s)
- Ram Parikshan Kumar
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500 007, India
| | | | | | | |
Collapse
|
43
|
Rebollo R, Horard B, Hubert B, Vieira C. Jumping genes and epigenetics: Towards new species. Gene 2010; 454:1-7. [DOI: 10.1016/j.gene.2010.01.003] [Citation(s) in RCA: 87] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2009] [Revised: 01/06/2010] [Accepted: 01/19/2010] [Indexed: 01/13/2023]
|
44
|
Taube JH, Allton K, Duncan SA, Shen L, Barton MC. Foxa1 functions as a pioneer transcription factor at transposable elements to activate Afp during differentiation of embryonic stem cells. J Biol Chem 2010; 285:16135-44. [PMID: 20348100 DOI: 10.1074/jbc.m109.088096] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Epigenetic control of genes that are silent in embryonic stem cells, but destined for expression during differentiation, includes distinctive hallmarks, such as simultaneous activating/repressing (bivalent) modifications of chromatin and DNA hypomethylation at enhancers of gene expression. Although alpha-fetoprotein (Afp) falls into this class of genes, as it is silent in pluripotent stem cells and activated during differentiation of endoderm, we find that Afp chromatin lacks bivalent histone modifications. However, critical regulatory sites for Afp activation, overlapping Foxa1/p53/Smad-binding elements, are located within a 300-bp region lacking DNA methylation, due to transposed elements underrepresented in CpG sequences: a short interspersed transposable element and a medium reiterated sequence 1 element. Forkhead family member Foxa1 is activated by retinoic acid treatment of embryonic stem cells, binds its DNA consensus site within the short interspersed transposable/medium reiterated sequence 1 elements, and displaces linker histone H1 from silent Afp chromatin. Small interfering RNA depletion of Foxa1 showed that Foxa1 is essential in providing chromatin access to transforming growth factor beta-activated Smad2 and Smad4 and their subsequent DNA binding. Together these transcription factors establish highly acetylated chromatin and promote expression of Afp. Foxa1 acts as a pioneer transcription factor in de novo activation of Afp, by exploiting a lack of methylation at juxtaposed transposed elements, to bind and poise chromatin for intersection with transforming growth factor beta signaling during differentiation of embryonic stem cells.
Collapse
Affiliation(s)
- Joseph H Taube
- Department of Biochemistry and Molecular Biology, Graduate School of Biomedical Sciences, Center for Stem Cell and Developmental Biology, University of Texas MD Anderson Cancer Center, Houston, Texas 77030, USA
| | | | | | | | | |
Collapse
|
45
|
Developmental diseases and the hypothetical Master Development Program. Med Hypotheses 2010; 74:564-73. [DOI: 10.1016/j.mehy.2009.09.035] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2009] [Accepted: 09/17/2009] [Indexed: 11/24/2022]
|
46
|
Unique functions of repetitive transcriptomes. INTERNATIONAL REVIEW OF CELL AND MOLECULAR BIOLOGY 2010; 285:115-88. [PMID: 21035099 DOI: 10.1016/b978-0-12-381047-2.00003-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Repetitive sequences occupy a huge fraction of essentially every eukaryotic genome. Repetitive sequences cover more than 50% of mammalian genomic DNAs, whereas gene exons and protein-coding sequences occupy only ~3% and 1%, respectively. Numerous genomic repeats include genes themselves. They generally encode "selfish" proteins necessary for the proliferation of transposable elements (TEs) in the host genome. The major part of evolutionary "older" TEs accumulated mutations over time and fails to encode functional proteins. However, repeats have important functions also on the RNA level. Repetitive transcripts may serve as multifunctional RNAs by participating in the antisense regulation of gene activity and by competing with the host-encoded transcripts for cellular factors. In addition, genomic repeats include regulatory sequences like promoters, enhancers, splice sites, polyadenylation signals, and insulators, which actively reshape cellular transcriptomes. TE expression is tightly controlled by the host cells, and some mechanisms of this regulation were recently decoded. Finally, capacity of TEs to proliferate in the host genome led to the development of multiple biotechnological applications.
Collapse
|
47
|
Gogvadze E, Buzdin A. Retroelements and their impact on genome evolution and functioning. Cell Mol Life Sci 2009; 66:3727-42. [PMID: 19649766 PMCID: PMC11115525 DOI: 10.1007/s00018-009-0107-2] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2009] [Revised: 06/11/2009] [Accepted: 07/14/2009] [Indexed: 12/31/2022]
Abstract
Retroelements comprise a considerable fraction of eukaryotic genomes. Since their initial discovery by Barbara McClintock in maize DNA, retroelements have been found in genomes of almost all organisms. First considered as a "junk DNA" or genomic parasites, they were shown to influence genome functioning and to promote genetic innovations. For this reason, they were suggested as an important creative force in the genome evolution and adaptation of an organism to altered environmental conditions. In this review, we summarize the up-to-date knowledge of different ways of retroelement involvement in structural and functional evolution of genes and genomes, as well as the mechanisms generated by cells to control their retrotransposition.
Collapse
Affiliation(s)
- Elena Gogvadze
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, 16/10 Miklukho-Maklaya st, 117997 Moscow, Russia.
| | | |
Collapse
|
48
|
Huda A, Jordan IK. Epigenetic Regulation of Mammalian Genomes by Transposable Elements. Ann N Y Acad Sci 2009; 1178:276-84. [DOI: 10.1111/j.1749-6632.2009.05007.x] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
49
|
Vorechovsky I. Transposable elements in disease-associated cryptic exons. Hum Genet 2009; 127:135-54. [PMID: 19823873 DOI: 10.1007/s00439-009-0752-4] [Citation(s) in RCA: 82] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2009] [Accepted: 09/27/2009] [Indexed: 11/28/2022]
Abstract
Transposable elements (TEs) make up a half of the human genome, but the extent of their contribution to cryptic exon activation that results in genetic disease is unknown. Here, a comprehensive survey of 78 mutation-induced cryptic exons previously identified in 51 disease genes revealed the presence of TEs in 40 cases (51%). Most TE-containing exons were derived from short interspersed nuclear elements (SINEs), with Alus and mammalian interspersed repeats (MIRs) covering >18 and >16% of the exonized sequences, respectively. The majority of SINE-derived cryptic exons had splice sites at the same positions of the Alu/MIR consensus as existing SINE exons and their inclusion in the mRNA was facilitated by phylogenetically conserved changes that improved both traditional and auxiliary splicing signals, thus marking intronic TEs amenable for pathogenic exonization. The overrepresentation of MIRs among TE exons is likely to result from their high average exon inclusion levels, which reflect their strong splice sites, a lack of splicing silencers and a high density of enhancers, particularly (G)AA(G) motifs. These elements were markedly depleted in antisense Alu exons, had the most prominent position on the exon-intron gradient scale and are proposed to promote exon definition through enhanced tertiary RNA interactions involving unpaired (di)adenosines. The identification of common mechanisms by which the most dynamic parts of the genome contribute both to new exon creation and genetic disease will facilitate detection of intronic mutations and the development of computational tools that predict TE hot-spots of cryptic exon activation.
Collapse
Affiliation(s)
- Igor Vorechovsky
- Division of Human Genetics, University of Southampton School of Medicine, MP808, Tremona Road, Southampton SO16 6YD, UK.
| |
Collapse
|
50
|
Wang J, Bowen NJ, Mariño-Ramírez L, Jordan IK. A c-Myc regulatory subnetwork from human transposable element sequences. MOLECULAR BIOSYSTEMS 2009; 5:1831-9. [PMID: 19763338 DOI: 10.1039/b908494k] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Transposable elements (TEs) can donate regulatory sequences that help to control the expression of human genes. The oncogene c-Myc is a promiscuous transcription factor that is thought to regulate the expression of hundreds of genes. We evaluated the contribution of TEs to the c-Myc regulatory network by searching for c-Myc binding sites derived from TEs and by analyzing the expression and function of target genes with nearby TE-derived c-Myc binding sites. There are thousands of TE sequences in the human genome that are bound by c-Myc. A conservative analysis indicated that 816-4564 of these TEs contain canonical c-Myc binding site motifs. c-Myc binding sites are over-represented among sequences derived from the ancient TE families L2 and MIR, consistent with their preservation by purifying selection. Genes associated with TE-derived c-Myc binding sites are co-expressed with each other and with c-Myc. A number of these putative TE-derived c-Myc target genes are differentially expressed between Burkitt's lymphoma cells versus normal B cells and encode proteins with cancer-related functions. Despite several lines of evidence pointing to their regulation by c-Myc and relevance to cancer, the set of genes identified as TE-derived c-Myc targets does not significantly overlap with two previously characterized c-Myc target gene sets. These data point to a substantial contribution of TEs to the regulation of human genes by c-Myc. Genes that are regulated by TE-derived c-Myc binding sites appear to form a distinct c-Myc regulatory subnetwork.
Collapse
Affiliation(s)
- Jianrong Wang
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA.
| | | | | | | |
Collapse
|