1
|
Chakrabarty P, Sen R, Sengupta S. From parasites to partners: exploring the intricacies of host-transposon dynamics and coevolution. Funct Integr Genomics 2023; 23:278. [PMID: 37610667 DOI: 10.1007/s10142-023-01206-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/01/2023] [Accepted: 08/07/2023] [Indexed: 08/24/2023]
Abstract
Transposable elements, often referred to as "jumping genes," have long been recognized as genomic parasites due to their ability to integrate and disrupt normal gene function and induce extensive genomic alterations, thereby compromising the host's fitness. To counteract this, the host has evolved a plethora of mechanisms to suppress the activity of the transposons. Recent research has unveiled the host-transposon relationships to be nuanced and complex phenomena, resulting in the coevolution of both entities. Transposition increases the mutational rate in the host genome, often triggering physiological pathways such as immune and stress responses. Current gene transfer technologies utilizing transposable elements have potential drawbacks, including off-target integration, induction of mutations, and modifications of cellular machinery, which makes an in-depth understanding of the host-transposon relationship imperative. This review highlights the dynamic interplay between the host and transposable elements, encompassing various factors and components of the cellular machinery. We provide a comprehensive discussion of the strategies employed by transposable elements for their propagation, as well as the mechanisms utilized by the host to mitigate their parasitic effects. Additionally, we present an overview of recent research identifying host proteins that act as facilitators or inhibitors of transposition. We further discuss the evolutionary outcomes resulting from the genetic interactions between the host and the transposable elements. Finally, we pose open questions in this field and suggest potential avenues for future research.
Collapse
Affiliation(s)
- Prayas Chakrabarty
- Department of Life Sciences, Presidency University Kolkata, 86/1 College Street, Kolkata, 700073, India
| | - Raneet Sen
- Department of Life Sciences, Presidency University Kolkata, 86/1 College Street, Kolkata, 700073, India
- Institute of Bioorganic Chemistry, Department of RNA Metabolism, Polish Academy of Sciences, Poznan, Poland
| | - Sugopa Sengupta
- Department of Life Sciences, Presidency University Kolkata, 86/1 College Street, Kolkata, 700073, India.
| |
Collapse
|
2
|
Adel S, Carels N. Plant Tolerance to Drought Stress with Emphasis on Wheat. PLANTS (BASEL, SWITZERLAND) 2023; 12:plants12112170. [PMID: 37299149 DOI: 10.3390/plants12112170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 03/16/2023] [Accepted: 03/29/2023] [Indexed: 06/12/2023]
Abstract
Environmental stresses, such as drought, have negative effects on crop yield. Drought is a stress whose impact tends to increase in some critical regions. However, the worldwide population is continuously increasing and climate change may affect its food supply in the upcoming years. Therefore, there is an ongoing effort to understand the molecular processes that may contribute to improving drought tolerance of strategic crops. These investigations should contribute to delivering drought-tolerant cultivars by selective breeding. For this reason, it is worthwhile to review regularly the literature concerning the molecular mechanisms and technologies that could facilitate gene pyramiding for drought tolerance. This review summarizes achievements obtained using QTL mapping, genomics, synteny, epigenetics, and transgenics for the selective breeding of drought-tolerant wheat cultivars. Synthetic apomixis combined with the msh1 mutation opens the way to induce and stabilize epigenomes in crops, which offers the potential of accelerating selective breeding for drought tolerance in arid and semi-arid regions.
Collapse
Affiliation(s)
- Sarah Adel
- Genetic Department, Faculty of Agriculture, Ain Shams University, Cairo 11241, Egypt
| | - Nicolas Carels
- Laboratory of Biological System Modeling, Center of Technological Development for Health (CDTS), Oswaldo Cruz Foundation (Fiocruz), Rio de Janeiro 21040-361, Brazil
| |
Collapse
|
3
|
Riba A, Fumagalli MR, Caselle M, Osella M. A Model-Driven Quantitative Analysis of Retrotransposon Distributions in the Human Genome. Genome Biol Evol 2021; 12:2045-2059. [PMID: 32986810 PMCID: PMC7750997 DOI: 10.1093/gbe/evaa201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2020] [Indexed: 12/21/2022] Open
Abstract
Retrotransposons, DNA sequences capable of creating copies of themselves, compose about half of the human genome and played a central role in the evolution of mammals. Their current position in the host genome is the result of the retrotranscription process and of the following host genome evolution. We apply a model from statistical physics to show that the genomic distribution of the two most populated classes of retrotransposons in human deviates from random placement, and that this deviation increases with time. The time dependence suggests a major role of the host genome dynamics in shaping the current retrotransposon distributions. Focusing on a neutral scenario, we show that a simple model based on random placement followed by genome expansion and sequence duplications can reproduce the empirical retrotransposon distributions, even though more complex and possibly selective mechanisms can have contributed. Besides the inherent interest in understanding the origin of current retrotransposon distributions, this work sets a general analytical framework to analyze quantitatively the effects of genome evolutionary dynamics on the distribution of genomic elements.
Collapse
Affiliation(s)
| | - Maria Rita Fumagalli
- Institute of Biophysics - CNR, National Research Council, Genova, Italy.,Department of Environmental Science and Policy, Center for Complexity and Biosystems, University of Milan, Milano, Italy
| | - Michele Caselle
- Department of Physics and INFN, University of Torino, Torino, Italy
| | - Matteo Osella
- Department of Physics and INFN, University of Torino, Torino, Italy
| |
Collapse
|
4
|
Lu JY, Chang L, Li T, Wang T, Yin Y, Zhan G, Han X, Zhang K, Tao Y, Percharde M, Wang L, Peng Q, Yan P, Zhang H, Bi X, Shao W, Hong Y, Wu Z, Ma R, Wang P, Li W, Zhang J, Chang Z, Hou Y, Zhu B, Ramalho-Santos M, Li P, Xie W, Na J, Sun Y, Shen X. Homotypic clustering of L1 and B1/Alu repeats compartmentalizes the 3D genome. Cell Res 2021; 31:613-630. [PMID: 33514913 PMCID: PMC8169921 DOI: 10.1038/s41422-020-00466-6] [Citation(s) in RCA: 85] [Impact Index Per Article: 28.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Accepted: 12/17/2020] [Indexed: 01/30/2023] Open
Abstract
Organization of the genome into euchromatin and heterochromatin appears to be evolutionarily conserved and relatively stable during lineage differentiation. In an effort to unravel the basic principle underlying genome folding, here we focus on the genome itself and report a fundamental role for L1 (LINE1 or LINE-1) and B1/Alu retrotransposons, the most abundant subclasses of repetitive sequences, in chromatin compartmentalization. We find that homotypic clustering of L1 and B1/Alu demarcates the genome into grossly exclusive domains, and characterizes and predicts Hi-C compartments. Spatial segregation of L1-rich sequences in the nuclear and nucleolar peripheries and B1/Alu-rich sequences in the nuclear interior is conserved in mouse and human cells and occurs dynamically during the cell cycle. In addition, de novo establishment of L1 and B1 nuclear segregation is coincident with the formation of higher-order chromatin structures during early embryogenesis and appears to be critically regulated by L1 and B1 transcripts. Importantly, depletion of L1 transcripts in embryonic stem cells drastically weakens homotypic repeat contacts and compartmental strength, and disrupts the nuclear segregation of L1- or B1-rich chromosomal sequences at genome-wide and individual sites. Mechanistically, nuclear co-localization and liquid droplet formation of L1 repeat DNA and RNA with heterochromatin protein HP1α suggest a phase-separation mechanism by which L1 promotes heterochromatin compartmentalization. Taken together, we propose a genetically encoded model in which L1 and B1/Alu repeats blueprint chromatin macrostructure. Our model explains the robustness of genome folding into a common conserved core, on which dynamic gene regulation is overlaid across cells.
Collapse
Affiliation(s)
- J Yuyang Lu
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Lei Chang
- State Key Laboratory of Membrane Biology, Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, and College of Future Technology, Peking University, Beijing, 100871, China
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health Guangdong Laboratory), Guangzhou, Guangdong, 510005, China
| | - Tong Li
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Ting Wang
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Yafei Yin
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Ge Zhan
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Xue Han
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Ke Zhang
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Yibing Tao
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Michelle Percharde
- MRC London Institute of Medical Sciences (LMS), London, W120NN, UK
- Institute of Clinical Sciences (ICS), Faculty of Medicine, Imperial College London, London, W120NN, UK
| | - Liang Wang
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Qi Peng
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Pixi Yan
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Hui Zhang
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Xianju Bi
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Wen Shao
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Yantao Hong
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Zhongyang Wu
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Runze Ma
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Peizhe Wang
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Wenzhi Li
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Jing Zhang
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Zai Chang
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Yingping Hou
- State Key Laboratory of Membrane Biology, Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, and College of Future Technology, Peking University, Beijing, 100871, China
| | - Bing Zhu
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China
| | - Miguel Ramalho-Santos
- Lunenfeld-Tanenbaum Research Institute, University of Toronto, Toronto, Ontario, M5T 3H7, Canada
| | - Pilong Li
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Wei Xie
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Jie Na
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China
| | - Yujie Sun
- State Key Laboratory of Membrane Biology, Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, and College of Future Technology, Peking University, Beijing, 100871, China.
| | - Xiaohua Shen
- Tsinghua-Peking Joint Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing, 100084, China.
| |
Collapse
|
5
|
Lu JY, Shao W, Chang L, Yin Y, Li T, Zhang H, Hong Y, Percharde M, Guo L, Wu Z, Liu L, Liu W, Yan P, Ramalho-Santos M, Sun Y, Shen X. Genomic Repeats Categorize Genes with Distinct Functions for Orchestrated Regulation. Cell Rep 2020; 30:3296-3311.e5. [PMID: 32160538 PMCID: PMC7195444 DOI: 10.1016/j.celrep.2020.02.048] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2019] [Revised: 11/11/2019] [Accepted: 02/10/2020] [Indexed: 11/06/2022] Open
Abstract
Repetitive elements are abundantly distributed in mammalian genomes. Here, we reveal a striking association between repeat subtypes and gene function. SINE, L1, and low-complexity repeats demarcate distinct functional categories of genes and may dictate the time and level of gene expression by providing binding sites for different regulatory proteins. Importantly, imaging and sequencing analysis show that L1 repeats sequester a large set of genes with specialized functions in nucleolus- and lamina-associated inactive domains that are depleted of SINE repeats. In addition, L1 transcripts bind extensively to its DNA in embryonic stem cells (ESCs). Depletion of L1 RNA in ESCs leads to relocation of L1-enriched chromosomal segments from inactive domains to the nuclear interior and de-repression of L1-associated genes. These results demonstrate a role of L1 DNA and RNA in gene silencing and suggest a general theme of genomic repeats in orchestrating the function, regulation, and expression of their host genes.
Collapse
Affiliation(s)
- J Yuyang Lu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Wen Shao
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lei Chang
- State Key Laboratory of Membrane Biology, School of Life Sciences, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing 100871, China
| | - Yafei Yin
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Tong Li
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Hui Zhang
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Yantao Hong
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Michelle Percharde
- MRC London Institute of Medical Sciences (LMS), London W120NN, UK; Institute of Clinical Sciences (ICS), Faculty of Medicine, Imperial College London, London W120NN, UK
| | - Lerui Guo
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Zhongyang Wu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Lichao Liu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Wei Liu
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Pixi Yan
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China
| | - Miguel Ramalho-Santos
- Lunenfeld-Tanenbaum Research Institute and Department of Molecular Genetics, University of Toronto, Toronto, ON M5T 3H7, Canada
| | - Yujie Sun
- State Key Laboratory of Membrane Biology, School of Life Sciences, and Biomedical Pioneering Innovation Center (BIOPIC), Peking University, Beijing 100871, China
| | - Xiaohua Shen
- Tsinghua Center for Life Sciences, School of Medicine and School of Life Sciences, Tsinghua University, Beijing 100084, China.
| |
Collapse
|
6
|
de Mendoza A, Pflueger J, Lister R. Capture of a functionally active methyl-CpG binding domain by an arthropod retrotransposon family. Genome Res 2019; 29:1277-1286. [PMID: 31239280 PMCID: PMC6673714 DOI: 10.1101/gr.243774.118] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2018] [Accepted: 06/20/2019] [Indexed: 12/30/2022]
Abstract
The repressive capacity of cytosine DNA methylation is mediated by recruitment of silencing complexes by methyl-CpG binding domain (MBD) proteins. Despite MBD proteins being associated with silencing, we discovered that a family of arthropod Copia retrotransposons have incorporated a host-derived MBD. We functionally show how retrotransposon-encoded MBDs preferentially bind to CpG-dense methylated regions, which correspond to transposable element regions of the host genome, in the myriapod Strigamia maritima Consistently, young MBD-encoding Copia retrotransposons (CopiaMBD) accumulate in regions with higher CpG densities than other LTR-retrotransposons also present in the genome. This would suggest that retrotransposons use MBDs to integrate into heterochromatic regions in Strigamia, avoiding potentially harmful insertions into host genes. In contrast, CopiaMBD insertions in the spider Stegodyphus dumicola genome disproportionately accumulate in methylated gene bodies compared with other spider LTR-retrotransposons. Given that transposons are not actively targeted by DNA methylation in the spider genome, this distribution bias would also support a role for MBDs in the integration process. Together, these data show that retrotransposons can co-opt host-derived epigenome readers, potentially harnessing the host epigenome landscape to advantageously tune the retrotransposition process.
Collapse
Affiliation(s)
- Alex de Mendoza
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, Western Australia, 6009, Australia.,Harry Perkins Institute of Medical Research, Perth, Western Australia, 6009, Australia
| | - Jahnvi Pflueger
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, Western Australia, 6009, Australia.,Harry Perkins Institute of Medical Research, Perth, Western Australia, 6009, Australia
| | - Ryan Lister
- Australian Research Council Centre of Excellence in Plant Energy Biology, School of Molecular Sciences, The University of Western Australia, Perth, Western Australia, 6009, Australia.,Harry Perkins Institute of Medical Research, Perth, Western Australia, 6009, Australia
| |
Collapse
|
7
|
Sultana T, van Essen D, Siol O, Bailly-Bechet M, Philippe C, Zine El Aabidine A, Pioger L, Nigumann P, Saccani S, Andrau JC, Gilbert N, Cristofari G. The Landscape of L1 Retrotransposons in the Human Genome Is Shaped by Pre-insertion Sequence Biases and Post-insertion Selection. Mol Cell 2019; 74:555-570.e7. [PMID: 30956044 DOI: 10.1016/j.molcel.2019.02.036] [Citation(s) in RCA: 88] [Impact Index Per Article: 17.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2018] [Revised: 01/28/2019] [Accepted: 02/25/2019] [Indexed: 01/10/2023]
Abstract
L1 retrotransposons are transposable elements and major contributors of genetic variation in humans. Where L1 integrates into the genome can directly impact human evolution and disease. Here, we experimentally induced L1 retrotransposition in cells and mapped integration sites at nucleotide resolution. At local scales, L1 integration is mostly restricted by genome sequence biases and the specificity of the L1 machinery. At regional scales, L1 shows a broad capacity for integration into all chromatin states, in contrast to other known mobile genetic elements. However, integration is influenced by the replication timing of target regions, suggesting a link to host DNA replication. The distribution of new L1 integrations differs from those of preexisting L1 copies, which are significantly reshaped by natural selection. Our findings reveal that the L1 machinery has evolved to efficiently target all genomic regions and underline a predominant role for post-integrative processes on the distribution of endogenous L1 elements.
Collapse
Affiliation(s)
- Tania Sultana
- Université Côte d'Azur, Inserm, CNRS, IRCAN, Nice, France
| | | | - Oliver Siol
- Institut de Génétique Humaine, University of Montpellier, CNRS, Montpellier, France
| | | | | | - Amal Zine El Aabidine
- Institut de Génétique Moléculaire de Montpellier, University of Montpellier, CNRS, Montpellier, France
| | - Léo Pioger
- Institut de Génétique Moléculaire de Montpellier, University of Montpellier, CNRS, Montpellier, France
| | - Pilvi Nigumann
- Université Côte d'Azur, Inserm, CNRS, IRCAN, Nice, France
| | - Simona Saccani
- Université Côte d'Azur, Inserm, CNRS, IRCAN, Nice, France
| | - Jean-Christophe Andrau
- Institut de Génétique Moléculaire de Montpellier, University of Montpellier, CNRS, Montpellier, France
| | - Nicolas Gilbert
- Institut de Génétique Humaine, University of Montpellier, CNRS, Montpellier, France; Institut de Médecine Régénératrice et de Biothérapie, Inserm U1183, CHU Montpellier, Montpellier, France
| | | |
Collapse
|
8
|
Rishishwar L, Wang L, Wang J, Yi SV, Lachance J, Jordan IK. Evidence for positive selection on recent human transposable element insertions. Gene 2018; 675:69-79. [DOI: 10.1016/j.gene.2018.06.077] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 06/24/2018] [Indexed: 11/29/2022]
|
9
|
Transposable element dysregulation in systemic lupus erythematosus and regulation by histone conformation and Hsp90. Clin Immunol 2018; 197:6-18. [PMID: 30149120 DOI: 10.1016/j.clim.2018.08.011] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Revised: 08/22/2018] [Accepted: 08/23/2018] [Indexed: 01/27/2023]
Abstract
Systemic lupus erythematosus (SLE) represents an autoimmune disease in which activation of the type I interferon pathway leads to dysregulation of tolerance and the generation of autoantibodies directed against nuclear constituents. The mechanisms driving the activation of the interferon pathway in SLE have been the subject of intense investigation but are still incompletely understood. Transposable elements represent an enormous source of RNA that could potentially stimulate the cell intrinsic RNA-recognition pathway, leading to upregulation of interferons. We used RNA-seq to define transposable element families and subfamilies in three cell types in SLE and found diverse effects on transposable element expression in the three cell types and even within a given family of transposable elements. When potential mechanisms were examined, we found that Hsp90 inhibition could drive increased expression of multiple type of transposable elements. Both direct inhibition and the delivery of a heat shock itself, which redirects heat shock regulators (including Hsp90) off of basal expression promoters and onto heat shock-responsive promoters, led to increased transposable element expression. This effect was amplified by the concurrent delivery of a histone deacetylase inhibitor. We conclude that transposable elements are dysregulated in SLE and there are tissue-specific effects and locus-specific effects. The magnitude of RNAs attributable to transposable elements makes their dysregulation of critical interest in SLE where transposable element RNA complexed with proteins has been shown to drive interferon expression.
Collapse
|
10
|
Wang L, Jordan IK. Transposable element activity, genome regulation and human health. Curr Opin Genet Dev 2018; 49:25-33. [PMID: 29505964 DOI: 10.1016/j.gde.2018.02.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2017] [Revised: 01/30/2018] [Accepted: 02/13/2018] [Indexed: 12/21/2022]
Abstract
A convergence of novel genome analysis technologies is enabling population genomic studies of human transposable elements (TEs). Population surveys of human genome sequences have uncovered thousands of individual TE insertions that segregate as common genetic variants, i.e. TE polymorphisms. These recent TE insertions provide an important source of naturally occurring human genetic variation. Investigators are beginning to leverage population genomic data sets to execute genome-scale association studies for assessing the phenotypic impact of human TE polymorphisms. For example, the expression quantitative trait loci (eQTL) analytical paradigm has recently been used to uncover hundreds of associations between human TE insertion variants and gene expression levels. These include population-specific gene regulatory effects as well as coordinated changes to gene regulatory networks. In addition, analyses of linkage disequilibrium patterns with previously characterized genome-wide association study (GWAS) trait variants have uncovered TE insertion polymorphisms that are likely causal variants for a variety of common complex diseases. Gene regulatory mechanisms that underlie specific disease phenotypes have been proposed for a number of these trait associated TE polymorphisms. These new population genomic approaches hold great promise for understanding how ongoing TE activity contributes to functionally relevant genetic variation within and between human populations.
Collapse
Affiliation(s)
- Lu Wang
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia.
| |
Collapse
|
11
|
Wang L, Norris ET, Jordan IK. Human Retrotransposon Insertion Polymorphisms Are Associated with Health and Disease via Gene Regulatory Phenotypes. Front Microbiol 2017; 8:1418. [PMID: 28824558 PMCID: PMC5539088 DOI: 10.3389/fmicb.2017.01418] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2017] [Accepted: 07/13/2017] [Indexed: 11/18/2022] Open
Abstract
The human genome hosts several active families of transposable elements (TEs), including the Alu, LINE-1, and SVA retrotransposons that are mobilized via reverse transcription of RNA intermediates. We evaluated how insertion polymorphisms generated by human retrotransposon activity may be related to common health and disease phenotypes that have been previously interrogated through genome-wide association studies (GWAS). To address this question, we performed a genome-wide screen for retrotransposon polymorphism disease associations that are linked to TE induced gene regulatory changes. Our screen first identified polymorphic retrotransposon insertions found in linkage disequilibrium (LD) with single nucleotide polymorphisms that were previously associated with common complex diseases by GWAS. We further narrowed this set of candidate disease associated retrotransposon polymorphisms by identifying insertions that are located within tissue-specific enhancer elements. We then performed expression quantitative trait loci analysis on the remaining set of candidates in order to identify polymorphic retrotransposon insertions that are associated with gene expression changes in B-cells of the human immune system. This progressive and stringent screen yielded a list of six retrotransposon insertions as the strongest candidates for TE polymorphisms that lead to disease via enhancer-mediated changes in gene regulation. For example, we found an SVA insertion within a cell-type specific enhancer located in the second intron of the B4GALT1 gene. B4GALT1 encodes a glycosyltransferase that functions in the glycosylation of the Immunoglobulin G (IgG) antibody in such a way as to convert its activity from pro- to anti-inflammatory. The disruption of the B4GALT1 enhancer by the SVA insertion is associated with down-regulation of the gene in B-cells, which would serve to keep the IgG molecule in a pro-inflammatory state. Consistent with this idea, the B4GALT1 enhancer SVA insertion is linked to a genomic region implicated by GWAS in both inflammatory conditions and autoimmune diseases, such as systemic lupus erythematosus and Crohn’s disease. We explore this example and the other cases uncovered by our genome-wide screen in an effort to illuminate how retrotransposon insertion polymorphisms can impact human health and disease by causing changes in gene expression.
Collapse
Affiliation(s)
- Lu Wang
- School of Biological Sciences, Georgia Institute of Technology, AtlantaGA, United States.,PanAmerican Bioinformatics InstituteCali, Colombia.,Applied Bioinformatics Laboratory, AtlantaGA, United States
| | - Emily T Norris
- School of Biological Sciences, Georgia Institute of Technology, AtlantaGA, United States.,PanAmerican Bioinformatics InstituteCali, Colombia.,Applied Bioinformatics Laboratory, AtlantaGA, United States
| | - I K Jordan
- School of Biological Sciences, Georgia Institute of Technology, AtlantaGA, United States.,PanAmerican Bioinformatics InstituteCali, Colombia.,Applied Bioinformatics Laboratory, AtlantaGA, United States
| |
Collapse
|
12
|
Wang L, Rishishwar L, Mariño-Ramírez L, Jordan IK. Human population-specific gene expression and transcriptional network modification with polymorphic transposable elements. Nucleic Acids Res 2017; 45:2318-2328. [PMID: 27998931 PMCID: PMC5389732 DOI: 10.1093/nar/gkw1286] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2016] [Revised: 12/05/2016] [Accepted: 12/12/2016] [Indexed: 02/07/2023] Open
Abstract
Transposable element (TE) derived sequences are known to contribute to the regulation of the human genome. The majority of known TE-derived regulatory sequences correspond to relatively ancient insertions, which are fixed across human populations. The extent to which human genetic variation caused by recent TE activity leads to regulatory polymorphisms among populations has yet to be thoroughly explored. In this study, we searched for associations between polymorphic TE (polyTE) loci and human gene expression levels using an expression quantitative trait loci (eQTL) approach. We compared locus-specific polyTE insertion genotypes to B cell gene expression levels among 445 individuals from 5 human populations. Numerous human polyTE loci correspond to both cis and trans eQTL, and their regulatory effects are directly related to cell type-specific function in the immune system. PolyTE loci are associated with differences in expression between European and African population groups, and a single polyTE loci is indirectly associated with the expression of numerous genes via the regulation of the B cell-specific transcription factor PAX5. The polyTE-gene expression associations we found indicate that human TE genetic variation can have important phenotypic consequences. Our results reveal that TE-eQTL are involved in population-specific gene regulation as well as transcriptional network modification.
Collapse
Affiliation(s)
- Lu Wang
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | - Lavanya Rishishwar
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
- Applied Bioinformatics Laboratory, Atlanta, GA 30332, USA
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, 760043, Colombia
- BIOS Centro de Bioinformática y Biología Computacional, Manizales, Caldas, 170002, Colombia
| | - Leonardo Mariño-Ramírez
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, 760043, Colombia
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - I. King Jordan
- School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA
- Applied Bioinformatics Laboratory, Atlanta, GA 30332, USA
- PanAmerican Bioinformatics Institute, Cali, Valle del Cauca, 760043, Colombia
- BIOS Centro de Bioinformática y Biología Computacional, Manizales, Caldas, 170002, Colombia
| |
Collapse
|
13
|
Rishishwar L, Wang L, Clayton EA, Mariño-Ramírez L, McDonald JF, Jordan IK. Population and clinical genetics of human transposable elements in the (post) genomic era. Mob Genet Elements 2017; 7:1-20. [PMID: 28228978 PMCID: PMC5305044 DOI: 10.1080/2159256x.2017.1280116] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2016] [Revised: 01/03/2017] [Accepted: 01/04/2017] [Indexed: 10/26/2022] Open
Abstract
Recent technological developments-in genomics, bioinformatics and high-throughput experimental techniques-are providing opportunities to study ongoing human transposable element (TE) activity at an unprecedented level of detail. It is now possible to characterize genome-wide collections of TE insertion sites for multiple human individuals, within and between populations, and for a variety of tissue types. Comparison of TE insertion site profiles between individuals captures the germline activity of TEs and reveals insertion site variants that segregate as polymorphisms among human populations, whereas comparison among tissue types ascertains somatic TE activity that generates cellular heterogeneity. In this review, we provide an overview of these new technologies and explore their implications for population and clinical genetic studies of human TEs. We cover both recent published results on human TE insertion activity as well as the prospects for future TE studies related to human evolution and health.
Collapse
Affiliation(s)
- Lavanya Rishishwar
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia; Applied Bioinformatics Laboratory, Atlanta, GA, USA
| | - Lu Wang
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia
| | - Evan A Clayton
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; Ovarian Cancer Institute, Atlanta, GA, USA
| | - Leonardo Mariño-Ramírez
- PanAmerican Bioinformatics Institute, Cali, Colombia; National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, USA
| | - John F McDonald
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; Ovarian Cancer Institute, Atlanta, GA, USA
| | - I King Jordan
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA; PanAmerican Bioinformatics Institute, Cali, Colombia; Applied Bioinformatics Laboratory, Atlanta, GA, USA
| |
Collapse
|
14
|
Lamolle G, Protasio AV, Iriarte A, Jara E, Simón D, Musto H. An Isochore-Like Structure in the Genome of the Flatworm Schistosoma mansoni. Genome Biol Evol 2016; 8:2312-8. [PMID: 27435793 PMCID: PMC5010904 DOI: 10.1093/gbe/evw170] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Eukaryotic genomes are compositionally heterogeneous, that is, composed by regions that differ in guanine-cytosine (GC) content (isochores). The most well documented case is that of vertebrates (mainly mammals) although it has been also noted among unicellular eukaryotes and invertebrates. In the human genome, regarded as a typical mammal, this heterogeneity is associated with several features. Specifically, genes located in GC-richest regions are the GC3-richest, display CpG islands and have shorter introns. Furthermore, these genes are more heavily expressed and tend to be located at the extremes of the chromosomes. Although the compositional heterogeneity seems to be widespread among eukaryotes, the associated properties noted in the human genome and other mammals have not been investigated in depth in other taxa Here we provide evidence that the genome of the parasitic flatworm Schistosoma mansoni is compositionally heterogeneous and exhibits an isochore-like structure, displaying some features associated, until now, only with the human and other vertebrate genomes, with the exception of gene concentration.
Collapse
Affiliation(s)
- Guillermo Lamolle
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| | - Anna V Protasio
- Wellcome Trust Genome Campus, Wellcome Trust Sanger Institute, Cambridge, United Kingdom
| | - Andrés Iriarte
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay Dpto. de Desarrollo Biotecnológico, Facultad de Medicina, Instituto de Higiene, Udelar, Montevideo, Uruguay
| | - Eugenio Jara
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| | - Diego Simón
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| | - Héctor Musto
- Laboratorio de Organización y Evolución del Genoma, Facultad de Ciencias, Udelar, Montevideo, Uruguay
| |
Collapse
|
15
|
Gaudi S, Guffanti G, Fallon J, Macciardi F. Epigenetic mechanisms and associated brain circuits in the regulation of positive emotions: A role for transposable elements. J Comp Neurol 2016; 524:2944-54. [PMID: 27224878 DOI: 10.1002/cne.24046] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2015] [Revised: 07/21/2015] [Accepted: 05/23/2016] [Indexed: 01/12/2023]
Abstract
Epigenetic programming and reprogramming are at the heart of cellular differentiation and represent developmental and evolutionary mechanisms in both germline and somatic cell lines. Only about 2% of our genome is composed of protein-coding genes, while the remaining 98%, once considered "junk" DNA, codes for regulatory/epigenetic elements that control how genes are expressed in different tissues and across time from conception to death. While we already know that epigenetic mechanisms are at play in cancer development and in regulating metabolism (cellular and whole body), the role of epigenetics in the developing prenatal and postnatal brain, and in maintaining a proper brain activity throughout the various stages of life, in addition to having played a critical role in human evolution, is a relatively new domain of knowledge. Here we present the current state-of-the-art techniques and results of these studies within the domain of emotions, and then speculate on how genomic and epigenetic mechanisms can modify and potentially alter our emotional (limbic) brain and affect our social interactions. J. Comp. Neurol. 524:2944-2954, 2016. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Simona Gaudi
- Department of Infectious, Parasitic, and Immune-Mediated Diseases, Italian National Institute of Health, 00161, Rome, Italy
| | - Guia Guffanti
- Department of Psychiatry, McLean Hospital, Harvard Medical School, Belmont, 02478, MA
| | - James Fallon
- Department of Psychiatry and Human Behavior, University of California Irvine, Irvine, 92617, California
| | - Fabio Macciardi
- Department of Psychiatry and Human Behavior, University of California Irvine, Irvine, 92617, California.,Center for Autism Research and Treatment (CART), University of California Irvine, Irvine, 92617, California.,Center for Epigenetics and Metabolism, University of California Irvine, Irvine, 92617, California.,Department of Health Sciences, University of Milan, 20133, Milan, Italy
| |
Collapse
|
16
|
Evolutionary direction of processed pseudogenes. SCIENCE CHINA-LIFE SCIENCES 2016; 59:839-49. [PMID: 27333782 DOI: 10.1007/s11427-016-5074-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Accepted: 03/08/2016] [Indexed: 10/21/2022]
Abstract
While some pseudogenes have been reported to play important roles in gene regulation, little is known about the possible relationship between pseudogene functions and evolutionary process of pseudogenes, or about the forces responsible for the pseudogene evolution. In this study, we characterized human processed pseudogenes in terms of evolutionary dynamics. Our results show that pseudogenes tend to evolve toward: lower GC content, strong dinucleotide bias, reduced abundance of transcription factor binding motifs and short palindromes, and decreased ability to form nucleosomes. We explored possible evolutionary forces that shaped the evolution pattern of pseudogenes, and concluded that mutations in pseudogenes are likely determined, at least partially, by neighbor-dependent mutational bias and recombination-associated selection.
Collapse
|
17
|
Knothe C, Shiratori H, Resch E, Ultsch A, Geisslinger G, Doehring A, Lötsch J. Disagreement between two common biomarkers of global DNA methylation. Clin Epigenetics 2016; 8:60. [PMID: 27222668 PMCID: PMC4877994 DOI: 10.1186/s13148-016-0227-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 05/10/2016] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND The quantification of global DNA methylation has been established in epigenetic screening. As more practicable alternatives to the HPLC-based gold standard, the methylation analysis of CpG islands in repeatable elements (LINE-1) and the luminometric methylation assay (LUMA) of overall 5-methylcytosine content in "CCGG" recognition sites are most widely used. Both methods are applied as virtually equivalent, despite the hints that their results only partly agree. This triggered the present agreement assessments. RESULTS Three different human cell types (cultured MCF7 and SHSY5Y cell lines treated with different chemical modulators of DNA methylation and whole blood drawn from pain patients and healthy volunteers) were submitted to the global DNA methylation assays employing LINE-1 or LUMA-based pyrosequencing measurements. The agreement between the two bioassays was assessed using generally accepted approaches to the statistics for laboratory method comparison studies. Although global DNA methylation levels measured by the two methods correlated, five different lines of statistical evidence consistently rejected the assumption of complete agreement. Specifically, a bias was observed between the two methods. In addition, both the magnitude and direction of bias were tissue-dependent. Interassay differences could be grouped based on Bayesian statistics, and these groups allowed in turn to re-identify the originating tissue. CONCLUSIONS Although providing partly correlated measurements of DNA methylation, interchangeability of the quantitative results obtained with LINE-1 and LUMA was jeopardized by a consistent bias between the results. Moreover, the present analyses strongly indicate a tissue specificity of the differences between the two methods.
Collapse
Affiliation(s)
- Claudia Knothe
- />Institute of Clinical Pharmacology, Goethe University, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Hiromi Shiratori
- />Project Group Translational Medicine and Pharmacology TMP, Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Eduard Resch
- />Project Group Translational Medicine and Pharmacology TMP, Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Alfred Ultsch
- />DataBionics Research Group, University of Marburg, Hans-Meerwein-Straße, 35032 Marburg, Germany
| | - Gerd Geisslinger
- />Institute of Clinical Pharmacology, Goethe University, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- />Project Group Translational Medicine and Pharmacology TMP, Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Alexandra Doehring
- />Institute of Clinical Pharmacology, Goethe University, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| | - Jörn Lötsch
- />Institute of Clinical Pharmacology, Goethe University, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
- />Project Group Translational Medicine and Pharmacology TMP, Fraunhofer Institute for Molecular Biology and Applied Ecology IME, Theodor-Stern-Kai 7, 60590 Frankfurt am Main, Germany
| |
Collapse
|
18
|
Mapelli P, Aboagye EO, Stebbing J, Sharma R. Epigenetic changes in gastroenteropancreatic neuroendocrine tumours. Oncogene 2015; 34:4439-47. [PMID: 25435371 DOI: 10.1038/onc.2014.379] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2014] [Revised: 09/25/2014] [Accepted: 10/10/2014] [Indexed: 02/07/2023]
Abstract
An understanding of epigenetic drivers of tumorigenesis has developed rapidly during the last years. The identification of these changes including DNA methylation and histone modifications in gastroenteropancreatic neuroendocrine tumours (GEP-NETs) is a step forward in trying to define underlying biologic processes in this heterogeneous disease. The reversible nature of these changes represents a potential therapeutic target. We present an overview of the current knowledge of epigenetic alterations related to GEP-NETs, focusing on the influence and impact these changes have on pathogenesis and prognosis. The potential role of demethylating agents in the management of this patient population is discussed.
Collapse
Affiliation(s)
- P Mapelli
- Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, Hammersmith Hospital, London, UK
| | - E O Aboagye
- Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, Hammersmith Hospital, London, UK
| | - J Stebbing
- Department of Surgery and Cancer, Faculty of Medicine, Imperial College London, Hammersmith Hospital, London, UK
| | - R Sharma
- Department of Experimental Medicine, Faculty of Medicine, Imperial College London, Hammersmith Hospital, London, UK
| |
Collapse
|
19
|
Elhaik E, Graur D. A comparative study and a phylogenetic exploration of the compositional architectures of mammalian nuclear genomes. PLoS Comput Biol 2014; 10:e1003925. [PMID: 25375262 PMCID: PMC4222635 DOI: 10.1371/journal.pcbi.1003925] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2014] [Accepted: 09/18/2014] [Indexed: 11/18/2022] Open
Abstract
For the past four decades the compositional organization of the mammalian genome posed a formidable challenge to molecular evolutionists attempting to explain it from an evolutionary perspective. Unfortunately, most of the explanations adhered to the "isochore theory," which has long been rebutted. Recently, an alternative compositional domain model was proposed depicting the human and cow genomes as composed mostly of short compositionally homogeneous and nonhomogeneous domains and a few long ones. We test the validity of this model through a rigorous sequence-based analysis of eleven completely sequenced mammalian and avian genomes. Seven attributes of compositional domains are used in the analyses: (1) the number of compositional domains, (2) compositional domain-length distribution, (3) density of compositional domains, (4) genome coverage by the different domain types, (5) degree of fit to a power-law distribution, (6) compositional domain GC content, and (7) the joint distribution of GC content and length of the different domain types. We discuss the evolution of these attributes in light of two competing phylogenetic hypotheses that differ from each other in the validity of clade Euarchontoglires. If valid, the murid genome compositional organization would be a derived state and exhibit a high similarity to that of other mammals. If invalid, the murid genome compositional organization would be closer to an ancestral state. We demonstrate that the compositional organization of the murid genome differs from those of primates and laurasiatherians, a phenomenon previously termed the "murid shift," and in many ways resembles the genome of opossum. We find no support to the "isochore theory." Instead, our findings depict the mammalian genome as a tapestry of mostly short homogeneous and nonhomogeneous domains and few long ones thus providing strong evidence in favor of the compositional domain model and seem to invalidate clade Euarchontoglires.
Collapse
Affiliation(s)
- Eran Elhaik
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
- * E-mail:
| | - Dan Graur
- Department of Biology & Biochemistry, University of Houston, Houston, Texas, United States of America
| |
Collapse
|
20
|
Dios F, Barturen G, Lebrón R, Rueda A, Hackenberg M, Oliver JL. DNA clustering and genome complexity. Comput Biol Chem 2014; 53 Pt A:71-8. [PMID: 25182383 DOI: 10.1016/j.compbiolchem.2014.08.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/11/2014] [Indexed: 01/08/2023]
Abstract
Early global measures of genome complexity (power spectra, the analysis of fluctuations in DNA walks or compositional segmentation) uncovered a high degree of complexity in eukaryotic genome sequences. The main evolutionary mechanisms leading to increases in genome complexity (i.e. gene duplication and transposon proliferation) can all potentially produce increases in DNA clustering. To quantify such clustering and provide a genome-wide description of the formed clusters, we developed GenomeCluster, an algorithm able to detect clusters of whatever genome element identified by chromosome coordinates. We obtained a detailed description of clusters for ten categories of human genome elements, including functional (genes, exons, introns), regulatory (CpG islands, TFBSs, enhancers), variant (SNPs) and repeat (Alus, LINE1) elements, as well as DNase hypersensitivity sites. For each category, we located their clusters in the human genome, then quantifying cluster length and composition, and estimated the clustering level as the proportion of clustered genome elements. In average, we found a 27% of elements in clusters, although a considerable variation occurs among different categories. Genes form the lowest number of clusters, but these are the longest ones, both in bp and the average number of components, while the shortest clusters are formed by SNPs. Functional and regulatory elements (genes, CpG islands, TFBSs, enhancers) show the highest clustering level, as compared to DNase sites, repeats (Alus, LINE1) or SNPs. Many of the genome elements we analyzed are known to be composed of clusters of low-level entities. In addition, we found here that the clusters generated by GenomeCluster can be in turn clustered into high-level super-clusters. The observation of 'clusters-within-clusters' parallels the 'domains within domains' phenomenon previously detected through global statistical methods in eukaryotic sequences, and reveals a complex human genome landscape dominated by hierarchical clustering.
Collapse
Affiliation(s)
- Francisco Dios
- Dpto. de Genética, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; Lab. de Bioinformática, Inst. de Biotecnología, Centro de Investigación Biomédica, 18100 Granada, Spain
| | - Guillermo Barturen
- Dpto. de Genética, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; Lab. de Bioinformática, Inst. de Biotecnología, Centro de Investigación Biomédica, 18100 Granada, Spain
| | - Ricardo Lebrón
- Dpto. de Genética, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; Lab. de Bioinformática, Inst. de Biotecnología, Centro de Investigación Biomédica, 18100 Granada, Spain
| | - Antonio Rueda
- Plataforma Andaluza de Genómica y Bioinformática (GBPA), Edificio INSUR, Calle Albert Einstein, 41092 Sevilla, Spain
| | - Michael Hackenberg
- Dpto. de Genética, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; Lab. de Bioinformática, Inst. de Biotecnología, Centro de Investigación Biomédica, 18100 Granada, Spain
| | - José L Oliver
- Dpto. de Genética, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; Lab. de Bioinformática, Inst. de Biotecnología, Centro de Investigación Biomédica, 18100 Granada, Spain.
| |
Collapse
|
21
|
Guffanti G, Gaudi S, Fallon JH, Sobell J, Potkin SG, Pato C, Macciardi F. Transposable elements and psychiatric disorders. Am J Med Genet B Neuropsychiatr Genet 2014; 165B:201-16. [PMID: 24585726 DOI: 10.1002/ajmg.b.32225] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/03/2013] [Accepted: 01/21/2014] [Indexed: 12/15/2022]
Abstract
Transposable Elements (TEs) or transposons are low-complexity elements (e.g., LINEs, SINEs, SVAs, and HERVs) that make up to two-thirds of the human genome. There is mounting evidence that TEs play an essential role in genomic architecture and regulation related to both normal function and disease states. Recently, the identification of active TEs in several different human brain regions suggests that TEs play a role in normal brain development and adult physiology and quite possibly in psychiatric disorders. TEs have been implicated in hemophilia, neurofibromatosis, and cancer. With the advent of next-generation whole-genome sequencing approaches, our understanding of the relationship between TEs and psychiatric disorders will greatly improve. We will review the biology of TEs and early evidence for TE involvement in psychiatric disorders.
Collapse
Affiliation(s)
- Guia Guffanti
- Department of Psychiatry, Columbia University, New York, New York
| | | | | | | | | | | | | |
Collapse
|
22
|
Breitbach S, Tug S, Helmig S, Zahn D, Kubiak T, Michal M, Gori T, Ehlert T, Beiter T, Simon P. Direct quantification of cell-free, circulating DNA from unpurified plasma. PLoS One 2014; 9:e87838. [PMID: 24595313 PMCID: PMC3940427 DOI: 10.1371/journal.pone.0087838] [Citation(s) in RCA: 99] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Accepted: 12/30/2013] [Indexed: 02/06/2023] Open
Abstract
Cell-free DNA (cfDNA) in body tissues or fluids is extensively investigated in clinical medicine and other research fields. In this article we provide a direct quantitative real-time PCR (qPCR) as a sensitive tool for the measurement of cfDNA from plasma without previous DNA extraction, which is known to be accompanied by a reduction of DNA yield. The primer sets were designed to amplify a 90 and 222 bp multi-locus L1PA2 sequence. In the first module, cfDNA concentrations in unpurified plasma were compared to cfDNA concentrations in the eluate and the flow-through of the QIAamp DNA Blood Mini Kit and in the eluate of a phenol-chloroform isoamyl (PCI) based DNA extraction, to elucidate the DNA losses during extraction. The analyses revealed 2.79-fold higher cfDNA concentrations in unpurified plasma compared to the eluate of the QIAamp DNA Blood Mini Kit, while 36.7% of the total cfDNA were found in the flow-through. The PCI procedure only performed well on samples with high cfDNA concentrations, showing 87.4% of the concentrations measured in plasma. The DNA integrity strongly depended on the sample treatment. Further qualitative analyses indicated differing fractions of cfDNA fragment lengths in the eluate of both extraction methods. In the second module, cfDNA concentrations in the plasma of 74 coronary heart disease patients were compared to cfDNA concentrations of 74 healthy controls, using the direct L1PA2 qPCR for cfDNA quantification. The patient collective showed significantly higher cfDNA levels (mean (SD) 20.1 (23.8) ng/ml; range 5.1–183.0 ng/ml) compared to the healthy controls (9.7 (4.2) ng/ml; range 1.6–23.7 ng/ml). With our direct qPCR, we recommend a simple, economic and sensitive procedure for the quantification of cfDNA concentrations from plasma that might find broad applicability, if cfDNA became an established marker in the assessment of pathophysiological conditions.
Collapse
Affiliation(s)
- Sarah Breitbach
- Department of Sports Medicine, Rehabilitation and Prevention, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Suzan Tug
- Department of Sports Medicine, Rehabilitation and Prevention, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Susanne Helmig
- Department of Sports Medicine, Rehabilitation and Prevention, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Daniela Zahn
- Department of Health Psychology, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Thomas Kubiak
- Department of Health Psychology, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Matthias Michal
- Department of Psychosomatic Medicine and Psychotherapy, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Tommaso Gori
- Department of Cardiology, Angiology and Internal Medicine, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Tobias Ehlert
- Department of Sports Medicine, Rehabilitation and Prevention, Johannes Gutenberg-University of Mainz, Mainz, Germany
| | - Thomas Beiter
- Department of Sports Medicine, Medical Clinic, Eberhard-Karls-University of Tuebingen, Tuebingen, Germany
| | - Perikles Simon
- Department of Sports Medicine, Rehabilitation and Prevention, Johannes Gutenberg-University of Mainz, Mainz, Germany
- * E-mail:
| |
Collapse
|
23
|
Linker S, Hedges D. Linear decay of retrotransposon antisense bias across genes is contingent upon tissue specificity. PLoS One 2013; 8:e79402. [PMID: 24244495 PMCID: PMC3828378 DOI: 10.1371/journal.pone.0079402] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 09/28/2013] [Indexed: 12/23/2022] Open
Abstract
Retrotransposons comprise approximately half of the human genome and contribute to chromatin structure, regulatory motifs, and protein-coding sequences. Since retrotransposon insertions can disrupt functional genetic elements as well as introduce new sequence motifs to a region, they have the potential to affect the function of genes that harbour insertions as well as those nearby. Partly as a result of these effects, the distribution of retrotransposons across the genome is non-uniform and there are observed imbalances in the orientation of insertions with respect to the transcriptional direction of the containing gene. Although some of the factors underlying the observed distributions are understood, much of the variability remains unexplained. Detailed characterization of retrotransposon density in genes could help inform predictions of the functional consequence of de novo as well as polymorphic insertions. In order to characterize the relationship between genes and inserted elements, we have examined the distribution of retrotransposons and their internal motifs within tissue-specific and housekeeping genes. We have identified that the previously established retrotransposon antisense bias decays at a linear rate across genes, resulting in an equal density of sense and antisense retrotransposons near the 3'-UTR. In addition, the decay of antisense bias across genes is less pronounced among tissue-specific genes. Our results provide support for the scenario in which this linear decay in antisense bias is established by natural selection shortly after retrotransposon integration, and that total antisense bias observed is above and beyond any bias introduced by the integration process itself. Finally, we provide an example of a retrotransposon acting as an eQTL on a coincident gene, highlighting one of several possible avenues through which insertions may modulate gene function.
Collapse
Affiliation(s)
- Sara Linker
- Hussman Institute for Human Genomics, Dr John T. Macdonald Foundation Department of Human Genetics, Miller School of Medicine, University of Miami, Miami, Florida, United States of America
| | - Dale Hedges
- Division of Human Genetics, Department of Internal Medicine, The Ohio State University, Columbus, Ohio, United States of America
| |
Collapse
|
24
|
Costantini M, Alvarez-Valin F, Costantini S, Cammarano R, Bernardi G. Compositional patterns in the genomes of unicellular eukaryotes. BMC Genomics 2013; 14:755. [PMID: 24188247 PMCID: PMC4007698 DOI: 10.1186/1471-2164-14-755] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2012] [Accepted: 10/31/2013] [Indexed: 11/29/2022] Open
Abstract
Background The genomes of multicellular eukaryotes are compartmentalized in mosaics of isochores, large and fairly homogeneous stretches of DNA that belong to a small number of families characterized by different average GC levels, by different gene concentration (that increase with GC), different chromatin structures, different replication timing in the cell cycle, and other different properties. A question raised by these basic results concerns how far back in evolution the compartmentalized organization of the eukaryotic genomes arose. Results In the present work we approached this problem by studying the compositional organization of the genomes from the unicellular eukaryotes for which full sequences are available, the sample used being representative. The average GC levels of the genomes from unicellular eukaryotes cover an extremely wide range (19%-60% GC) and the compositional patterns of individual genomes are extremely different but all genomes tested show a compositional compartmentalization. Conclusions The average GC range of the genomes of unicellular eukaryotes is very broad (as broad as that of prokaryotes) and individual compositional patterns cover a very broad range from very narrow to very complex. Both features are not surprising for organisms that are very far from each other both in terms of phylogenetic distances and of environmental life conditions. Most importantly, all genomes tested, a representative sample of all supergroups of unicellular eukaryotes, are compositionally compartmentalized, a major difference with prokaryotes.
Collapse
Affiliation(s)
- Maria Costantini
- Laboratory of Animal Physiology and Evolution, Stazione Zoologica Anton Dohrn, Villa Comunale, Naples 80121, Italy.
| | | | | | | | | |
Collapse
|
25
|
Belan E. LINEs of evidence: noncanonical DNA replication as an epigenetic determinant. Biol Direct 2013; 8:22. [PMID: 24034780 PMCID: PMC3868326 DOI: 10.1186/1745-6150-8-22] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Accepted: 09/06/2013] [Indexed: 12/17/2022] Open
Abstract
LINE-1 (L1) retrotransposons are repetitive elements in mammalian genomes. They are
capable of synthesizing DNA on their own RNA templates by harnessing reverse
transcriptase (RT) that they encode. Abundantly expressed full-length L1s and their
RT are found to globally influence gene expression profiles, differentiation state,
and proliferation capacity of early embryos and many types of cancer, albeit by yet
unknown mechanisms. They are essential for the progression of early development and
the establishment of a cancer-related undifferentiated state. This raises important
questions regarding the functional significance of L1 RT in these cell systems.
Massive nuclear L1-linked reverse transcription has been shown to occur in mouse
zygotes and two-cell embryos, and this phenomenon is purported to be DNA replication
independent. This review argues against this claim with the goal of understanding the
nature of this phenomenon and the role of L1 RT in early embryos and cancers.
Available L1 data are revisited and integrated with relevant findings accumulated in
the fields of replication timing, chromatin organization, and epigenetics, bringing
together evidence that strongly supports two new concepts. First, noncanonical
replication of a portion of genomic full-length L1s by means of L1 RNP-driven reverse
transcription is proposed to co-exist with DNA polymerase-dependent replication of
the rest of the genome during the same round of DNA replication in embryonic and
cancer cell systems. Second, the role of this mechanism is thought to be epigenetic;
it might promote transcriptional competence of neighboring genes linked to
undifferentiated states through the prevention of tethering of involved L1s to the
nuclear periphery. From the standpoint of these concepts, several hitherto
inexplicable phenomena can be explained. Testing methods for the model are
proposed.
Collapse
Affiliation(s)
- Ekaterina Belan
- Genetics Laboratory, Royal University Hospital, Saskatoon, SK S7N 0W8, Canada.
| |
Collapse
|
26
|
Klimopoulos A, Sellis D, Almirantis Y. Widespread occurrence of power-law distributions in inter-repeat distances shaped by genome dynamics. Gene 2012; 499:88-98. [PMID: 22370293 DOI: 10.1016/j.gene.2012.02.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2011] [Revised: 02/05/2012] [Accepted: 02/06/2012] [Indexed: 11/25/2022]
Abstract
Repetitive DNA sequences derived from transposable elements (TE) are distributed in a non-random way, co-clustering with other classes of repeat elements, genes and other genomic components. In a previous work we reported power-law-like size distributions (linearity in log-log scale) in the spatial arrangement of Alu and LINE1 elements in the human genome. Here we investigate the large-scale features of the spatial arrangement of all principal classes of TEs in 14 genomes from phylogenetically distant organisms by studying the size distribution of inter-repeat distances. Power-law-like size distributions are found to be widespread, extending up to several orders of magnitude. In order to understand the emergence of this distributional pattern, we introduce an evolutionary scenario, which includes (i) Insertions of DNA segments (e.g., more recent repeats) into the considered sequence and (ii) Eliminations of members of the studied TE family. In the proposed model we also incorporate the potential for transposition events (characteristic of the DNA transposons' life-cycle) and segmental duplications. Simulations reproduce the main features of the observed size distributions. Furthermore, we investigate the effects of various genomic features on the presence and extent of power-law size distributions including TE class and age, mode of parental TE transmission, GC content, deletion and recombination rates in the studied genomic region, etc. Our observations corroborate the hypothesis that insertions of genomic material and eliminations of repeats are at the basis of power-laws in inter-repeat distances. The existence of these power-laws could facilitate the formation of the recently proposed "fractal globule" for the confined chromatin organization.
Collapse
Affiliation(s)
- Alexandros Klimopoulos
- National Center for Scientific Research "Demokritos," Institute of Biology, 153 10 Athens, Greece.
| | | | | |
Collapse
|
27
|
Arhondakis S, Auletta F, Bernardi G. Isochores and the regulation of gene expression in the human genome. Genome Biol Evol 2012; 3:1080-9. [PMID: 21979159 PMCID: PMC3227402 DOI: 10.1093/gbe/evr017] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
It is well established that changes in the phenotype depend much more on changes in gene expression than on changes in protein-coding genes, and that cis-regulatory sequences and chromatin structure are two major factors influencing gene expression. Here, we investigated these factors at the genome-wide level by focusing on the trinucleotide patterns in the 0.1- to 25-kb regions flanking the human genes that are present in the GC-poorest L1 and GC-richest H3 isochore families, the other families exhibiting intermediate patterns. We could show 1) that the trinucleotide patterns of the 25-kb gene-flanking regions are representative of the very different patterns already reported for the whole isochores from the L1 and H3 families and, expectedly, identical in upstream and downstream locations; 2) that the patterns of the 0.1- to 0.5-kb regions in the L1 and H3 isochores are remarkably more divergent and more specific when compared with those of the 25-kb regions, as well as different in the upstream and downstream locations; and 3) that these patterns fade into the 25-kb patterns around 5kb in both upstream and downstream locations. The 25-kb findings indicate differences in nucleosome positioning and density in different isochore families, those of the 0.1- to 0.5-kb sequences indicate differences in the transcription factors that bind upstream and downstream of genes. These results indicate differences in the regulation of genes located in different isochore families, a point of functional and evolutionary relevance.
Collapse
Affiliation(s)
- Stilianos Arhondakis
- Bioinformatics and Medical Informatics Team, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | | | | |
Collapse
|
28
|
Advances in Research on Pseudogenes. PROG BIOCHEM BIOPHYS 2011. [DOI: 10.3724/sp.j.1206.2010.00215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
29
|
Liu G, Li H, Cai L. Processed pseudogenes are located preferentially in regions of low recombination rates in the human genome. J Evol Biol 2010; 23:1107-15. [PMID: 20345820 DOI: 10.1111/j.1420-9101.2010.01974.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The aim of this article is to demonstrate possible recombination-associated evolutionary forces affecting the genomic distribution of processed pseudogenes. The relationship between recombination rate and the distribution of processed pseudogenes is analysed in the human genome. The results show that processed pseudogenes preferentially accumulate in regions of low recombination rates and this correlation cannot be explained by indirect relationships with GC content and gene density. Several explanatory models for the observation are discussed. A model of selection against ectopic recombination is tested based on the difference in distribution pattern between two classes of processed pseudogenes, which differ in the possibility of stimulating ectopic recombination. Our results indicate that the correlation between processed pseudogene density and recombination rate is probably results, in part, from the selection against ectopic recombination between closely located homologous processed pseudogenes. We also found a length effect in processed pseudogene distribution, namely long processed pseudogenes are located more preferentially in regions of low recombination rates than short ones.
Collapse
Affiliation(s)
- G Liu
- School of Mathematics, Physics and Biological Engineering, Inner Mongolia University of Science and Technology, Baotou, China.
| | | | | |
Collapse
|
30
|
Proviruses selected for high and stable expression of transduced genes accumulate in broadly transcribed genome areas. J Virol 2010; 84:4204-11. [PMID: 20147411 DOI: 10.1128/jvi.02511-09] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Retroviruses and retrovirus-derived vectors integrate nonrandomly into the genomes of host cells with specific preferences for transcribed genes, gene-rich regions, and CpG islands. However, the genomic features that influence the transcriptional activities of integrated retroviruses or retroviral vectors are poorly understood. We report here the cloning and characterization of avian sarcoma virus integration sites from chicken tumors. Growing progressively, dependent on high and stable expression of the transduced v-src oncogene, these tumors represent clonal expansions of cells bearing transcriptionally active replication-defective proviruses. Therefore, integration sites in our study distinguished genomic loci favorable for the expression of integrated retroviruses and gene transfer vectors. Analysis of integration sites from avian sarcoma virus-induced tumors showed strikingly nonrandom distribution, with proviruses found prevalently within or close to transcription units, particularly in genes broadly expressed in multiple tissues but not in tissue-specifically expressed genes. We infer that proviruses integrated in these genomic areas efficiently avoid transcriptional silencing and remain active for a long time during the growth of tumors. Defining the differences between unselected retroviral integration sites and sites selected for long-terminal-repeat-driven gene expression is relevant for retrovirus-mediated gene transfer and has ramifications for gene therapy.
Collapse
|
31
|
Wang XF, Jin X, Wang X, Liu J, Feng J, Yang Q, Mu W, Shi X, Lu Z. Effects of L1-ORF2 fragments on green fluorescent protein gene expression. Genet Mol Biol 2009; 32:688-96. [PMID: 21637438 PMCID: PMC3036906 DOI: 10.1590/s1415-47572009005000068] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Accepted: 05/15/2009] [Indexed: 11/22/2022] Open
Abstract
The retrotransposon known as long interspersed nuclear element-1 (L1) is 6 kb long, although most L1s in mammalian and other eukaryotic cells are truncated. L1 contains two open reading frames, ORF1 and ORF2, that code for an RNA-binding protein and a protein with endonuclease and reverse transcriptase activities, respectively. In this work, we examined the effects of full length L1-ORF2 and ORF2 fragments on green fluorescent protein gene (GFP) expression when inserted into the pEGFP-C1 vector downstream of GFP. All of the ORF2 fragments in sense orientation inhibited GFP expression more than when in antisense orientation, which suggests that small ORF2 fragments contribute to the distinct inhibitory effects of this ORF on gene expression. These results provide the first evidence that different 280-bp fragments have distinct effects on the termination of gene transcription, and that when inserted in the antisense direction, fragment 280-9 (the 3' end fragment of ORF2) induces premature termination of transcription that is consistent with the effect of ORF2.
Collapse
Affiliation(s)
- Xiu-Fang Wang
- Hebei Key Lab of Laboratory Animal, Department of Genetics, Hebei Medical University, Shijiazhuang, Hebei Province China
| | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Gogvadze E, Buzdin A. Retroelements and their impact on genome evolution and functioning. Cell Mol Life Sci 2009; 66:3727-42. [PMID: 19649766 PMCID: PMC11115525 DOI: 10.1007/s00018-009-0107-2] [Citation(s) in RCA: 78] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2009] [Revised: 06/11/2009] [Accepted: 07/14/2009] [Indexed: 12/31/2022]
Abstract
Retroelements comprise a considerable fraction of eukaryotic genomes. Since their initial discovery by Barbara McClintock in maize DNA, retroelements have been found in genomes of almost all organisms. First considered as a "junk DNA" or genomic parasites, they were shown to influence genome functioning and to promote genetic innovations. For this reason, they were suggested as an important creative force in the genome evolution and adaptation of an organism to altered environmental conditions. In this review, we summarize the up-to-date knowledge of different ways of retroelement involvement in structural and functional evolution of genes and genomes, as well as the mechanisms generated by cells to control their retrotransposition.
Collapse
Affiliation(s)
- Elena Gogvadze
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, 16/10 Miklukho-Maklaya st, 117997 Moscow, Russia.
| | | |
Collapse
|
33
|
Abstract
Background Previous investigations from our laboratory were largely focused on the genome organization of vertebrates. We showed that these genomes are mosaics of isochores, megabase-size DNA sequences that are fairly homogeneous in base composition yet belong to a small number of families that cover a wide compositional spectrum. A question raised by these results concerned how far back in evolution an isochore organization of the eukaryotic genome arose. Results The present investigation deals with the compositional patterns of the invertebrates for which full genome sequences, or at least scaffolds, are available. We found that (i) a mosaic of isochores is the long-range organization of all the genomes that we investigated; (ii) the isochore families from the invertebrate genomes matched the corresponding families of vertebrates in GC levels; (iii) the relative amounts of isochore families were remarkably different for different genomes, except for those from phylogenetically close species, such as the Drosophilids. Conclusion This work demonstrates not only that an isochore organization is present in all metazoan genomes analyzed that included Nematodes, Arthropods among Protostomia, Echinoderms and Chordates among Deuterostomia, but also that the isochore families of invertebrates share GC levels with the corresponding families of vertebrates.
Collapse
|
34
|
Melamed E, Arnold AP. The role of LINEs and CpG islands in dosage compensation on the chicken Z chromosome. Chromosome Res 2009; 17:727-36. [PMID: 19672682 PMCID: PMC2759020 DOI: 10.1007/s10577-009-9068-4] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2009] [Accepted: 07/02/2009] [Indexed: 11/28/2022]
Abstract
Most avian Z genes are expressed more highly in ZZ males than ZW females, suggesting that chromosome-wide mechanisms of dosage compensation have not evolved. Nevertheless, a small percentage of Z genes are expressed at similar levels in males and females, an indication that a yet unidentified mechanism compensates for the sex difference in copy number. Primary DNA sequences are thought to have a role in determining chromosome gene inactivation status on the mammalian X chromosome. However, it is currently unknown whether primary DNA sequences also mediate chicken Z gene compensation status. Using a combination of chicken DNA sequences and Z gene compensation profiles of 310 genes, we explored the relationship between Z gene compensation status and primary DNA sequence features. Statistical analysis of different Z chromosomal features revealed that long interspersed nuclear elements (LINEs) and CpG islands are enriched on the Z chromosome compared with 329 other DNA features. Linear support vector machine (SVM) classifiers, using primary DNA sequences, correctly predict the Z compensation status for >60% of all Z-linked genes. CpG islands appear to be the most accurate classifier and alone can correctly predict compensation of 63% of Z genes. We also show that LINE CR1 elements are enriched 2.7-fold on the chicken Z chromosome compared with autosomes and that chicken chromosomal length is highly correlated with percentage LINE content. However, the position of LINE elements is not significantly associated with dosage compensation status of Z genes. We also find a trend for a higher proportion of CpG islands in the region of the Z chromosome with the fewest dosage-compensated genes compared with the region containing the greatest concentration of compensated genes. Comparison between chicken and platypus genomes shows that LINE elements are not enriched on sex chromosomes in platypus, indicating that LINE accumulation is not a feature of all sex chromosomes. Our results suggest that CpG islands are not randomly distributed on the Z chromosome and may influence Z gene dosage compensation status.
Collapse
Affiliation(s)
- Esther Melamed
- Department of Physiological Science and Laboratory of Neuroendocrinology of the Brain Research Institute, University of California, 621 Charles Young Drive South, Los Angeles, CA 90095-1606, USA.
| | | |
Collapse
|
35
|
Power-laws in the genomic distribution of coding segments in several organisms: an evolutionary trace of segmental duplications, possible paleopolyploidy and gene loss. Gene 2009; 447:18-28. [PMID: 19591912 DOI: 10.1016/j.gene.2009.04.028] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2008] [Revised: 03/18/2009] [Accepted: 04/08/2009] [Indexed: 02/02/2023]
Abstract
Large-scale features of the spatial arrangement of protein-coding segments (PCS) are investigated by means of the inter-PCS spacers' size distributions, which have been found to follow power-laws. Linearity in double-logarithmic scale extends to several orders of magnitude in the genomes of organisms as disparate as mammals, insects and plants. This feature is also present in the most compact eukaryotic genomes and in half of the examined bacteria, despite their very limited non-coding space. We have tried to determine the sequence of events in the course of genomes' evolution which may account for the formation of the observed size distributions. The proposed mechanism essentially includes two types of events: (i) segmental duplications (and possibly paleopolyploidy), and (ii) the subsequent loss of most of the duplicated genes. It is shown by computer simulations that the formulated scenario generates power-law-like inter-PCS spacers' size distributions, which remain robust for a variety of parameter choices, even if insertion of external sequences, such as viruses or proliferating retroelements is included. Moreover, power-laws are preserved after most of the non-coding DNA has been removed, thus explaining the finding of this pattern in genomes as compact as that of Takifugu rubripes.
Collapse
|
36
|
Expansión clónica y caracterización genómica del proceso de integración del virus linfotrópico humano tipo I en la leucemia/linfoma de células T en adultos. BIOMEDICA 2009. [DOI: 10.7705/biomedica.v29i2.24] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|
37
|
Analysis of transposon interruptions suggests selection for L1 elements on the X chromosome. PLoS Genet 2008; 4:e1000172. [PMID: 18769724 PMCID: PMC2517846 DOI: 10.1371/journal.pgen.1000172] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2008] [Accepted: 07/17/2008] [Indexed: 01/02/2023] Open
Abstract
It has been hypothesised that the massive accumulation of L1 transposable elements on the X chromosome is due to their function in X inactivation, and that the accumulation of Alu elements near genes is adaptive. We tested the possible selective advantage of these two transposable element (TE) families with a novel method, interruption analysis. In mammalian genomes, a large number of TEs interrupt other TEs due to the high overall abundance and age of repeats, and these interruptions can be used to test whether TEs are selectively neutral. Interruptions of TEs, which are beneficial for the host, are expected to be deleterious and underrepresented compared with neutral ones. We found that L1 elements in the regions of the X chromosome that contain the majority of the inactivated genes are significantly less frequently interrupted than on the autosomes, while L1s near genes that escape inactivation are interrupted with higher frequency, supporting the hypothesis that L1s on the X chromosome play a role in its inactivation. In addition, we show that TEs are less frequently interrupted in introns than in intergenic regions, probably due to selection against the expansion of introns, but the insertion pattern of Alus is comparable to other repeats. Recent experimental findings (for example the ENCODE project) show that many functional non-coding regions of genomes are not conserved across species, making the in-silico discovery of such regions challenging. Transposable elements (TEs), which represent 45 percent of the human genome and typically show no sequence conservation, are particularly intriguing from this point of view, because the highly nonrandom genomic distribution of many TE families in genomes has led to hypotheses that their presence is adaptive and have an epigenetic (regulatory) function. We use a novel approach based on the analysis of interrupted TEs to investigate if repeats are under selection that does not rely on sequence conservation. L1 elements, the most active transposable elements of the human genome, are highly overrepresented on the X-chromosome and were suggested to enhance its inactivation in mammals. We find that the interruption pattern of L1 repeats indicates a function for L1 elements in the inactivation of the mammalian X chromosome. Additionally, we show that a considerable fraction of TEs in introns are under selection for integrity, possibly due to selection on intron size or on TEs themselves.
Collapse
|
38
|
Tomilin NV. Regulation of mammalian gene expression by retroelements and non-coding tandem repeats. Bioessays 2008; 30:338-48. [PMID: 18348251 DOI: 10.1002/bies.20741] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
Genomes of higher eukaryotes contain abundant non-coding repeated sequences whose overall biological impact is unclear. They comprise two categories. The first consists of retrotransposon-derived elements. These are three major families of retroelements (LINEs, SINEs and LTRs). SINEs are clustered in gene-rich regions and are found in promoters of genes while LINEs are concentrated in gene-poor regions and are depleted from promoters. The second class consists of non-coding tandem repeats (satellite DNAs and TTAGGG arrays), which are associated with mammalian centromeres, heterochromatin and telomeres. Terminal TTAGGG arrays are involved in telomere capping and satellite DNAs are located in heterochromatin, which is implicated in transcription silencing by gene repositioning (relocalization). It is unknown whether interstitial TTAGGG sequences, which are present in many vertebrates, have a function. Here, evidence will be presented that retroelements and TTAGGG arrays are involved in regulation of gene expression. Retroelements can provide binding sites for transcription factors and protect promoter CpG islands from repressive chromatin modifications, and may be also involved in nuclear compartmentalization of transcriptionally active and inactive domains. Interstitial telomere-like sequences can form dynamically maintained three-dimensional nuclear networks of transcriptionally inactive domains, which may be involved in transcription silencing like classic heterochromatin.
Collapse
Affiliation(s)
- Nikolai V Tomilin
- Institute of Cytology, Russian Academy of Sciences, 194064 St.Petersburg, Tikchoretskii Av. 4, Russia.
| |
Collapse
|
39
|
Urrutia AO, Ocaña LB, Hurst LD. Do Alu repeats drive the evolution of the primate transcriptome? Genome Biol 2008; 9:R25. [PMID: 18241332 PMCID: PMC2374697 DOI: 10.1186/gb-2008-9-2-r25] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2007] [Revised: 01/02/2008] [Accepted: 02/01/2008] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Of all repetitive elements in the human genome, Alus are unusual in being enriched near to genes that are expressed across a broad range of tissues. This has led to the proposal that Alus might be modifying the expression breadth of neighboring genes, possibly by providing CpG islands, modifying transcription factor binding, or altering chromatin structure. Here we consider whether Alus have increased expression breadth of genes in their vicinity. RESULTS Contrary to the modification hypothesis, we find that those genes that have always had broad expression are richest in Alus, whereas those that are more likely to have become more broadly expressed have lower enrichment. This finding is consistent with a model in which Alus accumulate near broadly expressed genes but do not affect their expression breadth. Furthermore, this model is consistent with the finding that expression breadth of mouse genes predicts Alu density near their human orthologs. However, Alus were found to be related to some alternative measures of transcription profile divergence, although evidence is contradictory as to whether Alus associate with lowly or highly diverged genes. If Alu have any effect it is not by provision of CpG islands, because they are especially rare near to transcriptional start sites. Previously reported Alu enrichment for genes serving certain cellular functions, suggested to be evidence of functional importance of Alus, appears to be partly a byproduct of the association with broadly expressed genes. CONCLUSION The abundance of Alu near broadly expressed genes is better explained by their preferential preservation near to housekeeping genes rather than by a modifying effect on expression of genes.
Collapse
Affiliation(s)
- Araxi O Urrutia
- Department of Biology and Biochemistry, University of Bath, Bath, BA4 7AY, UK.
| | | | | |
Collapse
|
40
|
Biased distributions and decay of long interspersed nuclear elements in the chicken genome. Genetics 2007; 178:573-81. [PMID: 17947446 DOI: 10.1534/genetics.106.061861] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
The genomes of birds are much smaller than mammalian genomes, and transposable elements (TEs) make up only 10% of the chicken genome, compared with the 45% of the human genome. To study the mechanisms that constrain the copy numbers of TEs, and as a consequence the genome size of birds, we analyzed the distributions of LINEs (CR1's) and SINEs (MIRs) on the chicken autosomes and Z chromosome. We show that (1) CR1 repeats are longest on the Z chromosome and their length is negatively correlated with the local GC content; (2) the decay of CR1 elements is highly biased, and the 5'-ends of the insertions are lost much faster than their 3'-ends; (3) the GC distribution of CR1 repeats shows a bimodal pattern with repeats enriched in both AT-rich and GC-rich regions of the genome, but the CR1 families show large differences in their GC distribution; and (4) the few MIRs in the chicken are most abundant in regions with intermediate GC content. Our results indicate that the primary mechanism that removes repeats from the chicken genome is ectopic exchange and that the low abundance of repeats in avian genomes is likely to be the consequence of their high recombination rates.
Collapse
|
41
|
Sellis D, Provata A, Almirantis Y. Alu and LINE1 distributions in the human chromosomes: evidence of global genomic organization expressed in the form of power laws. Mol Biol Evol 2007; 24:2385-99. [PMID: 17728280 DOI: 10.1093/molbev/msm181] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Spatial distribution and clustering of repetitive elements are extensively studied during the last years, as well as their colocalization with other genomic components. Here we investigate the large-scale features of Alu and LINE1 spatial arrangement in the human genome by studying the size distribution of interrepeat distances. In most cases, we have found power-law size distributions extending in several orders of magnitude. We have also studied the correlations of the extent of the power law (linear region in double-logarithmic scale) and of the corresponding exponent (slope) with other genomic properties. A model has been formulated to explain the formation of the observed power laws. According to the model, 2 kinds of events occur repetitively in evolutionary time: random insertion of several types of intruding sequences and occasional loss of repeats belonging to the initial population due to "elimination" events. This simple mechanism is shown to reproduce the observed power-law size distributions and is compatible with our present knowledge on the dynamics of repeat proliferation in the genome.
Collapse
Affiliation(s)
- Diamantis Sellis
- National Center for Scientific Research Demokritos, Institute of Biology, Athens, Greece
| | | | | |
Collapse
|
42
|
Gu W, Ray DA, Walker JA, Barnes EW, Gentles AJ, Samollow PB, Jurka J, Batzer MA, Pollock DD. SINEs, evolution and genome structure in the opossum. Gene 2007; 396:46-58. [PMID: 17442506 DOI: 10.1016/j.gene.2007.02.028] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2006] [Revised: 02/15/2007] [Accepted: 02/19/2007] [Indexed: 10/23/2022]
Abstract
Short INterspersed Elements (SINEs) are non-autonomous retrotransposons, usually between 100 and 500 base pairs (bp) in length, which are ubiquitous components of eukaryotic genomes. Their activity, distribution, and evolution can be highly informative on genomic structure and evolutionary processes. To determine recent activity, we amplified more than one hundred SINE1 loci in a panel of 43 M. domestica individuals derived from five diverse geographic locations. The SINE1 family has expanded recently enough that many loci were polymorphic, and the SINE1 insertion-based genetic distances among populations reflected geographic distance. Genome-wide comparisons of SINE1 densities and GC content revealed that high SINE1 density is associated with high GC content in a few long and many short spans. Young SINE1s, whether fixed or polymorphic, showed an unbiased GC content preference for insertion, indicating that the GC preference accumulates over long time periods, possibly in periodic bursts. SINE1 evolution is thus broadly similar to human Alu evolution, although it has an independent origin. High GC content adjacent to SINE1s is strongly correlated with bias towards higher AT to GC substitutions and lower GC to AT substitutions. This is consistent with biased gene conversion, and also indicates that like chickens, but unlike eutherian mammals, GC content heterogeneity (isochore structure) is reinforced by substitution processes in the M. domestica genome. Nevertheless, both high and low GC content regions are apparently headed towards lower GC content equilibria, possibly due to a relative shift to lower recombination rates in the recent Monodelphis ancestral lineage. Like eutherians, metatherian (marsupial) mammals have evolved high CpG substitution rates, but this is apparently a convergence in process rather than a shared ancestral state.
Collapse
Affiliation(s)
- Wanjun Gu
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine, Aurora, CO 80045, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Analysis of the features and source gene composition of the AluYg6 subfamily of human retrotransposons. BMC Evol Biol 2007; 7:102. [PMID: 17603915 PMCID: PMC1925064 DOI: 10.1186/1471-2148-7-102] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2007] [Accepted: 07/01/2007] [Indexed: 11/19/2022] Open
Abstract
Background Alu elements are a family of SINE retrotransposons in primates. They are classified into subfamilies according to specific diagnostic mutations from the general Alu consensus. It is now believed that there may be several retrotranspositionally-competent source genes within an Alu subfamily. To investigate the evolution of young Alu elements it is critical to have access to complete subfamilies, which, following the release of the final human genome assembly, can now be obtained using in silico methods. Results 380 elements belonging to the young AluYg6 subfamily were identified in the human genome, a number significantly exceeding prior expectations. An AluYg6 element was also identified in the chimpanzee genome, indicating that the subfamily is older than previously estimated, and appears to have undergone a period of dormancy before its expansion. The relative contributions of back mutation and gene conversion to variation at the six diagnostic positions are examined, and cases of complete forward gene conversion events are reported. Two small subfamilies derived from AluYg6 have been identified, named AluYg6a2 and AluYg5b3, which contain 40 and 27 members, respectively. These small subfamilies are used to illustrate the ambiguity regarding Alu subfamily definition, and to assess the contribution of secondary source genes to the AluYg6 subfamily. Conclusion The number of elements in the AluYg6 subfamily greatly exceeds prior expectations, indicating that the current knowledge of young Alu subfamilies is incomplete, and that prior analyses that have been carried out using these data may have generated inaccurate results. A definition of primary and secondary source genes has been provided, and it has been shown that several source genes have contributed to the proliferation of the AluYg6 subfamily. Access to the sequence data for the complete AluYg6 subfamily will be invaluable in future computational analyses investigating the evolution of young Alu subfamilies.
Collapse
|
44
|
Choi IS, Estecio MRH, Nagano Y, Kim DH, White JA, Yao JC, Issa JPJ, Rashid A. Hypomethylation of LINE-1 and Alu in well-differentiated neuroendocrine tumors (pancreatic endocrine tumors and carcinoid tumors). Mod Pathol 2007; 20:802-10. [PMID: 17483816 DOI: 10.1038/modpathol.3800825] [Citation(s) in RCA: 115] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Neuroendocrine tumors including carcinoid tumors and pancreatic endocrine tumors are uncommon, and the genetic alterations in these indolent tumors are not well characterized. We studied global hypomethylation by analyzing long interspersed nucleotide elements (LINE)-1 and Alu methylation using pyrosequencing in 35 neuroendocrine tumors and corresponding normal tissue. The tumor samples were less methylated than normal tissue at LINE-1 (P=0.04) and Alu (P=0.001). The mean relative tumor hypomethylation (difference in methylation between normal tissue and in tumor) was 11.5+/-10.0 for LINE-1 and 5.8+/-6.4 for Alu, and were correlated with each other (correlation coefficient 0.6, P=0.001). Relative tumor hypomethylation of LINE-1 was higher in ileal carcinoid tumors than in non-ileal carcinoid tumors and pancreatic endocrine tumors (P=0.047), and tumors with lymph node metastasis (P=0.02), chromosome 18 loss (P=0.001) and RAS-association domain family 1, isoform A gene methylation (P=0.02). Alu methylation in tumors was inversely correlated with methylation of O(6)-methyl-guanine methyltransferase gene (P=0.02). Our study shows that hypomethylation is more common in carcinoid tumors than in pancreatic endocrine tumors and is associated with clinicopathologic features, and genetic and epigenetic alterations in these tumors, including lymph node metastasis.
Collapse
Affiliation(s)
- In-Seon Choi
- Department of Pathology, The University of Texas MD Anderson Cancer Center, Houston, TX 77030-4095, USA
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Gentles AJ, Wakefield MJ, Kohany O, Gu W, Batzer MA, Pollock DD, Jurka J. Evolutionary dynamics of transposable elements in the short-tailed opossum Monodelphis domestica. Genome Res 2007; 17:992-1004. [PMID: 17495012 PMCID: PMC1899126 DOI: 10.1101/gr.6070707] [Citation(s) in RCA: 112] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The genome of the gray short-tailed opossum Monodelphis domestica is notable for its large size ( approximately 3.6 Gb). We characterized nearly 500 families of interspersed repeats from the Monodelphis. They cover approximately 52% of the genome, higher than in any other amniotic lineage studied to date, and may account for the unusually large genome size. In comparison to other mammals, Monodelphis is significantly rich in non-LTR retrotransposons from the LINE-1, CR1, and RTE families, with >29% of the genome sequence comprised of copies of these elements. Monodelphis has at least four families of RTE, and we report support for horizontal transfer of this non-LTR retrotransposon. In addition to short interspersed elements (SINEs) mobilized by L1, we found several families of SINEs that appear to use RTE elements for mobilization. In contrast to L1-mobilized SINEs, the RTE-mobilized SINEs in Monodelphis appear to shift from G+C-rich to G+C-low regions with time. Endogenous retroviruses have colonized approximately 10% of the opossum genome. We found that their density is enhanced in centromeric and/or telomeric regions of most Monodelphis chromosomes. We identified 83 new families of ancient repeats that are highly conserved across amniotic lineages, including 14 LINE-derived repeats; and a novel SINE element, MER131, that may have been exapted as a highly conserved functional noncoding RNA, and whose emergence dates back to approximately 300 million years ago. Many of these conserved repeats are also present in human, and are highly over-represented in predicted cis-regulatory modules. Seventy-six of the 83 families are present in chicken in addition to mammals.
Collapse
Affiliation(s)
- Andrew J. Gentles
- Department of Radiology, School of Medicine, Stanford University, Stanford, California 94305, USA
- Genetic Information Research Institute, Mountain View, California 94043, USA
- Corresponding authors.E-mail ; fax (650) 723-5795.E-mail ; fax (650) 961-4473
| | - Matthew J. Wakefield
- ARC Centre for Kangaroo Genomics, Walter and Eliza Hall Institute of Medical Research, Parkville, Victoria 3050, Australia
| | - Oleksiy Kohany
- Genetic Information Research Institute, Mountain View, California 94043, USA
| | - Wanjun Gu
- Department of Biochemistry and Molecular Genetics, University of Colorado Health Sciences Center, Aurora 80045, Colorado, USA
| | - Mark A. Batzer
- Department of Biological Sciences, Biological Computation and Visualization Center, Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana 70803, USA
| | - David D. Pollock
- Department of Biochemistry and Molecular Genetics, University of Colorado Health Sciences Center, Aurora 80045, Colorado, USA
| | - Jerzy Jurka
- Genetic Information Research Institute, Mountain View, California 94043, USA
- Corresponding authors.E-mail ; fax (650) 723-5795.E-mail ; fax (650) 961-4473
| |
Collapse
|
46
|
Costantini M, Clay O, Federico C, Saccone S, Auletta F, Bernardi G. Human chromosomal bands: nested structure, high-definition map and molecular basis. Chromosoma 2006; 116:29-40. [PMID: 17072634 DOI: 10.1007/s00412-006-0078-0] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2006] [Accepted: 08/15/2006] [Indexed: 10/24/2022]
Abstract
In this paper, we report investigations on the nested structure, the high-definition mapping, and the molecular basis of the classical Giemsa and Reverse bands in human chromosomes. We found the rules according to which the approximately 3,200 isochores of the human genome are assembled in high (850-band) resolution bands, and the latter in low (400-band) resolution bands, so forming the nested mosaic structure of chromosomes. Moreover, we identified the borders of both sets of chromosomal bands at the DNA sequence level on the basis of our recent map of isochores, which represent the highest-resolution, ultimate bands. Indeed, beyond the 100-kb resolution of the isochore map, the guanine and cytosine (GC) profile of DNA becomes turbulent owing to the contribution of specific sequences such as exons, introns, interspersed repeats, CpG islands, etc. The isochore-based level of definition (100 kb) of chromosomal bands is much higher than the cytogenetic definition level (2-3 Mb). The major conclusions of this work concern the high degree of order found in the structure of chromosomal bands, their mapping at a high definition, and the solution of the long-standing problem of the molecular basis of chromosomal bands, as these could be defined on the basis of compositional DNA properties alone.
Collapse
Affiliation(s)
- Maria Costantini
- Laboratory of Molecular Evolution, Stazione Zoologica Anton Dohrn, Naples, Italy
| | | | | | | | | | | |
Collapse
|
47
|
Eller CD, Regelson M, Merriman B, Nelson S, Horvath S, Marahrens Y. Repetitive sequence environment distinguishes housekeeping genes. Gene 2006; 390:153-65. [PMID: 17141428 PMCID: PMC1857324 DOI: 10.1016/j.gene.2006.09.018] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2006] [Revised: 09/18/2006] [Accepted: 09/24/2006] [Indexed: 12/14/2022]
Abstract
Housekeeping genes are expressed across a wide variety of tissues. Since repetitive sequences have been reported to influence the expression of individual genes, we employed a novel approach to determine whether housekeeping genes can be distinguished from tissue-specific genes by their repetitive sequence context. We show that Alu elements are more highly concentrated around housekeeping genes while various longer (>400-bp) repetitive sequences ("repeats"), including Long Interspersed Nuclear Element-1 (LINE-1) elements, are excluded from these regions. We further show that isochore membership does not distinguish housekeeping genes from tissue-specific genes and that repetitive sequence environment distinguishes housekeeping genes from tissue-specific genes in every isochore. The distinct repetitive sequence environment, in combination with other previously published sequence properties of housekeeping genes, was used to develop a method of predicting housekeeping genes on the basis of DNA sequence alone. Using expression across tissue types as a measure of success, we demonstrate that repetitive sequence environment is by far the most important sequence feature identified to date for distinguishing housekeeping genes.
Collapse
Affiliation(s)
- C. Daniel Eller
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Moira Regelson
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Barry Merriman
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Stan Nelson
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
| | - Steve Horvath
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
- UCLA Department of Biostatistics, School of Public Health, Box 951772, Los Angeles, California 90095-1772, USA
| | - York Marahrens
- UCLA Department of Human Genetics David Geffen School of Medicine, Gonda Center, 695 E. Young Drive South, Los Angeles, California 90095-7088, USA
- * to whom correspondence should be addressed: York Marahrens, UCLA Department of Human Genetics, Gonda Center, Room 4554b, 695 Charles E. Young Drive, Los Angeles, CA 90095, USA, Phone: (310) 267-2466, Fax: (310) 794-5446, E-mail:
| |
Collapse
|
48
|
Vieira A. Atypical mutational properties of human chromosome 21 suggested by comparative genome-scale analyses. Cytogenet Genome Res 2006; 115:30-4. [PMID: 16974081 DOI: 10.1159/000094798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2006] [Accepted: 02/03/2006] [Indexed: 11/19/2022] Open
Abstract
Mutation of genetic material is a necessary component of evolutionary change. There is evidence for both intragenome and intergenome heterogeneity in terms of mutation frequencies. Reported comparisons of DNA sequence differences between human and chimpanzee (Pan troglodytes) suggest that human chromosome 21 may exhibit mutational hypervariability relative to the other autosomes. In the present study, further evidence is provided for such hypervariability based on large-scale analyses of amino acid composition of (translated) human genes and pseudogenes. A comparison of the variation in the above cases (i.e., DNA sequence differences and amino acid composition differences) yields similar ratios (1.2-1.4) for chromosome 21 relative to the other autosomes, e.g., human chromosome 22 - an autosome that is more typical in this respect and is of similar size to 21. Human chromosome 21 is also presented in this study as being atypical in terms of reported associations between mutation rates and GC content or CpG dinucleotides. In terms of GC distribution patterns, a comparison of NT_011512 and NT_011520 contigs revealed a lower heterogeneity for human chromosome 21 relative to 22. Possible hypermutability of chromosome 21 is further discussed in the context of GC patterns, reported long interspersed nuclear element content (LINE1s), and the implications of these parameters for chromatin structure.
Collapse
Affiliation(s)
- A Vieira
- Endocrine and Metabolic Research Laboratory, Faculty of Applied Sciences, Simon Fraser University, Burnaby, BC, Canada.
| |
Collapse
|
49
|
Gasior SL, Preston G, Hedges DJ, Gilbert N, Moran JV, Deininger PL. Characterization of pre-insertion loci of de novo L1 insertions. Gene 2006; 390:190-8. [PMID: 17067767 PMCID: PMC1850991 DOI: 10.1016/j.gene.2006.08.024] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2006] [Revised: 08/21/2006] [Accepted: 08/22/2006] [Indexed: 10/24/2022]
Abstract
The human Long Interspersed Element-1 (LINE-1) and the Short Interspersed Element (SINE) Alu comprise 28% of the human genome. They share the same L1-encoded endonuclease for insertion, which recognizes an A+T-rich sequence. Under a simple model of insertion distribution, this nucleotide preference would lead to the prediction that the populations of both elements would be biased towards A+T-rich regions. Genomic L1 elements do show an A+T-rich bias. In contrast, Alu is biased towards G+C-rich regions when compared to the genome average. Several analyses have demonstrated that relatively recent insertions of both elements show less G+C content bias relative to older elements. We have analyzed the repetitive element and G+C composition of more than 100 pre-insertion loci derived from de novo L1 insertions in cultured human cancer cells, which should represent an evolutionarily unbiased set of insertions. An A+T-rich bias is observed in the 50 bp flanking the endonuclease target site, consistent with the known target site for the L1 endonuclease. The L1, Alu, and G+C content of 20 kb of the de novo pre-insertion loci shows a different set of biases than that observed for fixed L1s in the human genome. In contrast to the insertion sites of genomic L1s, the de novo L1 pre-insertion loci are relatively L1-poor, Alu-rich and G+C neutral. Finally, a statistically significant cluster of de novo L1 insertions was localized in the vicinity of the c-myc gene. These results suggest that the initial insertion preference of L1, while A+T-rich in the initial vicinity of the break site, can be influenced by the broader content of the flanking genomic region and have implications for understanding the dynamics of L1 and Alu distributions in the human genome.
Collapse
Affiliation(s)
- Stephen L. Gasior
- Tulane Cancer Center and Dept. of Epidemiology, Tulane University Health Sciences Center SL-66, 1430 Tulane Ave., New Orleans, LA 70112, Phone: (504) 988-6385, Fax: (504) 988-5516,
| | - Graeme Preston
- Tulane Cancer Center and Dept. of Epidemiology, Tulane University Health Sciences Center SL-66, 1430 Tulane Ave., New Orleans, LA 70112, Phone: (504) 988-6385, Fax: (504) 988-5516,
| | - Dale J. Hedges
- Tulane Cancer Center and Dept. of Epidemiology, Tulane University Health Sciences Center SL-66, 1430 Tulane Ave., New Orleans, LA 70112, Phone: (504) 988-6385, Fax: (504) 988-5516,
| | - Nicolas Gilbert
- Institut de Génétique Humaine, CNRS, UPR 1142, 141 rue de la Cardonille, 34396 Montpellier cedex 5, France
| | - John V. Moran
- Departments of Human Genetics and Internal Medicine, 1241 E. Catherine St., University of Michigan Medical School, Ann Arbor, Michigan 48109-0618
| | - Prescott L. Deininger
- Tulane Cancer Center and Dept. of Epidemiology, Tulane University Health Sciences Center SL-66, 1430 Tulane Ave., New Orleans, LA 70112, Phone: (504) 988-6385, Fax: (504) 988-5516,
- *Address for Correspondence: Tulane Cancer Center, SL66, Tulane University Health Sciences Center, 1430 Tulane Ave., New Orleans, LA 70112, 504-988-6385,
| |
Collapse
|
50
|
Abrusán G, Krambeck HJ. The distribution of L1 and Alu retroelements in relation to GC content on human sex chromosomes is consistent with the ectopic recombination model. J Mol Evol 2006; 63:484-92. [PMID: 16955238 DOI: 10.1007/s00239-005-0275-0] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2005] [Accepted: 05/30/2006] [Indexed: 10/24/2022]
Abstract
The distribution of Alu and L1 retroelements in the human genome changes with their age. Active retroelements target AT-rich regions, but their frequency increases in GC- and gene-rich regions of the genome with increasing age of the insertions. Currently there is no consensus on the mechanism generating this pattern. In this paper we test the hypothesis that selection against deleterious deletions caused by ectopic recombination between repeats is the main cause of the inhomogeneous distribution of L1s and Alus, by means of a detailed analysis of the GC distribution of the repeats on the sex chromosomes. We show that (1) unlike on the autosomes and X chromosome, L1s do not accumulate on the Y chromosome in GC-rich regions, whereas Alus accumulate there to a minor extent; (2) on the Y chromosome Alu and L1 densities are positively correlated, unlike the negative correlation on other chromosomes; and (3) in gene-poor regions of chromosome 4 and X, the distribution of Alus and L1s does not shift toward GC-rich regions. In addition, we show that although local GC content of long L1 insertions is lower than average, their selective loss from recombining chromosomes is not the main cause of the enrichment of ancient L1s in GC-rich regions. The results support the hypothesis that ectopic recombination causes the shift of Alu and L1 distributions toward the gene-rich regions of the genome.
Collapse
Affiliation(s)
- György Abrusán
- Max Planck Institute of Limnology, Department of Ecophysiology, August Thienemann Str. 2, 24306, Plön, Germany,
| | | |
Collapse
|