1
|
Roberts M, Josephs EB. Previously unmeasured genetic diversity explains part of Lewontin's paradox in a k-mer-based meta-analysis of 112 plant species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.17.594778. [PMID: 38798362 PMCID: PMC11118579 DOI: 10.1101/2024.05.17.594778] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
At the molecular level, most evolution is expected to be neutral. A key prediction of this expectation is that the level of genetic diversity in a population should scale with population size. However, as was noted by Richard Lewontin in 1974 and reaffirmed by later studies, the slope of the population size-diversity relationship in nature is much weaker than expected under neutral theory. We hypothesize that one contributor to this paradox is that current methods relying on single nucleotide polymorphisms (SNPs) called from aligning short reads to a reference genome underestimate levels of genetic diversity in many species. To test this idea, we calculated nucleotide diversity ( π ) and k-mer-based metrics of genetic diversity across 112 plant species, amounting to over 205 terabases of DNA sequencing data from 27,488 individual plants. We then compared how these different metrics correlated with proxies of population size that account for both range size and population density variation across species. We found that our population size proxies scaled anywhere from about 3 to over 20 times faster with k-mer diversity than nucleotide diversity after adjusting for evolutionary history, mating system, life cycle habit, cultivation status, and invasiveness. The relationship between k-mer diversity and population size proxies also remains significant after correcting for genome size, whereas the analogous relationship for nucleotide diversity does not. These results suggest that variation not captured by common SNP-based analyses explains part of Lewontin's paradox in plants.
Collapse
|
2
|
Choudalakis M, Bashtrykov P, Jeltsch A. RepEnTools: an automated repeat enrichment analysis package for ChIP-seq data reveals hUHRF1 Tandem-Tudor domain enrichment in young repeats. Mob DNA 2024; 15:6. [PMID: 38570859 PMCID: PMC10988844 DOI: 10.1186/s13100-024-00315-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2023] [Accepted: 03/05/2024] [Indexed: 04/05/2024] Open
Abstract
BACKGROUND Repeat elements (REs) play important roles for cell function in health and disease. However, RE enrichment analysis in short-read high-throughput sequencing (HTS) data, such as ChIP-seq, is a challenging task. RESULTS Here, we present RepEnTools, a software package for genome-wide RE enrichment analysis of ChIP-seq and similar chromatin pulldown experiments. Our analysis package bundles together various software with carefully chosen and validated settings to provide a complete solution for RE analysis, starting from raw input files to tabular and graphical outputs. RepEnTools implementations are easily accessible even with minimal IT skills (Galaxy/UNIX). To demonstrate the performance of RepEnTools, we analysed chromatin pulldown data by the human UHRF1 TTD protein domain and discovered enrichment of TTD binding on young primate and hominid specific polymorphic repeats (SVA, L1PA1/L1HS) overlapping known enhancers and decorated with H3K4me1-K9me2/3 modifications. We corroborated these new bioinformatic findings with experimental data by qPCR assays using newly developed primate and hominid specific qPCR assays which complement similar research tools. Finally, we analysed mouse UHRF1 ChIP-seq data with RepEnTools and showed that the endogenous mUHRF1 protein colocalizes with H3K4me1-H3K9me3 on promoters of REs which were silenced by UHRF1. These new data suggest a functional role for UHRF1 in silencing of REs that is mediated by TTD binding to the H3K4me1-K9me3 double mark and conserved in two mammalian species. CONCLUSIONS RepEnTools improves the previously available programmes for RE enrichment analysis in chromatin pulldown studies by leveraging new tools, enhancing accessibility and adding some key functions. RepEnTools can analyse RE enrichment rapidly, efficiently, and accurately, providing the community with an up-to-date, reliable and accessible tool for this important type of analysis.
Collapse
Affiliation(s)
- Michel Choudalakis
- Department of Biochemistry, Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany
| | - Pavel Bashtrykov
- Department of Biochemistry, Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany.
| | - Albert Jeltsch
- Department of Biochemistry, Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Allmandring 31, 70569, Stuttgart, Germany.
| |
Collapse
|
3
|
Baril T, Galbraith J, Hayward A. Earl Grey: A Fully Automated User-Friendly Transposable Element Annotation and Analysis Pipeline. Mol Biol Evol 2024; 41:msae068. [PMID: 38577785 PMCID: PMC11003543 DOI: 10.1093/molbev/msae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 02/20/2024] [Accepted: 03/22/2024] [Indexed: 04/06/2024] Open
Abstract
Transposable elements (TEs) are major components of eukaryotic genomes and are implicated in a range of evolutionary processes. Yet, TE annotation and characterization remain challenging, particularly for nonspecialists, since existing pipelines are typically complicated to install, run, and extract data from. Current methods of automated TE annotation are also subject to issues that reduce overall quality, particularly (i) fragmented and overlapping TE annotations, leading to erroneous estimates of TE count and coverage, and (ii) repeat models represented by short sections of total TE length, with poor capture of 5' and 3' ends. To address these issues, we present Earl Grey, a fully automated TE annotation pipeline designed for user-friendly curation and annotation of TEs in eukaryotic genome assemblies. Using nine simulated genomes and an annotation of Drosophila melanogaster, we show that Earl Grey outperforms current widely used TE annotation methodologies in ameliorating the issues mentioned above while scoring highly in benchmarking for TE annotation and classification and being robust across genomic contexts. Earl Grey provides a comprehensive and fully automated TE annotation toolkit that provides researchers with paper-ready summary figures and outputs in standard formats compatible with other bioinformatics tools. Earl Grey has a modular format, with great scope for the inclusion of additional modules focused on further quality control and tailored analyses in future releases.
Collapse
Affiliation(s)
- Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, 2000 Neuchâtel, Switzerland
| | - James Galbraith
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
| |
Collapse
|
4
|
Lee M, Ahmad SF, Xu J. Regulation and function of transposable elements in cancer genomes. Cell Mol Life Sci 2024; 81:157. [PMID: 38556602 PMCID: PMC10982106 DOI: 10.1007/s00018-024-05195-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Revised: 02/28/2024] [Accepted: 03/01/2024] [Indexed: 04/02/2024]
Abstract
Over half of human genomic DNA is composed of repetitive sequences generated throughout evolution by prolific mobile genetic parasites called transposable elements (TEs). Long disregarded as "junk" or "selfish" DNA, TEs are increasingly recognized as formative elements in genome evolution, wired intimately into the structure and function of the human genome. Advances in sequencing technologies and computational methods have ushered in an era of unprecedented insight into how TE activity impacts human biology in health and disease. Here we discuss the current views on how TEs have shaped the regulatory landscape of the human genome, how TE activity is implicated in human cancers, and how recent findings motivate novel strategies to leverage TE activity for improved cancer therapy. Given the crucial role of methodological advances in TE biology, we pair our conceptual discussions with an in-depth review of the inherent technical challenges in studying repeats, specifically related to structural variation, expression analyses, and chromatin regulation. Lastly, we provide a catalog of existing and emerging assays and bioinformatic software that altogether are enabling the most sophisticated and comprehensive investigations yet into the regulation and function of interspersed repeats in cancer genomes.
Collapse
Affiliation(s)
- Michael Lee
- Department of Pediatrics, Children's Medical Center Research Institute, University of Texas Southwestern Medical Center, 6000 Harry Hines Blvd., Dallas, TX, 75390, USA.
| | - Syed Farhan Ahmad
- Department of Pathology, Center of Excellence for Leukemia Studies, St. Jude Children's Research Hospital, 262 Danny Thomas Place - MS 345, Memphis, TN, 38105, USA
| | - Jian Xu
- Department of Pathology, Center of Excellence for Leukemia Studies, St. Jude Children's Research Hospital, 262 Danny Thomas Place - MS 345, Memphis, TN, 38105, USA.
| |
Collapse
|
5
|
Hénault M, Marsit S, Charron G, Landry CR. The genomic landscape of transposable elements in yeast hybrids is shaped by structural variation and genotype-specific modulation of transposition rate. eLife 2024; 12:RP89277. [PMID: 38411604 PMCID: PMC10911583 DOI: 10.7554/elife.89277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/28/2024] Open
Abstract
Transposable elements (TEs) are major contributors to structural genomic variation by creating interspersed duplications of themselves. In return, structural variants (SVs) can affect the genomic distribution of TE copies and shape their load. One long-standing hypothesis states that hybridization could trigger TE mobilization and thus increase TE load in hybrids. We previously tested this hypothesis (Hénault et al., 2020) by performing a large-scale evolution experiment by mutation accumulation (MA) on multiple hybrid genotypes within and between wild populations of the yeasts Saccharomyces paradoxus and Saccharomyces cerevisiae. Using aggregate measures of TE load with short-read sequencing, we found no evidence for TE load increase in hybrid MA lines. Here, we resolve the genomes of the hybrid MA lines with long-read phasing and assembly to precisely characterize the role of SVs in shaping the TE landscape. Highly contiguous phased assemblies of 127 MA lines revealed that SV types like polyploidy, aneuploidy, and loss of heterozygosity have large impacts on the TE load. We characterized 18 de novo TE insertions, indicating that transposition only has a minor role in shaping the TE landscape in MA lines. Because the scarcity of TE mobilization in MA lines provided insufficient resolution to confidently dissect transposition rate variation in hybrids, we adapted an in vivo assay to measure transposition rates in various S. paradoxus hybrid backgrounds. We found that transposition rates are not increased by hybridization, but are modulated by many genotype-specific factors including initial TE load, TE sequence variants, and mitochondrial DNA inheritance. Our results show the multiple scales at which TE load is shaped in hybrid genomes, being highly impacted by SV dynamics and finely modulated by genotype-specific variation in transposition rates.
Collapse
Affiliation(s)
- Mathieu Hénault
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université LavalQuébecCanada
- Département de biochimie, microbiologie et bioinformatique, Université LavalQuébecCanada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université LavalQuébecCanada
- Université Laval Big Data Research Center (BDRC_UL)QuébecCanada
| | - Souhir Marsit
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université LavalQuébecCanada
- Département de biochimie, microbiologie et bioinformatique, Université LavalQuébecCanada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université LavalQuébecCanada
- Université Laval Big Data Research Center (BDRC_UL)QuébecCanada
- Département de biologie, Université LavalQuébecCanada
| | - Guillaume Charron
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université LavalQuébecCanada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université LavalQuébecCanada
- Université Laval Big Data Research Center (BDRC_UL)QuébecCanada
- Département de biologie, Université LavalQuébecCanada
| | - Christian R Landry
- Institut de Biologie Intégrative et des Systèmes (IBIS), Université LavalQuébecCanada
- Département de biochimie, microbiologie et bioinformatique, Université LavalQuébecCanada
- Quebec Network for Research on Protein Function, Engineering, and Applications (PROTEO), Université LavalQuébecCanada
- Université Laval Big Data Research Center (BDRC_UL)QuébecCanada
- Département de biologie, Université LavalQuébecCanada
| |
Collapse
|
6
|
Torres DE, Kramer HM, Tracanna V, Fiorin GL, Cook DE, Seidl MF, Thomma BPHJ. Implications of the three-dimensional chromatin organization for genome evolution in a fungal plant pathogen. Nat Commun 2024; 15:1701. [PMID: 38402218 PMCID: PMC10894299 DOI: 10.1038/s41467-024-45884-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 02/05/2024] [Indexed: 02/26/2024] Open
Abstract
The spatial organization of eukaryotic genomes is linked to their biological functions, although it is not clear how this impacts the overall evolution of a genome. Here, we uncover the three-dimensional (3D) genome organization of the phytopathogen Verticillium dahliae, known to possess distinct genomic regions, designated adaptive genomic regions (AGRs), enriched in transposable elements and genes that mediate host infection. Short-range DNA interactions form clear topologically associating domains (TADs) with gene-rich boundaries that show reduced levels of gene expression and reduced genomic variation. Intriguingly, TADs are less clearly insulated in AGRs than in the core genome. At a global scale, the genome contains bipartite long-range interactions, particularly enriched for AGRs and more generally containing segmental duplications. Notably, the patterns observed for V. dahliae are also present in other Verticillium species. Thus, our analysis links 3D genome organization to evolutionary features conserved throughout the Verticillium genus.
Collapse
Affiliation(s)
- David E Torres
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
- Theoretical Biology & Bioinformatics Group, Department of Biology, Utrecht University, Utrecht, The Netherlands
| | - H Martin Kramer
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - Vittorio Tracanna
- University of Cologne, Institute for Plant Sciences, Cluster of Excellence on Plant Sciences (CEPLAS), Cologne, Germany
| | - Gabriel L Fiorin
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
| | - David E Cook
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands
- Department of Plant Pathology, Kansas State University, 1712 Claflin Road, Manhattan, KS, USA
| | - Michael F Seidl
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands.
- Theoretical Biology & Bioinformatics Group, Department of Biology, Utrecht University, Utrecht, The Netherlands.
| | - Bart P H J Thomma
- Laboratory of Phytopathology, Wageningen University and Research, Droevendaalsesteeg 1, 6708 PB, Wageningen, The Netherlands.
- University of Cologne, Institute for Plant Sciences, Cluster of Excellence on Plant Sciences (CEPLAS), Cologne, Germany.
| |
Collapse
|
7
|
Lanciano S, Philippe C, Sarkar A, Pratella D, Domrane C, Doucet AJ, van Essen D, Saccani S, Ferry L, Defossez PA, Cristofari G. Locus-level L1 DNA methylation profiling reveals the epigenetic and transcriptional interplay between L1s and their integration sites. CELL GENOMICS 2024; 4:100498. [PMID: 38309261 PMCID: PMC10879037 DOI: 10.1016/j.xgen.2024.100498] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 07/20/2023] [Accepted: 01/09/2024] [Indexed: 02/05/2024]
Abstract
Long interspersed element 1 (L1) retrotransposons are implicated in human disease and evolution. Their global activity is repressed by DNA methylation, but deciphering the regulation of individual copies has been challenging. Here, we combine short- and long-read sequencing to unveil L1 methylation heterogeneity across cell types, families, and individual loci and elucidate key principles involved. We find that the youngest primate L1 families are specifically hypomethylated in pluripotent stem cells and the placenta but not in most tumors. Locally, intronic L1 methylation is intimately associated with gene transcription. Conversely, the L1 methylation state can propagate to the proximal region up to 300 bp. This phenomenon is accompanied by the binding of specific transcription factors, which drive the expression of L1 and chimeric transcripts. Finally, L1 hypomethylation alone is typically insufficient to trigger L1 expression due to redundant silencing pathways. Our results illuminate the epigenetic and transcriptional interplay between retrotransposons and their host genome.
Collapse
Affiliation(s)
- Sophie Lanciano
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France
| | - Claude Philippe
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France
| | - Arpita Sarkar
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France
| | - David Pratella
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France
| | - Cécilia Domrane
- University Paris Cité, CNRS, Epigenetics and Cell Fate, Paris, France
| | - Aurélien J Doucet
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France
| | - Dominic van Essen
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France
| | - Simona Saccani
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France
| | - Laure Ferry
- University Paris Cité, CNRS, Epigenetics and Cell Fate, Paris, France
| | | | - Gael Cristofari
- University Cote d'Azur, INSERM, CNRS, Institute for Research on Cancer and Aging of Nice (IRCAN), Nice, France.
| |
Collapse
|
8
|
Matsushima W, Planet E, Trono D. Ancestral genome reconstruction enhances transposable element annotation by identifying degenerate integrants. CELL GENOMICS 2024; 4:100497. [PMID: 38295789 PMCID: PMC10879028 DOI: 10.1016/j.xgen.2024.100497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 08/09/2023] [Accepted: 01/06/2024] [Indexed: 02/17/2024]
Abstract
Growing evidence indicates that transposable elements (TEs) play important roles in evolution by providing genomes with coding and non-coding sequences. Identification of TE-derived functional elements, however, has relied on TE annotations in individual species, which limits its scope to relatively intact TE sequences. Here, we report a novel approach to uncover previously unannotated degenerate TEs (degTEs) by probing multiple ancestral genomes reconstructed from hundreds of species. We applied this method to the human genome and achieved a 10.8% increase in coverage over the most recent annotation. Further, we discovered that degTEs contribute to various cis-regulatory elements and transcription factor binding sites, including those of a known TE-controlling family, the KRAB zinc-finger proteins. We also report unannotated chimeric transcripts between degTEs and human genes expressed in embryos. This study provides a novel methodology and a freely available resource that will facilitate the investigation of TE co-option events on a full scale.
Collapse
Affiliation(s)
- Wayo Matsushima
- School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland.
| | - Evarist Planet
- School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland
| | - Didier Trono
- School of Life Sciences, École Polytechnique Fédérale de Lausanne (EPFL), 1015 Lausanne, Switzerland.
| |
Collapse
|
9
|
Fukuda K. The role of transposable elements in human evolution and methods for their functional analysis: current status and future perspectives. Genes Genet Syst 2024; 98:289-304. [PMID: 37866889 DOI: 10.1266/ggs.23-00140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2023] Open
Abstract
Transposable elements (TEs) are mobile DNA sequences that can insert themselves into various locations within the genome, causing mutations that may provide advantages or disadvantages to individuals and species. The insertion of TEs can result in genetic variation that may affect a wide range of human traits including genetic disorders. Understanding the role of TEs in human biology is crucial for both evolutionary and medical research. This review discusses the involvement of TEs in human traits and disease susceptibility, as well as methods for functional analysis of TEs.
Collapse
Affiliation(s)
- Kei Fukuda
- Integrative Genomics Unit, The University of Melbourne
| |
Collapse
|
10
|
Mandal AK. Recent insights into crosstalk between genetic parasites and their host genome. Brief Funct Genomics 2024; 23:15-23. [PMID: 36307128 PMCID: PMC10799329 DOI: 10.1093/bfgp/elac032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/14/2022] [Accepted: 09/21/2022] [Indexed: 01/21/2024] Open
Abstract
The bulk of higher order organismal genomes is comprised of transposable element (TE) copies, i.e. genetic parasites. The host-parasite relation is multi-faceted, varying across genomic region (genic versus intergenic), life-cycle stages, tissue-type and of course in health versus pathological state. The reach of functional genomics though, in investigating genotype-to-phenotype relations, has been limited when TEs are involved. The aim of this review is to highlight recent progress made in understanding how TE origin biochemical activity interacts with the central dogma stages of the host genome. Such interaction can also bring about modulation of the immune context and this could have important repercussions in disease state where immunity has a role to play. Thus, the review is to instigate ideas and action points around identifying evolutionary adaptations that the host genome and the genetic parasite have evolved and why they could be relevant.
Collapse
Affiliation(s)
- Amit K Mandal
- Corresponding author: A.K. Mandal, Nuffield Department of Surgical Sciences (NDS), University of Oxford, Old Road Campus Research building (ORCRB), Oxford OX3 7DQ, UK. Tel: +44 (0)1865 617123; Fax: +44 (0)1865 768876; E-mail:
| |
Collapse
|
11
|
Hall A, Middlehurst B, Cadogan MAM, Reed X, Billingsley KJ, Bubb VJ, Quinn JP. A SINE-VNTR-Alu at the LRIG2 locus is associated with proximal and distal gene expression in CRISPR and population models. Sci Rep 2024; 14:792. [PMID: 38191889 PMCID: PMC10774264 DOI: 10.1038/s41598-023-50307-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Accepted: 12/18/2023] [Indexed: 01/10/2024] Open
Abstract
SINE-VNTR-Alu (SVA) retrotransposons represent mobile regulatory elements that have the potential to influence the surrounding genome when they insert into a locus. Evolutionarily recent mobilisation has resulted in loci in the human genome where a given retrotransposon might be observed to be present or absent, termed a retrotransposon insertion polymorphism (RIP). We previously observed that an SVA RIP ~ 2 kb upstream of LRIG2 on chromosome 1, the 'LRIG2 SVA', was associated with differences in local gene expression and methylation, and that the two were correlated. Here, we have used CRISPR-mediated deletion of the LRIG2 SVA in a cell line model to validate that presence of the retrotransposon is directly affecting local expression and provide evidence that is suggestive of a modest role for the SVA in modulating nearby methylation. Additionally, in leveraging an available Hi-C dataset we observed that the LRIG2 SVA was also involved in long-range chromatin interactions with a cluster of genes ~ 300 kb away, and that expression of these genes was to varying degrees associated with dosage of the SVA in both CRISPR cell line and population models. Altogether, these data support a regulatory role for SVAs in the modulation of gene expression, with the latter potentially involving chromatin looping, consistent with the model that RIPs may contribute to interpersonal differences in transcriptional networks.
Collapse
Affiliation(s)
- Ashley Hall
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, UK
| | - Ben Middlehurst
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, UK
| | - Max A M Cadogan
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, UK
| | - Xylena Reed
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, 20892, USA
- Center for Alzheimer's and Related Dementias, National Institute on Aging, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Kimberley J Billingsley
- Laboratory of Neurogenetics, National Institute on Aging, National Institutes of Health, Bethesda, MD, 20892, USA
- Center for Alzheimer's and Related Dementias, National Institute on Aging, National Institute of Neurological Disorders and Stroke, National Institutes of Health, Bethesda, MD, 20892, USA
| | - Vivien J Bubb
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, UK
| | - John P Quinn
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool, L69 7BE, UK.
| |
Collapse
|
12
|
Müller I, Helin K. Keep quiet: the HUSH complex in transcriptional silencing and disease. Nat Struct Mol Biol 2024; 31:11-22. [PMID: 38216658 DOI: 10.1038/s41594-023-01173-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Accepted: 10/23/2023] [Indexed: 01/14/2024]
Abstract
The human silencing hub (HUSH) complex is an epigenetic repressor complex whose role has emerged as an important guardian of genome integrity. It protects the genome from exogenous DNA invasion and regulates endogenous retroelements by recruiting histone methyltransferases catalyzing histone 3 lysine 9 trimethylation (H3K9me3) and additional proteins involved in chromatin compaction. In particular, its regulation of transcriptionally active LINE1 retroelements, by binding to and neutralizing LINE1 transcripts, has been well characterized. HUSH is required for mouse embryogenesis and is associated with disease, in particular cancer. Here we provide insights into the structural and biochemical features of the HUSH complex. Furthermore, we discuss the molecular mechanisms by which the HUSH complex is recruited to specific genomic regions and how it silences transcription. Finally, we discuss the role of HUSH complex members in mammalian development, antiretroviral immunity, and diseases such as cancer.
Collapse
Affiliation(s)
- Iris Müller
- Cell Biology Program and Center for Epigenetics Research, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Kristian Helin
- Cell Biology Program and Center for Epigenetics Research, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
- The Institute of Cancer Research, London, UK.
| |
Collapse
|
13
|
Feldmeyer B, Bornberg-Bauer E, Dohmen E, Fouks B, Heckenhauer J, Huylmans AK, Jones ARC, Stolle E, Harrison MC. Comparative Evolutionary Genomics in Insects. Methods Mol Biol 2024; 2802:473-514. [PMID: 38819569 DOI: 10.1007/978-1-0716-3838-5_16] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Genome sequencing quality, in terms of both read length and accuracy, is constantly improving. By combining long-read sequencing technologies with various scaffolding techniques, chromosome-level genome assemblies are now achievable at an affordable price for non-model organisms. Insects represent an exciting taxon for studying the genomic underpinnings of evolutionary innovations, due to ancient origins, immense species-richness, and broad phenotypic diversity. Here we summarize some of the most important methods for carrying out a comparative genomics study on insects. We describe available tools and offer concrete tips on all stages of such an endeavor from DNA extraction through genome sequencing, annotation, and several evolutionary analyses. Along the way we describe important insect-specific aspects, such as DNA extraction difficulties or gene families that are particularly difficult to annotate, and offer solutions. We describe results from several examples of comparative genomics analyses on insects to illustrate the fascinating questions that can now be addressed in this new age of genomics research.
Collapse
Affiliation(s)
- Barbara Feldmeyer
- Senckenberg Biodiversity and Climate Research Centre (SBiK-F), Molecular Ecology, Frankfurt, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Elias Dohmen
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Bertrand Fouks
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Jacqueline Heckenhauer
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Frankfurt, Germany
- Department of Terrestrial Zoology, Senckenberg Research Institute and Natural History Museum Frankfurt, Frankfurt, Germany
| | - Ann Kathrin Huylmans
- Institute of Organismic and Molecular Evolution, Johannes Gutenberg University, Mainz, Germany
| | - Alun R C Jones
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Eckart Stolle
- Museum Koenig, Leibniz Institute for the Analysis of Biodiversity Change (LIB), Bonn, Germany
| | - Mark C Harrison
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany.
| |
Collapse
|
14
|
Pulver C, Grun D, Duc J, Sheppard S, Planet E, Coudray A, de Fondeville R, Pontis J, Trono D. Statistical learning quantifies transposable element-mediated cis-regulation. Genome Biol 2023; 24:258. [PMID: 37950299 PMCID: PMC10637000 DOI: 10.1186/s13059-023-03085-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Accepted: 10/09/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND Transposable elements (TEs) have colonized the genomes of most metazoans, and many TE-embedded sequences function as cis-regulatory elements (CREs) for genes involved in a wide range of biological processes from early embryogenesis to innate immune responses. Because of their repetitive nature, TEs have the potential to form CRE platforms enabling the coordinated and genome-wide regulation of protein-coding genes by only a handful of trans-acting transcription factors (TFs). RESULTS Here, we directly test this hypothesis through mathematical modeling and demonstrate that differences in expression at protein-coding genes alone are sufficient to estimate the magnitude and significance of TE-contributed cis-regulatory activities, even in contexts where TE-derived transcription fails to do so. We leverage hundreds of overexpression experiments and estimate that, overall, gene expression is influenced by TE-embedded CREs situated within approximately 500 kb of promoters. Focusing on the cis-regulatory potential of TEs within the gene regulatory network of human embryonic stem cells, we find that pluripotency-specific and evolutionarily young TE subfamilies can be reactivated by TFs involved in post-implantation embryogenesis. Finally, we show that TE subfamilies can be split into truly regulatorily active versus inactive fractions based on additional information such as matched epigenomic data, observing that TF binding may better predict TE cis-regulatory activity than differences in histone marks. CONCLUSION Our results suggest that TE-embedded CREs contribute to gene regulation during and beyond gastrulation. On a methodological level, we provide a statistical tool that infers TE-dependent cis-regulation from RNA-seq data alone, thus facilitating the study of TEs in the next-generation sequencing era.
Collapse
Affiliation(s)
- Cyril Pulver
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Delphine Grun
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Julien Duc
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Shaoline Sheppard
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Evarist Planet
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Alexandre Coudray
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland
| | - Raphaël de Fondeville
- Swiss Data Science Center, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland.
| | - Julien Pontis
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland.
- SOPHiA GENETICS SA, La Pièce 12, CH-1180, Rolle, Switzerland.
| | - Didier Trono
- School of Life Sciences, Swiss Federal Institute of Technology Lausanne (EPFL), CH-1015, Lausanne, Switzerland.
| |
Collapse
|
15
|
Lawson HA, Liang Y, Wang T. Transposable elements in mammalian chromatin organization. Nat Rev Genet 2023; 24:712-723. [PMID: 37286742 DOI: 10.1038/s41576-023-00609-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/24/2023] [Indexed: 06/09/2023]
Abstract
Transposable elements (TEs) are mobile DNA elements that comprise almost 50% of mammalian genomic sequence. TEs are capable of making additional copies of themselves that integrate into new positions in host genomes. This unique property has had an important impact on mammalian genome evolution and on the regulation of gene expression because TE-derived sequences can function as cis-regulatory elements such as enhancers, promoters and silencers. Now, advances in our ability to identify and characterize TEs have revealed that TE-derived sequences also regulate gene expression by both maintaining and shaping 3D genome architecture. Studies are revealing how TEs contribute raw sequence that can give rise to the structures that shape chromatin organization, and thus gene expression, allowing for species-specific genome innovation and evolutionary novelty.
Collapse
Affiliation(s)
- Heather A Lawson
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO, USA.
| | - Yonghao Liang
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO, USA
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO, USA
| | - Ting Wang
- Department of Genetics, Washington University School of Medicine, Saint Louis, MO, USA.
- Center for Genome Sciences and Systems Biology, Washington University School of Medicine, Saint Louis, MO, USA.
- McDonnell Genome Institute, Washington University School of Medicine, Saint Louis, MO, USA.
| |
Collapse
|
16
|
Sproul JS, Hotaling S, Heckenhauer J, Powell A, Marshall D, Larracuente AM, Kelley JL, Pauls SU, Frandsen PB. Analyses of 600+ insect genomes reveal repetitive element dynamics and highlight biodiversity-scale repeat annotation challenges. Genome Res 2023; 33:1708-1717. [PMID: 37739812 PMCID: PMC10691545 DOI: 10.1101/gr.277387.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 09/20/2023] [Indexed: 09/24/2023]
Abstract
Repetitive elements (REs) are integral to the composition, structure, and function of eukaryotic genomes, yet remain understudied in most taxonomic groups. We investigated REs across 601 insect species and report wide variation in RE dynamics across groups. Analysis of associations between REs and protein-coding genes revealed dynamic evolution at the interface between REs and coding regions across insects, including notably elevated RE-gene associations in lineages with abundant long interspersed nuclear elements (LINEs). We leveraged this large, empirical data set to quantify impacts of long-read technology on RE detection and investigate fundamental challenges to RE annotation in diverse groups. In long-read assemblies, we detected ∼36% more REs than short-read assemblies, with long terminal repeats (LTRs) showing 162% increased detection, whereas DNA transposons and LINEs showed less respective technology-related bias. In most insect lineages, 25%-85% of repetitive sequences were "unclassified" following automated annotation, compared with only ∼13% in Drosophila species. Although the diversity of available insect genomes has rapidly expanded, we show the rate of community contributions to RE databases has not kept pace, preventing efficient annotation and high-resolution study of REs in most groups. We highlight the tremendous opportunity and need for the biodiversity genomics field to embrace REs and suggest collective steps for making progress toward this goal.
Collapse
Affiliation(s)
- John S Sproul
- Department of Biology, Brigham Young University, Provo, Utah 84602, USA;
- Department of Biology, University of Nebraska Omaha, Omaha, Nebraska 68182, USA
- Department of Biology, University of Rochester, Rochester, New York 14627, USA
| | - Scott Hotaling
- School of Biological Sciences, Washington State University, Pullman, Washington 99163, USA
- Department of Watershed Sciences, Utah State University, Logan, Utah 84322, USA
| | - Jacqueline Heckenhauer
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, 60325 Frankfurt, Germany
| | - Ashlyn Powell
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602, USA
| | - Dez Marshall
- Department of Biology, University of Nebraska Omaha, Omaha, Nebraska 68182, USA
| | | | - Joanna L Kelley
- School of Biological Sciences, Washington State University, Pullman, Washington 99163, USA
- Department of Ecology and Evolutionary Biology, University of California Santa Cruz, Santa Cruz, California 95064, USA
| | - Steffen U Pauls
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Senckenberg Research Institute and Natural History Museum Frankfurt, 60325 Frankfurt, Germany
- Department of Insect Biotechnology, Justus-Liebig-University Gießen, 35392 Gießen, Germany
| | - Paul B Frandsen
- LOEWE Center for Translational Biodiversity Genomics (LOEWE-TBG), 60325 Frankfurt, Germany
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, Utah 84602, USA
- Data Science Lab, Smithsonian Institution, Washington, District of Columbia 20560, USA
| |
Collapse
|
17
|
Orozco-Arias S, Lopez-Murillo LH, Piña JS, Valencia-Castrillon E, Tabares-Soto R, Castillo-Ossa L, Isaza G, Guyot R. Genomic object detection: An improved approach for transposable elements detection and classification using convolutional neural networks. PLoS One 2023; 18:e0291925. [PMID: 37733731 PMCID: PMC10513252 DOI: 10.1371/journal.pone.0291925] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2023] [Accepted: 09/10/2023] [Indexed: 09/23/2023] Open
Abstract
Analysis of eukaryotic genomes requires the detection and classification of transposable elements (TEs), a crucial but complex and time-consuming task. To improve the performance of tools that accomplish these tasks, Machine Learning approaches (ML) that leverage computer resources, such as GPUs (Graphical Processing Unit) and multiple CPU (Central Processing Unit) cores, have been adopted. However, until now, the use of ML techniques has mostly been limited to classification of TEs. Herein, a detection-classification strategy (named YORO) based on convolutional neural networks is adapted from computer vision (YOLO) to genomics. This approach enables the detection of genomic objects through the prediction of the position, length, and classification in large DNA sequences such as fully sequenced genomes. As a proof of concept, the internal protein-coding domains of LTR-retrotransposons are used to train the proposed neural network. Precision, recall, accuracy, F1-score, execution times and time ratios, as well as several graphical representations were used as metrics to measure performance. These promising results open the door for a new generation of Deep Learning tools for genomics. YORO architecture is available at https://github.com/simonorozcoarias/YORO.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia
- Center for Technology Development Bioprocess and Agroindustry Plant, Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | | | - Johan S. Piña
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales, Colombia
| | | | - Reinel Tabares-Soto
- Center for Technology Development Bioprocess and Agroindustry Plant, Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia
| | - Luis Castillo-Ossa
- Center for Technology Development Bioprocess and Agroindustry Plant, Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | - Gustavo Isaza
- Center for Technology Development Bioprocess and Agroindustry Plant, Department of Systems and Informatics, Universidad de Caldas, Manizales, Colombia
| | - Romain Guyot
- Department of Electronics and Automation, Universidad Autónoma de Manizales, Manizales, Colombia
- Institut de Recherche pour le Développement, CIRAD, Univ. Montpellier, Montpellier, France
| |
Collapse
|
18
|
Karttunen K, Patel D, Xia J, Fei L, Palin K, Aaltonen L, Sahu B. Transposable elements as tissue-specific enhancers in cancers of endodermal lineage. Nat Commun 2023; 14:5313. [PMID: 37658059 PMCID: PMC10474299 DOI: 10.1038/s41467-023-41081-4] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 08/23/2023] [Indexed: 09/03/2023] Open
Abstract
Transposable elements (TE) are repetitive genomic elements that harbor binding sites for human transcription factors (TF). A regulatory role for TEs has been suggested in embryonal development and diseases such as cancer but systematic investigation of their functions has been limited by their widespread silencing in the genome. Here, we utilize unbiased massively parallel reporter assay data using a whole human genome library to identify TEs with functional enhancer activity in two human cancer types of endodermal lineage, colorectal and liver cancers. We show that the identified TE enhancers are characterized by genomic features associated with active enhancers, such as epigenetic marks and TF binding. Importantly, we identify distinct TE subfamilies that function as tissue-specific enhancers, namely MER11- and LTR12-elements in colon and liver cancers, respectively. These elements are bound by distinct TFs in each cell type, and they have predicted associations to differentially expressed genes. In conclusion, these data demonstrate how different cancer types can utilize distinct TEs as tissue-specific enhancers, paving the way for comprehensive understanding of the role of TEs as bona fide enhancers in the cancer genomes.
Collapse
Affiliation(s)
- Konsta Karttunen
- Applied Tumor Genomics Program, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Divyesh Patel
- Applied Tumor Genomics Program, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Jihan Xia
- Applied Tumor Genomics Program, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Liangru Fei
- Applied Tumor Genomics Program, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland
| | - Kimmo Palin
- Applied Tumor Genomics Program, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Lauri Aaltonen
- Applied Tumor Genomics Program, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland
| | - Biswajyoti Sahu
- Applied Tumor Genomics Program, Research Programs Unit, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
- iCAN Digital Precision Cancer Medicine Flagship, University of Helsinki, Helsinki, Finland.
- Medicum, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
- Centre for Molecular Medicine Norway, University of Oslo, Oslo, Norway.
| |
Collapse
|
19
|
Zhao P, Gu L, Gao Y, Pan Z, Liu L, Li X, Zhou H, Yu D, Han X, Qian L, Liu GE, Fang L, Wang Z. Young SINEs in pig genomes impact gene regulation, genetic diversity, and complex traits. Commun Biol 2023; 6:894. [PMID: 37652983 PMCID: PMC10471783 DOI: 10.1038/s42003-023-05234-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 08/09/2023] [Indexed: 09/02/2023] Open
Abstract
Transposable elements (TEs) are a major source of genetic polymorphisms and play a role in chromatin architecture, gene regulatory networks, and genomic evolution. However, their functional role in pigs and contributions to complex traits are largely unknown. We created a catalog of TEs (n = 3,087,929) in pigs and found that young SINEs were predominantly silenced by histone modifications, DNA methylation, and decreased accessibility. However, some transcripts from active young SINEs showed high tissue-specificity, as confirmed by analyzing 3570 RNA-seq samples. We also detected 211,067 dimorphic SINEs in 374 individuals, including 340 population-specific ones associated with local adaptation. Mapping these dimorphic SINEs to genome-wide associations of 97 complex traits in pigs, we found 54 candidate genes (e.g., ANK2 and VRTN) that might be mediated by TEs. Our findings highlight the important roles of young SINEs and provide a supplement for genotype-to-phenotype associations and modern breeding in pigs.
Collapse
Affiliation(s)
- Pengju Zhao
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lihong Gu
- Institute of Animal Science & Veterinary Medicine, Hainan Academy of Agricultural Sciences, No. 14 Xingdan Road, Haikou, 571100, China
| | - Yahui Gao
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA
| | - Zhangyuan Pan
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Lei Liu
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Xingzheng Li
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, 518124, China
| | - Huaijun Zhou
- Department of Animal Science, University of California, Davis, CA, 95616, USA
| | - Dongyou Yu
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Xinyan Han
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Lichun Qian
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, Aarhus, 8000, Denmark.
| | - Zhengguang Wang
- Hainan Institute, Zhejiang University, Yongyou Industry Park, Yazhou Bay Sci-Tech City, Sanya, 572000, China.
- College of Animal Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China.
| |
Collapse
|
20
|
Alkailani MI, Gibbings D. The Regulation and Immune Signature of Retrotransposons in Cancer. Cancers (Basel) 2023; 15:4340. [PMID: 37686616 PMCID: PMC10486412 DOI: 10.3390/cancers15174340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 08/14/2023] [Accepted: 08/18/2023] [Indexed: 09/10/2023] Open
Abstract
Advances in sequencing technologies and the bioinformatic analysis of big data facilitate the study of jumping genes' activity in the human genome in cancer from a broad perspective. Retrotransposons, which move from one genomic site to another by a copy-and-paste mechanism, are regulated by various molecular pathways that may be disrupted during tumorigenesis. Active retrotransposons can stimulate type I IFN responses. Although accumulated evidence suggests that retrotransposons can induce inflammation, the research investigating the exact mechanism of triggering these responses is ongoing. Understanding these mechanisms could improve the therapeutic management of cancer through the use of retrotransposon-induced inflammation as a tool to instigate immune responses to tumors.
Collapse
Affiliation(s)
- Maisa I. Alkailani
- College of Health and Life Sciences, Hamad Bin Khalifa University, Qatar Foundation, Doha P.O. Box 34110, Qatar
| | - Derrick Gibbings
- Department of Cellular and Molecular Medicine, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada;
| |
Collapse
|
21
|
Zhao P, Peng C, Fang L, Wang Z, Liu GE. Taming transposable elements in livestock and poultry: a review of their roles and applications. Genet Sel Evol 2023; 55:50. [PMID: 37479995 PMCID: PMC10362595 DOI: 10.1186/s12711-023-00821-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Accepted: 06/30/2023] [Indexed: 07/23/2023] Open
Abstract
Livestock and poultry play a significant role in human nutrition by converting agricultural by-products into high-quality proteins. To meet the growing demand for safe animal protein, genetic improvement of livestock must be done sustainably while minimizing negative environmental impacts. Transposable elements (TE) are important components of livestock and poultry genomes, contributing to their genetic diversity, chromatin states, gene regulatory networks, and complex traits of economic value. However, compared to other species, research on TE in livestock and poultry is still in its early stages. In this review, we analyze 72 studies published in the past 20 years, summarize the TE composition in livestock and poultry genomes, and focus on their potential roles in functional genomics. We also discuss bioinformatic tools and strategies for integrating multi-omics data with TE, and explore future directions, feasibility, and challenges of TE research in livestock and poultry. In addition, we suggest strategies to apply TE in basic biological research and animal breeding. Our goal is to provide a new perspective on the importance of TE in livestock and poultry genomes.
Collapse
Affiliation(s)
- Pengju Zhao
- Hainan Institute of Zhejiang University, Hainan Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Zhejiang, Hangzhou, People's Republic of China
| | - Chen Peng
- Hainan Institute of Zhejiang University, Hainan Sanya, 572000, China
- College of Animal Sciences, Zhejiang University, Zhejiang, Hangzhou, People's Republic of China
| | - Lingzhao Fang
- Center for Quantitative Genetics and Genomics, Aarhus University, 8000, Aarhus, Denmark.
| | - Zhengguang Wang
- Hainan Institute of Zhejiang University, Hainan Sanya, 572000, China.
- College of Animal Sciences, Zhejiang University, Zhejiang, Hangzhou, People's Republic of China.
| | - George E Liu
- Animal Genomics and Improvement Laboratory, Beltsville Agricultural Research Center, Agricultural Research Service, USDA, Beltsville, MD, 20705, USA.
| |
Collapse
|
22
|
Garret P, Chevarin M, Vitobello A, Verdez S, Fournier C, Verloes A, Tisserant E, Vabres P, Prevel O, Philippe C, Denommé-Pichon AS, Bruel AL, Mau-Them FT, Safraou H, Boughalem A, Costa JM, Trost D, Thauvin-Robinet C, Faivre L, Duffourd Y. A second look at exome sequencing data: detecting mobile elements insertion in a rare disease cohort. Eur J Hum Genet 2023; 31:761-768. [PMID: 36450799 PMCID: PMC10326243 DOI: 10.1038/s41431-022-01250-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 07/01/2022] [Accepted: 11/17/2022] [Indexed: 12/05/2022] Open
Abstract
About 0.3% of all variants are due to de novo mobile element insertions (MEIs). The massive development of next-generation sequencing has made it possible to identify MEIs on a large scale. We analyzed exome sequencing (ES) data from 3232 individuals (2410 probands) with developmental and/or neurological abnormalities, with MELT, a tool designed to identify MEIs. The results were filtered by frequency, impacted region and gene function. Following phenotype comparison, two candidates were identified in two unrelated probands. The first mobile element (ME) was found in a patient referred for poikilodermia. A homozygous insertion was identified in the FERMT1 gene involved in Kindler syndrome. RNA study confirmed its pathological impact on splicing. The second ME was a de novo Alu insertion in the GRIN2B gene involved in intellectual disability, and detected in a patient with a developmental disorder. The frequency of de novo exonic MEIs in our study is concordant with previous studies on ES data. This project, which aimed to identify pathological MEIs in the coding sequence of genes, confirms that including detection of MEs in the ES pipeline can increase the diagnostic rate. This work provides additional evidence that ES could be used alone as a diagnostic exam.
Collapse
Affiliation(s)
- Philippine Garret
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France.
- Laboratoire, CERBA, Saint-Ouen l'Aumône, France.
| | - Martin Chevarin
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Antonio Vitobello
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Simon Verdez
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Cyril Fournier
- UMR 1231, Faculty of Medicine, University of Burgundy-iSITE-INSERM, Dijon, France
- Unit for innovation in genetics and epigenetic in oncology, Dijon University Hospital, Dijon, France
| | - Alain Verloes
- INSERM UMR1141, Université de Paris, Paris, France
- Genetics Department, AP-HP Nord, Robert-Debré University Hospital, Paris, France
| | - Emilie Tisserant
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Pierre Vabres
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Centre de Référence maladies rares « maladies dermatologiques en mosaïque », service de dermatologie, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
- Service Dermatologie, Dijon University Hospital, Dijon, France
| | - Orlane Prevel
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Service Dermatologie, Dijon University Hospital, Dijon, France
| | - Christophe Philippe
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Anne-Sophie Denommé-Pichon
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
- Centre de Référence maladies rares « Anomalies du développement et syndromes malformatifs », centre de génétique, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Ange-Line Bruel
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Frédéric Tran Mau-Them
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
- Centre de Référence maladies rares « Anomalies du développement et syndromes malformatifs », centre de génétique, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
- Centre de Référence maladies rares « Déficiences intellectuelles de cause rare », centre de génétique, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Hana Safraou
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
- Centre de Référence maladies rares « Anomalies du développement et syndromes malformatifs », centre de génétique, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | | | | | | | - Christel Thauvin-Robinet
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
- Centre de Référence maladies rares « Déficiences intellectuelles de cause rare », centre de génétique, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Laurence Faivre
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Centre de Référence maladies rares « Anomalies du développement et syndromes malformatifs », centre de génétique, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| | - Yannis Duffourd
- UMR1231 GAD, Inserm-Université Bourgogne-Franche Comté, Dijon, France
- Unité Fonctionnelle Innovation en Diagnostic génomique des maladies rares, FHU-TRANSLAD, Dijon University Hospital, Dijon, France
| |
Collapse
|
23
|
Mohamed M, Sabot F, Varoqui M, Mugat B, Audouin K, Pélisson A, Fiston-Lavier AS, Chambeyron S. TrEMOLO: accurate transposable element allele frequency estimation using long-read sequencing data combining assembly and mapping-based approaches. Genome Biol 2023; 24:63. [PMID: 37013657 PMCID: PMC10069131 DOI: 10.1186/s13059-023-02911-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Accepted: 03/23/2023] [Indexed: 04/05/2023] Open
Abstract
Transposable Element MOnitoring with LOng-reads (TrEMOLO) is a new software that combines assembly- and mapping-based approaches to robustly detect genetic elements called transposable elements (TEs). Using high- or low-quality genome assemblies, TrEMOLO can detect most TE insertions and deletions and estimate their allele frequency in populations. Benchmarking with simulated data revealed that TrEMOLO outperforms other state-of-the-art computational tools. TE detection and frequency estimation by TrEMOLO were validated using simulated and experimental datasets. Therefore, TrEMOLO is a comprehensive and suitable tool to accurately study TE dynamics. TrEMOLO is available under GNU GPL3.0 at https://github.com/DrosophilaGenomeEvolution/TrEMOLO .
Collapse
Affiliation(s)
- Mourdas Mohamed
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | - François Sabot
- DIADE, University of Montpellier, CIRAD, IRD, Montpellier, France
- IFB - Southgreen Bioversity, CIRAD, INRAE, IRD, Montpellier, France
| | - Marion Varoqui
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | - Bruno Mugat
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | | | - Alain Pélisson
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France
| | - Anna-Sophie Fiston-Lavier
- ISEM, Université Montpellier, CNRS, IRD, CIRAD, EPHE, Montpellier, France.
- Institut Universitaire de France (IUF), Paris, France.
| | - Séverine Chambeyron
- Institute of Human Genetics, UMR9002, CNRS and Université de Montpellier, Montpellier, France.
| |
Collapse
|
24
|
Hu Y, Wang X, Xu Y, Yang H, Tong Z, Tian R, Xu S, Yu L, Guo Y, Shi P, Huang S, Yang G, Shi S, Wei F. Molecular mechanisms of adaptive evolution in wild animals and plants. SCIENCE CHINA. LIFE SCIENCES 2023; 66:453-495. [PMID: 36648611 PMCID: PMC9843154 DOI: 10.1007/s11427-022-2233-x] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 08/30/2022] [Indexed: 01/18/2023]
Abstract
Wild animals and plants have developed a variety of adaptive traits driven by adaptive evolution, an important strategy for species survival and persistence. Uncovering the molecular mechanisms of adaptive evolution is the key to understanding species diversification, phenotypic convergence, and inter-species interaction. As the genome sequences of more and more non-model organisms are becoming available, the focus of studies on molecular mechanisms of adaptive evolution has shifted from the candidate gene method to genetic mapping based on genome-wide scanning. In this study, we reviewed the latest research advances in wild animals and plants, focusing on adaptive traits, convergent evolution, and coevolution. Firstly, we focused on the adaptive evolution of morphological, behavioral, and physiological traits. Secondly, we reviewed the phenotypic convergences of life history traits and responding to environmental pressures, and the underlying molecular convergence mechanisms. Thirdly, we summarized the advances of coevolution, including the four main types: mutualism, parasitism, predation and competition. Overall, these latest advances greatly increase our understanding of the underlying molecular mechanisms for diverse adaptive traits and species interaction, demonstrating that the development of evolutionary biology has been greatly accelerated by multi-omics technologies. Finally, we highlighted the emerging trends and future prospects around the above three aspects of adaptive evolution.
Collapse
Affiliation(s)
- Yibo Hu
- CAS Key Lab of Animal Ecology and Conservation Biology, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Xiaoping Wang
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, School of Life Sciences, Yunnan University, Kunming, 650091, China
| | - Yongchao Xu
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Hui Yang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650201, China
| | - Zeyu Tong
- Institute of Evolution and Ecology, School of Life Sciences, Central China Normal University, Wuhan, 430079, China
| | - Ran Tian
- College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China
| | - Shaohua Xu
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China
| | - Li Yu
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, School of Life Sciences, Yunnan University, Kunming, 650091, China.
| | - Yalong Guo
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
| | - Peng Shi
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, 650201, China.
| | - Shuangquan Huang
- Institute of Evolution and Ecology, School of Life Sciences, Central China Normal University, Wuhan, 430079, China.
| | - Guang Yang
- Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou), Guangzhou, 511458, China.
- College of Life Sciences, Nanjing Normal University, Nanjing, 210023, China.
| | - Suhua Shi
- State Key Laboratory of Biocontrol, Guangdong Key Lab of Plant Resources, School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
| | - Fuwen Wei
- CAS Key Lab of Animal Ecology and Conservation Biology, Chinese Academy of Sciences, Beijing, 100101, China.
- Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou), Guangzhou, 511458, China.
| |
Collapse
|
25
|
Raiyemo DA, Bobadilla LK, Tranel PJ. Genomic profiling of dioecious Amaranthus species provides novel insights into species relatedness and sex genes. BMC Biol 2023; 21:37. [PMID: 36804015 PMCID: PMC9940365 DOI: 10.1186/s12915-023-01539-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 02/08/2023] [Indexed: 02/21/2023] Open
Abstract
BACKGROUND Amaranthus L. is a diverse genus consisting of domesticated, weedy, and non-invasive species distributed around the world. Nine species are dioecious, of which Amaranthus palmeri S. Watson and Amaranthus tuberculatus (Moq.) J.D. Sauer are troublesome weeds of agronomic crops in the USA and elsewhere. Shallow relationships among the dioecious Amaranthus species and the conservation of candidate genes within previously identified A. palmeri and A. tuberculatus male-specific regions of the Y (MSYs) in other dioecious species are poorly understood. In this study, seven genomes of dioecious amaranths were obtained by paired-end short-read sequencing and combined with short reads of seventeen species in the family Amaranthaceae from NCBI database. The species were phylogenomically analyzed to understand their relatedness. Genome characteristics for the dioecious species were evaluated and coverage analysis was used to investigate the conservation of sequences within the MSY regions. RESULTS We provide genome size, heterozygosity, and ploidy level inference for seven newly sequenced dioecious Amaranthus species and two additional dioecious species from the NCBI database. We report a pattern of transposable element proliferation in the species, in which seven species had more Ty3 elements than copia elements while A. palmeri and A. watsonii had more copia elements than Ty3 elements, similar to the TE pattern in some monoecious amaranths. Using a Mash-based phylogenomic analysis, we accurately recovered taxonomic relationships among the dioecious Amaranthus species that were previously identified based on comparative morphology. Coverage analysis revealed eleven candidate gene models within the A. palmeri MSY region with male-enriched coverages, as well as regions on scaffold 19 with female-enriched coverage, based on A. watsonii read alignments. A previously reported FLOWERING LOCUS T (FT) within A. tuberculatus MSY contig was also found to exhibit male-enriched coverages for three species closely related to A. tuberculatus but not for A. watsonii reads. Additional characterization of the A. palmeri MSY region revealed that 78% of the region is made of repetitive elements, typical of a sex determination region with reduced recombination. CONCLUSIONS The results of this study further increase our understanding of the relationships among the dioecious species of the Amaranthus genus as well as revealed genes with potential roles in sex function in the species.
Collapse
Affiliation(s)
- Damilola A Raiyemo
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Lucas K Bobadilla
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Patrick J Tranel
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA.
| |
Collapse
|
26
|
Pradier L, Bedhomme S. Ecology, more than antibiotics consumption, is the major predictor for the global distribution of aminoglycoside-modifying enzymes. eLife 2023; 12:77015. [PMID: 36785930 PMCID: PMC9928423 DOI: 10.7554/elife.77015] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 01/24/2023] [Indexed: 02/15/2023] Open
Abstract
Antibiotic consumption and its abuses have been historically and repeatedly pointed out as the major driver of antibiotic resistance emergence and propagation. However, several examples show that resistance may persist despite substantial reductions in antibiotic use, and that other factors are at stake. Here, we study the temporal, spatial, and ecological distribution patterns of aminoglycoside resistance, by screening more than 160,000 publicly available genomes for 27 clusters of genes encoding aminoglycoside-modifying enzymes (AME genes). We find that AME genes display a very ubiquitous pattern: about 25% of sequenced bacteria carry AME genes. These bacteria were sequenced from all the continents (except Antarctica) and terrestrial biomes, and belong to a wide number of phyla. By focusing on European countries between 1997 and 2018, we show that aminoglycoside consumption has little impact on the prevalence of AME-gene-carrying bacteria, whereas most variation in prevalence is observed among biomes. We further analyze the resemblance of resistome compositions across biomes: soil, wildlife, and human samples appear to be central to understand the exchanges of AME genes between different ecological contexts. Together, these results support the idea that interventional strategies based on reducing antibiotic use should be complemented by a stronger control of exchanges, especially between ecosystems.
Collapse
Affiliation(s)
- Léa Pradier
- CEFE, CNRS, Univ Montpellier, EPHE, IRD, Montpellier, France
| | | |
Collapse
|
27
|
Eskier D, Arıbaş A, Karakülah G. PlanTEnrichment: A How-to Guide on Rapid Identification of Transposable Elements Associated with Regions of Interest in Select Plant Genomes. Methods Mol Biol 2023; 2703:59-70. [PMID: 37646937 DOI: 10.1007/978-1-0716-3389-2_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
Transposable elements (TEs) are repeat elements that can relocate or create novel copies of themselves in the genome and contribute to genomic complexity and expansion, via events such as chromosome recombination or regulation of gene expression. However, given the large number of such repeats across the genome, identifying repeats of interest can be a challenge in even well-annotated genomes, especially in more complex, TE-rich plant genomes. Here, we describe a protocol for PlanTEnrichment, a database we created comprising information on 11 plant genomes to analyze stress-associated TEs using publicly available data. By selecting a genome and providing a list of genes or genomic regions whose TE associations the user wants to identify, the user can rapidly obtain TE subfamilies found near the provided regions, as well as their superfamily and class, and the enrichment values of the repeats. The results also provide the locations of individual repeat instances found, alongside the input regions or genes they are associated with, and a bar graph of the top ten most significant repeat subfamilies identified. PlanTEnrichment is freely available at http://tools.ibg.deu.edu.tr/plantenrichment/ and can be used by researchers with rudimentary or no proficiency in computational analysis of TE elements, allowing for expedience in the identification of TEs of interest and helping further our understanding of the potential contributions of TEs in plant genomes.
Collapse
Affiliation(s)
- Doğa Eskier
- İzmir International Biomedicine and Genome Institute, Dokuz Eylül University, İnciraltı, İzmir, Turkey
- Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey
| | - Alirıza Arıbaş
- Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey
| | - Gökhan Karakülah
- İzmir International Biomedicine and Genome Institute, Dokuz Eylül University, İnciraltı, İzmir, Turkey.
- Bioinformatics Platform, İzmir Biomedicine and Genome Center (IBG), İnciraltı, İzmir, Turkey.
| |
Collapse
|
28
|
Almeida da Paz M, Taher L. T3E: a tool for characterising the epigenetic profile of transposable elements using ChIP-seq data. Mob DNA 2022; 13:29. [PMID: 36451223 PMCID: PMC9710123 DOI: 10.1186/s13100-022-00285-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Accepted: 11/14/2022] [Indexed: 12/05/2022] Open
Abstract
BACKGROUND Despite the advent of Chromatin Immunoprecipitation Sequencing (ChIP-seq) having revolutionised our understanding of the mammalian genome's regulatory landscape, many challenges remain. In particular, because of their repetitive nature, the sequencing reads derived from transposable elements (TEs) pose a real bioinformatics challenge, to the point that standard analysis pipelines typically ignore reads whose genomic origin cannot be unambiguously ascertained. RESULTS We show that discarding ambiguously mapping reads may lead to a systematic underestimation of the number of reads associated with young TE families/subfamilies. We also provide evidence suggesting that the strategy of randomly permuting the location of the read mappings (or the TEs) that is often used to compute the background for enrichment calculations at TE families/subfamilies can result in both false positive and negative enrichments. To address these problems, we present the Transposable Element Enrichment Estimator (T3E), a tool that makes use of ChIP-seq data to characterise the epigenetic profile of associated TE families/subfamilies. T3E weights the number of read mappings assigned to the individual TE copies of a family/subfamily by the overall number of genomic loci to which the corresponding reads map, and this is done at the single nucleotide level. In addition, T3E computes ChIP-seq enrichment relative to a background estimated based on the distribution of the read mappings in the input control DNA. We demonstrated the capabilities of T3E on 23 different ChIP-seq libraries. T3E identified enrichments that were consistent with previous studies. Furthermore, T3E detected context-specific enrichments that are likely to pinpoint unexplored TE families/subfamilies with individual TE copies that have been frequently exapted as cis-regulatory elements during the evolution of mammalian regulatory networks. CONCLUSIONS T3E is a novel open-source computational tool (available for use at: https://github.com/michelleapaz/T3E ) that overcomes some of the pitfalls associated with the analysis of ChIP-seq data arising from the repetitive mammalian genome and provides a framework to shed light on the epigenetics of entire TE families/subfamilies.
Collapse
Affiliation(s)
| | - Leila Taher
- Institute of Biomedical Informatics, Graz University of Technology, Graz, Austria.
| |
Collapse
|
29
|
Miyao A, Yamanouchi U. Transposable element finder (TEF): finding active transposable elements from next generation sequencing data. BMC Bioinformatics 2022; 23:500. [PMID: 36418944 PMCID: PMC9682801 DOI: 10.1186/s12859-022-05011-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Accepted: 10/26/2022] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Detection of newly transposed events by transposable elements (TEs) from next generation sequence (NGS) data is difficult, due to their multiple distribution sites over the genome containing older TEs. The previously reported Transposon Insertion Finder (TIF) detects TE transpositions on the reference genome from NGS short reads using end sequences of target TE. TIF requires the sequence of target TE and is not able to detect transpositions for TEs with an unknown sequence. RESULT The new algorithm Transposable Element Finder (TEF) enables the detection of TE transpositions, even for TEs with an unknown sequence. TEF is a finding tool of transposed TEs, in contrast to TIF as a detection tool of transposed sites for TEs with a known sequence. The transposition event is often accompanied with a target site duplication (TSD). Focusing on TSD, two algorithms to detect both ends of TE, TSDs and target sites are reported here. One is based on the grouping with TSDs and direct comparison of k-mers from NGS without similarity search. The other is based on the junction mapping of TE end sequence candidates. Both methods succeed to detect both ends and TSDs of known active TEs in several tests with rice, Arabidopsis and Drosophila data and discover several new TEs in new locations. PCR confirmed the detected transpositions of TEs in several test cases in rice. CONCLUSIONS TEF detects transposed TEs with TSDs as a result of TE transposition, sequences of both ends and their inserted positions of transposed TEs by direct comparison of NGS data between two samples. Genotypes of transpositions are verified by counting of junctions of head and tail, and non-insertion sequences in NGS reads. TEF is easy to run and independent of any TE library, which makes it useful to detect insertions from unknown TEs bypassed by common TE annotation pipelines.
Collapse
Affiliation(s)
- Akio Miyao
- grid.416835.d0000 0001 2222 0432Institute of Crop Science, National Agriculture and Food Research Organization, 2-1-2, Kannondai, Tsukuba, Ibaraki 305-8518 Japan
| | - Utako Yamanouchi
- grid.416835.d0000 0001 2222 0432Institute of Crop Science, National Agriculture and Food Research Organization, 2-1-2, Kannondai, Tsukuba, Ibaraki 305-8518 Japan
| |
Collapse
|
30
|
Angileri KM, Bagia NA, Feschotte C. Transposon control as a checkpoint for tissue regeneration. Development 2022; 149:dev191957. [PMID: 36440631 PMCID: PMC10655923 DOI: 10.1242/dev.191957] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Accepted: 10/03/2022] [Indexed: 11/29/2022]
Abstract
Tissue regeneration requires precise temporal control of cellular processes such as inflammatory signaling, chromatin remodeling and proliferation. The combination of these processes forms a unique microenvironment permissive to the expression, and potential mobilization of, transposable elements (TEs). Here, we develop the hypothesis that TE activation creates a barrier to tissue repair that must be overcome to achieve successful regeneration. We discuss how uncontrolled TE activity may impede tissue restoration and review mechanisms by which TE activity may be controlled during regeneration. We posit that the diversification and co-evolution of TEs and host control mechanisms may contribute to the wide variation in regenerative competency across tissues and species.
Collapse
Affiliation(s)
- Krista M. Angileri
- Department of Molecular Biology and Genetics, Cornell University, 526 Campus Rd, Ithaca, NY 14850, USA
| | - Nornubari A. Bagia
- Department of Molecular Biology and Genetics, Cornell University, 526 Campus Rd, Ithaca, NY 14850, USA
| | - Cedric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, 526 Campus Rd, Ithaca, NY 14850, USA
| |
Collapse
|
31
|
Savage AL, Iacoangeli A, Schumann GG, Rubio-Roldan A, Garcia-Perez JL, Al Khleifat A, Koks S, Bubb VJ, Al-Chalabi A, Quinn JP. Characterisation of retrotransposon insertion polymorphisms in whole genome sequencing data from individuals with amyotrophic lateral sclerosis. Gene 2022; 843:146799. [PMID: 35963498 DOI: 10.1016/j.gene.2022.146799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 07/15/2022] [Accepted: 08/05/2022] [Indexed: 11/15/2022]
Abstract
The genetics of an individual is a crucial factor in understanding the risk of developing the neurodegenerative disease amyotrophic lateral sclerosis (ALS). There is still a large proportion of the heritability of ALS, particularly in sporadic cases, to be understood. Among others, active transposable elements drive inter-individual variability, and in humans long interspersed element 1 (LINE1, L1), Alu and SINE-VNTR-Alu (SVA) retrotransposons are a source of polymorphic insertions in the population. We undertook a pilot study to characterise the landscape of non-reference retrotransposon insertion polymorphisms (non-ref RIPs) in 15 control and 15 ALS individuals' whole genomes from Project MinE, an international project to identify potential genetic causes of ALS. The combination of two bioinformatics tools (mobile element locator tool (MELT) and TEBreak) identified on average 1250 Alu, 232 L1 and 77 SVA non-ref RIPs per genome across the 30 analysed. Further PCR validation of individual polymorphic retrotransposon insertions showed a similar level of accuracy for MELT and TEBreak. Our preliminary study did not identify a specific RIP or a significant difference in the total number of non-ref RIPs in ALS compared to control genomes. The use of multiple bioinformatic tools improved the accuracy of non-ref RIP detection and our study highlights the potential importance of studying these elements further in ALS.
Collapse
Affiliation(s)
- Abigail L Savage
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3BX, UK
| | - Alfredo Iacoangeli
- Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London SE5 9RT, UK; Department of Biostatistics and Health Informatics, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London SE5 8AF, UK
| | - Gerald G Schumann
- Division of Medical Biotechnology, Paul-Ehrlich-Institut, Langen 63225, Germany
| | - Alejandro Rubio-Roldan
- Department of Genomic Medicine and Department of Oncology, GENYO, Centre for Genomics & Oncology, PTS Granada, 18007, Spain
| | - Jose L Garcia-Perez
- Department of Genomic Medicine and Department of Oncology, GENYO, Centre for Genomics & Oncology, PTS Granada, 18007, Spain; MRC-HGU Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XU, UK
| | - Ahmad Al Khleifat
- Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London SE5 9RT, UK
| | - Sulev Koks
- Perron Institute for Neurological and Translational Science, Perth, Western Australia 6009, Australia; Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Perth, Western Australia 6150, Australia
| | - Vivien J Bubb
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3BX, UK
| | - Ammar Al-Chalabi
- Maurice Wohl Clinical Neuroscience Institute, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London SE5 9RT, UK; Department of Neurology, King's College Hospital, London SE5 9RS, UK
| | - John P Quinn
- Department of Pharmacology and Therapeutics, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 3BX, UK.
| |
Collapse
|
32
|
Wang J, Ren M, Yu J, Hu M, Wang X, Ma W, Jiang X, Cui J. Single-cell RNA sequencing highlights the functional role of human endogenous retroviruses in gallbladder cancer. EBioMedicine 2022; 85:104319. [PMCID: PMC9626538 DOI: 10.1016/j.ebiom.2022.104319] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 09/05/2022] [Accepted: 10/11/2022] [Indexed: 11/11/2022] Open
Abstract
Background Gallbladder cancer (GBC), the most common malignancy of the biliary tract, shows late diagnosis and low survival rate and requires continued search for new diagnostic biomarkers and therapeutic targets. Human endogenous retroviruses (HERVs) are specifically prone to be reactivated in diverse cancers and are implicated in cancer progression and immunotherapy. Methods Single-cell RNA sequencing was performed on tumor tissues and paired adjacent tissues from 4 GBC patients. Dual-luciferase reporter assay was applied to measure enhancer activity of HERV sequences. Findings We dissected the cellular diversity and described the HERV transcriptomic landscape for GBC. We found that HERVs were transcribed in a cell type-specific manner and different HERV families were associated with diverse biological effects. HERVs could function as enhancers, presumably causing altered expression of neighboring genes. The transcription level of HERVH was gradually elevated with the malignant transformation of epithelial cells, suggesting HERVH may be a potential early diagnostic biomarker of GBC. HHLA2, a newly emerging immune checkpoint, was derived by HERVH, exhibited an expressional correlation with HERVH, and was identified as a promising target for immunotherapy. Interpretation Exploring the transcriptional landscape and potential functional impact of HERVs highlights the important role of HERVs in GBC and provides a fresh perspective on managing GBC. Funding This study was supported by the National Natural Science Foundation of China (31970176, 81972256) and the research grants from the Innovation Capacity Building Project of Jiangsu province (BM2020019).
Collapse
Key Words
- gallbladder cancer
- single-cell rna sequencing
- human endogenous retrovirus
- enhancer
- immune checkpoint
- hervh
- gbc, gallbladder cancer
- herv, human endogenous retrovirus
- scrna-seq, single-cell rna sequencing
- tme, tumor microenvironment
- wta, whole transcriptome analysis
- deg, differentially expressed gene
- cnv, copy number variation
- go, gene ontology
- nk cell, natural killer cell
- nkt cell, natural killer t cell
- dc, dendritic cell
- ics, intermediate cell state
- hhla2, human endogenous retrovirus-h long terminal repeat-associating 2
- cd4+ th cell, cd4+ t helper cell
- igg, immunoglobulin g
- cdc, conventional dc
- mo-dc, monocyte-derived dc
- caf, cancer-associated fibroblast
- ecm, extracellular matrix
- icaf, inflammatory caf
- myocaf, myo-cancer-associated fibroblast
- te, transposable element
Collapse
Affiliation(s)
- Jinghan Wang
- Department of Hepatobiliary and Pancreatic Surgery, Shanghai East Hospital, Tongji University School of Medicine, Shanghai 200120, China
| | - Meng Ren
- CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai 200031, China,Nanjing Advanced Academy of Life and Health, Nanjing 211135, China
| | - Jundan Yu
- CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai 200031, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mingtai Hu
- Department of Hepatobiliary and Pancreatic Surgery, Shanghai East Hospital, Tongji University School of Medicine, Shanghai 200120, China
| | - Xiaojing Wang
- CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai 200031, China,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Wencong Ma
- Department of Hepatobiliary and Pancreatic Surgery, Shanghai East Hospital, Tongji University School of Medicine, Shanghai 200120, China
| | - Xiaoqing Jiang
- Department of Biliary Tract Surgery I, the Third Hospital of Naval Medical University, Shanghai 200438, China,Corresponding author.
| | - Jie Cui
- CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai 200031, China,Nanjing Advanced Academy of Life and Health, Nanjing 211135, China,Corresponding author.
| |
Collapse
|
33
|
Di Stefano L. All Quiet on the TE Front? The Role of Chromatin in Transposable Element Silencing. Cells 2022; 11:cells11162501. [PMID: 36010577 PMCID: PMC9406493 DOI: 10.3390/cells11162501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2022] [Revised: 07/27/2022] [Accepted: 08/03/2022] [Indexed: 01/09/2023] Open
Abstract
Transposable elements (TEs) are mobile genetic elements that constitute a sizeable portion of many eukaryotic genomes. Through their mobility, they represent a major source of genetic variation, and their activation can cause genetic instability and has been linked to aging, cancer and neurodegenerative diseases. Accordingly, tight regulation of TE transcription is necessary for normal development. Chromatin is at the heart of TE regulation; however, we still lack a comprehensive understanding of the precise role of chromatin marks in TE silencing and how chromatin marks are established and maintained at TE loci. In this review, I discuss evidence documenting the contribution of chromatin-associated proteins and histone marks in TE regulation across different species with an emphasis on Drosophila and mammalian systems.
Collapse
Affiliation(s)
- Luisa Di Stefano
- Molecular, Cellular and Developmental Biology Department (MCD), Centre de Biologie Intégrative (CBI), University of Toulouse, CNRS, UPS, 31062 Toulouse, France
| |
Collapse
|
34
|
Yokoi K, Kimura K, Bono H. Revealing Landscapes of Transposable Elements in Apis Species by Meta-Analysis. INSECTS 2022; 13:insects13080698. [PMID: 36005323 PMCID: PMC9408917 DOI: 10.3390/insects13080698] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 07/29/2022] [Accepted: 08/01/2022] [Indexed: 12/04/2022]
Abstract
Transposable elements (TEs) are grouped into several families with diverse sequences. Owing to their diversity, studies involving the detection, classification, and annotation of TEs are difficult tasks. Moreover, simple comparisons of TEs among different species with different methods can lead to misinterpretations. The genome data of several honey bee (Apis) species are available in public databases. Therefore, we conducted a meta-analysis of TEs, using 11 sets of genome data for Apis species, in order to establish data of “landscape of TEs”. Consensus TE sequences were constructed and their distributions in the Apis genomes were determined. Our results showed that TEs belonged to four to seven TE families among 13 and 15 families of TEs detected in classes I and II respectively mainly consisted of Apis TEs and that more DNA/TcMar-Mariner consensus sequences and copies were present in all Apis genomes tested. In addition, more consensus sequences and copy numbers of DNA/TcMar-Mariner were detected in Apis mellifera than in other Apis species. These results suggest that TcMar-Mariner might exert A. mellifera-specific effects on the host A. mellifera species. In conclusion, our unified approach enabled comparison of Apis genome sequences to determine the TE landscape, which provide novel evolutionary insights into Apis species.
Collapse
Affiliation(s)
- Kakeru Yokoi
- Insect Design Technology Group, Division of Insect Advanced Technology, Institute of Agrobiological Sciences, National Agriculture and Food Research Organization (NARO), 1-2 Owashi, Tsukuba, Ibaraki 305-8634, Japan
- Correspondence: ; Tel.: +81-29-838-6129
| | - Kiyoshi Kimura
- Smart Livestock Facilities Group, Division of Advanced Feeding Technology Research, National Institute of Livestock and Grassland Science (NILGS), National Agriculture and Food Research Organization (NARO), Tsukuba, 2 Ikenodai, Tsukuba, Ibaraki 305-0901, Japan;
| | - Hidemasa Bono
- Laboratory of BioDX, Genome Editing Innovation Center, Hiroshima University, 3-10-23 Kagamiyama, Higashi-Hiroshima City, Hiroshima 739-0046, Japan;
- Laboratory of Genome Informatics, Graduate School of Integrated Sciences for Life, Hiroshima University, 3-10-23 Kagamiyama, Higashi-Hiroshima City, Hiroshima 739-0046, Japan
| |
Collapse
|
35
|
Lee Y, Ha U, Moon S. Ongoing endeavors to detect mobilization of transposable elements. BMB Rep 2022. [PMID: 35725016 PMCID: PMC9340088 DOI: 10.5483/bmbrep.2022.55.7.088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Transposable elements (TEs) are DNA sequences capable of mobilization from one location to another in the genome. Since the discovery of ‘Dissociation (Dc) locus’ by Barbara McClintock in maize (1), mounting evidence in the era of genomics indicates that a significant fraction of most eukaryotic genomes is composed of TE sequences, involving in various aspects of biological processes such as development, physiology, diseases and evolution. Although technical advances in genomics have discovered numerous functional impacts of TE across species, our understanding of TEs is still ongoing process due to challenges resulted from complexity and abundance of TEs in the genome. In this mini-review, we briefly summarize biology of TEs and their impacts on the host genome, emphasizing importance of understanding TE landscape in the genome. Then, we introduce recent endeavors especially in vivo retrotransposition assays and long read sequencing technology for identifying de novo insertions/TE polymorphism, which will broaden our knowledge of extraordinary relationship between genomic cohabitants and their host.
Collapse
Affiliation(s)
- Yujeong Lee
- Department of Biological Sciences, Kangwon National University, Chuncheon 24341, Korea
| | - Una Ha
- Department of Biological Sciences, Kangwon National University, Chuncheon 24341, Korea
| | - Sungjin Moon
- Department of Biological Sciences, Kangwon National University, Chuncheon 24341, Korea
| |
Collapse
|
36
|
Fueyo R, Judd J, Feschotte C, Wysocka J. Roles of transposable elements in the regulation of mammalian transcription. Nat Rev Mol Cell Biol 2022; 23:481-497. [PMID: 35228718 PMCID: PMC10470143 DOI: 10.1038/s41580-022-00457-y] [Citation(s) in RCA: 110] [Impact Index Per Article: 55.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/25/2022] [Indexed: 12/16/2022]
Abstract
Transposable elements (TEs) comprise about half of the mammalian genome. TEs often contain sequences capable of recruiting the host transcription machinery, which they use to express their own products and promote transposition. However, the regulatory sequences carried by TEs may affect host transcription long after the TEs have lost the ability to transpose. Recent advances in genome analysis and engineering have facilitated systematic interrogation of the regulatory activities of TEs. In this Review, we discuss diverse mechanisms by which TEs contribute to transcription regulation. Notably, TEs can donate enhancer and promoter sequences that influence the expression of host genes, modify 3D chromatin architecture and give rise to novel regulatory genes, including non-coding RNAs and transcription factors. We discuss how TEs spur regulatory evolution and facilitate the emergence of genetic novelties in mammalian physiology and development. By virtue of their repetitive and interspersed nature, TEs offer unique opportunities to dissect the effects of mutation and genomic context on the function and evolution of cis-regulatory elements. We argue that TE-centric studies hold the key to unlocking general principles of transcription regulation and evolution.
Collapse
Affiliation(s)
- Raquel Fueyo
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Julius Judd
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Cedric Feschotte
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.
| | - Joanna Wysocka
- Department of Chemical and Systems Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Department of Developmental Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Institute for Stem Cell Biology and Regenerative Medicine, Stanford University School of Medicine, Stanford, CA, USA.
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA, USA.
| |
Collapse
|
37
|
Riehl K, Riccio C, Miska EA, Hemberg M. TransposonUltimate: software for transposon classification, annotation and detection. Nucleic Acids Res 2022; 50:e64. [PMID: 35234904 PMCID: PMC9226531 DOI: 10.1093/nar/gkac136] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 12/17/2022] Open
Abstract
Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085).
Collapse
Affiliation(s)
- Kevin Riehl
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
| | - Cristian Riccio
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Eric A Miska
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| | - Martin Hemberg
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women’s Hospital, 75 Francis Street, Boston, MA 02215, USA
| |
Collapse
|
38
|
Femenias MM, Santos JC, Sites JW, Avila LJ, Morando M. ExplorATE: A new pipeline to explore active transposable elements from RNA-seq data. Bioinformatics 2022; 38:3361-3366. [PMID: 35608310 DOI: 10.1093/bioinformatics/btac354] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 05/03/2022] [Accepted: 05/19/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Transposable elements (TEs) are ubiquitous in genomes and many remain active. TEs comprise an important fraction of the transcriptomes with potential effects on the host genome, either by generating deleterious mutations or promoting evolutionary novelties. However, their functional study is limited by the difficulty in their identification and quantification, particularly in non-model organisms. RESULTS We developed a new pipeline (ExplorATE or Explore Active Transposable Elements) implemented in R and bash that allows the quantification of active TEs in both model and non-model organisms. ExplorATE creates TE-specific indexes and uses the Selective Alignment (SA) to filter out co-transcribed transposons within genes based on alignment scores. Moreover, our software incorporates a Wicker-like criteria to refine a set of target TEs and avoid spurious mapping. Based on simulated and real data, we show that the SA strategy adopted by ExplorATE achieved better estimates of non-co-transcribed elements than other available alignment-based or mapping-based software. ExplorATE results showed high congruence with alignment-based tools with and without a reference genome, yet ExplorATE required less execution time. Likewise, ExplorATE expands and complements most previous TE analyses by incorporating the co-transcription and multi-mapping effects during quantification, and provides a seamless integration with other downstream tools within the R environment. AVAILABILITY Source code is available at https://github.com/FemeniasM/ExplorATEproject and https://github.com/FemeniasM/ExplorATE_shell_script. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Martin M Femenias
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC-CONICET), Boulevard Almirante Brown 2915, Puerto Madryn, CT U9120ACD, Argentina
| | - Juan C Santos
- Department of Biological Sciences, St. John's University, Queens, NY, 11439, USA
| | - Jack W Sites
- Department of Biology and M.L. Bean Life Science Museum, Brigham Young University (BYU), Provo, UT, 84602, USA
| | - Luciano J Avila
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC-CONICET), Boulevard Almirante Brown 2915, Puerto Madryn, CT U9120ACD, Argentina
| | - Mariana Morando
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC-CONICET), Boulevard Almirante Brown 2915, Puerto Madryn, CT U9120ACD, Argentina
| |
Collapse
|
39
|
Genome-Wide Screening of Transposable Elements in the Whitefly, Bemisia tabaci (Hemiptera: Aleyrodidae), Revealed Insertions with Potential Insecticide Resistance Implications. INSECTS 2022; 13:insects13050396. [PMID: 35621732 PMCID: PMC9143410 DOI: 10.3390/insects13050396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/13/2022] [Accepted: 04/15/2022] [Indexed: 12/10/2022]
Abstract
Simple Summary Transposable elements (TEs) are mobile DNA sequences hosted in the genomes of various organisms. These elements have the ability to mediate regulatory changes, which can result in changes in gene expression. Bemisia tabaci is an important agricultural pest that has been linked to several cases of insecticide resistance. In this study, we conducted a genome-wide screening of TEs in the B. tabaci genome using bioinformatics tools. Results revealed a total of 1,292,393 TE copies clustered into 4872 lineages. The TE insertion site analysis revealed 94 insertions within or near defensome genes. Abstract Transposable elements (TEs) are genetically mobile units that move from one site to another within a genome. These units can mediate regulatory changes that can result in massive changes in genes expression. In fact, a precise identification of TEs can allow the detection of the mechanisms involving these elements in gene regulation and genome evolution. In the present study, a genome-wide analysis of the Hemipteran pest Bemisia tabaci was conducted using bioinformatics tools to identify, annotate and estimate the age of TEs, in addition to their insertion sites, within or near of the defensome genes involved in insecticide resistance. Overall, 1,292,393 TE copies were identified in the B. tabaci genome grouped into 4872 lineages. A total of 699 lineages were found to belong to Class I of TEs, 1348 belong to Class II, and 2825 were uncategorized and form the largest part of TEs (28.81%). The TE age estimation revealed that the oldest TEs invasion happened 14 million years ago (MYA) and the most recent occurred 0.2 MYA with the insertion of Class II TE elements. The analysis of TE insertion sites in defensome genes revealed 94 insertions. Six of these TE insertions were found within or near previously identified differentially expressed insecticide resistance genes. These insertions may have a potential role in the observed insecticide resistance in these pests.
Collapse
|
40
|
Storer JM, Hubley R, Rosen J, Smit AFA. Methodologies for the De novo Discovery of Transposable Element Families. Genes (Basel) 2022; 13:709. [PMID: 35456515 PMCID: PMC9025800 DOI: 10.3390/genes13040709] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/14/2022] [Accepted: 04/15/2022] [Indexed: 02/07/2023] Open
Abstract
The discovery and characterization of transposable element (TE) families are crucial tasks in the process of genome annotation. Careful curation of TE libraries for each organism is necessary as each has been exposed to a unique and often complex set of TE families. De novo methods have been developed; however, a fully automated and accurate approach to the development of complete libraries remains elusive. In this review, we cover established methods and recent developments in de novo TE analysis. We also present various methodologies used to assess these tools and discuss opportunities for further advancement of the field.
Collapse
Affiliation(s)
| | | | | | - Arian F. A. Smit
- Institute for Systems Biology, Seattle, WA 98109, USA; (J.M.S.); (R.H.); (J.R.)
| |
Collapse
|
41
|
Rech GE, Radío S, Guirao-Rico S, Aguilera L, Horvath V, Green L, Lindstadt H, Jamilloux V, Quesneville H, González J. Population-scale long-read sequencing uncovers transposable elements associated with gene expression variation and adaptive signatures in Drosophila. Nat Commun 2022; 13:1948. [PMID: 35413957 PMCID: PMC9005704 DOI: 10.1038/s41467-022-29518-8] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2021] [Accepted: 03/15/2022] [Indexed: 12/16/2022] Open
Abstract
High quality reference genomes are crucial to understanding genome function, structure and evolution. The availability of reference genomes has allowed us to start inferring the role of genetic variation in biology, disease, and biodiversity conservation. However, analyses across organisms demonstrate that a single reference genome is not enough to capture the global genetic diversity present in populations. In this work, we generate 32 high-quality reference genomes for the well-known model species D. melanogaster and focus on the identification and analysis of transposable element variation as they are the most common type of structural variant. We show that integrating the genetic variation across natural populations from five climatic regions increases the number of detected insertions by 58%. Moreover, 26% to 57% of the insertions identified using long-reads were missed by short-reads methods. We also identify hundreds of transposable elements associated with gene expression variation and new TE variants likely to contribute to adaptive evolution in this species. Our results highlight the importance of incorporating the genetic variation present in natural populations to genomic studies, which is essential if we are to understand how genomes function and evolve. Even in well-studied species, there is still substantial natural genetic variation that has not been characterized. Here, the authors use long read sequencing to discover transposable elements in the Drosophila genome not detected by short read sequencing, and link them to gene expression.
Collapse
Affiliation(s)
- Gabriel E Rech
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Santiago Radío
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Sara Guirao-Rico
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Vivien Horvath
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Llewellyn Green
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | - Hannah Lindstadt
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain
| | | | | | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003, Barcelona, Spain.
| |
Collapse
|
42
|
Zaha S, Sakamoto Y, Nagasawa S, Sugano S, Suzuki A, Suzuki Y, Seki M. Whole-genome Methylation Analysis of APOBEC Enzyme-converted DNA (~5 kb) by Nanopore Sequencing. Bio Protoc 2022; 12:e4345. [PMID: 35592605 PMCID: PMC8918215 DOI: 10.21769/bioprotoc.4345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2022] [Revised: 10/24/2021] [Accepted: 01/18/2022] [Indexed: 12/29/2022] Open
Abstract
In recent years, DNA methylation research has been accelerated by the advent of nanopore sequencers. However, read length has been limited by the constraints of base conversion using the bisulfite method, making analysis of chromatin content difficult. The read length of the previous method combining bisulfite conversion and long-read sequencing was ~1.5 kb, even using targeted PCR. In this study, we have improved read length (~5 kb), by converting unmethylated cytosines to uracils with APOBEC enzymes, to reduce DNA fragmentation. The converted DNA was then sequenced using a PromethION nanopore sequencer. We have also developed a new analysis pipeline that accounts for base conversions, which are not present in conventional nanopore sequencing, as well as errors produced by nanopore sequencing.
Collapse
Affiliation(s)
| | | | | | - Sumio Sugano
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Kashiwa, Chiba, Japan
,Institute of Kashiwa-no-ha Omics Gate, Kashiwa, Chiba, Japan
| | | | | | | |
Collapse
|
43
|
Evolutionary Genetics of Cacti: Research Biases, Advances and Prospects. Genes (Basel) 2022; 13:genes13030452. [PMID: 35328006 PMCID: PMC8952820 DOI: 10.3390/genes13030452] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 02/22/2022] [Accepted: 02/25/2022] [Indexed: 02/01/2023] Open
Abstract
Here, we present a review of the studies of evolutionary genetics (phylogenetics, population genetics, and phylogeography) using genetic data as well as genome scale assemblies in Cactaceae (Caryophyllales, Angiosperms), a major lineage of succulent plants with astonishing diversity on the American continent. To this end, we performed a literature survey (1992–2021) to obtain detailed information regarding key aspects of studies investigating cactus evolution. Specifically, we summarize the advances in the following aspects: molecular markers, species delimitation, phylogenetics, hybridization, biogeography, and genome assemblies. In brief, we observed substantial growth in the studies conducted with molecular markers in the past two decades. However, we found biases in taxonomic/geographic sampling and the use of traditional markers and statistical approaches. We discuss some methodological and social challenges for engaging the cactus community in genomic research. We also stressed the importance of integrative approaches, coalescent methods, and international collaboration to advance the understanding of cactus evolution.
Collapse
|
44
|
Niu Y, Teng X, Zhou H, Shi Y, Li Y, Tang Y, Zhang P, Luo H, Kang Q, Xu T, He S. Characterizing mobile element insertions in 5675 genomes. Nucleic Acids Res 2022; 50:2493-2508. [PMID: 35212372 PMCID: PMC8934628 DOI: 10.1093/nar/gkac128] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 02/07/2022] [Accepted: 02/11/2022] [Indexed: 12/30/2022] Open
Abstract
Mobile element insertions (MEIs) are a major class of structural variants (SVs) and have been linked to many human genetic disorders, including hemophilia, neurofibromatosis, and various cancers. However, human MEI resources from large-scale genome sequencing are still lacking compared to those for SNPs and SVs. Here, we report a comprehensive map of 36 699 non-reference MEIs constructed from 5675 genomes, comprising 2998 Chinese samples (∼26.2×, NyuWa) and 2677 samples from the 1000 Genomes Project (∼7.4×, 1KGP). We discovered that LINE-1 insertions were highly enriched in centromere regions, implying the role of chromosome context in retroelement insertion. After functional annotation, we estimated that MEIs are responsible for about 9.3% of all protein-truncating events per genome. Finally, we built a companion database named HMEID for public use. This resource represents the latest and largest genomewide study on MEIs and will have broad utility for exploration of human MEI findings.
Collapse
Affiliation(s)
- Yiwei Niu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xueyi Teng
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Honghong Zhou
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yirong Shi
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanyan Li
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yiheng Tang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peng Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Huaxia Luo
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Quan Kang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Tao Xu
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China.,National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China.,College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
45
|
Russo A, Mayjonade B, Frei D, Potente G, Kellenberger RT, Frachon L, Copetti D, Studer B, Frey JE, Grossniklaus U, Schlüter PM. Low-Input High-Molecular-Weight DNA Extraction for Long-Read Sequencing From Plants of Diverse Families. FRONTIERS IN PLANT SCIENCE 2022; 13:883897. [PMID: 35665166 PMCID: PMC9161206 DOI: 10.3389/fpls.2022.883897] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Accepted: 04/21/2022] [Indexed: 05/16/2023]
Abstract
Long-read DNA sequencing technologies require high molecular weight (HMW) DNA of adequate purity and integrity, which can be difficult to isolate from plant material. Plant leaves usually contain high levels of carbohydrates and secondary metabolites that can impact DNA purity, affecting downstream applications. Several protocols and kits are available for HMW DNA extraction, but they usually require a high amount of input material and often lead to substantial DNA fragmentation, making sequencing suboptimal in terms of read length and data yield. We here describe a protocol for plant HMW DNA extraction from low input material (0.1 g) which is easy to follow and quick (2.5 h). This method successfully enabled us to extract HMW from four species from different families (Orchidaceae, Poaceae, Brassicaceae, Asteraceae). In the case of recalcitrant species, we show that an additional purification step is sufficient to deliver a clean DNA sample. We demonstrate the suitability of our protocol for long-read sequencing on the Oxford Nanopore Technologies PromethION® platform, with and without the use of a short fragment depletion kit.
Collapse
Affiliation(s)
- Alessia Russo
- Department of Plant and Microbial Biology and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
- *Correspondence: Alessia Russo,
| | - Baptiste Mayjonade
- Laboratoire des Interactions Plantes Microbes Environnement (LIPME), INRAE, Toulouse, France
| | - Daniel Frei
- Department of Method Development and Analytics, Agroscope, Wädenswil, Switzerland
| | - Giacomo Potente
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| | | | - Léa Frachon
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| | - Dario Copetti
- Institute of Agricultural Sciences and Zurich-Basel Plant Science Centre, ETH Zürich, Zurich, Switzerland
| | - Bruno Studer
- Institute of Agricultural Sciences and Zurich-Basel Plant Science Centre, ETH Zürich, Zurich, Switzerland
| | - Jürg E. Frey
- Department of Method Development and Analytics, Agroscope, Wädenswil, Switzerland
| | - Ueli Grossniklaus
- Department of Plant and Microbial Biology and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
| | - Philipp M. Schlüter
- Department of Plant Evolutionary Biology, Institute of Biology, University of Hohenheim, Stuttgart, Germany
- Department of Systematic and Evolutionary Botany and Zurich-Basel Plant Science Centre, University of Zurich, Zurich, Switzerland
- Philipp M. Schlüter,
| |
Collapse
|
46
|
Zeng C, Takeda A, Sekine K, Osato N, Fukunaga T, Hamada M. Bioinformatics Approaches for Determining the Functional Impact of Repetitive Elements on Non-coding RNAs. Methods Mol Biol 2022; 2509:315-340. [PMID: 35796972 DOI: 10.1007/978-1-0716-2380-0_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
With a large number of annotated non-coding RNAs (ncRNAs), repetitive sequences are found to constitute functional components (termed as repetitive elements) in ncRNAs that perform specific biological functions. Bioinformatics analysis is a powerful tool for improving our understanding of the role of repetitive elements in ncRNAs. This chapter summarizes recent findings that reveal the role of repetitive elements in ncRNAs. Furthermore, relevant bioinformatics approaches are systematically reviewed, which promises to provide valuable resources for studying the functional impact of repetitive elements on ncRNAs.
Collapse
Affiliation(s)
- Chao Zeng
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan.
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), Tokyo, Japan.
| | - Atsushi Takeda
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan
| | - Kotaro Sekine
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan
| | - Naoki Osato
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan
| | - Tsukasa Fukunaga
- Waseda Institute for Advanced Study, Waseda University, Tokyo, Japan
| | - Michiaki Hamada
- Faculty of Science and Engineering, Waseda University, Tokyo, Japan.
- AIST-Waseda University Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), Tokyo, Japan.
| |
Collapse
|
47
|
Vargas-Chavez C, Longo Pendy NM, Nsango SE, Aguilera L, Ayala D, González J. Transposable element variants and their potential adaptive impact in urban populations of the malaria vector Anopheles coluzzii. Genome Res 2021; 32:189-202. [PMID: 34965939 PMCID: PMC8744685 DOI: 10.1101/gr.275761.121] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 11/24/2021] [Indexed: 11/28/2022]
Abstract
Anopheles coluzzii is one of the primary vectors of human malaria in sub-Saharan Africa. Recently, it has spread into the main cities of Central Africa threatening vector control programs. The adaptation of An. coluzzii to urban environments partly results from an increased tolerance to organic pollution and insecticides. Some of the molecular mechanisms for ecological adaptation are known, but the role of transposable elements (TEs) in the adaptive processes of this species has not been studied yet. As a first step toward assessing the role of TEs in rapid urban adaptation, we sequenced using long reads six An. coluzzii genomes from natural breeding sites in two major Central Africa cities. We de novo annotated TEs in these genomes and in an additional high-quality An. coluzzii genome, and we identified 64 new TE families. TEs were nonrandomly distributed throughout the genome with significant differences in the number of insertions of several superfamilies across the studied genomes. We identified seven putatively active families with insertions near genes with functions related to vectorial capacity, and several TEs that may provide promoter and transcription factor binding sites to insecticide resistance and immune-related genes. Overall, the analysis of multiple high-quality genomes allowed us to generate the most comprehensive TE annotation in this species to date and identify several TE insertions that could potentially impact both genome architecture and the regulation of functionally relevant genes. These results provide a basis for future studies of the impact of TEs on the biology of An. coluzzii.
Collapse
Affiliation(s)
- Carlos Vargas-Chavez
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Neil Michel Longo Pendy
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,École Doctorale Régional (EDR) en Infectiologie Tropicale d'Afrique Centrale, BP 876, Franceville, Gabon
| | - Sandrine E Nsango
- Faculté de Médecine et des Sciences Pharmaceutiques, Université de Douala, BP 2701, Douala, Cameroun
| | - Laura Aguilera
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| | - Diego Ayala
- Centre Interdisciplinaire de Recherches Médicales de Franceville (CIRMF), BP 769, Franceville, Gabon.,Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution et Contrôle (MIVEGEC), Université Montpellier, CNRS, IRD, 64501 Montpellier, France
| | - Josefa González
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), 08003 Barcelona, Spain
| |
Collapse
|
48
|
Delorme Q, Costa R, Mansour Y, Fiston-Lavier AS, Chateau A. Involving repetitive regions in scaffolding improvement. J Bioinform Comput Biol 2021; 19:2140016. [PMID: 34923926 DOI: 10.1142/s0219720021400163] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
In this paper, we investigate througth a premilinary study the influence of repeat elements during the assembly process. We analyze the link between the presence and the nature of one type of repeat element, called transposable element (TE) and misassembly events in genome assemblies. We propose to improve assemblies by taking into account the presence of repeat elements, including TEs, during the scaffolding step. We analyze the results and relate the misassemblies to TEs before and after correction.
Collapse
Affiliation(s)
- Quentin Delorme
- LIRMM, Univ Montpellier, CNRS, Montpellier, France.,Laboratoire MIVEGEC (Université de Montpellier, CNRS 5290, IRD 229), Centre de Recherche en Écologie et Évolution de la Santé (CREES), Institut de Recherche pour le Développement (IRD), F-34394, Montpellier, France
| | - Rémy Costa
- LIRMM, Univ Montpellier, CNRS, Montpellier, France.,IGH-UMR9002, Univ Montpellier, CNRS, Montpellier, France
| | - Yasmine Mansour
- LIRMM, Univ Montpellier, CNRS, Montpellier, France.,ISEM, Univ Montpellier, CNRS, IRD, Montpellier, France
| | - Anna-Sophie Fiston-Lavier
- ISEM, Univ Montpellier, CNRS, IRD, Montpellier, France.,Institut Universitaire de France (IUF), France
| | | |
Collapse
|
49
|
Taylor D, Lowe R, Philippe C, Cheng KCL, Grant OA, Zabet NR, Cristofari G, Branco MR. Locus-specific chromatin profiling of evolutionarily young transposable elements. Nucleic Acids Res 2021; 50:e33. [PMID: 34908129 PMCID: PMC8989514 DOI: 10.1093/nar/gkab1232] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2021] [Revised: 11/15/2021] [Accepted: 12/02/2021] [Indexed: 01/13/2023] Open
Abstract
Despite a vast expansion in the availability of epigenomic data, our knowledge of the chromatin landscape at interspersed repeats remains highly limited by difficulties in mapping short-read sequencing data to these regions. In particular, little is known about the locus-specific regulation of evolutionarily young transposable elements (TEs), which have been implicated in genome stability, gene regulation and innate immunity in a variety of developmental and disease contexts. Here we propose an approach for generating locus-specific protein-DNA binding profiles at interspersed repeats, which leverages information on the spatial proximity between repetitive and non-repetitive genomic regions. We demonstrate that the combination of HiChIP and a newly developed mapping tool (PAtChER) yields accurate protein enrichment profiles at individual repetitive loci. Using this approach, we reveal previously unappreciated variation in the epigenetic profiles of young TE loci in mouse and human cells. Insights gained using our method will be invaluable for dissecting the molecular determinants of TE regulation and their impact on the genome.
Collapse
Affiliation(s)
- Darren Taylor
- Blizard Institute, Barts and The London School of Medicine and Dentistry, QMUL, London E1 2AT, UK
| | - Robert Lowe
- Blizard Institute, Barts and The London School of Medicine and Dentistry, QMUL, London E1 2AT, UK
| | | | - Kevin C L Cheng
- Blizard Institute, Barts and The London School of Medicine and Dentistry, QMUL, London E1 2AT, UK
| | - Olivia A Grant
- Blizard Institute, Barts and The London School of Medicine and Dentistry, QMUL, London E1 2AT, UK.,School of Life Sciences, University of Essex, Colchester, CO4 3SQ, UK
| | - Nicolae Radu Zabet
- Blizard Institute, Barts and The London School of Medicine and Dentistry, QMUL, London E1 2AT, UK
| | | | - Miguel R Branco
- Blizard Institute, Barts and The London School of Medicine and Dentistry, QMUL, London E1 2AT, UK
| |
Collapse
|
50
|
Merkerova MD, Krejcik Z. Transposable elements and Piwi‑interacting RNAs in hemato‑oncology with a focus on myelodysplastic syndrome (Review). Int J Oncol 2021; 59:105. [PMID: 34779490 DOI: 10.3892/ijo.2021.5285] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 10/12/2021] [Indexed: 11/06/2022] Open
Abstract
Our current understanding of hematopoietic stem cell differentiation and the abnormalities that lead to leukemogenesis originates from the accumulation of knowledge regarding protein‑coding genes. However, the possible impact of transposable element (TE) mobilization and the expression of P‑element‑induced WImpy testis‑interacting RNAs (piRNAs) on leukemogenesis has been beyond the scope of scientific interest to date. The expression profiles of these molecules and their importance for human health have only been characterized recently due to the rapid progress of high‑throughput sequencing technology development. In the present review, current knowledge on the expression profile and function of TEs and piRNAs was summarized, with specific focus on their reported involvement in leukemogenesis and pathogenesis of myelodysplastic syndrome.
Collapse
Affiliation(s)
| | - Zdenek Krejcik
- Institute of Hematology and Blood Transfusion, 128 20 Prague, Czech Republic
| |
Collapse
|