1
|
Yang X, Wang H, Yu C. The Mechanism of APOBEC3B in Hepatitis B Virus Infection and HBV Related Hepatocellular Carcinoma Progression, Therapeutic and Prognostic Potential. Infect Drug Resist 2024; 17:4477-4486. [PMID: 39435460 PMCID: PMC11492903 DOI: 10.2147/idr.s484265] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2024] [Accepted: 10/11/2024] [Indexed: 10/23/2024] Open
Abstract
Hepatocellular carcinoma (HCC) is one of the most prevalent malignant tumors globally. Prominent factors include chronic hepatitis B (CHB) and chronic hepatitis C (CHC) virus infections, exposure to aflatoxin, alcohol abuse, diabetes, and obesity. The prevalence of hepatitis B (HBV) is substantial, and the significant proportion of asymptomatic carriers heightens the challenge in diagnosing and treating hepatocellular carcinoma (HCC), necessitating further and more comprehensive research. Apolipoprotein B mRNA editing catalytic polypeptide (APOBEC) family members are single-stranded DNA cytidine deaminases that can restrict viral replication. The APOBEC-related mutation pattern constitutes a primary characteristic of somatic mutations in various cancer types such as lung, breast, bladder, head and neck, cervix, and ovary. Symptoms in the early stages of HCC are often subtle and nonspecific, posing challenges in treatment and monitoring. Furthermore, this article primarily focuses on the established specific mechanism of action of the APOBEC3B (A3B) gene in the onset and progression of HBV-related HCC (HBV-HCC) through stimulating mutations in HBV, activating Interleukin-6 (IL-6) and promoting reactive oxygen species(ROS) production, while also exploring the potential for A3B to serve as a therapeutic target and prognostic indicator in HBV-HCC.
Collapse
Affiliation(s)
- Xiaochen Yang
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, Zhejiang, People’s Republic of China
| | - Huanqiu Wang
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, Zhejiang, People’s Republic of China
| | - Chengbo Yu
- State Key Laboratory for Diagnosis and Treatment of Infectious Diseases, National Clinical Research Center for Infectious Diseases, Collaborative Innovation Center for Diagnosis and Treatment of Infectious Diseases, The First Affiliated Hospital, School of Medicine, Zhejiang University, Hangzhou, Zhejiang, People’s Republic of China
| |
Collapse
|
2
|
de Oliveira FS, Azambuja M, Schemberger MO, Nascimento VD, Oliveira JIN, Wolf IR, Nogaroto V, Martins C, Vicari MR. Characterization of hAT DNA transposon superfamily in the genome of Neotropical fish Apareiodon sp. Mol Genet Genomics 2024; 299:96. [PMID: 39382723 DOI: 10.1007/s00438-024-02190-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Accepted: 09/28/2024] [Indexed: 10/10/2024]
Abstract
DNA transposons are diverse in fish genomes and have been described to generate genomic evolutionary novelties. hAT transposable element data are scarce in Teleostei genomes, making it challenging to conduct comparative genomic studies to understand their neutrality or function. This study aimed to perform a genomic and molecular characterization of hAT copies to assess the diversity of these elements and associate changes in these sequences to genomic and karyotypic novelties in Apareiodon sp. The data revealed that hAT TEs are highly abundant in the Apareiodon sp. genome, with few possibly autonomous copies. Highly conserved sequences with likely functional transposases were observed in nine hAT elements. A great diversity of hAT subgroups was observed, especially from Ac, Charlie, Blackjack, Tip100, hAT6, and hAT5, and a similar wave of hAT genomic invasion was identified in the genome for these six groups of hAT sequences. The data also revealed a distinct number of microsatellites within degenerated hAT copies. hAT sites were demonstrated to be dispersed in the Apareiodon sp. chromosomes and not involved in W chromosome-specific region differentiation. In conclusion, the genomic analysis revealed a great diversity of hAT elements, possible autonomous copies, and differentiation of degenerated transposable elements into tandem sequences.
Collapse
Affiliation(s)
- Fernanda Souza de Oliveira
- Programa de Pós-Graduação em Genética, Universidade Federal do Paraná, Centro Politécnico, Avenida Coronel Francisco H. Dos Santos, 100, Curitiba, Paraná, 81531-990, Brazil
| | - Matheus Azambuja
- Programa de Pós-Graduação em Genética, Universidade Federal do Paraná, Centro Politécnico, Avenida Coronel Francisco H. Dos Santos, 100, Curitiba, Paraná, 81531-990, Brazil
| | - Michelle Orane Schemberger
- Programa de Pós-Graduação em Genética, Universidade Federal do Paraná, Centro Politécnico, Avenida Coronel Francisco H. Dos Santos, 100, Curitiba, Paraná, 81531-990, Brazil
| | - Viviane Demetrio Nascimento
- Programa de Pós-Graduação em Genética, Universidade Federal do Paraná, Centro Politécnico, Avenida Coronel Francisco H. Dos Santos, 100, Curitiba, Paraná, 81531-990, Brazil
| | - Jordana Inácio Nascimento Oliveira
- Departamento de Morfologia, Instituto de Biociências de Botucatu, Universidade Estadual Paulista, Distrito de Rubião Júnior, S/N, Botucatu, São Paulo, 18618-689, Brazil
| | - Ivan Rodrigo Wolf
- Departamento de Morfologia, Instituto de Biociências de Botucatu, Universidade Estadual Paulista, Distrito de Rubião Júnior, S/N, Botucatu, São Paulo, 18618-689, Brazil
| | - Viviane Nogaroto
- Departamento de Biologia Estrutural, Molecular e Genética, Universidade Estadual de Ponta Grossa, Av. Carlos Cavalcanti, 4748, Ponta Grossa, Paraná, 84030-900, Brazil
| | - Cesar Martins
- Departamento de Morfologia, Instituto de Biociências de Botucatu, Universidade Estadual Paulista, Distrito de Rubião Júnior, S/N, Botucatu, São Paulo, 18618-689, Brazil
| | - Marcelo Ricardo Vicari
- Programa de Pós-Graduação em Genética, Universidade Federal do Paraná, Centro Politécnico, Avenida Coronel Francisco H. Dos Santos, 100, Curitiba, Paraná, 81531-990, Brazil.
- Departamento de Biologia Estrutural, Molecular e Genética, Universidade Estadual de Ponta Grossa, Av. Carlos Cavalcanti, 4748, Ponta Grossa, Paraná, 84030-900, Brazil.
| |
Collapse
|
3
|
Ben Amara W, Djebbi S, Khemakhem MM. Evolutionary History of the DD41D Family of Tc1/Mariner Transposons in Two Mayetiola Species. Biochem Genet 2024:10.1007/s10528-024-10898-z. [PMID: 39117934 DOI: 10.1007/s10528-024-10898-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 07/29/2024] [Indexed: 08/10/2024]
Abstract
Tc1/mariner elements are ubiquitous in eukaryotic genomes including insects. They are diverse and divided into families and sub-families. The DD34D family including mauritiana and irritans subfamilies have already been identified in two closely related species of Cecidomyiids M. destructor and M. hordei. In the current study the de novo and similarity-based methods allowed the identification for the first time of seven consensuses in M. destructor and two consensuses in M. hordei belonging to DD41D family whereas the in vitro method allowed the amplification of two and three elements in these two species respectively. Most of identified elements accumulated different mutations and long deletions spanning the N-terminal region of the transposase. Phylogenetic analyses showed that the DD41D elements were clustered in two groups belonging to rosa and Long-TIR subfamilies. The age estimation of the last transposition events of the identified Tc1/mariner elements in M. destructor showed different evolutionary histories. Indeed, irritans elements have oscillated between periods of silencing and reappearance while rosa and mauritiana elements have shown regular activity with large recent bursts. The study of insertion sites showed that they are mostly intronic and that some recently transposed elements occurred in genes linked to putative DNA-binding domains and enzymes involved in metabolic chains. Thus, this study gave evidence of the existence of DD41D family in two Mayetiola species and an insight on their evolutionary history.
Collapse
Affiliation(s)
- Wiem Ben Amara
- Laboratory of Biochemistry and Biotechnology (LR01ES05), Faculty of Sciences of Tunis, University of Tunis El Manar, 1068, Tunis, Tunisia
| | - Salma Djebbi
- Laboratory of Biochemistry and Biotechnology (LR01ES05), Faculty of Sciences of Tunis, University of Tunis El Manar, 1068, Tunis, Tunisia
| | - Maha Mezghani Khemakhem
- Laboratory of Biochemistry and Biotechnology (LR01ES05), Faculty of Sciences of Tunis, University of Tunis El Manar, 1068, Tunis, Tunisia.
| |
Collapse
|
4
|
Peona V, Martelossi J, Almojil D, Bocharkina J, Brännström I, Brown M, Cang A, Carrasco-Valenzuela T, DeVries J, Doellman M, Elsner D, Espíndola-Hernández P, Montoya GF, Gaspar B, Zagorski D, Hałakuc P, Ivanovska B, Laumer C, Lehmann R, Boštjančić LL, Mashoodh R, Mazzoleni S, Mouton A, Nilsson MA, Pei Y, Potente G, Provataris P, Pardos-Blas JR, Raut R, Sbaffi T, Schwarz F, Stapley J, Stevens L, Sultana N, Symonova R, Tahami MS, Urzì A, Yang H, Yusuf A, Pecoraro C, Suh A. Teaching transposon classification as a means to crowd source the curation of repeat annotation - a tardigrade perspective. Mob DNA 2024; 15:10. [PMID: 38711146 DOI: 10.1186/s13100-024-00319-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 04/09/2024] [Indexed: 05/08/2024] Open
Abstract
BACKGROUND The advancement of sequencing technologies results in the rapid release of hundreds of new genome assemblies a year providing unprecedented resources for the study of genome evolution. Within this context, the significance of in-depth analyses of repetitive elements, transposable elements (TEs) in particular, is increasingly recognized in understanding genome evolution. Despite the plethora of available bioinformatic tools for identifying and annotating TEs, the phylogenetic distance of the target species from a curated and classified database of repetitive element sequences constrains any automated annotation effort. Moreover, manual curation of raw repeat libraries is deemed essential due to the frequent incompleteness of automatically generated consensus sequences. RESULTS Here, we present an example of a crowd-sourcing effort aimed at curating and annotating TE libraries of two non-model species built around a collaborative, peer-reviewed teaching process. Manual curation and classification are time-consuming processes that offer limited short-term academic rewards and are typically confined to a few research groups where methods are taught through hands-on experience. Crowd-sourcing efforts could therefore offer a significant opportunity to bridge the gap between learning the methods of curation effectively and empowering the scientific community with high-quality, reusable repeat libraries. CONCLUSIONS The collaborative manual curation of TEs from two tardigrade species, for which there were no TE libraries available, resulted in the successful characterization of hundreds of new and diverse TEs in a reasonable time frame. Our crowd-sourcing setting can be used as a teaching reference guide for similar projects: A hidden treasure awaits discovery within non-model organisms.
Collapse
Affiliation(s)
- Valentina Peona
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden.
- Swiss Ornithological Institute Vogelwarte, Sempach, CH-6204, Switzerland.
- Department of Bioinformatics and Genetics, Swedish Natural History Museum, Stockholm, Sweden.
| | - Jacopo Martelossi
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, Bologna, 40126, Italy.
| | - Dareen Almojil
- New York University Abu Dhabi, Saadiyat Island, United Arab Emirates
| | | | - Ioana Brännström
- Natural History Museum, Oslo University, Oslo, Norway
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| | - Max Brown
- Anglia Ruskin University, East Rd, Cambridge, CB1 1PT, UK
| | | | - Tomàs Carrasco-Valenzuela
- Evolutionary Genetics Department, Leibniz Institute for Zoo and Wildlife Research, 10315, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, 14195, Berlin, Germany
| | - Jon DeVries
- Reed College, Portland, OR, United States of America
| | - Meredith Doellman
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL, 60637, USA
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Daniel Elsner
- Evolutionary Biology & Ecology, University of Freiburg, Freiburg, Germany
| | - Pamela Espíndola-Hernández
- Research Unit Comparative Microbiome Analysis (COMI), Helmholtz Zentrum München, Ingolstädter Landstraße 1, D-85764, Neuherberg, Germany
| | | | - Bence Gaspar
- Institute of Evolution and Ecology, University of Tuebingen, Tuebingen, Germany
| | - Danijela Zagorski
- Institute of Botany, Czech Academy of Sciences, Průhonice, Czech Republic
| | - Paweł Hałakuc
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Beti Ivanovska
- Institute of Genetics and Biotechnology, Hungarian University of Agriculture and Life Sciences, Budapest, Hungary
| | | | - Robert Lehmann
- Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Ljudevit Luka Boštjančić
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt, Germany
| | - Rahia Mashoodh
- Department of Genetics, Environment & Evolution, Centre for Biodiversity & Environment Research, University College London, London, UK
| | - Sofia Mazzoleni
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Alice Mouton
- INBIOS-Conservation Genetic Lab, University of Liege, Liege, Belgium
| | - Maria Anna Nilsson
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt, Germany
| | - Yifan Pei
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden
- Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Adenauerallee 127, 53113, Bonn, Germany
| | - Giacomo Potente
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Panagiotis Provataris
- German Cancer Research Center, NGS Core Facility, DKFZ-ZMBH Alliance, 69120, Heidelberg, Germany
| | - José Ramón Pardos-Blas
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales (MNCN-CSIC), José Gutiérrez Abascal 2, Madrid, 28006, Spain
| | - Ravindra Raut
- Department of Biotechnology, National Institute of Technology Durgapur, Durgapur, India
| | - Tomasa Sbaffi
- Molecular Ecology Group (MEG), National Research Council of Italy - Water Research Institute (CNR-IRSA), Verbania, Italy
| | - Florian Schwarz
- Eurofins Genomics Europe Pharma and Diagnostics Products & Services Sales GmbH, Ebersberg, Germany
| | - Jessica Stapley
- Plant Pathology Group, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Lewis Stevens
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Nusrat Sultana
- Department of Botany, Jagannath Univerity, Dhaka, 1100, Bangladesh
| | - Radka Symonova
- Institute of Hydrobiology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Mohadeseh S Tahami
- Department of Biological and Environmental Science, University of Jyväskylä, P.O. Box 35, Jyväskylä, 40014, Finland
| | - Alice Urzì
- Centogene GmbH, Am Strande 7, 18055, Rostock, Germany
| | - Heidi Yang
- Department of Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - Abdullah Yusuf
- Zell- und Molekularbiologie der Pflanzen, Technische Universität Dresden, Dresden, Germany
| | | | - Alexander Suh
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden.
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TU, UK.
- Present address: Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Adenauerallee 160, 53113, Bonn, Germany.
| |
Collapse
|
5
|
Rudenko V, Korotkov E. Study of Dispersed Repeats in the Cyanidioschyzon merolae Genome. Int J Mol Sci 2024; 25:4441. [PMID: 38674025 PMCID: PMC11050394 DOI: 10.3390/ijms25084441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Revised: 04/08/2024] [Accepted: 04/15/2024] [Indexed: 04/28/2024] Open
Abstract
In this study, we applied the iterative procedure (IP) method to search for families of highly diverged dispersed repeats in the genome of Cyanidioschyzon merolae, which contains over 16 million bases. The algorithm included the construction of position weight matrices (PWMs) for repeat families and the identification of more dispersed repeats based on the PWMs using dynamic programming. The results showed that the C. merolae genome contained 20 repeat families comprising a total of 33,938 dispersed repeats, which is significantly more than has been previously found using other methods. The repeats varied in length from 108 to 600 bp (522.54 bp in average) and occupied more than 72% of the C. merolae genome, whereas previously identified repeats, including tandem repeats, have been shown to constitute only about 28%. The high genomic content of dispersed repeats and their location in the coding regions suggest a significant role in the regulation of the functional activity of the genome.
Collapse
Affiliation(s)
- Valentina Rudenko
- Institute of Bioengineering, Research Center of Biotechnology of the Russian Academy of Sciences, Moscow 119071, Russia;
| | | |
Collapse
|
6
|
Baril T, Galbraith J, Hayward A. Earl Grey: A Fully Automated User-Friendly Transposable Element Annotation and Analysis Pipeline. Mol Biol Evol 2024; 41:msae068. [PMID: 38577785 PMCID: PMC11003543 DOI: 10.1093/molbev/msae068] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2023] [Revised: 02/20/2024] [Accepted: 03/22/2024] [Indexed: 04/06/2024] Open
Abstract
Transposable elements (TEs) are major components of eukaryotic genomes and are implicated in a range of evolutionary processes. Yet, TE annotation and characterization remain challenging, particularly for nonspecialists, since existing pipelines are typically complicated to install, run, and extract data from. Current methods of automated TE annotation are also subject to issues that reduce overall quality, particularly (i) fragmented and overlapping TE annotations, leading to erroneous estimates of TE count and coverage, and (ii) repeat models represented by short sections of total TE length, with poor capture of 5' and 3' ends. To address these issues, we present Earl Grey, a fully automated TE annotation pipeline designed for user-friendly curation and annotation of TEs in eukaryotic genome assemblies. Using nine simulated genomes and an annotation of Drosophila melanogaster, we show that Earl Grey outperforms current widely used TE annotation methodologies in ameliorating the issues mentioned above while scoring highly in benchmarking for TE annotation and classification and being robust across genomic contexts. Earl Grey provides a comprehensive and fully automated TE annotation toolkit that provides researchers with paper-ready summary figures and outputs in standard formats compatible with other bioinformatics tools. Earl Grey has a modular format, with great scope for the inclusion of additional modules focused on further quality control and tailored analyses in future releases.
Collapse
Affiliation(s)
- Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, 2000 Neuchâtel, Switzerland
| | - James Galbraith
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, UK
| |
Collapse
|
7
|
Hou L, Liu W, Zhang H, Li R, Liu M, Shi H, Wu L. Divergent composition and transposon-silencing activity of small RNAs in mammalian oocytes. Genome Biol 2024; 25:80. [PMID: 38532500 PMCID: PMC10964541 DOI: 10.1186/s13059-024-03214-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Accepted: 03/11/2024] [Indexed: 03/28/2024] Open
Abstract
BACKGROUND Small RNAs are essential for germ cell development and fertilization. However, fundamental questions remain, such as the level of conservation in small RNA composition between species and whether small RNAs control transposable elements in mammalian oocytes. RESULTS Here, we use high-throughput sequencing to profile small RNAs and poly(A)-bearing long RNAs in oocytes of 12 representative vertebrate species (including 11 mammals). The results show that miRNAs are generally expressed in the oocytes of each representative species (although at low levels), whereas endo-siRNAs are specific to mice. Notably, piRNAs are predominant in oocytes of all species (except mice) and vary widely in length. We find PIWIL3-associated piRNAs are widespread in mammals and generally lack 3'-2'-O-methylation. Additionally, sequence identity is low between homologous piRNAs in different species, even among those present in syntenic piRNA clusters. Despite the species-specific divergence, piRNAs retain the capacity to silence younger TE subfamilies in oocytes. CONCLUSIONS Collectively, our findings illustrate a high level of diversity in the small RNA populations of mammalian oocytes. Furthermore, we identify sequence features related to conserved roles of small RNAs in silencing TEs, providing a large-scale reference for future in-depth study of small RNA functions in oocytes.
Collapse
Affiliation(s)
- Li Hou
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Wei Liu
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Hongdao Zhang
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Ronghong Li
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China
| | - Miao Liu
- Shanghai-MOST Key Laboratory of Health and Disease Genomics, NHC Key Lab of Reproduction Regulation, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai, 200032, China
| | - Huijuan Shi
- Shanghai-MOST Key Laboratory of Health and Disease Genomics, NHC Key Lab of Reproduction Regulation, Shanghai Institute for Biomedical and Pharmaceutical Technologies, Shanghai, 200032, China
| | - Ligang Wu
- Key Laboratory of RNA Science and Engineering, Shanghai Institute of Biochemistry and Cell Biology, Center for Excellence in Molecular Cell Science Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 200031, China.
| |
Collapse
|
8
|
Liu X, Zhao L, Majid M, Huang Y. Orthoptera-TElib: a library of Orthoptera transposable elements for TE annotation. Mob DNA 2024; 15:5. [PMID: 38486291 PMCID: PMC10941475 DOI: 10.1186/s13100-024-00316-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Accepted: 03/08/2024] [Indexed: 03/17/2024] Open
Abstract
Transposable elements (TEs) are a major component of eukaryotic genomes and are present in almost all eukaryotic organisms. TEs are highly dynamic between and within species, which significantly affects the general applicability of the TE databases. Orthoptera is the only known group in the class Insecta with a significantly enlarged genome (0.93-21.48 Gb). When analyzing the large genome using the existing TE public database, the efficiency of TE annotation is not satisfactory. To address this limitation, it becomes imperative to continually update the available TE resource library and the need for an Orthoptera-specific library as more insect genomes are publicly available. Here, we used the complete genome data of 12 Orthoptera species to de novo annotate TEs, then manually re-annotate the unclassified TEs to construct a non-redundant Orthoptera-specific TE library: Orthoptera-TElib. Orthoptera-TElib contains 24,021 TE entries including the re-annotated results of 13,964 unknown TEs. The naming of TE entries in Orthoptera-TElib adopts the same naming as RepeatMasker and Dfam and is encoded as the three-level form of "level1/level2-level3". Orthoptera-TElib can be directly used as an input reference database and is compatible with mainstream repetitive sequence analysis software such as RepeatMasker and dnaPipeTE. When analyzing TEs of Orthoptera species, Orthoptera-TElib performs better TE annotation as compared to Dfam and Repbase regardless of using low-coverage sequencing or genome assembly data. The most improved TE annotation result is Angaracris rhodopa, which has increased from 7.89% of the genome to 53.28%. Finally, Orthoptera-TElib is stored in Sqlite3 for the convenience of data updates and user access.
Collapse
Affiliation(s)
- Xuanzeng Liu
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Lina Zhao
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Muhammad Majid
- College of Life Sciences, Shaanxi Normal University, Xi'an, China
| | - Yuan Huang
- College of Life Sciences, Shaanxi Normal University, Xi'an, China.
| |
Collapse
|
9
|
Gupta S, Petrov V, Garg V, Mueller-Roeber B, Fernie AR, Nikoloski Z, Gechev T. The genome of Haberlea rhodopensis provides insights into the mechanisms for tolerance to multiple extreme environments. Cell Mol Life Sci 2024; 81:117. [PMID: 38443747 PMCID: PMC10914886 DOI: 10.1007/s00018-024-05140-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Revised: 01/22/2024] [Accepted: 01/23/2024] [Indexed: 03/07/2024]
Abstract
Haberlea rhodopensis, a resurrection species, is the only plant known to be able to survive multiple extreme environments, including desiccation, freezing temperatures, and long-term darkness. However, the molecular mechanisms underlying tolerance to these stresses are poorly studied. Here, we present a high-quality genome of Haberlea and found that ~ 23.55% of the 44,306 genes are orphan. Comparative genomics analysis identified 89 significantly expanded gene families, of which 25 were specific to Haberlea. Moreover, we demonstrated that Haberlea preserves its resurrection potential even in prolonged complete darkness. Transcriptome profiling of plants subjected to desiccation, darkness, and low temperatures revealed both common and specific footprints of these stresses, and their combinations. For example, PROTEIN PHOSPHATASE 2C (PP2C) genes were substantially induced in all stress combinations, while PHYTOCHROME INTERACTING FACTOR 1 (PIF1) and GROWTH RESPONSE FACTOR 4 (GRF4) were induced only in darkness. Additionally, 733 genes with unknown functions and three genes encoding transcription factors specific to Haberlea were specifically induced/repressed upon combination of stresses, rendering them attractive targets for future functional studies. The study provides a comprehensive understanding of the genomic architecture and reports details of the mechanisms of multi-stress tolerance of this resurrection species that will aid in developing strategies that allow crops to survive extreme and multiple abiotic stresses.
Collapse
Affiliation(s)
- Saurabh Gupta
- Intercellular Macromolecular Transport, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany.
- Curtin Medical School, Curtin Health Innovation Research Institute (CHIRI), Curtin University, Perth, WA, 6102, Australia.
| | - Veselin Petrov
- Center of Plant Systems Biology and Biotechnology, 14 Knyaz Boris I Pokrastitel Str., 4023, Plovdiv, Bulgaria
- Department of Plant Physiology, Biochemistry and Genetics, Agricultural University Plovdiv, 12 Mendeleev Str., 4000, Plovdiv, Bulgaria
| | - Vanika Garg
- Molecular Biology, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476, Potsdam-Golm, Germany
- State Agricultural Biotechnology Centre, Centre for Crop and Food Innovation, Food Futures Institute, Murdoch University, Murdoch, WA, 6150, Australia
| | - Bernd Mueller-Roeber
- Center of Plant Systems Biology and Biotechnology, 14 Knyaz Boris I Pokrastitel Str., 4023, Plovdiv, Bulgaria
- Molecular Biology, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476, Potsdam-Golm, Germany
- Plant Signalling, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Alisdair R Fernie
- Center of Plant Systems Biology and Biotechnology, 14 Knyaz Boris I Pokrastitel Str., 4023, Plovdiv, Bulgaria
- Central Metabolism, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Zoran Nikoloski
- Center of Plant Systems Biology and Biotechnology, 14 Knyaz Boris I Pokrastitel Str., 4023, Plovdiv, Bulgaria
- Bioinformatics, Institute of Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476, Potsdam-Golm, Germany
- Systems Biology and Mathematical Modelling, Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476, Potsdam-Golm, Germany
| | - Tsanko Gechev
- Center of Plant Systems Biology and Biotechnology, 14 Knyaz Boris I Pokrastitel Str., 4023, Plovdiv, Bulgaria.
- Department of Plant Physiology and Molecular Biology, Plovdiv University, 24 Tsar Assen Str., 4000, Plovdiv, Bulgaria.
| |
Collapse
|
10
|
Mandal AK. Recent insights into crosstalk between genetic parasites and their host genome. Brief Funct Genomics 2024; 23:15-23. [PMID: 36307128 PMCID: PMC10799329 DOI: 10.1093/bfgp/elac032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 09/14/2022] [Accepted: 09/21/2022] [Indexed: 01/21/2024] Open
Abstract
The bulk of higher order organismal genomes is comprised of transposable element (TE) copies, i.e. genetic parasites. The host-parasite relation is multi-faceted, varying across genomic region (genic versus intergenic), life-cycle stages, tissue-type and of course in health versus pathological state. The reach of functional genomics though, in investigating genotype-to-phenotype relations, has been limited when TEs are involved. The aim of this review is to highlight recent progress made in understanding how TE origin biochemical activity interacts with the central dogma stages of the host genome. Such interaction can also bring about modulation of the immune context and this could have important repercussions in disease state where immunity has a role to play. Thus, the review is to instigate ideas and action points around identifying evolutionary adaptations that the host genome and the genetic parasite have evolved and why they could be relevant.
Collapse
Affiliation(s)
- Amit K Mandal
- Corresponding author: A.K. Mandal, Nuffield Department of Surgical Sciences (NDS), University of Oxford, Old Road Campus Research building (ORCRB), Oxford OX3 7DQ, UK. Tel: +44 (0)1865 617123; Fax: +44 (0)1865 768876; E-mail:
| |
Collapse
|
11
|
Westerberg I, Ament-Velásquez SL, Vogan AA, Johannesson H. Evolutionary dynamics of the LTR-retrotransposon crapaud in the Podospora anserina species complex and the interaction with repeat-induced point mutations. Mob DNA 2024; 15:1. [PMID: 38218923 PMCID: PMC10787394 DOI: 10.1186/s13100-023-00311-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 12/22/2023] [Indexed: 01/15/2024] Open
Abstract
BACKGROUND The genome of the filamentous ascomycete Podospora anserina shows a relatively high abundance of retrotransposons compared to other interspersed repeats. The LTR-retrotransposon family crapaud is particularly abundant in the genome, and consists of multiple diverged sequence variations specifically localized in the 5' half of both long terminal repeats (LTRs). P. anserina is part of a recently diverged species-complex, which makes the system ideal to classify the crapaud family based on the observed LTR variation and to study the evolutionary dynamics, such as the diversification and bursts of the elements over recent evolutionary time. RESULTS We developed a sequence similarity network approach to classify the crapaud repeats of seven genomes representing the P. anserina species complex into 14 subfamilies. This method does not utilize a consensus sequence, but instead it connects any copies that share enough sequence similarity over a set sequence coverage. Based on phylogenetic analyses, we found that the crapaud repeats likely diversified in the ancestor of the complex and have had activity at different time points for different subfamilies. Furthermore, while we hypothesized that the evolution into multiple subfamilies could have been a direct effect of escaping the genome defense system of repeat induced point mutations, we found this not to be the case. CONCLUSIONS Our study contributes to the development of methods to classify transposable elements in fungi, and also highlights the intricate patterns of retrotransposon evolution over short timescales and under high mutational load caused by nucleotide-altering genome defense.
Collapse
Affiliation(s)
- Ivar Westerberg
- Department of Ecology, environmental and Plant Sciences, Stockholm University, Stockholm, 106 91, Sweden
| | - S Lorena Ament-Velásquez
- Division of Population Genetics, Department of Zoology, Stockholm University, Stockholm, 106 91, Sweden
| | - Aaron A Vogan
- Systematic Biology, Department of Organismal Biology, Uppsala University, Norbyvägen 18D, Uppsala, 752 36, Sweden.
| | - Hanna Johannesson
- Department of Ecology, environmental and Plant Sciences, Stockholm University, Stockholm, 106 91, Sweden.
- The Royal Swedish Academy of Sciences, Stockholm, 114 18, Sweden.
| |
Collapse
|
12
|
Xiao Y, Xi Z, Wang F, Wang J. Genomic asymmetric epigenetic modification of transposable elements is involved in gene expression regulation of allopolyploid Brassica napus. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 117:226-241. [PMID: 37797206 DOI: 10.1111/tpj.16491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 09/20/2023] [Accepted: 09/25/2023] [Indexed: 10/07/2023]
Abstract
Polyploids are common and have a wide geographical distribution and environmental adaptability. Allopolyploidy may lead to the activation of transposable elements (TE). However, the mechanism of epigenetic modification of TEs in the establishment and evolution of allopolyploids remains to be explored. We focused on the TEs of model allopolyploid Brassica napus (An An Cn Cn ), exploring the TE characteristics of the genome, epigenetic modifications of TEs during allopolyploidization, and regulation of gene expression by TE methylation. In B. napus, approximately 50% of the genome was composed of TEs. TEs increased with proximity to genes, especially DNA transposons. TE methylation levels were negatively correlated with gene expression, and changes in TE methylation levels were able to regulate the expression of neighboring genes related to responses to light intensity and stress, which promoted powerful adaptation of allopolyploids to new environments. TEs can be synergistically regulated by RNA-directed DNA methylation pathways and histone modifications. The epigenetic modification levels of TEs tended to be similar to those of the diploid parents during the genome evolution of B. napus. The TEs of the An subgenome were more likely to be modified, and the imbalance in TE number and epigenetic modification level in the An and Cn subgenomes may lead to the establishment of subgenome dominance. Our study analyzed the characteristics of TE location, DNA methylation, siRNA, and histone modification in B. napus and highlighted the importance of TE epigenetic modifications during the allopolyploidy process, providing support for revealing the mechanism of allopolyploid formation and evolution.
Collapse
Affiliation(s)
- Yafang Xiao
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Zengde Xi
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Fei Wang
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| | - Jianbo Wang
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, 430072, China
| |
Collapse
|
13
|
Koonin EV, Krupovic M. New faces of prokaryotic mobile genetic elements: guide RNAs link transposition with host defense mechanisms. CURRENT OPINION IN SYSTEMS BIOLOGY 2023; 36:100473. [PMID: 37779558 PMCID: PMC10538440 DOI: 10.1016/j.coisb.2023.100473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/03/2023]
Abstract
Most life forms harbor multiple, diverse mobile genetic elements (MGE) that widely differ in their rates and mechanisms of mobility. Recent findings on two classes of MGE in prokaryotes revealed a novel mechanism, RNA-guided transposition, where a transposon-encoded guide RNA directs the transposase to a unique site in the host genome. Tn7-like transposons, on multiple occasions, recruited CRISPR systems that lost the capacity to cleave target DNA and instead mediate RNA-guided transposition via CRISPR RNA. Conversely, the abundant transposon-associated, RNA-guided nucleases IscB and TnpB that appear to promote proliferation of IS200/IS605 and IS607 transposons were the likely evolutionary ancestors of type II and type V CRISPR systems, respectively. Thus, RNA-guided target recognition is a major biological phenomenon that connects MGE with host defense mechanisms. More RNA-guided defensive and MGE-associated functionalities are likely to be discovered.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, Bethesda, MD, USA
| | - Mart Krupovic
- Institut Pasteur, Université Paris Cité, CNRS UMR6047, Archaeal Virology Unit, 25 rue du Dr Roux, 75015 Paris
| |
Collapse
|
14
|
Lu Y, Wang Z, Wang Y, Chen Y, Tang D, Yu H. Genomic Comparison of Two Species of Samsoniella with Other Genera in the Family Cordycipitaceae. J Fungi (Basel) 2023; 9:1146. [PMID: 38132747 PMCID: PMC10744563 DOI: 10.3390/jof9121146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 11/12/2023] [Accepted: 11/25/2023] [Indexed: 12/23/2023] Open
Abstract
Whole genomes of Samsoniella hepiali ICMM 82-2 and S. yunnanensis YFCC 1527 were sequenced and annotated, as well as compared with whole genome sequences of other species in the family Cordycipitaceae. S. hepiali ICMM 82-2, S. hepiali FENG and S. yunnanensis YFCC 1527 had 54, 57 and 58 putative secondary metabolite biosynthetic gene clusters, respectively. S. hepiali had one unique domain and S. yunnanensis YFCC 1527 six. Both S. hepiali and S. yunnanensis YFCC 1527 had curvupallide-B, fumosorinone and fujikurin putative biosynthetic gene clusters. C. javanica had biosynthetic gene clusters for fumonisin. The 14 genomes had common domains, namely A-P-C-P-C and KS-AT-DH-ER-KR-ACP. The A-P-C-P-C domain may be involved in the biosynthesis of dimethylcoprogen. The maximum likelihood and the Bayesian inference trees of KS-AT-DH-ER-KR-ACP were highly consistent with the multigene phylogenetic tree for the 13 species of Cordycipitaceae. This study facilitates the discovery of novel biologically active SMs from Cordycipitaceae using heterologous expression and gene knockdown methods.
Collapse
Affiliation(s)
- Yingling Lu
- Yunnan Herbal Laboratory, College of Ecology and Environmental Sciences, Yunnan University, Kunming 650504, China; (Y.L.); (Z.W.); (Y.C.); (D.T.)
- The International Joint Research Center for Sustainable Utilization of Cordyceps Bioresources in China and Southeast Asia, Yunnan University, Kunming 650091, China
- Laboratory of Forest Plant Cultivation and Utilization, The Key Laboratory of Rare and Endangered Forest Plants of State Forestry Administration, Yunnan Academy of Forestry and Grassland, Kunming 650201, China
| | - Zhiqin Wang
- Yunnan Herbal Laboratory, College of Ecology and Environmental Sciences, Yunnan University, Kunming 650504, China; (Y.L.); (Z.W.); (Y.C.); (D.T.)
- The International Joint Research Center for Sustainable Utilization of Cordyceps Bioresources in China and Southeast Asia, Yunnan University, Kunming 650091, China
| | - Yi Wang
- Laboratory of Forest Plant Cultivation and Utilization, The Key Laboratory of Rare and Endangered Forest Plants of State Forestry Administration, Yunnan Academy of Forestry and Grassland, Kunming 650201, China
| | - Yue Chen
- Yunnan Herbal Laboratory, College of Ecology and Environmental Sciences, Yunnan University, Kunming 650504, China; (Y.L.); (Z.W.); (Y.C.); (D.T.)
- The International Joint Research Center for Sustainable Utilization of Cordyceps Bioresources in China and Southeast Asia, Yunnan University, Kunming 650091, China
| | - Dexiang Tang
- Yunnan Herbal Laboratory, College of Ecology and Environmental Sciences, Yunnan University, Kunming 650504, China; (Y.L.); (Z.W.); (Y.C.); (D.T.)
- The International Joint Research Center for Sustainable Utilization of Cordyceps Bioresources in China and Southeast Asia, Yunnan University, Kunming 650091, China
| | - Hong Yu
- Yunnan Herbal Laboratory, College of Ecology and Environmental Sciences, Yunnan University, Kunming 650504, China; (Y.L.); (Z.W.); (Y.C.); (D.T.)
- The International Joint Research Center for Sustainable Utilization of Cordyceps Bioresources in China and Southeast Asia, Yunnan University, Kunming 650091, China
| |
Collapse
|
15
|
Gao D. Introduction of Plant Transposon Annotation for Beginners. BIOLOGY 2023; 12:1468. [PMID: 38132293 PMCID: PMC10741241 DOI: 10.3390/biology12121468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Revised: 11/21/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023]
Abstract
Transposons are mobile DNA sequences that contribute large fractions of many plant genomes. They provide exclusive resources for tracking gene and genome evolution and for developing molecular tools for basic and applied research. Despite extensive efforts, it is still challenging to accurately annotate transposons, especially for beginners, as transposon prediction requires necessary expertise in both transposon biology and bioinformatics. Moreover, the complexity of plant genomes and the dynamic evolution of transposons also bring difficulties for genome-wide transposon discovery. This review summarizes the three major strategies for transposon detection including repeat-based, structure-based, and homology-based annotation, and introduces the transposon superfamilies identified in plants thus far, and some related bioinformatics resources for detecting plant transposons. Furthermore, it describes transposon classification and explains why the terms 'autonomous' and 'non-autonomous' cannot be used to classify the superfamilies of transposons. Lastly, this review also discusses how to identify misannotated transposons and improve the quality of the transposon database. This review provides helpful information about plant transposons and a beginner's guide on annotating these repetitive sequences.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID 83210, USA
| |
Collapse
|
16
|
Gao D, Fox-Fogle E. Identification of transcriptionally active transposons in Barley. BMC Genom Data 2023; 24:64. [PMID: 37925398 PMCID: PMC10625261 DOI: 10.1186/s12863-023-01170-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2023] [Accepted: 10/30/2023] [Indexed: 11/06/2023] Open
Abstract
BACKGROUND The genomes of many major crops including barley (Hordeum vulgare) consist of numerous transposons. Despite their important roles in crop genome evolution and morphological variations, most of these elements are silent or truncated and unable to be mobile in host genomes. Thus far, only a very limited number of active transposons were identified in plants. RESULTS We analyzed the barley full-length cDNA (FLcDNA) sequences and detected 71 unique FLcDNAs exhibiting significant sequence similarity to the extant transposase proteins. These FLcDNAs were then used to search against the genome of a malting barley cultivar 'Morex', seven new intact transposons were identified. Sequence alignments indicated that six intact transposons contained the entire FLcDNAs whereas another one served as 3' untranslated region (3' UTR) of a barley gene. Our reverse transcription-PCR (RT-PCR) experiment further confirmed the expression of these six transposons and revealed their differential expression. We conducted genome-wide transposon comparisons and detected polymorphisms of three transposon families between the genomes of 'Morex' and other three genotypes including the wild barley (Hordeum spontaneum, B1K-04-12) and two cultivated barley varieties, 'Golden Promise' and 'Lasa Goumang'. Lastly, we screened the transcripts of all annotated barley genes and found that some transposons may serve as the coding regions (CDSs) or UTRs of barley genes. CONCLUSION We identified six newly expressed transposons in the barley genome and revealed the recent mobility of three transposon families. Our efforts provide a valuable resource for understanding the effects of transposons on barley genome evolution and for developing novel molecular tools for barley genetic improvement and other research.
Collapse
Affiliation(s)
- Dongying Gao
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, 83210, USA.
| | - Emma Fox-Fogle
- Small Grains and Potato Germplasm Research Unit, USDA-ARS, Aberdeen, ID, 83210, USA
- National Agricultural Statistical Service, USDA, Olympia, WA, 98501, USA
| |
Collapse
|
17
|
Shi S, Puzakov MV, Puzakova LV, Ulupova YN, Xiang K, Wang B, Gao B, Song C. Hiker, a new family of DNA transposons encoding transposases with DD35E motifs, displays a distinct phylogenetic relationship with most known DNA transposon families of IS630-Tc1-mariner (ITm). Mol Phylogenet Evol 2023; 188:107906. [PMID: 37586577 DOI: 10.1016/j.ympev.2023.107906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 08/13/2023] [Accepted: 08/13/2023] [Indexed: 08/18/2023]
Abstract
DNA transposons play a crucial role in determining the size and structure of eukaryotic genomes. In this study, a new family of IS630-Tc1-mariner (ITm) DNA transposons, named Hiker (HK), was identified. HK is characterized by a DD35E catalytic domain and is distinct from all previously known families of the ITm group. Phylogenetic analyses showed that DD35E/Hiker forms a monophyletic clade with DD34E/Gambol, indicating that they may represent a separate superfamily of ITm. A total of 178 Hiker species were identified, with 170 found mainly in Actinopterygii, one in Chondrichthyes, six in Anura and one in Mollusca. Gambol (GM), on the other hand, are found in invertebrates, with 18 in Arthropoda and one in Platyhelminthes. Hiker transposons have a total length ranging from 2.14 to 3.67 kb and contain a single open reading frame that encodes a protein of approximately 370 amino acids (range 311-413 aa). They are flanked by short terminal inverted repeats (TIRs) of 16-30 base pairs and two base pair (TA) target-site duplications. In contrast, most transposons of the Gambol family have a total length of 1.35-5.96 kb, encode a transposase protein of approximately 350 amino acids (range 306-374 aa), and are flanked by TIRs that range from 32 to 1097 bp in length. Both Hiker and Gambol transposases have several conserved motifs, including helix-turn-helix (HTH) motifs and a DDE domain. Our study observed multiple amplification waves and repeated horizontal transfer (HT) events of HK transposons in vertebrate genomes, indicating their role in diversifying and shaping the genomes of Actinopterygii, Chondrichthyes, and Anura. Conversely, GM transposons showed few Horizontal transfer events. According to cell-based transposition assays, most HK transposons are likely inactive due to the truncated DNA binding domains of their transposases. We present an updated classification of the ITm group based on these findings, which will enhance the understanding of both the evolution of ITm transposons and that of their hosts.
Collapse
Affiliation(s)
- Shasha Shi
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Mikhail V Puzakov
- A.O. Kovalevsky Institute of Biology of the Southern Seas of RAS, Lenninsky ave, 38 119991, Moscow, Russia
| | - Ludmila V Puzakova
- A.O. Kovalevsky Institute of Biology of the Southern Seas of RAS, Lenninsky ave, 38 119991, Moscow, Russia
| | - Yulia N Ulupova
- A.O. Kovalevsky Institute of Biology of the Southern Seas of RAS, Lenninsky ave, 38 119991, Moscow, Russia
| | - Kuilin Xiang
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Binqing Wang
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Bo Gao
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Chengyi Song
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China.
| |
Collapse
|
18
|
Reinar WB, Tørresen OK, Nederbragt AJ, Matschiner M, Jentoft S, Jakobsen KS. Teleost genomic repeat landscapes in light of diversification rates and ecology. Mob DNA 2023; 14:14. [PMID: 37789366 PMCID: PMC10546739 DOI: 10.1186/s13100-023-00302-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 09/20/2023] [Indexed: 10/05/2023] Open
Abstract
Repetitive DNA make up a considerable fraction of most eukaryotic genomes. In fish, transposable element (TE) activity has coincided with rapid species diversification. Here, we annotated the repetitive content in 100 genome assemblies, covering the major branches of the diverse lineage of teleost fish. We investigated if TE content correlates with family level net diversification rates and found support for a weak negative correlation. Further, we demonstrated that TE proportion correlates with genome size, but not to the proportion of short tandem repeats (STRs), which implies independent evolutionary paths. Marine and freshwater fish had large differences in STR content, with the most extreme propagation detected in the genomes of codfish species and Atlantic herring. Such a high density of STRs is likely to increase the mutational load, which we propose could be counterbalanced by high fecundity as seen in codfishes and herring.
Collapse
Affiliation(s)
| | - Ole K Tørresen
- Department of Biosciences, University of Oslo, Oslo, Norway
| | - Alexander J Nederbragt
- Department of Biosciences, University of Oslo, Oslo, Norway
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Michael Matschiner
- Department of Biosciences, University of Oslo, Oslo, Norway
- University of Oslo, Natural History Museum, Oslo, Norway
| | - Sissel Jentoft
- Department of Biosciences, University of Oslo, Oslo, Norway
| | | |
Collapse
|
19
|
Baril T, Pym A, Bass C, Hayward A. Transposon accumulation at xenobiotic gene family loci in aphids. Genome Res 2023; 33:1718-1733. [PMID: 37852781 PMCID: PMC10691553 DOI: 10.1101/gr.277820.123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 08/29/2023] [Indexed: 10/20/2023]
Abstract
The evolution of resistance is a major challenge for the sustainable control of pests and pathogens. Thus, a deeper understanding of the evolutionary and genomic mechanisms underpinning resistance evolution is required to safeguard health and food production. Several studies have implicated transposable elements (TEs) in xenobiotic-resistance evolution in insects. However, analyses are generally restricted to one insect species and/or one or a few xenobiotic gene families (XGFs). We examine evidence for TE accumulation at XGFs by performing a comparative genomic analysis across 20 aphid genomes, considering major subsets of XGFs involved in metabolic resistance to insecticides: cytochrome P450s, glutathione S-transferases, esterases, UDP-glucuronosyltransferases, and ABC transporters. We find that TEs are significantly enriched at XGFs compared with other genes. XGFs show similar levels of TE enrichment to those of housekeeping genes. But unlike housekeeping genes, XGFs are not constitutively expressed in germline cells, supporting the selective enrichment of TEs at XGFs rather than enrichment owing to chromatin availability. Hotspots of extreme TE enrichment occur around certain XGFs. We find, in aphids of agricultural importance, particular enrichment of TEs around cytochrome P450 genes with known functions in the detoxification of synthetic insecticides. Our results provide evidence supporting a general role for TEs as a source of genomic variation at host XGFs and highlight the existence of considerable variability in TE content across XGFs and host species. These findings show the need for detailed functional verification analyses to clarify the significance of individual TE insertions and elucidate underlying mechanisms at TE-XGF hotspots.
Collapse
Affiliation(s)
- Tobias Baril
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| | - Adam Pym
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| | - Chris Bass
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| | - Alex Hayward
- Centre for Ecology and Conservation, University of Exeter, Penryn Campus, Cornwall TR10 9FE, United Kingdom
| |
Collapse
|
20
|
Liao X, Zhu W, Zhou J, Li H, Xu X, Zhang B, Gao X. Repetitive DNA sequence detection and its role in the human genome. Commun Biol 2023; 6:954. [PMID: 37726397 PMCID: PMC10509279 DOI: 10.1038/s42003-023-05322-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 09/04/2023] [Indexed: 09/21/2023] Open
Abstract
Repetitive DNA sequences playing critical roles in driving evolution, inducing variation, and regulating gene expression. In this review, we summarized the definition, arrangement, and structural characteristics of repeats. Besides, we introduced diverse biological functions of repeats and reviewed existing methods for automatic repeat detection, classification, and masking. Finally, we analyzed the type, structure, and regulation of repeats in the human genome and their role in the induction of complex diseases. We believe that this review will facilitate a comprehensive understanding of repeats and provide guidance for repeat annotation and in-depth exploration of its association with human diseases.
Collapse
Affiliation(s)
- Xingyu Liao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Wufei Zhu
- Department of Endocrinology, Yichang Central People's Hospital, The First College of Clinical Medical Science, China Three Gorges University, 443000, Yichang, P.R. China
| | - Juexiao Zhou
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Haoyang Li
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xiaopeng Xu
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Bin Zhang
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia.
| |
Collapse
|
21
|
Puzakov MV, Puzakova LV, Shi S, Cheresiz SV. maT and mosquito transposons in cnidarians: evolutionary history and intraspecific differences. Funct Integr Genomics 2023; 23:244. [PMID: 37454326 DOI: 10.1007/s10142-023-01175-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 07/07/2023] [Accepted: 07/10/2023] [Indexed: 07/18/2023]
Abstract
Transposable elements exert a significant effect on the size and structure of eukaryotic genomes. Tc1/mariner superfamily elements represent the widely distributed and highly variable group of DNA transposons. Tc1/mariner elements include TLE/DD34-38E, MLE/DD34D, maT/DD37D, Visitor/DD41D, Guest/DD39D, mosquito/DD37E, and L18/DD37E families, all of which are well or less scarcely studied. However, more detailed research into the patterns of prevalence and diversity of Tc1/mariner transposons enables one to better understand the coevolution of the TEs and the eukaryotic genomes. We performed a detailed analysis of the maT/DD37D family in Cnidaria. The study of 77 genomic assemblies demonstrated that maT transposons are found in a limited number of cnidarian species belonging to classes Cubozoa (1 species), Hydrozoa (3 species) и Scyphozoa (5 species) only. The identified TEs were classified into 5 clades, with the representatives from Pelagiidae (class Scyphozoa) forming a separate clade of maT transposons, which has never been described previously. The potentially functional copies of maT transposons were identified in the hydrae. The phylogenetic analysis and the studies of distribution among the taxons and the evolutionary dynamics of the elements suggest that maT transposons of the cnidarians are the descendants of several independent invasion events occurring at different periods of time. We also established that the TEs of mosquito/DD37E family are found in Hydridae (class Hydrozoa) only. A comparison of maT and mosquito prevalence in two genomic assemblies of Hydra viridissima revealed obvious differences, thus demonstrating that each individual organism might carry a unique mobilome pattern. The results of the presented research make us better understand the diversity and evolution of Tc1/mariner transposons and their effect on the eukaryotic genomes.
Collapse
Affiliation(s)
- Mikhail V Puzakov
- A.O. Kovalevsky Institute of Biology of the Southern Seas of RAS, Lenninsky Eve., 38, Moscow, Russia, 119991.
| | - Lyudmila V Puzakova
- A.O. Kovalevsky Institute of Biology of the Southern Seas of RAS, Lenninsky Eve., 38, Moscow, Russia, 119991
| | - Shasha Shi
- College of Animal Science & Technology, Yangzhou University, Yangzhou, 225009, Jiangsu, China
| | - Sergey V Cheresiz
- V. Zelman Institute for Medicine and Psychology, Novosibirsk State University, Pirogova st., 1, Novosibirsk, Russia, 630090
- State Scientific Research Institute of Physiology and Basic Medicine, P.O. Box 237, Novosibirsk, Russia, 630117
| |
Collapse
|
22
|
Zuo B, Nneji LM, Sun YB. Comparative genomics reveals insights into anuran genome size evolution. BMC Genomics 2023; 24:379. [PMID: 37415107 PMCID: PMC10324214 DOI: 10.1186/s12864-023-09499-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 06/30/2023] [Indexed: 07/08/2023] Open
Abstract
BACKGROUND Amphibians, particularly anurans, display an enormous variation in genome size. Due to the unavailability of whole genome datasets in the past, the genomic elements and evolutionary causes of anuran genome size variation are poorly understood. To address this, we analyzed whole-genome sequences of 14 anuran species ranging in size from 1.1 to 6.8 Gb. By annotating multiple genomic elements, we investigated the genomic correlates of anuran genome size variation and further examined whether the genome size relates to habitat types. RESULTS Our results showed that intron expansions or contraction and Transposable Elements (TEs) diversity do not contribute significantly to genome size variation. However, the recent accumulation of transposable elements (TEs) and the lack of deletion of ancient TEs primarily accounted for the evolution of anuran genome sizes. Our study showed that the abundance and density of simple repeat sequences positively correlate with genome size. Ancestral state reconstruction revealed that genome size exhibits a taxon-specific pattern of evolution, with families Bufonidae and Pipidae experiencing extreme genome expansion and contraction events, respectively. Our result showed no relationship between genome size and habitat types, although large genome-sized species are predominantly found in humid habitats. CONCLUSIONS Overall, our study identified the genomic element and their evolutionary dynamics accounting for anuran genome size variation, thus paving a path to a greater understanding of the size evolution of the genome in amphibians.
Collapse
Affiliation(s)
- Bin Zuo
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, 650504, China
| | - Lotanna Micah Nneji
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Yan-Bo Sun
- Ministry of Education Key Laboratory for Transboundary Ecosecurity of Southwest China, Yunnan Key Laboratory of Plant Reproductive Adaptation and Evolutionary Ecology, Institute of Biodiversity, School of Ecology and Environmental Science, Yunnan University, Kunming, Yunnan, 650504, China.
- Laboratory for Conservation and Utilization of Bio-resources, Yunnan University, Kunming, 650091, China.
| |
Collapse
|
23
|
Xiang K, Puzakov M, Shi S, Diaby M, Ullah N, Gao B, Song C. Mosquito ( MS), a DD37E Family of Tc1/ Mariner, Displaying a Distinct Evolution Profile from DD37E/ TRT and DD37E/ L18. Genes (Basel) 2023; 14:1379. [PMID: 37510284 PMCID: PMC10379824 DOI: 10.3390/genes14071379] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 06/26/2023] [Accepted: 06/26/2023] [Indexed: 07/30/2023] Open
Abstract
Diverse Tc1/mariner elements with the DD37E signature have been detected. However, their evolutionary relationship and profiles are largely unknown. Using bioinformatics methods, we defined the evolution profile of a Tc1/Mariner family, which harbors the catalytic domain with the DD37E signature, and renamed it DD37E/Mosquito (MS). MS transposons form a separate monophyletic clade in the phylogenetic tree, distinct from the other two groups of elements with the DD37E signature, DD37E/L18 and DD37E/TRT (transposon related to Tc1), and represent a very different taxonomic distribution from that of DD37E/TRT. MS is only detected in invertebrate and is mostly present in Arthropoda, as well as in Cnidaria, Ctenophora, Mollusca, Nematoda, and Platyhelminthes, with a total length of about 1.3 kb, containing an open reading frame (ORF) encoding about 340 amino acids transposases, with a conserved DD37E catalytic domain. The terminal inverted repeat (TIR) lengths range from 19 bp to 203 bp, and the target site duplication (TSD) is TA. We also identified few occurrences of MS horizontal transfers (HT) across lineages of diptera. In this paper, the distribution characteristics, structural characteristics, phylogenetic evolution, and horizontal transfer of the MS family are fully analyzed, which is conducive to supplementing and improving the Tc1/Mariner superfamily and excavating active transposons.
Collapse
Affiliation(s)
- Kuilin Xiang
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Mikhail Puzakov
- A.O. Kovalevsky Institute of Biology of the Southern Seas of RAS, Lenninsky Ave, 38, Moscow 119991, Russia
| | - Shasha Shi
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Mohamed Diaby
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Numan Ullah
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Bo Gao
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| | - Chengyi Song
- College of Animal Science and Technology, Yangzhou University, Yangzhou 225009, China
| |
Collapse
|
24
|
Li Y, Kim EJ, Voshall A, Moriyama EN, Cerutti H. Small RNAs >26 nt in length associate with AGO1 and are upregulated by nutrient deprivation in the alga Chlamydomonas. THE PLANT CELL 2023; 35:1868-1887. [PMID: 36945744 DOI: 10.1093/plcell/koad093] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/14/2023] [Accepted: 02/17/2023] [Indexed: 05/30/2023]
Abstract
Small RNAs (sRNAs) associate with ARGONAUTE (AGO) proteins forming effector complexes with key roles in gene regulation and defense responses against molecular parasites. In multicellular eukaryotes, extensive duplication and diversification of RNA interference (RNAi) components have resulted in intricate pathways for epigenetic control of gene expression. The unicellular alga Chlamydomonas reinhardtii also has a complex RNAi machinery, including 3 AGOs and 3 DICER-like proteins. However, little is known about the biogenesis and function of most endogenous sRNAs. We demonstrate here that Chlamydomonas contains uncommonly long (>26 nt) sRNAs that associate preferentially with AGO1. Somewhat reminiscent of animal PIWI-interacting RNAs, these >26 nt sRNAs are derived from moderately repetitive genomic clusters and their biogenesis is DICER-independent. Interestingly, the sequences generating these >26-nt sRNAs have been conserved and amplified in several Chlamydomonas species. Moreover, expression of these longer sRNAs increases substantially under nitrogen or sulfur deprivation, concurrently with the downregulation of predicted target transcripts. We hypothesize that the transposon-like sequences from which >26-nt sRNAs are produced might have been ancestrally targeted for silencing by the RNAi machinery but, during evolution, certain sRNAs might have fortuitously acquired endogenous target genes and become integrated into gene regulatory networks.
Collapse
Affiliation(s)
- Yingshan Li
- School of Biological Sciences and Center for Plant Science Innovation, University of Nebraska-Lincoln, Nebraska-Lincoln, NE 68588-0666, USA
| | - Eun-Jeong Kim
- Department of Life Science, Chung-Ang University, Seoul 06974, Korea
| | - Adam Voshall
- School of Biological Sciences and Center for Plant Science Innovation, University of Nebraska-Lincoln, Nebraska-Lincoln, NE 68588-0666, USA
- Division of Genetics and Genomics, Boston Children's Hospital and Harvard Medical School, Boston, MA 02115, USA
| | - Etsuko N Moriyama
- School of Biological Sciences and Center for Plant Science Innovation, University of Nebraska-Lincoln, Nebraska-Lincoln, NE 68588-0666, USA
| | - Heriberto Cerutti
- School of Biological Sciences and Center for Plant Science Innovation, University of Nebraska-Lincoln, Nebraska-Lincoln, NE 68588-0666, USA
| |
Collapse
|
25
|
Alastruey-Izquierdo A, Martín-Galiano AJ. The challenges of the genome-based identification of antifungal resistance in the clinical routine. Front Microbiol 2023; 14:1134755. [PMID: 37152754 PMCID: PMC10157239 DOI: 10.3389/fmicb.2023.1134755] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2022] [Accepted: 04/05/2023] [Indexed: 05/09/2023] Open
Abstract
The increasing number of chronic and life-threatening infections caused by antimicrobial resistant fungal isolates is of critical concern. Low DNA sequencing cost may facilitate the identification of the genomic profile leading to resistance, the resistome, to rationally optimize the design of antifungal therapies. However, compared to bacteria, initiatives for resistome detection in eukaryotic pathogens are underdeveloped. Firstly, reported mutations in antifungal targets leading to reduced susceptibility must be extensively collected from the literature to generate comprehensive databases. This information should be complemented with specific laboratory screenings to detect the highest number possible of relevant genetic changes in primary targets and associations between resistance and other genomic markers. Strikingly, some drug resistant strains experience high-level genetic changes such as ploidy variation as much as duplications and reorganizations of specific chromosomes. Such variations involve allelic dominance, gene dosage increments and target expression regime effects that should be explicitly parameterized in antifungal resistome prediction algorithms. Clinical data indicate that predictors need to consider the precise pathogen species and drug levels of detail, instead of just genus and drug class. The concomitant needs for mutation accuracy and assembly quality assurance suggest hybrid sequencing approaches involving third-generation methods will be utilized. Moreover, fatal fast infections, like fungemia and meningitis, will further require both sequencing and analysis facilities are available in-house. Altogether, the complex nature of antifungal resistance demands extensive sequencing, data acquisition and processing, bioinformatic analysis pipelines, and standard protocols to be accomplished prior to genome-based protocols are applied in the clinical setting.
Collapse
Affiliation(s)
- Ana Alastruey-Izquierdo
- Mycology Reference Laboratory, National Centre for Microbiology, Instituto de Salud Carlos III, Madrid, Spain
- Center for Biomedical Research in Network in Infectious Diseases (CIBERINFEC-CB21/13/00105), Instituto de Salud Carlos III, Madrid, Spain
| | | |
Collapse
|
26
|
Yushkova E, Moskalev A. Transposable elements and their role in aging. Ageing Res Rev 2023; 86:101881. [PMID: 36773759 DOI: 10.1016/j.arr.2023.101881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Revised: 01/16/2023] [Accepted: 02/07/2023] [Indexed: 02/12/2023]
Abstract
Transposable elements (TEs) are an important part of eukaryotic genomes. The role of somatic transposition in aging, carcinogenesis, and other age-related diseases has been determined. This review discusses the fundamental properties of TEs and their complex interactions with cellular processes, which are crucial for understanding the diverse effects of their activity on the genetics and epigenetics of the organism. The interactions of TEs with recombination, replication, repair, and chromosomal regulation; the ability of TEs to maintain a balance between their own activity and repression, the involvement of TEs in the creation of new or alternative genes, the expression of coding/non-coding RNA, and the role in DNA damage and modification of regulatory networks are reviewed. The contribution of the derepressed TEs to age-dependent effects in individual cells/tissues in different organisms was assessed. Conflicting information about TE activity under stress as well as theories of aging mechanisms related to TEs is discussed. On the one hand, transposition activity in response to stressors can lead to organisms acquiring adaptive innovations of great importance for evolution at the population level. On the other hand, the TE expression can cause decreased longevity and stress tolerance at the individual level. The specific features of TE effects on aging processes in germline and soma and the ways of their regulation in cells are highlighted. Recent results considering somatic mutations in normal human and animal tissues are indicated, with the emphasis on their possible functional consequences. In the context of aging, the correlation between somatic TE activation and age-related changes in the number of proteins required for heterochromatin maintenance and longevity regulation was analyzed. One of the original features of this review is a discussion of not only effects based on the TEs insertions and the associated consequences for the germline cell dynamics and somatic genome, but also the differences between transposon- and retrotransposon-mediated structural genome changes and possible phenotypic characteristics associated with aging and various age-related pathologies. Based on the analysis of published data, a hypothesis about the influence of the species-specific features of number, composition, and distribution of TEs on aging dynamics of different animal genomes was formulated.
Collapse
Affiliation(s)
- Elena Yushkova
- Laboratory of Geroprotective and Radioprotective Technologies, Institute of Biology, Komi Science Center, Ural Branch, Russian Academy of Sciences, 28 Kommunisticheskaya st., 167982 Syktyvkar, Russian Federation
| | - Alexey Moskalev
- Laboratory of Geroprotective and Radioprotective Technologies, Institute of Biology, Komi Science Center, Ural Branch, Russian Academy of Sciences, 28 Kommunisticheskaya st., 167982 Syktyvkar, Russian Federation; Laboratory of Genetics and Epigenetics of Aging, Russian Clinical Research Center for Gerontology, Pirogov Russian National Research Medical University, Moscow 129226, Russian Federation; Longaevus Technologies, London, UK.
| |
Collapse
|
27
|
Stamidis N, Żylicz JJ. RNA-mediated heterochromatin formation at repetitive elements in mammals. EMBO J 2023; 42:e111717. [PMID: 36847618 PMCID: PMC10106986 DOI: 10.15252/embj.2022111717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2022] [Revised: 12/12/2022] [Accepted: 02/07/2023] [Indexed: 03/01/2023] Open
Abstract
The failure to repress transcription of repetitive genomic elements can lead to catastrophic genome instability and is associated with various human diseases. As such, multiple parallel mechanisms cooperate to ensure repression and heterochromatinization of these elements, especially during germline development and early embryogenesis. A vital question in the field is how specificity in establishing heterochromatin at repetitive elements is achieved. Apart from trans-acting protein factors, recent evidence points to a role of different RNA species in targeting repressive histone marks and DNA methylation to these sites in mammals. Here, we review recent discoveries on this topic and predominantly focus on the role of RNA methylation, piRNAs, and other localized satellite RNAs.
Collapse
Affiliation(s)
- Nikolaos Stamidis
- Novo Nordisk Foundation Center for Stem Cell Medicine, reNEW, University of Copenhagen, Copenhagen, Denmark
| | - Jan Jakub Żylicz
- Novo Nordisk Foundation Center for Stem Cell Medicine, reNEW, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
28
|
IS481EU Shows a New Connection between Eukaryotic and Prokaryotic DNA Transposons. BIOLOGY 2023; 12:biology12030365. [PMID: 36979057 PMCID: PMC10045372 DOI: 10.3390/biology12030365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 02/03/2023] [Accepted: 02/24/2023] [Indexed: 03/03/2023]
Abstract
DDD/E transposase gene is the most abundant gene in nature and many DNA transposons in all three domains of life use it for their transposition. A substantial number of eukaryotic DNA transposons show similarity to prokaryotic insertion sequences (ISs). The presence of IS481-like DNA transposons was indicated in the genome of Trichomonas vaginalis. Here, we surveyed IS481-like eukaryotic sequences using a bioinformatics approach and report a group of eukaryotic IS481-like DNA transposons, designated IS481EU, from parabasalids including T. vaginalis. The lengths of target site duplications (TSDs) of IS481EU are around 4 bps, around 15 bps, or around 25 bps, and strikingly, these discrete lengths of TSDs can be observed even in a single IS481EU family. Phylogenetic analysis indicated the close relationships of IS481EU with some of the prokaryotic IS481 family members. IS481EU was not well separated from IS3EU/GingerRoot in the phylogenetic analysis, but was distinct from other eukaryotic DNA transposons including Ginger1 and Ginger2. The unique characteristics of IS481EU in protein sequences and the distribution of TSD lengths support its placement as a new superfamily of eukaryotic DNA transposons.
Collapse
|
29
|
Yuan X, Li Y, Luo T, Bi W, Yu J, Wang Y. Genomic Analysis of the Xanthoria elegans and Polyketide Synthase Gene Mining Based on the Whole Genome. MYCOBIOLOGY 2023; 51:36-48. [PMID: 36846628 PMCID: PMC9946308 DOI: 10.1080/12298093.2023.2175428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Revised: 12/29/2022] [Accepted: 12/29/2022] [Indexed: 06/18/2023]
Abstract
Xanthoria elegans is a lichen symbiosis, that inhabits extreme environments and can absorb UV-B. We reported the de novo sequencing and assembly of X. elegans genome. The whole genome was approximately 44.63 Mb, with a GC content of 40.69%. Genome assembly generated 207 scaffolds with an N50 length of 563,100 bp, N90 length of 122,672 bp. The genome comprised 9,581 genes, some encoded enzymes involved in the secondary metabolism such as terpene, polyketides. To further understand the UV-B absorbing and adaptability to extreme environments mechanisms of X. elegans, we searched the secondary metabolites genes and gene-cluster from the genome using genome-mining and bioinformatics analysis. The results revealed that 7 NR-PKSs, 12 HR-PKSs and 2 hybrid PKS-PKSs from X. elegans were isolated, they belong to Type I PKS (T1PKS) according to the domain architecture; phylogenetic analysis and BGCs comparison linked the putative products to two NR-PKSs and three HR-PKSs, the putative products of two NR-PKSs were emodin xanthrone (most likely parietin) and mycophelonic acid, the putative products of three HR-PKSs were soppilines, (+)-asperlin and macrolactone brefeldin A, respectively. 5 PKSs from X. elegans build a correlation between the SMs carbon skeleton and PKS genes based on the domain architecture, phylogenetic and BGC comparison. Although the function of 16 PKSs remains unclear, the findings emphasize that the genes from X. elegans represent an unexploited source of novel polyketide and utilization of lichen gene resources.
Collapse
Affiliation(s)
- Xiaolong Yuan
- Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Huanggang Normal University, Huanggang, Hubei, People’ Republic of China
- Yunnan Key Laboratory of Forest Plant Cultivation and Utilization/National Forestry and Grassland Administration Key Laboratory of Yunnan Rare and Endangered Species Conservation and Propagation, Yunnan Academy of Forestry and Grassland, Kunming, Yunnan, People’ Republic of China
| | - Yunqing Li
- Yunnan Key Laboratory of Forest Plant Cultivation and Utilization/National Forestry and Grassland Administration Key Laboratory of Yunnan Rare and Endangered Species Conservation and Propagation, Yunnan Academy of Forestry and Grassland, Kunming, Yunnan, People’ Republic of China
| | - Ting Luo
- Yunnan Key Laboratory of Forest Plant Cultivation and Utilization/National Forestry and Grassland Administration Key Laboratory of Yunnan Rare and Endangered Species Conservation and Propagation, Yunnan Academy of Forestry and Grassland, Kunming, Yunnan, People’ Republic of China
| | - Wei Bi
- Yunnan Key Laboratory of Forest Plant Cultivation and Utilization/National Forestry and Grassland Administration Key Laboratory of Yunnan Rare and Endangered Species Conservation and Propagation, Yunnan Academy of Forestry and Grassland, Kunming, Yunnan, People’ Republic of China
| | - Jiaojun Yu
- Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Huanggang Normal University, Huanggang, Hubei, People’ Republic of China
| | - Yi Wang
- Yunnan Key Laboratory of Forest Plant Cultivation and Utilization/National Forestry and Grassland Administration Key Laboratory of Yunnan Rare and Endangered Species Conservation and Propagation, Yunnan Academy of Forestry and Grassland, Kunming, Yunnan, People’ Republic of China
| |
Collapse
|
30
|
Gasparotto E, Burattin FV, Di Gioia V, Panepuccia M, Ranzani V, Marasca F, Bodega B. Transposable Elements Co-Option in Genome Evolution and Gene Regulation. Int J Mol Sci 2023; 24:ijms24032610. [PMID: 36768929 PMCID: PMC9917352 DOI: 10.3390/ijms24032610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 01/26/2023] [Accepted: 01/28/2023] [Indexed: 01/31/2023] Open
Abstract
The genome is no longer deemed as a fixed and inert item but rather as a moldable matter that is continuously evolving and adapting. Within this frame, Transposable Elements (TEs), ubiquitous, mobile, repetitive elements, are considered an alive portion of the genomes to date, whose functions, although long considered "dark", are now coming to light. Here we will review that, besides the detrimental effects that TE mobilization can induce, TEs have shaped genomes in their current form, promoting genome sizing, genomic rearrangements and shuffling of DNA sequences. Although TEs are mostly represented in the genomes by evolutionarily old, short, degenerated, and sedentary fossils, they have been thoroughly co-opted by the hosts as a prolific and original source of regulatory instruments for the control of gene transcription and genome organization in the nuclear space. For these reasons, the deregulation of TE expression and/or activity is implicated in the onset and progression of several diseases. It is likely that we have just revealed the outermost layers of TE functions. Further studies on this portion of the genome are required to unlock novel regulatory functions that could also be exploited for diagnostic and therapeutic approaches.
Collapse
Affiliation(s)
- Erica Gasparotto
- Fondazione INGM, Istituto Nazionale di Genetica Molecolare “Enrica e Romeo Invernizzi”, 20122 Milan, Italy
- SEMM, European School of Molecular Medicine, 20139 Milan, Italy
| | - Filippo Vittorio Burattin
- Fondazione INGM, Istituto Nazionale di Genetica Molecolare “Enrica e Romeo Invernizzi”, 20122 Milan, Italy
- Department of Biosciences, University of Milan, 20133 Milan, Italy
| | - Valeria Di Gioia
- Fondazione INGM, Istituto Nazionale di Genetica Molecolare “Enrica e Romeo Invernizzi”, 20122 Milan, Italy
- SEMM, European School of Molecular Medicine, 20139 Milan, Italy
| | - Michele Panepuccia
- Fondazione INGM, Istituto Nazionale di Genetica Molecolare “Enrica e Romeo Invernizzi”, 20122 Milan, Italy
| | - Valeria Ranzani
- Fondazione INGM, Istituto Nazionale di Genetica Molecolare “Enrica e Romeo Invernizzi”, 20122 Milan, Italy
| | - Federica Marasca
- Fondazione INGM, Istituto Nazionale di Genetica Molecolare “Enrica e Romeo Invernizzi”, 20122 Milan, Italy
- Department of Clinical Sciences and Community Health, University of Milan, 20122 Milan, Italy
| | - Beatrice Bodega
- Fondazione INGM, Istituto Nazionale di Genetica Molecolare “Enrica e Romeo Invernizzi”, 20122 Milan, Italy
- Department of Biosciences, University of Milan, 20133 Milan, Italy
- Correspondence:
| |
Collapse
|
31
|
Devaux CA, Pontarotti P, Nehari S, Raoult D. 'Cannibalism' of exogenous DNA sequences: The ancestral form of adaptive immunity which entails recognition of danger. Front Immunol 2022; 13:989707. [PMID: 36618387 PMCID: PMC9816338 DOI: 10.3389/fimmu.2022.989707] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Accepted: 12/05/2022] [Indexed: 12/24/2022] Open
Abstract
Adaptive immunity is a sophisticated form of immune response capable of retaining the molecular memory of a very great diversity of target antigens (epitopes) as non-self. It is capable of reactivating itself upon a second encounter with an immunoglobulin or T-cell receptor antigen-binding site with a known epitope that had previously primed the host immune system. It has long been considered that adaptive immunity is a highly evolved form of non-self recognition that appeared quite late in speciation and complemented a more generalist response called innate immunity. Innate immunity offers a relatively non-specific defense (although mediated by sensors that could specifically recognize virus or bacteria compounds) and which does not retain a memory of the danger. But this notion of recent acquisition of adaptive immunity is challenged by the fact that another form of specific recognition mechanisms already existed in prokaryotes that may be able to specifically auto-protect against external danger. This recognition mechanism can be considered a primitive form of specific (adaptive) non-self recognition. It is based on the fact that many archaea and bacteria use a genome editing system that confers the ability to appropriate viral DNA sequences allowing prokaryotes to prevent host damage through a mechanism very similar to adaptive immunity. This is indistinctly called, 'endogenization of foreign DNA' or 'viral DNA predation' or, more pictorially 'DNA cannibalism'. For several years evidence has been accumulating, highlighting the crucial role of endogenization of foreign DNA in the fundamental processes related to adaptive immunity and leading to a change in the dogma that adaptive immunity appeared late in speciation.
Collapse
Affiliation(s)
- Christian A. Devaux
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France,Department of Biological Sciences, Centre National de la Recherche Scientifique, Centre National de la Recherche Scientifique (CNRS)-SNC5039, Marseille, France,*Correspondence: Christian A. Devaux,
| | - Pierre Pontarotti
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France,Department of Biological Sciences, Centre National de la Recherche Scientifique, Centre National de la Recherche Scientifique (CNRS)-SNC5039, Marseille, France
| | - Sephora Nehari
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France
| | - Didier Raoult
- Aix-Marseille University, Institut de recherche pour le développement (IRD), Assistance Publique Hôpitaux de Marseille (APHM), MEPHI, Institut Hospitalo-universitaire (IHU)-Méditerranée Infection, Marseille, France
| |
Collapse
|
32
|
Modenini G, Abondio P, Boattini A. The coevolution between APOBEC3 and retrotransposons in primates. Mob DNA 2022; 13:27. [PMID: 36443831 PMCID: PMC9706992 DOI: 10.1186/s13100-022-00283-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 10/31/2022] [Indexed: 12/02/2022] Open
Abstract
Retrotransposons are genetic elements with the ability to replicate in the genome using reverse transcriptase: they have been associated with the development of different biological structures, such as the Central Nervous System (CNS), and their high mutagenic potential has been linked to various diseases, including cancer and neurological disorders. Throughout evolution and over time, Primates and Homo had to cope with infections from viruses and bacteria, and also with endogenous retroelements. Therefore, host genomes have evolved numerous methods to counteract the activity of endogenous and exogenous pathogens, and the APOBEC3 family of mutators is a prime example of a defensive mechanism in this context.In most Primates, there are seven members of the APOBEC3 family of deaminase proteins: among their functions, there is the ability to inhibit the mobilization of retrotransposons and the functionality of viruses. The evolution of the APOBEC3 proteins found in Primates is correlated with the expansion of two major families of retrotransposons, i.e. ERV and LINE-1.In this review, we will discuss how the rapid expansion of the APOBEC3 family is linked to the evolution of retrotransposons, highlighting the strong evolutionary arms race that characterized the history of APOBEC3s and endogenous retroelements in Primates. Moreover, the possible role of this relationship will be assessed in the context of embryonic development and brain-associated diseases.
Collapse
Affiliation(s)
- Giorgia Modenini
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| | - Paolo Abondio
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy ,grid.6292.f0000 0004 1757 1758Department of Cultural Heritage, University of Bologna, Ravenna, Italy
| | - Alessio Boattini
- grid.6292.f0000 0004 1757 1758Department of Biological, Geological and Environmental Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
33
|
Qi X, Wang H, Chen S, Feng J, Chen H, Qin Z, Blilou I, Deng Y. The genome of single-petal jasmine ( Jasminum sambac) provides insights into heat stress tolerance and aroma compound biosynthesis. FRONTIERS IN PLANT SCIENCE 2022; 13:1045194. [PMID: 36340389 PMCID: PMC9627619 DOI: 10.3389/fpls.2022.1045194] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Accepted: 10/03/2022] [Indexed: 06/16/2023]
Abstract
Jasmine [Jasminum sambac (L.) Aiton] is a commercially important cultivated plant species known for its fragrant flowers used in the perfume industry, medicine and cosmetics. In the present study, we obtained a draft genome for the J. sambac cultivar 'Danbanmoli' (JSDB, a single-petal phenotype). We showed that the final genome of J. sambac was 520.80 Mb in size (contig N50 = 145.43 kb; scaffold N50 = 145.53 kb) and comprised 35,363 genes. Our analyses revealed that the J. sambac genome has undergone only an ancient whole-genome duplication (WGD) event. We estimated that the lineage that has given rise to J. sambac diverged from the lineage leading to Osmanthus fragrans and Olea europaea approximately 31.1 million years ago (Mya). On the basis of a combination of genomic and transcriptomic analyses, we identified 92 transcription factors (TFs) and 206 genes related to heat stress response. Base on a combination of genomic, transcriptomic and metabolomic analyses, a range of aroma compounds and genes involved in the benzenoid/phenylpropanoid and terpenoid biosynthesis pathways were identified. In the newly assembled J. sambac genome, we identified a total of 122 MYB, 122 bHLH and 69 WRKY genes. Our assembled J. sambac JSDB genome provides fundamental knowledge to study the molecular mechanism of heat stress tolerance, and improve jasmine flowers and dissect its fragrance.
Collapse
Affiliation(s)
- Xiangyu Qi
- Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Institute of Leisure Agriculture, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Huadi Wang
- Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Institute of Leisure Agriculture, Jiangsu Academy of Agricultural Sciences, Nanjing, China
- School of Life Sciences, Jiangsu University, Zhenjiang, China
| | - Shuangshuang Chen
- Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Institute of Leisure Agriculture, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Jing Feng
- Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Institute of Leisure Agriculture, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Huijie Chen
- Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Institute of Leisure Agriculture, Jiangsu Academy of Agricultural Sciences, Nanjing, China
| | - Ziyi Qin
- Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Institute of Leisure Agriculture, Jiangsu Academy of Agricultural Sciences, Nanjing, China
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Ikram Blilou
- Biological and Environmental Sciences and Engineering, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Yanming Deng
- Jiangsu Key Laboratory for Horticultural Crop Genetic Improvement, Institute of Leisure Agriculture, Jiangsu Academy of Agricultural Sciences, Nanjing, China
- School of Life Sciences, Jiangsu University, Zhenjiang, China
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| |
Collapse
|
34
|
Cerbin S, Ou S, Li Y, Sun Y, Jiang N. Distinct composition and amplification dynamics of transposable elements in sacred lotus (Nelumbo nucifera Gaertn.). THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022; 112:172-192. [PMID: 35959634 PMCID: PMC9804982 DOI: 10.1111/tpj.15938] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Revised: 07/19/2022] [Accepted: 08/08/2022] [Indexed: 06/15/2023]
Abstract
Sacred lotus (Nelumbo nucifera Gaertn.) is a basal eudicot plant with a unique lifestyle, physiological features, and evolutionary characteristics. Here we report the unique profile of transposable elements (TEs) in the genome, using a manually curated repeat library. TEs account for 59% of the genome, and hAT (Ac/Ds) elements alone represent 8%, more than in any other known plant genome. About 18% of the lotus genome is comprised of Copia LTR retrotransposons, and over 25% of them are associated with non-canonical termini (non-TGCA). Such high abundance of non-canonical LTR retrotransposons has not been reported for any other organism. TEs are very abundant in genic regions, with retrotransposons enriched in introns and DNA transposons primarily in flanking regions of genes. The recent insertion of TEs in introns has led to significant intron size expansion, with a total of 200 Mb in the 28 455 genes. This is accompanied by declining TE activity in intergenic regions, suggesting distinct control efficacy of TE amplification in different genomic compartments. Despite the prevalence of TEs in genic regions, some genes are associated with fewer TEs, such as those involved in fruit ripening and stress responses. Other genes are enriched with TEs, and genes in epigenetic pathways are the most associated with TEs in introns, indicating a dynamic interaction between TEs and the host surveillance machinery. The dramatic differential abundance of TEs with genes involved in different biological processes as well as the variation of target preference of different TEs suggests the composition and activity of TEs influence the path of evolution.
Collapse
Affiliation(s)
- Stefan Cerbin
- Department of HorticultureMichigan State University1066 Bogue StreetEast LansingMI48824USA
- Present address:
Department of Ecology & Evolutionary BiologyUniversity of Kansas1200 Sunnyside AvenueLawrenceKS66045USA
| | - Shujun Ou
- Department of HorticultureMichigan State University1066 Bogue StreetEast LansingMI48824USA
- Present address:
Department of Computer ScienceJohns Hopkins UniversityBaltimoreMD21218USA
| | - Yang Li
- Department of Electrical EngineeringCity University of Hong KongKowloonHong Kong SARChina
| | - Yanni Sun
- Department of Electrical EngineeringCity University of Hong KongKowloonHong Kong SARChina
| | - Ning Jiang
- Department of HorticultureMichigan State University1066 Bogue StreetEast LansingMI48824USA
| |
Collapse
|
35
|
Hayashi S, Honda Y, Kanesaki E, Koga A. Marsupial satellite DNA as faithful reflections of long terminal repeat (LTR) retroelement structure. Genome 2022; 65:469-478. [PMID: 35930809 DOI: 10.1139/gen-2022-0039] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Long terminal repeat (LTR) retroelements, including endogenous retroviruses, are one of the origins of satellite DNAs. However, the vast majority of satellite DNAs originating from LTR retroelements consist of parts of the element. In addition, they frequently contain sequences unrelated to that element. Here we report a novel marsupial satellite DNA (named walbRep) that contains, and consists solely of, the entire sequence of an LTR retroelement (the walb element). As is common with LTR retroelements, walb copies exhibit length variation. We focused on the abundance of copies of a specific length (2.7 kb) in the genome of the red-necked wallaby. Cloning and analyses of long genomic DNA fragments revealed a satellite DNA in which the LTR sequence (0.4 kb) and the sequence of the internal region of a nonautonomous walb copy (2.3 kb) were repeated alternately. The junctions between these two components exhibited the same end-to-end arrangements as those in the walb element. This satellite organization could be accounted for by a simple formation model that includes slippage during chromosome pairing followed by homologous recombination but does not invoke any other types of rearrangements. We discuss the possible reasons why satellite DNAs having such structures are rarely found in mammals.
Collapse
Affiliation(s)
| | - Yusuke Honda
- Noichi Zoological Park of Kochi Prefecture, Konan, Japan;
| | | | | |
Collapse
|
36
|
Riehl K, Riccio C, Miska EA, Hemberg M. TransposonUltimate: software for transposon classification, annotation and detection. Nucleic Acids Res 2022; 50:e64. [PMID: 35234904 PMCID: PMC9226531 DOI: 10.1093/nar/gkac136] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Revised: 02/09/2022] [Accepted: 02/14/2022] [Indexed: 12/17/2022] Open
Abstract
Most genomes harbor a large number of transposons, and they play an important role in evolution and gene regulation. They are also of interest to clinicians as they are involved in several diseases, including cancer and neurodegeneration. Although several methods for transposon identification are available, they are often highly specialised towards specific tasks or classes of transposons, and they lack common standards such as a unified taxonomy scheme and output file format. We present TransposonUltimate, a powerful bundle of three modules for transposon classification, annotation, and detection of transposition events. TransposonUltimate comes as a Conda package under the GPL-3.0 licence, is well documented and it is easy to install through https://github.com/DerKevinRiehl/TransposonUltimate. We benchmark the classification module on the large TransposonDB covering 891,051 sequences to demonstrate that it outperforms the currently best existing solutions. The annotation and detection modules combine sixteen existing softwares, and we illustrate its use by annotating Caenorhabditis elegans, Rhizophagus irregularis and Oryza sativa subs. japonica genomes. Finally, we use the detection module to discover 29 554 transposition events in the genomes of 20 wild type strains of C. elegans. Databases, assemblies, annotations and further findings can be downloaded from (https://doi.org/10.5281/zenodo.5518085).
Collapse
Affiliation(s)
- Kevin Riehl
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
| | - Cristian Riccio
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Eric A Miska
- Gurdon Institute, University of Cambridge, Cambridge CB2 1QN, UK
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK
| | - Martin Hemberg
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
- Evergrande Center for Immunologic Diseases, Harvard Medical School and Brigham and Women’s Hospital, 75 Francis Street, Boston, MA 02215, USA
| |
Collapse
|
37
|
Puzakov MV, Puzakova LV. Prevalence, Diversity, and Evolution of L18 (DD37E) Transposons in the Genomes of Cnidarians. Mol Biol 2022. [DOI: 10.1134/s0026893322030104] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
|
38
|
Etchegaray E, Dechaud C, Barbier J, Naville M, Volff JN. Diversity of Harbinger-like Transposons in Teleost Fish Genomes. Animals (Basel) 2022; 12:ani12111429. [PMID: 35681893 PMCID: PMC9179366 DOI: 10.3390/ani12111429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 05/23/2022] [Accepted: 05/30/2022] [Indexed: 11/16/2022] Open
Abstract
Simple Summary The study of transposable elements, which are repeated DNA sequences that can insert into new locations in genomes, is of particular interest to genome evolution, as they are sources of mutations but also of new regulatory and coding sequences. Teleost fish are a species-rich clade presenting a high diversity of transposable elements, both quantitatively and qualitatively, making them a very attractive group to investigate the evolution of mobile sequences. We studied Harbinger-like DNA transposons, which are widespread from plants to vertebrates but absent from mammalian genomes. These elements code for both a transposase and a Myb-like protein. We observed high variability in the genomic composition of Harbinger-like sequences in teleost fish. While Harbinger transposons might have been present in a common ancestor of all the fish species studied, ISL2EU elements were possibly gained by horizontal transfer at the base of teleost fish. Transposase and Myb-like protein phylogenies of Harbinger transposons indicated unique origins of the association between both genes and suggests recombination was rare between transposon sublineages. Finally, we report one case of Harbinger horizontal transfer between divergent fish species and the transcriptional activity of both Harbinger and ISL2EU transposons in teleost fish. There was male-biased expression in the gonads of the medaka fish. Abstract Harbinger elements are DNA transposons that are widespread from plants to vertebrates but absent from mammalian genomes. Among vertebrates, teleost fish are the clade presenting not only the largest number of species but also the highest diversity of transposable elements, both quantitatively and qualitatively, making them a very attractive group to investigate the evolution of mobile sequences. We studied Harbinger DNA transposons and the distantly related ISL2EU elements in fish, focusing on representative teleost species compared to the spotted gar, the coelacanth, the elephant shark and the amphioxus. We observed high variability in the genomic composition of Harbinger-like sequences in teleost fish, as they covered 0.002–0.14% of the genome, when present. While Harbinger transposons might have been present in a common ancestor of all the fish species studied here, with secondary loss in elephant shark, our results suggests that ISL2EU elements were gained by horizontal transfer at the base of teleost fish 200–300 million years ago, and that there was secondary loss in a common ancestor of pufferfishes and stickleback. Harbinger transposons code for a transposase and a Myb-like protein. We reconstructed and compared molecular phylogenies of both proteins to get insights into the evolution of Harbinger transposons in fish. Transposase and Myb-like protein phylogenies showed global congruent evolution, indicating unique origin of the association between both genes and suggesting rare recombination between transposon sublineages. Finally, we report one case of Harbinger horizontal transfer between divergent fish species and the transcriptional activity of both Harbinger and ISL2EU transposons in teleost fish. There was male-biased expression in the gonads of the medaka fish.
Collapse
|
39
|
Guan Z, Shi S, Diaby M, Danley P, Ullah N, Puzakov M, Gao B, Song C. Horizontal transfer of Buster transposons across multiple phyla and classes of animals. Mol Phylogenet Evol 2022; 173:107506. [PMID: 35595006 DOI: 10.1016/j.ympev.2022.107506] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 03/06/2022] [Accepted: 04/05/2022] [Indexed: 10/18/2022]
Abstract
Transposable elements (TEs) are mobile genetic elements in the genome and broadly distributed across both prokaryotes and eukaryotes, and play an important role in shaping the genome evolution of their hosts. hAT elements are thought to be the most widespread cut-and-paste DNA transposon found throughout the tree of life. Buster is a recently recognized family of hAT. However, the evolutionary profile of the Buster family, such as its taxonomic distribution, evolutionary pattern, and activities, remains largely unknown. We conducted a systematic analysis of the evolutionary landscape of the Buster family and found that most Buster transposons are 1.72-4.66 kilobases (kb) in length, encode 500-736-amino acid (aa) transposases and are flanked by short (10-18 bp) terminal inverted repeats (TIRs) and 8 bp target site duplications (TSDs). Buster family is widely distributed in 609 species, involving eight classes of invertebrates and most lineage of vertebrates (including mammals). Horizontal transfer events were detected across multiple phyla and classes of animals, which may have contributed to their wide distribution, and both parasites and invasive species may facilitate HT events of Buster in vertebrates. Our data also suggest that Buster transposons are young, highly active, and appear as intact copies in multiple lineages of animals. High percentages of intact copies (>30%) were identified in some Arthropoda, Actinopterygii, Agnatha, and reptile species, and some of these may be active. These data will help increase understanding of the evolution of the hAT superfamily and its impact on eukaryotic genome evolution.
Collapse
Affiliation(s)
- Zhongxia Guan
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Shasha Shi
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Mohamed Diaby
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Patrick Danley
- University of Pittsburgh Medical Center, Pittsburgh, PA 15213, USA
| | - Numan Ullah
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Mikhail Puzakov
- A.O. Kovalevsky Institute of Biology of the Southern Seas of RAS, Nakhimov av., 2, Sevastopol 299011, Russia
| | - Bo Gao
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China
| | - Chengyi Song
- College of Animal Science & Technology, Yangzhou University, Yangzhou, Jiangsu 225009, China.
| |
Collapse
|
40
|
De Miccolis Angelini RM, Landi L, Raguseo C, Pollastro S, Faretra F, Romanazzi G. Tracking of Diversity and Evolution in the Brown Rot Fungi Monilinia fructicola, Monilinia fructigena, and Monilinia laxa. Front Microbiol 2022; 13:854852. [PMID: 35356516 PMCID: PMC8959702 DOI: 10.3389/fmicb.2022.854852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Accepted: 02/15/2022] [Indexed: 11/13/2022] Open
Abstract
Monilinia species are among the most devastating fungi worldwide as they cause brown rot and blossom blight on fruit trees. To understand the molecular bases of their pathogenic lifestyles, we compared the newly assembled genomes of single strains of Monilinia fructicola, M. fructigena and M. laxa, with those of Botrytis cinerea and Sclerotinia sclerotiorum, as the closest species within Sclerotiniaceae. Phylogenomic analysis of orthologous proteins and syntenic investigation suggest that M. laxa is closer to M. fructigena than M. fructicola, and is closest to the other investigated Sclerotiniaceae species. This indicates that M. laxa was the earliest result of the speciation process. Distinct evolutionary profiles were observed for transposable elements (TEs). M. fructicola and M. laxa showed older bursts of TE insertions, which were affected (mainly in M. fructicola) by repeat-induced point (RIP) mutation gene silencing mechanisms. These suggested frequent occurrence of the sexual process in M. fructicola. More recent TE expansion linked with low RIP action was observed in M. fructigena, with very little in S. sclerotiorum and B. cinerea. The detection of active non-syntenic TEs is indicative of horizontal gene transfer and has resulted in alterations in specific gene functions. Analysis of candidate effectors, biosynthetic gene clusters for secondary metabolites and carbohydrate-active enzymes, indicated that Monilinia genus has multiple virulence mechanisms to infect host plants, including toxins, cell-death elicitor, putative virulence factors and cell-wall-degrading enzymes. Some species-specific pathogenic factors might explain differences in terms of host plant and organ preferences between M. fructigena and the other two Monilinia species.
Collapse
Affiliation(s)
| | - Lucia Landi
- Department of Agricultural, Food and Environmental Sciences, Marche Polytechnic University, Ancona, Italy
| | - Celeste Raguseo
- Department of Soil, Plant and Food Sciences, University of Bari Aldo Moro, Bari, Italy
| | - Stefania Pollastro
- Department of Soil, Plant and Food Sciences, University of Bari Aldo Moro, Bari, Italy
| | - Francesco Faretra
- Department of Soil, Plant and Food Sciences, University of Bari Aldo Moro, Bari, Italy
| | - Gianfranco Romanazzi
- Department of Agricultural, Food and Environmental Sciences, Marche Polytechnic University, Ancona, Italy
| |
Collapse
|
41
|
Hoyt SJ, Storer JM, Hartley GA, Grady PGS, Gershman A, de Lima LG, Limouse C, Halabian R, Wojenski L, Rodriguez M, Altemose N, Rhie A, Core LJ, Gerton JL, Makalowski W, Olson D, Rosen J, Smit AFA, Straight AF, Vollger MR, Wheeler TJ, Schatz MC, Eichler EE, Phillippy AM, Timp W, Miga KH, O’Neill RJ. From telomere to telomere: The transcriptional and epigenetic state of human repeat elements. Science 2022; 376:eabk3112. [PMID: 35357925 PMCID: PMC9301658 DOI: 10.1126/science.abk3112] [Citation(s) in RCA: 121] [Impact Index Per Article: 60.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Mobile elements and repetitive genomic regions are sources of lineage-specific genomic innovation and uniquely fingerprint individual genomes. Comprehensive analyses of such repeat elements, including those found in more complex regions of the genome, require a complete, linear genome assembly. We present a de novo repeat discovery and annotation of the T2T-CHM13 human reference genome. We identified previously unknown satellite arrays, expanded the catalog of variants and families for repeats and mobile elements, characterized classes of complex composite repeats, and located retroelement transduction events. We detected nascent transcription and delineated CpG methylation profiles to define the structure of transcriptionally active retroelements in humans, including those in centromeres. These data expand our insight into the diversity, distribution, and evolution of repetitive regions that have shaped the human genome.
Collapse
Affiliation(s)
- Savannah J. Hoyt
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | | | - Gabrielle A. Hartley
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Patrick G. S. Grady
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Ariel Gershman
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
| | | | - Charles Limouse
- Department of Biochemistry, Stanford University, Stanford, CA, USA
| | - Reza Halabian
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Luke Wojenski
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Matias Rodriguez
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Nicolas Altemose
- Department of Bioengineering, University of California, Berkeley, Berkeley, CA, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Leighton J. Core
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
| | | | - Wojciech Makalowski
- Institute of Bioinformatics, Faculty of Medicine, University of Münster, Münster, Germany
| | - Daniel Olson
- Department of Computer Science, University of Montana, Missoula, MT, USA
| | - Jeb Rosen
- Institute for Systems Biology, Seattle, WA, USA
| | | | | | - Mitchell R. Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Travis J. Wheeler
- Department of Computer Science, University of Montana, Missoula, MT, USA
| | - Michael C. Schatz
- Department of Computer Science and Department of Biology, Johns Hopkins University, Baltimore, MD, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Adam M. Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Winston Timp
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Karen H. Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA
| | - Rachel J. O’Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Genetics and Genome Sciences, UConn Health, Farmington, CT, USA
| |
Collapse
|
42
|
Miniature Inverted-Repeat Transposable Elements (MITEs) in the Two Lepidopteran Genomes of Helicoverpa armigera and Helicoverpa zea. INSECTS 2022; 13:insects13040313. [PMID: 35447755 PMCID: PMC9033116 DOI: 10.3390/insects13040313] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 03/10/2022] [Accepted: 03/20/2022] [Indexed: 02/04/2023]
Abstract
Simple Summary Miniature inverted-repeat transposable elements (MITEs) are non-autonomous transposable elements that play important roles in genome organization and evolution. Helicoverpa armigera and Helicoverpa zea shows a high number of reported cases of insecticide resistance worldwide, having evolved resistance against pyrethroids, organophosphates, carbamates, organochlorines, and recently to macrocyclic lactone spinosad and several Bacillus thuringiensis toxins. In the present study, we conducted a genome screening of MITEs in the H. armigera and H. zea genomes using bioinformatics approaches, and the results revealed a total of 3570 and 7405 MITE sequences in the H. armigera and H. zea genomes, respectively. Among these MITEs, we highlighted eleven MITE insertions in the H. armigera defensome genes and only one MITE insertion in those of H. zea. Abstract Miniature inverted-repeat transposable elements MITEs are ubiquitous, non-autonomous class II transposable elements. The moths, Helicoverpa armigera and Helicoverpa zea, are recognized as the two most serious pest species within the genus. Moreover, these pests have the ability to develop insecticide resistance. In the present study, we conducted a genome-wide analysis of MITEs present in H. armigera and H. zea genomes using the bioinformatics tool, MITE tracker. Overall, 3570 and 7405 MITE sequences were identified in H. armigera and H. zea genomes, respectively. Comparative analysis of identified MITE sequences in the two genomes led to the identification of 18 families, comprising 140 MITE members in H. armigera and 161 MITE members in H. zea. Based on target site duplication (TSD) sequences, the identified families were classified into three superfamilies (PIF/harbinger, Tc1/mariner and CACTA). Copy numbers varied from 6 to 469 for each MITE family. Finally, the analysis of MITE insertion sites in defensome genes showed intronic insertions of 11 MITEs in the cytochrome P450, ATP-binding cassette transporter (ABC) and esterase genes in H. armigera whereas for H. zea, only one MITE was retrieved in the ABC-C2 gene. These insertions could thus be involved in the insecticide resistance observed in these pests.
Collapse
|
43
|
Niu Y, Teng X, Zhou H, Shi Y, Li Y, Tang Y, Zhang P, Luo H, Kang Q, Xu T, He S. Characterizing mobile element insertions in 5675 genomes. Nucleic Acids Res 2022; 50:2493-2508. [PMID: 35212372 PMCID: PMC8934628 DOI: 10.1093/nar/gkac128] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 02/07/2022] [Accepted: 02/11/2022] [Indexed: 12/30/2022] Open
Abstract
Mobile element insertions (MEIs) are a major class of structural variants (SVs) and have been linked to many human genetic disorders, including hemophilia, neurofibromatosis, and various cancers. However, human MEI resources from large-scale genome sequencing are still lacking compared to those for SNPs and SVs. Here, we report a comprehensive map of 36 699 non-reference MEIs constructed from 5675 genomes, comprising 2998 Chinese samples (∼26.2×, NyuWa) and 2677 samples from the 1000 Genomes Project (∼7.4×, 1KGP). We discovered that LINE-1 insertions were highly enriched in centromere regions, implying the role of chromosome context in retroelement insertion. After functional annotation, we estimated that MEIs are responsible for about 9.3% of all protein-truncating events per genome. Finally, we built a companion database named HMEID for public use. This resource represents the latest and largest genomewide study on MEIs and will have broad utility for exploration of human MEI findings.
Collapse
Affiliation(s)
- Yiwei Niu
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xueyi Teng
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Honghong Zhou
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Yirong Shi
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yanyan Li
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| | - Yiheng Tang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Peng Zhang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Huaxia Luo
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Quan Kang
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Tao Xu
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
- National Laboratory of Biomacromolecules, CAS Center for Excellence in Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
| | - Shunmin He
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China
- College of Life Sciences, University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
44
|
Li Y, Jiang N, Sun Y. AnnoSINE: a short interspersed nuclear elements annotation tool for plant genomes. PLANT PHYSIOLOGY 2022; 188:955-970. [PMID: 34792587 PMCID: PMC8825457 DOI: 10.1093/plphys/kiab524] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/01/2021] [Indexed: 06/13/2023]
Abstract
Short interspersed nuclear elements (SINEs) are a widespread type of small transposable element (TE). With increasing evidence for their impact on gene function and genome evolution in plants, accurate genome-scale SINE annotation becomes a fundamental step for studying the regulatory roles of SINEs and their relationship with other components in the genomes. Despite the overall promising progress made in TE annotation, SINE annotation remains a major challenge. Unlike some other TEs, SINEs are short and heterogeneous, and they usually lack well-conserved sequence or structural features. Thus, current SINE annotation tools have either low sensitivity or high false discovery rates. Given the demand and challenges, we aimed to provide a more accurate and efficient SINE annotation tool for plant genomes. The pipeline starts with maximizing the pool of SINE candidates via profile hidden Markov model-based homology search and de novo SINE search using structural features. Then, it excludes the false positives by integrating all known features of SINEs and the features of other types of TEs that can often be misannotated as SINEs. As a result, the pipeline substantially improves the tradeoff between sensitivity and accuracy, with both values close to or over 90%. We tested our tool in Arabidopsis thaliana and rice (Oryza sativa), and the results show that our tool competes favorably against existing SINE annotation tools. The simplicity and effectiveness of this tool would potentially be useful for generating more accurate SINE annotations for other plant species. The pipeline is freely available at https://github.com/yangli557/AnnoSINE.
Collapse
Affiliation(s)
- Yang Li
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| | - Ning Jiang
- Department of Horticulture, Michigan State University, East Lansing, Michigan 48824, USA
| | - Yanni Sun
- Department of Electrical Engineering, City University of Hong Kong, Kowloon, Hong Kong SAR, China
| |
Collapse
|
45
|
Gazolla CB, Ludwig A, de Moura Gama J, Bruschi DP. Evolutionary dynamics of DIRS-like and Ngaro-like retrotransposons in Xenopus laevis and Xenopus tropicalis genomes. G3 GENES|GENOMES|GENETICS 2022; 12:6430978. [PMID: 34792579 PMCID: PMC9210276 DOI: 10.1093/g3journal/jkab391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 11/01/2021] [Indexed: 12/02/2022]
Abstract
Anuran genomes have a large number and diversity of transposable elements, but are little explored, mainly in relation to their molecular structure and evolutionary dynamics. Here, we investigated the retrotransposons containing tyrosine recombinase (YR) (order DIRS) in the genome of Xenopus tropicalis and Xenopus laevis. These anurans show 2n = 20 and the 2n = 36 karyotypes, respectively. They diverged about 48 million years ago (mya) and X. laevis had an allotetraploid origin (around 17–18 mya). Our investigation is based on the analysis of the molecular structure and the phylogenetic relationships of 95 DIRS families of Xenopus belonging to DIRS-like and Ngaro-like superfamilies. We were able to identify molecular signatures in the 5' and 3' noncoding terminal regions, preserved open reading frames, and conserved domains that are specific to distinguish each superfamily. We recognize two ancient amplification waves of DIRS-like elements that occurred in the ancestor of both species and a higher density of the old/degenerate copies detected in both subgenomes of X. laevis. More recent amplification waves are seen in X. tropicalis (less than 3.2 mya) and X. laevis (around 10 mya) corroborating with transcriptional activity evidence. All DIRS-like families were found in both X. laevis subgenomes, while a few were most represented in the L subgenome. Ngaro-like elements presented less diversity and quantity in X. tropicalis and X. laevis genomes, although potentially active copies were found in both species and this is consistent with a recent amplification wave seen in the evolutionary landscape. Our findings highlight a differential diversity-level and evolutionary dynamics of the YR retrotransposons in X. tropicalis and X. laevis species expanding our comprehension of the behavior of these elements in both genomes during the diversification process.
Collapse
Affiliation(s)
- Camilla Borges Gazolla
- Departamento de Genética, Laboratório de Citogenética Evolutiva e Conservação Animal (LabCECA), Universidade Federal do Paraná, Curitiba, PR 80060-000, Brazil
- Departamento de Genética, Programa de Pós-Graduação em Genética (PPG-GEN), Universidade Federal do Paraná (UFPR), Curitiba, PR 80060-000, Brazil
| | - Adriana Ludwig
- Laboratório de Ciências e Tecnologias Aplicadas em Saúde (LaCTAS), Instituto Carlos Chagas—Fiocruz-PR, Curitiba, PR 81350-010, Brazil
| | - Joana de Moura Gama
- Departamento de Genética, Programa de Pós-Graduação em Genética (PPG-GEN), Universidade Federal do Paraná (UFPR), Curitiba, PR 80060-000, Brazil
| | - Daniel Pacheco Bruschi
- Departamento de Genética, Laboratório de Citogenética Evolutiva e Conservação Animal (LabCECA), Universidade Federal do Paraná, Curitiba, PR 80060-000, Brazil
| |
Collapse
|
46
|
Potente G, Léveillé-Bourret É, Yousefi N, Choudhury RR, Keller B, Diop SI, Duijsings D, Pirovano W, Lenhard M, Szövényi P, Conti E. Comparative Genomics Elucidates the Origin of a Supergene Controlling Floral Heteromorphism. Mol Biol Evol 2022; 39:msac035. [PMID: 35143659 PMCID: PMC8859637 DOI: 10.1093/molbev/msac035] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Supergenes are nonrecombining genomic regions ensuring the coinheritance of multiple, coadapted genes. Despite the importance of supergenes in adaptation, little is known on how they originate. A classic example of supergene is the S locus controlling heterostyly, a floral heteromorphism occurring in 28 angiosperm families. In Primula, heterostyly is characterized by the cooccurrence of two complementary, self-incompatible floral morphs and is controlled by five genes clustered in the hemizygous, ca. 300-kb S locus. Here, we present the first chromosome-scale genome assembly of any heterostylous species, that of Primula veris (cowslip). By leveraging the high contiguity of the P. veris assembly and comparative genomic analyses, we demonstrated that the S-locus evolved via multiple, asynchronous gene duplications and independent gene translocations. Furthermore, we discovered a new whole-genome duplication in Ericales that is specific to the Primula lineage. We also propose a mechanism for the origin of S-locus hemizygosity via nonhomologous recombination involving the newly discovered two pairs of CFB genes flanking the S locus. Finally, we detected only weak signatures of degeneration in the S locus, as predicted for hemizygous supergenes. The present study provides a useful resource for future research addressing key questions on the evolution of supergenes in general and the S locus in particular: How do supergenes arise? What is the role of genome architecture in the evolution of complex adaptations? Is the molecular architecture of heterostyly supergenes across angiosperms similar to that of Primula?
Collapse
Affiliation(s)
- Giacomo Potente
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
- BaseClear BV, Leiden, The Netherlands
- Zurich-Basel Plant Science Center, Zurich, Switzerland
| | - Étienne Léveillé-Bourret
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
- Département de Sciences Biologiques, Institut de Recherche en Biologie Végétale, Université de Montréal, Montréal, Canada
| | - Narjes Yousefi
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Rimjhim Roy Choudhury
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Barbara Keller
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Seydina Issa Diop
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
- BaseClear BV, Leiden, The Netherlands
- Zurich-Basel Plant Science Center, Zurich, Switzerland
| | | | | | - Michael Lenhard
- Institute for Biochemistry and Biology, University of Potsdam, Potsdam-Golm, Germany
| | - Péter Szövényi
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
- Zurich-Basel Plant Science Center, Zurich, Switzerland
| | - Elena Conti
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
- Zurich-Basel Plant Science Center, Zurich, Switzerland
| |
Collapse
|
47
|
Genome Sequencing of Hericium coralloides by a Combination of PacBio RS II and Next-Generation Sequencing Platforms. Int J Genomics 2022; 2022:4017654. [PMID: 35141329 PMCID: PMC8820905 DOI: 10.1155/2022/4017654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 12/07/2021] [Accepted: 01/09/2022] [Indexed: 11/17/2022] Open
Abstract
The fruiting bodies or mycelia of Hericium coralloides (H. coralloides) contain many physiologically active compounds that are used to treat various diseases, including cardiovascular disorders and cancers. However, the genome of H. coralloides has not been sequenced, which hinders further investigations into aspects, such as bioactivity or evolutionary events. The present study is aimed at (i) performing de novo sequencing of the assembled genome; (ii) mapping the reads from PE400 DNA into the assembled genome; (iii) identifying the full length of all the repeated sequences; and (iv) annotating protein-coding genes using GO, eggNOG, and KEGG databases. The assembled genome comprised 5,59,05,675 bp, including 307 contigs. The mapping rate of reads obtained from PE400 DNA in the assembled genome was 92.46%. We identified 2,525 repeated sequences of 14,23,274 bp length. We predicted ncRNAs of 48,895 bp and 11,736 genes encoding proteins that were annotated in the GO, eggNOG, and KEGG databases. We are the first to sequence the entire H. coralloides genome (NCBI; Assembly: ASM367540v1), which will serve as a reference for studying the evolutionary diversification of edible and medicinal mushrooms and facilitate the application of bioactivity in H. coralloides.
Collapse
|
48
|
Timmons CM, Shazib SUA, Katz LA. Epigenetic influences of mobile genetic elements on ciliate genome architecture and evolution. J Eukaryot Microbiol 2022; 69:e12891. [PMID: 35100457 DOI: 10.1111/jeu.12891] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 01/20/2022] [Accepted: 01/22/2022] [Indexed: 11/27/2022]
Abstract
Mobile genetic elements (MGEs) are transient genetic material that can move either within a single organism's genome or between individuals or species. While historically considered 'junk' DNA (i.e. deleterious or at best neutral), more recent studies reveal the adaptive advantages MGEs provide in lineages across the tree of life. Ciliates, a group of single-celled microbial eukaryotes characterized by nuclear dimorphism, exemplify how epigenetic influences from MGEs shape genome architecture and patterns of molecular evolution. Ciliate nuclear dimorphism may have evolved as a response to transposon invasion and ciliates have since co-opted transposons to carry out programmed DNA deletion. Another example of the effect of MGEs is in providing mechanisms for lateral gene transfer from bacteria, which introduces genetic diversity and, in several cases, drives ecological specialization in ciliates. As a third example, the integration of viral DNA, likely through transduction, provides new genetic material and can change the way host cells defend themselves against other viral pathogens. We argue that the acquisition of MGEs through non-Mendelian patterns of inheritance, coupled with their effects on ciliate genome architecture and expression and persistence throughout evolutionary history, exemplify how the transmission of mobile elements should be considered a mechanism of transgenerational epigenetic inheritance.
Collapse
Affiliation(s)
- Caitlin M Timmons
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, 01063, USA
| | - Shahed U A Shazib
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, 01063, USA
| | - Laura A Katz
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, 01063, USA
| |
Collapse
|
49
|
Deneweth J, Van de Peer Y, Vermeirssen V. Nearby transposable elements impact plant stress gene regulatory networks: a meta-analysis in A. thaliana and S. lycopersicum. BMC Genomics 2022; 23:18. [PMID: 34983397 PMCID: PMC8725346 DOI: 10.1186/s12864-021-08215-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 11/09/2021] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Transposable elements (TE) make up a large portion of many plant genomes and are playing innovative roles in genome evolution. Several TEs can contribute to gene regulation by influencing expression of nearby genes as stress-responsive regulatory motifs. To delineate TE-mediated plant stress regulatory networks, we took a 2-step computational approach consisting of identifying TEs in the proximity of stress-responsive genes, followed by searching for cis-regulatory motifs in these TE sequences and linking them to known regulatory factors. Through a systematic meta-analysis of RNA-seq expression profiles and genome annotations, we investigated the relation between the presence of TE superfamilies upstream, downstream or within introns of nearby genes and the differential expression of these genes in various stress conditions in the TE-poor Arabidopsis thaliana and the TE-rich Solanum lycopersicum. RESULTS We found that stress conditions frequently expressed genes having members of various TE superfamilies in their genomic proximity, such as SINE upon proteotoxic stress and Copia and Gypsy upon heat stress in A. thaliana, and EPRV and hAT upon infection, and Harbinger, LINE and Retrotransposon upon light stress in S. lycopersicum. These stress-specific gene-proximal TEs were mostly located within introns and more detected near upregulated than downregulated genes. Similar stress conditions were often related to the same TE superfamily. Additionally, we detected both novel and known motifs in the sequences of those TEs pointing to regulatory cooption of these TEs upon stress. Next, we constructed the regulatory network of TFs that act through binding these TEs to their target genes upon stress and discovered TE-mediated regulons targeted by TFs such as BRB/BPC, HD, HSF, GATA, NAC, DREB/CBF and MYB factors in Arabidopsis and AP2/ERF/B3, NAC, NF-Y, MYB, CXC and HD factors in tomato. CONCLUSIONS Overall, we map TE-mediated plant stress regulatory networks using numerous stress expression profile studies for two contrasting plant species to study the regulatory role TEs play in the response to stress. As TE-mediated gene regulation allows plants to adapt more rapidly to new environmental conditions, this study contributes to the future development of climate-resilient plants.
Collapse
Affiliation(s)
- Jan Deneweth
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.,VIB Center for Plant Systems Biology, Ghent, Belgium.,Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa
| | - Vanessa Vermeirssen
- Department of Biomedical Molecular Biology, Ghent University, Ghent, Belgium. .,Department of Biomolecular Medicine, Ghent University, Ghent, Belgium. .,Lab for Computational Biology, Integromics and Gene Regulation (CBIGR), Cancer Research Institute Ghent (CRIG), Ghent, Belgium.
| |
Collapse
|
50
|
Dayama G, Bulekova K, Lau NC. Extending and Running the Mosquito Small RNA Genomics Resource Pipeline. Methods Mol Biol 2022; 2509:341-352. [PMID: 35796973 PMCID: PMC10100135 DOI: 10.1007/978-1-0716-2380-0_20] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The Mosquito Small RNA Genomics (MSRG) resource is a repository of analyses on the small RNA transcriptomes of mosquito cell cultures and somatic and gonadal tissues. This resource allows for comparing the regulation dynamics of small RNAs generated from transposons and viruses across mosquito species. This chapter covers the procedures to set up the MSRG resource pipeline as a new installation by detailing the necessary collection of genome reference and annotation files and lists of microRNAs (miRNAs) hairpin sequences, transposon repeats consensus sequences, and virus genome sequences. Proper execution of the MSRG resource pipeline yields outputs amenable to biologists to further analyze with desktop and spreadsheet software to gain insights into the balance between arthropod endogenous small RNA populations and the proportions of virus-derived small RNAs that include Piwi-interacting RNAs (piRNAs) and endogenous small interfering RNAs (siRNAs).
Collapse
Affiliation(s)
- Gargi Dayama
- Boston University School of Medicine, Department of Biochemistry, Boston University Bioinformatics Program, Boston, MA, USA
| | - Katia Bulekova
- Boston University Research Computing Services, Information Services and Technology, Boston, MA, USA
| | - Nelson C Lau
- Boston University School of Medicine, Department of Biochemistry, Boston University Bioinformatics Program, Boston, MA, USA.
| |
Collapse
|