1
|
Biryukov M, Ustyantsev K. Origin and Evolution of Plant Long Terminal Repeat Retrotransposons with Additional Ribonuclease H. Genome Biol Evol 2023; 15:evad161. [PMID: 37697050 PMCID: PMC10508981 DOI: 10.1093/gbe/evad161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 08/08/2023] [Accepted: 09/01/2023] [Indexed: 09/13/2023] Open
Abstract
Retroviruses originated from long terminal repeat retrotransposons (LTR-RTs) through several structural adaptations. One such modification was the arrangement of an additional ribonuclease H (aRH) domain next to native RH, followed by degradation and subfunctionalization of the latter. We previously showed that this retrovirus-like structure independently evolved in Tat LTR-RTs in flowering plants, proposing its origin from sequential rearrangements of ancestral Tat structures identified in lycophytes and conifers. However, most nonflowering plant genome assemblies were not available at that time, therefore masking the history of aRH acquisition by Tat and challenging our hypothesis. Here, we revisited Tat's evolution scenario upon the aRH acquisition by covering most of the extant plant phyla. We show that Tat evolved and obtained aRH in an ancestor of land plants. Importantly, we found the retrovirus-like structure in clubmosses, hornworts, ferns, and gymnosperms, suggesting its ancient origin, broad propagation, and yet-to-be-understood benefit for the LTR-RTs' adaptation.
Collapse
Affiliation(s)
- Mikhail Biryukov
- Sector of Molecular and Genetic Mechanisms of Regeneration, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
- Novosibirsk State University, Novosibirsk, Russia
| | - Kirill Ustyantsev
- Sector of Molecular and Genetic Mechanisms of Regeneration, Institute of Cytology and Genetics SB RAS, Novosibirsk, Russia
| |
Collapse
|
2
|
DARTS: An Algorithm for Domain-Associated Retrotransposon Search in Genome Assemblies. Genes (Basel) 2021; 13:genes13010009. [PMID: 35052350 PMCID: PMC8775202 DOI: 10.3390/genes13010009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/16/2021] [Accepted: 12/17/2021] [Indexed: 01/08/2023] Open
Abstract
Retrotransposons comprise a substantial fraction of eukaryotic genomes, reaching the highest proportions in plants. Therefore, identification and annotation of retrotransposons is an important task in studying the regulation and evolution of plant genomes. The majority of computational tools for mining transposable elements (TEs) are designed for subsequent genome repeat masking, often leaving aside the element lineage classification and its protein domain composition. Additionally, studies focused on the diversity and evolution of a particular group of retrotransposons often require substantial customization efforts from researchers to adapt existing software to their needs. Here, we developed a computational pipeline to mine sequences of protein-coding retrotransposons based on the sequences of their conserved protein domains—DARTS (Domain-Associated Retrotransposon Search). Using the most abundant group of TEs in plants—long terminal repeat (LTR) retrotransposons (LTR-RTs)—we show that DARTS has radically higher sensitivity for LTR-RT identification compared to the widely accepted tool LTRharvest. DARTS can be easily customized for specific user needs. As a result, DARTS returns a set of structurally annotated nucleotide and amino acid sequences which can be readily used in subsequent comparative and phylogenetic analyses. DARTS may facilitate researchers interested in the discovery and detailed analysis of the diversity and evolution of retrotransposons, LTR-RTs, and other protein-coding TEs.
Collapse
|
3
|
Nelson DR, Hazzouri KM, Lauersen KJ, Jaiswal A, Chaiboonchoe A, Mystikou A, Fu W, Daakour S, Dohai B, Alzahmi A, Nobles D, Hurd M, Sexton J, Preston MJ, Blanchette J, Lomas MW, Amiri KMA, Salehi-Ashtiani K. Large-scale genome sequencing reveals the driving forces of viruses in microalgal evolution. Cell Host Microbe 2021; 29:250-266.e8. [PMID: 33434515 DOI: 10.1016/j.chom.2020.12.005] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 10/08/2020] [Accepted: 11/18/2020] [Indexed: 01/08/2023]
Abstract
Being integral primary producers in diverse ecosystems, microalgal genomes could be mined for ecological insights, but representative genome sequences are lacking for many phyla. We cultured and sequenced 107 microalgae species from 11 different phyla indigenous to varied geographies and climates. This collection was used to resolve genomic differences between saltwater and freshwater microalgae. Freshwater species showed domain-centric ontology enrichment for nuclear and nuclear membrane functions, while saltwater species were enriched in organellar and cellular membrane functions. Further, marine species contained significantly more viral families in their genomes (p = 8e-4). Sequences from Chlorovirus, Coccolithovirus, Pandoravirus, Marseillevirus, Tupanvirus, and other viruses were found integrated into the genomes of algal from marine environments. These viral-origin sequences were found to be expressed and code for a wide variety of functions. Together, this study comprehensively defines the expanse of protein-coding and viral elements in microalgal genomes and posits a unified adaptive strategy for algal halotolerance.
Collapse
Affiliation(s)
- David R Nelson
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE.
| | - Khaled M Hazzouri
- Khalifa Center for Genetic Engineering and Biotechnology (KCGEB), UAE University, Al Ain, Abu Dhabi, UAE; Biology Department, College of Science, UAE University, Al Ain, Abu Dhabi, UAE
| | - Kyle J Lauersen
- Biological and Environmental Sciences and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Kingdom of Saudi Arabia
| | - Ashish Jaiswal
- Division of Science and Math, New York University Abu Dhabi, Abu Dhabi, UAE
| | | | - Alexandra Mystikou
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Weiqi Fu
- Division of Science and Math, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Sarah Daakour
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Bushra Dohai
- Division of Science and Math, New York University Abu Dhabi, Abu Dhabi, UAE
| | - Amnah Alzahmi
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE
| | - David Nobles
- UTEX Culture Collection of Algae at the University of Texas at Austin, Austin, TX, USA
| | - Mark Hurd
- National Center for Marine Algae and Microbiota, East Boothbay, ME, USA
| | - Julie Sexton
- National Center for Marine Algae and Microbiota, East Boothbay, ME, USA
| | - Michael J Preston
- National Center for Marine Algae and Microbiota, East Boothbay, ME, USA
| | - Joan Blanchette
- National Center for Marine Algae and Microbiota, East Boothbay, ME, USA
| | - Michael W Lomas
- National Center for Marine Algae and Microbiota, East Boothbay, ME, USA
| | - Khaled M A Amiri
- Khalifa Center for Genetic Engineering and Biotechnology (KCGEB), UAE University, Al Ain, Abu Dhabi, UAE; Biology Department, College of Science, UAE University, Al Ain, Abu Dhabi, UAE
| | - Kourosh Salehi-Ashtiani
- Center for Genomics and Systems Biology, New York University Abu Dhabi, Abu Dhabi, UAE; Division of Science and Math, New York University Abu Dhabi, Abu Dhabi, UAE.
| |
Collapse
|
4
|
Li SF, Li JR, Wang J, Dong R, Jia KL, Zhu HW, Li N, Yuan JH, Deng CL, Gao WJ. Cytogenetic and genomic organization analyses of chloroplast DNA invasions in the nuclear genome of Asparagus officinalis L. provides signatures of evolutionary complexity and informativity in sex chromosome evolution. BMC PLANT BIOLOGY 2019; 19:361. [PMID: 31419941 PMCID: PMC6698032 DOI: 10.1186/s12870-019-1975-8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2019] [Accepted: 08/13/2019] [Indexed: 05/03/2023]
Abstract
BACKGROUND The transfer of chloroplast DNA into nuclear genome is a common process in plants. These transfers form nuclear integrants of plastid DNAs (NUPTs), which are thought to be driving forces in genome evolution, including sex chromosome evolution. In this study, NUPTs in the genome of a dioecious plant Asparagus officinalis L. were systematically analyzed, in order to investigate the characteristics of NUPTs in the nuclear genome and the relationship between NUPTs and sex chromosome evolution in this species. RESULTS A total of 3155 NUPT insertions were detected, and they represented approximated 0.06% of the nuclear genome. About 45% of the NUPTs were organized in clusters. These clusters were derived from various evolutionary events. The Y chromosome contained the highest number and largest proportion of NUPTs, suggesting more accumulation of NUPTs on sex chromosomes. NUPTs were distributed widely in all of the chromosomes, and some regions preferred these insertions. The highest density of NUPTs was found in a 47 kb region in the Y chromosome; more than 75% of this region was occupied by NUPTs. Further cytogenetic and sequence alignment analysis revealed that this region was likely the centromeric region of the sex chromosomes. On the other hand, the male-specific region of the Y chromosome (MSY) and the adjacent regions did not have NUPT insertions. CONCLUSIONS These results indicated that NUPTs were involved in shaping the genome of A. officinalis through complicated process. NUPTs may play important roles in the centromere shaping of the sex chromosomes of A. officinalis, but were not implicated in MSY formation.
Collapse
Affiliation(s)
- Shu-Fen Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Jia-Rong Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Jin Wang
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Ran Dong
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Ke-Li Jia
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
- SanQuan Medical College, Xinxiang Medical University, Xinxiang, 453003 China
| | - Hong-Wei Zhu
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Ning Li
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Jin-Hong Yuan
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Chuan-Liang Deng
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| | - Wu-Jun Gao
- College of Life Sciences, Henan Normal University, Xinxiang, 453007 China
| |
Collapse
|
5
|
Orozco-Arias S, Isaza G, Guyot R. Retrotransposons in Plant Genomes: Structure, Identification, and Classification through Bioinformatics and Machine Learning. Int J Mol Sci 2019; 20:E3837. [PMID: 31390781 PMCID: PMC6696364 DOI: 10.3390/ijms20153837] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 07/31/2019] [Accepted: 08/02/2019] [Indexed: 01/26/2023] Open
Abstract
Transposable elements (TEs) are genomic units able to move within the genome of virtually all organisms. Due to their natural repetitive numbers and their high structural diversity, the identification and classification of TEs remain a challenge in sequenced genomes. Although TEs were initially regarded as "junk DNA", it has been demonstrated that they play key roles in chromosome structures, gene expression, and regulation, as well as adaptation and evolution. A highly reliable annotation of these elements is, therefore, crucial to better understand genome functions and their evolution. To date, much bioinformatics software has been developed to address TE detection and classification processes, but many problematic aspects remain, such as the reliability, precision, and speed of the analyses. Machine learning and deep learning are algorithms that can make automatic predictions and decisions in a wide variety of scientific applications. They have been tested in bioinformatics and, more specifically for TEs, classification with encouraging results. In this review, we will discuss important aspects of TEs, such as their structure, importance in the evolution and architecture of the host, and their current classifications and nomenclatures. We will also address current methods and their limitations in identifying and classifying TEs.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Science, Universidad Autónoma de Manizales, Manizales 170001, Colombia
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Gustavo Isaza
- Department of Systems and Informatics, Universidad de Caldas, Manizales 170001, Colombia
| | - Romain Guyot
- Department of Electronics and Automatization, Universidad Autónoma de Manizales, Manizales 170001, Colombia.
- Institut de Recherche pour le Développement, CIRAD, University Montpellier, 34000 Montpellier, France.
| |
Collapse
|
6
|
Morozov SY, Lezzhov AA, Lazareva EA, Erokhina TN, Solovyev AG. Potential Role of Accessory Domains in Polyproteins Encoded by Retrotransposons in Anti-viral Defense of Host Cells. Front Microbiol 2019; 9:3193. [PMID: 30687243 PMCID: PMC6338049 DOI: 10.3389/fmicb.2018.03193] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 12/10/2018] [Indexed: 11/30/2022] Open
Affiliation(s)
- Sergey Y Morozov
- A. N. Belozersky Institute of Physico-Chemical Biology, Moscow State University, Moscow, Russia.,Department of Virology, Biological Faculty, Moscow State University, Moscow, Russia
| | - Alexander A Lezzhov
- Faculty of Bioengineering and Bioinformatics, Moscow State University, Moscow, Russia
| | - Ekaterina A Lazareva
- Department of Virology, Biological Faculty, Moscow State University, Moscow, Russia
| | - Tatiana N Erokhina
- Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Science, Moscow, Russia
| | - Andrey G Solovyev
- A. N. Belozersky Institute of Physico-Chemical Biology, Moscow State University, Moscow, Russia.,Department of Virology, Biological Faculty, Moscow State University, Moscow, Russia.,Institute of Molecular Medicine, Sechenov First Moscow State Medical University, Moscow, Russia
| |
Collapse
|
7
|
Martín-Peciña M, Ruiz-Ruano FJ, Camacho JPM, Dodsworth S. Phylogenetic signal of genomic repeat abundances can be distorted by random homoplasy: a case study from hominid primates. Zool J Linn Soc 2018. [DOI: 10.1093/zoolinnean/zly077] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Affiliation(s)
- María Martín-Peciña
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | | | - Juan Pedro M Camacho
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, Granada, Spain
| | - Steven Dodsworth
- School of Biological and Chemical Sciences, Queen Mary University of London, London, UK
- School of Life Sciences, University of Bedfordshire, University Square, Luton, UK
| |
Collapse
|
8
|
Recurrent acquisition of cytosine methyltransferases into eukaryotic retrotransposons. Nat Commun 2018; 9:1341. [PMID: 29632298 PMCID: PMC5890265 DOI: 10.1038/s41467-018-03724-9] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2017] [Accepted: 03/07/2018] [Indexed: 01/27/2023] Open
Abstract
Transposable elements are in a constant arms race with the silencing mechanisms of their host genomes. One silencing mechanism commonly used by many eukaryotes is dependent on cytosine methylation, a covalent modification of DNA deposited by C5 cytosine methyltransferases (DNMTs). Here, we report how two distantly related eukaryotic lineages, dinoflagellates and charophytes, have independently incorporated DNMTs into the coding regions of distinct retrotransposon classes. Concomitantly, we show that dinoflagellates of the genus Symbiodinium have evolved cytosine methylation patterns unlike any other eukaryote, with most of the genome methylated at CG dinucleotides. Finally, we demonstrate the ability of retrotransposon DNMTs to methylate CGs de novo, suggesting that retrotransposons could self-methylate retrotranscribed DNA. Together, this is an example of how retrotransposons incorporate host-derived genes involved in DNA methylation. In some cases, this event could have implications for the composition and regulation of the host epigenomic environment.
Collapse
|
9
|
Arkhipova IR. Using bioinformatic and phylogenetic approaches to classify transposable elements and understand their complex evolutionary histories. Mob DNA 2017; 8:19. [PMID: 29225705 PMCID: PMC5718144 DOI: 10.1186/s13100-017-0103-2] [Citation(s) in RCA: 60] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 11/28/2017] [Indexed: 12/11/2022] Open
Abstract
In recent years, much attention has been paid to comparative genomic studies of transposable elements (TEs) and the ensuing problems of their identification, classification, and annotation. Different approaches and diverse automated pipelines are being used to catalogue and categorize mobile genetic elements in the ever-increasing number of prokaryotic and eukaryotic genomes, with little or no connectivity between different domains of life. Here, an overview of the current picture of TE classification and evolutionary relationships is presented, updating the diversity of TE types uncovered in sequenced genomes. A tripartite TE classification scheme is proposed to account for their replicative, integrative, and structural components, and the need to expand in vitro and in vivo studies of their structural and biological properties is emphasized. Bioinformatic studies have now become front and center of novel TE discovery, and experimental pursuits of these discoveries hold great promise for both basic and applied science.
Collapse
Affiliation(s)
- Irina R Arkhipova
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, Woods Hole, MA 02543 USA
| |
Collapse
|