1
|
Orozco-Arias S, Dupeyron M, Gutiérrez-Duque D, Tabares-Soto R, Guyot R. High nucleotide similarity of three Copia lineage LTR retrotransposons among plant genomes. Genome 2023; 66:51-61. [PMID: 36623262 DOI: 10.1139/gen-2022-0026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Transposable elements (TEs) are mobile elements found in the majority of eukaryotic genomes. TEs deeply impact the structure and evolution of chromosomes and can induce mutations affecting coding genes. In plants, the major group of TEs is long terminal repeat retrotransposons (LTR-RTs). They are classified into superfamilies (Gypsy, Copia) and subclassified into lineages. Horizontal transfer (HT), defined as the nonsexual transmission of genetic material between species, is a process allowing LTR-RTs to invade a new genome. Although this phenomenon was considered rare, recent studies demonstrate numerous transfers of LTR-RTs. This study aims to determine which LTR-RT lineages are shared with high similarity among 69 plant genomes. We identified and classified 88 450 LTR-RTs and determined 143 cases of high similarities between pairs of genomes. Most of them involved three Copia lineages (Oryco/Ivana, Retrofit/Ale, and Tork/Tar/Ikeros). A detailed analysis of three cases of high similarities involving Tork/Tar/Ikeros group shows an uneven distribution in the phylogeny of the elements and incongruence with between phylogenetic trees topologies, indicating they could be originated from HTs. Overall, our results suggest that LTR-RT Copia lineages share outstanding similarity between distant species and may likely be involved in HT mechanisms more frequent than initially estimated.
Collapse
Affiliation(s)
- Simon Orozco-Arias
- Department of Computer Sciences, Universidad Autónoma de Manizales, Colombia.,Department of Systems and Informatics, Universidad de Caldas, Colombia
| | - Mathilde Dupeyron
- Institut de Recherche pour le Développement, IRD, CIRAD, Université de Montpellier, France
| | | | - Reinel Tabares-Soto
- Department of Systems and Informatics, Universidad de Caldas, Colombia.,Department of Electronics and Automatization, Universidad Autónoma de Manizales, Colombia
| | - Romain Guyot
- Institut de Recherche pour le Développement, IRD, CIRAD, Université de Montpellier, France.,Department of Electronics and Automatization, Universidad Autónoma de Manizales, Colombia
| |
Collapse
|
2
|
Kui L, Majeed A, Wang X, Yang Z, Chen J, He L, Di Y, Li X, Qian Z, Jiao Y, Wang G, Liu L, Xu R, Gu S, Yang Q, Chen S, Lou H, Meng Y, Xie L, Xu F, Shen Q, Singh A, Gruber K, Pan Y, Hao T, Dong Y, Li F. A chromosome-level genome assembly for Erianthus fulvus provides insights into its biofuel potential and facilitates breeding for improvement of sugarcane. PLANT COMMUNICATIONS 2023:100562. [PMID: 36814384 PMCID: PMC10363513 DOI: 10.1016/j.xplc.2023.100562] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 12/21/2022] [Accepted: 02/16/2023] [Indexed: 06/18/2023]
Abstract
Erianthus produces substantial biomass, exhibits a good Brix value, and shows wide environmental adaptability, making it a potential biofuel plant. In contrast to closely related sorghum and sugarcane, Erianthus can grow in degraded soils, thus releasing pressure on agricultural lands used for biofuel production. However, the lack of genomic resources for Erianthus hinders its genetic improvement, thus limiting its potential for biofuel production. In the present study, we generated a chromosome-scale reference genome for Erianthus fulvus Nees. The genome size estimated by flow cytometry was 937 Mb, and the assembled genome size was 902 Mb, covering 96.26% of the estimated genome size. A total of 35 065 protein-coding genes were predicted, and 67.89% of the genome was found to be repetitive. A recent whole-genome duplication occurred approximately 74.10 million years ago in the E. fulvus genome. Phylogenetic analysis showed that E. fulvus is evolutionarily closer to S. spontaneum and diverged after S. bicolor. Three of the 10 chromosomes of E. fulvus formed through rearrangements of ancestral chromosomes. Phylogenetic reconstruction of the Saccharum complex revealed a polyphyletic origin of the complex and a sister relationship of E. fulvus with Saccharum sp., excluding S. arundinaceum. On the basis of the four amino acid residues that provide substrate specificity, the E. fulvus SWEET proteins were classified as mono- and disaccharide sugar transporters. Ortho-QTL genes identified for 10 biofuel-related traits may aid in the rapid screening of E. fulvus populations to enhance breeding programs for improved biofuel production. The results of this study provide valuable insights for breeding programs aimed at improving biofuel production in E. fulvus and enhancing sugarcane introgression programs.
Collapse
Affiliation(s)
- Ling Kui
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; Shenzhen Qianhai Shekou Free Trade Zone Hospital, Shenzhen 518067, China
| | - Aasim Majeed
- Plant Molecular Genetics Laboratory, School of Agricultural Biotechnology, Punjab Agricultural University, Ludhiana, India
| | - Xianhong Wang
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China; The Key Laboratory of Crop Production and Smart Agriculture of Yunnan Province, Kunming, Yunnan 650201, China
| | - Zijiang Yang
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Jian Chen
- International Genome Center, Jiangsu University, Zhenjiang, Jiangsu 212013, China
| | - Lilian He
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Yining Di
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Xuzhen Li
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China
| | - Zhenfeng Qian
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Yinming Jiao
- Shenzhen Qianhai Shekou Free Trade Zone Hospital, Shenzhen 518067, China
| | - Guoyun Wang
- Shenzhen Qianhai Shekou Free Trade Zone Hospital, Shenzhen 518067, China
| | - Lufeng Liu
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; The Key Laboratory of Crop Production and Smart Agriculture of Yunnan Province, Kunming, Yunnan 650201, China
| | - Rong Xu
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Shujie Gu
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Qinghui Yang
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Shuying Chen
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Hongbo Lou
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Yu Meng
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Linyan Xie
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Fu Xu
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Qingqing Shen
- College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China
| | - Amit Singh
- Institute of Molecular Biosciences, University of Graz, 8010 Graz, Austria
| | - Karl Gruber
- Institute of Molecular Biosciences, University of Graz, 8010 Graz, Austria
| | - Yunbing Pan
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China
| | - Tingting Hao
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China
| | - Yang Dong
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan Agricultural University, Kunming, Yunnan 650201, China; Yunnan Plateau Characteristic Agriculture Industry Research Institute, Kunming, Yunnan 650201, China.
| | - Fusheng Li
- Sugarcane Research Institute of Yunnan Agricultural University, Kunming, Yunnan 650201, China; College of Agronomy and Biotechnology of Yunnan Agricultural University, Kunming, Yunnan 650201, China; The Key Laboratory of Crop Production and Smart Agriculture of Yunnan Province, Kunming, Yunnan 650201, China.
| |
Collapse
|
3
|
A Practical Guide on Computational Tools and Databases for Transposable Elements in Plants. Methods Mol Biol 2021. [PMID: 33900590 DOI: 10.1007/978-1-0716-1134-0_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
In the age of big data, obtaining precise information about the research topic of interesting is extremely important. Keeping this in mind, this chapter focuses on providing a practical knowledge guide about computational tools and databases of transposable elements (TE) in plants. For that, we organize and present this text in three sections: (1) a discussion about tools and databases on this theme; (2) hands-on of how to use a few of them; (3) an exploratory data analysis on public TE data. Finally, we are going deep to present the main challenges and possible solutions to improve resources and tools.
Collapse
|
4
|
Xu CQ, Liu H, Zhou SS, Zhang DX, Zhao W, Wang S, Chen F, Sun YQ, Nie S, Jia KH, Jiao SQ, Zhang RG, Yun QZ, Guan W, Wang X, Gao Q, Bennetzen JL, Maghuly F, Porth I, Van de Peer Y, Wang XR, Ma Y, Mao JF. Genome sequence of Malania oleifera, a tree with great value for nervonic acid production. Gigascience 2019; 8:giy164. [PMID: 30689848 PMCID: PMC6377399 DOI: 10.1093/gigascience/giy164] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2018] [Revised: 11/12/2018] [Accepted: 12/17/2018] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Malania oleifera, a member of the Olacaceae family, is an IUCN red listed tree, endemic and restricted to the Karst region of southwest China. This tree's seed is valued for its high content of precious fatty acids (especially nervonic acid). However, studies on its genetic makeup and fatty acid biogenesis are severely hampered by a lack of molecular and genetic tools. FINDINGS We generated 51 Gb and 135 Gb of raw DNA sequences, using Pacific Biosciences (PacBio) single-molecule real-time and 10× Genomics sequencing, respectively. A final genome assembly, with a scaffold N50 size of 4.65 Mb and a total length of 1.51 Gb, was obtained by primary assembly based on PacBio long reads plus scaffolding with 10× Genomics reads. Identified repeats constituted ∼82% of the genome, and 24,064 protein-coding genes were predicted with high support. The genome has low heterozygosity and shows no evidence for recent whole genome duplication. Metabolic pathway genes relating to the accumulation of long-chain fatty acid were identified and studied in detail. CONCLUSIONS Here, we provide the first genome assembly and gene annotation for M. oleifera. The availability of these resources will be of great importance for conservation biology and for the functional genomics of nervonic acid biosynthesis.
Collapse
Affiliation(s)
- Chao-Qun Xu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Hui Liu
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Shan-Shan Zhou
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Dong-Xu Zhang
- College of Life Science, Datong University, Datong, Shanxi, 037009, China
| | - Wei Zhao
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Sihai Wang
- Yunnan Key Laboratory of Forest Plant Cultivation and Utilization, State Forestry Administration Key Laboratory of Yunnan Rare and Endangered Species Conservation and Propagation, Yunnan Academy of Forestry, Kunming, Yunnan, 650201, China
| | - Fu Chen
- The Camellia Institute, Yunnan Academy of Forestry, Guangnan, Yunnan, 663300, China
| | - Yan-Qiang Sun
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Shuai Nie
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Kai-Hua Jia
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Si-Qian Jiao
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Ren-Gang Zhang
- Beijing Ori-Gene Science and Technology Co. Ltd, Beijing, 102206, China
| | - Quan-Zheng Yun
- Beijing Ori-Gene Science and Technology Co. Ltd, Beijing, 102206, China
| | - Wenbin Guan
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Xuewen Wang
- The Camellia Institute, Yunnan Academy of Forestry, Guangnan, Yunnan, 663300, China
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Qiong Gao
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| | - Jeffrey L Bennetzen
- The Camellia Institute, Yunnan Academy of Forestry, Guangnan, Yunnan, 663300, China
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Fatemeh Maghuly
- Plant Biotechnology Unit (PBU), Dept. Biotechnology, BOKU-VIBT, University of Natural Resources and Life Sciences, Muthgasse 18, Vienna 1190, Austria
| | - Ilga Porth
- Département des sciences du bois et de la forêt, 1030, Avenue de la Médecine, Université Laval, Québec (Québec) G1V 0A6, Canada
- Institute for System and Integrated Biology, Pavillon Charles-Eugène-Marchand, 1030, Avenue de la Médecine, Université Laval, Québec (Québec) G1V 0A6, Canada
- Centre d'Étude de la Forêt, 1030, Avenue de la Médecine, Université Laval, Québec (Québec) G1V 0A6, Canada
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent 9052, Belgium
- VIB Center for Plant Systems Biology, Ghent 9052, Belgium
- Centre for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology Genetics, University of Pretoria, Private bag X20, Pretoria 0028, South Africa
| | - Xiao-Ru Wang
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
- Department of Ecology and Environmental Science, UPSC, Umeå University, Umeå SE-901 87, Sweden
| | - Yongpeng Ma
- Yunnan Key Laboratory for Integrative Conservation of Plant Species with Extremely Small Population, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, China
| | - Jian-Feng Mao
- Beijing Advanced Innovation Center for Tree Breeding by Molecular Design, National Engineering Laboratory for Tree Breeding, School of Nature Conservation, College of Biological Sciences and Technology, Beijing Forestry University, Beijing, 100083, China
| |
Collapse
|
5
|
Homologous Capsid Proteins Testify to the Common Ancestry of Retroviruses, Caulimoviruses, Pseudoviruses, and Metaviruses. J Virol 2017; 91:JVI.00210-17. [PMID: 28356531 DOI: 10.1128/jvi.00210-17] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
6
|
Tørresen OK, Star B, Jentoft S, Reinar WB, Grove H, Miller JR, Walenz BP, Knight J, Ekholm JM, Peluso P, Edvardsen RB, Tooming-Klunderud A, Skage M, Lien S, Jakobsen KS, Nederbragt AJ. An improved genome assembly uncovers prolific tandem repeats in Atlantic cod. BMC Genomics 2017; 18:95. [PMID: 28100185 PMCID: PMC5241972 DOI: 10.1186/s12864-016-3448-x] [Citation(s) in RCA: 115] [Impact Index Per Article: 16.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2016] [Accepted: 12/20/2016] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND The first Atlantic cod (Gadus morhua) genome assembly published in 2011 was one of the early genome assemblies exclusively based on high-throughput 454 pyrosequencing. Since then, rapid advances in sequencing technologies have led to a multitude of assemblies generated for complex genomes, although many of these are of a fragmented nature with a significant fraction of bases in gaps. The development of long-read sequencing and improved software now enable the generation of more contiguous genome assemblies. RESULTS By combining data from Illumina, 454 and the longer PacBio sequencing technologies, as well as integrating the results of multiple assembly programs, we have created a substantially improved version of the Atlantic cod genome assembly. The sequence contiguity of this assembly is increased fifty-fold and the proportion of gap-bases has been reduced fifteen-fold. Compared to other vertebrates, the assembly contains an unusual high density of tandem repeats (TRs). Indeed, retrospective analyses reveal that gaps in the first genome assembly were largely associated with these TRs. We show that 21% of the TRs across the assembly, 19% in the promoter regions and 12% in the coding sequences are heterozygous in the sequenced individual. CONCLUSIONS The inclusion of PacBio reads combined with the use of multiple assembly programs drastically improved the Atlantic cod genome assembly by successfully resolving long TRs. The high frequency of heterozygous TRs within or in the vicinity of genes in the genome indicate a considerable standing genomic variation in Atlantic cod populations, which is likely of evolutionary importance.
Collapse
Affiliation(s)
- Ole K. Tørresen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
| | - Bastiaan Star
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
| | - Sissel Jentoft
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
- Department of Natural Sciences, University of Agder, Kristiansand, NO-4604 Norway
| | - William B. Reinar
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
| | - Harald Grove
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, Ås, NO-1432 Norway
| | - Jason R. Miller
- J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, 20850 MD USA
| | - Brian P. Walenz
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, 20892 MD USA
| | - James Knight
- Yale School of Medicine, Yale University, New Haven, 06520 CT USA
| | | | | | | | - Ave Tooming-Klunderud
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
| | - Morten Skage
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
| | - Sigbjørn Lien
- Centre for Integrative Genetics (CIGENE), Department of Animal and Aquacultural Sciences, Norwegian University of Life Sciences, Ås, NO-1432 Norway
| | - Kjetill S. Jakobsen
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
| | - Alexander J. Nederbragt
- Centre for Ecological and Evolutionary Synthesis, Department of Biosciences, University of Oslo, Oslo, NO-0316 Norway
- Biomedical Informatics Research Group, Department of Informatics, University of Oslo, Oslo, NO-0316 Norway
| |
Collapse
|
7
|
Mourier T, Mollerup S, Vinner L, Hansen TA, Kjartansdóttir KR, Guldberg Frøslev T, Snogdal Boutrup T, Nielsen LP, Willerslev E, Hansen AJ. Characterizing novel endogenous retroviruses from genetic variation inferred from short sequence reads. Sci Rep 2015; 5:15644. [PMID: 26493184 PMCID: PMC4616055 DOI: 10.1038/srep15644] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2015] [Accepted: 09/21/2015] [Indexed: 02/06/2023] Open
Abstract
From Illumina sequencing of DNA from brain and liver tissue from the lion, Panthera leo, and tumor samples from the pike-perch, Sander lucioperca, we obtained two assembled sequence contigs with similarity to known retroviruses. Phylogenetic analyses suggest that the pike-perch retrovirus belongs to the epsilonretroviruses, and the lion retrovirus to the gammaretroviruses. To determine if these novel retroviral sequences originate from an endogenous retrovirus or from a recently integrated exogenous retrovirus, we assessed the genetic diversity of the parental sequences from which the short Illumina reads are derived. First, we showed by simulations that we can robustly infer the level of genetic diversity from short sequence reads. Second, we find that the measures of nucleotide diversity inferred from our retroviral sequences significantly exceed the level observed from Human Immunodeficiency Virus infections, prompting us to conclude that the novel retroviruses are both of endogenous origin. Through further simulations, we rule out the possibility that the observed elevated levels of nucleotide diversity are the result of co-infection with two closely related exogenous retroviruses.
Collapse
Affiliation(s)
- Tobias Mourier
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Sarah Mollerup
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Lasse Vinner
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Thomas Arn Hansen
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Kristín Rós Kjartansdóttir
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Tobias Guldberg Frøslev
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Torsten Snogdal Boutrup
- Section for Virology, National Veterinary Institute, Technical University of Denmark, Frederiksberg, Denmark
| | - Lars Peter Nielsen
- Department for Autoimmunology and Biomarkers, Statens Serum Institut, Copenhagen, Denmark
| | - Eske Willerslev
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| | - Anders J Hansen
- Centre for GeoGenetics, Museum of Natural History of Denmark, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
8
|
Beulé T, Agbessi MD, Dussert S, Jaligot E, Guyot R. Genome-wide analysis of LTR-retrotransposons in oil palm. BMC Genomics 2015; 16:795. [PMID: 26470789 PMCID: PMC4608283 DOI: 10.1186/s12864-015-2023-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Accepted: 10/07/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The oil palm (Elaeis guineensis Jacq.) is a major cultivated crop and the world's largest source of edible vegetable oil. The genus Elaeis comprises two species E. guineensis, the commercial African oil palm and E. oleifera, which is used in oil palm genetic breeding. The recent publication of both the African oil palm genome assembly and the first draft sequence of its Latin American relative now allows us to tackle the challenge of understanding the genome composition, structure and evolution of these palm genomes through the annotation of their repeated sequences. METHODS In this study, we identified, annotated and compared Transposable Elements (TE) from the African and Latin American oil palms. In a first step, Transposable Element databases were built through de novo detection in both genome sequences then the TE content of both genomes was estimated. Then putative full-length retrotransposons with Long Terminal Repeats (LTRs) were further identified in the E. guineensis genome for characterization of their structural diversity, copy number and chromosomal distribution. Finally, their relative expression in several tissues was determined through in silico analysis of publicly available transcriptome data. RESULTS Our results reveal a congruence in the transpositional history of LTR retrotransposons between E. oleifera and E. guineensis, especially the Sto-4 family. Also, we have identified and described 583 full-length LTR-retrotransposons in the Elaeis guineensis genome. Our work shows that these elements are most likely no longer mobile and that no recent insertion event has occurred. Moreover, the analysis of chromosomal distribution suggests a preferential insertion of Copia elements in gene-rich regions, whereas Gypsy elements appear to be evenly distributed throughout the genome. CONCLUSIONS Considering the high proportion of LTR retrotransposon in the oil palm genome, our work will contribute to a greater understanding of their impact on genome organization and evolution. Moreover, the knowledge gained from this study constitutes a valuable resource for both the improvement of genome annotation and the investigation of the evolutionary history of palms.
Collapse
Affiliation(s)
- Thierry Beulé
- CIRAD, UMR DIADE (IRD, UM), 34394, Montpellier, France.
| | | | | | | | - Romain Guyot
- IRD, UMR IPME (IRD, CIRAD, UM), 34394, Montpellier, France.
| |
Collapse
|
9
|
Faber-Hammond JJ, Phillips RB, Brown KH. Comparative Analysis of the Shared Sex-Determination Region (SDR) among Salmonid Fishes. Genome Biol Evol 2015; 7:1972-87. [PMID: 26112966 PMCID: PMC4524489 DOI: 10.1093/gbe/evv123] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Salmonids present an excellent model for studying evolution of young sex-chromosomes. Within the genus, Oncorhynchus, at least six independent sex-chromosome pairs have evolved, many unique to individual species. This variation results from the movement of the sex-determining gene, sdY, throughout the salmonid genome. While sdY is known to define sexual differentiation in salmonids, the mechanism of its movement throughout the genome has remained elusive due to high frequencies of repetitive elements, rDNA sequences, and transposons surrounding the sex-determining regions (SDR). Despite these difficulties, bacterial artificial chromosome (BAC) library clones from both rainbow trout and Atlantic salmon containing the sdY region have been reported. Here, we report the sequences for these BACs as well as the extended sequence for the known SDR in Chinook gained through genome walking methods. Comparative analysis allowed us to study the overlapping SDRs from three unique salmonid Y chromosomes to define the specific content, size, and variation present between the species. We found approximately 4.1 kb of orthologous sequence common to all three species, which contains the genetic content necessary for masculinization. The regions contain transposable elements that may be responsible for the translocations of the SDR throughout salmonid genomes and we examine potential mechanistic roles of each one.
Collapse
Affiliation(s)
- Joshua J Faber-Hammond
- Department of Biology, Portland State University School of Biological Sciences, Washington State University Vancouver
| | - Ruth B Phillips
- School of Biological Sciences, Washington State University Vancouver Center for Reproductive Biology, Washington State University, Pullman
| | - Kim H Brown
- Department of Biology, Portland State University
| |
Collapse
|
10
|
Disruption of the petal identity gene APETALA3-3 is highly correlated with loss of petals within the buttercup family (Ranunculaceae). Proc Natl Acad Sci U S A 2013; 110:5074-9. [PMID: 23479615 DOI: 10.1073/pnas.1219690110] [Citation(s) in RCA: 66] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Absence of petals, or being apetalous, is usually one of the most important features that characterizes a group of flowering plants at high taxonomic ranks (i.e., family and above). The apetalous condition, however, appears to be the result of parallel or convergent evolution with unknown genetic causes. Here we show that within the buttercup family (Ranunculaceae), apetalous genera in at least seven different lineages were all derived from petalous ancestors, indicative of parallel petal losses. We also show that independent petal losses within this family were strongly associated with decreased or eliminated expression of a single floral organ identity gene, APETALA3-3 (AP3-3), apparently owing to species-specific molecular lesions. In an apetalous mutant of Nigella, insertion of a transposable element into the second intron has led to silencing of the gene and transformation of petals into sepals. In several naturally occurring apetalous genera, such as Thalictrum, Beesia, and Enemion, the gene has either been lost altogether or disrupted by deletions in coding or regulatory regions. In Clematis, a large genus in which petalous species evolved secondarily from apetalous ones, the gene exhibits hallmarks of a pseudogene. These results suggest that, as a petal identity gene, AP3-3 has been silenced or down-regulated by different mechanisms in different evolutionary lineages. This also suggests that petal identity did not evolve many times independently across the Ranunculaceae but was lost in numerous instances. The genetic mechanisms underlying the independent petal losses, however, may be complex, with disruption of AP3-3 being either cause or effect.
Collapse
|
11
|
Steinbauerová V, Neumann P, Novák P, Macas J. A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons. Genetica 2012; 139:1543-55. [PMID: 22544262 DOI: 10.1007/s10709-012-9654-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 04/16/2012] [Indexed: 01/21/2023]
Abstract
Long terminal repeat (LTR) retrotransposons make up substantial parts of most higher plant genomes where they accumulate due to their replicative mode of transposition. Although the transposition is facilitated by proteins encoded within the gag-pol region which is common to all autonomous elements, some LTR retrotransposons were found to potentially carry an additional protein coding capacity represented by extra open reading frames located upstream or downstream of gag-pol. In this study, we performed a comprehensive in silico survey and comparative analysis of these extra open reading frames (ORFs) in the group of Ty3/gypsy LTR retrotransposons as the first step towards our understanding of their origin and function. We found that extra ORFs occur in all three major lineages of plant Ty3/gypsy elements, being the most frequent in the Tat lineage where most (77 %) of identified elements contained extra ORFs. This lineage was also characterized by the highest diversity of extra ORF arrangement (position and orientation) within the elements. On the other hand, all of these ORFs could be classified into only two broad groups based on their mutual similarities or the presence of short conserved motifs in their inferred protein sequences. In the Athila lineage, the extra ORFs were confined to the element 3' regions but they displayed much higher sequence diversity compared to those found in Tat. In the lineage of Chromoviruses the extra ORFs were relatively rare, occurring only in 5' regions of a group of elements present in a single plant family (Poaceae). In all three lineages, most extra ORFs lacked sequence similarities to characterized gene sequences or functional protein domains, except for two Athila-like elements with similarities to LOGL4 gene and part of the Chromoviruses extra ORFs that displayed partial similarity to histone H3 gene. Thus, in these cases the extra ORFs most likely originated by transduction or recombination of cellular gene sequences. In addition, the protein domain which is otherwise associated with DNA transposons have been detected in part of the Tat-like extra ORFs, pointing to their origin from an insertion event of a mobile element.
Collapse
Affiliation(s)
- Veronika Steinbauerová
- Institute of Plant Molecular Biology, Biology Centre ASCR, Branišovská 31, Ceske Budejovice, Czech Republic
| | | | | | | |
Collapse
|
12
|
Park M, Park J, Kim S, Kwon JK, Park HM, Bae IH, Yang TJ, Lee YH, Kang BC, Choi D. Evolution of the large genome in Capsicum annuum occurred through accumulation of single-type long terminal repeat retrotransposons and their derivatives. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2012; 69:1018-29. [PMID: 22074025 DOI: 10.1111/j.1365-313x.2011.04851.x] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
Although plant genome sizes are extremely diverse, the mechanism underlying the expansion of huge genomes that did not experience whole-genome duplication has not been elucidated. The pepper, Capsicum annuum, is an excellent model for studies of genome expansion due to its large genome size (2700 Mb) and the absence of whole genome duplication. As most of the pepper genome structure has been identified as constitutive heterochromatin, we investigated the evolution of this region in detail. Our findings show that the constitutive heterochromatin in pepper was actively expanded 20.0-7.5 million years ago through a massive accumulation of single-type Ty3/Gypsy-like elements that belong to the Del subgroup. Interestingly, derivatives of the Del elements, such as non-autonomous long terminal repeat retrotransposons and long-unit tandem repeats, played important roles in the expansion of constitutive heterochromatic regions. This expansion occurred not only in the existing heterochromatic regions but also into the euchromatic regions. Furthermore, our results revealed a repeat of unit length 18-24 kb. This repeat was found not only in the pepper genome but also in the other solanaceous species, such as potato and tomato. These results represent a characteristic mechanism for large genome evolution in plants.
Collapse
Affiliation(s)
- Minkyu Park
- Interdisciplinary Program in Agriculture Biotechnology, Seoul National University, Seoul 151-921, Korea
| | | | | | | | | | | | | | | | | | | |
Collapse
|
13
|
Ahmed S, Shafiuddin MD, Azam MS, Islam MS, Ghosh A, Khan H. Identification and characterization of jute LTR retrotransposons:: Their abundance, heterogeneity and transcriptional activity. Mob Genet Elements 2011; 1:18-28. [PMID: 22016842 PMCID: PMC3190282 DOI: 10.4161/mge.1.1.16433] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2011] [Revised: 05/06/2011] [Accepted: 05/07/2011] [Indexed: 12/12/2022] Open
Abstract
Long Terminal Repeat (LTR) retrotransposons constitute a significant part of eukaryotic genomes and play an important role in genome evolution especially in plants. Jute is an important fiber crop with a large genome of 1,250 Mbps. This genome is still mostly unexplored. In this study we aimed at identifying and characterizing the LTR retrotransposons of jute with a view to understanding the jute genome better. In this study, the Reverse Transcriptase domain of Ty1-copia and Ty3-gypsy LTR retrotransposons of jute were amplified by degenerate primers and their expressions were examined by reverse transcription PCR. Copy numbers of reverse transcriptase (RT) genes of Ty1-copia and Ty3-gypsy elements were determined by dot blot analysis. Sequence analysis revealed higher heterogeneity among Ty1-copia retrotransposons than Ty3-gypsy and clustered each of them in three groups. Copy number of RT genes in Ty1-copia was found to be higher than that of Ty3-gypsy elements from dot blot hybridization. Cumulatively Ty1-copia and Ty3-gypsy may constitute around 19% of the jute genome where two groups of Ty1-copia were found to be transcriptionally active. Since the LTR retrotransposons constitute a large portion of jute genome, these findings imply the importance of these elements in the evolution of jute genome.
Collapse
Affiliation(s)
- Salim Ahmed
- Department of Genetic Engineering and Biotechnology; University of Dhaka; Dhaka, Bangladesh
| | - MD Shafiuddin
- Department of Biochemistry and Molecular Biology; University of Dhaka; Dhaka, Bangladesh
| | - Muhammad Shafiul Azam
- Department of Genetic Engineering and Biotechnology; University of Dhaka; Dhaka, Bangladesh
| | | | - Ajit Ghosh
- International Center for Genetic Engineering and Biotechnology; New Delhi, India
| | - Haseena Khan
- Department of Biochemistry and Molecular Biology; University of Dhaka; Dhaka, Bangladesh
| |
Collapse
|
14
|
Plant centromeric retrotransposons: a structural and cytogenetic perspective. Mob DNA 2011; 2:4. [PMID: 21371312 PMCID: PMC3059260 DOI: 10.1186/1759-8753-2-4] [Citation(s) in RCA: 138] [Impact Index Per Article: 10.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2010] [Accepted: 03/03/2011] [Indexed: 12/12/2022] Open
Abstract
Background The centromeric and pericentromeric regions of plant chromosomes are colonized by Ty3/gypsy retrotransposons, which, on the basis of their reverse transcriptase sequences, form the chromovirus CRM clade. Despite their potential importance for centromere evolution and function, they have remained poorly characterized. In this work, we aimed to carry out a comprehensive survey of CRM clade elements with an emphasis on their diversity, structure, chromosomal distribution and transcriptional activity. Results We have surveyed a set of 190 CRM elements belonging to 81 different retrotransposon families, derived from 33 host species and falling into 12 plant families. The sequences at the C-terminus of their integrases were unexpectedly heterogeneous, despite the understanding that they are responsible for targeting to the centromere. This variation allowed the division of the CRM clade into the three groups A, B and C, and the members of each differed considerably with respect to their chromosomal distribution. The differences in chromosomal distribution coincided with variation in the integrase C-terminus sequences possessing a putative targeting domain (PTD). A majority of the group A elements possess the CR motif and are concentrated in the centromeric region, while members of group C have the type II chromodomain and are dispersed throughout the genome. Although representatives of the group B lack a PTD of any type, they appeared to be localized preferentially in the centromeres of tested species. All tested elements were found to be transcriptionally active. Conclusions Comprehensive analysis of the CRM clade elements showed that genuinely centromeric retrotransposons represent only a fraction of the CRM clade (group A). These centromeric retrotransposons represent an active component of centromeres of a wide range of angiosperm species, implying that they play an important role in plant centromere evolution. In addition, their transcriptional activity is consistent with the notion that the transcription of centromeric retrotransposons has a role in normal centromere function.
Collapse
|
15
|
Park M, Jo S, Kwon JK, Park J, Ahn JH, Kim S, Lee YH, Yang TJ, Hur CG, Kang BC, Kim BD, Choi D. Comparative analysis of pepper and tomato reveals euchromatin expansion of pepper genome caused by differential accumulation of Ty3/Gypsy-like elements. BMC Genomics 2011; 12:85. [PMID: 21276256 PMCID: PMC3042944 DOI: 10.1186/1471-2164-12-85] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2010] [Accepted: 01/29/2011] [Indexed: 11/23/2022] Open
Abstract
Background Among the Solanaceae plants, the pepper genome is three times larger than that of tomato. Although the gene repertoire and gene order of both species are well conserved, the cause of the genome-size difference is not known. To determine the causes for the expansion of pepper euchromatic regions, we compared the pepper genome to that of tomato. Results For sequence-level analysis, we generated 35.6 Mb of pepper genomic sequences from euchromatin enriched 1,245 pepper BAC clones. The comparative analysis of orthologous gene-rich regions between both species revealed insertion of transposons exclusively in the pepper sequences, maintaining the gene order and content. The most common type of the transposon found was the LTR retrotransposon. Phylogenetic comparison of the LTR retrotransposons revealed that two groups of Ty3/Gypsy-like elements (Tat and Athila) were overly accumulated in the pepper genome. The FISH analysis of the pepper Tat elements showed a random distribution in heterochromatic and euchromatic regions, whereas the tomato Tat elements showed heterochromatin-preferential accumulation. Conclusions Compared to tomato pepper euchromatin doubled its size by differential accumulation of a specific group of Ty3/Gypsy-like elements. Our results could provide an insight on the mechanism of genome evolution in the Solanaceae family.
Collapse
Affiliation(s)
- Minkyu Park
- Interdisciplinary Program in Agriculture Biotechnology, Seoul National University, Seoul 151-921, Korea
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
16
|
Llorens C, Futami R, Covelli L, Domínguez-Escribá L, Viu JM, Tamarit D, Aguilar-Rodríguez J, Vicente-Ripolles M, Fuster G, Bernet GP, Maumus F, Munoz-Pomer A, Sempere JM, Latorre A, Moya A. The Gypsy Database (GyDB) of mobile genetic elements: release 2.0. Nucleic Acids Res 2011; 39:D70-4. [PMID: 21036865 PMCID: PMC3013669 DOI: 10.1093/nar/gkq1061] [Citation(s) in RCA: 236] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
This article introduces the second release of the Gypsy Database of Mobile Genetic Elements (GyDB 2.0): a research project devoted to the evolutionary dynamics of viruses and transposable elements based on their phylogenetic classification (per lineage and protein domain). The Gypsy Database (GyDB) is a long-term project that is continuously progressing, and that owing to the high molecular diversity of mobile elements requires to be completed in several stages. GyDB 2.0 has been powered with a wiki to allow other researchers participate in the project. The current database stage and scope are long terminal repeats (LTR) retroelements and relatives. GyDB 2.0 is an update based on the analysis of Ty3/Gypsy, Retroviridae, Ty1/Copia and Bel/Pao LTR retroelements and the Caulimoviridae pararetroviruses of plants. Among other features, in terms of the aforementioned topics, this update adds: (i) a variety of descriptions and reviews distributed in multiple web pages; (ii) protein-based phylogenies, where phylogenetic levels are assigned to distinct classified elements; (iii) a collection of multiple alignments, lineage-specific hidden Markov models and consensus sequences, called GyDB collection; (iv) updated RefSeq databases and BLAST and HMM servers to facilitate sequence characterization of new LTR retroelement and caulimovirus queries; and (v) a bibliographic server. GyDB 2.0 is available at http://gydb.org.
Collapse
Affiliation(s)
- Carlos Llorens
- Biotechvana, Parc Científic, Universitat de València, Calle Catedrático José Beltrán 2, 46980 Paterna, València, Spain.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Novikova OS. Diversity and evolution of LTR retrotransposons in the genome of Phanerochaete chrysosporium (Fungi: Basidiomycota). RUSS J GENET+ 2010. [DOI: 10.1134/s1022795410060013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
18
|
Breen J, Wicker T, Kong X, Zhang J, Ma W, Paux E, Feuillet C, Appels R, Bellgard M. A highly conserved gene island of three genes on chromosome 3B of hexaploid wheat: diverse gene function and genomic structure maintained in a tightly linked block. BMC PLANT BIOLOGY 2010; 10:98. [PMID: 20507561 PMCID: PMC3017796 DOI: 10.1186/1471-2229-10-98] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2009] [Accepted: 05/27/2010] [Indexed: 05/29/2023]
Abstract
BACKGROUND The complexity of the wheat genome has resulted from waves of retrotransposable element insertions. Gene deletions and disruptions generated by the fast replacement of repetitive elements in wheat have resulted in disruption of colinearity at a micro (sub-megabase) level among the cereals. In view of genomic changes that are possible within a given time span, conservation of genes between species tends to imply an important functional or regional constraint that does not permit a change in genomic structure. The ctg1034 contig completed in this paper was initially studied because it was assigned to the Sr2 resistance locus region, but detailed mapping studies subsequently assigned it to the long arm of 3B and revealed its unusual features. RESULTS BAC shotgun sequencing of the hexaploid wheat (Triticum aestivum cv. Chinese Spring) genome has been used to assemble a group of 15 wheat BACs from the chromosome 3B physical map FPC contig ctg1034 into a 783,553 bp genomic sequence. This ctg1034 sequence was annotated for biological features such as genes and transposable elements. A three-gene island was identified among >80% repetitive DNA sequence. Using bioinformatics analysis there were no observable similarity in their gene functions. The ctg1034 gene island also displayed complete conservation of gene order and orientation with syntenic gene islands found in publicly available genome sequences of Brachypodium distachyon, Oryza sativa, Sorghum bicolor and Zea mays, even though the intergenic space and introns were divergent. CONCLUSION We propose that ctg1034 is located within the heterochromatic C-band region of deletion bin 3BL7 based on the identification of heterochromatic tandem repeats and presence of significant matches to chromodomain-containing gypsy LTR retrotransposable elements. We also speculate that this location, among other highly repetitive sequences, may account for the relative stability in gene order and orientation within the gene island.Sequence data from this article have been deposited with the GenBank Data Libraries under accession no. GQ422824.
Collapse
Affiliation(s)
- James Breen
- Centre for Comparative Genomics (CCG), Murdoch University, South Street, Perth 6150, Australia
- Molecular Plant Breeding Co-operative Research Centre (MPBCRC) Murdoch University, South Street, Perth 6150, Australia
| | - Thomas Wicker
- Institute of Plant Biology, University Zurich, Zollikerstrasse 107, Zurich, CH-8008 Switzerland
| | - Xiuying Kong
- Key Laboratory of Crop Germplasm Resources and Utilization, MOA/Institute of Crop Sciences, CAAS/The Key Facility for Crop Gene Resources and Genetic Improvement, Beijing 100081, China
| | - Juncheng Zhang
- Key Laboratory of Crop Germplasm Resources and Utilization, MOA/Institute of Crop Sciences, CAAS/The Key Facility for Crop Gene Resources and Genetic Improvement, Beijing 100081, China
| | - Wujun Ma
- Centre for Comparative Genomics (CCG), Murdoch University, South Street, Perth 6150, Australia
- State Agricultural Biotechnology Centre (SABC), Murdoch University, South Street, Perth 6150, Australia
- Department of Agriculture and Food, Western Australia (DAFWA), 3 Baron Hay Court, Perth, 6151 Australia
| | - Etienne Paux
- UMR 1095 Génétique, Diversité et Ecophysiologie des Céréales, INRA Site de Crouël, 63100 Clermont-ferrand, France
| | - Catherine Feuillet
- UMR 1095 Génétique, Diversité et Ecophysiologie des Céréales, INRA Site de Crouël, 63100 Clermont-ferrand, France
| | - Rudi Appels
- Centre for Comparative Genomics (CCG), Murdoch University, South Street, Perth 6150, Australia
| | - Matthew Bellgard
- Centre for Comparative Genomics (CCG), Murdoch University, South Street, Perth 6150, Australia
| |
Collapse
|
19
|
Gottlieb AM, Poggio L. Genomic screening in dioecious "yerba mate" tree (Ilex paraguariensis A. St. Hill., Aquifoliaceae) through representational difference analysis. Genetica 2010; 138:567-78. [PMID: 20221672 DOI: 10.1007/s10709-010-9449-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2009] [Accepted: 09/25/2009] [Indexed: 01/31/2023]
Abstract
The "yerba mate" tree, Ilex paraguariensis, is a functionally dioecious crop species with economic relevance in several South American countries. We report a genomic screening accomplished through representational difference analysis (RDA) in male and female I. paraguariensis trees. The aim of the present paper was to investigate the occurrence of sex-related genomic differences in order to develop an early gender detection molecular method that could help reducing energy inputs during the "yerba mate" processing and that could be suitable for breeding programs. An intra-experiment redundancy was detected via SSCP analysis and sequence characterization. Taking together both reciprocal RDA assays, fragments isolated can be discriminated into three main categories. The first category of fragments shows spurious affinities with available deposited sequences and could be considered as specific to I. paraguariensis. The second category comprises sequences identified as organellar or ribosomal plant DNA. Sequences grouped in the third category involve clones akin to conserved domains of retrotransposons (RNaseH, integrases and/or chromodomains) from at least two distinct lineages of Ty3/Gypsy retrotransposons and one from Ty1/Copia retroelements, which in addition are associated to sex determination regions of the Solanaceae, Caricaceae and Salicaceae. A contig sequence was assembled that codes for an integrase core domain and a chromodomain. The phylogenetic analysis of the so-called IPRE (for I. paraguariensis retroelement) integrase domain indicates that it belongs to the Del lineage of the Chromoviridae. This is the first report of mobile elements isolated and detected from the "yerba mate" tree. Although RDA derived fragments, so far tested, have been retrieved from both sexes with similar sequences, association to sex related regions cannot be completely discarded. Implications of present results are further discussed.
Collapse
Affiliation(s)
- Alexandra Marina Gottlieb
- Laboratorio de Citogenética y Evolución (LaCyE), Departamento de Ecología, Genética y Evolución, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Intendente Güiraldes y Costanera Norte s/n, 4to. Piso, Pabellón II, Ciudad Universitaria, C1428EHA, Ciudad Autónoma de Buenos Aires, Argentina.
| | | |
Collapse
|
20
|
Steinbiss S, Willhoeft U, Gremme G, Kurtz S. Fine-grained annotation and classification of de novo predicted LTR retrotransposons. Nucleic Acids Res 2010; 37:7002-13. [PMID: 19786494 PMCID: PMC2790888 DOI: 10.1093/nar/gkp759] [Citation(s) in RCA: 187] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Long terminal repeat (LTR) retrotransposons and endogenous retroviruses (ERVs) are transposable elements in eukaryotic genomes well suited for computational identification. De novo identification tools determine the position of potential LTR retrotransposon or ERV insertions in genomic sequences. For further analysis, it is desirable to obtain an annotation of the internal structure of such candidates. This article presents LTRdigest, a novel software tool for automated annotation of internal features of putative LTR retrotransposons. It uses local alignment and hidden Markov model-based algorithms to detect retrotransposon-associated protein domains as well as primer binding sites and polypurine tracts. As an example, we used LTRdigest results to identify 88 (near) full-length ERVs in the chromosome 4 sequence of Mus musculus, separating them from truncated insertions and other repeats. Furthermore, we propose a work flow for the use of LTRdigest in de novo LTR retrotransposon classification and perform an exemplary de novo analysis on the Drosophila melanogaster genome as a proof of concept. Using a new method solely based on the annotations generated by LTRdigest, 518 potential LTR retrotransposons were automatically assigned to 62 candidate groups. Representative sequences from 41 of these 62 groups were matched to reference sequences with >80% global sequence similarity.
Collapse
Affiliation(s)
- Sascha Steinbiss
- Center for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany.
| | | | | | | |
Collapse
|
21
|
Bao W, Kapitonov VV, Jurka J. Ginger DNA transposons in eukaryotes and their evolutionary relationships with long terminal repeat retrotransposons. Mob DNA 2010; 1:3. [PMID: 20226081 PMCID: PMC2836005 DOI: 10.1186/1759-8753-1-3] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2009] [Accepted: 01/25/2010] [Indexed: 12/12/2022] Open
Abstract
Background In eukaryotes, long terminal repeat (LTR) retrotransposons such as Copia, BEL and Gypsy integrate their DNA copies into the host genome using a particular type of DDE transposase called integrase (INT). The Gypsy INT-like transposase is also conserved in the Polinton/Maverick self-synthesizing DNA transposons and in the 'cut and paste' DNA transposons known as TDD-4 and TDD-5. Moreover, it is known that INT is similar to bacterial transposases that belong to the IS3, IS481, IS30 and IS630 families. It has been suggested that LTR retrotransposons evolved from a non-LTR retrotransposon fused with a DNA transposon in early eukaryotes. In this paper we analyze a diverse superfamily of eukaryotic cut and paste DNA transposons coding for INT-like transposase and discuss their evolutionary relationship to LTR retrotransposons. Results A new diverse eukaryotic superfamily of DNA transposons, named Ginger (for 'Gypsy INteGrasE Related') DNA transposons is defined and analyzed. Analogously to the IS3 and IS481 bacterial transposons, the Ginger termini resemble those of the Gypsy LTR retrotransposons. Currently, Ginger transposons can be divided into two distinct groups named Ginger1 and Ginger2/Tdd. Elements from the Ginger1 group are characterized by approximately 40 to 270 base pair (bp) terminal inverted repeats (TIRs), and are flanked by CCGG-specific or CCGT-specific target site duplication (TSD) sequences. The Ginger1-encoded transposases contain an approximate 400 amino acid N-terminal portion sharing high amino acid identity to the entire Gypsy-encoded integrases, including the YPYY motif, zinc finger, DDE domain, and, importantly, the GPY/F motif, a hallmark of Gypsy and endogenous retrovirus (ERV) integrases. Ginger1 transposases also contain additional C-terminal domains: ovarian tumor (OTU)-like protease domain or Ulp1 protease domain. In vertebrate genomes, at least two host genes, which were previously thought to be derived from the Gypsy integrases, apparently have evolved from the Ginger1 transposase genes. We also introduce a second Ginger group, designated Ginger2/Tdd, which includes the previously reported DNA transposon TDD-4. Conclusions The Ginger superfamily represents eukaryotic DNA transposons closely related to LTR retrotransposons. Ginger elements provide new insights into the evolution of transposable elements and certain transposable element (TE)-derived genes.
Collapse
Affiliation(s)
- Weidong Bao
- Genetic Information Research Institute, Mountain View, CA, USA.
| | | | | |
Collapse
|
22
|
Macas J, Koblízková A, Navrátilová A, Neumann P. Hypervariable 3' UTR region of plant LTR-retrotransposons as a source of novel satellite repeats. Gene 2009; 448:198-206. [PMID: 19563868 DOI: 10.1016/j.gene.2009.06.014] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2009] [Revised: 06/17/2009] [Accepted: 06/19/2009] [Indexed: 11/15/2022]
Abstract
The repetitive sequence PisTR-A has an unusual organization in the pea (Pisum sativum) genome, being present both as short dispersed repeats as well as long arrays of tandemly arranged satellite DNA. Cloning, sequencing and FISH analysis of both PisTR-A variants revealed that the former occurs in the genome embedded within the sequence of Ty3/gypsy-like Ogre elements, whereas the latter forms homogenized arrays of satellite repeats at several genomic loci. The Ogre elements carry the PisTR-A sequences in their 3' untranslated region (UTR) separating the gag-pol region from the 3' LTR. This region was found to be highly variable among pea Ogre elements, and includes a number of other tandem repeats along with or instead of PisTR-A. Bioinformatic analysis of LTR-retrotransposons mined from available plant genomic sequence data revealed that the frequent occurrence of variable tandem repeats within 3' UTRs is a typical feature of the Tat lineage of plant retrotransposons. Comparison of these repeats to known plant satellite sequences uncovered two other instances of satellites with sequence similarity to a Tat-like retrotransposon 3' UTR regions. These observations suggest that some retrotransposons may significantly contribute to satellite DNA evolution by generating a library of short repeat arrays that can subsequently be dispersed through the genome and eventually further amplified and homogenized into novel satellite repeats.
Collapse
Affiliation(s)
- Jirí Macas
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovská 31, Ceské Budejovice, CZ-37005, Czech Republic.
| | | | | | | |
Collapse
|
23
|
de Setta N, Van Sluys MA, Capy P, Carareto CMA. Multiple invasions of Gypsy and Micropia retroelements in genus Zaprionus and melanogaster subgroup of the genus Drosophila. BMC Evol Biol 2009; 9:279. [PMID: 19954522 PMCID: PMC2797524 DOI: 10.1186/1471-2148-9-279] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2009] [Accepted: 12/02/2009] [Indexed: 11/23/2022] Open
Abstract
Background The Zaprionus genus shares evolutionary features with the melanogaster subgroup, such as space and time of origin. Although little information about the transposable element content in the Zaprionus genus had been accumulated, some of their elements appear to be more closely related with those of the melanogaster subgroup, indicating that these two groups of species were involved in horizontal transfer events during their evolution. Among these elements, the Gypsy and the Micropia retroelements were chosen for screening in seven species of the two Zaprionus subgenera, Anaprionus and Zaprionus. Results Screening allowed the identification of diverse Gypsy and Micropia retroelements only in species of the Zaprionus subgenus, showing that they are transcriptionally active in the sampled species. The sequences of each retroelement were closely related to those of the melanogaster species subgroup, and the most parsimonious hypothesis would be that 15 horizontal transfer events shaped their evolution. The Gypsy retroelement of the melanogaster subgroup probably invaded the Zaprionus genomes about 11 MYA. In contrast, the Micropia retroelement may have been introduced into the Zaprionus subgenus and the melanogaster subgroup from an unknown donor more recently (~3 MYA). Conclusion Gypsy and Micropia of Zaprionus and melanogaster species share similar evolutionary patterns. The sharing of evolutionary, ecological and ethological features probably allowed these species to pass through a permissive period of transposable element invasion, explaining the proposed waves of horizontal transfers.
Collapse
Affiliation(s)
- Nathalia de Setta
- Department of Biology, UNESP - São Paulo State University, São José do Rio Preto, SP, Brazil.
| | | | | | | |
Collapse
|
24
|
Gao D, Gill N, Kim HR, Walling JG, Zhang W, Fan C, Yu Y, Ma J, SanMiguel P, Jiang N, Cheng Z, Wing RA, Jiang J, Jackson SA. A lineage-specific centromere retrotransposon in Oryza brachyantha. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2009; 60:820-831. [PMID: 19702667 DOI: 10.1111/j.1365-313x.2009.04005.x] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Most eukaryotic centromeres contain large quantities of repetitive DNA, such as satellite repeats and retrotransposons. Unlike most transposons in plant genomes, the centromeric retrotransposon (CR) family is conserved over long evolutionary periods among a majority of the grass species. CR elements are highly concentrated in centromeres, and are likely to play a role in centromere function. In order to study centromere evolution in the Oryza (rice) genus, we sequenced the orthologous region to centromere 8 of Oryza sativa from a related species, Oryza brachyantha. We found that O. brachyantha does not have the canonical CRR (CR of rice) found in the centromeres of all other Oryza species. Instead, a new Ty3-gypsy (Metaviridae) retroelement (FRetro3) was found to colonize the centromeres of this species. This retroelement is found in high copy numbers in the O. brachyantha genome, but not in other Oryza genomes, and based on the dating of long terminal repeats (LTRs) of FRetro3 it was amplified in the genome in the last few million years. Interestingly, there is a high level of removal of FRetro3 based on solo-LTRs to full-length elements, and this rapid turnover may have played a role in the replacement of the canonical CRR with the new element by active deletion. Comparison with previously described ChIP cloning data revealed that FRetro3 is found in CENH3-associated chromatin sequences. Thus, within a single lineage of the Oryza genus, the canonical component of grass centromeres has been replaced with a new retrotransposon that has all the hallmarks of a centromeric retroelement.
Collapse
Affiliation(s)
- Dongying Gao
- Molecular and Evolutionary Genetics, Purdue University, 915 W. State Street, West Lafayette, IN 47907, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
25
|
Llorens C, Muñoz-Pomer A, Bernad L, Botella H, Moya A. Network dynamics of eukaryotic LTR retroelements beyond phylogenetic trees. Biol Direct 2009; 4:41. [PMID: 19883502 PMCID: PMC2774666 DOI: 10.1186/1745-6150-4-41] [Citation(s) in RCA: 94] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2009] [Accepted: 11/02/2009] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Sequencing projects have allowed diverse retroviruses and LTR retrotransposons from different eukaryotic organisms to be characterized. It is known that retroviruses and other retro-transcribing viruses evolve from LTR retrotransposons and that this whole system clusters into five families: Ty3/Gypsy, Retroviridae, Ty1/Copia, Bel/Pao and Caulimoviridae. Phylogenetic analyses usually show that these split into multiple distinct lineages but what is yet to be understood is how deep evolution occurred in this system. RESULTS We combined phylogenetic and graph analyses to investigate the history of LTR retroelements both as a tree and as a network. We used 268 non-redundant LTR retroelements, many of them introduced for the first time in this work, to elucidate all possible LTR retroelement phylogenetic patterns. These were superimposed over the tree of eukaryotes to investigate the dynamics of the system, at distinct evolutionary times. Next, we investigated phenotypic features such as duplication and variability of amino acid motifs, and several differences in genomic ORF organization. Using this information we characterized eight reticulate evolution markers to construct phenotypic network models. CONCLUSION The evolutionary history of LTR retroelements can be traced as a time-evolving network that depends on phylogenetic patterns, epigenetic host-factors and phenotypic plasticity. The Ty1/Copia and the Ty3/Gypsy families represent the oldest patterns in this network that we found mimics eukaryotic macroevolution. The emergence of the Bel/Pao, Retroviridae and Caulimoviridae families in this network can be related with distinct inflations of the Ty3/Gypsy family, at distinct evolutionary times. This suggests that Ty3/Gypsy ancestors diversified much more than their Ty1/Copia counterparts, at distinct geological eras. Consistent with the principle of preferential attachment, the connectivities among phenotypic markers, taken as network-represented combinations, are power-law distributed. This evidences an inflationary mode of evolution where the system diversity; 1) expands continuously alternating vertical and gradual processes of phylogenetic divergence with episodes of modular, saltatory and reticulate evolution; 2) is governed by the intrinsic capability of distinct LTR retroelement host-communities to self-organize their phenotypes according to emergent laws characteristic of complex systems. REVIEWERS This article was reviewed by Eugene V. Koonin, Eric Bapteste, and Enmanuelle Lerat (nominated by King Jordan).
Collapse
Affiliation(s)
- Carlos Llorens
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva (ICBIBE), Universitat de València, Paterna, Valencia, Spain
- Biotechvana, Parc Científic, Universitat de València, Paterna, Valencia, Spain
| | - Alfonso Muñoz-Pomer
- Biotechvana, Parc Científic, Universitat de València, Paterna, Valencia, Spain
- Departamento de Sistemas Informáticos y Computación (DSIC), Universitat Politècnica de València, Valencia, Spain
| | - Lucia Bernad
- Biotechvana, Parc Científic, Universitat de València, Paterna, Valencia, Spain
| | - Hector Botella
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva (ICBIBE), Universitat de València, Paterna, Valencia, Spain
- Área de Paleontología, Dpto. Geología, Universitat de València, Paterna, Valencia, Spain
| | - Andrés Moya
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva (ICBIBE), Universitat de València, Paterna, Valencia, Spain
- Centro Superior de Investigación en Salud Pública (CSISP), Valencia, Spain
- CIBER de Epidemiología y Salud Pública (CIBERESP), Barcelona, Spain
| |
Collapse
|
26
|
Llorens C, Futami R, Renaud G, Moya A. Bioinformatic flowchart and database to investigate the origins and diversity of clan AA peptidases. Biol Direct 2009; 4:3. [PMID: 19173708 PMCID: PMC2642776 DOI: 10.1186/1745-6150-4-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2009] [Accepted: 01/27/2009] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Clan AA of aspartic peptidases relates the family of pepsin monomers evolutionarily with all dimeric peptidases encoded by eukaryotic LTR retroelements. Recent findings describing various pools of single-domain nonviral host peptidases, in prokaryotes and eukaryotes, indicate that the diversity of clan AA is larger than previously thought. The ensuing approach to investigate this enzyme group is by studying its phylogeny. However, clan AA is a difficult case to study due to the low similarity and different rates of evolution. This work is an ongoing attempt to investigate the different clan AA families to understand the cause of their diversity. RESULTS In this paper, we describe in-progress database and bioinformatic flowchart designed to characterize the clan AA protein domain based on all possible protein families through ancestral reconstructions, sequence logos, and hidden markov models (HMMs). The flowchart includes the characterization of a major consensus sequence based on 6 amino acid patterns with correspondence with Andreeva's model, the structural template describing the clan AA peptidase fold. The set of tools is work in progress we have organized in a database within the GyDB project, referred to as Clan AA Reference Database http://gydb.uv.es/gydb/phylogeny.php?tree=caard. CONCLUSION The pre-existing classification combined with the evolutionary history of LTR retroelements permits a consistent taxonomical collection of sequence logos and HMMs. This set is useful for gene annotation but also a reference to evaluate the diversity of, and the relationships among, the different families. Comparisons among HMMs suggest a common ancestor for all dimeric clan AA peptidases that is halfway between single-domain nonviral peptidases and those coded by Ty3/Gypsy LTR retroelements. Sequence logos reveal how all clan AA families follow similar protein domain architecture related to the peptidase fold. In particular, each family nucleates a particular consensus motif in the sequence position related to the flap. The different motifs constitute a network where an alanine-asparagine-like variable motif predominates, instead of the canonical flap of the HIV-1 peptidase and closer relatives.
Collapse
Affiliation(s)
- Carlos Llorens
- Institut Cavanilles de Biodiversitat i Biología Evolutiva, Universitat de València, Paterna, Valencia, Spain
- Biotechvana, Parc Cientific, Universitat de Valencia, Paterna, Valencia, Spain
| | - Ricardo Futami
- Biotechvana, Parc Cientific, Universitat de Valencia, Paterna, Valencia, Spain
| | - Gabriel Renaud
- Biotechvana, Parc Cientific, Universitat de Valencia, Paterna, Valencia, Spain
| | - Andrés Moya
- Institut Cavanilles de Biodiversitat i Biología Evolutiva, Universitat de València, Paterna, Valencia, Spain
- CIBER de Epidemiología y Salud Pública (CIBERESP), Barcelona, Spain
| |
Collapse
|
27
|
A transitional endogenous lentivirus from the genome of a basal primate and implications for lentivirus evolution. Proc Natl Acad Sci U S A 2008; 105:20362-7. [PMID: 19075221 DOI: 10.1073/pnas.0807873105] [Citation(s) in RCA: 149] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Lentiviruses chronically infect a broad range of mammalian species and have been transmitted from primates to humans, giving rise to multiple outbreaks of HIV infection over the past century. Although the circumstances surrounding these recent zoonoses are becoming clearer, the nature and timescale of interaction between lentiviruses and primates remains unknown. Here, we report the discovery of an endogenous lentivirus in the genome of the gray mouse lemur (Microcebus murinus), a strepsirrhine primate from Madagascar, demonstrating that lentiviruses are capable of invading the primate germ line. Phylogenetic analysis places gray mouse lemur prosimian immunodeficiency virus (pSIVgml) basal to all known primate lentiviruses and, consistent with this, its genomic organization is intermediate between the nonprimate lentiviruses and their more derived primate counterparts. Thus, pSIVgml represents the first unambiguous example of a viral transitional form, revealing the acquisition and loss of genomic features during lentiviral evolution. Furthermore, because terrestrial mammal populations in Madagascar and Africa are likely to have been isolated from one another for at least 14 million years, the presence of pSIVgml in the gray mouse lemur genome indicates that lentiviruses must have been infecting primates for at least this period of time, or have been transmitted between Malagasy and African primate populations by a vector species capable of traversing the Mozambique channel. The discovery of pSIVgml illustrates the utility of endogenous sequences for the study of contemporary retroviruses and indicates that primate lentiviruses may be considerably older and more broadly distributed than previously thought.
Collapse
|
28
|
Bae YA, Ahn JS, Kim SH, Rhyu MG, Kong Y, Cho SY. PwRn1, a novel Ty3/gypsy-like retrotransposon of Paragonimus westermani: molecular characters and its differentially preserved mobile potential according to host chromosomal polyploidy. BMC Genomics 2008; 9:482. [PMID: 18851759 PMCID: PMC2582038 DOI: 10.1186/1471-2164-9-482] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2008] [Accepted: 10/14/2008] [Indexed: 11/25/2022] Open
Abstract
Background Retrotransposons have been known to involve in the remodeling and evolution of host genome. These reverse transcribing elements, which show a complex evolutionary pathway with diverse intermediate forms, have been comprehensively analyzed from a wide range of host genomes, while the information remains limited to only a few species in the phylum Platyhelminthes. Results A LTR retrotransposon and its homologs with a strong phylogenetic affinity toward CsRn1 of Clonorchis sinensis were isolated from a trematode parasite Paragonimus westermani via a degenerate PCR method and from an insect species Anopheles gambiae by in silico analysis of the whole mosquito genome, respectively. These elements, designated PwRn1 and AgCR-1 – AgCR-14 conserved unique features including a t-RNATrp primer binding site and the unusual CHCC signature of Gag proteins. Their flanking LTRs displayed >97% nucleotide identities and thus, these elements were likely to have expanded recently in the trematode and insect genomes. They evolved heterogeneous expression strategies: a single fused ORF, two separate ORFs with an identical reading frame and two ORFs overlapped by -1 frameshifting. Phylogenetic analyses suggested that the elements with the separate ORFs had evolved from an ancestral form(s) with the overlapped ORFs. The mobile potential of PwRn1 was likely to be maintained differentially in association with the karyotype of host genomes, as was examined by the presence/absence of intergenomic polymorphism and mRNA transcripts. Conclusion Our results on the structural diversity of CsRn1-like elements can provide a molecular tool to dissect a more detailed evolutionary episode of LTR retrotransposons. The PwRn1-associated genomic polymorphism, which is substantial in diploids, will also be informative in addressing genomic diversification following inter-/intra-specific hybridization in P. westermani populations.
Collapse
Affiliation(s)
- Young-An Bae
- Department of Molecular Parasitology and Samsung Biomedical Research Institute, Sungkyunkwan University School of Medicine, Suwon, Gyeonggi-do 440-746, Korea.
| | | | | | | | | | | |
Collapse
|
29
|
Llorens C, Fares MA, Moya A. Relationships of gag-pol diversity between Ty3/Gypsy and Retroviridae LTR retroelements and the three kings hypothesis. BMC Evol Biol 2008; 8:276. [PMID: 18842133 PMCID: PMC2577118 DOI: 10.1186/1471-2148-8-276] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2008] [Accepted: 10/08/2008] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The origin of vertebrate retroviruses (Retroviridae) is yet to be thoroughly investigated, but due to their similarity and identical gag-pol (and env) genome structure, it is accepted that they evolve from Ty3/Gypsy LTR retroelements the retrotransposons and retroviruses of plants, fungi and animals. These 2 groups of LTR retroelements code for 3 proteins rarely studied due to the high variability - gag polyprotein, protease and GPY/F module. In relation to 3 previously proposed Retroviridae classes I, II and II, investigation of the above proteins conclusively uncovers important insights regarding the ancient history of Ty3/Gypsy and Retroviridae LTR retroelements. RESULTS We performed a comprehensive study of 120 non-redundant Ty3/Gypsy and Retroviridae LTR retroelements. Phylogenetic reconstruction inferred based on the concatenated analysis of the gag and pol polyproteins shows a robust phylogenetic signal regarding the clustering of OTUs. Evaluation of gag and pol polyproteins separately yields discordant information. While pol signal supports the traditional perspective (2 monophyletic groups), gag polyprotein describes an alternative scenario where each Retroviridae class can be distantly related with one or more Ty3/Gypsy lineages. We investigated more in depth this evidence through comparative analyses performed based on the gag polyprotein, the protease and the GPY/F module. Our results indicate that contrary to the traditional monophyletic view of the origin of vertebrate retroviruses, the Retroviridae class I is a molecular fossil, preserving features that were probably predominant among Ty3/Gypsy ancestors predating the split of plants, fungi and animals. In contrast, classes II and III maintain other phenotypes that emerged more recently during Ty3/Gypsy evolution. CONCLUSION The 3 Retroviridae classes I, II and III exhibit phenotypic differences that delineate a network never before reported between Ty3/Gypsy and Retroviridae LTR retroelements. This new scenario reveals how the diversity of vertebrate retroviruses is polyphyletically recurrent into the Ty3/Gypsy evolution, i.e. older than previously thought. The simplest hypothesis to explain this finding is that classes I, II and III trace back to at least 3 Ty3/Gypsy ancestors that emerged at different evolutionary times prior to protostomes-deuterostomes divergence. We have called this "the three kings hypothesis" concerning the origin of vertebrate retroviruses.
Collapse
Affiliation(s)
- Carlos Llorens
- Institut Cavanilles de Biodiversitat i Biología Evolutiva, Universitat de València, Polígono de la coma S/N, Paterna, Valencia, Spain
- Biotechvana, Parc Cientific, Universitat de Valencia, Paterna, Lab 16D Polígono de la coma S/N, Paterna, Valencia, Spain
| | - Mario A Fares
- Department of Genetics, University of Dublín, Trinity Collage Dublín, Dublín 2, Ireland
| | - Andres Moya
- Institut Cavanilles de Biodiversitat i Biología Evolutiva, Universitat de València, Polígono de la coma S/N, Paterna, Valencia, Spain
- CIBER de Epidemiología y Sal ud Pública (CIBERESP), Spain
| |
Collapse
|
30
|
Tempel S, Jurka M, Jurka J. VisualRepbase: an interface for the study of occurrences of transposable element families. BMC Bioinformatics 2008; 9:345. [PMID: 18710569 PMCID: PMC2527617 DOI: 10.1186/1471-2105-9-345] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2008] [Accepted: 08/18/2008] [Indexed: 12/20/2022] Open
Abstract
Background Repbase is a reference database of eukaryotic repetitive DNA, which includes prototypic sequences of repeats and basic information described in annotations. Repbase already has software for entering new sequence families and for comparing the user's sequence with the database of consensus sequences. Results We describe the software named VisualRepbase and the associated database, which allow for displaying and analyzing all occurrences of transposable element families present in an annotated genome. VisualRepbase is a Java-based interface which can download selected occurrences of transposable elements, show the distribution of given families on the chromosome, and present the localization of these occurrences with regard to gene annotations and other families of transposable elements in Repbase. In addition, it has several features for saving the graphical representation of occurrences, saving all sequences in FASTA format, and searching and saving all annotated genes that are surrounded by these occurrences. Conclusion VisualRepbase is available as a downloadable version. It can be found at .
Collapse
Affiliation(s)
- Sébastien Tempel
- Genetic Information Research Institute, 1925 Landings Dr, Mountain View, CA 94043, USA.
| | | | | |
Collapse
|
31
|
LTR retrotransposon landscape in Medicago truncatula: more rapid removal than in rice. BMC Genomics 2008; 9:382. [PMID: 18691433 PMCID: PMC2533021 DOI: 10.1186/1471-2164-9-382] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2008] [Accepted: 08/10/2008] [Indexed: 11/16/2022] Open
Abstract
Background Long terminal repeat retrotransposons (LTR elements) are ubiquitous Eukaryotic TEs that transpose through RNA intermediates. Accounting for significant proportion of many plant genomes, LTR elements have been well established as one of the major forces underlying the evolution of plant genome size, structure and function. The accessibility of more than 40% of genomic sequences of the model legume Medicago truncatula (Mt) has made the comprehensive study of its LTR elements possible. Results We use a newly developed tool LTR_FINDER to identify LTR retrotransposons in the Mt genome and detect 526 full-length elements as well as a great number of copies related to them. These elements constitute about 9.6% of currently available genomic sequences. They are classified into 85 families of which 64 are reported for the first time. The majority of the LTR retrotransposons belong to either Copia or Gypsy superfamily and the others are categorized as TRIMs or LARDs by their length. We find that the copy-number of Copia-like families is 3 times more than that of Gypsy-like ones but the latter contribute more to the genome. The analysis of PBS and protein-coding domain structure of the LTR families reveals that they tend to use only 4–5 types of tRNAs and many families have quite conservative ORFs besides known TE domains. For several important families, we describe in detail their abundance, conservation, insertion time and structure. We investigate the amplification-deletion pattern of the elements and find that the detectable full-length elements are relatively young and most of them were inserted within the last 0.52 MY. We also estimate that more than ten million bp of the Mt genomic sequences have been removed by the deletion of LTR elements and the removal of the full-length structures in Mt has been more rapid than in rice. Conclusion This report is the first comprehensive description and analysis of LTR retrotransposons in the Mt genome. Many important novel LTR families were discovered and their evolution is elucidated. Our results may outline the LTR retrotransposon landscape of the model legume.
Collapse
|