1
|
Boland DJ, Cornejo-Corona I, Browne DR, Murphy RL, Mullet J, Okada S, Devarenne TP. Reclassification of Botryococcus braunii chemical races into separate species based on a comparative genomics analysis. PLoS One 2024; 19:e0304144. [PMID: 39074348 DOI: 10.1371/journal.pone.0304144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 05/07/2024] [Indexed: 07/31/2024] Open
Abstract
The colonial green microalga Botryococcus braunii is well known for producing liquid hydrocarbons that can be utilized as biofuel feedstocks. B. braunii is taxonomically classified as a single species made up of three chemical races, A, B, and L, that are mainly distinguished by the hydrocarbons produced. We previously reported a B race draft nuclear genome, and here we report the draft nuclear genomes for the A and L races. A comparative genomic study of the three B. braunii races and 14 other algal species within Chlorophyta revealed significant differences in the genomes of each race of B. braunii. Phylogenomically, there was a clear divergence of the three races with the A race diverging earlier than both the B and L races, and the B and L races diverging from a later common ancestor not shared by the A race. DNA repeat content analysis suggested the B race had more repeat content than the A or L races. Orthogroup analysis revealed the B. braunii races displayed more gene orthogroup diversity than three closely related Chlamydomonas species, with nearly 24-36% of all genes in each B. braunii race being specific to each race. This analysis suggests the three races are distinct species based on sufficient differences in their respective genomes. We propose reclassification of the three chemical races to the following species names: Botryococcus alkenealis (A race), Botryococcus braunii (B race), and Botryococcus lycopadienor (L race).
Collapse
Affiliation(s)
- Devon J Boland
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
- Texas A&M Institute for Genome Sciences & Society (TIGSS), College Station, Texas, United States of America
| | - Ivette Cornejo-Corona
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
| | - Daniel R Browne
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
- AI & Computational Biology, LanzaTech Inc., Skokie, Illinois, United States of America
| | - Rebecca L Murphy
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
- Biology Department, Centenary College of Louisiana, Shreveport, Louisiana, United States of America
| | - John Mullet
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
| | - Shigeru Okada
- Laboratory of Aquatic Natural Products Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Yayoi, Bunkyo, Tokyo, Japan
| | - Timothy P Devarenne
- Department of Biochemistry and Biophysics, Texas A & M University, College Station, Texas, United States of America
| |
Collapse
|
2
|
Silva GS, Souza MM, Pamponét VDCC, Micheli F, de Melo CAF, de Oliveira SG, Costa EA. Cytogenomic Characterization of Transposable Elements and Satellite DNA in Passiflora L. Species. Genes (Basel) 2024; 15:418. [PMID: 38674353 PMCID: PMC11049143 DOI: 10.3390/genes15040418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 03/17/2024] [Accepted: 03/22/2024] [Indexed: 04/28/2024] Open
Abstract
The species Passiflora alata, P. cincinnata, and P. edulis have great economic value due to the use of their fruits for human consumption. In this study, we compared the repetitive genome fractions of these three species. The compositions of the repetitive DNA of these three species' genomes were analyzed using clustering and identification of the repetitive sequences with RepeatExplorer. It was found that repetitive DNA content represents 74.70%, 66.86%, and 62.24% of the genome of P. alata, P. edulis, and P. cincinnata, respectively. LTR Ty3/Gypsy retrotransposons represent the highest genome proportions in P. alata and P. edulis, while Ty1/Copia comprises the largest proportion of P. cincinnata genome. Chromosomal mapping by Fluorescent In Situ Hybridization (FISH) showed that LTR retrotransposons have a dispersed distribution along chromosomes. The subtelomeric region of chromosomes is where 145 bp satellite DNA is located, suggesting that these elements may play important roles in genome structure and organization in these species. In this work, we obtained the first global characterization of the composition of repetitive DNA in Passiflora, showing that an increase in genome size is related to an increase in repetitive DNA, which represents an important evolutionary route for these species.
Collapse
Affiliation(s)
- Gonçalo Santos Silva
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Margarete Magalhães Souza
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Vanessa de Carvalho Cayres Pamponét
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Fabienne Micheli
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
- CIRAD, UMR AGAP, F-34398 Montpellier, France
| | - Cláusio Antônio Ferreira de Melo
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| | - Sárah Gomes de Oliveira
- Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo (USP), São Paulo 01049-010, SP, Brazil;
| | - Eduardo Almeida Costa
- Laboratório de Melhoramento de Plantas, Departamento de Ciências Biológicas, Universidade Estadual de Santa Cruz (UESC), Ilhéus 45662-900, BA, Brazil; (G.S.S.); (V.d.C.C.P.); (F.M.); (C.A.F.d.M.); (E.A.C.)
| |
Collapse
|
3
|
Raiyemo DA, Bobadilla LK, Tranel PJ. Genomic profiling of dioecious Amaranthus species provides novel insights into species relatedness and sex genes. BMC Biol 2023; 21:37. [PMID: 36804015 PMCID: PMC9940365 DOI: 10.1186/s12915-023-01539-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2022] [Accepted: 02/08/2023] [Indexed: 02/21/2023] Open
Abstract
BACKGROUND Amaranthus L. is a diverse genus consisting of domesticated, weedy, and non-invasive species distributed around the world. Nine species are dioecious, of which Amaranthus palmeri S. Watson and Amaranthus tuberculatus (Moq.) J.D. Sauer are troublesome weeds of agronomic crops in the USA and elsewhere. Shallow relationships among the dioecious Amaranthus species and the conservation of candidate genes within previously identified A. palmeri and A. tuberculatus male-specific regions of the Y (MSYs) in other dioecious species are poorly understood. In this study, seven genomes of dioecious amaranths were obtained by paired-end short-read sequencing and combined with short reads of seventeen species in the family Amaranthaceae from NCBI database. The species were phylogenomically analyzed to understand their relatedness. Genome characteristics for the dioecious species were evaluated and coverage analysis was used to investigate the conservation of sequences within the MSY regions. RESULTS We provide genome size, heterozygosity, and ploidy level inference for seven newly sequenced dioecious Amaranthus species and two additional dioecious species from the NCBI database. We report a pattern of transposable element proliferation in the species, in which seven species had more Ty3 elements than copia elements while A. palmeri and A. watsonii had more copia elements than Ty3 elements, similar to the TE pattern in some monoecious amaranths. Using a Mash-based phylogenomic analysis, we accurately recovered taxonomic relationships among the dioecious Amaranthus species that were previously identified based on comparative morphology. Coverage analysis revealed eleven candidate gene models within the A. palmeri MSY region with male-enriched coverages, as well as regions on scaffold 19 with female-enriched coverage, based on A. watsonii read alignments. A previously reported FLOWERING LOCUS T (FT) within A. tuberculatus MSY contig was also found to exhibit male-enriched coverages for three species closely related to A. tuberculatus but not for A. watsonii reads. Additional characterization of the A. palmeri MSY region revealed that 78% of the region is made of repetitive elements, typical of a sex determination region with reduced recombination. CONCLUSIONS The results of this study further increase our understanding of the relationships among the dioecious species of the Amaranthus genus as well as revealed genes with potential roles in sex function in the species.
Collapse
Affiliation(s)
- Damilola A Raiyemo
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Lucas K Bobadilla
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA
| | - Patrick J Tranel
- Department of Crop Sciences, University of Illinois, Urbana, IL, 61801, USA.
| |
Collapse
|
4
|
Phylogenomics and genome size evolution in Amomum s. s. (Zingiberaceae): Comparison of traditional and modern sequencing methods. Mol Phylogenet Evol 2023; 178:107666. [PMID: 36384185 DOI: 10.1016/j.ympev.2022.107666] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2021] [Revised: 11/05/2022] [Accepted: 11/08/2022] [Indexed: 11/15/2022]
Abstract
BACKGROUND AND AIMS A targeted enrichment NGS approach was used to construct the phylogeny of Amomum Roxb. (Zingiberaceae). Phylogenies based on hundreds of nuclear genes, the whole plastome and the rDNA cistron were compared with an ITS-based phylogeny. Trends in genome size (GS) evolution were examined, chromosomes were counted and the geographical distribution of phylogenetic lineages was evaluated. METHODS In total, 92 accessions of 54 species were analysed. ITS was obtained for 79 accessions, 37 accessions were processed with Hyb-Seq and sequences from 449 nuclear genes, the whole cpDNA, and the rDNA cistron were analysed using concatenation, coalescence and supertree approaches. The evolution of absolute GS was analysed in a phylogenetic and geographical context. The chromosome numbers of 12 accessions were counted. KEY RESULTS Four groups were recognised in all datasets though their mutual relationships differ among datasets. While group A (A. subulatum and A. petaloideum) is basal to the remaining groups in the nuclear gene phylogeny, in the cpDNA topology it is sister to group B (A. repoeense and related species) and, in the ITS topology, it is sister to group D (the Elettariopsis lineage). The former Elettariopsis makes a monophyletic group. There is an increasing trend in GS during evolution. The largest GS values were found in group D in two tetraploid taxa, A. cinnamomeum and A. aff. biphyllum (both 2n = 96 chromosomes). The rest varied in GS (2C = 3.54-8.78 pg) with a constant chromosome number 2n = 48. There is a weak connection between phylogeny, GS and geography in Amomum. CONCLUSIONS Amomum consists of four groups, and the former Elettariopsis is monophyletic. Species in this group have the largest GS. Two polyploids were found and GS greatly varied in the rest of Amomum.
Collapse
|
5
|
Lu R, Liu J, Wang X, Song Z, Ji X, Li N, Ma G, Sun X. Chromosome-Level Genome Assembly of a Fragrant Japonica Rice Cultivar 'Changxianggeng 1813' Provides Insights into Genomic Variations between Fragrant and Non-Fragrant Japonica Rice. Int J Mol Sci 2022; 23:9705. [PMID: 36077110 PMCID: PMC9456513 DOI: 10.3390/ijms23179705] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 08/24/2022] [Accepted: 08/24/2022] [Indexed: 11/16/2022] Open
Abstract
East Asia has an abundant resource of fragrant japonica rice that is gaining increasing interest among both consumers and producers. However, genomic resources and in particular complete genome sequences currently available for the breeding of fragrant japonica rice are still scarce. Here, integrating Nanopore long-read sequencing, Illumina short-read sequencing, and Hi-C methods, we presented a high-quality chromosome-level genome assembly (~378.78 Mb) for a new fragrant japonica cultivar ‘Changxianggeng 1813’, with 31,671 predicated protein-coding genes. Based on the annotated genome sequence, we demonstrated that it was the badh2-E2 type of deletion (a 7-bp deletion in the second exon) that caused fragrance in ‘Changxianggeng 1813’. Comparative genomic analyses revealed that multiple gene families involved in the abiotic stress response were expanded in the ‘Changxianggeng 1813’ genome, which further supported the previous finding that no generalized loss of abiotic stress tolerance associated with the fragrance phenotype. Although the ‘Changxianggeng 1813’ genome showed high genomic synteny with the genome of the non-fragrant japonica rice cultivar Nipponbare, a total of 289,970 single nucleotide polymorphisms (SNPs), 96,093 small insertion-deletion polymorphisms (InDels), and 8690 large structure variants (SVs, >1000 bp) were identified between them. Together, these genomic resources will be valuable for elucidating the mechanisms underlying economically important traits and have wide-ranging implications for genomics-assisted breeding in fragrant japonica rice.
Collapse
Affiliation(s)
- Ruisen Lu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing 210014, China
| | - Jia Liu
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing 210014, China
| | - Xuegang Wang
- Changshu Agricultural Science Research Institute, Changshu 215500, China
| | - Zhao Song
- Guangdong Academy of Forestry, Guangzhou 510520, China
| | - Xiangdong Ji
- Changshu Agricultural Science Research Institute, Changshu 215500, China
| | - Naiwei Li
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing 210014, China
| | - Gang Ma
- Changshu Agricultural Science Research Institute, Changshu 215500, China
| | - Xiaoqin Sun
- Institute of Botany, Jiangsu Province and Chinese Academy of Sciences, Nanjing 210014, China
| |
Collapse
|
6
|
Dai SF, Zhu XG, Hutang GR, Li JY, Tian JQ, Jiang XH, Zhang D, Gao LZ. Genome Size Variation and Evolution Driven by Transposable Elements in the Genus Oryza. FRONTIERS IN PLANT SCIENCE 2022; 13:921937. [PMID: 35874017 PMCID: PMC9301470 DOI: 10.3389/fpls.2022.921937] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Accepted: 05/16/2022] [Indexed: 05/08/2023]
Abstract
Genome size variation and evolutionary forces behind have been long pursued in flowering plants. The genus Oryza, consisting of approximately 25 wild species and two cultivated rice, harbors eleven extant genome types, six of which are diploid (AA, BB, CC, EE, FF, and GG) and five of which are tetraploid (BBCC, CCDD, HHJJ, HHKK, and KKLL). To obtain the most comprehensive knowledge of genome size variation in the genus Oryza, we performed flow cytometry experiments and estimated genome sizes of 166 accessions belonging to 16 non-AA genome Oryza species. k-mer analyses were followed to verify the experimental results of the two accessions for each species. Our results showed that genome sizes largely varied fourfold in the genus Oryza, ranging from 279 Mb in Oryza brachyantha (FF) to 1,203 Mb in Oryza ridleyi (HHJJ). There was a 2-fold variation (ranging from 570 to 1,203 Mb) in genome size among the tetraploid species, while the diploid species had 3-fold variation, ranging from 279 Mb in Oryza brachyantha (FF) to 905 Mb in Oryza australiensis (EE). The genome sizes of the tetraploid species were not always two times larger than those of the diploid species, and some diploid species even had larger genome sizes than those of tetraploids. Nevertheless, we found that genome sizes of newly formed allotetraploids (BBCC-) were almost equal to totaling genome sizes of their parental progenitors. Our results showed that the species belonging to the same genome types had similar genome sizes, while genome sizes exhibited a gradually decreased trend during the evolutionary process in the clade with AA, BB, CC, and EE genome types. Comparative genomic analyses further showed that the species with different rice genome types may had experienced dissimilar amplification histories of retrotransposons, resulting in remarkably different genome sizes. On the other hand, the closely related rice species may have experienced similar amplification history. We observed that the contents of transposable elements, long terminal repeats (LTR) retrotransposons, and particularly LTR/Gypsy retrotransposons varied largely but were significantly correlated with genome sizes. Therefore, this study demonstrated that LTR retrotransposons act as an active driver of genome size variation in the genus Oryza.
Collapse
Affiliation(s)
- Shuang-feng Dai
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou, China
| | - Xun-ge Zhu
- Plant Germplasm and Genomics Center, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Ge-rang Hutang
- Plant Germplasm and Genomics Center, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
| | - Jia-yue Li
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou, China
| | - Jia-qi Tian
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou, China
| | - Xian-hui Jiang
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou, China
| | - Dan Zhang
- College of Tropical Crops, Hainan University, Haikou, China
| | - Li-zhi Gao
- Institution of Genomics and Bioinformatics, South China Agricultural University, Guangzhou, China
- Plant Germplasm and Genomics Center, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, China
- College of Tropical Crops, Hainan University, Haikou, China
- *Correspondence: Li-zhi Gao,
| |
Collapse
|
7
|
Baiakhmetov E, Guyomar C, Shelest E, Nobis M, Gudkova PD. The first draft genome of feather grasses using SMRT sequencing and its implications in molecular studies of Stipa. Sci Rep 2021; 11:15345. [PMID: 34321531 PMCID: PMC8319324 DOI: 10.1038/s41598-021-94068-w] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2020] [Accepted: 06/24/2021] [Indexed: 11/22/2022] Open
Abstract
The Eurasian plant Stipa capillata is the most widespread species within feather grasses. Many taxa of the genus are dominants in steppe plant communities and can be used for their classification and in studies related to climate change. Moreover, some species are of economic importance mainly as fodder plants and can be used for soil remediation processes. Although large-scale molecular data has begun to appear, there is still no complete or draft genome for any Stipa species. Thus, here we present a single-molecule long-read sequencing dataset generated using the Pacific Biosciences Sequel System. A draft genome of about 1004 Mb was obtained with a contig N50 length of 351 kb. Importantly, here we report 81,224 annotated protein-coding genes, present 77,614 perfect and 58 unique imperfect SSRs, reveal the putative allopolyploid nature of S. capillata, investigate the evolutionary history of the genus, demonstrate structural heteroplasmy of the chloroplast genome and announce for the first time the mitochondrial genome in Stipa. The assembled nuclear, mitochondrial and chloroplast genomes provide a significant source of genetic data for further works on phylogeny, hybridisation and population studies within Stipa and the grass family Poaceae.
Collapse
Affiliation(s)
- Evgenii Baiakhmetov
- Institute of Botany, Faculty of Biology, Jagiellonian University, Gronostajowa 3, 30-387, Kraków, Poland. .,Research Laboratory 'Herbarium', National Research Tomsk State University, Lenin 36 Ave., Tomsk, 634050, Russia.
| | - Cervin Guyomar
- German Centre for Integrative Biodiversity Research (iDiv), Puschstrasse 4, 04103, Leipzig, Germany.,Institute for Genetics, Environment and Plant Protection (IGEPP), Agrocampus Ouest, INRAE, University of Rennes 1, 35650, Le Rheu, France
| | - Ekaterina Shelest
- German Centre for Integrative Biodiversity Research (iDiv), Puschstrasse 4, 04103, Leipzig, Germany.,Centre for Enzyme Innovation, University of Portsmouth, Portsmouth, PO1 2UP, UK
| | - Marcin Nobis
- Institute of Botany, Faculty of Biology, Jagiellonian University, Gronostajowa 3, 30-387, Kraków, Poland. .,Research Laboratory 'Herbarium', National Research Tomsk State University, Lenin 36 Ave., Tomsk, 634050, Russia.
| | - Polina D Gudkova
- Research Laboratory 'Herbarium', National Research Tomsk State University, Lenin 36 Ave., Tomsk, 634050, Russia.,Department of Biology, Altai State University, Lenin 61 Ave., Barnaul, Russia, 656049
| |
Collapse
|
8
|
Yañez-Santos AM, Paz RC, Paz-Sepúlveda PB, Urdampilleta JD. Full-length LTR retroelements in Capsicum annuum revealed a few species-specific family bursts with insertional preferences. Chromosome Res 2021; 29:261-284. [PMID: 34086192 DOI: 10.1007/s10577-021-09663-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2020] [Revised: 05/04/2021] [Accepted: 05/06/2021] [Indexed: 01/01/2023]
Abstract
Capsicum annuum is a species that has undergone an expansion of the size of its genome caused mainly by the amplification of repetitive DNA sequences, including mobile genetic elements. Based on information obtained from sequencing the genome of pepper, the estimated fraction of retroelements is approximately 81%, and previous results revealed an important contribution of lineages derived from Gypsy superfamily. However, the dynamics of the retroelements in the C. annuum genome is poorly understood. In this way, the present work seeks to investigate the phylogenetic diversity and genomic abundance of the families of autonomous (complete and intact) LTR retroelements from C. annuum and inspect their distribution along its chromosomes. In total, we identified 1151 structurally full-length retroelements (340 Copia; 811 Gypsy) grouped in 124 phylogenetic families in the base of their retrotranscriptase. All the evolutive lineages of LTR retroelements identified in plants were present in pepper; however, three of them comprise 83% of the entire LTR retroelements population, the lineages Athila, Del/Tekay, and Ale/Retrofit. From them, only three families represent 70.8% of the total number of the identified retroelements. A massive family-specific wave of amplification of two of them occurred in the last 0.5 Mya (GypsyCa_16; CopiaCa_01), whereas the third is more ancient and occurred 3.0 Mya (GypsyCa_13). Fluorescent in situ hybridization performed with family and lineage-specific probes revealed contrasting patterns of chromosomal affinity. Our results provide a database of the populations LTR retroelements specific to C. annuum genome. The most abundant families were analyzed according to chromosome insertional preferences, suppling useful tools to the design of retroelement-based markers specific to the species.
Collapse
Affiliation(s)
- Anahí Mara Yañez-Santos
- CIGEOBIO (FCEFyN, UNSJ/CONICET), Av. Ignacio de la Roza 590 (Oeste), J5402DCS, Rivadavia, San Juan, Argentina.,Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)-Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
| | - Rosalía Cristina Paz
- CIGEOBIO (FCEFyN, UNSJ/CONICET), Av. Ignacio de la Roza 590 (Oeste), J5402DCS, Rivadavia, San Juan, Argentina.
| | - Paula Beatriz Paz-Sepúlveda
- Instituto Multidisciplinario de Biología Celular (IMBICE), Consejo Nacional de Investigaciones Científicas y Técnicas de la República Argentina (CONICET) - Comisión de Investigaciones Científicas (CIC) - Universidad Nacional de La Plata (UNLP), La Plata, Argentina
| | - Juan Domingo Urdampilleta
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET)-Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
| |
Collapse
|
9
|
Yu H, Lin T, Meng X, Du H, Zhang J, Liu G, Chen M, Jing Y, Kou L, Li X, Gao Q, Liang Y, Liu X, Fan Z, Liang Y, Cheng Z, Chen M, Tian Z, Wang Y, Chu C, Zuo J, Wan J, Qian Q, Han B, Zuccolo A, Wing RA, Gao C, Liang C, Li J. A route to de novo domestication of wild allotetraploid rice. Cell 2021; 184:1156-1170.e14. [PMID: 33539781 DOI: 10.1016/j.cell.2021.01.013] [Citation(s) in RCA: 201] [Impact Index Per Article: 67.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2020] [Revised: 12/02/2020] [Accepted: 01/11/2021] [Indexed: 12/25/2022]
Abstract
Cultivated rice varieties are all diploid, and polyploidization of rice has long been desired because of its advantages in genome buffering, vigorousness, and environmental robustness. However, a workable route remains elusive. Here, we describe a practical strategy, namely de novo domestication of wild allotetraploid rice. By screening allotetraploid wild rice inventory, we identified one genotype of Oryza alta (CCDD), polyploid rice 1 (PPR1), and established two important resources for its de novo domestication: (1) an efficient tissue culture, transformation, and genome editing system and (2) a high-quality genome assembly discriminated into two subgenomes of 12 chromosomes apiece. With these resources, we show that six agronomically important traits could be rapidly improved by editing O. alta homologs of the genes controlling these traits in diploid rice. Our results demonstrate the possibility that de novo domesticated allotetraploid rice can be developed into a new staple cereal to strengthen world food security.
Collapse
Affiliation(s)
- Hong Yu
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
| | - Tao Lin
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Xiangbing Meng
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Huilong Du
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jingkun Zhang
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guifu Liu
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Mingjiang Chen
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Yanhui Jing
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Liquan Kou
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Xiuxiu Li
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qiang Gao
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Yan Liang
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Xiangdong Liu
- Guangdong Laboratory of Lingnan Modern Agriculture, Guangzhou 510642, China; State Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, South China Agricultural University, Guangzhou 510642, China
| | - Zhilan Fan
- National Field Genebank for Wild Rice (Guangzhou), Rice Research Institute, Guangdong Academy of Agricultural Sciences, Guangzhou 510640, China
| | - Yuntao Liang
- Rice Research Institute, Guangxi Academy of Agricultural Science, Nanning 530007, China
| | - Zhukuan Cheng
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Mingsheng Chen
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhixi Tian
- University of Chinese Academy of Sciences, Beijing 100049, China; State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China
| | - Yonghong Wang
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Chengcai Chu
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jianru Zuo
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China
| | - Jianmin Wan
- National Key Facility for Crop Gene Resources and Genetic Improvement, Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing 100081, China
| | - Qian Qian
- State Key Laboratory of Rice Biology, China National Rice Research Institute, Chinese Academy of Agricultural Sciences, Hangzhou 310006, China
| | - Bin Han
- National Center of Plant Gene Research Shanghai Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences and CAS Center of Excellence for Molecular Plant Sciences, Chinese Academy of Sciences, Shanghai 200233, China
| | - Andrea Zuccolo
- Center for Desert Agriculture, Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia; Institute of Life Sciences, Scuola Superiore Sant'Anna, Pisa 56127, Italy
| | - Rod A Wing
- Center for Desert Agriculture, Biological and Environmental Sciences & Engineering Division (BESE), King Abdullah University of Science and Technology (KAUST), Thuwal 23955-6900, Saudi Arabia; Arizona Genomics Institute, School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA
| | - Caixia Gao
- University of Chinese Academy of Sciences, Beijing 100049, China; State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China.
| | - Chengzhi Liang
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China.
| | - Jiayang Li
- State Key Laboratory of Plant Genomics, and National Center for Plant Gene Research, Institute of Genetics and Developmental Biology, Innovation Academy for Seed Design, Chinese Academy of Sciences, Beijing 100101, China; University of Chinese Academy of Sciences, Beijing 100049, China; Guangdong Laboratory of Lingnan Modern Agriculture, Guangzhou 510642, China.
| |
Collapse
|
10
|
Nan H, Li W, Lin YL, Gao LZ. Genome-Wide Analysis of WRKY Genes and Their Response to Salt Stress in the Wild Progenitor of Asian Cultivated Rice, Oryza rufipogon. Front Genet 2020. [DOI: 10.3389/fgene.2020.00359] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
11
|
Choi JY, Lye ZN, Groen SC, Dai X, Rughani P, Zaaijer S, Harrington ED, Juul S, Purugganan MD. Nanopore sequencing-based genome assembly and evolutionary genomics of circum-basmati rice. Genome Biol 2020; 21:21. [PMID: 32019604 PMCID: PMC7001208 DOI: 10.1186/s13059-020-1938-2] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Accepted: 01/17/2020] [Indexed: 01/23/2023] Open
Abstract
Background The circum-basmati group of cultivated Asian rice (Oryza sativa) contains many iconic varieties and is widespread in the Indian subcontinent. Despite its economic and cultural importance, a high-quality reference genome is currently lacking, and the group’s evolutionary history is not fully resolved. To address these gaps, we use long-read nanopore sequencing and assemble the genomes of two circum-basmati rice varieties. Results We generate two high-quality, chromosome-level reference genomes that represent the 12 chromosomes of Oryza. The assemblies show a contig N50 of 6.32 Mb and 10.53 Mb for Basmati 334 and Dom Sufid, respectively. Using our highly contiguous assemblies, we characterize structural variations segregating across circum-basmati genomes. We discover repeat expansions not observed in japonica—the rice group most closely related to circum-basmati—as well as the presence and absence variants of over 20 Mb, one of which is a circum-basmati-specific deletion of a gene regulating awn length. We further detect strong evidence of admixture between the circum-basmati and circum-aus groups. This gene flow has its greatest effect on chromosome 10, causing both structural variation and single-nucleotide polymorphism to deviate from genome-wide history. Lastly, population genomic analysis of 78 circum-basmati varieties shows three major geographically structured genetic groups: Bhutan/Nepal, India/Bangladesh/Myanmar, and Iran/Pakistan. Conclusion The availability of high-quality reference genomes allows functional and evolutionary genomic analyses providing genome-wide evidence for gene flow between circum-aus and circum-basmati, describes the nature of circum-basmati structural variation, and reveals the presence/absence variation in this important and iconic rice variety group.
Collapse
Affiliation(s)
- Jae Young Choi
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA.
| | - Zoe N Lye
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | - Simon C Groen
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA
| | | | | | | | | | - Sissel Juul
- Oxford Nanopore Technologies, New York, NY, USA
| | - Michael D Purugganan
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, USA. .,Center for Genomics and Systems Biology, NYU Abu Dhabi Research Institute, New York University Abu Dhabi, Abu Dhabi, United Arab Emirates.
| |
Collapse
|
12
|
Gichuki DK, Ma L, Zhu Z, Du C, Li Q, Hu G, Zhong Z, Li H, Wang Q, Xin H. Genome size, chromosome number determination, and analysis of the repetitive elements in Cissus quadrangularis. PeerJ 2019; 7:e8201. [PMID: 31875149 PMCID: PMC6927348 DOI: 10.7717/peerj.8201] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2019] [Accepted: 11/13/2019] [Indexed: 02/03/2023] Open
Abstract
Cissus quadrangularis (Vitaceae) is a perennial climber endemic to Africa and is characterized by succulent angular stems. The plant grows in arid and semi-arid regions of Africa especially in the African savanna. The stem of C. quadrangularis has a wide range of applications in both human and animal medicine, but there is limited cytogenetic information available for this species. In this study, the chromosome number, genome size, and genome composition for C. quadrangularis were determined. Flow cytometry results indicated that the genome size of C. quadrangularis is approximately 2C = 1.410 pg. Fluorescence microscopy combined with DAPI stain showed the chromosome numbers to be 2n = 48. It is likely that C. quadrangularis has a tetraploid genome after considering the basic chromosome numbers in Cissus genus (n = 10, 11, or 12). A combination of low-throughput genome sequencing and bioinformatics analysis allowed identification and quantification of repetitive elements that make up about 52% of the C. quadrangularis genome, which was dominated by LTR-retrotransposons. Two LTR superfamilies were identified as Copia and Gypsy, with 24% and 15% of the annotated clusters, respectively. The comparison of repeat elements for C. quadrangularis, Vitis vinifera, and four other selected members in the Cissus genus revealed a high diversity in the repetitive element components, which could suggest recent amplification events in the Cissus genus. Our data provides a platform for further studies on the phylogeny and karyotype evolution in this genus and in the family Vitaceae.
Collapse
Affiliation(s)
- Duncan Kiragu Gichuki
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, Peoples Republic of China
| | - Lu Ma
- Shenzhen Tobeacon Technology Co. Ltd., Shenzhen, Peoples Republic of China
| | - Zhenfei Zhu
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
- University of Chinese Academy of Sciences, Beijing, Peoples Republic of China
| | - Chang Du
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Qingyun Li
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Guangwan Hu
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Zhixiang Zhong
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Honglin Li
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Qingfeng Wang
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| | - Haiping Xin
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, The Innovative Academy of Seed Design, Chinese Academy of Sciences, Wuhan, China
- Center of Economic Botany, Core Botanical Gardens, Chinese Academy of Sciences, Wuhan, China
| |
Collapse
|
13
|
Van-Lume B, Mata-Sucre Y, Báez M, Ribeiro T, Huettel B, Gagnon E, Leitch IJ, Pedrosa-Harand A, Lewis GP, Souza G. Evolutionary convergence or homology? Comparative cytogenomics of Caesalpinia group species (Leguminosae) reveals diversification in the pericentromeric heterochromatic composition. PLANTA 2019; 250:2173-2186. [PMID: 31696317 DOI: 10.1007/s00425-019-03287-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Accepted: 09/25/2019] [Indexed: 05/02/2023]
Abstract
We demonstrated by cytogenomic analysis that the proximal heterochromatin of the Northeast Brazilian species of Caesalpinia group is enriched with phylogenetically conserved Ty3/Gypsy-Tekay RT, but diverge in the presence of Ty3/Gypsy-Athila RT and satDNA. The Caesalpinia Group includes 225 species and 27 monophyletic genera of which four occur in Northeastern Brazil: Erythrostemon (1 sp.), Cenostigma (7 spp.), Libidibia (1 sp.), and Paubrasilia (1 sp.). The last three genera are placed in different clades in the Caesalpinia Group phylogeny, and yet they are characterized by having a numerically stable karyotype 2n = 24 (16 M+8A) and GC-rich heterochromatic bands (chromomycin A3 positive/CMA+ bands) in the proximal chromosome regions. To characterize the composition of their heterochromatin and test for the homology of these chromosomal regions, genomic DNA was extracted from Cenostigma microphyllum, Libidibia ferrea, and Paubrasilia echinata, and sequenced at low coverage using the Illumina platform. The genomic repetitive fractions were characterized using a Galaxy/RepeatExplorer-Elixir platform. The most abundant elements of each genome were chromosomally located by fluorescent in situ hybridization (FISH) and compared to the CMA+ heterochromatin distribution. The repetitive fraction of the genomes of C. microphyllum, L. ferrea, and P. echinata were estimated to be 41.70%, 38.44%, and 72.51%, respectively. Ty3/Gypsy retrotransposons (RT), specifically the Tekay lineage, were the most abundant repeats in each of the three genomes. FISH mapping revealed species-specific patterns for the Tekay elements in the proximal regions of the chromosomes, co-localized with CMA+ bands. Other species-specific patterns were observed, e.g., for the Ty3/Gypsy RT Athila elements which were found in all the proximal heterochromatin of L. ferrea or restricted to the acrocentric chromosomes of C. microphyllum. This Athila labeling co-localized with satellite DNAs (satDNAs). Although the Caesalpinia Group diverged around 55 Mya, our results suggest an ancestral colonization of Tekay RT in the proximal heterochromatin. Thus, the present-day composition of the pericentromeric heterochromatin in these Northeast Brazilian species is a combination of the maintenance of an ancestral Tekay distribution with a species-specific accumulation of other repeats.
Collapse
Affiliation(s)
- Brena Van-Lume
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitária, Recife, PE, 50670-420, Brazil
| | - Yennifer Mata-Sucre
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitária, Recife, PE, 50670-420, Brazil
| | - Mariana Báez
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitária, Recife, PE, 50670-420, Brazil
| | - Tiago Ribeiro
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitária, Recife, PE, 50670-420, Brazil
- Department of Botany and Ecology, Institute of Biosciences, Federal University of Mato Grosso, Av. Fernando Correa da Costa, 2.367, Boa Esperança, Cuiabá, MT, 78060-900, Brazil
| | | | - Edeline Gagnon
- Royal Botanic Garden Edinburgh, 20A Inverleith Row, Edinburgh, EH3 5NZ, UK
| | - Ilia J Leitch
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB, UK
| | - Andrea Pedrosa-Harand
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitária, Recife, PE, 50670-420, Brazil
| | - Gwilym P Lewis
- Department of Comparative Plant and Fungal Biology, Royal Botanic Gardens, Kew, Richmond, Surrey, TW9 3AB, UK
| | - Gustavo Souza
- Laboratory of Plant Cytogenetics and Evolution, Department of Botany, Federal University of Pernambuco, Rua Nelson Chaves S/N, Cidade Universitária, Recife, PE, 50670-420, Brazil.
| |
Collapse
|
14
|
Draft genome sequence of first monocot-halophytic species Oryza coarctata reveals stress-specific genes. Sci Rep 2018; 8:13698. [PMID: 30209320 PMCID: PMC6135824 DOI: 10.1038/s41598-018-31518-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2018] [Accepted: 08/21/2018] [Indexed: 11/08/2022] Open
Abstract
Oryza coarctata (KKLL; 2n = 4x = 48, 665 Mb) also known as Porteresia coarctata is an extreme halophyte species of genus Oryza. Using Illumina and Nanopore reads, we achieved the assembled genome size of 569.9 Mb, accounting 85.69% of the estimated genome size with N50 of 1.85 Mb and 19.89% repetitive region. We also found 230,968 simple sequence repeats (SSRs) and 5,512 non-coding RNAs (ncRNAs). The functional annotation of predicted 33,627 protein-coding genes and 4,916 transcription factors revealed that high salinity adaptation of this species is due to the exclusive or excessive presence of stress-specific genes as compared to rice. We have identified 8 homologs to salt-tolerant SOS1 genes, one of the three main components of salt overly sensitive (SOS) signal pathway. On the other hand, the phylogenetic analysis of the assembled chloroplast (134.75 kb) and mitochondrial genome (491.06 kb) favours the conservative nature of these organelle genomes within Oryza taxon.
Collapse
|
15
|
Tamayo-Ordóñez YJ, Narváez-Zapata JA, Tamayo-Ordóñez MC, Sánchez-Teyer LF. Retroelements and DNA Methylation Could Contribute to Diversity of 5S rDNA in Agave L. J Mol Evol 2018; 86:404-423. [DOI: 10.1007/s00239-018-9856-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 07/03/2018] [Indexed: 01/21/2023]
|
16
|
Hou F, Ma B, Xin Y, Kuang L, He N. Horizontal transfers of LTR retrotransposons in seven species of Rosales. Genome 2018; 61:587-594. [PMID: 29958091 DOI: 10.1139/gen-2017-0208] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Horizontal transposable element transfer (HTT) events have occurred among a large number of species and play important roles in the composition and evolution of eukaryotic genomes. HTTs are also regarded as effective forces in promoting genomic variation and biological innovation. In the present study, HTT events were identified and analyzed in seven sequenced species of Rosales using bioinformatics methods by comparing sequence conservation and Ka/Ks value of reverse transcriptase (RT) with 20 conserved genes, estimating the dating of HTTs, and analyzing the phylogenetic relationships. Seven HTT events involving long terminal repeat (LTR) retrotransposons, two HTTs between Morus notabilis and Ziziphus jujuba, and five between Malus domestica and Pyrus bretschneideri were identified. Further analysis revealed that these LTR retrotransposons had functional structures, and the copy insertion times were lower than the dating of HTTs, particularly in Mn.Zj.1 and Md.Pb.3. Altogether, the results demonstrate that LTR retrotransposons still have potential transposition activity in host genomes. These results indicate that HTT events are another strategy for exchanging genetic material among species and are important for the evolution of genomes.
Collapse
Affiliation(s)
- Fei Hou
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China
| | - Bi Ma
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China
| | - Youchao Xin
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China
| | - Lulu Kuang
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China
| | - Ningjia He
- State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China.,State Key Laboratory of Silkworm Genome Biology, Southwest University, Beibei, Chongqing 400715, China
| |
Collapse
|
17
|
De novo genome assembly of Oryza granulata reveals rapid genome expansion and adaptive evolution. Commun Biol 2018; 1:84. [PMID: 30271965 PMCID: PMC6123737 DOI: 10.1038/s42003-018-0089-4] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2017] [Accepted: 06/08/2018] [Indexed: 12/18/2022] Open
Abstract
The wild relatives of rice have adapted to different ecological environments and constitute a useful reservoir of agronomic traits for genetic improvement. Here we present the ~777 Mb de novo assembled genome sequence of Oryza granulata. Recent bursts of long-terminal repeat retrotransposons, especially RIRE2, led to a rapid twofold increase in genome size after O. granulata speciation. Universal centromeric tandem repeats are absent within its centromeres, while gypsy-type LTRs constitute the main centromere-specific repetitive elements. A total of 40,116 protein-coding genes were predicted in O. granulata, which is close to that of Oryza sativa. Both the copy number and function of genes involved in photosynthesis and energy production have undergone positive selection during the evolution of O. granulata, which might have facilitated its adaptation to the low light habitats. Together, our findings reveal the rapid genome expansion, distinctive centromere organization, and adaptive evolution of O. granulata. Zhigang Wu, Dongming Fang, Rui Yang, et al. present the genome assembly of a wild rice species Oryza granulata, revealing critical insights about the rapid genome expansion and evolution observed in the Oryza genus. They find that recent bursts of LTR retrotransposons have led to the rapid increase in O. granulate genome size following speciation.
Collapse
|
18
|
Dluhošová J, Ištvánek J, Nedělník J, Řepková J. Red Clover ( Trifolium pratense) and Zigzag Clover ( T. medium) - A Picture of Genomic Similarities and Differences. FRONTIERS IN PLANT SCIENCE 2018; 9:724. [PMID: 29922311 PMCID: PMC5996420 DOI: 10.3389/fpls.2018.00724] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2018] [Accepted: 05/14/2018] [Indexed: 05/29/2023]
Abstract
The genus clover (Trifolium sp.) is one of the most economically important genera in the Fabaceae family. More than 10 species are grown as manure plants or forage legumes. Red clover's (T. pratense) genome size is one of the smallest in the Trifolium genus, while many clovers with potential breeding value have much larger genomes. Zigzag clover (T. medium) is closely related to the sequenced red clover; however, its genome is approximately 7.5x larger. Currently, almost nothing is known about the architecture of this large genome and differences between these two clover species. We sequenced the T. medium genome (2n = 8x = 64) with ∼23× coverage and managed to partially assemble 492.7 Mbp of its genomic sequence. A thorough comparison between red clover and zigzag clover sequencing reads resulted in the successful validation of 7 T. pratense- and 45 T. medium-specific repetitive elements. The newly discovered repeats led to the set-up of the first partial T. medium karyotype. Newly discovered red clover and zigzag clover tandem repeats were summarized. The structure of centromere-specific satellite repeat resembling that of T. repens was inferred in T. pratense. Two repeats, TrM300 and TrM378, showed a specific localization into centromeres of a half of all zigzag clover chromosomes; TrM300 on eight chromosomes and TrM378 on 24 chromosomes. A comparison with the red clover draft sequence was also used to mine more than 105,000 simple sequence repeats (SSRs) and 1,170,000 single nucleotide variants (SNVs). The presented data obtained from the sequencing of zigzag clover represent the first glimpse on the genomic sequence of this species. Centromeric repeats indicated its allopolyploid origin and naturally occurring homogenization of the centromeric repeat motif was somehow prevented. Using various repeats, highly uniform 64 chromosomes were separated into eight types of chromosomes. Zigzag clover genome underwent substantial chromosome rearrangements and cannot be counted as a true octoploid. The resulting data, especially the large number of predicted SSRs and SNVs, may have great potential for further research of the legume family and for rapid advancements in clover breeding.
Collapse
Affiliation(s)
- Jana Dluhošová
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Jan Ištvánek
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | | | - Jana Řepková
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| |
Collapse
|
19
|
Chowrasia S, Rawal HC, Mazumder A, Gaikwad K, Sharma TR, Singh NK, Mondal TK. Oryza coarctata Roxb. COMPENDIUM OF PLANT GENOMES 2018. [DOI: 10.1007/978-3-319-71997-9_8] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
20
|
Li X, Guo K, Zhu X, Chen P, Li Y, Xie G, Wang L, Wang Y, Persson S, Peng L. Domestication of rice has reduced the occurrence of transposable elements within gene coding regions. BMC Genomics 2017; 18:55. [PMID: 28068923 PMCID: PMC5223533 DOI: 10.1186/s12864-016-3454-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2016] [Accepted: 12/22/2016] [Indexed: 11/19/2022] Open
Abstract
Background Transposable elements (TEs) are prominent features in many plant genomes, and patterns of TEs in closely related rice species are thus proposed as an ideal model to study TEs roles in the context of plant genome evolution. As TEs may contribute to improved rice growth and grain quality, it is of pivotal significance for worldwide food security and biomass production. Results We analyzed three cultivated rice species and their closest five wild relatives for distribution and content of TEs in their genomes. Despite that the three cultivar rice species contained similar copies and more total TEs, their genomes contained much longer TEs as compared to their wild relatives. Notably, TEs were largely depleted from genomic regions that corresponded to genes in the cultivated species, while this was not the case for their wild relatives. Gene ontology and gene homology analyses revealed that while certain genes contained TEs in all the wild species, the closest homologs in the cultivated species were devoid of them. This distribution of TEs is surprising as the cultivated species are more distantly related to each other as compared to their closest wild relative. Hence, cultivated rice species have more similar TE distributions among their genes as compared to their closest wild relatives. We, furthermore, exemplify how genes that are conferring important rice traits can be regulated by TE associations. Conclusions This study demonstrate that the cultivation of rice has led to distinct genomic distribution of TEs, and that certain rice traits are closely associated with TE distribution patterns. Hence, the results provide means to better understand TE-dependent rice traits and the potential to genetically engineer rice for better performance. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3454-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Xukai Li
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Kai Guo
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Life Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Xiaobo Zhu
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Peng Chen
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Ying Li
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Guosheng Xie
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Lingqiang Wang
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Yanting Wang
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China
| | - Staffan Persson
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China. .,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China. .,School of Biosciences, University of Melbourne, Melbourne, VIC, 3010, Australia.
| | - Liangcai Peng
- Biomass and Bioenergy Research Centre, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China. .,National Key Laboratory of Crop Genetic Improvement, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China. .,College of Plant Science and Technology, Huazhong Agricultural University, Wuhan, Hubei, 430070, People's Republic of China.
| |
Collapse
|
21
|
Lwin AK, Bertolini E, Pè ME, Zuccolo A. Genomic skimming for identification of medium/highly abundant transposable elements in Arundo donax and Arundo plinii. Mol Genet Genomics 2016; 292:157-171. [PMID: 27778102 DOI: 10.1007/s00438-016-1263-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 10/17/2016] [Indexed: 11/29/2022]
Abstract
Transposable elements (TEs) are the most abundant genetic material for almost all eukaryotic genomes. Their effects on the host genomes range from an extensive size variation to the regulation of gene expression, altering gene function and creating new genes. Because of TEs pivotal contribute to the host genome structure and regulation, their identification and characterization provide a wealth of useful data for gaining an in-depth understanding of host genome functioning. The giant reed (Arundo donax) is a perennial rhizomatous C3 grass, octadecaploid, with an estimated nuclear genome size of 2744 Mbp. It is a promising feedstock for second-generation biofuels and biomethane production. To identify and characterize the most repetitive TEs in the genomes of A. donax and its ancestral A. plinii species, we carried out low-coverage whole genome shotgun sequencing for both species. Using a de novo repeat identification approach, 33,041 and 28,237 non-redundant repetitive sequences were identified and characterized in A. donax and A. plinii genomes, representing 37.55 and 31.68% of each genome, respectively. Comparative phylogenetic analyses, including the major TE classes identified in A. donax and A. plinii, together with rice and maize TE paralogs, were carried out to understand the evolutionary relationship of the most abundant TE classes. Highly conserved copies of RIRE1-like Ty1-Copia elements were discovered in two Arundo spp. in which they represented nearly 3% of each genomic sequence. We identified and characterized the medium/highly repetitive TEs in two unexplored polyploid genomes, thus generating useful information for the study of the genomic structure, composition, and functioning of these two non-model species. We provided a valuable resource that could be exploited in any effort aimed at sequencing and assembling these two genomes.
Collapse
Affiliation(s)
- Aung Kyaw Lwin
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy.,Sugarcane Research and Seed Farm, Pyinmana, Nay Pyi Taw, Myanmar
| | - Edoardo Bertolini
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy
| | - Andrea Zuccolo
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy.
| |
Collapse
|
22
|
Wang J, Yu Y, Tao F, Zhang J, Copetti D, Kudrna D, Talag J, Lee S, Wing RA, Fan C. DNA methylation changes facilitated evolution of genes derived from Mutator-like transposable elements. Genome Biol 2016; 17:92. [PMID: 27154274 PMCID: PMC4858842 DOI: 10.1186/s13059-016-0954-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Accepted: 04/14/2016] [Indexed: 01/17/2023] Open
Abstract
Background Mutator-like transposable elements, a class of DNA transposons, exist pervasively in both prokaryotic and eukaryotic genomes, with more than 10,000 copies identified in the rice genome. These elements can capture ectopic genomic sequences that lead to the formation of new gene structures. Here, based on whole-genome comparative analyses, we comprehensively investigated processes and mechanisms of the evolution of putative genes derived from Mutator-like transposable elements in ten Oryza species and the outgroup Leersia perieri, bridging ~20 million years of evolutionary history. Results Our analysis identified thousands of putative genes in each of the Oryza species, a large proportion of which have evidence of expression and contain chimeric structures. Consistent with previous reports, we observe that the putative Mutator-like transposable element-derived genes are generally GC-rich and mainly derive from GC-rich parental sequences. Furthermore, we determine that Mutator-like transposable elements capture parental sequences preferentially from genomic regions with low methylation levels and high recombination rates. We explicitly show that methylation levels in the internal and terminated inverted repeat regions of these elements, which might be directed by the 24-nucleotide small RNA-mediated pathway, are different and change dynamically over evolutionary time. Lastly, we demonstrate that putative genes derived from Mutator-like transposable elements tend to be expressed in mature pollen, which have undergone de-methylation programming, thereby providing a permissive expression environment for newly formed/transposable element-derived genes. Conclusions Our results suggest that DNA methylation may be a primary mechanism to facilitate the origination, survival, and regulation of genes derived from Mutator-like transposable elements, thus contributing to the evolution of gene innovation and novelty in plant genomes. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-0954-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jun Wang
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Yeisoo Yu
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Feng Tao
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA
| | - Jianwei Zhang
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Dario Copetti
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Dave Kudrna
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Jayson Talag
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Seunghee Lee
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA
| | - Rod A Wing
- Arizona Genomics Institute, BIO5 Institute and School of Plant Sciences, University of Arizona, Tucson, AZ, 85721, USA.,T.T. Chang Genetics Resources Center, International Rice Research Institute, Los Baños, Laguna, 4031, Philippines
| | - Chuanzhu Fan
- Department of Biological Sciences, Wayne State University, 5047 Gullen Mall, Detroit, MI, 48202, USA.
| |
Collapse
|
23
|
Mlinarec J, Franjević D, Harapin J, Besendorfer V. The impact of the Tekay chromoviral elements on genome organisation and evolution of Anemone s.l. (Ranunculaceae). PLANT BIOLOGY (STUTTGART, GERMANY) 2016; 18:332-347. [PMID: 26370195 DOI: 10.1111/plb.12393] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/14/2015] [Accepted: 09/10/2015] [Indexed: 06/05/2023]
Abstract
We studied the highly abundant chromoviral Tekay clade in species from three sister genera - Anemone, Pulsatilla and Hepatica (Ranunculaceae). With this clade, we performed a concomitant survey of its phylogenetic diversity, chromosomal organisation and transcriptional activity in Anemone s.l. in order to investigate dynamics of the Tekay elements at a finer scale than previously achieved in this or any other flowering clade. The phylogenetic tree built from Tekay sequences conformed to expected evolutionary relationships of the species; exceptions being A. nemorosa and A. sylvestris, which appeared more closely related that expected, and we invoke hybridisation events to explain the observed topology. The separation of elements into six clusters could be explained by episodic bursts of activity since divergence from a common ancestor at different points in their respective evolutionary histories. In Anemone s.l. the Tekay elements do not have a preferential position on chromosomes, i.e. they can have a: (i) centromeric/pericentromeric position; (ii) interstitial position in DAPI-positive AT-rich heterochromatic regions; can be (iii) dispersed throughout chromosomes; or even (iv) be absent from large heterochromatic blocks. Widespread transcriptional activity of the Tekay elements in Anemone s.l. taxa indicate that some copies of Tekay elements could still be active in this plant group, contributing to genome evolution and speciation within Anemone s.l. Identification of Tekay elements in Anemone s.l. provides valuable information for understanding how different localisation patterns might help to facilitate plant genome organisation in a structural and functional manner.
Collapse
Affiliation(s)
- J Mlinarec
- Division of Biology, Department of Molecular Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia
| | - D Franjević
- Division of Biology, Zoology Department, Faculty of Science, University of Zagreb, Zagreb, Croatia
| | - J Harapin
- Division of Biology, Department of Molecular Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia
| | - V Besendorfer
- Division of Biology, Department of Molecular Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia
| |
Collapse
|
24
|
Gebre YG, Bertolini E, Pè ME, Zuccolo A. Identification and characterization of abundant repetitive sequences in Eragrostis tef cv. Enatite genome. BMC PLANT BIOLOGY 2016; 16:39. [PMID: 26833063 PMCID: PMC4736629 DOI: 10.1186/s12870-016-0725-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Accepted: 01/28/2016] [Indexed: 06/05/2023]
Abstract
BACKGROUND Eragrostis tef is an allotetraploid (2n = 4 × = 40) annual, C4 grass with an estimated nuclear genome size of 730 Mbp. It is widely grown in Ethiopia, where it provides basic nutrition for more than half of the population. Although a draft assembly of the E. tef genome was made available in 2014, characterization of the repetitive portion of the E. tef genome has not been a subject of a detailed analysis. Repetitive sequences constitute most of the DNA in eukaryotic genomes. Transposable elements are usually the most abundant repetitive component in plant genomes. They contribute to genome size variation, cause mutations, can result in chromosomal rearrangements, and influence gene regulation. An extensive and in depth characterization of the repetitive component is essential in understanding the evolution and function of the genome. RESULTS Using new paired-end sequence data and a de novo repeat identification strategy, we identified the most repetitive elements in the E. tef genome. Putative repeat sequences were annotated based on similarity to known repeat groups in other grasses. Altogether we identified 1,389 medium/highly repetitive sequences that collectively represent about 27% of the teff genome. Phylogenetic analyses of the most important classes of TEs were carried out in a comparative framework including paralog elements from rice and maize. Finally, an abundant tandem repeat accounting for more than 4% of the whole genome was identified and partially characterized. CONCLUSIONS Analyzing a large sample of randomly sheared reads we obtained a library of the repetitive sequences of E. tef. The approach we used was designed to avoid underestimation of repeat contribution; such underestimation is characteristic of whole genome assembly projects. The data collected represent a valuable resource for further analysis of the genome of this important orphan crop.
Collapse
Affiliation(s)
- Yohannes Gedamu Gebre
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
- Department of Dryland Crop and Horticultural Sciences, College of Dryland Agriculture and Natural Resources, Mekelle University, P.O.Box 231, Mekelle, Ethiopia.
| | - Edoardo Bertolini
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
| | - Andrea Zuccolo
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
| |
Collapse
|
25
|
Macas J, Novák P, Pellicer J, Čížková J, Koblížková A, Neumann P, Fuková I, Doležel J, Kelly LJ, Leitch IJ. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae. PLoS One 2015; 10:e0143424. [PMID: 26606051 PMCID: PMC4659654 DOI: 10.1371/journal.pone.0143424] [Citation(s) in RCA: 118] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 11/04/2015] [Indexed: 01/30/2023] Open
Abstract
The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55–83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Collapse
Affiliation(s)
- Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
- * E-mail:
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Jaume Pellicer
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, United Kingdom
| | - Jana Čížková
- Institute of Experimental Botany, Olomouc, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Andrea Koblížková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Iva Fuková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Jaroslav Doležel
- Institute of Experimental Botany, Olomouc, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Laura J. Kelly
- School of Biological and Chemical Sciences, Queen Mary University of London, London, United Kingdom
| | - Ilia J. Leitch
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, United Kingdom
| |
Collapse
|
26
|
Schatz MC, Maron LG, Stein JC, Hernandez Wences A, Gurtowski J, Biggers E, Lee H, Kramer M, Antoniou E, Ghiban E, Wright MH, Chia JM, Ware D, McCouch SR, McCombie WR. Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica. Genome Biol 2015; 15:506. [PMID: 25468217 DOI: 10.1186/preaccept-2784872521277375] [Citation(s) in RCA: 101] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate. RESULTS Here, we use rice as a model to demonstrate how improvements in sequencing and assembly technology allow rapid and inexpensive de novo assembly of next generation sequence data into high-quality assemblies that can be directly compared using whole genome alignment to provide an unbiased assessment. Using this approach, we are able to accurately assess the "pan-genome" of three divergent rice varieties and document several megabases of each genome absent in the other two. CONCLUSIONS Many of the genome-specific loci are annotated to contain genes, reflecting the potential for new biological properties that would be missed by standard reference-mapping approaches. We further provide a detailed analysis of several loci associated with agriculturally important traits, including the S5 hybrid sterility locus, the Sub1 submergence tolerance locus, the LRK gene cluster associated with improved yield, and the Pup1 cluster associated with phosphorus deficiency, illustrating the utility of our approach for biological discovery. All of the data and software are openly available to support further breeding and functional studies of rice and other species.
Collapse
|
27
|
Castanera R, Pérez G, López L, Sancho R, Santoyo F, Alfaro M, Gabaldón T, Pisabarro AG, Oguiza JA, Ramírez L. Highly expressed captured genes and cross-kingdom domains present in Helitrons create novel diversity in Pleurotus ostreatus and other fungi. BMC Genomics 2014; 15:1071. [PMID: 25480150 PMCID: PMC4289320 DOI: 10.1186/1471-2164-15-1071] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Accepted: 11/14/2014] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Helitrons are class-II eukaryotic transposons that transpose via a rolling circle mechanism. Due to their ability to capture and mobilize gene fragments, they play an important role in the evolution of their host genomes. We have used a bioinformatics approach for the identification of helitrons in two Pleurotus ostreatus genomes using de novo detection and homology-based searching. We have analyzed the presence of helitron-captured genes as well as the expansion of helitron-specific helicases in fungi and performed a phylogenetic analysis of their conserved domains with other representative eukaryotic species. RESULTS Our results show the presence of two helitron families in P. ostreatus that disrupt gene colinearity and cause a lack of synteny between their genomes. Both putative autonomous and non-autonomous helitrons were transcriptionally active, and some of them carried highly expressed captured genes of unknown origin and function. In addition, both families contained eukaryotic, bacterial and viral domains within the helitron's boundaries. A phylogenetic reconstruction of RepHel helicases using the Helitron-like and PIF1-like helicase conserved domains revealed a polyphyletic origin for eukaryotic helitrons. CONCLUSION P. ostreatus helitrons display features similar to other eukaryotic helitrons and do not tend to capture host genes or gene fragments. The occurrence of genes probably captured from other hosts inside the helitrons boundaries pose the hypothesis that an ancient horizontal transfer mechanism could have taken place. The viral domains found in some of these genes and the polyphyletic origin of RepHel helicases in the eukaryotic kingdom suggests that virus could have played a role in a putative lateral transfer of helitrons within the eukaryotic kingdom. The high similarity of some helitrons, along with the transcriptional activity of its RepHel helicases indicates that these elements are still active in the genome of P. ostreatus.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | - Lucía Ramírez
- Department of Agrarian Production, Genetics and Microbiology Research Group, Public University of Navarre, 31006 Pamplona, Navarre, Spain.
| |
Collapse
|
28
|
Schatz MC, Maron LG, Stein JC, Wences AH, Gurtowski J, Biggers E, Lee H, Kramer M, Antoniou E, Ghiban E, Wright MH, Chia JM, Ware D, McCouch SR, McCombie WR. Whole genome de novo assemblies of three divergent strains of rice, Oryza sativa, document novel gene space of aus and indica. Genome Biol 2014. [PMID: 25468217 PMCID: PMC4268812 DOI: 10.1186/s13059-014-0506-z] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Background The use of high throughput genome-sequencing technologies has uncovered a large extent of structural variation in eukaryotic genomes that makes important contributions to genomic diversity and phenotypic variation. When the genomes of different strains of a given organism are compared, whole genome resequencing data are typically aligned to an established reference sequence. However, when the reference differs in significant structural ways from the individuals under study, the analysis is often incomplete or inaccurate. Results Here, we use rice as a model to demonstrate how improvements in sequencing and assembly technology allow rapid and inexpensive de novo assembly of next generation sequence data into high-quality assemblies that can be directly compared using whole genome alignment to provide an unbiased assessment. Using this approach, we are able to accurately assess the ‘pan-genome’ of three divergent rice varieties and document several megabases of each genome absent in the other two. Conclusions Many of the genome-specific loci are annotated to contain genes, reflecting the potential for new biological properties that would be missed by standard reference-mapping approaches. We further provide a detailed analysis of several loci associated with agriculturally important traits, including the S5 hybrid sterility locus, the Sub1 submergence tolerance locus, the LRK gene cluster associated with improved yield, and the Pup1 cluster associated with phosphorus deficiency, illustrating the utility of our approach for biological discovery. All of the data and software are openly available to support further breeding and functional studies of rice and other species. Electronic supplementary material The online version of this article (doi:10.1186/s13059-014-0506-z) contains supplementary material, which is available to authorized users.
Collapse
|
29
|
Rapid diversification of five Oryza AA genomes associated with rice adaptation. Proc Natl Acad Sci U S A 2014; 111:E4954-62. [PMID: 25368197 DOI: 10.1073/pnas.1418307111] [Citation(s) in RCA: 119] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Comparative genomic analyses among closely related species can greatly enhance our understanding of plant gene and genome evolution. We report de novo-assembled AA-genome sequences for Oryza nivara, Oryza glaberrima, Oryza barthii, Oryza glumaepatula, and Oryza meridionalis. Our analyses reveal massive levels of genomic structural variation, including segmental duplication and rapid gene family turnover, with particularly high instability in defense-related genes. We show, on a genomic scale, how lineage-specific expansion or contraction of gene families has led to their morphological and reproductive diversification, thus enlightening the evolutionary process of speciation and adaptation. Despite strong purifying selective pressures on most Oryza genes, we documented a large number of positively selected genes, especially those genes involved in flower development, reproduction, and resistance-related processes. These diversifying genes are expected to have played key roles in adaptations to their ecological niches in Asia, South America, Africa and Australia. Extensive variation in noncoding RNA gene numbers, function enrichment, and rates of sequence divergence might also help account for the different genetic adaptations of these rice species. Collectively, these resources provide new opportunities for evolutionary genomics, numerous insights into recent speciation, a valuable database of functional variation for crop improvement, and tools for efficient conservation of wild rice germplasm.
Collapse
|
30
|
First evidence of mariner-like transposons in the genome of the marine microalga Amphora acutiuscula (Bacillariophyta). Protist 2014; 165:730-44. [PMID: 25250954 DOI: 10.1016/j.protis.2014.08.003] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2014] [Revised: 08/11/2014] [Accepted: 08/18/2014] [Indexed: 12/17/2022]
Abstract
Mariner-like elements (MLEs) are transposable elements able to move in the host genomes by a "cut and paste" mechanism. They have been found in numerous organisms. We succeeded in amplifying complete and truncated MLEs in the marine diatom Amphora acutiuscula. Full-length MLEs of 2,100bp delimited by imperfect Terminal Inverted Repeats revealed an intact Open Reading Frame, suggesting that the MLEs could be active. The DNA binding domain of the corresponding putative transposase could have two Helix-Turn-Helix and a Nuclear Location Site motifs, and its catalytic domain includes a particular triad of aspartic acids DD43D not previously reported. The number of copies was estimated to be 38, including approximately 20 full-length elements. Phylogenetic analysis shows that these peculiar MLEs differ from plant and other stramenopile MLEs and that they could constitute a new sub-family of Tc1-mariner elements.
Collapse
|
31
|
Novák P, Hřibová E, Neumann P, Koblížková A, Doležel J, Macas J. Genome-wide analysis of repeat diversity across the family Musaceae. PLoS One 2014; 9:e98918. [PMID: 24932725 PMCID: PMC4059648 DOI: 10.1371/journal.pone.0098918] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2014] [Accepted: 05/08/2014] [Indexed: 11/18/2022] Open
Abstract
Background The banana family (Musaceae) includes genetically a diverse group of species and their diploid and polyploid hybrids that are widely cultivated in the tropics. In spite of their socio-economic importance, the knowledge of Musaceae genomes is basically limited to draft genome assemblies of two species, Musa acuminata and M. balbisiana. Here we aimed to complement this information by analyzing repetitive genome fractions of six species selected to represent various phylogenetic groups within the family. Results Low-pass sequencing of M. acuminata, M. ornata, M. textilis, M. beccarii, M. balbisiana, and Ensete gilletii genomes was performed using a 454/Roche platform. Sequence reads were subjected to analysis of their overall intra- and inter-specific similarities and, all major repeat families were quantified using graph-based clustering. Maximus/SIRE and Angela lineages of Ty1/copia long terminal repeat (LTR) retrotransposons and the chromovirus lineage of Ty3/gypsy elements were found to make up most of highly repetitive DNA in all species (14–34.5% of the genome). However, there were quantitative differences and sequence variations detected for classified repeat families as well as for the bulk of total repetitive DNA. These differences were most pronounced between species from different taxonomic sections of the Musaceae family, whereas pairs of closely related species (M. acuminata/M. ornata and M. beccarii/M. textilis) shared similar populations of repetitive elements. Conclusions This study provided the first insight into the composition and sequence variation of repetitive parts of Musaceae genomes. It allowed identification of repetitive sequences specific for a single species or a group of species that can be utilized as molecular markers in breeding programs and generated computational resources that will be instrumental in repeat masking and annotation in future genome assembly projects.
Collapse
Affiliation(s)
- Petr Novák
- Biology Centre ASCR, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
- * E-mail:
| | - Eva Hřibová
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Pavel Neumann
- Biology Centre ASCR, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Andrea Koblížková
- Biology Centre ASCR, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Jaroslav Doležel
- Institute of Experimental Botany, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Jiří Macas
- Biology Centre ASCR, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| |
Collapse
|
32
|
Li SF, Gao WJ, Zhao XP, Dong TY, Deng CL, Lu LD. Analysis of transposable elements in the genome of Asparagus officinalis from high coverage sequence data. PLoS One 2014; 9:e97189. [PMID: 24810432 PMCID: PMC4014616 DOI: 10.1371/journal.pone.0097189] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2013] [Accepted: 04/16/2014] [Indexed: 11/19/2022] Open
Abstract
Asparagus officinalis is an economically and nutritionally important vegetable crop that is widely cultivated and is used as a model dioecious species to study plant sex determination and sex chromosome evolution. To improve our understanding of its genome composition, especially with respect to transposable elements (TEs), which make up the majority of the genome, we performed Illumina HiSeq2000 sequencing of both male and female asparagus genomes followed by bioinformatics analysis. We generated 17 Gb of sequence (12×coverage) and assembled them into 163,406 scaffolds with a total cumulated length of 400 Mbp, which represent about 30% of asparagus genome. Overall, TEs masked about 53% of the A. officinalis assembly. Majority of the identified TEs belonged to LTR retrotransposons, which constitute about 28% of genomic DNA, with Ty1/copia elements being more diverse and accumulated to higher copy numbers than Ty3/gypsy. Compared with LTR retrotransposons, non-LTR retrotransposons and DNA transposons were relatively rare. In addition, comparison of the abundance of the TE groups between male and female genomes showed that the overall TE composition was highly similar, with only slight differences in the abundance of several TE groups, which is consistent with the relatively recent origin of asparagus sex chromosomes. This study greatly improves our knowledge of the repetitive sequence construction of asparagus, which facilitates the identification of TEs responsible for the early evolution of plant sex chromosomes and is helpful for further studies on this dioecious plant.
Collapse
Affiliation(s)
- Shu-Fen Li
- College of Life Sciences, Henan Normal University, Xinxiang, Henan, China
- Key Laboratory for Microorganisms and Functional Molecules, University of Henan Province, Xinxiang, China
| | - Wu-Jun Gao
- College of Life Sciences, Henan Normal University, Xinxiang, Henan, China
- Key Laboratory for Microorganisms and Functional Molecules, University of Henan Province, Xinxiang, China
- * E-mail:
| | - Xin-Peng Zhao
- College of Life Sciences, Henan Normal University, Xinxiang, Henan, China
| | - Tian-Yu Dong
- College of Life Sciences, Henan Normal University, Xinxiang, Henan, China
| | - Chuan-Liang Deng
- College of Life Sciences, Henan Normal University, Xinxiang, Henan, China
- Key Laboratory for Microorganisms and Functional Molecules, University of Henan Province, Xinxiang, China
| | - Long-Dou Lu
- College of Life Sciences, Henan Normal University, Xinxiang, Henan, China
- Key Laboratory for Microorganisms and Functional Molecules, University of Henan Province, Xinxiang, China
| |
Collapse
|
33
|
Ferguson AA, Zhao D, Jiang N. Selective acquisition and retention of genomic sequences by Pack-Mutator-like elements based on guanine-cytosine content and the breadth of expression. PLANT PHYSIOLOGY 2013; 163:1419-32. [PMID: 24028844 PMCID: PMC3813661 DOI: 10.1104/pp.113.223271] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/07/2023]
Abstract
The process of gene duplication followed by sequence and functional divergence is important for the generation of new genes. Pack-MULEs, nonautonomous Mutator-like elements (MULEs) that carry genic sequence(s), are potentially involved in generating new open reading frames and regulating parental gene expression. These elements are identified in many plant genomes and are most abundant in rice (Oryza sativa). Despite the abundance of Pack-MULEs, the mechanism by which parental genes are captured by Pack-MULEs remains largely unknown. In this study, we identified all MULEs in rice and examined factors likely important for sequence acquisition. Terminal inverted repeat MULEs are the predominant MULE type and account for the majority of the Pack-MULEs. In addition to genic sequences, rice MULEs capture guanine-cytosine (GC)-rich intergenic sequences, albeit at a much lower frequency. MULEs carrying nontransposon sequences have longer terminal inverted repeats and higher GC content in terminal and subterminal regions. An overrepresentation of genes with known functions and genes with orthologs among parental genes of Pack-MULEs is observed in rice, maize (Zea mays), and Arabidopsis (Arabidopsis thaliana), suggesting preferential acquisition for bona fide genes by these elements. Pack-MULEs selectively acquire/retain parental sequences through a combined effect of GC content and breadth of expression, with GC content playing a stronger role. Increased GC content and number of tissues with detectable expression result in higher chances of a gene being acquired by Pack-MULEs. Such selective acquisition/retention provides these elements greater chances of carrying functional sequences that may provide new genetic resources for the evolution of new genes or the modification of existing genes.
Collapse
|
34
|
Lee SI, Park KC, Son JH, Hwang YJ, Lim KB, Song YS, Kim JH, Kim NS. Isolation and characterization of novel Ty1-copia-like retrotransposons from lily. Genome 2013; 56:495-503. [DOI: 10.1139/gen-2013-0088] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Species of the genus Lilium are well known for their large genomes. Although expansion of noncoding repeated DNA is believed to account for this genome size, retroelement del Ty3-gypsy is the only one described so far in the genus Lilium. We isolated Ty1-copia elements from Lilium longiflorum and named them LIREs (lily retrotransposons). The long terminal repeats, primer binding site, and polypurine tract sequences are highly similar among the LIRE elements, indicating that they are in the same lineage. Although the protein-coding regions were highly decayed, the sequence motifs of the integrase, reverse transcriptase, and RNase H domains were identifiable as belonging to the order of Ty1-copia elements. Phylogenetic analysis and primer binding site sequences revealed that these elements belonged to the Ale lineage among the six lineages of plant Ty1-copia elements. Base substitutions in the long terminal repeats estimated that the integration times of the LIRE Ty1-copia elements were between 0.7 and 5.5 mya. In situ hybridization showed that the LIRE elements were present in all the chromosomes of L. longiflorum and L. lancifolium, but absent in centromeres, telomeres, and 45S rRNA sites in both species. The LIRE elements were present very abundantly in species of the genus Lilium, but absent in other genera of the family Liliaceae, implying that the LIRE elements might have contributed to the expansion of the genome in the genus Lilium.
Collapse
Affiliation(s)
- Sung-Il Lee
- BK21 Training Program, Department of Molecular Bioscience, Kangwon National University, Chuncheon, 200-701, Korea
- Department of Molecular Bioscience, Kangwon National University, Chuncheon, 200-701, Korea
| | - Kyong-Cheul Park
- Institute of Biosciences and Biotechnology, Kangwon National University, Chuncheon, 200-701, Korea
| | - Jae-Han Son
- BK21 Training Program, Department of Molecular Bioscience, Kangwon National University, Chuncheon, 200-701, Korea
- Department of Molecular Bioscience, Kangwon National University, Chuncheon, 200-701, Korea
| | - Youn-Jung Hwang
- Department of Horticulture, Kyungbook National University, Daegu, Korea
| | - Ki-Byung Lim
- Department of Horticulture, Kyungbook National University, Daegu, Korea
| | - Ye-Su Song
- Department of Horticulture, Kangwon National University, Chuncheon, 200-701, Korea
| | - Jong-Hwa Kim
- Department of Horticulture, Kangwon National University, Chuncheon, 200-701, Korea
| | - Nam-Soo Kim
- BK21 Training Program, Department of Molecular Bioscience, Kangwon National University, Chuncheon, 200-701, Korea
- Institute of Biosciences and Biotechnology, Kangwon National University, Chuncheon, 200-701, Korea
| |
Collapse
|
35
|
Shangguan L, Han J, Kayesh E, Sun X, Zhang C, Pervaiz T, Wen X, Fang J. Evaluation of genome sequencing quality in selected plant species using expressed sequence tags. PLoS One 2013; 8:e69890. [PMID: 23922843 PMCID: PMC3726750 DOI: 10.1371/journal.pone.0069890] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2013] [Accepted: 06/14/2013] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND With the completion of genome sequencing projects for more than 30 plant species, large volumes of genome sequences have been produced and stored in online databases. Advancements in sequencing technologies have reduced the cost and time of whole genome sequencing enabling more and more plants to be subjected to genome sequencing. Despite this, genome sequence qualities of multiple plants have not been evaluated. METHODOLOGY/PRINCIPAL FINDING Integrity and accuracy were calculated to evaluate the genome sequence quality of 32 plants. The integrity of a genome sequence is presented by the ratio of chromosome size and genome size (or between scaffold size and genome size), which ranged from 55.31% to nearly 100%. The accuracy of genome sequence was presented by the ratio between matched EST and selected ESTs where 52.93% ∼ 98.28% and 89.02% ∼ 98.85% of the randomly selected clean ESTs could be mapped to chromosome and scaffold sequences, respectively. According to the integrity, accuracy and other analysis of each plant species, thirteen plant species were divided into four levels. Arabidopsis thaliana, Oryza sativa and Zea mays had the highest quality, followed by Brachypodium distachyon, Populus trichocarpa, Vitis vinifera and Glycine max, Sorghum bicolor, Solanum lycopersicum and Fragaria vesca, and Lotus japonicus, Medicago truncatula and Malus × domestica in that order. Assembling the scaffold sequences into chromosome sequences should be the primary task for the remaining nineteen species. Low GC content and repeat DNA influences genome sequence assembly. CONCLUSION The quality of plant genome sequences was found to be lower than envisaged and thus the rapid development of genome sequencing projects as well as research on bioinformatics tools and the algorithms of genome sequence assembly should provide increased processing and correction of genome sequences that have already been published.
Collapse
Affiliation(s)
- Lingfei Shangguan
- College of Horticulture, Nanjing Agricultural University, Nanjing City, Jiangsu Province, China
| | - Jian Han
- College of Horticulture, Nanjing Agricultural University, Nanjing City, Jiangsu Province, China
| | - Emrul Kayesh
- College of Horticulture, Nanjing Agricultural University, Nanjing City, Jiangsu Province, China
| | - Xin Sun
- College of Horticulture, Nanjing Agricultural University, Nanjing City, Jiangsu Province, China
| | - Changqing Zhang
- College of Horticulture, Jinling Institute of Technology, Nanjing City, Jiangsu Province, China
| | - Tariq Pervaiz
- College of Horticulture, Nanjing Agricultural University, Nanjing City, Jiangsu Province, China
| | - Xicheng Wen
- College of Horticulture, Nanjing Agricultural University, Nanjing City, Jiangsu Province, China
| | - Jinggui Fang
- College of Horticulture, Nanjing Agricultural University, Nanjing City, Jiangsu Province, China
| |
Collapse
|
36
|
Liu D, Zeng SH, Chen JJ, Zhang YJ, Xiao G, Zhu LY, Wang Y. First insights into the large genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Ttaditional medicinal plant. Int J Mol Sci 2013; 14:13559-76. [PMID: 23807511 PMCID: PMC3742203 DOI: 10.3390/ijms140713559] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2013] [Revised: 05/16/2013] [Accepted: 06/06/2013] [Indexed: 11/30/2022] Open
Abstract
Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant.
Collapse
Affiliation(s)
- Di Liu
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
- University of Chinese Academy of Sciences, Beijing 100039, China
| | - Shao-Hua Zeng
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences, Guangzhou 510650, China; E-Mail:
| | - Jian-Jun Chen
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
| | - Yan-Jun Zhang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
| | - Gong Xiao
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
- University of Chinese Academy of Sciences, Beijing 100039, China
| | - Lin-Yao Zhu
- Wuhan Vegetable Research Station, Wuhan 430065, China; E-Mail:
| | - Ying Wang
- Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan 430074, China; E-Mails: (D.L.); (J.-J.C.); (Y.-J.Z.); (G.X.)
| |
Collapse
|
37
|
Whole-genome sequencing of Oryza brachyantha reveals mechanisms underlying Oryza genome evolution. Nat Commun 2013; 4:1595. [PMID: 23481403 PMCID: PMC3615480 DOI: 10.1038/ncomms2596] [Citation(s) in RCA: 139] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Accepted: 02/11/2013] [Indexed: 01/12/2023] Open
Abstract
The wild species of the genus Oryza contain a largely untapped reservoir of agronomically important genes for rice improvement. Here we report the 261-Mb de novo assembled genome sequence of Oryza brachyantha. Low activity of long-terminal repeat retrotransposons and massive internal deletions of ancient long-terminal repeat elements lead to the compact genome of Oryza brachyantha. We model 32,038 protein-coding genes in the Oryza brachyantha genome, of which only 70% are located in collinear positions in comparison with the rice genome. Analysing breakpoints of non-collinear genes suggests that double-strand break repair through non-homologous end joining has an important role in gene movement and erosion of collinearity in the Oryza genomes. Transition of euchromatin to heterochromatin in the rice genome is accompanied by segmental and tandem duplications, further expanded by transposable element insertions. The high-quality reference genome sequence of Oryza brachyantha provides an important resource for functional and evolutionary studies in the genus Oryza. The wild rice species can be used as germplasm resources for this crop’s genetic improvement. Here Chen and colleagues report the de novo sequencing of the O. brachyantha genome, and identify the origin of genome size variation, the role of gene movement and its implications on heterochromatin evolution in the rice genome.
Collapse
|
38
|
Han MJ, Shen YH, Xu MS, Liang HY, Zhang HH, Zhang Z. Identification and evolution of the silkworm helitrons and their contribution to transcripts. DNA Res 2013; 20:471-84. [PMID: 23771679 PMCID: PMC3789558 DOI: 10.1093/dnares/dst024] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
In this study, we developed a structure-based approach to identify Helitrons in four lepidopterans and systematically analysed Helitrons in the silkworm genome. We found that the content of Helitrons varied greatly among genomes. The silkworm genome harboured 67,555 Helitron-related sequences that could be classified into 21 families and accounted for ≈ 4.23% of the genome. Thirteen of the families were new. Three families were putatively autonomous and included the replication initiator motif and helicase domain. The silkworm Helitrons were widely and randomly distributed in the genome. Most Helitron families radiated within the past 2 million years and experienced a single burst of expansion. These Helitron families captured 3724 gene fragments and contributed to at least 1.4% of the silkworm full-length cDNAs, suggesting important roles of Helitrons in the evolution of the silkworm genes. In addition, we found that some new Helitrons were generated by combinations of other Helitrons. Overall, the results presented in this study provided insights into the generation and evolution of Helitron transposons and their contribution to transcripts.
Collapse
Affiliation(s)
- Min-Jin Han
- 1State Key Laboratory of Silkworm Genome Biology, The Key Sericultural Laboratory of Agricultural Ministry, Southwest University, Chongqing 400715, China
| | | | | | | | | | | |
Collapse
|
39
|
Hertweck KL. Assembly and comparative analysis of transposable elements from low coverage genomic sequence data in Asparagales. Genome 2013; 56:487-94. [PMID: 24168669 DOI: 10.1139/gen-2013-0042] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The research field of comparative genomics is moving from a focus on genes to a more holistic view including the repetitive complement. This study aimed to characterize relative proportions of the repetitive fraction of large, complex genomes in a nonmodel system. The monocotyledonous plant order Asparagales (onion, asparagus, agave) comprises some of the largest angiosperm genomes and represents variation in both genome size and structure (karyotype). Anonymous, low coverage, single-end Illumina data from 11 exemplar Asparagales taxa were assembled using a de novo method. Resulting contigs were annotated using a reference library of available monocot repetitive sequences. Mapping reads to contigs provided rough estimates of relative proportions of each type of transposon in the nuclear genome. The results were parsed into general repeat types and synthesized with genome size estimates and a phylogenetic context to describe the pattern of transposable element evolution among these lineages. The major finding is that although some lineages in Asparagales exhibit conservation in repeat proportions, there is generally wide variation in types and frequency of repeats. This approach is an appropriate first step in characterizing repeats in evolutionary lineages with a paucity of genomic resources.
Collapse
Affiliation(s)
- Kate L Hertweck
- National Evolutionary Synthesis Center, 2024 West Main Street, Suite A200, Durham, NC 27705, USA
| |
Collapse
|
40
|
Hastie AR, Dong L, Smith A, Finklestein J, Lam ET, Huo N, Cao H, Kwok PY, Deal KR, Dvorak J, Luo MC, Gu Y, Xiao M. Rapid genome mapping in nanochannel arrays for highly complete and accurate de novo sequence assembly of the complex Aegilops tauschii genome. PLoS One 2013; 8:e55864. [PMID: 23405223 PMCID: PMC3566107 DOI: 10.1371/journal.pone.0055864] [Citation(s) in RCA: 123] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2012] [Accepted: 01/03/2013] [Indexed: 02/04/2023] Open
Abstract
Next-generation sequencing (NGS) technologies have enabled high-throughput and low-cost generation of sequence data; however, de novo genome assembly remains a great challenge, particularly for large genomes. NGS short reads are often insufficient to create large contigs that span repeat sequences and to facilitate unambiguous assembly. Plant genomes are notorious for containing high quantities of repetitive elements, which combined with huge genome sizes, makes accurate assembly of these large and complex genomes intractable thus far. Using two-color genome mapping of tiling bacterial artificial chromosomes (BAC) clones on nanochannel arrays, we completed high-confidence assembly of a 2.1-Mb, highly repetitive region in the large and complex genome of Aegilops tauschii, the D-genome donor of hexaploid wheat (Triticum aestivum). Genome mapping is based on direct visualization of sequence motifs on single DNA molecules hundreds of kilobases in length. With the genome map as a scaffold, we anchored unplaced sequence contigs, validated the initial draft assembly, and resolved instances of misassembly, some involving contigs <2 kb long, to dramatically improve the assembly from 75% to 95% complete.
Collapse
Affiliation(s)
- Alex R. Hastie
- BioNano Genomics, San Diego, California, United States of America
| | - Lingli Dong
- Genomics and Gene Discovery Research Unit, United States Department of Agriculture - Agricultural Research Service, Albany, California, United States of America
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Alexis Smith
- BioNano Genomics, San Diego, California, United States of America
| | - Jeff Finklestein
- BioNano Genomics, San Diego, California, United States of America
| | - Ernest T. Lam
- BioNano Genomics, San Diego, California, United States of America
| | - Naxin Huo
- Genomics and Gene Discovery Research Unit, United States Department of Agriculture - Agricultural Research Service, Albany, California, United States of America
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Han Cao
- BioNano Genomics, San Diego, California, United States of America
| | - Pui-Yan Kwok
- Institute for Human Genetics, University of California San Francisco, San Francisco, California, United States of America
| | - Karin R. Deal
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Jan Dvorak
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Ming-Cheng Luo
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
| | - Yong Gu
- Genomics and Gene Discovery Research Unit, United States Department of Agriculture - Agricultural Research Service, Albany, California, United States of America
- Department of Plant Sciences, University of California Davis, Davis, California, United States of America
- * E-mail: (MX); (YG)
| | - Ming Xiao
- BioNano Genomics, San Diego, California, United States of America
- * E-mail: (MX); (YG)
| |
Collapse
|
41
|
Estep MC, DeBarry JD, Bennetzen JL. The dynamics of LTR retrotransposon accumulation across 25 million years of panicoid grass evolution. Heredity (Edinb) 2013; 110:194-204. [PMID: 23321774 PMCID: PMC3554455 DOI: 10.1038/hdy.2012.99] [Citation(s) in RCA: 61] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2012] [Revised: 10/22/2012] [Accepted: 10/23/2012] [Indexed: 11/09/2022] Open
Abstract
Sample sequence analysis was employed to investigate the repetitive DNAs that were most responsible for the evolved variation in genome content across seven panicoid grasses with >5-fold variation in genome size and different histories of polyploidy. In all cases, the most abundant repeats were LTR retrotransposons, but the particular families that had become dominant were found to be different in the Pennisetum, Saccharum, Sorghum and Zea lineages. One element family, Huck, has been very active in all of the studied species over the last few million years. This suggests the transmittal of an active or quiescent autonomous set of Huck elements to this lineage at the founding of the panicoids. Similarly, independent recent activity of Ji and Opie elements in Zea and of Leviathan elements in Sorghum and Saccharum species suggests that members of these families with exceptional activation potential were present in the genome(s) of the founders of these lineages. In a detailed analysis of the Zea lineage, the combined action of several families of LTR retrotransposons were observed to have approximately doubled the genome size of Zea luxurians relative to Zea mays and Zea diploperennis in just the last few million years. One of the LTR retrotransposon amplification bursts in Zea may have been initiated by polyploidy, but the great majority of transposable element activations are not. Instead, the results suggest random activation of a few or many LTR retrotransposons families in particular lineages over evolutionary time, with some families especially prone to future activation and hyper-amplification.
Collapse
Affiliation(s)
- M C Estep
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - J D DeBarry
- Department of Genetics, University of Georgia, Athens, GA, USA
| | - J L Bennetzen
- Department of Genetics, University of Georgia, Athens, GA, USA
| |
Collapse
|
42
|
Piednoël M, Aberer AJ, Schneeweiss GM, Macas J, Novak P, Gundlach H, Temsch EM, Renner SS. Next-generation sequencing reveals the impact of repetitive DNA across phylogenetically closely related genomes of Orobanchaceae. Mol Biol Evol 2012; 29:3601-11. [PMID: 22723303 PMCID: PMC3859920 DOI: 10.1093/molbev/mss168] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
We used next-generation sequencing to characterize the genomes of nine species of Orobanchaceae of known phylogenetic relationships, different life forms, and including a polyploid species. The study species are the autotrophic, nonparasitic Lindenbergia philippensis, the hemiparasitic Schwalbea americana, and seven nonphotosynthetic parasitic species of Orobanche (Orobanche crenata, Orobanche cumana, Orobanche gracilis (tetraploid), and Orobanche pancicii) and Phelipanche (Phelipanche lavandulacea, Phelipanche purpurea, and Phelipanche ramosa). Ty3/Gypsy elements comprise 1.93%-28.34% of the nine genomes and Ty1/Copia elements comprise 8.09%-22.83%. When compared with L. philippensis and S. americana, the nonphotosynthetic species contain higher proportions of repetitive DNA sequences, perhaps reflecting relaxed selection on genome size in parasitic organisms. Among the parasitic species, those in the genus Orobanche have smaller genomes but higher proportions of repetitive DNA than those in Phelipanche, mostly due to a diversification of repeats and an accumulation of Ty3/Gypsy elements. Genome downsizing in the tetraploid O. gracilis probably led to sequence loss across most repeat types.
Collapse
Affiliation(s)
- Mathieu Piednoël
- Systematic Botany and Mycology, University of Munich (LMU), Munich, Germany
| | - Andre J. Aberer
- Scientific Computing Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Gerald M. Schneeweiss
- Department of Systematic and Evolutionary Botany, University of Vienna, Vienna, Austria
| | - Jiri Macas
- Institute of Plant Molecular Biology, Biology Centre ASCR, České Budějovice, Czech Republic
| | - Petr Novak
- Institute of Plant Molecular Biology, Biology Centre ASCR, České Budějovice, Czech Republic
| | - Heidrun Gundlach
- Institute for Bioinformatics and System Biology, Helmholtz Center Munich German Research Center for Environmental Health, Neuherberg, Germany
| | - Eva M. Temsch
- Department of Systematic and Evolutionary Botany, University of Vienna, Vienna, Austria
| | - Susanne S. Renner
- Systematic Botany and Mycology, University of Munich (LMU), Munich, Germany
| |
Collapse
|
43
|
Chénais B, Caruso A, Hiard S, Casse N. The impact of transposable elements on eukaryotic genomes: from genome size increase to genetic adaptation to stressful environments. Gene 2012; 509:7-15. [PMID: 22921893 DOI: 10.1016/j.gene.2012.07.042] [Citation(s) in RCA: 195] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2012] [Revised: 07/16/2012] [Accepted: 07/25/2012] [Indexed: 11/25/2022]
Abstract
Transposable elements (TEs) are present in roughly all genomes. These mobile DNA sequences are able to invade genomes and their impact on genome evolution is substantial. The mobility of TEs can induce the appearance of deleterious mutations, gene disruption and chromosome rearrangements, but transposition activity also has positive aspects and the mutational activities of TEs contribute to the genetic diversity of organisms. This short review aims to give a brief overview of the impact TEs may have on animal and plant genome structure and expression, and the relationship between TEs and the stress response of organisms, including insecticide resistance.
Collapse
Affiliation(s)
- Benoît Chénais
- Université du Maine, EA2160 Mer Molécules Santé, UFR Sciences et Techniques, Avenue Olivier Messiaen, F-72085 Le Mans, France.
| | | | | | | |
Collapse
|
44
|
Seibt KM, Wenke T, Wollrab C, Junghans H, Muders K, Dehmer KJ, Diekmann K, Schmidt T. Development and application of SINE-based markers for genotyping of potato varieties. TAG. THEORETICAL AND APPLIED GENETICS. THEORETISCHE UND ANGEWANDTE GENETIK 2012; 125:185-196. [PMID: 22371142 DOI: 10.1007/s00122-012-1825-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/11/2011] [Accepted: 02/11/2012] [Indexed: 05/31/2023]
Abstract
Potato variety discrimination based on morphological traits is laborious and influenced by the environment, while currently applied molecular markers are either expensive or time-consuming in development or application. SINEs, short interspersed nuclear elements, are retrotransposons with a high copy number in plant genomes representing a potential source for new markers. We developed a marker system for potato genotyping, designated inter-SINE amplified polymorphism (ISAP). Based on nine potato SINE families recently characterized (Wenke et al. in Plant Cell 23:3117-3128, 2011), we designed species-specific SINE primers. From the resulting 153 primer combinations, highly informative primer sets were selected for potato variety analysis regarding number of bands, quality of the banding pattern, and the degree of polymorphism. Fragments representing ISAPs can be separated by conventional agarose gel electrophoresis; however, automation with a capillary sequencer is feasible. Two selected SINE families, SolS-IIIa and SolS-IV, were shown to be highly but differently amplified in Solanaceae, Solaneae tribe, including wild and cultivated potatoes, tomato, and eggplant. Fluorescent in situ hybridization demonstrated the genome-wide distribution of SolS-IIIa and SolS-IV along potato chromosomes, which is the basis for genotype discrimination and differentiation of somaclonal variants by ISAP markers.
Collapse
Affiliation(s)
- Kathrin M Seibt
- Institute of Botany, Dresden University of Technology, 01062, Dresden, Germany
| | | | | | | | | | | | | | | |
Collapse
|
45
|
Ai B, Wang ZS, Ge S. GENOME SIZE IS NOT CORRELATED WITH EFFECTIVE POPULATION SIZE IN THEORYZASPECIES. Evolution 2012; 66:3302-10. [DOI: 10.1111/j.1558-5646.2012.01674.x] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
46
|
Steinbauerová V, Neumann P, Novák P, Macas J. A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons. Genetica 2012; 139:1543-55. [PMID: 22544262 DOI: 10.1007/s10709-012-9654-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2012] [Accepted: 04/16/2012] [Indexed: 01/21/2023]
Abstract
Long terminal repeat (LTR) retrotransposons make up substantial parts of most higher plant genomes where they accumulate due to their replicative mode of transposition. Although the transposition is facilitated by proteins encoded within the gag-pol region which is common to all autonomous elements, some LTR retrotransposons were found to potentially carry an additional protein coding capacity represented by extra open reading frames located upstream or downstream of gag-pol. In this study, we performed a comprehensive in silico survey and comparative analysis of these extra open reading frames (ORFs) in the group of Ty3/gypsy LTR retrotransposons as the first step towards our understanding of their origin and function. We found that extra ORFs occur in all three major lineages of plant Ty3/gypsy elements, being the most frequent in the Tat lineage where most (77 %) of identified elements contained extra ORFs. This lineage was also characterized by the highest diversity of extra ORF arrangement (position and orientation) within the elements. On the other hand, all of these ORFs could be classified into only two broad groups based on their mutual similarities or the presence of short conserved motifs in their inferred protein sequences. In the Athila lineage, the extra ORFs were confined to the element 3' regions but they displayed much higher sequence diversity compared to those found in Tat. In the lineage of Chromoviruses the extra ORFs were relatively rare, occurring only in 5' regions of a group of elements present in a single plant family (Poaceae). In all three lineages, most extra ORFs lacked sequence similarities to characterized gene sequences or functional protein domains, except for two Athila-like elements with similarities to LOGL4 gene and part of the Chromoviruses extra ORFs that displayed partial similarity to histone H3 gene. Thus, in these cases the extra ORFs most likely originated by transduction or recombination of cellular gene sequences. In addition, the protein domain which is otherwise associated with DNA transposons have been detected in part of the Tat-like extra ORFs, pointing to their origin from an insertion event of a mobile element.
Collapse
Affiliation(s)
- Veronika Steinbauerová
- Institute of Plant Molecular Biology, Biology Centre ASCR, Branišovská 31, Ceske Budejovice, Czech Republic
| | | | | | | |
Collapse
|
47
|
Cenci A, Combes MC, Lashermes P. Genome evolution in diploid and tetraploid Coffea species as revealed by comparative analysis of orthologous genome segments. PLANT MOLECULAR BIOLOGY 2012; 78:135-45. [PMID: 22086332 DOI: 10.1007/s11103-011-9852-3] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2011] [Accepted: 11/04/2011] [Indexed: 05/10/2023]
Abstract
Sequence comparison of orthologous regions enables estimation of the divergence between genomes, analysis of their evolution and detection of particular features of the genomes, such as sequence rearrangements and transposable elements. Despite the economic importance of Coffea species, little genomic information is currently available. Coffea is a relatively young genus that includes more than one hundred diploid species and a single tetraploid species. Three Coffea orthologous regions of 470-900 kb were analyzed and compared: both subgenomes of allotetraploid Coffea arabica (contributed by the diploid species Coffea eugenioides and Coffea canephora) and the genome of diploid C. canephora. Sequence divergence was calculated on global alignments or on coding and non-coding sequences separately. A search for transposable elements detected 43 retrotransposons and 198 transposons in the sequences analyzed. Comparative insertion analysis made it possible to locate 165 TE insertions in the phylogenetic tree of the three genomes/subgenomes. In the tetraploid C. arabica, a homoeologous non-reciprocal transposition (HNRT) was detected and characterized: a 50 kb region of the C. eugenioides derived subgenome replaced the C. canephora derived counterpart. Comparative sequence analysis on three Coffea genomes/subgenomes revealed almost perfect gene synteny, low sequence divergence and a high number of shared transposable elements. Compared to the results of similar analysis in other genera (Aegilops/Triticum and Oryza), Coffea genomes/subgenomes appeared to be dramatically less diverged, which is consistent with the relatively recent radiation of the Coffea genus. Based on nucleotide substitution frequency, the HNRT was dated at 10,000-50,000 years BP, which is also the most recent estimation of the origin of C. arabica.
Collapse
Affiliation(s)
- Alberto Cenci
- IRD-Institut de Recherche pour le Développement, UMR RPB (CIRAD, IRD, Université Montpellier II), BP 64501, Montpellier, France.
| | | | | |
Collapse
|
48
|
Sun C, Shepard DB, Chong RA, López Arriaza J, Hall K, Castoe TA, Feschotte C, Pollock DD, Mueller RL. LTR retrotransposons contribute to genomic gigantism in plethodontid salamanders. Genome Biol Evol 2011; 4:168-83. [PMID: 22200636 PMCID: PMC3318908 DOI: 10.1093/gbe/evr139] [Citation(s) in RCA: 121] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/22/2011] [Indexed: 01/20/2023] Open
Abstract
Among vertebrates, most of the largest genomes are found within the salamanders, a clade of amphibians that includes 613 species. Salamander genome sizes range from ~14 to ~120 Gb. Because genome size is correlated with nucleus and cell sizes, as well as other traits, morphological evolution in salamanders has been profoundly affected by genomic gigantism. However, the molecular mechanisms driving genomic expansion in this clade remain largely unknown. Here, we present the first comparative analysis of transposable element (TE) content in salamanders. Using high-throughput sequencing, we generated genomic shotgun data for six species from the Plethodontidae, the largest family of salamanders. We then developed a pipeline to mine TE sequences from shotgun data in taxa with limited genomic resources, such as salamanders. Our summaries of overall TE abundance and diversity for each species demonstrate that TEs make up a substantial portion of salamander genomes, and that all of the major known types of TEs are represented in salamanders. The most abundant TE superfamilies found in the genomes of our six focal species are similar, despite substantial variation in genome size. However, our results demonstrate a major difference between salamanders and other vertebrates: salamander genomes contain much larger amounts of long terminal repeat (LTR) retrotransposons, primarily Ty3/gypsy elements. Thus, the extreme increase in genome size that occurred in salamanders was likely accompanied by a shift in TE landscape. These results suggest that increased proliferation of LTR retrotransposons was a major molecular mechanism contributing to genomic expansion in salamanders.
Collapse
Affiliation(s)
- Cheng Sun
- Department of Biology, Colorado State University
| | - Donald B. Shepard
- Department of Biology, Colorado State University
- Current address: Department of Fisheries, Wildlife and Conservation Biology; University of Minnesota
| | | | | | - Kathryn Hall
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine
| | - Todd A. Castoe
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine
| | | | - David D. Pollock
- Department of Biochemistry and Molecular Genetics, University of Colorado School of Medicine
| | | |
Collapse
|
49
|
Macas J, Kejnovský E, Neumann P, Novák P, Koblížková A, Vyskot B. Next generation sequencing-based analysis of repetitive DNA in the model dioecious [corrected] plant Silene latifolia. PLoS One 2011; 6:e27335. [PMID: 22096552 PMCID: PMC3212565 DOI: 10.1371/journal.pone.0027335] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2011] [Accepted: 10/14/2011] [Indexed: 01/04/2023] Open
Abstract
Background Silene latifolia is a dioceous plant with well distinguished X and Y chromosomes that is used as a model to study sex determination and sex chromosome evolution in plants. However, efficient utilization of this species has been hampered by the lack of large-scale sequencing resources and detailed analysis of its genome composition, especially with respect to repetitive DNA, which makes up the majority of the genome. Methodology/Principal Findings We performed low-pass 454 sequencing followed by similarity-based clustering of 454 reads in order to identify and characterize sequences of all major groups of S. latifolia repeats. Illumina sequencing data from male and female genomes were also generated and employed to quantify the genomic proportions of individual repeat families. The majority of identified repeats belonged to LTR-retrotransposons, constituting about 50% of genomic DNA, with Ty3/gypsy elements being more frequent than Ty1/copia. While there were differences between the male and female genome in the abundance of several repeat families, their overall repeat composition was highly similar. Specific localization patterns on sex chromosomes were found for several satellite repeats using in situ hybridization with probes based on k-mer frequency analysis of Illumina sequencing data. Conclusions/Significance This study provides comprehensive information about the sequence composition and abundance of repeats representing over 60% of the S. latifolia genome. The results revealed generally low divergence in repeat composition between the sex chromosomes, which is consistent with their relatively recent origin. In addition, the study generated various data resources that are available for future exploration of the S. latifolia genome.
Collapse
Affiliation(s)
- Jiří Macas
- Biology Centre of the Academy of Sciences of the Czech Republic, Institute of Plant Molecular Biology, České Budějovice, Czech Republic.
| | | | | | | | | | | |
Collapse
|
50
|
Fang SA, Eu TI, Chung MC. Isolation and characterization of genome-specific markers in Oryza species with the BB genome. PLANT SCIENCE : AN INTERNATIONAL JOURNAL OF EXPERIMENTAL PLANT BIOLOGY 2011; 181:300-308. [PMID: 21763541 DOI: 10.1016/j.plantsci.2011.06.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/16/2010] [Revised: 05/18/2011] [Accepted: 06/06/2011] [Indexed: 05/31/2023]
Abstract
Wild species of rice with many valuable agronomic traits are an important genetic resource for improving cultivated rice by wide hybridization. Genome- or chromosome-specific markers are useful for monitoring genome introgression and for identifying genome components. From 47 random amplified polymorphic DNAs (RAPDs) of nine Oryza species, three bands (Ogla225, Opun225, and Opun246) were found to be genome specific with distinct sizes. Their specificities were further characterized by Southern hybridization, sequence analysis, and fluorescent in situ hybridization (FISH). Ogla225 is specifically amplified from the AA genome but homologous sequences were conserved among Oryza species. Opun225 occurs at a low copy number although is specifically amplified from Oryza punctata. There are estimated 2000-3300 repeats of Opun246 in each haploid genome of Oryza species with the BB or BBCC genome. Clusters of Opun246 repeats were detected at heterochromatic regions on almost all chromosomes of the BB genomes by FISH. Opun246 may be a useful marker for monitoring the introgression of BB genome or for identifying the conserved components of BB genome in genetic resource. The results from this study and our previous study both indicate that numerous unique repeats play role in the differentiation of the BB genome from other Oryza genomes.
Collapse
Affiliation(s)
- Shao-An Fang
- Institute of Plant and Microbial Biology, Academia Sinica, Taipei, Taiwan
| | | | | |
Collapse
|