1
|
Mahdhi A, Mars M, Rejili M. Members of Ensifer and Rhizobium genera are new bacterial endosymbionts nodulating Pisum sativum (L.). FEMS Microbiol Ecol 2023; 99:fiad001. [PMID: 36597782 DOI: 10.1093/femsec/fiad001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 12/27/2022] [Accepted: 01/02/2023] [Indexed: 01/05/2023] Open
Abstract
A total of 84 Pisum sativum legume nodulating bacteria (LNB) were isolated from seven geographical sites from southern Tunisia. Phylogenetic analyses based on partial sequences of 16S rRNA gene and the housekeeping genes glnII, and recA grouped strains into six clusters, four of which belonged to the genus Rhizobium and two to the Ensifer genus. Among Rhizobium clusters, 41 strains were affiliated to Rhizobium leguminosarum, two strains to R. pisi, two strains to R. etli, and interestingly two strains belonged to previously undescribed Rhizobium species. The remaining two strains were closely related to Ensifer medicae (two strains) and Ensifer meliloti (two strains). A symbiotic nodC gene-based phylogeny and host specificity test showed that all Rhizobium strains nodulating pea belonged to the symbiovar viciae, whereas the Ensifer strains were associated with the symbiovar meliloti never described to date. All strains under investigation differed in the number of induced root nodules and the effectiveness of atmospheric nitrogen fixation. The R. leguminosarum PsZA23, R. leguminosarum PsGBL42, and E. medicae PsTA22a, forming the most effective symbiosis with the plant host, are potential candidates for inoculation programs.
Collapse
Affiliation(s)
- A Mahdhi
- Laboratory of Biodiversity and Valorization of Arid Areas Bioresources (BVBAA) - Faculty of Sciences of Gabes, University of Gabes, Erriadh, Zrig 6072, Gabes, Tunisia
| | - M Mars
- Laboratory of Biodiversity and Valorization of Arid Areas Bioresources (BVBAA) - Faculty of Sciences of Gabes, University of Gabes, Erriadh, Zrig 6072, Gabes, Tunisia
| | - M Rejili
- Laboratory of Biodiversity and Valorization of Arid Areas Bioresources (BVBAA) - Faculty of Sciences of Gabes, University of Gabes, Erriadh, Zrig 6072, Gabes, Tunisia
- Department of Life Sciences, College of Sciences, Al Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh, 11623, Saudi Arabia
| |
Collapse
|
2
|
Fominaya A, Loarce Y, González JM, Ferrer E. Cytogenetic evidence supports Avena insularis being closely related to hexaploid oats. PLoS One 2021; 16:e0257100. [PMID: 34653181 PMCID: PMC8519437 DOI: 10.1371/journal.pone.0257100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Accepted: 10/01/2021] [Indexed: 11/19/2022] Open
Abstract
Cytogenetic observations, phylogenetic studies and genome analysis using high-density genetic markers have suggested a tetraploid Avena species carrying the C and D genomes (formerly C and A) to be the donor of all hexaploid oats (AACCDD). However, controversy surrounds which of the three extant CCDD tetraploid species—A. insularis, A. magna and A. murphyi—is most closely related to hexaploid oats. The present work describes a comparative karyotype analysis of these three CCDD tetraploid species and two hexaploid species, A. sativa and A. byzantina. This involved the use of FISH with six simple sequence repeats (SSRs) with the motifs CT, AAC, AAG, ACG, ATC and ACT, two repeated ribosomal sequences, and C genome-specific repetitive DNA. The hybridization pattern of A. insularis with oligonucleotide (AC)10 was also determined and compared with those previously published for A. sativa and A. byzantina. Significant differences in the 5S sites and SSR hybridization patterns of A. murphyi compared to the other CCDD species rule out its being directly involved in the origin of the hexaploids. In contrast, the repetitive and SSR hybridization patterns shown by the D genome chromosomes, and by most of the C genome chromosomes of A. magna and A. insularis, can be equated with the corresponding chromosomes of the hexaploids. Several chromosome hybridization signals seen for A. insularis, but not for A. magna, were shared with the hexaploid oats species, especially with A. byzantina. These diagnostic signals add weight to the idea that the extant A. insularis, or a direct ancestor of it, is the most closely related progenitor of hexaploid oats. The similarity of the chromosome hybridization patterns of the hexaploids and CCDD tetraploids was taken as being indicative of homology. A common chromosome nomenclature for CCDD species based on that of the hexaploid species is proposed.
Collapse
Affiliation(s)
- Araceli Fominaya
- Department of Biomedicine and Biotechnology, University of Alcalá, Alcalá de Henares, Madrid, Spain
| | - Yolanda Loarce
- Department of Biomedicine and Biotechnology, University of Alcalá, Alcalá de Henares, Madrid, Spain
| | - Juan M. González
- Department of Biomedicine and Biotechnology, University of Alcalá, Alcalá de Henares, Madrid, Spain
| | - Esther Ferrer
- Department of Biomedicine and Biotechnology, University of Alcalá, Alcalá de Henares, Madrid, Spain
- * E-mail:
| |
Collapse
|
3
|
Kuhn GCS, Heringer P, Dias GB. Structure, Organization, and Evolution of Satellite DNAs: Insights from the Drosophila repleta and D. virilis Species Groups. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2021; 60:27-56. [PMID: 34386871 DOI: 10.1007/978-3-030-74889-0_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The fact that satellite DNAs (satDNAs) in eukaryotes are abundant genomic components, can perform functional roles, but can also change rapidly across species while being homogenous within a species, makes them an intriguing and fascinating genomic component to study. It is also becoming clear that satDNAs represent an important piece in genome architecture and that changes in their structure, organization, and abundance can affect the evolution of genomes and species in many ways. Since the discovery of satDNAs more than 50 years ago, species from the Drosophila genus have continuously been used as models to study several aspects of satDNA biology. These studies have been largely concentrated in D. melanogaster and closely related species from the Sophophora subgenus, even though the vast majority of all Drosophila species belong to the Drosophila subgenus. This chapter highlights some studies on the satDNA structure, organization, and evolution in two species groups from the Drosophila subgenus: the repleta and virilis groups. We also discuss and review the classification of other abundant tandem repeats found in these species in the light of the current information available.
Collapse
Affiliation(s)
- Gustavo C S Kuhn
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, MG, Brazil.
| | - Pedro Heringer
- Departamento de Genética, Ecologia e Evolução, Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, MG, Brazil
| | - Guilherme Borges Dias
- Department of Genetics and Institute of Bioinformatics, University of Georgia, Athens, GA, USA
| |
Collapse
|
4
|
Lopes ALK, Kriegová E, Lukeš J, Krieger MA, Ludwig A. Distribution of Merlin in eukaryotes and first report of DNA transposons in kinetoplastid protists. PLoS One 2021; 16:e0251133. [PMID: 33956864 PMCID: PMC8101967 DOI: 10.1371/journal.pone.0251133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Accepted: 04/21/2021] [Indexed: 11/19/2022] Open
Abstract
DNA transposons are defined as repeated DNA sequences that can move within the host genome through the action of transposases. The transposon superfamily Merlin was originally found mainly in animal genomes. Here, we describe a global distribution of the Merlin in animals, fungi, plants and protists, reporting for the first time their presence in Rhodophyceae, Metamonada, Discoba and Alveolata. We identified a great variety of potentially active Merlin families, some containing highly imperfect terminal inverted repeats and internal tandem repeats. Merlin-related sequences with no evidence of mobilization capacity were also observed and may be products of domestication. The evolutionary trees support that Merlin is likely an ancient superfamily, with early events of diversification and secondary losses, although repeated re-invasions probably occurred in some groups, which would explain its diversity and discontinuous distribution. We cannot rule out the possibility that the Merlin superfamily is the product of multiple horizontal transfers of related prokaryotic insertion sequences. Moreover, this is the first account of a DNA transposon in kinetoplastid flagellates, with conserved Merlin transposase identified in Bodo saltans and Perkinsela sp., whereas it is absent in trypanosomatids. Based on the level of conservation of the transposase and overlaps of putative open reading frames with Merlin, we propose that in protists it may serve as a raw material for gene emergence.
Collapse
Affiliation(s)
- Ana Luisa Kalb Lopes
- Pós-Graduação em Biologia Celular e Molecular, Universidade Federal do Paraná, Curitiba, PR, Brazil
- Laboratório de Ciências e Tecnologias Aplicadas em Saúde (LaCTAS), Instituto Carlos Chagas, Fundação Oswaldo Cruz (Fiocruz), Curitiba, PR, Brazil
- Instituto de Biologia Molecular do Paraná, Curitiba, PR, Brazil
| | - Eva Kriegová
- Institute of Parasitology, Biology Center, Czech Academy of Sciences, České Budějovice (Budweis), Czech Republic
| | - Julius Lukeš
- Institute of Parasitology, Biology Center, Czech Academy of Sciences, České Budějovice (Budweis), Czech Republic
- Faculty of Sciences, University of South Bohemia, České Budějovice (Budweis), Czech Republic
| | - Marco Aurélio Krieger
- Pós-Graduação em Biologia Celular e Molecular, Universidade Federal do Paraná, Curitiba, PR, Brazil
- Laboratório de Ciências e Tecnologias Aplicadas em Saúde (LaCTAS), Instituto Carlos Chagas, Fundação Oswaldo Cruz (Fiocruz), Curitiba, PR, Brazil
| | - Adriana Ludwig
- Laboratório de Ciências e Tecnologias Aplicadas em Saúde (LaCTAS), Instituto Carlos Chagas, Fundação Oswaldo Cruz (Fiocruz), Curitiba, PR, Brazil
- * E-mail:
| |
Collapse
|
5
|
Retrotransposable Elements: DNA Fingerprinting and the Assessment of Genetic Diversity. Methods Mol Biol 2021; 2222:263-286. [PMID: 33301099 DOI: 10.1007/978-1-0716-0997-2_15] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Retrotransposable elements (RTEs) are highly common mobile genetic elements that are composed of several classes and make up the majority of eukaryotic genomes. The "copy-out and paste-in" life cycle of replicative transposition in these dispersive and ubiquitous RTEs leads to new genome insertions without excision of the original element. RTEs are important drivers of species diversity; they exhibit great variety in structure, size, and mechanisms of transposition, making them important putative components in genome evolution. Accordingly, various applications have been developed to explore the polymorphisms in RTE insertion patterns. These applications include conventional or anchored polymerase chain reaction (PCR) and quantitative or digital PCR with primers designed for the 5' or 3' junction. Marker systems exploiting these PCR methods can be easily developed and are inexpensively used in the absence of extensive genome sequence data. The main inter-repeat amplification polymorphism techniques include inter-retrotransposon amplified polymorphism (IRAP), retrotransposon microsatellite amplified polymorphism (REMAP), and Inter-Primer Binding Site (iPBS) for PCR amplification with a single or two primers.
Collapse
|
6
|
Khapilina O, Raiser O, Danilova A, Shevtsov V, Turzhanova A, Kalendar R. DNA profiling and assessment of genetic diversity of relict species Allium altaicum Pall. on the territory of Altai. PeerJ 2021; 9:e10674. [PMID: 33510974 PMCID: PMC7798630 DOI: 10.7717/peerj.10674] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Accepted: 12/09/2020] [Indexed: 12/18/2022] Open
Abstract
Analysis of the genetic diversity of natural populations of threatened and endangered species of plants is a main aspect of conservation strategy. The endangered species Allium altaicum is a relict plant of the Ice Age and natural populations are located in extreme climatic conditions of Kazakstan's Altai Mountains. Mobile genetic elements and other interspersed repeats are basic components of a eukaryote genome, which can activate under stress conditions and indirectly promote the survival of an organism against environmental stresses. Detections of chromosomal changes related to recombination processes of mobile genetic elements are performed by various PCR methods. These methods are based on interspersed repeat sequences and are an effective tool for research of biological diversity of plants and their variability. In our research, we used conservative sequences of tRNA primer binding sites (PBS) when initializing the retrotransposon replication as PCR primers to research the genetic diversity of 12 natural populations of A. altaicum found in various ecogeographic conditions of the Kazakhstani Altai. High efficiency of the PBS amplification method used was observed already at the intrapopulation level. Unique amplicons representative of a certain population were found at the intrapopulation level. Analysis of molecular dispersion revealed that the biodiversity of populations of mountainous and lowland A. altaicum is due to intrapopulation differences for climatic zones of habitation. This is likely conditional upon predominance of vegetative reproduction over seed reproduction in some populations. In the case of vegetative reproduction, somatic recombination related to the activity of mobile genetic elements are preserved in subsequent generations. This leads to an increase of intrapopulation genetic diversity. Thus, high genetic diversity was observed in populations such as A. altaicum located in the territory of the Kalbinskii Altai, whereas the minimum diversity was observed in the populations of the Leninororsk ecogeographic group. Distinctions between these populations were also identified depending on the areas of their distribution. Low-land and mid-mountain living environments are characterized by a great variety of shapes and plasticity. This work allowed us to obtain new genetic data on the structure of A. altaicum populations on the territory of the Kazakhstan Altai for the subsequent development of preservation and reproduction strategies for this relict species.
Collapse
Affiliation(s)
| | - Olesya Raiser
- National Center for Biotechnology, Nur-Sultan, Kazakhstan
| | | | | | | | - Ruslan Kalendar
- Department of Agricultural Sciences, University of Helsinki, Helsinki, Finland.,National Laboratory Astana, Nazarbayev University, Nur-Sultan, Aqmola, Kazakhstan
| |
Collapse
|
7
|
Kalendar R, Raskina O, Belyayev A, Schulman AH. Long Tandem Arrays of Cassandra Retroelements and Their Role in Genome Dynamics in Plants. Int J Mol Sci 2020; 21:ijms21082931. [PMID: 32331257 PMCID: PMC7215508 DOI: 10.3390/ijms21082931] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Revised: 04/15/2020] [Accepted: 04/17/2020] [Indexed: 02/07/2023] Open
Abstract
Retrotransposable elements are widely distributed and diverse in eukaryotes. Their copy number increases through reverse-transcription-mediated propagation, while they can be lost through recombinational processes, generating genomic rearrangements. We previously identified extensive structurally uniform retrotransposon groups in which no member contains the gag, pol, or env internal domains. Because of the lack of protein-coding capacity, these groups are non-autonomous in replication, even if transcriptionally active. The Cassandra element belongs to the non-autonomous group called terminal-repeat retrotransposons in miniature (TRIM). It carries 5S RNA sequences with conserved RNA polymerase (pol) III promoters and terminators in its long terminal repeats (LTRs). Here, we identified multiple extended tandem arrays of Cassandra retrotransposons within different plant species, including ferns. At least 12 copies of repeated LTRs (as the tandem unit) and internal domain (as a spacer), giving a pattern that resembles the cellular 5S rRNA genes, were identified. A cytogenetic analysis revealed the specific chromosomal pattern of the Cassandra retrotransposon with prominent clustering at and around 5S rDNA loci. The secondary structure of the Cassandra retroelement RNA is predicted to form super-loops, in which the two LTRs are complementary to each other and can initiate local recombination, leading to the tandem arrays of Cassandra elements. The array structures are conserved for Cassandra retroelements of different species. We speculate that recombination events similar to those of 5S rRNA genes may explain the wide variation in Cassandra copy number. Likewise, the organization of 5S rRNA gene sequences is very variable in flowering plants; part of what is taken for 5S gene copy variation may be variation in Cassandra number. The role of the Cassandra 5S sequences remains to be established.
Collapse
Affiliation(s)
- Ruslan Kalendar
- Department of Agricultural Sciences, University of Helsinki, P.O. Box 27 (Latokartanonkaari 5), FI-00014 Helsinki, Finland
- RSE “National Center for Biotechnology”, Korgalzhyn Highway 13/5, Nur-Sultan 010000, Kazakhstan
- Correspondence: (R.K.); (A.H.S.)
| | - Olga Raskina
- Institute of Evolution, University of Haifa, Mount Carmel, Haifa 31905, Israel;
| | - Alexander Belyayev
- Laboratory of Molecular Cytogenetics and Karyology, Institute of Botany of the ASCR, Zámek 1, CZ-252 43 Průhonice, Czech Republic;
| | - Alan H. Schulman
- Natural Resources Institute Finland (Luke), Latokartanonkaari 9, FI-00790 Helsinki, Finland
- Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65, FI-00014 Helsinki, Finland
- Correspondence: (R.K.); (A.H.S.)
| |
Collapse
|
8
|
Divashuk MG, Karlov GI, Kroupin PY. Copy Number Variation of Transposable Elements in Thinopyrum intermedium and Its Diploid Relative Species. PLANTS (BASEL, SWITZERLAND) 2019; 9:E15. [PMID: 31877707 PMCID: PMC7020174 DOI: 10.3390/plants9010015] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/03/2019] [Revised: 12/05/2019] [Accepted: 12/17/2019] [Indexed: 12/14/2022]
Abstract
Diploid and polyploid wild species of Triticeae have complex relationships, and the understanding of their evolution and speciation could help to increase the usability of them in wheat breeding as a source of genetic diversity. The diploid species Pseudoroegneria spicata (St), Thinopyrum bessarabicum (Jb), Dasypyrum villosum (V) derived from a hypothetical common ancestor are considered to be possible subgenome donors in hexaploid species Th. intermedium (JrJvsSt, where indices r, v, and s stand for the partial relation to the genomes of Secale, Dasypyrum, and Pseudoroegneria, respectively). We quantified 10 families of transposable elements (TEs) in P. spicata, Th. bessarabicum, D. villosum (per one genome), and Th. intermedium (per one average subgenome) using the quantitative real time PCR assay and compared their abundance within the studied genomes as well as between them. Sabrina was the most abundant among all studied elements in P. spicata, D. villosum, and Th. intermedium, and among Ty3/Gypsy elements in all studied species. Among Ty1/Copia elements, Angela-A and WIS-A showed the highest and close abundance with the exception of D. villosum, and comprised the majority of all studied elements in Th. bessarabicum. Sabrina, BAGY2, and Angela-A showed similar abundance among diploids and in Th. intermedium hexaploid; Latidu and Barbara demonstrated sharp differences between diploid genomes. The relationships between genomes of Triticeae species based on the studied TE abundance and the role of TEs in speciation and polyploidization in the light of the current phylogenetic models is discussed.
Collapse
Affiliation(s)
- Mikhail G. Divashuk
- Laboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Moscow 127550, Russia; (M.G.D.)
- Centre for Molecular Biotechnology, Russian State Agrarian University-Timiryazev Agricultural Academy, Moscow 127550, Russia
| | - Gennady I. Karlov
- Laboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Moscow 127550, Russia; (M.G.D.)
- Centre for Molecular Biotechnology, Russian State Agrarian University-Timiryazev Agricultural Academy, Moscow 127550, Russia
| | - Pavel Yu. Kroupin
- Laboratory of Applied Genomics and Crop Breeding, All-Russia Research Institute of Agricultural Biotechnology, Moscow 127550, Russia; (M.G.D.)
- Centre for Molecular Biotechnology, Russian State Agrarian University-Timiryazev Agricultural Academy, Moscow 127550, Russia
| |
Collapse
|
9
|
Paço A, Freitas R, Vieira-da-Silva A. Conversion of DNA Sequences: From a Transposable Element to a Tandem Repeat or to a Gene. Genes (Basel) 2019; 10:E1014. [PMID: 31817529 PMCID: PMC6947457 DOI: 10.3390/genes10121014] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2019] [Revised: 11/18/2019] [Accepted: 11/29/2019] [Indexed: 01/24/2023] Open
Abstract
Eukaryotic genomes are rich in repetitive DNA sequences grouped in two classes regarding their genomic organization: tandem repeats and dispersed repeats. In tandem repeats, copies of a short DNA sequence are positioned one after another within the genome, while in dispersed repeats, these copies are randomly distributed. In this review we provide evidence that both tandem and dispersed repeats can have a similar organization, which leads us to suggest an update to their classification based on the sequence features, concretely regarding the presence or absence of retrotransposons/transposon specific domains. In addition, we analyze several studies that show that a repetitive element can be remodeled into repetitive non-coding or coding sequences, suggesting (1) an evolutionary relationship among DNA sequences, and (2) that the evolution of the genomes involved frequent repetitive sequence reshuffling, a process that we have designated as a "DNA remodeling mechanism". The alternative classification of the repetitive DNA sequences here proposed will provide a novel theoretical framework that recognizes the importance of DNA remodeling for the evolution and plasticity of eukaryotic genomes.
Collapse
Affiliation(s)
- Ana Paço
- MED-Mediterranean Institute for Agriculture, Environment and Development, University of Évora, 7002–554 Évora, Portugal;
| | - Renata Freitas
- IBMC-Institute for Molecular and Cell Biology, University of Porto, R. Campo Alegre 823, 4150–180 Porto, Portugal;
- I3S-Institute for Innovation and Health Research, University of Porto, Rua Alfredo Allen, 208, 4200–135 Porto, Portugal
- ICBAS-Institute of Biomedical Sciences Abel Salazar, University of Porto, 4050-313 Porto, Portugal
| | - Ana Vieira-da-Silva
- MED-Mediterranean Institute for Agriculture, Environment and Development, University of Évora, 7002–554 Évora, Portugal;
| |
Collapse
|
10
|
Kalendar R, Tselykh TV, Khassenov B, Ramanculov EM. Introduction on Using the FastPCR Software and the Related Java Web Tools for PCR and Oligonucleotide Assembly and Analysis. Methods Mol Biol 2018; 1620:33-64. [PMID: 28540698 DOI: 10.1007/978-1-4939-7060-5_2] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
This chapter introduces the FastPCR software as an integrated tool environment for PCR primer and probe design, which predicts properties of oligonucleotides based on experimental studies of the PCR efficiency. The software provides comprehensive facilities for designing primers for most PCR applications and their combinations. These include the standard PCR as well as the multiplex, long-distance, inverse, real-time, group-specific, unique, overlap extension PCR for multi-fragments assembling cloning and loop-mediated isothermal amplification (LAMP). It also contains a built-in program to design oligonucleotide sets both for long sequence assembly by ligase chain reaction and for design of amplicons that tile across a region(s) of interest. The software calculates the melting temperature for the standard and degenerate oligonucleotides including locked nucleic acid (LNA) and other modifications. It also provides analyses for a set of primers with the prediction of oligonucleotide properties, dimer and G/C-quadruplex detection, linguistic complexity as well as a primer dilution and resuspension calculator. The program consists of various bioinformatical tools for analysis of sequences with the GC or AT skew, CG% and GA% content, and the purine-pyrimidine skew. It also analyzes the linguistic sequence complexity and performs generation of random DNA sequence as well as restriction endonucleases analysis. The program allows to find or create restriction enzyme recognition sites for coding sequences and supports the clustering of sequences. It performs efficient and complete detection of various repeat types with visual display. The FastPCR software allows the sequence file batch processing that is essential for automation. The program is available for download at http://primerdigital.com/fastpcr.html , and its online version is located at http://primerdigital.com/tools/pcr.html .
Collapse
Affiliation(s)
- Ruslan Kalendar
- National Center for Biotechnology, 010000, Astana, Kazakhstan. .,PrimerDigital Ltd, 00710, Helsinki, Finland.
| | - Timofey V Tselykh
- Biochemistry and Developmental Biology, Medicum, University of Helsinki, PL63, 00014, Helsinki, Finland.,Minerva Medical Research Institute, Biomedicum 2U, Tukholmankatu 8, 00290, Helsinki, Finland
| | - Bekbolat Khassenov
- National Center for Biotechnology,, 13/5 Kurgalzhynskoye Hwy, Astana, Kazakhstan
| | - Erlan M Ramanculov
- National Center for Biotechnology,, 13/5 Kurgalzhynskoye Hwy, Astana, Kazakhstan
| |
Collapse
|
11
|
Pisupati R, Vergara D, Kane NC. Diversity and evolution of the repetitive genomic content in Cannabis sativa. BMC Genomics 2018; 19:156. [PMID: 29466945 PMCID: PMC5822635 DOI: 10.1186/s12864-018-4494-3] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Accepted: 01/24/2018] [Indexed: 01/13/2023] Open
Abstract
Background The repetitive content of the genome, once considered to be “junk DNA”, is in fact an essential component of genomic architecture and evolution. In this study, we used the genomes of three varieties of Cannabis sativa, three varieties of Humulus lupulus and one genotype of Morus notabilis to explore their repetitive content using a graph-based clustering method, designed to explore and compare repeat content in genomes that have not been fully assembled. Results The repetitive content in the C. sativa genome is mainly composed of the retrotransposons LTR/Copia and LTR/Gypsy (14% and 14.8%, respectively), ribosomal DNA (2%), and low-complexity sequences (29%). We observed a recent copy number expansion in some transposable element families. Simple repeats and low complexity regions of the genome show higher intra and inter species variation. Conclusions As with other sequenced genomes, the repetitive content of C. sativa’s genome exhibits a wide range of evolutionary patterns. Some repeat types have patterns of diversity consistent with expansions followed by losses in copy number, while others may have expanded more slowly and reached a steady state. Still, other repetitive sequences, particularly ribosomal DNA (rDNA), show signs of concerted evolution playing a major role in homogenizing sequence variation. Electronic supplementary material The online version of this article (10.1186/s12864-018-4494-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Rahul Pisupati
- Department of Biotechnology, Indian Institute of Technology, Kharagpur, 721302, India.,Present address: Gregor Mendel Institute, Dr. Bohr-gasse 3, Vienna, 1030, Austria
| | - Daniela Vergara
- Ecology and Evolutionary Biology, University of Colorado, Boulder, 80302, USA
| | - Nolan C Kane
- Ecology and Evolutionary Biology, University of Colorado, Boulder, 80302, USA.
| |
Collapse
|
12
|
Kalendar R, Amenov A, Daniyarov A. Use of retrotransposon-derived genetic markers to analyse genomic variability in plants. FUNCTIONAL PLANT BIOLOGY : FPB 2018; 46:15-29. [PMID: 30939255 DOI: 10.1071/fp18098] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/17/2018] [Accepted: 08/23/2018] [Indexed: 06/09/2023]
Abstract
Transposable elements (TEs) are common mobile genetic elements comprising several classes and making up the majority of eukaryotic genomes. The movement and accumulation of TEs has been a major force shaping the genes and genomes of most organisms. Most eukaryotic genomes are dominated by retrotransposons and minimal DNA transposon accumulation. The 'copy and paste' lifecycle of replicative transposition produces new genome insertions without excising the original element. Horizontal TE transfer among lineages is rare. TEs represent a reservoir of potential genomic instability and RNA-level toxicity. Many TEs appear static and nonfunctional, but some are capable of replicating and mobilising to new positions, and somatic transposition events have been observed. The overall structure of retrotransposons and the domains responsible for the phases of their replication are highly conserved in all eukaryotes. TEs are important drivers of species diversity and exhibit great variety in their structure, size and transposition mechanisms, making them important putative actors in evolution. Because TEs are abundant in plant genomes, various applications have been developed to exploit polymorphisms in TE insertion patterns, including conventional or anchored PCR, and quantitative or digital PCR with primers for the 5' or 3' junction. Alternatively, the retrotransposon junction can be mapped using high-throughput next-generation sequencing and bioinformatics. With these applications, TE insertions can be rapidly, easily and accurately identified, or new TE insertions can be found. This review provides an overview of the TE-based applications developed for plant species and assesses the contributions of TEs to the analysis of plants' genetic diversity.
Collapse
Affiliation(s)
- Ruslan Kalendar
- Department of Agricultural Sciences, PO Box 27 (Latokartanonkaari 5), FI-00014 University of Helsinki, Helsinki, Finland
| | - Asset Amenov
- RSE 'National Center for Biotechnology', 13/5 Kurgalzhynskoye Road, Astana, 010000, Kazakhstan
| | - Asset Daniyarov
- RSE 'National Center for Biotechnology', 13/5 Kurgalzhynskoye Road, Astana, 010000, Kazakhstan
| |
Collapse
|
13
|
Palacios-Gimenez OM, Dias GB, de Lima LG, Kuhn GCES, Ramos É, Martins C, Cabral-de-Mello DC. High-throughput analysis of the satellitome revealed enormous diversity of satellite DNAs in the neo-Y chromosome of the cricket Eneoptera surinamensis. Sci Rep 2017; 7:6422. [PMID: 28743997 PMCID: PMC5527012 DOI: 10.1038/s41598-017-06822-8] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2017] [Accepted: 06/19/2017] [Indexed: 12/31/2022] Open
Abstract
Satellite DNAs (satDNAs) constitute large portion of eukaryote genomes, comprising non-protein-coding sequences tandemly repeated. They are mostly found in heterochromatic regions of chromosomes such as around centromere or near telomeres, in intercalary heterochromatin, and often in non-recombining segments of sex chromosomes. We examined the satellitome in the cricket Eneoptera surinamensis (2n = 9, neo-X1X2Y, males) to characterize the molecular evolution of its neo-sex chromosomes. To achieve this, we analyzed illumina reads using graph-based clustering and complementary analyses. We found an unusually high number of 45 families of satDNAs, ranging from 4 bp to 517 bp, accounting for about 14% of the genome and showing different modular structures and high diversity of arrays. FISH mapping revealed that satDNAs are located mostly in C-positive pericentromeric regions of the chromosomes. SatDNAs enrichment was also observed in the neo-sex chromosomes in comparison to autosomes. Especially astonishing accumulation of satDNAs loci was found in the highly differentiated neo-Y, including 39 satDNAs over-represented in this chromosome, which is the greatest satDNAs diversity yet reported for sex chromosomes. Our results suggest possible involvement of satDNAs in genome increasing and in molecular differentiation of the neo-sex chromosomes in this species, contributing to the understanding of sex chromosome composition and evolution in Orthoptera.
Collapse
Affiliation(s)
| | - Guilherme Borges Dias
- Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Leonardo Gomes de Lima
- Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | | | - Érica Ramos
- UNESP - Univ Estadual Paulista, Instituto de Biociências/IB, Departamento de Morfologia, Botucatu, São Paulo, Brazil
| | - Cesar Martins
- UNESP - Univ Estadual Paulista, Instituto de Biociências/IB, Departamento de Morfologia, Botucatu, São Paulo, Brazil
| | | |
Collapse
|
14
|
Babii A, Kovalchuk S, Glazko T, Kosovsky G, Glazko V. Helitrons and Retrotransposons Are Co-localized in Bos taurus Genomes. Curr Genomics 2017; 18:278-286. [PMID: 28659723 PMCID: PMC5476951 DOI: 10.2174/1389202918666161108143909] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Revised: 08/26/2016] [Accepted: 10/27/2016] [Indexed: 11/24/2022] Open
Abstract
Background: DNA transposons helitrons are mobile genetic elements responsible for major movements of the genetic material within and across different genomes. This ability makes helitrons suitable candidate elements for the development of new approaches of multilocus genotyping of live-stock animals, along with the well-known microsatellite loci. Objective: We aimed to estimate the informativeness of helitron and microsatellite markers in assessing the consolidation and the “gene pool” standards of two commercial dairy cattle breeds (Ayrshire breed and holsteinized Black-and-White cattle) and one local breed of Kalmyk cattle, and to reveal any inter-breed difference in the organization of genomic regions flanked by helitrons in the studied cattle breeds. Method: We used the combination of two highly-polymorphic genomic elements – helitrons and trinu-cleotide microsatellites (AGC)6G and (GAG)6C, respectively – for genome scanning of the sampled groups of cattle. Also, we pyrosequenced the genomic regions flanked by the inverted repeats of 3'-end of Heligloria family of helitron fragments. Results: Generally, the both combinations of markers generated polymorphic spectra, based on which certain interbreed differentiation could be observed. The analysis of the identified interspersed repeats suggests that in factory and local cattle the genomic regions flanked by helitron fragments are shaped differently and contain different superfamilies of transposable elements, especially retrotransposons. Conclusion: Despite the well-known fact of retrotransposon-dependent microsatellite expansion, our data suggest that, in the cattle genome, the DNA transposons and microsatellites can also be found in close neighbourhood, and that helitrons and retrotransposons may form domains of increased variability – targets for factors of artificial selection.
Collapse
Affiliation(s)
- A Babii
- Federal State Budget Scientific Institution Center of Experimental Embryology and Reproductive Biotechnologies, Moscow, Russia
| | - S Kovalchuk
- Federal State Budget Scientific Institution Center of Experimental Embryology and Reproductive Biotechnologies, Moscow, Russia
| | - T Glazko
- Federal State Budget Scientific Institution Center of Experimental Embryology and Reproductive Biotechnologies, Moscow, Russia.,Federal State Educational Institution of Higher Education Russian State Agrarian University - MAA named after K. A. Timiryazev, Moscow, Russia
| | - G Kosovsky
- Federal State Budget Scientific Institution Center of Experimental Embryology and Reproductive Biotechnologies, Moscow, Russia
| | - V Glazko
- Federal State Budget Scientific Institution Center of Experimental Embryology and Reproductive Biotechnologies, Moscow, Russia.,Federal State Educational Institution of Higher Education Russian State Agrarian University - MAA named after K. A. Timiryazev, Moscow, Russia
| |
Collapse
|
15
|
FastPCR: An in silico tool for fast primer and probe design and advanced sequence analysis. Genomics 2017; 109:312-319. [PMID: 28502701 DOI: 10.1016/j.ygeno.2017.05.005] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2016] [Revised: 05/09/2017] [Accepted: 05/11/2017] [Indexed: 01/16/2023]
Abstract
Polymerase chain reaction (PCR) is one of the most important laboratory techniques used in molecular biology, genetics and molecular diagnostics. The success of a PCR-based method largely depends on the correct nucleic acid sequence analysis in silico prior to a wet-bench experiment. Here, we report the development of an online Java-based software for virtual PCR on linear or circular DNA templates and multiple primer or probe search from large or small databases. Primer or probe sensitivity and specificity are predicted by searching a database to find sequences with an optimal number of mismatches, similarity and stability. The software determines primer location, orientation, efficiency of binding and calculates primer melting temperatures for standard and degenerate oligonucleotides. The software is suitable for batch file processing, which is essential for automation when working with large amounts of data. The online Java software is available for download at http://primerdigital.com/tools/pcr.html. Accession numbers for the sequences resulting from this study: EU140956 EU177767 EU867815 EU882730 FJ975775-FJ975780 HM481419 HM481420 KC686837-KC686839 KM262797.
Collapse
|
16
|
Rey-Baños R, Sáenz de Miera LE, García P, Pérez de la Vega M. Obtaining retrotransposon sequences, analysis of their genomic distribution and use of retrotransposon-derived genetic markers in lentil (Lens culinaris Medik.). PLoS One 2017; 12:e0176728. [PMID: 28448614 PMCID: PMC5407846 DOI: 10.1371/journal.pone.0176728] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 04/14/2017] [Indexed: 12/02/2022] Open
Abstract
Retrotransposons with long terminal repeats (LTR-RTs) are widespread mobile elements in eukaryotic genomes. We obtained a total of 81 partial LTR-RT sequences from lentil corresponding to internal retrotransposon components and LTRs. Sequences were obtained by PCR from genomic DNA. Approximately 37% of the LTR-RT internal sequences presented premature stop codons, pointing out that these elements must be non-autonomous. LTR sequences were obtained using the iPBS technique which amplifies sequences between LTR-RTs. A total of 193 retrotransposon-derived genetic markers, mainly iPBS, were used to obtain a genetic linkage map from 94 F7 inbred recombinant lines derived from the cross between the cultivar Lupa and the wild ancestor L. culinaris subsp. orientalis. The genetic map included 136 markers located in eight linkage groups. Clusters of tightly linked retrotransposon-derived markers were detected in linkage groups LG1, LG2, and LG6, hence denoting a non-random genomic distribution. Phylogenetic analyses identified the LTR-RT families in which internal and LTR sequences are included. Ty3-gypsy elements were more frequent than Ty1-copia, mainly due to the high Ogre element frequency in lentil, as also occurs in other species of the tribe Vicieae. LTR and internal sequences were used to analyze in silico their distribution among the contigs of the lentil draft genome. Up to 8.8% of the lentil contigs evidenced the presence of at least one LTR-RT similar sequence. A statistical analysis suggested a non-random distribution of these elements within of the lentil genome. In most cases (between 97% and 72%, depending on the LTR-RT type) none of the internal sequences flanked by the LTR sequence pair was detected, suggesting that defective and non-autonomous LTR-RTs are very frequent in lentil. Results support that LTR-RTs are abundant and widespread throughout of the lentil genome and that they are a suitable source of genetic markers useful to carry out further genetic analyses.
Collapse
Affiliation(s)
- Rita Rey-Baños
- Área de Genética, Dpto. de Biología Molecular, Universidad de León, León, Spain
| | - Luis E. Sáenz de Miera
- Área de Genética, Dpto. de Biología Molecular, Universidad de León, León, Spain
- * E-mail:
| | - Pedro García
- Área de Genética, Dpto. de Biología Molecular, Universidad de León, León, Spain
| | | |
Collapse
|
17
|
Patterns of Evolutionary Trajectories and Domestication History within the Genus Hordeum Assessed by REMAP Markers. J Mol Evol 2017; 84:116-128. [PMID: 28168328 DOI: 10.1007/s00239-016-9779-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2016] [Accepted: 12/29/2016] [Indexed: 10/20/2022]
Abstract
The patterns of genetic diversity related to the taxonomy and domestication history of 85 accessions representing the main four species of the genus Hordeum were examined by retrotransposon-microsatellite amplified polymorphism (REMAP) markers based on the retrotransposon BARE-1. A substantial level of genetic polymorphisms at among- and within-species level was observed showing that this retrotransposon family and its adjacent genomic regions has been a target for genome dynamics during the evolution and domestication of barley. The obtained data are consistent with the current taxonomic status within the genus Hordeum. Similar level of genetic diversity was observed between the wild and the domesticated barley accessions suggesting that transposable elements` activity and accumulation may counteract the decrease of genome-wide diversity following domestication. In addition, eco-geographical sub-genome pools of the cultivated barley were identified in support to the theory of multiple origins of domestication within the genus Hordeum. We also provide conclusions about the relationship between accessions of different species and the putative routes of barley domestication. In conclusion, the retrotransposon BARE-1 stands as a reliable and perspective DNA marker for the assessment of the phylogenetic and domestication history in the genus Hordeum and other crop species.
Collapse
|
18
|
Abstract
The polymerase chain reaction (PCR) is fundamental to molecular biology and is the most important practical molecular technique for the research laboratory. The principle of this technique has been further used and applied in plenty of other simple or complex nucleic acid amplification technologies (NAAT). In parallel to laboratory "wet bench" experiments for nucleic acid amplification technologies, in silico or virtual (bioinformatics) approaches have been developed, among which in silico PCR analysis. In silico NAAT analysis is a useful and efficient complementary method to ensure the specificity of primers or probes for an extensive range of PCR applications from homology gene discovery, molecular diagnosis, DNA fingerprinting, and repeat searching. Predicting sensitivity and specificity of primers and probes requires a search to determine whether they match a database with an optimal number of mismatches, similarity, and stability. In the development of in silico bioinformatics tools for nucleic acid amplification technologies, the prospects for the development of new NAAT or similar approaches should be taken into account, including forward-looking and comprehensive analysis that is not limited to only one PCR technique variant. The software FastPCR and the online Java web tool are integrated tools for in silico PCR of linear and circular DNA, multiple primer or probe searches in large or small databases and for advanced search. These tools are suitable for processing of batch files that are essential for automation when working with large amounts of data. The FastPCR software is available for download at http://primerdigital.com/fastpcr.html and the online Java version at http://primerdigital.com/tools/pcr.html .
Collapse
|
19
|
Guan R, Zhao Y, Zhang H, Fan G, Liu X, Zhou W, Shi C, Wang J, Liu W, Liang X, Fu Y, Ma K, Zhao L, Zhang F, Lu Z, Lee SMY, Xu X, Wang J, Yang H, Fu C, Ge S, Chen W. Draft genome of the living fossil Ginkgo biloba. Gigascience 2016. [PMID: 27871309 DOI: 10.1186/s13742-016-0154-1pmid:27871309] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2023] Open
Abstract
BACKGROUND Ginkgo biloba L. (Ginkgoaceae) is one of the most distinctive plants. It possesses a suite of fascinating characteristics including a large genome, outstanding resistance/tolerance to abiotic and biotic stresses, and dioecious reproduction, making it an ideal model species for biological studies. However, the lack of a high-quality genome sequence has been an impediment to our understanding of its biology and evolution. FINDINGS The 10.61 Gb genome sequence containing 41,840 annotated genes was assembled in the present study. Repetitive sequences account for 76.58% of the assembled sequence, and long terminal repeat retrotransposons (LTR-RTs) are particularly prevalent. The diversity and abundance of LTR-RTs is due to their gradual accumulation and a remarkable amplification between 16 and 24 million years ago, and they contribute to the long introns and large genome. Whole genome duplication (WGD) may have occurred twice, with an ancient WGD consistent with that shown to occur in other seed plants, and a more recent event specific to ginkgo. Abundant gene clusters from tandem duplication were also evident, and enrichment of expanded gene families indicates a remarkable array of chemical and antibacterial defense pathways. CONCLUSIONS The ginkgo genome consists mainly of LTR-RTs resulting from ancient gradual accumulation and two WGD events. The multiple defense mechanisms underlying the characteristic resilience of ginkgo are fostered by a remarkable enrichment in ancient duplicated and ginkgo-specific gene clusters. The present study sheds light on sequencing large genomes, and opens an avenue for further genetic and evolutionary research.
Collapse
Affiliation(s)
- Rui Guan
- BGI-Shenzhen, Shenzhen, 518083, China
- BGI-Qingdao, Qingdao, 266555, China
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Yunpeng Zhao
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
| | - He Zhang
- BGI-Shenzhen, Shenzhen, 518083, China
- BGI-Qingdao, Qingdao, 266555, China
- Stanley Ho Centre for Emerging Infectious Diseases, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Guangyi Fan
- BGI-Shenzhen, Shenzhen, 518083, China
- BGI-Qingdao, Qingdao, 266555, China
- State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
| | - Xin Liu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Wenbin Zhou
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
| | | | | | - Weiqing Liu
- BGI-Wuhan, BGI-Shenzhen, Wuhan, 430074, China
| | | | - Yuanyuan Fu
- BGI-Shenzhen, Shenzhen, 518083, China
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
| | | | - Lijun Zhao
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
| | - Fumin Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Zuhong Lu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Simon Ming-Yuen Lee
- State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, 518083, China
- James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, 518083, China
- James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Chengxin Fu
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China.
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China.
| | - Song Ge
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
| | - Wenbin Chen
- BGI-Shenzhen, Shenzhen, 518083, China.
- BGI-Qingdao, Qingdao, 266555, China.
| |
Collapse
|
20
|
Guan R, Zhao Y, Zhang H, Fan G, Liu X, Zhou W, Shi C, Wang J, Liu W, Liang X, Fu Y, Ma K, Zhao L, Zhang F, Lu Z, Lee SMY, Xu X, Wang J, Yang H, Fu C, Ge S, Chen W. Draft genome of the living fossil Ginkgo biloba. Gigascience 2016; 5:49. [PMID: 27871309 PMCID: PMC5118899 DOI: 10.1186/s13742-016-0154-1] [Citation(s) in RCA: 157] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2016] [Accepted: 11/01/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Ginkgo biloba L. (Ginkgoaceae) is one of the most distinctive plants. It possesses a suite of fascinating characteristics including a large genome, outstanding resistance/tolerance to abiotic and biotic stresses, and dioecious reproduction, making it an ideal model species for biological studies. However, the lack of a high-quality genome sequence has been an impediment to our understanding of its biology and evolution. FINDINGS The 10.61 Gb genome sequence containing 41,840 annotated genes was assembled in the present study. Repetitive sequences account for 76.58% of the assembled sequence, and long terminal repeat retrotransposons (LTR-RTs) are particularly prevalent. The diversity and abundance of LTR-RTs is due to their gradual accumulation and a remarkable amplification between 16 and 24 million years ago, and they contribute to the long introns and large genome. Whole genome duplication (WGD) may have occurred twice, with an ancient WGD consistent with that shown to occur in other seed plants, and a more recent event specific to ginkgo. Abundant gene clusters from tandem duplication were also evident, and enrichment of expanded gene families indicates a remarkable array of chemical and antibacterial defense pathways. CONCLUSIONS The ginkgo genome consists mainly of LTR-RTs resulting from ancient gradual accumulation and two WGD events. The multiple defense mechanisms underlying the characteristic resilience of ginkgo are fostered by a remarkable enrichment in ancient duplicated and ginkgo-specific gene clusters. The present study sheds light on sequencing large genomes, and opens an avenue for further genetic and evolutionary research.
Collapse
Affiliation(s)
- Rui Guan
- BGI-Shenzhen, Shenzhen, 518083, China
- BGI-Qingdao, Qingdao, 266555, China
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Yunpeng Zhao
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
| | - He Zhang
- BGI-Shenzhen, Shenzhen, 518083, China
- BGI-Qingdao, Qingdao, 266555, China
- Stanley Ho Centre for Emerging Infectious Diseases, Faculty of Medicine, The Chinese University of Hong Kong, Shatin, Hong Kong
| | - Guangyi Fan
- BGI-Shenzhen, Shenzhen, 518083, China
- BGI-Qingdao, Qingdao, 266555, China
- State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
| | - Xin Liu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Wenbin Zhou
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
| | | | | | - Weiqing Liu
- BGI-Wuhan, BGI-Shenzhen, Wuhan, 430074, China
| | | | - Yuanyuan Fu
- BGI-Shenzhen, Shenzhen, 518083, China
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
| | | | - Lijun Zhao
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China
| | - Fumin Zhang
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China
| | - Zuhong Lu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing, 210096, China
| | - Simon Ming-Yuen Lee
- State Key Laboratory of Quality Research in Chinese Medicine and Institute of Chinese Medical Sciences, Macao, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Jian Wang
- BGI-Shenzhen, Shenzhen, 518083, China
- James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Huanming Yang
- BGI-Shenzhen, Shenzhen, 518083, China
- James D. Watson Institute of Genome Sciences, Hangzhou, 310058, China
| | - Chengxin Fu
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China.
- Laboratory of Systematic & Evolutionary Botany and Biodiversity, Institute of Ecology and Conservation Center for Gene Resources of Endangered Wildlife, Zhejiang University, Hangzhou, 310058, China.
| | - Song Ge
- State Key Laboratory of Systematic and Evolutionary Botany, Institute of Botany, Chinese Academy of Sciences, Beijing, 100093, China.
| | - Wenbin Chen
- BGI-Shenzhen, Shenzhen, 518083, China.
- BGI-Qingdao, Qingdao, 266555, China.
| |
Collapse
|
21
|
Lwin AK, Bertolini E, Pè ME, Zuccolo A. Genomic skimming for identification of medium/highly abundant transposable elements in Arundo donax and Arundo plinii. Mol Genet Genomics 2016; 292:157-171. [PMID: 27778102 DOI: 10.1007/s00438-016-1263-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 10/17/2016] [Indexed: 11/29/2022]
Abstract
Transposable elements (TEs) are the most abundant genetic material for almost all eukaryotic genomes. Their effects on the host genomes range from an extensive size variation to the regulation of gene expression, altering gene function and creating new genes. Because of TEs pivotal contribute to the host genome structure and regulation, their identification and characterization provide a wealth of useful data for gaining an in-depth understanding of host genome functioning. The giant reed (Arundo donax) is a perennial rhizomatous C3 grass, octadecaploid, with an estimated nuclear genome size of 2744 Mbp. It is a promising feedstock for second-generation biofuels and biomethane production. To identify and characterize the most repetitive TEs in the genomes of A. donax and its ancestral A. plinii species, we carried out low-coverage whole genome shotgun sequencing for both species. Using a de novo repeat identification approach, 33,041 and 28,237 non-redundant repetitive sequences were identified and characterized in A. donax and A. plinii genomes, representing 37.55 and 31.68% of each genome, respectively. Comparative phylogenetic analyses, including the major TE classes identified in A. donax and A. plinii, together with rice and maize TE paralogs, were carried out to understand the evolutionary relationship of the most abundant TE classes. Highly conserved copies of RIRE1-like Ty1-Copia elements were discovered in two Arundo spp. in which they represented nearly 3% of each genomic sequence. We identified and characterized the medium/highly repetitive TEs in two unexplored polyploid genomes, thus generating useful information for the study of the genomic structure, composition, and functioning of these two non-model species. We provided a valuable resource that could be exploited in any effort aimed at sequencing and assembling these two genomes.
Collapse
Affiliation(s)
- Aung Kyaw Lwin
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy.,Sugarcane Research and Seed Farm, Pyinmana, Nay Pyi Taw, Myanmar
| | - Edoardo Bertolini
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy
| | - Andrea Zuccolo
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33, 56127, Pisa, Italy.
| |
Collapse
|
22
|
Gebre YG, Bertolini E, Pè ME, Zuccolo A. Identification and characterization of abundant repetitive sequences in Eragrostis tef cv. Enatite genome. BMC PLANT BIOLOGY 2016; 16:39. [PMID: 26833063 PMCID: PMC4736629 DOI: 10.1186/s12870-016-0725-4] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/15/2016] [Accepted: 01/28/2016] [Indexed: 06/05/2023]
Abstract
BACKGROUND Eragrostis tef is an allotetraploid (2n = 4 × = 40) annual, C4 grass with an estimated nuclear genome size of 730 Mbp. It is widely grown in Ethiopia, where it provides basic nutrition for more than half of the population. Although a draft assembly of the E. tef genome was made available in 2014, characterization of the repetitive portion of the E. tef genome has not been a subject of a detailed analysis. Repetitive sequences constitute most of the DNA in eukaryotic genomes. Transposable elements are usually the most abundant repetitive component in plant genomes. They contribute to genome size variation, cause mutations, can result in chromosomal rearrangements, and influence gene regulation. An extensive and in depth characterization of the repetitive component is essential in understanding the evolution and function of the genome. RESULTS Using new paired-end sequence data and a de novo repeat identification strategy, we identified the most repetitive elements in the E. tef genome. Putative repeat sequences were annotated based on similarity to known repeat groups in other grasses. Altogether we identified 1,389 medium/highly repetitive sequences that collectively represent about 27% of the teff genome. Phylogenetic analyses of the most important classes of TEs were carried out in a comparative framework including paralog elements from rice and maize. Finally, an abundant tandem repeat accounting for more than 4% of the whole genome was identified and partially characterized. CONCLUSIONS Analyzing a large sample of randomly sheared reads we obtained a library of the repetitive sequences of E. tef. The approach we used was designed to avoid underestimation of repeat contribution; such underestimation is characteristic of whole genome assembly projects. The data collected represent a valuable resource for further analysis of the genome of this important orphan crop.
Collapse
Affiliation(s)
- Yohannes Gedamu Gebre
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
- Department of Dryland Crop and Horticultural Sciences, College of Dryland Agriculture and Natural Resources, Mekelle University, P.O.Box 231, Mekelle, Ethiopia.
| | - Edoardo Bertolini
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
| | - Mario Enrico Pè
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
| | - Andrea Zuccolo
- Institute of Life Sciences, Scuola Superiore Sant'Anna, Piazza Martiri della Libertà, 33-56127, Pisa, Italy.
| |
Collapse
|
23
|
Chapuis MP, Plantamp C, Streiff R, Blondin L, Piou C. Microsatellite evolutionary rate and pattern in Schistocerca gregaria inferred from direct observation of germline mutations. Mol Ecol 2015; 24:6107-19. [PMID: 26562076 DOI: 10.1111/mec.13465] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2015] [Revised: 11/05/2015] [Accepted: 11/06/2015] [Indexed: 01/21/2023]
Abstract
Unravelling variation among taxonomic orders regarding the rate of evolution in microsatellites is crucial for evolutionary biology and population genetics research. The mean mutation rate of microsatellites tends to be lower in arthropods than in vertebrates, but data are scarce and mostly concern accumulation of mutations in model species. Based on parent-offspring segregations and a hierarchical Bayesian model, the mean rate of mutation in the orthopteran insect Schistocerca gregaria was estimated at 2.1e(-4) per generation per untranscribed dinucleotide locus. This is close to vertebrate estimates and one order of magnitude higher than estimates from species of other arthropod orders, such as Drosophila melanogaster and Daphnia pulex. We also found evidence of a directional bias towards expansions even for long alleles and exceptionally large ranges of allele sizes. Finally, at transcribed microsatellites, the mean rate of mutation was half the rate found at untranscribed loci and the mutational model deviated from that usually considered, with most mutations involving multistep changes that avoid disrupting the reading frame. Our direct estimates of mutation rate were discussed in the light of peculiar biological and genomic features of S. gregaria, including specificities in mismatch repair and the dependence of its activity to allele length. Shedding new light on the mutational dynamics of grasshopper microsatellites is of critical importance for a number of research fields. As an illustration, we showed how our findings improve microsatellite application in population genetics, by obtaining a more precise estimation of S. gregaria effective population size from a published data set based on the same microsatellites.
Collapse
Affiliation(s)
- M-P Chapuis
- CIRAD, UMR CBGP, Montpellier, F-34398, France
| | - C Plantamp
- Laboratoire de Biométrie et Biologie Evolutive, CNRS, UMR 5558, Université Lyon 1, Villeurbanne, 69622, France
| | - R Streiff
- INRA, UMR CBGP, Montpellier, F-34398, France.,INRA, UMR DGIMI, Montpellier, F-34000, France
| | - L Blondin
- CIRAD, UPR B-AMR, Montpellier, F-34398, France
| | - C Piou
- CIRAD, UMR CBGP, Montpellier, F-34398, France
| |
Collapse
|
24
|
Macas J, Novák P, Pellicer J, Čížková J, Koblížková A, Neumann P, Fuková I, Doležel J, Kelly LJ, Leitch IJ. In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae. PLoS One 2015; 10:e0143424. [PMID: 26606051 PMCID: PMC4659654 DOI: 10.1371/journal.pone.0143424] [Citation(s) in RCA: 118] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Accepted: 11/04/2015] [Indexed: 01/30/2023] Open
Abstract
The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55–83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Collapse
Affiliation(s)
- Jiří Macas
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
- * E-mail:
| | - Petr Novák
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Jaume Pellicer
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, United Kingdom
| | - Jana Čížková
- Institute of Experimental Botany, Olomouc, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Andrea Koblížková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Pavel Neumann
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Iva Fuková
- Biology Centre of the Czech Academy of Sciences, Institute of Plant Molecular Biology, České Budějovice, Czech Republic
| | - Jaroslav Doležel
- Institute of Experimental Botany, Olomouc, Centre of the Region Haná for Biotechnological and Agricultural Research, Olomouc, Czech Republic
| | - Laura J. Kelly
- School of Biological and Chemical Sciences, Queen Mary University of London, London, United Kingdom
| | - Ilia J. Leitch
- Jodrell Laboratory, Royal Botanic Gardens, Kew, Richmond, Surrey, United Kingdom
| |
Collapse
|
25
|
MuTAnT: a family of Mutator-like transposable elements targeting TA microsatellites in Medicago truncatula. Genetica 2015; 143:433-40. [PMID: 25981486 PMCID: PMC4486113 DOI: 10.1007/s10709-015-9842-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2015] [Accepted: 05/10/2015] [Indexed: 12/31/2022]
Abstract
Transposable elements (TEs) are mobile DNA segments, abundant and dynamic in plant genomes. Because their mobility can be potentially deleterious to the host, a variety of mechanisms evolved limiting that negative impact, one of them being preference for a specific target insertion site. Here, we describe a family of Mutator-like DNA transposons in Medicago truncatula targeting TA microsatellites. We identified 218 copies of MuTAnTs and an element carrying a complete ORF encoding a mudrA-like transposase. Most insertion sites are flanked by a variable number of TA tandem repeats, indicating that MuTAnTs are specifically targeting TA microsatellites. Other TE families flanked by TA repeats (e.g. TAFT elements in maize) were described previously, however we identified the first putative autonomous element sharing that characteristics with a related group of short non-autonomous transposons.
Collapse
|
26
|
Zuccolo A, Scofield DG, De Paoli E, Morgante M. The Ty1-copia LTR retroelement family PARTC is highly conserved in conifers over 200 MY of evolution. Gene 2015; 568:89-99. [PMID: 25982862 DOI: 10.1016/j.gene.2015.05.028] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Revised: 04/06/2015] [Accepted: 05/11/2015] [Indexed: 11/26/2022]
Abstract
Long Terminal Repeat retroelements (LTR-RTs) are a major component of many plant genomes. Although well studied and described in angiosperms, their features and dynamics are poorly understood in gymnosperms. Representative complete copies of a Ty1-copia element isolate in Picea abies and named PARTC were identified in six other conifer species (Picea glauca, Pinus sylvestris, Pinus taeda, Abies sibirica, Taxus baccata and Juniperus communis) covering more than 200 million years of evolution. Here we characterized the structure of this element, assessed its abundance across conifers, studied the modes and timing of its amplification, and evaluated the degree of conservation of its extant copies at nucleotide level over distant species. We demonstrated that the element is ancient, abundant, widespread and its paralogous copies are present in the genera Picea, Pinus and Abies as an LTR-RT family. The amplification leading to the extant copies of PARTC occurred over long evolutionary times spanning 10s of MY and mostly took place after the speciation of the conifers analyzed. The level of conservation of PARTC is striking and may be explained by low substitution rates and limited removal mechanisms for LTR-RTs. These PARTC features and dynamics are representative of a more general scenario for LTR-RTs in gymnosperms quite different from that characterizing the vast majority of LTR-RT elements in angiosperms.
Collapse
Affiliation(s)
- Andrea Zuccolo
- Institute of Life Sciences, Scuola Superiore Sant'Anna, 56127 Pisa, Italy; Istituto di Genomica Applicata, Via J. Linussio 51, 33100 Udine, Italy.
| | - Douglas G Scofield
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, SE-75236 Uppsala, Sweden
| | - Emanuele De Paoli
- Università degli Studi di Udine, Via delle Scienze 208, 33100 Udine, Italy
| | - Michele Morgante
- Istituto di Genomica Applicata, Via J. Linussio 51, 33100 Udine, Italy; Università degli Studi di Udine, Via delle Scienze 208, 33100 Udine, Italy
| |
Collapse
|
27
|
Dias GB, Svartman M, Delprat A, Ruiz A, Kuhn GCS. Tetris is a foldback transposon that provided the building blocks for an emerging satellite DNA of Drosophila virilis. Genome Biol Evol 2014; 6:1302-13. [PMID: 24858539 PMCID: PMC4079207 DOI: 10.1093/gbe/evu108] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Transposable elements (TEs) and satellite DNAs (satDNAs) are abundant components of most eukaryotic genomes studied so far and their impact on evolution has been the focus of several studies. A number of studies linked TEs with satDNAs, but the nature of their evolutionary relationships remains unclear. During in silico analyses of the Drosophila virilis assembled genome, we found a novel DNA transposon we named Tetris based on its modular structure and diversity of rearranged forms. We aimed to characterize Tetris and investigate its role in generating satDNAs. Data mining and sequence analysis showed that Tetris is apparently nonautonomous, with a structure similar to foldback elements, and present in D. virilis and D. americana. Herein, we show that Tetris shares the final portions of its terminal inverted repeats (TIRs) with DAIBAM, a previously described miniature inverted transposable element implicated in the generation of chromosome inversions. Both elements are likely to be mobilized by the same autonomous TE. Tetris TIRs contain approximately 220-bp internal tandem repeats that we have named TIR-220. We also found TIR-220 repeats making up longer (kb-size) satDNA-like arrays. Using bioinformatic, phylogenetic and cytogenomic tools, we demonstrated that Tetris has contributed to shaping the genomes of D. virilis and D. americana, providing internal tandem repeats that served as building blocks for the amplification of satDNA arrays. The β-heterochromatic genomic environment seemed to have favored such amplification. Our results imply for the first time a role for foldback elements in generating satDNAs.
Collapse
Affiliation(s)
- Guilherme B Dias
- Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Marta Svartman
- Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| | - Alejandra Delprat
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Bellaterra, Catalunya, Spain
| | - Alfredo Ruiz
- Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Bellaterra, Catalunya, Spain
| | - Gustavo C S Kuhn
- Departamento de Biologia Geral, Universidade Federal de Minas Gerais, Belo Horizonte, MG, Brazil
| |
Collapse
|
28
|
SSR genetic linkage map construction of pea (Pisum sativum L.) based on Chinese native varieties. ACTA ACUST UNITED AC 2014. [DOI: 10.1016/j.cj.2014.03.004] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
29
|
Almeida NF, Leitão ST, Caminero C, Torres AM, Rubiales D, Vaz Patto MC. Transferability of molecular markers from major legumes to Lathyrus spp. for their application in mapping and diversity studies. Mol Biol Rep 2013; 41:269-83. [DOI: 10.1007/s11033-013-2860-4] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2013] [Accepted: 10/31/2013] [Indexed: 12/01/2022]
|
30
|
Chang W, Jääskeläinen M, Li SP, Schulman AH. BARE retrotransposons are translated and replicated via distinct RNA pools. PLoS One 2013; 8:e72270. [PMID: 23940808 PMCID: PMC3735527 DOI: 10.1371/journal.pone.0072270] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2013] [Accepted: 07/14/2013] [Indexed: 01/02/2023] Open
Abstract
The replication of Long Terminal Repeat (LTR) retrotransposons, which can constitute over 80% of higher plant genomes, resembles that of retroviruses. A major question for retrotransposons and retroviruses is how the two conflicting roles of their transcripts, in translation and reverse transcription, are balanced. Here, we show that the BARE retrotransposon, despite its organization into just one open reading frame, produces three distinct classes of transcripts. One is capped, polyadenylated, and translated, but cannot be copied into cDNA. The second is not capped or polyadenylated, but is destined for packaging and ultimate reverse transcription. The third class is capped, polyadenylated, and spliced to favor production of a subgenomic RNA encoding only Gag, the protein forming virus-like particles. Moreover, the BARE2 subfamily, which cannot synthesize Gag and is parasitic on BARE1, does not produce the spliced sub-genomic RNA for translation but does make the replication competent transcripts, which are packaged into BARE1 particles. To our knowledge, this is first demonstration of distinct RNA pools for translation and transcription for any retrotransposon.
Collapse
Affiliation(s)
- Wei Chang
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, Helsinki, Finland
| | - Marko Jääskeläinen
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, Helsinki, Finland
| | - Song-ping Li
- Genome-Scale Biology Program, University of Helsinki, Biomedicum, Helsinki, Finland
| | - Alan H. Schulman
- Institute of Biotechnology, Viikki Biocenter, University of Helsinki, Helsinki, Finland
- Biotechnology and Food Research, MTT Agrifood Research Finland, Jokioinen, Finland
- * E-mail:
| |
Collapse
|
31
|
Yeoh CC, Balcerowicz M, Zhang L, Jaudal M, Brocard L, Ratet P, Putterill J. Fine mapping links the FTa1 flowering time regulator to the dominant spring1 locus in Medicago. PLoS One 2013; 8:e53467. [PMID: 23308229 PMCID: PMC3538541 DOI: 10.1371/journal.pone.0053467] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2012] [Accepted: 11/29/2012] [Indexed: 12/27/2022] Open
Abstract
To extend our understanding of flowering time control in eudicots, we screened for mutants in the model legume Medicago truncatula (Medicago). We identified an early flowering mutant, spring1, in a T-DNA mutant screen, but spring1 was not tagged and was deemed a somaclonal mutant. We backcrossed the mutant to wild type R108. The F1 plants and the majority of F2 plants were early flowering like spring1, strongly indicating that spring1 conferred monogenic, dominant early flowering. We hypothesized that the spring1 phenotype resulted from over expression of an activator of flowering. Previously, a major QTL for flowering time in different Medicago accessions was located to an interval on chromosome 7 with six candidate flowering-time activators, including a CONSTANS gene, MtCO, and three FLOWERING LOCUS T (FT) genes. Hence we embarked upon linkage mapping using 29 markers from the MtCO/FT region on chromosome 7 on two populations developed by crossing spring1 with Jester. Spring1 mapped to an interval of ∼0.5 Mb on chromosome 7 that excluded MtCO, but contained 78 genes, including the three FT genes. Of these FT genes, only FTa1 was up-regulated in spring1 plants. We then investigated global gene expression in spring1 and R108 by microarray analysis. Overall, they had highly similar gene expression and apart from FTa1, no genes in the mapping interval were differentially expressed. Two MADS transcription factor genes, FRUITFULLb (FULb) and SUPPRESSOR OF OVER EXPRESSION OF CONSTANS1a (SOC1a), that were up-regulated in spring1, were also up-regulated in transgenic Medicago over-expressing FTa1. This suggested that their differential expression in spring1 resulted from the increased abundance of FTa1. A 6255 bp genomic FTa1 fragment, including the complete 5' region, was sequenced, but no changes were observed indicating that the spring1 mutation is not a DNA sequence difference in the FTa1 promoter or introns.
Collapse
Affiliation(s)
- Chin Chin Yeoh
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Martin Balcerowicz
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Lulu Zhang
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Mauren Jaudal
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Lysiane Brocard
- Institut des Sciences du Végétal, CNRS, Gif sur Yvette, France
| | - Pascal Ratet
- Institut des Sciences du Végétal, CNRS, Gif sur Yvette, France
| | - Joanna Putterill
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| |
Collapse
|
32
|
Stolle E, Kidner JH, Moritz RFA. Patterns of evolutionary conservation of microsatellites (SSRs) suggest a faster rate of genome evolution in Hymenoptera than in Diptera. Genome Biol Evol 2013; 5:151-62. [PMID: 23292136 PMCID: PMC3595035 DOI: 10.1093/gbe/evs133] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 12/21/2012] [Indexed: 12/25/2022] Open
Abstract
Microsatellites, or simple sequence repeats (SSRs), are common and widespread DNA elements in genomes of many organisms. However, their dynamics in genome evolution is unclear, whereby they are thought to evolve neutrally. More available genome sequences along with dated phylogenies allowed for studying the evolution of these repetitive DNA elements along evolutionary time scales. This could be used to compare rates of genome evolution. We show that SSRs in insects can be retained for several hundred million years. Different types of microsatellites seem to be retained longer than others. By comparing Dipteran with Hymenopteran species, we found very similar patterns of SSR loss during their evolution, but both taxa differ profoundly in the rate. Relative to divergence time, Diptera lost SSRs twice as fast as Hymenoptera. The loss of SSRs on the Drosophila melanogaster X-chromosome was higher than on the other chromosomes. However, accounting for generation time, the Diptera show an 8.5-fold slower rate of SSR loss than the Hymenoptera, which, in contrast to previous studies, suggests a faster genome evolution in the latter. This shows that generation time differences can have a profound effect. A faster genome evolution in these insects could be facilitated by several factors very different to Diptera, which is discussed in light of our results on the haplodiploid D. melanogaster X-chromosome. Furthermore, large numbers of SSRs can be found to be in synteny and thus could be exploited as a tool to investigate genome structure and evolution.
Collapse
Affiliation(s)
- Eckart Stolle
- Department of Zoology, Institute of Biology, Martin-Luther-University Halle-Wittenberg, Halle (Saale), Germany.
| | | | | |
Collapse
|
33
|
Novák K, Biedermannová E, Vondrys J. Functional markers delimiting a Medicago orthologue of pea symbiotic gene NOD3. EUPHYTICA 2012; 186:761-777. [PMID: 0 DOI: 10.1007/s10681-011-0586-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2010] [Accepted: 11/12/2011] [Indexed: 05/21/2023]
|
34
|
|
35
|
Mogil LS, Slowikowski K, Laten HM. Computational and experimental analyses of retrotransposon-associated minisatellite DNAs in the soybean genome. BMC Bioinformatics 2012; 13 Suppl 2:S13. [PMID: 22536864 PMCID: PMC3305785 DOI: 10.1186/1471-2105-13-s2-s13] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
BACKGROUND Retrotransposons are mobile DNA elements that spread through genomes via the action of element-encoded reverse transcriptases. They are ubiquitous constituents of most eukaryotic genomes, especially those of higher plants. The pericentromeric regions of soybean (Glycine max) chromosomes contain >3,200 intact copies of the Gmr9/GmOgre retrotransposon. Between the 3' end of the coding region and the long terminal repeat, this retrotransposon family contains a polymorphic minisatellite region composed of five distinct, interleaved minisatellite families. To better understand the possible role and origin of retrotransposon-associated minisatellites, a computational project to map and physically characterize all members of these families in the G. max genome, irrespective of their association with Gmr9, was undertaken. METHODS A computational pipeline was developed to map and analyze the organization and distribution of five Gmr9-associated minisatellites throughout the soybean genome. Polymerase chain reaction amplifications were used to experimentally assess the computational outputs. RESULTS A total of 63,841 copies of Gmr9-associated minisatellites were recovered from the assembled G. max genome. Ninety percent were associated with Gmr9, an additional 9% with other annotated retrotransposons, and 1% with uncharacterized repetitive DNAs. Monomers were tandemly interleaved and repeated up to 149 times per locus. CONCLUSIONS The computational pipeline enabled a fast, accurate, and detailed characterization of known minisatellites in a large, downloaded DNA database, and PCR amplification supported the general organization of these arrays.
Collapse
Affiliation(s)
- Lauren S Mogil
- Program in Bioinformatics Loyola University Chicago, 1032 W. Sheridan Rd, Chicago, IL 60660 USA
- Department of Biology, Loyola University Chicago, 1032 W. Sheridan Rd, Chicago, IL 60660 USA
- Present address: Department of Biochemistry and Molecular Biology, Mayo Graduate School, Rochester, MN 55905 USA
| | - Kamil Slowikowski
- Program in Bioinformatics Loyola University Chicago, 1032 W. Sheridan Rd, Chicago, IL 60660 USA
| | - Howard M Laten
- Program in Bioinformatics Loyola University Chicago, 1032 W. Sheridan Rd, Chicago, IL 60660 USA
- Department of Biology, Loyola University Chicago, 1032 W. Sheridan Rd, Chicago, IL 60660 USA
| |
Collapse
|
36
|
Cieslarová J, Hanáček P, Fialová E, Hýbl M, Smýkal P. Estimation of pea (Pisum sativum L.) microsatellite mutation rate based on pedigree and single-seed descent analyses. J Appl Genet 2011; 52:391-401. [PMID: 21769669 DOI: 10.1007/s13353-011-0058-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Revised: 06/21/2011] [Accepted: 06/22/2011] [Indexed: 02/07/2023]
Abstract
Microsatellites, or simple sequence repeats (SSRs) are widespread class of repetitive DNA sequences, used in population genetics, genetic diversity and mapping studies. In spite of the SSR utility, the genetic and evolutionary mechanisms are not fully understood. We have investigated three microsatellite loci with different position in the pea (Pisum sativum L.) genome, the A9 locus residing in LTR region of abundant retrotransposon, AD270 as intergenic and AF016458 located in 5'untranslated region of expressed gene. Comparative analysis of a 35 pair samples from seven pea varieties propagated by single-seed descent for ten generations, revealed single 4 bp mutation in 10th generation sample at AD270 locus corresponding to stepwise increase in one additional ATCT repeat unit. The estimated mutation rate was 4.76 × 10(-3) per locus per generation, with a 95% confidence interval of 1.2 × 10(-4) to 2.7 × 10(-2). The comparison of cv. Bohatýr accessions retrieved from different collections, showed intra-, inter-accession variation and differences in flanking and repeat sequences. Fragment size and sequence alternations were also found in long term in vitro organogenic culture, established at 1983, indicative of somatic mutation process. The evidence of homoplasy was detected across of unrelated pea genotypes, which adversaly affects the reliability of diversity estimates not only for diverse germplasm but also highly bred material. The findings of this study have important implications for Pisum phylogeny studies, variety identification and registration process in pea breeding where mutation rate influences the genetic diversity and the effective population size estimates.
Collapse
Affiliation(s)
- Jaroslava Cieslarová
- Department of Plant Biology, Mendel University, Zemědělská, Brno, Czech Republic
| | | | | | | | | |
Collapse
|
37
|
Novák P, Neumann P, Macas J. Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data. BMC Bioinformatics 2010; 11:378. [PMID: 20633259 PMCID: PMC2912890 DOI: 10.1186/1471-2105-11-378] [Citation(s) in RCA: 307] [Impact Index Per Article: 21.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2010] [Accepted: 07/15/2010] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND The investigation of plant genome structure and evolution requires comprehensive characterization of repetitive sequences that make up the majority of higher plant nuclear DNA. Since genome-wide characterization of repetitive elements is complicated by their high abundance and diversity, novel approaches based on massively-parallel sequencing are being adapted to facilitate the analysis. It has recently been demonstrated that the low-pass genome sequencing provided by a single 454 sequencing reaction is sufficient to capture information about all major repeat families, thus providing the opportunity for efficient repeat investigation in a wide range of species. However, the development of appropriate data mining tools is required in order to fully utilize this sequencing data for repeat characterization. RESULTS We adapted a graph-based approach for similarity-based partitioning of whole genome 454 sequence reads in order to build clusters made of the reads derived from individual repeat families. The information about cluster sizes was utilized for assessing the proportion and composition of repeats in the genomes of two model species, Pisum sativum and Glycine max, differing in genome size and 454 sequencing coverage. Moreover, statistical analysis and visual inspection of the topology of the cluster graphs using a newly developed program tool, SeqGrapheR, were shown to be helpful in distinguishing basic types of repeats and investigating sequence variability within repeat families. CONCLUSIONS Repetitive regions of plant genomes can be efficiently characterized by the presented graph-based analysis and the graph representation of repeats can be further used to assess the variability and evolutionary divergence of repeat families, discover and characterize novel elements, and aid in subsequent assembly of their consensus sequences.
Collapse
Affiliation(s)
- Petr Novák
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovska 31, Ceske Budejovice, CZ-37005, Czech Republic
| | - Pavel Neumann
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovska 31, Ceske Budejovice, CZ-37005, Czech Republic
| | - Jiří Macas
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovska 31, Ceske Budejovice, CZ-37005, Czech Republic
| |
Collapse
|
38
|
Comparative genomic analysis reveals species-dependent complexities that explain difficulties with microsatellite marker development in molluscs. Heredity (Edinb) 2010; 106:78-87. [PMID: 20424639 DOI: 10.1038/hdy.2010.36] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Reliable population DNA molecular markers are difficult to develop for molluscs, the reasons for which are largely unknown. Identical protocols for microsatellite marker development were implemented in three gastropods. Success rates were lower for Gibbula cineraria compared to Littorina littorea and L. saxatilis. Comparative genomic analysis of 47.2 kb of microsatellite containing sequences (MCS) revealed a high incidence of cryptic repetitive DNA in their flanking regions. The majority of these were novel, and could be grouped into DNA families based upon sequence similarities. Significant inter-specific variation in abundance of cryptic repetitive DNA and DNA families was observed. Repbase scans show that a large proportion of cryptic repetitive DNA was identified as transposable elements (TEs). We argue that a large number of TEs and their transpositional activity may be linked to differential rates of DNA multiplication and recombination. This is likely to be an important factor explaining inter-specific variation in genome stability and hence microsatellite marker development success rates. Gastropods also differed significantly in the type of TEs classes (autonomous vs non-autonomous) observed. We propose that dissimilar transpositional mechanisms differentiate the TE classes in terms of their propensity for transposition, fixation and/or silencing. Consequently, the phylogenetic conservation of non-autonomous TEs, such as CvA, suggests that dispersal of these elements may have behaved as microsatellite-inducing elements. Results seem to indicate that, compared to autonomous, non-autonomous TEs maybe have a more active role in genome rearrangement processes. The implications of the findings for genomic rearrangement, stability and marker development are discussed.
Collapse
|
39
|
Macas J, Koblízková A, Navrátilová A, Neumann P. Hypervariable 3' UTR region of plant LTR-retrotransposons as a source of novel satellite repeats. Gene 2009; 448:198-206. [PMID: 19563868 DOI: 10.1016/j.gene.2009.06.014] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2009] [Revised: 06/17/2009] [Accepted: 06/19/2009] [Indexed: 11/15/2022]
Abstract
The repetitive sequence PisTR-A has an unusual organization in the pea (Pisum sativum) genome, being present both as short dispersed repeats as well as long arrays of tandemly arranged satellite DNA. Cloning, sequencing and FISH analysis of both PisTR-A variants revealed that the former occurs in the genome embedded within the sequence of Ty3/gypsy-like Ogre elements, whereas the latter forms homogenized arrays of satellite repeats at several genomic loci. The Ogre elements carry the PisTR-A sequences in their 3' untranslated region (UTR) separating the gag-pol region from the 3' LTR. This region was found to be highly variable among pea Ogre elements, and includes a number of other tandem repeats along with or instead of PisTR-A. Bioinformatic analysis of LTR-retrotransposons mined from available plant genomic sequence data revealed that the frequent occurrence of variable tandem repeats within 3' UTRs is a typical feature of the Tat lineage of plant retrotransposons. Comparison of these repeats to known plant satellite sequences uncovered two other instances of satellites with sequence similarity to a Tat-like retrotransposon 3' UTR regions. These observations suggest that some retrotransposons may significantly contribute to satellite DNA evolution by generating a library of short repeat arrays that can subsequently be dispersed through the genome and eventually further amplified and homogenized into novel satellite repeats.
Collapse
Affiliation(s)
- Jirí Macas
- Biology Centre ASCR, Institute of Plant Molecular Biology, Branisovská 31, Ceské Budejovice, CZ-37005, Czech Republic.
| | | | | | | |
Collapse
|