1
|
Kalendar R, Shustov AV, Schulman AH. Palindromic Sequence-Targeted (PST) PCR, Version 2: An Advanced Method for High-Throughput Targeted Gene Characterization and Transposon Display. FRONTIERS IN PLANT SCIENCE 2021; 12:691940. [PMID: 34239528 PMCID: PMC8258406 DOI: 10.3389/fpls.2021.691940] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/07/2021] [Accepted: 05/20/2021] [Indexed: 05/28/2023]
Abstract
Genome walking (GW), a strategy for capturing previously unsequenced DNA fragments that are in proximity to a known sequence tag, is currently predominantly based on PCR. Recently developed PCR-based methods allow for combining of sequence-specific primers with designed capturing primers capable of annealing to unknown DNA targets, thereby offering the rapidity and effectiveness of PCR. This study presents a methodological improvement to the previously described GW technique known as palindromic sequence-targeted PCR (PST-PCR). Like PST-PCR, this new method (called PST-PCR v.2) relies on targeting of capturing primers to palindromic sequences arbitrarily present in natural DNA templates. PST-PCR v.2 consists of two rounds of PCR. The first round uses a combination of one sequence-specific primer with one capturing (PST) primer. The second round uses a combination of a single (preferred) or two universal primers; one anneals to a 5' tail attached to the sequence-specific primer and the other anneals to a different 5' tail attached to the PST primer. The key advantage of PST-PCR v.2 is the convenience of using a single universal primer with invariable sequences in GW processes involving various templates. The entire procedure takes approximately 2-3 h to produce the amplified PCR fragment, which contains a portion of a template flanked by the sequence-specific and capturing primers. PST-PCR v.2 is highly suitable for simultaneous work with multiple samples. For this reason, PST-PCR v.2 can be applied beyond the classical task of GW for studies in population genetics, in which PST-PCR v.2 is a preferred alternative to amplified fragment length polymorphism (AFLP) or next-generation sequencing. Furthermore, the conditions for PST-PCR v.2 are easier to optimize, as only one sequence-specific primer is used. This reduces non-specific random amplified polymorphic DNA (RAPD)-like amplification and formation of non-templated amplification. Importantly, akin to the previous version, PST-PCR v.2 is not sensitive to template DNA sequence complexity or quality. This study illustrates the utility of PST-PCR v.2 for transposon display (TD), which is a method to characterize inter- or intra-specific variability related to transposon integration sites. The Ac transposon sequence in the maize (Zea mays) genome was used as a sequence tag during the TD procedure to characterize the Ac integration sites.
Collapse
Affiliation(s)
- Ruslan Kalendar
- National Laboratory Astana, Nazarbayev University, Nur-Sultan, Kazakhstan
- Viikki Plant Science Centre, HiLIFE Institute of Biotechnology, University of Helsinki, Helsinki, Finland
| | | | - Alan H. Schulman
- Viikki Plant Science Centre, HiLIFE Institute of Biotechnology, University of Helsinki, Helsinki, Finland
- Natural Resources Institute Finland (Luke), Helsinki, Finland
| |
Collapse
|
2
|
Retrotransposable Elements: DNA Fingerprinting and the Assessment of Genetic Diversity. Methods Mol Biol 2021; 2222:263-286. [PMID: 33301099 DOI: 10.1007/978-1-0716-0997-2_15] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Retrotransposable elements (RTEs) are highly common mobile genetic elements that are composed of several classes and make up the majority of eukaryotic genomes. The "copy-out and paste-in" life cycle of replicative transposition in these dispersive and ubiquitous RTEs leads to new genome insertions without excision of the original element. RTEs are important drivers of species diversity; they exhibit great variety in structure, size, and mechanisms of transposition, making them important putative components in genome evolution. Accordingly, various applications have been developed to explore the polymorphisms in RTE insertion patterns. These applications include conventional or anchored polymerase chain reaction (PCR) and quantitative or digital PCR with primers designed for the 5' or 3' junction. Marker systems exploiting these PCR methods can be easily developed and are inexpensively used in the absence of extensive genome sequence data. The main inter-repeat amplification polymorphism techniques include inter-retrotransposon amplified polymorphism (IRAP), retrotransposon microsatellite amplified polymorphism (REMAP), and Inter-Primer Binding Site (iPBS) for PCR amplification with a single or two primers.
Collapse
|
3
|
Branco I, Choupina A. Bioinformatics: new tools and applications in life science and personalized medicine. Appl Microbiol Biotechnol 2021; 105:937-951. [PMID: 33404829 DOI: 10.1007/s00253-020-11056-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2020] [Revised: 11/29/2020] [Accepted: 12/09/2020] [Indexed: 11/28/2022]
Abstract
While we have a basic understanding of the functioning of the gene when coding sequences of specific proteins, we feel the lack of information on the role that DNA has on specific diseases or functions of thousands of proteins that are produced. Bioinformatics combines the methods used in the collection, storage, identification, analysis, and correlation of this huge and complex information. All this work produces an "ocean" of information that can only be "sailed" with the help of computerized methods. The goal is to provide scientists with the right means to explain normal biological processes, dysfunctions of these processes which give rise to disease and approaches that allow the discovery of new medical cures. Recently, sequencing platforms, a large scale of genomes and transcriptomes, have created new challenges not only to the genomics but especially for bioinformatics. The intent of this article is to compile a list of tools and information resources used by scientists to treat information from the massive sequencing of recent platforms to new generations and the applications of this information in different areas of life sciences including medicine. KEY POINTS: • Biological data mining • Omic approaches • From genotype to phenotype.
Collapse
Affiliation(s)
- Iuliia Branco
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal
| | - Altino Choupina
- Centro de Investigação de Montanha (CIMO), Instituto Politécnico de Bragança, Campus de Santa Apolónia, 5300-253, Bragança, Portugal.
| |
Collapse
|
4
|
Kalendar R, Raskina O, Belyayev A, Schulman AH. Long Tandem Arrays of Cassandra Retroelements and Their Role in Genome Dynamics in Plants. Int J Mol Sci 2020; 21:ijms21082931. [PMID: 32331257 PMCID: PMC7215508 DOI: 10.3390/ijms21082931] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2020] [Revised: 04/15/2020] [Accepted: 04/17/2020] [Indexed: 02/07/2023] Open
Abstract
Retrotransposable elements are widely distributed and diverse in eukaryotes. Their copy number increases through reverse-transcription-mediated propagation, while they can be lost through recombinational processes, generating genomic rearrangements. We previously identified extensive structurally uniform retrotransposon groups in which no member contains the gag, pol, or env internal domains. Because of the lack of protein-coding capacity, these groups are non-autonomous in replication, even if transcriptionally active. The Cassandra element belongs to the non-autonomous group called terminal-repeat retrotransposons in miniature (TRIM). It carries 5S RNA sequences with conserved RNA polymerase (pol) III promoters and terminators in its long terminal repeats (LTRs). Here, we identified multiple extended tandem arrays of Cassandra retrotransposons within different plant species, including ferns. At least 12 copies of repeated LTRs (as the tandem unit) and internal domain (as a spacer), giving a pattern that resembles the cellular 5S rRNA genes, were identified. A cytogenetic analysis revealed the specific chromosomal pattern of the Cassandra retrotransposon with prominent clustering at and around 5S rDNA loci. The secondary structure of the Cassandra retroelement RNA is predicted to form super-loops, in which the two LTRs are complementary to each other and can initiate local recombination, leading to the tandem arrays of Cassandra elements. The array structures are conserved for Cassandra retroelements of different species. We speculate that recombination events similar to those of 5S rRNA genes may explain the wide variation in Cassandra copy number. Likewise, the organization of 5S rRNA gene sequences is very variable in flowering plants; part of what is taken for 5S gene copy variation may be variation in Cassandra number. The role of the Cassandra 5S sequences remains to be established.
Collapse
Affiliation(s)
- Ruslan Kalendar
- Department of Agricultural Sciences, University of Helsinki, P.O. Box 27 (Latokartanonkaari 5), FI-00014 Helsinki, Finland
- RSE “National Center for Biotechnology”, Korgalzhyn Highway 13/5, Nur-Sultan 010000, Kazakhstan
- Correspondence: (R.K.); (A.H.S.)
| | - Olga Raskina
- Institute of Evolution, University of Haifa, Mount Carmel, Haifa 31905, Israel;
| | - Alexander Belyayev
- Laboratory of Molecular Cytogenetics and Karyology, Institute of Botany of the ASCR, Zámek 1, CZ-252 43 Průhonice, Czech Republic;
| | - Alan H. Schulman
- Natural Resources Institute Finland (Luke), Latokartanonkaari 9, FI-00790 Helsinki, Finland
- Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, P.O. Box 65, FI-00014 Helsinki, Finland
- Correspondence: (R.K.); (A.H.S.)
| |
Collapse
|
5
|
High-throughput retrotransposon-based genetic diversity of maize germplasm assessment and analysis. Mol Biol Rep 2020; 47:1589-1603. [PMID: 31919750 DOI: 10.1007/s11033-020-05246-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Accepted: 01/03/2020] [Indexed: 01/08/2023]
Abstract
Maize is one of the world's most important crops and a model for grass genome research. Long terminal repeat (LTR) retrotransposons comprise most of the maize genome; their ability to produce new copies makes them efficient high-throughput genetic markers. Inter-retrotransposon-amplified polymorphisms (IRAPs) were used to study the genetic diversity of maize germplasm. Five LTR retrotransposons (Huck, Tekay, Opie, Ji, and Grande) were chosen, based on their large number of copies in the maize genome, whereas polymerase chain reaction primers were designed based on consensus LTR sequences. The LTR primers showed high quality and reproducible DNA fingerprints, with a total of 677 bands including 392 polymorphic bands showing 58% polymorphism between maize hybrid lines. These markers were used to identify genetic similarities among all lines of maize. Analysis of genetic similarity was carried out based on polymorphic amplicon profiles and genetic similarity phylogeny analysis. This diversity was expected to display ecogeographical patterns of variation and local adaptation. The clustering method showed that the varieties were grouped into three clusters differing in ecogeographical origin. Each of these clusters comprised divergent hybrids with convergent characters. The clusters reflected the differences among maize hybrids and were in accordance with their pedigree. The IRAP technique is an efficient high-throughput genetic marker-generating method.
Collapse
|
6
|
Jia B, Li X, Liu W, Lu C, Lu X, Ma L, Li YY, Wei C. GLAPD: Whole Genome Based LAMP Primer Design for a Set of Target Genomes. Front Microbiol 2019; 10:2860. [PMID: 31921040 PMCID: PMC6923652 DOI: 10.3389/fmicb.2019.02860] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 11/26/2019] [Indexed: 11/23/2022] Open
Abstract
Loop-mediated isothermal amplification (LAMP) technology has been applied in a wide range of fields such as detection of foodborne bacteria and clinical pathogens due to its simplicity and efficiency. However, existing LAMP primer designing systems require a conserved gene or a short genome region as input, and they can’t design group-specific primers. With the growing number of whole genomes available, it is possible to design better primers to target a set of genomes with high specificity based on whole genomes. We present here a whole Genome based LAMP primer designer (GLAPD), a new system to design LAMP primer for a set of target genomes using whole genomes. Candidate single primer regions are identified genome wide and then combined into LAMP primer sets. For a given set of target genomes, only primer sets amplifying them and only these genomes will be output. In order to accelerate the primer designing, a GPU version is provided as well. The effectiveness of primers designed by GLAPD has been assessed for a wide range of foodborne bacteria. GLAPD can be accessed at http://cgm.sjtu.edu.cn/GLAPD/ or https://github.com/jiqingxiaoxi/GLAPD.git. A simple online version is also supplied to help users to learn and test GLAPD: http://cgm.sjtu.edu.cn/GLAPD/online/.
Collapse
Affiliation(s)
- Ben Jia
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Xueling Li
- Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Wei Liu
- Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Changde Lu
- Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Xiaoting Lu
- Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Liangxiao Ma
- Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Yuan-Yuan Li
- Shanghai Center for Bioinformation Technology, Shanghai, China
| | - Chaochun Wei
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China.,Shanghai Center for Bioinformation Technology, Shanghai, China
| |
Collapse
|
7
|
Palindromic sequence-targeted (PST) PCR: a rapid and efficient method for high-throughput gene characterization and genome walking. Sci Rep 2019; 9:17707. [PMID: 31776407 PMCID: PMC6881309 DOI: 10.1038/s41598-019-54168-0] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Accepted: 11/07/2019] [Indexed: 11/23/2022] Open
Abstract
Genome walking (GW) refers to the capture and sequencing of unknown regions in a long DNA molecule that are adjacent to a region with a known sequence. A novel PCR-based method, palindromic sequence-targeted PCR (PST-PCR), was developed. PST-PCR is based on a distinctive design of walking primers and special thermal cycling conditions. The walking primers (PST primers) match palindromic sequences (PST sites) that are randomly distributed in natural DNA. The PST primers have palindromic sequences at their 3′-ends. Upstream of the palindromes there is a degenerate sequence (8–12 nucleotides long); defined adapters are present at the 5′-termini. The thermal cycling profile has a linear amplification phase and an exponential amplification phase differing in annealing temperature. Changing the annealing temperature to switch the amplification phases at a defined cycle controls the balance between sensitivity and specificity. In contrast to traditional genome walking methods, PST-PCR is rapid (two to three hours to produce GW fragments) as it uses only one or two PCR rounds. Using PST-PCR, previously unknown regions (the promoter and intron 1) of the VRN1 gene of Timothy-grass (Phleum pratense L.) were captured for sequencing. In our experience, PST-PCR had higher throughput and greater convenience in comparison to other GW methods.
Collapse
|
8
|
Shirshikov FV, Pekov YA, Miroshnikov KA. MorphoCatcher: a multiple-alignment based web tool for target selection and designing taxon-specific primers in the loop-mediated isothermal amplification method. PeerJ 2019; 7:e6801. [PMID: 31086739 PMCID: PMC6487805 DOI: 10.7717/peerj.6801] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2018] [Accepted: 03/18/2019] [Indexed: 11/20/2022] Open
Abstract
BACKGROUND Advantages of loop-mediated isothermal amplification in molecular diagnostics allow to consider the method as a promising technology of nucleic acid detection in agriculture and medicine. A bioinformatics tool that provides rapid screening and selection of target nucleotide sequences with subsequent taxon-specific primer design toward polymorphic orthologous genes, not only unique or conserved common regions of genome, would contribute to the development of more specific and sensitive diagnostic assays. However, considering features of the original software for primer selection, also known as the PrimerExplorer (Eiken Chemical Co. LTD, Tokyo, Japan), the taxon-specific primer design using multiple sequence alignments of orthologs or even viral genomes with conservative architecture is still complicated. FINDINGS Here, MorphoCatcher is introduced as a fast and simple web plugin for PrimerExplorer with a clear interface. It enables an execution of multiple-alignment based search of taxon-specific mutations, visual screening and selection of target sequences, and easy-to-start specific primer design using the PrimerExplorer software. The combination of MorphoCatcher and PrimerExplorer allows to perform processing of the multiple alignments of orthologs for informative sliding-window plot analysis, which is used to identify the sequence regions with a high density of taxon-specific mutations and cover them by the primer ends for better specificity of amplification. CONCLUSIONS We hope that this new bioinformatics tool developed for target selection and taxon-specific primer design, called the MorphoCatcher, will gain more popularity of the loop-mediated isothermal amplification method for molecular diagnostics community. MorphoCatcher is a simple web plugin tool for the PrimerExplorer software which is freely available only for non-commercial and academic users at http://morphocatcher.ru.
Collapse
Affiliation(s)
- Fedor V Shirshikov
- Shemyakin & Ovchinnikov Institute of Bioorganic Chemistry RAS, Moscow, Russia
| | - Yuri A Pekov
- Lomonosov Moscow State University, Moscow, Russia
| | | |
Collapse
|
9
|
Natural History of a Satellite DNA Family: From the Ancestral Genome Component to Species-Specific Sequences, Concerted and Non-Concerted Evolution. Int J Mol Sci 2019; 20:ijms20051201. [PMID: 30857296 PMCID: PMC6429384 DOI: 10.3390/ijms20051201] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 03/04/2019] [Accepted: 03/06/2019] [Indexed: 12/20/2022] Open
Abstract
Satellite DNA (satDNA) is the most variable fraction of the eukaryotic genome. Related species share a common ancestral satDNA library and changing of any library component in a particular lineage results in interspecific differences. Although the general developmental trend is clear, our knowledge of the origin and dynamics of satDNAs is still fragmentary. Here, we explore whole genome shotgun Illumina reads using the RepeatExplorer (RE) pipeline to infer satDNA family life stories in the genomes of Chenopodium species. The seven diploids studied represent separate lineages and provide an example of a species complex typical for angiosperms. Application of the RE pipeline allowed by similarity searches a determination of the satDNA family with a basic monomer of ~40 bp and to trace its transformation from the reconstructed ancestral to the species-specific sequences. As a result, three types of satDNA family evolutionary development were distinguished: (i) concerted evolution with mutation and recombination events; (ii) concerted evolution with a trend toward increased complexity and length of the satellite monomer; and (iii) non-concerted evolution, with low levels of homogenization and multidirectional trends. The third type is an example of entire repeatome transformation, thus producing a novel set of satDNA families, and genomes showing non-concerted evolution are proposed as a significant source for genomic diversity.
Collapse
|
10
|
Genetic analysis of the grapevine genotypes of the Russian Vitis ampelographic collection using iPBS markers. Genetica 2019; 147:91-101. [PMID: 30783944 DOI: 10.1007/s10709-019-00055-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Accepted: 02/12/2019] [Indexed: 12/11/2022]
Abstract
Cultivated grapevine (Vitis vinifera L. ssp. sativa D.C.) is one of the oldest agricultural crops, each variety comprising an array of clones obtained by vegetative propagation from a selected vine grown from a single seedling. Most clones within a variety are identical, but some show a different form of accession, giving rise to new divergent phenotypes. Understanding the associations among the genotypes within a variety is crucial to efficient management and effective grapevine improvement. Inter-primer binding-site (iPBS) markers may aid in determining the new clones inside closely related genotypes. Following this idea, iPBS markers were used to assess the genetic variation of 33 grapevine genotypes collected from Russia. We used molecular markers to identify the differences among and within five grapevine clonal populations and analysed the variation, using clustering and statistical approaches. Four of a total of 30 PBS primers were selected, based on amplification efficiency. Polymerase chain reaction (PCR) with PBS primers resulted in a total of 1412 bands ranging from 300 to 6000 bp, with a polymorphism ratio of 44%, ranging from 58 to 75 bands per group. In total, were identified seven private bands in 33 genotypes. Results of molecular variance analysis showed that 40% of the total variation was observed within groups and only 60% between groups. Cluster analysis clearly showed that grapevine genotypes are highly divergent and possess abundant genetic diversities. The iPBS PCR-based genome fingerprinting technology used in this study effectively differentiated genotypes into five grapevine groups and indicated that iPBS markers are useful tools for clonal selection. The number of differences between clones was sufficient to identify them as separate clones of studied varieties containing unique mutations. Our previous phenotypic and phenological studies have confirmed that these genotypes differ from those of maternal plants. This work emphasized the need for a better understanding of the genotypic differences among closely related varieties of grapevine and has implications for the management of its selection processes.
Collapse
|
11
|
Novel synthetic nucleotides of notifiable dengue (1-4), Japanese encephalitis, yellow fever and Zika flaviviruses. Future Sci OA 2018; 5:FSO353. [PMID: 30652021 PMCID: PMC6331751 DOI: 10.4155/fsoa-2018-0081] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2018] [Accepted: 09/27/2018] [Indexed: 11/30/2022] Open
Abstract
Aim: To produce synthetic nucleotides of notifiable dengue virus (1–4 types), Japanese encephalitis, yellow fever and Zika flaviviruses. These notifiable flaviviruses, particularly dengue and Zika, are problematic mosquito-borne infections in the Philippines, as well as in those countries with tropical and subtropical climates. Method: An algorithmic design formulation of overlap extension – polymerase chain reaction (OE-PCR) was performed to propagate 50–60 oligomer lengths of select notifiable flaviviral RNAs to DNA nucleotides via the two-step process of OE-PCR. Result: Algorithmic OE-PCR design formulation efficiently produced 253–256 bp of notifiable flaviviruses. Comparing the newly designed algorithmic OE-PCR with existing executable programs demonstrated it to be efficient and useful in generating accurate sequences of synthetic flaviviral nucleotides. Conclusion: The efficiently and accurately produced novel synthetic nucleotides of notifiable dengue virus 1–4, Japanese encephalitis, yellow fever and Zika flaviviruses using OE-PCR is useful in understanding the dynamics of flaviviral species and holds potential for the development of synthetic nucleotide-based immunogens. Dengue virus (1–4), Japanese encephalitis and Zika fevers are notable mosquito-borne infections that continue to be problematic in the Philippines and its southeast Asian neighbors. Flaviviral yellow fever affects other tropical countries. This study aimed to effectively and efficiently synthesize flaviviral nucleotides through a predictively accurate algorithmic overlap extension-polymerase chain reaction design. The cost-saving arithmetic formulation has enabled the advanced production of 253–256 bp flaviviral nucleotide lengths. The newly synthesized and sequenced products are now ready for further experimentation and open the door for immunological exploration.
Collapse
|