1
|
Panda S, Swain SK, Sahu BP, Sarangi R. Insights into genome plasticity and gene regulation in Orientia tsutsugamushi through genome-wide mining of microsatellite markers. 3 Biotech 2023; 13:366. [PMID: 37840877 PMCID: PMC10575825 DOI: 10.1007/s13205-023-03795-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 09/25/2023] [Indexed: 10/17/2023] Open
Abstract
Microsatellite markers are being used for molecular identification and characterization as well as estimation of evolution patterns due to their highly polymorphic nature. The repeats hold 40% of the entire genome of Orientia tsutsugamushi (OT), but not yet been characterized. Thus, we investigated the genome-wide presence of microsatellites within nine complete genomes of OT and analyzed their distribution pattern, composition, and complexity. The in-silico study revealed that the genome of OT enriched with microsatellites having a total of 126,187 SSRs and 10,374 cSSRs throughout the genome, of which 70% and 30% are represented within the coding and non-coding regions, respectively. The relative density (RD) and relative abundance (RA) of SSRs were 42-44.43/kb and 6.25-6.59/kb, while for cSSRs this value ranged from 7.06 to 8.1/kb and 0.50 to 0.55/kb, respectively. However, RA and RD were weakly correlated with genome size and incidence of microsatellites. The mononucleotide repeats (54.55%) were prevalent over di- (33.22%), tri- (11.88%), tetra- (0.27%), penta- (0.02%), hexanucleotide (0.04%) repeats, with poly (A/T) richness over poly (G/C). The motif composition of cSSRs revealed that maximum cSSRs were made up of two microsatellites having unique duplication patterns such as AT-x-AT and CG-x-CG. To our knowledge, this is the first study of microsatellites in the OT genome, where characterization of such variations in repeat sequences would be important in deciphering the origin, rate of mutation, and role of repeat sequences in the genome. More numbers of microsatellites represented within the coding region provide an insight into the genome plasticity that may interfere with gene regulation to mitigate host-pathogen interaction and evolution of the species.
Collapse
Affiliation(s)
- Subhasmita Panda
- Department of Pediatrics, IMS and SUM Hospital, Siksha ‘O’ Anusandhan (Deemed to be University), K8, Kalinga Nagar, Bhubaneswar, Odisha 751003 India
| | - Subrat Kumar Swain
- Medical Research Laboratory, IMS and SUM Hospital, Siksha ‘O’ Anusandhan (Deemed to be University), K8, Kalinga Nagar, Bhubaneswar, Odisha 751003 India
| | - Basanta Pravas Sahu
- School of Biological Sciences, The University of Hong Kong, Pokfulam, Hong Kong
| | - Rachita Sarangi
- Department of Pediatrics, IMS and SUM Hospital, Siksha “O” Anusandhan (Deemed to be University), K8, Kalinga Nagar, Bhubaneswar, Odisha 751003 India
| |
Collapse
|
2
|
Jain A, Sharma PC. Occurrence and distribution of compound microsatellites in the genomes of three economically important virus families. INFECTION GENETICS AND EVOLUTION 2021; 92:104853. [PMID: 33839312 DOI: 10.1016/j.meegid.2021.104853] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 01/17/2021] [Revised: 04/01/2021] [Accepted: 04/04/2021] [Indexed: 11/15/2022]
Abstract
Microsatellites are nonrandom hypervariable iterations of one to six nucleotides, existing across the coding as well as noncoding regions of virtually all known genomes, arising primarily due to polymerase slippage and unequal crossing over during replication events. Two or more perfect microsatellites located in close proximity form compound microsatellites. We studied the distribution of compound microsatellites in 118 ssDNA virus genomes belonging to three economically important virus families, namely Anelloviridae, Circoviridae, and Parvoviridae, known to predominantly infect livestock and humans. Among these virus families, 0-58.49% of perfect microsatellites were involved in the formation of compound microsatellites, the majority being located in the coding regions. No clear relationship existed between the genomic features (genome size and GC%) and compound microsatellite characteristics (relative abundance and relative density). The majority of the compound microsatellites resulted from di-SSR couples. A strong positive relationship was observed between the maximum distance value and length of compound microsatellite, percentage of microsatellites involved in the compound microsatellite formation, and relative microsatellite density. The degree of variability among microsatellite characteristics studied was largely a species-specific phenomenon. A major proportion of compound microsatellites was represented by similar motif combinations. The findings of the present study will help in better understanding of the structural, functional, and evolutionary role of compound microsatellites prevailing in the smaller genomes.
Collapse
Affiliation(s)
- Ankit Jain
- Merck Life Science Pvt. Ltd, Sector-17, Chandigarh, India
| | - Prakash C Sharma
- University School of Biotechnology, Guru Gobind Singh Indraprastha University, Dwarka Sector-16 C, New Delhi 11078, India.
| |
Collapse
|
3
|
Laskar R, Jilani MG, Ali S. Implications of genome simple sequence repeats signature in 98 Polyomaviridae species. 3 Biotech 2021; 11:35. [PMID: 33432281 PMCID: PMC7787124 DOI: 10.1007/s13205-020-02583-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 11/02/2020] [Indexed: 01/21/2023] Open
Abstract
The analysis of simple sequence repeats (SSRs) in 98 genomes across four genera of the family Polyomaviridae was performed. The genome size ranged from 3962 (BM87) to 7369 bp (BM85) but maximum genomes were in the range of 5-5.5 kb. The GC% had an average of 42% and ranged between 34.69 (BM95) and 52.35 (BM81). A total of 3036 SSRs and 223 cSSRs were extracted using IMEx with incident frequency from 18 to 56 and 0 to 7, respectively. The most prevalent mono-nucleotide repeat motif was "T" (48.95%) followed by "A" (33.48%). "AT/TA" was the most prevalent dinucleotide motif closely followed by "CT/TC". The distribution was expectedly more in the coding region with 77.6% SSRs of which nearly half were in Large T Antigen (LTA) gene. Notably, most viruses with humans, apes and related species as host exhibited exclusivity of mono-nucleotide repeats in AT region, a proposed predictive marker for determination of humans as host in the virus in course of its evolution. Each genome has a unique SSR signature which is pivotal for viral evolution particularly in terms of host divergence. SUPPLEMENTARY INFORMATION The online version contains supplementary material available at 10.1007/s13205-020-02583-w.
Collapse
Affiliation(s)
- Rezwanuzzaman Laskar
- Clinical and Applied Genomics (CAG) Laboratory, Department of Biological Sciences, Aliah University, IIA/27, Newtown, Kolkata, 700160 India
| | - Md Gulam Jilani
- Clinical and Applied Genomics (CAG) Laboratory, Department of Biological Sciences, Aliah University, IIA/27, Newtown, Kolkata, 700160 India
| | - Safdar Ali
- Clinical and Applied Genomics (CAG) Laboratory, Department of Biological Sciences, Aliah University, IIA/27, Newtown, Kolkata, 700160 India
| |
Collapse
|
4
|
Comparative analysis, distribution, and characterization of microsatellites in Orf virus genome. Sci Rep 2020; 10:13852. [PMID: 32807836 PMCID: PMC7431841 DOI: 10.1038/s41598-020-70634-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2019] [Accepted: 07/01/2020] [Indexed: 11/09/2022] Open
Abstract
Genome-wide in-silico identification of microsatellites or simple sequence repeats (SSRs) in the Orf virus (ORFV), the causative agent of contagious ecthyma has been carried out to investigate the type, distribution and its potential role in the genome evolution. We have investigated eleven ORFV strains, which resulted in the presence of 1,036-1,181 microsatellites per strain. The further screening revealed the presence of 83-107 compound SSRs (cSSRs) per genome. Our analysis indicates the dinucleotide (76.9%) repeats to be the most abundant, followed by trinucleotide (17.7%), mononucleotide (4.9%), tetranucleotide (0.4%) and hexanucleotide (0.2%) repeats. The Relative Abundance (RA) and Relative Density (RD) of these SSRs varied between 7.6-8.4 and 53.0-59.5 bp/kb, respectively. While in the case of cSSRs, the RA and RD ranged from 0.6-0.8 and 12.1-17.0 bp/kb, respectively. Regression analysis of all parameters like the incident of SSRs, RA, and RD significantly correlated with the GC content. But in a case of genome size, except incident SSRs, all other parameters were non-significantly correlated. Nearly all cSSRs were composed of two microsatellites, which showed no biasedness to a particular motif. Motif duplication pattern, such as, (C)-x-(C), (TG)-x-(TG), (AT)-x-(AT), (TC)- x-(TC) and self-complementary motifs, such as (GC)-x-(CG), (TC)-x-(AG), (GT)-x-(CA) and (TC)-x-(AG) were observed in the cSSRs. Finally, in-silico polymorphism was assessed, followed by in-vitro validation using PCR analysis and sequencing. The thirteen polymorphic SSR markers developed in this study were further characterized by mapping with the sequence present in the database. The results of the present study indicate that these SSRs could be a useful tool for identification, analysis of genetic diversity, and understanding the evolutionary status of the virus.
Collapse
|
5
|
Alam CM, Iqbal A, Sharma A, Schulman AH, Ali S. Microsatellite Diversity, Complexity, and Host Range of Mycobacteriophage Genomes of the Siphoviridae Family. Front Genet 2019; 10:207. [PMID: 30923537 PMCID: PMC6426759 DOI: 10.3389/fgene.2019.00207] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Accepted: 02/26/2019] [Indexed: 01/21/2023] Open
Abstract
The incidence, distribution, and variation of simple sequence repeats (SSRs) in viruses is instrumental in understanding the functional and evolutionary aspects of repeat sequences. Full-length genome sequences retrieved from NCBI were used for extraction and analysis of repeat sequences using IMEx software. We have also developed two MATLAB-based tools for extraction of gene locations from GenBank in tabular format and simulation of this data with SSR incidence data. Present study encompassing 147 Mycobacteriophage genomes revealed 25,284 SSRs and 1,127 compound SSRs (cSSRs) through IMEx. Mono- to hexa-nucleotide motifs were present. The SSR count per genome ranged from 78 (M100) to 342 (M58) while cSSRs incidence ranged from 1 (M138) to 17 (M28, M73). Though cSSRs were present in all the genomes, their frequency and SSR to cSSR conversion percentage varied from 1.08 (M138 with 93 SSRs) to 8.33 (M116 with 96 SSRs). In terms of localization, the SSRs were predominantly localized to coding regions (∼78%). Interestingly, genomes of around 50 kb contained a similar number of SSRs/cSSRs to that in a 110 kb genome, suggesting functional relevance for SSRs which was substantiated by variation in motif constitution between species with different host range. The three species with broad host range (M97, M100, M116) have around 90% of their mono-nucleotide repeat motifs composed of G or C and only M16 has both A and T mononucleotide motifs. Around 20% of the di-nucleotide repeat motifs in the genomes exhibiting a broad host range were CT/TC, which were either absent or represented to a much lesser extent in the other genomes.
Collapse
Affiliation(s)
- Chaudhary Mashhood Alam
- Luke/BI Plant Genome Dynamics Lab, Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Ingenious e-Brain Solutions, Gurugram, India
| | - Asif Iqbal
- PIRO Technologies Private Limited, New Delhi, India
| | - Anjana Sharma
- Department of Biomedical Sciences, SRCASW, University of Delhi, New Delhi, India
| | - Alan H Schulman
- Luke/BI Plant Genome Dynamics Lab, Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Safdar Ali
- Department of Biomedical Sciences, SRCASW, University of Delhi, New Delhi, India.,Department of Biological Sciences, Aliah University, Kolkata, India
| |
Collapse
|
6
|
Comparative analysis on precise distribution-patterns of microsatellites in HIV-1 with differential statistical method. GENE REPORTS 2018. [DOI: 10.1016/j.genrep.2018.06.007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
7
|
Genome-wide In Silico Analysis, Characterization and Identification of Microsatellites in Spodoptera littoralis Multiple nucleopolyhedrovirus (SpliMNPV). Sci Rep 2016; 6:33741. [PMID: 27650818 PMCID: PMC5030640 DOI: 10.1038/srep33741] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2016] [Accepted: 09/01/2016] [Indexed: 01/10/2023] Open
Abstract
In this study, we undertook a survey to analyze the distribution and frequency of microsatellites or Simple Sequence Repeats (SSRs) in Spodoptera littoralis multiple nucleopolyhedrovirus (SpliMNPV) genome (isolate AN-1956). Out of the 55 microsatellite motifs, identified in the SpliMNPV-AN1956 genome using in silico analysis (inclusive of mono-, di-, tri- and hexa-nucleotide repeats), 39 were found to be distributed within coding regions (cSSRs), whereas 16 were observed to lie within intergenic or noncoding regions. Among the 39 motifs located in coding regions, 21 were located in annotated functional genes whilst 18 were identified in unknown functional genes (hypothetical proteins). Among the identified motifs, trinucleotide (80%) repeats were found to be the most abundant followed by dinucleotide (13%), mononucleotide (5%) and hexanucleotide (2%) repeats. The 39 motifs located within coding regions were further validated in vitro by using PCR analysis, while the 21 motifs located within known functional genes (15 genes) were characterized using nucleotide sequencing. A comparison of the sequence analysis data of the 21 sequenced cSSRs with the published sequences is presented. Finally, the developed SSR markers of the 39 motifs were further mapped/localized onto the SpliMNPV-AN1956 genome. In conclusion, the SSR markers specific to SpliMNPV, developed in this study, could be a useful tool for the identification of isolates and analysis of genetic diversity and viral evolutionary status.
Collapse
|
8
|
Survey and analysis of simple sequence repeats (SSRs) in three genomes of Candida species. Gene 2016; 584:129-35. [PMID: 26883055 DOI: 10.1016/j.gene.2016.02.018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Revised: 01/15/2016] [Accepted: 02/12/2016] [Indexed: 11/23/2022]
Abstract
Simple sequence repeats (SSRs) or microsatellites, which composed of tandem repeated short units of 1-6 bp, have been paying attention continuously. Here, the distribution, composition and polymorphism of microsatellites and compound microsatellites were analyzed in three available genomes of Candida species (Candida dubliniensis, Candida glabrata and Candida orthopsilosis). The results show that there were 118,047, 66,259 and 61,119 microsatellites in genomes of C. dubliniensis, C. glabrata and C. orthopsilosis, respectively. The SSRs covered more than 1/3 length of genomes in the three species. The microsatellites, which just consist of bases A and (or) T, such as (A)n, (T)n, (AT)n, (TA)n, (AAT)n, (TAA)n, (TTA)n, (ATA)n, (ATT)n and (TAT)n, were predominant in the three genomes. The length of microsatellites was focused on 6 bp and 9 bp either in the three genomes or in its coding sequences. What's more, the relative abundance (19.89/kbp) and relative density (167.87 bp/kbp) of SSRs in sequence of mitochondrion of C. glabrata were significantly great than that in any one of genomes or chromosomes of the three species. In addition, the distance between any two adjacent microsatellites was an important factor to influence the formation of compound microsatellites. The analysis may be helpful for further studying the roles of microsatellites in genomes' origination, organization and evolution of Candida species.
Collapse
|
9
|
Comparative analysis of microsatellites and compound microsatellites in T4-like viruses. Gene 2016; 575:695-701. [DOI: 10.1016/j.gene.2015.09.053] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Revised: 09/16/2015] [Accepted: 09/21/2015] [Indexed: 01/27/2023]
|
10
|
GEORGE B, GEORGE B, AWASTHI M, SINGH RN. In silico genome-wide identification and analysis of microsatelliterepeats in the largest RNA virus family (Closteroviridae). Turk J Biol 2016. [DOI: 10.3906/biy-1503-11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
|
11
|
Basharat Z, Yasmin A. Survey of compound microsatellites in multiple Lactobacillus genomes. Can J Microbiol 2015; 61:898-902. [PMID: 26445296 DOI: 10.1139/cjm-2015-0136] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Distinct simple sequence repeats with 2 or more individual microsatellites joined together or lying adjacent to each other are identified as compound microsatellites. Investigation of such composite microsatellites in the genomes of genus Lactobacillus was the aim of this study. In silico inspection of microsatellite clustering in genomes of 14 Lactobacillus species revealed a wealth of compound microsatellites. All of the mined compound microsatellites were imperfect, were composed of variant motifs, and increased in all genomes, with maximum distance (dMAX) increments of 10 to 50. The majority of these repeats were present in the coding regions. A correlation of microsatellite to compound microsatellite density was detected. The difference established in compound microsatellite division among eukaryotes, Escherichia coli, and lactobacilli is suggestive of diverse genomic features and elementary distinction between creation and fixation methods of compound microsatellites among these organisms.
Collapse
Affiliation(s)
- Zarrin Basharat
- Microbiology and Biotechnology Research Laboratory, Department of Environmental Sciences, Fatima Jinnah Women University 46000, Pakistan.,Microbiology and Biotechnology Research Laboratory, Department of Environmental Sciences, Fatima Jinnah Women University 46000, Pakistan
| | - Azra Yasmin
- Microbiology and Biotechnology Research Laboratory, Department of Environmental Sciences, Fatima Jinnah Women University 46000, Pakistan.,Microbiology and Biotechnology Research Laboratory, Department of Environmental Sciences, Fatima Jinnah Women University 46000, Pakistan
| |
Collapse
|
12
|
George B, Alam CM, Kumar RV, Gnanasekaran P, Chakraborty S. Potential linkage between compound microsatellites and recombination in geminiviruses: Evidence from comparative analysis. Virology 2015; 482:41-50. [DOI: 10.1016/j.virol.2015.03.003] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Revised: 02/16/2015] [Accepted: 03/05/2015] [Indexed: 01/10/2023]
|
13
|
George B, George B, awasthi M, Singh RN. Genome wide survey and analysis of microsatellites in Tombusviridae family. Genes Genomics 2015. [DOI: 10.1007/s13258-015-0295-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
14
|
George B, Bhatt BS, Awasthi M, George B, Singh AK. Comparative analysis of microsatellites in chloroplast genomes of lower and higher plants. Curr Genet 2015; 61:665-77. [PMID: 25999216 DOI: 10.1007/s00294-015-0495-9] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Revised: 05/05/2015] [Accepted: 05/08/2015] [Indexed: 12/29/2022]
Abstract
Microsatellites, or simple sequence repeats (SSRs), contain repetitive DNA sequence where tandem repeats of one to six base pairs are present number of times. Chloroplast genome sequences have been shown to possess extensive variations in the length, number and distribution of SSRs. However, a comparative analysis of chloroplast microsatellites is not available. Considering their potential importance in generating genomic diversity, we have systematically analysed the abundance and distribution of simple and compound microsatellites in 164 sequenced chloroplast genomes from wide range of plants. The key findings of these studies are (1) a large number of mononucleotide repeats as compared to SSR(2-6)(di-, tri-, tetra-, penta-, hexanucleotide repeats) are present in all chloroplast genomes investigated, (2) lower plants such as algae show wide variation in relative abundance, density and distribution of microsatellite repeats as compared to flowering plants, (3) longer SSRs are excluded from coding regions of most chloroplast genomes, (4) GC content has a weak influence on number, relative abundance and relative density of mononucleotide as well as SSR(2-6). However, GC content strongly showed negative correlation with relative density (R (2) = 0.5, P < 0.05) and relative abundance (R (2) = 0.6, P < 0.05) of cSSRs. In summary, our comparative studies of chloroplast genomes illustrate the variable distribution of microsatellites and revealed that chloroplast genome of smaller plants possesses relatively more genomic diversity compared to higher plants.
Collapse
Affiliation(s)
- Biju George
- Blessy Software Solution, Sector 4/441, Malviya Nagar, Jaipur, 302017, Rajasthan, India.
| | - Bhavin S Bhatt
- School of Life Sciences, Central University of Gujarat, Gandhinagar, 382030, Gujarat, India
| | - Mayur Awasthi
- Mahatma Gandhi Chitrakoot Gramodaya Vishwavidhyalaya, Satna, 485334, Madhya Pradesh, India
| | - Binu George
- Blessy Software Solution, Sector 4/441, Malviya Nagar, Jaipur, 302017, Rajasthan, India
| | - Achuit K Singh
- School of Life Sciences, Central University of Gujarat, Gandhinagar, 382030, Gujarat, India.
| |
Collapse
|
15
|
Mashhood Alam C, Sharfuddin C, Ali S. Analysis of Simple and Imperfect Microsatellites in Ebolavirus Species and Other Genomes of Filoviridae Family. ACTA ACUST UNITED AC 2015. [DOI: 10.17795/gct-26404] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
16
|
Genome wide survey of microsatellites in ssDNA viruses infecting vertebrates. Gene 2014; 552:209-18. [DOI: 10.1016/j.gene.2014.09.032] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2014] [Revised: 08/15/2014] [Accepted: 09/15/2014] [Indexed: 01/26/2023]
|
17
|
The analysis of microsatellites and compound microsatellites in 56 complete genomes of Herpesvirales. Gene 2014; 551:103-9. [DOI: 10.1016/j.gene.2014.08.054] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2014] [Revised: 08/09/2014] [Accepted: 08/26/2014] [Indexed: 01/13/2023]
|
18
|
Alam CM, Singh AK, Sharfuddin C, Ali S. In- silico exploration of thirty alphavirus genomes for analysis of the simple sequence repeats. Meta Gene 2014; 2:694-705. [PMID: 25606453 PMCID: PMC4287844 DOI: 10.1016/j.mgene.2014.09.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2014] [Revised: 09/08/2014] [Accepted: 09/10/2014] [Indexed: 11/29/2022] Open
Abstract
The compilation of simple sequence repeats (SSRs) in viruses and its analysis with reference to incidence, distribution and variation would be instrumental in understanding the functional and evolutionary aspects of repeat sequences. Present study encompasses the analysis of SSRs across 30 species of alphaviruses. The full length genome sequences, assessed from NCBI were used for extraction and analysis of repeat sequences using IMEx software. The repeats of different motif sizes (mono- to penta-nucleotide) observed therein exhibited variable incidence across the species. Expectedly, mononucleotide A/T was the most prevalent followed by dinucleotide AG/GA and trinucleotide AAG/GAA in these genomes. The conversion of SSRs to imperfect microsatellite or compound microsatellite (cSSR) is low. cSSR, primarily constituted by variant motifs accounted for up to 12.5% of the SSRs. Interestingly, seven species lacked cSSR in their genomes. However, the SSR and cSSR are predominantly localized to the coding region ORFs for non structural protein and structural proteins. The relative frequencies of different classes of simple and compound microsatellites within and across genomes have been highlighted. This is the first analysis of SSR and cSSR in alphaviruses. We analysed differential frequency and distribution patterns of SSRs and cSSRs. We studied localization of SSR and cSSR in alphaviruses proteomics This study would help in better understanding of evolutionary biology of alphaviruses.
Collapse
Affiliation(s)
| | - Avadhesh Kumar Singh
- Department of Biomedical Sciences, SRCASW, University of Delhi, Vasundhara Enclave, New Delhi 110096, India
| | | | - Safdar Ali
- Department of Biomedical Sciences, SRCASW, University of Delhi, Vasundhara Enclave, New Delhi 110096, India
| |
Collapse
|
19
|
George B, Gnanasekaran P, Jain SK, Chakraborty S. Genome wide survey and analysis of small repetitive sequences in caulimoviruses. INFECTION GENETICS AND EVOLUTION 2014; 27:15-24. [PMID: 24999243 DOI: 10.1016/j.meegid.2014.06.018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Revised: 06/01/2014] [Accepted: 06/22/2014] [Indexed: 12/19/2022]
Abstract
Microsatellites are known to exhibit ubiquitous presence across all kingdoms of life including viruses. Members of the Caulimoviridae family severely affect growth of vegetable and fruit plants and reduce economic yield in diverse cropping systems worldwide. Here, we analyzed the nature and distribution of both simple and complex microsatellites present in complete genome of 44 species of Caulimoviridae. Our results showed, in all analyzed genomes, genome size and GC content had a weak influence on number, relative abundance and relative density of microsatellites, respectively. For each genome, mono- and dinucleotide repeats were found to be highly predominant and are overrepresented in genome of majority of caulimoviruses. AT/TA and GAA/AAG/AGA was the most abundant di- and trinucleotide repeat motif, respectively. Repeats larger than trinucleotide were rarely found in these genomes. Comparative study of occurrence, abundance and density of microsatellite among available RNA and DNA viral genomes indicated that simple repeats were least abundant in genomes of caulimoviruses. Polymorphic repeats even though rare were observed in the large intergenic region of the genome, indicating strand slippage and/or unequal recombination processes do occur in caulimoviruses. To our knowledge, this is the first analysis of microsatellites occurring in any dsDNA viral genome. Characterization of such variations in repeat sequences would be important in deciphering the origin, mutational processes, and role of repeat sequences in viral genomes.
Collapse
Affiliation(s)
- Biju George
- Molecular Virology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Prabu Gnanasekaran
- Molecular Virology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - S K Jain
- Department of Biotechnology, Jamia Hamdard University, New Delhi, Delhi 110062, India
| | - Supriya Chakraborty
- Molecular Virology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India.
| |
Collapse
|
20
|
Singh AK, Alam CM, Sharfuddin C, Ali S. Frequency and distribution of simple and compound microsatellites in forty-eight Human papillomavirus (HPV) genomes. INFECTION GENETICS AND EVOLUTION 2014; 24:92-8. [PMID: 24662441 DOI: 10.1016/j.meegid.2014.03.010] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 01/18/2014] [Revised: 03/02/2014] [Accepted: 03/12/2014] [Indexed: 12/14/2022]
Abstract
Simple sequence repeats (SSRs) are tandem-repeated sequences ubiquitously present but differentially distributed across genomes. Present study is a systematic analysis for incidence, composition and complexity of different microsatellites in 48 representative Human papillomavirus (HPV) genomes. The analysis revealed a total of 1868 SSRs and 120 cSSRs. However, four genomes (HPV-60, HPV-92, HPV-112 and HPV-136) lacked any cSSR content; while HPV-31 accounted for a maximum of 10 cSSRs. An overall increase in cSSR% with higher dMAX was observed. The SSRs and cSSRs were prevalent in coding regions. Poly(A/T) repeats were significantly more abundant than poly(G/C) repeats possibly due to high (A/T) content of the HPV genomes. Further, higher prevalence of di-nucleotide repeats over tri-nucleotide repeats may be attributed to instability of former because of higher slippage rate. An in-depth study of the satellite sequences would provide an insight into the imperfections and evolution of microsatellites.
Collapse
Affiliation(s)
- Avadhesh Kumar Singh
- Department of Biomedical Sciences, SRCASW, University of Delhi, Vasundhara Enclave, New Delhi 110096, India
| | | | | | - Safdar Ali
- Department of Biomedical Sciences, SRCASW, University of Delhi, Vasundhara Enclave, New Delhi 110096, India.
| |
Collapse
|
21
|
Alam CM, Singh AK, Sharfuddin C, Ali S. Incidence, complexity and diversity of simple sequence repeats across potexvirus genomes. Gene 2014; 537:189-96. [DOI: 10.1016/j.gene.2014.01.007] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2013] [Revised: 11/15/2013] [Accepted: 01/04/2014] [Indexed: 01/18/2023]
|
22
|
Genome-wide scan for analysis of simple and imperfect microsatellites in diverse carlaviruses. INFECTION GENETICS AND EVOLUTION 2014; 21:287-94. [DOI: 10.1016/j.meegid.2013.11.018] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2013] [Revised: 11/15/2013] [Accepted: 11/21/2013] [Indexed: 01/08/2023]
|
23
|
In-silico analysis of simple and imperfect microsatellites in diverse tobamovirus genomes. Gene 2013; 530:193-200. [DOI: 10.1016/j.gene.2013.08.046] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2013] [Revised: 08/10/2013] [Accepted: 08/13/2013] [Indexed: 11/20/2022]
|
24
|
Alam CM, George B, Sharfuddin C, Jain S, Chakraborty S. Occurrence and analysis of imperfect microsatellites in diverse potyvirus genomes. Gene 2013; 521:238-44. [DOI: 10.1016/j.gene.2013.02.045] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2012] [Revised: 02/14/2013] [Accepted: 02/19/2013] [Indexed: 12/30/2022]
|