1
|
Tang J, Yao D, Zhou H, Du L, Daroch M. Reevaluation of Parasynechococcus-like Strains and Genomic Analysis of Their Microsatellites and Compound Microsatellites. PLANTS (BASEL, SWITZERLAND) 2022; 11:plants11081060. [PMID: 35448788 PMCID: PMC9024877 DOI: 10.3390/plants11081060] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/06/2022] [Accepted: 04/11/2022] [Indexed: 05/28/2023]
Abstract
Morphologically similar to Synechococcus, a large number of Parasynechococcus strains were misclassified, resulting in extreme underestimation of their genetic diversity. In this study, 80 Synechococcus-like strains were reevaluated using a combination of 16S rRNA phylogeny and genomic approach, identifying 54 strains as Parasynechococcus-like strains and showing considerably intragenus genetic divergence among the subclades identified. Further, bioinformatics analysis disclosed diversified patterns of distribution, abundance, density, and diversity of microsatellites (SSRs) and compound microsatellites (CSSRs) in genomes of these Parasynechococcus-like strains. Variations of SSRs and CSSRs were observed amongst phylotypes and subclades. Both SSRs and CSSRs were in particular unequally distributed among genomes. Dinucleotide SSRs were the most widespread, while the genomes showed two patterns in the second most abundant repeat type (mononucleotide or trinucleotide SSRs). Both SSRs and CSSRs were predominantly observed in coding regions. These two types of microsatellites showed positive correlation with genome size (p < 0.01) but negative correlation with GC content (p < 0.05). Additionally, the motif (A)n, (AG)n and (AGC)n was a major one in the corresponding category. Meanwhile, distinctive motifs of CSSRs were found in 39 genomes. This study characterizes SSRs and CSSRs in genomes of Parasynechococcus-like strains and will be useful as a prerequisite for future studies regarding their distribution, function, and evolution. Moreover, the identified SSRs may facilitate fast acclimation of Parasynechococcus-like strains to fluctuating environments and contribute to the extensive distribution of Parasynechococcus species in global marine environments.
Collapse
Affiliation(s)
- Jie Tang
- School of Food and Bioengineering, Chengdu University, Chengdu 610106, China; (J.T.); (D.Y.); (H.Z.); (L.D.)
| | - Dan Yao
- School of Food and Bioengineering, Chengdu University, Chengdu 610106, China; (J.T.); (D.Y.); (H.Z.); (L.D.)
| | - Huizhen Zhou
- School of Food and Bioengineering, Chengdu University, Chengdu 610106, China; (J.T.); (D.Y.); (H.Z.); (L.D.)
| | - Lianming Du
- School of Food and Bioengineering, Chengdu University, Chengdu 610106, China; (J.T.); (D.Y.); (H.Z.); (L.D.)
| | - Maurycy Daroch
- School of Environment and Energy, Peking University Shenzhen Graduate School, 2199 Lishui Road, Shenzhen 518055, China
| |
Collapse
|
2
|
Meng B, Epp N, Wijaya W, Mrázek J, Hoover TR. Methylation Motifs in Promoter Sequences May Contribute to the Maintenance of a Conserved m5C Methyltransferase in Helicobacter pylori. Microorganisms 2021; 9:microorganisms9122474. [PMID: 34946076 PMCID: PMC8706393 DOI: 10.3390/microorganisms9122474] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Revised: 11/23/2021] [Accepted: 11/25/2021] [Indexed: 01/22/2023] Open
Abstract
DNA methylomes of Helicobacter pylori strains are complex due to the large number of DNA methyltransferases (MTases) they possess. H. pylori J99 M.Hpy99III is a 5-methylcytosine (m5C) MTase that converts GCGC motifs to Gm5CGC. Homologs of M.Hpy99III are found in essentially all H. pylori strains. Most of these homologs are orphan MTases that lack a cognate restriction endonuclease, and their retention in H. pylori strains suggest they have roles in gene regulation. To address this hypothesis, green fluorescent protein (GFP) reporter genes were constructed with six putative promoters that had a GCGC motif in the extended −10 region, and the expression of the reporter genes was compared in wild-type H. pylori G27 and a mutant lacking the M.Hpy99III homolog (M.HpyGIII). The expression of three of the GFP reporter genes was decreased significantly in the mutant lacking M.HpyGIII. In addition, the growth rate of the H. pylori G27 mutant lacking M.HpyGIII was reduced markedly compared to that of the wild type. These findings suggest that the methylation of the GCGC motif in many H. pylori GCGC-containing promoters is required for the robust expression of genes controlled by these promoters, which may account for the universal retention of M.Hpy99III homologs in H. pylori strains.
Collapse
Affiliation(s)
- Bowen Meng
- Department of Microbiology, University of Georgia, Athens, GA 30602, USA; (B.M.); (N.E.); (W.W.)
| | - Naomi Epp
- Department of Microbiology, University of Georgia, Athens, GA 30602, USA; (B.M.); (N.E.); (W.W.)
| | - Winsen Wijaya
- Department of Microbiology, University of Georgia, Athens, GA 30602, USA; (B.M.); (N.E.); (W.W.)
| | - Jan Mrázek
- Department of Microbiology and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA;
| | - Timothy R. Hoover
- Department of Microbiology, University of Georgia, Athens, GA 30602, USA; (B.M.); (N.E.); (W.W.)
- Correspondence: ; Tel.: +1-706-542-2675
| |
Collapse
|
3
|
Yao D, Cheng L, Du L, Li M, Daroch M, Tang J. Genome-Wide Investigation and Analysis of Microsatellites and Compound Microsatellites in Leptolyngbya-like Species, Cyanobacteria. Life (Basel) 2021; 11:life11111258. [PMID: 34833134 PMCID: PMC8619395 DOI: 10.3390/life11111258] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2021] [Revised: 11/03/2021] [Accepted: 11/16/2021] [Indexed: 01/12/2023] Open
Abstract
Microsatellites (simple sequence repeats, SSRs) are ubiquitously distributed in almost all known genomes. Here, the first investigation was designed to examine the SSRs and compound microsatellites (CSSRs) in genomes of Leptolyngbya-like strains. The results disclosed diversified patterns of distribution, abundance, density, and diversity of SSRs and CSSRs in genomes, indicating that they may be subject to rapid evolutionary change. The numbers of SSRs and CSSRs were extremely unevenly distributed among genomes, ranging from 11,086 to 24,000 and from 580 to 1865, respectively. Dinucleotide SSRs were the most abundant category in 31 genomes, while the other 15 genomes followed the pattern: mono- > di- > trinucleotide SSRs. The patterns related to SSRs and CSSRs showed differences among phylogenetic groups. Both SSRs and CSSRs were overwhelmingly distributed in coding regions. The numbers of SSRs and CSSRs were significantly positively correlated with genome size (p < 0.01) and negatively correlated with GC content (p < 0.05). Moreover, the motif (A/C)n and (AG)n was predominant in mononucleotide and dinucleotide SSRs, and unique motifs of CSSRs were identified in 39 genomes. This study provides the first insight into SSRs and CSSRs in genomes of Leptolyngbya-like strains and will be useful to understanding their distribution, predicting their function, and tracking their evolution. Additionally, the identified SSRs may provide an evolutionary advantage of fast adaptation to environmental changes and may play an important role in the cosmopolitan distribution of Leptolyngbya strains to globally diverse niches.
Collapse
Affiliation(s)
- Dan Yao
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, Chengdu University, Chengdu 610106, China; (D.Y.); (L.D.)
| | - Lei Cheng
- Beijing Engineering and Technology Research Center of Food Additives, Beijing Technology & Business University, Beijing 100048, China;
| | - Lianming Du
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, Chengdu University, Chengdu 610106, China; (D.Y.); (L.D.)
| | - Meijin Li
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen 518055, China; (M.L.); (M.D.)
| | - Maurycy Daroch
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen 518055, China; (M.L.); (M.D.)
| | - Jie Tang
- Key Laboratory of Coarse Cereal Processing, Ministry of Agriculture and Rural Affairs, Chengdu University, Chengdu 610106, China; (D.Y.); (L.D.)
- Correspondence: ; Tel.: +86-028-84616063
| |
Collapse
|
4
|
An empirical analysis of mtSSRs: could microsatellite distribution patterns explain the evolution of mitogenomes in plants? Funct Integr Genomics 2021; 22:35-53. [PMID: 34751851 DOI: 10.1007/s10142-021-00815-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 10/18/2021] [Accepted: 10/19/2021] [Indexed: 10/19/2022]
Abstract
Microsatellites (SSRs) are tandem repeat sequences in eukaryote genomes, including plant cytoplasmic genomes. The mitochondrial genome (mtDNA) has been shown to vary in size, number, and distribution of SSRs among different plant groups. Thus, SSRs contribute with genomic diversity in mtDNAs. However, the abundance, distribution, and evolutionary significance of SSRs in mtDNA from a wide range of algae and plants have not been explored. In this study, the mtDNAs of 204 plant and algal species were investigated related to the presence of SSRs. The number of SSRs was positively correlated with genome size. Its distribution is dependent on plant and algal groups analyzed, although the cluster analysis indicates the conservation of some common motifs in algal and terrestrial plants that reflect common ancestry of groups. Many SSRs in coding and non-coding regions can be useful for molecular markers. Moreover, mitochondrial SSRs are highly abundant, representing an important source for natural or induced genetic variation, i.e., for biotechnological approaches that can modulate mtDNA gene regulation. Thus, this comparative study increases the understanding of the plant and algal SSR evolution and brings perspectives for further studies.
Collapse
|
5
|
Merker Breyer G, Malvessi Cattani A, Silveira Schrank I, Maboni Siqueira F. The influence of regulatory elements on Mycoplasma hyopneumoniae 7448 transcriptional response during oxidative stress and heat shock. Mol Biol Rep 2021; 49:139-147. [PMID: 34676505 DOI: 10.1007/s11033-021-06851-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Accepted: 10/15/2021] [Indexed: 11/26/2022]
Abstract
BACKGROUND The comprehension of genome organization and gene modulation is essential for understanding pathogens' infection mechanisms. Mycoplasma hyopneumoniae 7448 genome is organized in transcriptional units (TUs), which are flanked by regulatory elements such as putative promoters, terminators and repetitive sequences. Yet the relationship between the presence of these elements and bacterial responses during stress conditions remains unclear. Thus, in this study, in silico and RT-qPCR analyses were associated to determine the effect of regulatory elements in gene expression regulation upon heat shock and oxidative stress conditions. METHODS AND RESULTS Thirteen TU's organizational profiles were found based on promoters and terminators distribution. Differential expression in genes sharing the same TUs was observed, suggesting the activity of internal regulatory elements. Moreover, 88.8% of tested genes were differentially expressed under oxidative stress in comparison to the control condition, being 81.3% of them surrounded by their own regulatory elements. Similarly, under heat shock, 44.4% of the genes showed regulation when compared to control condition, being 75.0% of them surrounded by their own regulatory elements. CONCLUSIONS Altogether, this data suggests the activity of internal regulatory elements in gene modulation of M. hyopneumoniae 7448 transcription.
Collapse
Affiliation(s)
- Gabriela Merker Breyer
- Laboratory of Veterinary Bacteriology, Veterinary Pathology Department, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Amanda Malvessi Cattani
- Graduate Program in Cell and Molecular Biology, Biotechnology Center, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Irene Silveira Schrank
- Graduate Program in Cell and Molecular Biology, Biotechnology Center, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Franciele Maboni Siqueira
- Laboratory of Veterinary Bacteriology, Veterinary Pathology Department, Universidade Federal do Rio Grande do Sul, Porto Alegre, RS, Brazil.
| |
Collapse
|
6
|
Zhang H, Li D, Zhao X, Pan S, Wu X, Peng S, Huang H, Shi R, Tan Z. Relatively semi-conservative replication and a folded slippage model for short tandem repeats. BMC Genomics 2020; 21:563. [PMID: 32807079 PMCID: PMC7430839 DOI: 10.1186/s12864-020-06949-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Accepted: 07/27/2020] [Indexed: 12/11/2022] Open
Abstract
Background The ubiquitous presence of short tandem repeats (STRs) in virtually all genomes implicates their functional relevance, while a widely-accepted definition of STR is yet to be established. Previous studies majorly focus on relatively longer STRs, while shorter repeats were generally excluded. Herein, we have adopted a more generous criteria to define shorter repeats, which has led to the definition of a much larger number of STRs that lack prior analysis. Using this definition, we analyzed the short repeats in 55 randomly selected segments in 55 randomly selected genomic sequences from a fairly wide range of species covering animals, plants, fungi, protozoa, bacteria, archaea and viruses. Results Our analysis reveals a high percentage of short repeats in all 55 randomly selected segments, indicating that the universal presence of high-content short repeats could be a common characteristic of genomes across all biological kingdoms. Therefore, it is reasonable to assume a mechanism for continuous production of repeats that can make the replicating process relatively semi-conservative. We have proposed a folded replication slippage model that considers the geometric space of nucleotides and hydrogen bond stability to explain the mechanism more explicitly, with improving the existing straight-line slippage model. The folded slippage model can explain the expansion and contraction of mono- to hexa- nucleotide repeats with proper folding angles. Analysis of external forces in the folding template strands also suggests that expansion exists more commonly than contraction in the short tandem repeats. Conclusion The folded replication slippage model provides a reasonable explanation for the continuous occurrences of simple sequence repeats in genomes. This model also contributes to the explanation of STR-to-genome evolution and is an alternative model that complements semi-conservative replication.
Collapse
Affiliation(s)
- Hongxi Zhang
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Douyue Li
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Xiangyan Zhao
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Saichao Pan
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Xiaolong Wu
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Shan Peng
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Hanrou Huang
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Ruixue Shi
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China
| | - Zhongyang Tan
- Bioinformatics Center, College of Biology, Hunan University, Changsha, 410082, China.
| |
Collapse
|
7
|
Structures and stability of simple DNA repeats from bacteria. Biochem J 2020; 477:325-339. [PMID: 31967649 PMCID: PMC7015867 DOI: 10.1042/bcj20190703] [Citation(s) in RCA: 23] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2019] [Revised: 12/20/2019] [Accepted: 01/03/2020] [Indexed: 01/12/2023]
Abstract
DNA is a fundamentally important molecule for all cellular organisms due to its biological role as the store of hereditary, genetic information. On the one hand, genomic DNA is very stable, both in chemical and biological contexts, and this assists its genetic functions. On the other hand, it is also a dynamic molecule, and constant changes in its structure and sequence drive many biological processes, including adaptation and evolution of organisms. DNA genomes contain significant amounts of repetitive sequences, which have divergent functions in the complex processes that involve DNA, including replication, recombination, repair, and transcription. Through their involvement in these processes, repetitive DNA sequences influence the genetic instability and evolution of DNA molecules and they are located non-randomly in all genomes. Mechanisms that influence such genetic instability have been studied in many organisms, including within human genomes where they are linked to various human diseases. Here, we review our understanding of short, simple DNA repeats across a diverse range of bacteria, comparing the prevalence of repetitive DNA sequences in different genomes. We describe the range of DNA structures that have been observed in such repeats, focusing on their propensity to form local, non-B-DNA structures. Finally, we discuss the biological significance of such unusual DNA structures and relate this to studies where the impacts of DNA metabolism on genetic stability are linked to human diseases. Overall, we show that simple DNA repeats in bacteria serve as excellent and tractable experimental models for biochemical studies of their cellular functions and influences.
Collapse
|
8
|
Bosshard L, Peischl S, Ackermann M, Excoffier L. Mutational and Selective Processes Involved in Evolution during Bacterial Range Expansions. Mol Biol Evol 2020; 36:2313-2327. [PMID: 31241150 DOI: 10.1093/molbev/msz148] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Bacterial populations have been shown to accumulate deleterious mutations during spatial expansions that overall decrease their fitness and ability to grow. However, it is unclear if and how they can respond to selection in face of this mutation load. We examine here if artificial selection can counteract the negative effects of range expansions. We examined the molecular evolution of 20 mutator lines selected for fast expansions (SEL) and compared them to 20 other mutator lines freely expanding without artificial selection (CONTROL). We find that the colony size of all 20 SEL lines have increased relative to the ancestral lines, unlike CONTROL lines, showing that enough beneficial mutations are produced during spatial expansions to counteract the negative effect of expansion load. Importantly, SEL and CONTROL lines have similar numbers of mutations indicating that they evolved for the same number of generations and that increased fitness is not due to a purging of deleterious mutations. We find that loss of function mutations better explain the increased colony size of SEL lines than nonsynonymous mutations or a combination of the two. Interestingly, most loss of function mutations are found in simple sequence repeats (SSRs) located in genes involved in gene regulation and gene expression. We postulate that such potentially reversible mutations could play a major role in the rapid adaptation of bacteria to changing environmental conditions by shutting down expensive genes and adjusting gene expression.
Collapse
Affiliation(s)
- Lars Bosshard
- CMPG, Institute of Ecology an Evolution, University of Berne, Berne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Stephan Peischl
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.,Interfaculty Bioinformatics Unit, University of Berne, Berne, Switzerland
| | - Martin Ackermann
- Institute of Biogeochemistry and Pollutant Dynamics, Swiss Federal Institute of Technology Zurich (ETH Zürich), Zürich, Switzerland.,Department of Environmental Microbiology, Swiss Federal Institute of Aquatic Science and Technology (Eawag), Dübendorf, Switzerland
| | - Laurent Excoffier
- CMPG, Institute of Ecology an Evolution, University of Berne, Berne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
9
|
Mahfooz S, Srivastava A, Yadav MC, Tahoor A. Comparative genomics in phytopathogenic prokaryotes reveals the higher relative abundance and density of long-SSRs in the smallest prokaryotic genome. 3 Biotech 2019; 9:340. [PMID: 31478033 DOI: 10.1007/s13205-019-1872-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Accepted: 08/13/2019] [Indexed: 10/26/2022] Open
Abstract
Frequency and distribution of long-SSRs were studied in 18 phytopathogenic prokaryotes. Higher relative abundance of the long-SSRs was observed in phytopathogenic prokaryotes when compared to non-pathogenic control. The frequency of these SSRs was positively correlated with size and GC content of the genomes of phytopathogenic prokaryotes. Interestingly, phytopathogens with higher GC content in the genome were found to posses longer repeat motifs of SSRs, whereas those having lesser GC content were harbouring shorter repeat motifs. Higher abundance of tri- and hexa-nucleotide repeat motifs were the characteristic of actinomycetes, where as higher abundance of mono- and tetra-nucleotide repeats were the characteristic of the mollicutes. The maximum relative abundance and relative density of SSR were found in the smallest genome of host-adapted pathogen Aster yellow, however, length of microsatellite repeat units was the least. On the basis of presence of SSRs in the housekeeping genes, a phylogenetic relationship between these phytopathogenic prokaryotes was deduced and compared with the phylogeny developed based on 16S ribosomal RNA gene.
Collapse
|
10
|
Ledenyova ML, Tkachenko GA, Shpak IM. Imperfect and Compound Microsatellites in the Genomes of Burkholderia pseudomallei Strains. Mol Biol 2019. [DOI: 10.1134/s0026893319010084] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
11
|
Alam CM, Iqbal A, Sharma A, Schulman AH, Ali S. Microsatellite Diversity, Complexity, and Host Range of Mycobacteriophage Genomes of the Siphoviridae Family. Front Genet 2019; 10:207. [PMID: 30923537 PMCID: PMC6426759 DOI: 10.3389/fgene.2019.00207] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Accepted: 02/26/2019] [Indexed: 01/21/2023] Open
Abstract
The incidence, distribution, and variation of simple sequence repeats (SSRs) in viruses is instrumental in understanding the functional and evolutionary aspects of repeat sequences. Full-length genome sequences retrieved from NCBI were used for extraction and analysis of repeat sequences using IMEx software. We have also developed two MATLAB-based tools for extraction of gene locations from GenBank in tabular format and simulation of this data with SSR incidence data. Present study encompassing 147 Mycobacteriophage genomes revealed 25,284 SSRs and 1,127 compound SSRs (cSSRs) through IMEx. Mono- to hexa-nucleotide motifs were present. The SSR count per genome ranged from 78 (M100) to 342 (M58) while cSSRs incidence ranged from 1 (M138) to 17 (M28, M73). Though cSSRs were present in all the genomes, their frequency and SSR to cSSR conversion percentage varied from 1.08 (M138 with 93 SSRs) to 8.33 (M116 with 96 SSRs). In terms of localization, the SSRs were predominantly localized to coding regions (∼78%). Interestingly, genomes of around 50 kb contained a similar number of SSRs/cSSRs to that in a 110 kb genome, suggesting functional relevance for SSRs which was substantiated by variation in motif constitution between species with different host range. The three species with broad host range (M97, M100, M116) have around 90% of their mono-nucleotide repeat motifs composed of G or C and only M16 has both A and T mononucleotide motifs. Around 20% of the di-nucleotide repeat motifs in the genomes exhibiting a broad host range were CT/TC, which were either absent or represented to a much lesser extent in the other genomes.
Collapse
Affiliation(s)
- Chaudhary Mashhood Alam
- Luke/BI Plant Genome Dynamics Lab, Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Ingenious e-Brain Solutions, Gurugram, India
| | - Asif Iqbal
- PIRO Technologies Private Limited, New Delhi, India
| | - Anjana Sharma
- Department of Biomedical Sciences, SRCASW, University of Delhi, New Delhi, India
| | - Alan H Schulman
- Luke/BI Plant Genome Dynamics Lab, Institute of Biotechnology and Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland.,Natural Resources Institute Finland (Luke), Helsinki, Finland
| | - Safdar Ali
- Department of Biomedical Sciences, SRCASW, University of Delhi, New Delhi, India.,Department of Biological Sciences, Aliah University, Kolkata, India
| |
Collapse
|
12
|
Mrázek J, Karls AC. In silico simulations of occurrence of transcription factor binding sites in bacterial genomes. BMC Evol Biol 2019; 19:67. [PMID: 30823869 PMCID: PMC6397444 DOI: 10.1186/s12862-019-1381-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2018] [Accepted: 02/01/2019] [Indexed: 11/16/2022] Open
Abstract
Background Interactions between transcription factors and their specific binding sites are a key component of regulation of gene expression. Until recently, it was generally assumed that most bacterial transcription factor binding sites are located at or near promoters. However, several recent works utilizing high-throughput technology to detect transcription factor binding sites in bacterial genomes found a large number of binding sites in unexpected locations, particularly inside genes, as opposed to known or expected promoter regions. While some of these intragenic binding sites likely have regulatory functions, an alternative scenario is that many of these binding sites arise by chance in the absence of selective constraints. The latter possibility was supported by in silico simulations for σ54 binding sites in Salmonella. Results In this work, we extend these simulations to more than forty transcription factors from E. coli and other bacteria. The results suggest that binding sites for all analyzed transcription factors are likely to arise throughout the genome by random genetic drift and many transcription factor binding sites found in genomes may not have specific regulatory functions. In addition, when comparing observed and expected patterns of occurrence of binding sites in genomes, we observed distinct differences among different transcription factors. Conclusions We speculate that transcription factor binding sites randomly occurring throughout the genome could be beneficial in promoting emergence of new regulatory interactions and thus facilitating evolution of gene regulatory networks. Electronic supplementary material The online version of this article (10.1186/s12862-019-1381-8) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jan Mrázek
- Department of Microbiology, University of Georgia, Athens, GA, USA. .,Institute of Bioinformatics, University of Georgia, Athens, GA, USA.
| | - Anna C Karls
- Department of Microbiology, University of Georgia, Athens, GA, USA
| |
Collapse
|
13
|
Orlov M, Garanina I, Fisunov GY, Sorokin A. Comparative Analysis of Mycoplasma gallisepticum vlhA Promoters. Front Genet 2018; 9:569. [PMID: 30519256 PMCID: PMC6258824 DOI: 10.3389/fgene.2018.00569] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 11/06/2018] [Indexed: 12/15/2022] Open
Abstract
Mycoplasma gallisepticum is an intracellular parasite affecting respiratory tract of poultry that belongs to class Mollicutes. M. gallisepticum features numerous variable lipoprotein hemagglutinin genes (vlhA) that play a role in immune escape. The vlhA promoters have a set of distinct properties in comparison to promoters of the other genes. The vlhA promoters carry a variable GAA repeats region at approximately 40 nts upstream of transcription start site. The promoters have been considered active only in the presence of exactly 12 GAA repeats. The mechanisms of vlhA expression regulation and GAA number variation are not described. Here we tried to understand these mechanisms using different computational methods. We conducted a comparative analysis among several M. gallisepticum strains. Nucleotide sequences analysis showed the presence of highly conserved regions flanking repeated trinucleotides that are not linked to GAA number variation. VlhA genes with 12 GAA repeats and their orthologs in 12 M. gallisepticum strains are more conserved than other vlhA genes and have narrower GAA number distribution. We conducted comparative analysis of physicochemical profiles of M. gallisepticum vlhA and sigma-70 promoters. Stress-induced duplex destabilization (SIDD) profiles showed that sigma-70 group is characterized by the common to prokaryotic promoters sharp maxima while vlhA promoters are hardly destabilized with the region between GAA repeats and transcription start site having zero opening probability. Electrostatic potential profiles of vlhA promoters indicate the presence of the distinct patterns that appear to govern initial stages of specific DNA-protein recognition. Open state dynamics profiles of vlhA demonstrate the pattern that might facilitate transcription bubble formation. Obtained data could be the basis for experimental identification of mechanisms of phase variation in M. gallisepticum.
Collapse
Affiliation(s)
- Mikhail Orlov
- Institute of Cell Biophysics, Russian Academy of Sciences, Pushchino, Russia
| | - Irina Garanina
- Federal Research and Clinical Center of Physical-Chemical Medicine, Federal Medical-Biological Agency, Moscow, Russia
| | - Gleb Y Fisunov
- Federal Research and Clinical Center of Physical-Chemical Medicine, Federal Medical-Biological Agency, Moscow, Russia
| | - Anatoly Sorokin
- Institute of Cell Biophysics, Russian Academy of Sciences, Pushchino, Russia
| |
Collapse
|
14
|
Miravet-Verde S, Lloréns-Rico V, Serrano L. Alternative transcriptional regulation in genome-reduced bacteria. Curr Opin Microbiol 2017; 39:89-95. [PMID: 29154025 DOI: 10.1016/j.mib.2017.10.022] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2017] [Accepted: 10/26/2017] [Indexed: 12/11/2022]
Abstract
Transcription is a core process of bacterial physiology, and as such it must be tightly controlled, so that bacterial cells maintain steady levels of each RNA molecule in homeostasis and modify them in response to perturbations. The major regulators of transcription in bacteria (and in eukaryotes) are transcription factors. However, in genome-reduced bacteria, the limited number of these proteins is insufficient to explain the variety of responses shown upon changes in their environment. Thus, alternative regulators may play a central role in orchestrating RNA levels in these microorganisms. These alternative mechanisms rely on intrinsic features within DNA and RNA molecules, suggesting they are ancestral mechanisms shared among bacteria that could have an increased relevance on transcriptional regulation in minimal cells. In this review, we summarize the alternative elements that can regulate transcript abundance in genome-reduced bacteria and how they contribute to the RNA homeostasis at different levels.
Collapse
Affiliation(s)
- Samuel Miravet-Verde
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Verónica Lloréns-Rico
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain
| | - Luis Serrano
- EMBL/CRG Systems Biology Research Unit, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr. Aiguader 88, 08003 Barcelona, Spain; Universitat Pompeu Fabra (UPF), Dr. Aiguader 88, 08003 Barcelona, Spain; Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluis Companys 23, 08010 Barcelona, Spain.
| |
Collapse
|
15
|
Raman G, Park V, Kwak M, Lee B, Park S. Characterization of the complete chloroplast genome of Arabis stellari and comparisons with related species. PLoS One 2017; 12:e0183197. [PMID: 28809950 PMCID: PMC5557495 DOI: 10.1371/journal.pone.0183197] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2017] [Accepted: 07/31/2017] [Indexed: 01/25/2023] Open
Abstract
Arabis stellari var. japonica is an ornamental plant of the Brassicaceae family, and is widely distributed in South Korea. However, no information is available about its molecular biology and no genomic study has been performed on A. stellari. In this paper, the authors report the complete chloroplast genome sequence of A. stellari. The plastome of A. stellari was 153,683 bp in length with 36.4% GC and included a pair of inverted repeats (IRs) of 26,423 bp that separated a large single-copy (LSC) region of 82,807 bp and a small single-copy (SSC) region of 18,030 bp. It was also found to contain 113 unique genes, of which 79 were protein-coding genes, 30 were transfer RNAs, and four were ribosomal RNAs. The gene content and organization of the A. stellari chloroplast genome were similar to those of other Brassicaceae genomes except for the absence of the rps16 protein-coding gene. A total of 991 SSRs were identified in the genome. The chloroplast genome of A. stellari was compared with closely related species of the Brassicaceae family. Comparative analysis showed a minor divergence occurred in the protein-coding matK, ycf1, ccsA, accD and rpl22 genes and that the KA/KS nucleotide substitution ratio of the ndhA genes of A. stellari and A. hirsuta was 1.35135. The genes infA and rps16 were absent in the Arabis genus and phylogenetic evolutionary studies revealed that these genes evolved independently. However, phylogenetic analysis showed that the positions of Brassicaceae species are highly conserved. The present study provides A. stellari genomic information that may be found useful in conservation and molecular phylogenetic studies on Brassicaceae.
Collapse
Affiliation(s)
- Gurusamy Raman
- Department of Life Sciences, Yeungnam University, Gyeongsan, Gyeongsan-buk, Republic of Korea
| | - Veronica Park
- Mcneil high school, Austin, Texas, United States of America
| | - Myounghai Kwak
- Plant Resources Division, National Institute of Biological Resources of Korea, Incheon, Republic of Korea
| | - Byoungyoon Lee
- Plant Resources Division, National Institute of Biological Resources of Korea, Incheon, Republic of Korea
| | - SeonJoo Park
- Department of Life Sciences, Yeungnam University, Gyeongsan, Gyeongsan-buk, Republic of Korea
- * E-mail:
| |
Collapse
|
16
|
Novel DNA Binding and Regulatory Activities for σ 54 (RpoN) in Salmonella enterica Serovar Typhimurium 14028s. J Bacteriol 2017; 199:JB.00816-16. [PMID: 28373272 DOI: 10.1128/jb.00816-16] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2016] [Accepted: 03/27/2017] [Indexed: 01/13/2023] Open
Abstract
The variable sigma (σ) subunit of the bacterial RNA polymerase (RNAP) holoenzyme, which is responsible for promoter specificity and open complex formation, plays a strategic role in the response to environmental changes. Salmonella enterica serovar Typhimurium utilizes the housekeeping σ70 and five alternative sigma factors, including σ54 The σ54-RNAP differs from other σ-RNAP holoenzymes in that it forms a stable closed complex with the promoter and requires ATP hydrolysis by an activated cognate bacterial enhancer binding protein (bEBP) to transition to an open complex and initiate transcription. In S. Typhimurium, σ54-dependent promoters normally respond to one of 13 different bEBPs, each of which is activated under a specific growth condition. Here, we utilized a constitutively active, promiscuous bEBP to perform a genome-wide identification of σ54-RNAP DNA binding sites and the transcriptome of the σ54 regulon of S. Typhimurium. The position and context of many of the identified σ54 RNAP DNA binding sites suggest regulatory roles for σ54-RNAP that connect the σ54 regulon to regulons of other σ factors to provide a dynamic response to rapidly changing environmental conditions.IMPORTANCE The alternative sigma factor σ54 (RpoN) is required for expression of genes involved in processes with significance in agriculture, bioenergy production, bioremediation, and host-microbe interactions. The characterization of the σ54 regulon of the versatile pathogen S. Typhimurium has expanded our understanding of the scope of the σ54 regulon and how it links to other σ regulons within the complex regulatory network for gene expression in bacteria.
Collapse
|
17
|
Cattani AM, Siqueira FM, Guedes RLM, Schrank IS. Repetitive Elements in Mycoplasma hyopneumoniae Transcriptional Regulation. PLoS One 2016; 11:e0168626. [PMID: 28005945 PMCID: PMC5179023 DOI: 10.1371/journal.pone.0168626] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Accepted: 12/02/2016] [Indexed: 12/15/2022] Open
Abstract
Transcriptional regulation, a multiple-step process, is still poorly understood in the important pig pathogen Mycoplasma hyopneumoniae. Basic motifs like promoters and terminators have already been described, but no other cis-regulatory elements have been found. DNA repeat sequences have been shown to be an interesting potential source of cis-regulatory elements. In this work, a genome-wide search for tandem and palindromic repetitive elements was performed in the intergenic regions of all coding sequences from M. hyopneumoniae strain 7448. Computational analysis demonstrated the presence of 144 tandem repeats and 1,171 palindromic elements. The DNA repeat sequences were distributed within the 5' upstream regions of 86% of transcriptional units of M. hyopneumoniae strain 7448. Comparative analysis between distinct repetitive sequences found in related mycoplasma genomes demonstrated different percentages of conservation among pathogenic and nonpathogenic strains. qPCR assays revealed differential expression among genes showing variable numbers of repetitive elements. In addition, repeats found in 206 genes already described to be differentially regulated under different culture conditions of M. hyopneumoniae strain 232 showed almost 80% conservation in relation to M. hyopneumoniae strain 7448 repeats. Altogether, these findings suggest a potential regulatory role of tandem and palindromic DNA repeats in the M. hyopneumoniae transcriptional profile.
Collapse
Affiliation(s)
- Amanda Malvessi Cattani
- Centro de Biotecnologia, Programa de Pós-Graduação em Biologia Celular e Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Franciele Maboni Siqueira
- Centro de Biotecnologia, Programa de Pós-Graduação em Biologia Celular e Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
| | - Rafael Lucas Muniz Guedes
- Laboratório de Bioinformática, Laboratório Nacional de Computação Científica (LNCC), Petrópolis, Rio de Janeiro, Brazil
| | - Irene Silveira Schrank
- Centro de Biotecnologia, Programa de Pós-Graduação em Biologia Celular e Molecular, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
- Centro de Biotecnologia, Departamento de Biologia Molecular e Biotecnologia, Instituto de Biociências, Universidade Federal do Rio Grande do Sul (UFRGS), Porto Alegre, Rio Grande do Sul, Brazil
- * E-mail:
| |
Collapse
|
18
|
Zhang J, Song X, Ma MJ, Xiao L, Kenri T, Sun H, Ptacek T, Li S, Waites KB, Atkinson TP, Shibayama K, Dybvig K, Feng Y. Inter- and intra-strain variability of tandem repeats in Mycoplasma pneumoniae based on next-generation sequencing data. Future Microbiol 2016; 12:119-129. [PMID: 27728978 DOI: 10.2217/fmb-2016-0111] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
AIM To characterize inter- and intra-strain variability of variable-number tandem repeats (VNTRs) in Mycoplasma pneumoniae to determine the optimal multilocus VNTR analysis scheme for improved strain typing. METHODS Whole genome assemblies and next-generation sequencing data from diverse M. pneumoniae isolates were used to characterize VNTRs and their variability, and to compare the strain discriminability of new VNTR and existing markers. RESULTS We identified 13 VNTRs including five reported previously. These VNTRs displayed different levels of inter- and intra-strain copy number variations. All new markers showed similar or higher discriminability compared with existing VNTR markers and the P1 typing system. CONCLUSION Our study provides novel insights into VNTR variations and potential new multilocus VNTR analysis schemes for improved genotyping of M. pneumoniae.
Collapse
Affiliation(s)
- Jing Zhang
- Department of Pathogenic Biology, Chongqing Medical University, Chongqing, China.,Critical Care Medicine Department, NIH Clinical Center, National Institutes of Health, Bethesda, MD, USA
| | - Xiaohong Song
- Critical Care Medicine Department, NIH Clinical Center, National Institutes of Health, Bethesda, MD, USA.,Department of Pharmacology, Sichuan University, Chengdu, Sichuan, China
| | | | - Li Xiao
- Department of Medicine, University of Alabama at Birmingham, AL, USA
| | - Tsuyoshi Kenri
- Department of Bacteriology II, National Institute of Infectious Diseases, Musashimurayama, Tokyo, Japan
| | - Hongmei Sun
- Department of Bacteriology, Capital Institute of Pediatrics, Beijing, China
| | - Travis Ptacek
- Department of Microbiology, University of Alabama at Birmingham, AL, USA
| | - Shaoli Li
- Department of Bacteriology, Capital Institute of Pediatrics, Beijing, China
| | - Ken B Waites
- Department of Pathology, University of Alabama at Birmingham, AL, USA
| | | | - Keigo Shibayama
- Department of Bacteriology II, National Institute of Infectious Diseases, Musashimurayama, Tokyo, Japan
| | - Kevin Dybvig
- Department of Microbiology, University of Alabama at Birmingham, AL, USA
| | - Yanmei Feng
- Department of Pulmonary Medicine, The First Affiliated Hospital, Chongqing Medical University, Chongqing, China
| |
Collapse
|
19
|
Defying Muller's Ratchet: Ancient Heritable Endobacteria Escape Extinction through Retention of Recombination and Genome Plasticity. mBio 2016; 7:mBio.02057-15. [PMID: 27329757 PMCID: PMC4916391 DOI: 10.1128/mbio.02057-15] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Heritable endobacteria, which are transmitted from one host generation to the next, are subjected to evolutionary forces that are different from those experienced by free-living bacteria. In particular, they suffer consequences of Muller’s ratchet, a mechanism that leads to extinction of small asexual populations due to fixation of slightly deleterious mutations combined with the random loss of the most-fit genotypes, which cannot be recreated without recombination. Mycoplasma-related endobacteria (MRE) are heritable symbionts of fungi from two ancient lineages, Glomeromycota (arbuscular mycorrhizal fungi) and Mucoromycotina. Previous studies revealed that MRE maintain unusually diverse populations inside their hosts and may have been associated with fungi already in the early Paleozoic. Here we show that MRE are vulnerable to genomic degeneration and propose that they defy Muller’s ratchet thanks to retention of recombination and genome plasticity. We suggest that other endobacteria may be capable of raising similar defenses against Muller’s ratchet.
Collapse
|
20
|
Survey and analysis of simple sequence repeats (SSRs) in three genomes of Candida species. Gene 2016; 584:129-35. [PMID: 26883055 DOI: 10.1016/j.gene.2016.02.018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2015] [Revised: 01/15/2016] [Accepted: 02/12/2016] [Indexed: 11/23/2022]
Abstract
Simple sequence repeats (SSRs) or microsatellites, which composed of tandem repeated short units of 1-6 bp, have been paying attention continuously. Here, the distribution, composition and polymorphism of microsatellites and compound microsatellites were analyzed in three available genomes of Candida species (Candida dubliniensis, Candida glabrata and Candida orthopsilosis). The results show that there were 118,047, 66,259 and 61,119 microsatellites in genomes of C. dubliniensis, C. glabrata and C. orthopsilosis, respectively. The SSRs covered more than 1/3 length of genomes in the three species. The microsatellites, which just consist of bases A and (or) T, such as (A)n, (T)n, (AT)n, (TA)n, (AAT)n, (TAA)n, (TTA)n, (ATA)n, (ATT)n and (TAT)n, were predominant in the three genomes. The length of microsatellites was focused on 6 bp and 9 bp either in the three genomes or in its coding sequences. What's more, the relative abundance (19.89/kbp) and relative density (167.87 bp/kbp) of SSRs in sequence of mitochondrion of C. glabrata were significantly great than that in any one of genomes or chromosomes of the three species. In addition, the distance between any two adjacent microsatellites was an important factor to influence the formation of compound microsatellites. The analysis may be helpful for further studying the roles of microsatellites in genomes' origination, organization and evolution of Candida species.
Collapse
|
21
|
Comparative analysis of microsatellites and compound microsatellites in T4-like viruses. Gene 2016; 575:695-701. [DOI: 10.1016/j.gene.2015.09.053] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2015] [Revised: 09/16/2015] [Accepted: 09/21/2015] [Indexed: 01/27/2023]
|
22
|
GEORGE B, GEORGE B, AWASTHI M, SINGH RN. In silico genome-wide identification and analysis of microsatelliterepeats in the largest RNA virus family (Closteroviridae). Turk J Biol 2016. [DOI: 10.3906/biy-1503-11] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
|
23
|
Rehm C, Wurmthaler LA, Li Y, Frickey T, Hartig JS. Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp. PLoS One 2015; 10:e0144275. [PMID: 26695179 PMCID: PMC4692102 DOI: 10.1371/journal.pone.0144275] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2015] [Accepted: 11/16/2015] [Indexed: 12/04/2022] Open
Abstract
In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1–5
nucleotides (nt) are causative for phase and antigenic variation. Although an
increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs
of 6–9 nt are rare. In particular G-rich repeat sequences with the propensity
to fold into G-quadruplex (G4) structures have received little attention. In silico
analysis of prokaryotic genomes show putative G4 forming sequences to be abundant.
This report focuses on a surprisingly enriched G-rich repeat of the type
GGGNATC in Xanthomonas and cyanobacteria
such as Nostoc. We studied in detail the genomes of
Xanthomonas campestris pv. campestris ATCC 33913
(Xcc), Xanthomonas axonopodis pv.
citri str. 306 (Xac), and Nostoc
sp. strain PCC7120 (Ana). In all three organisms repeats
are spread all over the genome with an over-representation in non-coding regions.
Extensive variation of the number of repetitive units was observed with repeat
numbers ranging from two up to 26 units. However a clear preference for four units
was detected. The strong bias for four units coincides with the requirement of four
consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus
repeat sequences was found in biophysical studies utilizing CD spectroscopy. The
G-rich repeats are preferably located between aligned open reading frames (ORFs) and
are under-represented in coding regions or between divergent ORFs. The G-rich repeats
are preferentially located within a distance of 50 bp upstream of an ORF on the
anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis
of whole transcriptome sequence data showed that the majority of repeat sequences are
transcribed. The genetic loci in the vicinity of repeat regions show increased
genomic stability. In conclusion, we introduce and characterize a special class of
highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Collapse
Affiliation(s)
- Charlotte Rehm
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstr. 10, 78457 Konstanz, Germany
| | - Lena A Wurmthaler
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstr. 10, 78457 Konstanz, Germany
| | - Yuanhao Li
- Department of Biology, University of Konstanz, Universitätsstr. 10, 78457 Konstanz, Germany
| | - Tancred Frickey
- Department of Biology, University of Konstanz, Universitätsstr. 10, 78457 Konstanz, Germany
| | - Jörg S Hartig
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstr. 10, 78457 Konstanz, Germany
| |
Collapse
|
24
|
Holder IT, Wagner S, Xiong P, Sinn M, Frickey T, Meyer A, Hartig JS. Intrastrand triplex DNA repeats in bacteria: a source of genomic instability. Nucleic Acids Res 2015; 43:10126-42. [PMID: 26450966 PMCID: PMC4666352 DOI: 10.1093/nar/gkv1017] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2015] [Accepted: 09/21/2015] [Indexed: 01/10/2023] Open
Abstract
Repetitive nucleic acid sequences are often prone to form secondary structures distinct from B-DNA. Prominent examples of such structures are DNA triplexes. We observed that certain intrastrand triplex motifs are highly conserved and abundant in prokaryotic genomes. A systematic search of 5246 different prokaryotic plasmids and genomes for intrastrand triplex motifs was conducted and the results summarized in the ITxF database available online at http://bioinformatics.uni-konstanz.de/utils/ITxF/. Next we investigated biophysical and biochemical properties of a particular G/C-rich triplex motif (TM) that occurs in many copies in more than 260 bacterial genomes by CD and nuclear magnetic resonance spectroscopy as well as in vivo footprinting techniques. A characterization of putative properties and functions of these unusually frequent nucleic acid motifs demonstrated that the occurrence of the TM is associated with a high degree of genomic instability. TM-containing genomic loci are significantly more rearranged among closely related Escherichia coli strains compared to control sites. In addition, we found very high frequencies of TM motifs in certain Enterobacteria and Cyanobacteria that were previously described as genetically highly diverse. In conclusion we link intrastrand triplex motifs with the induction of genomic instability. We speculate that the observed instability might be an adaptive feature of these genomes that creates variation for natural selection to act upon.
Collapse
Affiliation(s)
- Isabelle T Holder
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Stefanie Wagner
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Peiwen Xiong
- Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Malte Sinn
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Tancred Frickey
- Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Axel Meyer
- Department of Biology, University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| | - Jörg S Hartig
- Department of Chemistry and Konstanz Research School Chemical Biology (KoRS-CB), University of Konstanz, Universitätsstrasse 10, 78457 Konstanz, Germany
| |
Collapse
|
25
|
Wei W, Davis RE, Suo X, Zhao Y. Occurrence, distribution and possible functional roles of simple sequence repeats in phytoplasma genomes. Int J Syst Evol Microbiol 2015; 65:2748-2760. [DOI: 10.1099/ijs.0.000273] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023] Open
Abstract
Phytoplasmas are unculturable, cell-wall-less bacteria that parasitize plants and insects. This transkingdom life cycle requires rapid responses to vastly different environments, including transitions from plant phloem sieve elements to various insect tissues and alternations among diverse plant hosts. Features that enable such flexibility in other microbes include simple sequence repeats (SSRs) — mutation-prone, phase-variable short DNA tracts that function as ‘evolutionary rheostats’ and enhance rapid adaptations. To gain insights into the occurrence, distribution and potentially functional roles of SSRs in phytoplasmas, we performed computational analysis on the genomes of five completely sequenced phytoplasma strains, ‘Candidatus Phytoplasma asteris’-related strains OYM and AYWB, ‘Candidatus Phytoplasma australiense’-related strains CBWB and SLY and ‘Candidatus Phytoplasma mali’-related strain AP-AT. The overall density of SSRs in phytoplasma genomes was higher than in representative strains of other prokaryotes. While mono- and trinucleotide SSRs were significantly overrepresented in the phytoplasma genomes, dinucleotide SSRs and other higher-order SSRs were underrepresented. The occurrence and distribution of long SSRs in the prophage islands and phytoplasma-unique genetic loci indicated that SSRs played a role in compounding the complexity of sequence mosaics in individual genomes and in increasing allelic diversity among genomes. Findings from computational analyses were further complemented by an examination of SSRs in varied additional phytoplasma strains, with a focus on potential contingency genes. Some SSRs were located in regions that could profoundly alter the regulation of transcription and translation of affected genes and/or the composition of protein products.
Collapse
Affiliation(s)
- Wei Wei
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| | - Robert E. Davis
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| | - Xiaobing Suo
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| | - Yan Zhao
- Molecular Plant Pathology Laboratory, USDA-Agricultural Research Service, Beltsville, MD, 20705, USA
| |
Collapse
|
26
|
George B, George B, awasthi M, Singh RN. Genome wide survey and analysis of microsatellites in Tombusviridae family. Genes Genomics 2015. [DOI: 10.1007/s13258-015-0295-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
27
|
Zou G, Bello-Orti B, Aragon V, Tucker AW, Luo R, Ren P, Bi D, Zhou R, Jin H. The Cipher Code of Simple Sequence Repeats in "Vampire Pathogens". Sci Rep 2015. [PMID: 26215592 PMCID: PMC4516964 DOI: 10.1038/srep12441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Blood inside mammals is a forbidden area for the majority of prokaryotic microbes; however, red blood cells tropism microbes, like “vampire pathogens” (VP), succeed in matching scarce nutrients and surviving strong immunity reactions. Here, we found VP of Mycoplasma, Rhizobiales, and Rickettsiales showed significantly higher counts of (AG)n dimeric simple sequence repeats (Di-SSRs) in the genomes, coding and non-coding regions than non Vampire Pathogens (N_VP). Regression analysis indicated a significant correlation between GC content and the span of (AG)n-Di-SSR variation. Gene Ontology (GO) terms with abundance of (AG)3-Di-SSRs shared by the VP strains were associated with purine nucleotide metabolism (FDR < 0.01), indicating an adaptation to the limited availability of purine and nucleotide precursors in blood. Di-amino acids coded by (AG)n-Di-SSRs included all three six-fold code amino acids (Arg, Leu and Ser) and significantly higher counts of Di-amino acids coded by (AG)3, (GA)3, and (TC)3 in VP than N_VP. Furthermore, significant differences (P < 0.001) on the numbers of triplexes formed from (AG)n-Di-SSRs between VP and N_VP in Mycoplasma suggested the potential role of (AG)n-Di-SSRs in gene regulation.
Collapse
Affiliation(s)
- Geng Zou
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Bernardo Bello-Orti
- Centre de Recerca en Sanitat Animal (CReSA), UAB-IRTA, Campus de la Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain
| | - Virginia Aragon
- 1] Centre de Recerca en Sanitat Animal (CReSA), UAB-IRTA, Campus de la Universitat Autònoma de Barcelona, 08193 Bellaterra, Spain [2] Institut de Recercai Tecnologia Agroalimentàries (IRTA), Barcelona, Spain
| | - Alexander W Tucker
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Rui Luo
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Pinxing Ren
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Dingren Bi
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Rui Zhou
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| | - Hui Jin
- State Key Laboratory of Agricultural Microbiology, College of Veterinary Medicine, Huazhong Agricultural University, Wuhan 430070, China
| |
Collapse
|
28
|
Specificity and Strain-Typing Capabilities of Nanorod Array-Surface Enhanced Raman Spectroscopy for Mycoplasma pneumoniae Detection. PLoS One 2015; 10:e0131831. [PMID: 26121242 PMCID: PMC4487258 DOI: 10.1371/journal.pone.0131831] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2014] [Accepted: 06/07/2015] [Indexed: 01/29/2023] Open
Abstract
Mycoplasma pneumoniae is a cell wall-less bacterial pathogen of the human respiratory tract that accounts for > 20% of all community-acquired pneumonia (CAP). At present the most effective means for detection and strain-typing is quantitative polymerase chain reaction (qPCR), which can exhibit excellent sensitivity and specificity but requires separate tests for detection and genotyping, lacks standardization between available tests and between labs, and has limited practicality for widespread, point-of-care use. We have developed and previously described a silver nanorod array-surface enhanced Raman Spectroscopy (NA-SERS) biosensing platform capable of detecting M. pneumoniae with statistically significant specificity and sensitivity in simulated and true clinical throat swab samples, and the ability to distinguish between reference strains of the two main genotypes of M. pneumoniae. Furthermore, we have established a qualitative lower endpoint of detection for NA-SERS of < 1 genome equivalent (cell/μl) and a quantitative multivariate detection limit of 5.3 ± 1 cells/μl. Here we demonstrate using partial least squares- discriminatory analysis (PLS-DA) of sample spectra that NA-SERS correctly identified M. pneumoniae clinical isolates from globally diverse origins and distinguished these from a panel of 12 other human commensal and pathogenic mycoplasma species with 100% cross-validated statistical accuracy. Furthermore, PLS-DA correctly classified by strain type all 30 clinical isolates with 96% cross-validated accuracy for type 1 strains, 98% cross-validated accuracy for type 2 strains, and 90% cross-validated accuracy for type 2V strains.
Collapse
|
29
|
George B, Bhatt BS, Awasthi M, George B, Singh AK. Comparative analysis of microsatellites in chloroplast genomes of lower and higher plants. Curr Genet 2015; 61:665-77. [PMID: 25999216 DOI: 10.1007/s00294-015-0495-9] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2015] [Revised: 05/05/2015] [Accepted: 05/08/2015] [Indexed: 12/29/2022]
Abstract
Microsatellites, or simple sequence repeats (SSRs), contain repetitive DNA sequence where tandem repeats of one to six base pairs are present number of times. Chloroplast genome sequences have been shown to possess extensive variations in the length, number and distribution of SSRs. However, a comparative analysis of chloroplast microsatellites is not available. Considering their potential importance in generating genomic diversity, we have systematically analysed the abundance and distribution of simple and compound microsatellites in 164 sequenced chloroplast genomes from wide range of plants. The key findings of these studies are (1) a large number of mononucleotide repeats as compared to SSR(2-6)(di-, tri-, tetra-, penta-, hexanucleotide repeats) are present in all chloroplast genomes investigated, (2) lower plants such as algae show wide variation in relative abundance, density and distribution of microsatellite repeats as compared to flowering plants, (3) longer SSRs are excluded from coding regions of most chloroplast genomes, (4) GC content has a weak influence on number, relative abundance and relative density of mononucleotide as well as SSR(2-6). However, GC content strongly showed negative correlation with relative density (R (2) = 0.5, P < 0.05) and relative abundance (R (2) = 0.6, P < 0.05) of cSSRs. In summary, our comparative studies of chloroplast genomes illustrate the variable distribution of microsatellites and revealed that chloroplast genome of smaller plants possesses relatively more genomic diversity compared to higher plants.
Collapse
Affiliation(s)
- Biju George
- Blessy Software Solution, Sector 4/441, Malviya Nagar, Jaipur, 302017, Rajasthan, India.
| | - Bhavin S Bhatt
- School of Life Sciences, Central University of Gujarat, Gandhinagar, 382030, Gujarat, India
| | - Mayur Awasthi
- Mahatma Gandhi Chitrakoot Gramodaya Vishwavidhyalaya, Satna, 485334, Madhya Pradesh, India
| | - Binu George
- Blessy Software Solution, Sector 4/441, Malviya Nagar, Jaipur, 302017, Rajasthan, India
| | - Achuit K Singh
- School of Life Sciences, Central University of Gujarat, Gandhinagar, 382030, Gujarat, India.
| |
Collapse
|
30
|
Minimal genomes of mycoplasma-related endobacteria are plastic and contain host-derived genes for sustained life within Glomeromycota. Proc Natl Acad Sci U S A 2015; 112:7791-6. [PMID: 25964324 DOI: 10.1073/pnas.1501676112] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Arbuscular mycorrhizal fungi (AMF, Glomeromycota) colonize roots of the majority of terrestrial plants. They provide essential minerals to their plant hosts and receive photosynthates in return. All major lineages of AMF harbor endobacteria classified as Mollicutes, and known as mycoplasma-related endobacteria (MRE). Except for their substantial intrahost genetic diversity and ability to transmit vertically, virtually nothing is known about the life history of these endobacteria. To understand MRE biology, we sequenced metagenomes of three MRE populations, each associated with divergent AMF hosts. We found that each AMF species harbored a genetically distinct group of MRE. Despite vertical transmission, all MRE populations showed extensive chromosomal rearrangements, which we attributed to genetic recombination, activity of mobile elements, and a history of plectroviral invasion. The MRE genomes are characterized by a highly reduced gene content, indicating metabolic dependence on the fungal host, with the mechanism of energy production remaining unclear. Several MRE genes encode proteins with domains involved in protein-protein interactions with eukaryotic hosts. In addition, the MRE genomes harbor genes horizontally acquired from AMF. Some of these genes encode small ubiquitin-like modifier (SUMO) proteases specific to the SUMOylation systems of eukaryotes, which MRE likely use to manipulate their fungal host. The extent of MRE genome plasticity and reduction, along with the large number of horizontally acquired host genes, suggests a high degree of adaptation to the fungal host. These features, together with the ubiquity of the MRE-Glomeromycota associations, emphasize the significance of MRE in the biology of Glomeromycota.
Collapse
|
31
|
The analysis of microsatellites and compound microsatellites in 56 complete genomes of Herpesvirales. Gene 2014; 551:103-9. [DOI: 10.1016/j.gene.2014.08.054] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2014] [Revised: 08/09/2014] [Accepted: 08/26/2014] [Indexed: 01/13/2023]
|
32
|
George B, Gnanasekaran P, Jain SK, Chakraborty S. Genome wide survey and analysis of small repetitive sequences in caulimoviruses. INFECTION GENETICS AND EVOLUTION 2014; 27:15-24. [PMID: 24999243 DOI: 10.1016/j.meegid.2014.06.018] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/06/2014] [Revised: 06/01/2014] [Accepted: 06/22/2014] [Indexed: 12/19/2022]
Abstract
Microsatellites are known to exhibit ubiquitous presence across all kingdoms of life including viruses. Members of the Caulimoviridae family severely affect growth of vegetable and fruit plants and reduce economic yield in diverse cropping systems worldwide. Here, we analyzed the nature and distribution of both simple and complex microsatellites present in complete genome of 44 species of Caulimoviridae. Our results showed, in all analyzed genomes, genome size and GC content had a weak influence on number, relative abundance and relative density of microsatellites, respectively. For each genome, mono- and dinucleotide repeats were found to be highly predominant and are overrepresented in genome of majority of caulimoviruses. AT/TA and GAA/AAG/AGA was the most abundant di- and trinucleotide repeat motif, respectively. Repeats larger than trinucleotide were rarely found in these genomes. Comparative study of occurrence, abundance and density of microsatellite among available RNA and DNA viral genomes indicated that simple repeats were least abundant in genomes of caulimoviruses. Polymorphic repeats even though rare were observed in the large intergenic region of the genome, indicating strand slippage and/or unequal recombination processes do occur in caulimoviruses. To our knowledge, this is the first analysis of microsatellites occurring in any dsDNA viral genome. Characterization of such variations in repeat sequences would be important in deciphering the origin, mutational processes, and role of repeat sequences in viral genomes.
Collapse
Affiliation(s)
- Biju George
- Molecular Virology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - Prabu Gnanasekaran
- Molecular Virology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India
| | - S K Jain
- Department of Biotechnology, Jamia Hamdard University, New Delhi, Delhi 110062, India
| | - Supriya Chakraborty
- Molecular Virology Laboratory, School of Life Sciences, Jawaharlal Nehru University, New Delhi 110067, India.
| |
Collapse
|
33
|
Huang Y, Mrázek J. Assessing diversity of DNA structure-related sequence features in prokaryotic genomes. DNA Res 2014; 21:285-97. [PMID: 24408877 PMCID: PMC4060949 DOI: 10.1093/dnares/dst057] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
Prokaryotic genomes are diverse in terms of their nucleotide and oligonucleotide composition as well as presence of various sequence features that can affect physical properties of the DNA molecule. We present a survey of local sequence patterns which have a potential to promote non-canonical DNA conformations (i.e. different from standard B-DNA double helix) and interpret the results in terms of relationships with organisms' habitats, phylogenetic classifications, and other characteristics. Our present work differs from earlier similar surveys not only by investigating a wider range of sequence patterns in a large number of genomes but also by using a more realistic null model to assess significant deviations. Our results show that simple sequence repeats and Z-DNA-promoting patterns are generally suppressed in prokaryotic genomes, whereas palindromes and inverted repeats are over-represented. Representation of patterns that promote Z-DNA and intrinsic DNA curvature increases with increasing optimal growth temperature (OGT), and decreases with increasing oxygen requirement. Additionally, representations of close direct repeats, palindromes and inverted repeats exhibit clear negative trends with increasing OGT. The observed relationships with environmental characteristics, particularly OGT, suggest possible evolutionary scenarios of structural adaptation of DNA to particular environmental niches.
Collapse
Affiliation(s)
- Yongjie Huang
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Jan Mrázek
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA Department of Microbiology, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
34
|
Moyano AJ, Feliziani S, Di Rienzo JA, Smania AM. Simple sequence repeats together with mismatch repair deficiency can bias mutagenic pathways in Pseudomonas aeruginosa during chronic lung infection. PLoS One 2013; 8:e80514. [PMID: 24278287 PMCID: PMC3837008 DOI: 10.1371/journal.pone.0080514] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2013] [Accepted: 10/04/2013] [Indexed: 11/18/2022] Open
Abstract
Pseudomonas aeruginosa is an opportunistic pathogen that chronically infects the airways of cystic fibrosis (CF) patients and undergoes a process of genetic adaptation based on mutagenesis. We evaluated the role of mononucleotide G:C and A:T simple sequence repeats (SSRs) in this adaptive process. An in silico survey of the genome sequences of 7 P. aeruginosa strains showed that mononucleotide G:C SSRs but not A:T SSRs were greatly under-represented in coding regions, suggesting a strong counterselection process for G:C SSRs with lengths >5 bp but not for A:T SSRs. A meta-analysis of published whole genome sequence data for a P. aeruginosa strain from a CF patient with chronic airway infection showed that G:C SSRs but not A:T SSRs were frequently mutated during the infection process through the insertion or deletion of one or more SSR subunits. The mutation tendency of G:C SSRs was length-dependent and increased exponentially as a function of SSR length. When this strain naturally became a stable Mismatch Repair System (MRS)-deficient mutator, the degree of increase of G:C SSRs mutations (5-fold) was much higher than that of other types of mutation (2.2-fold or less). Sequence analysis of several mutated genes reported for two different collections, both containing mutator and non-mutator strains of P. aeruginosa from CF chronic infections, showed that the proportion of G:C SSR mutations was significantly higher in mutators than in non-mutators, whereas no such difference was observed for A:T SSR mutations. Our findings, taken together, provide genome-scale evidences that under a MRS-deficient background, long G:C SSRs are able to stochastically bias mutagenic pathways by making the genes in which they are harbored more prone to mutation. The combination of MRS deficiency and virulence-related genes that contain long G:C SSRs is therefore a matter of concern in P. aeruginosa CF chronic infection.
Collapse
Affiliation(s)
- Alejandro J. Moyano
- Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), CONICET, Departamento de Química Biológica, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Córdoba, Argentina
| | - Sofía Feliziani
- Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), CONICET, Departamento de Química Biológica, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Córdoba, Argentina
| | - Julio A. Di Rienzo
- Estadística y Biometría, Facultad de Ciencias Agropecuarias, Universidad Nacional de Córdoba, Córdoba, Argentina
| | - Andrea M. Smania
- Centro de Investigaciones en Química Biológica de Córdoba (CIQUIBIC), CONICET, Departamento de Química Biológica, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Córdoba, Argentina
- * E-mail:
| |
Collapse
|
35
|
A genome-wide analysis of simple sequence repeats in maize and the development of polymorphism markers from next-generation sequence data. BMC Res Notes 2013; 6:403. [PMID: 24099602 PMCID: PMC3828028 DOI: 10.1186/1756-0500-6-403] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2013] [Accepted: 09/12/2013] [Indexed: 01/10/2023] Open
Abstract
Background Maize (Zea mays ssp. mays L.), as the most important plant for staple food of several million people, animal feed and bioenergy productions, is widely cultivated around the world. Simple sequence repeats (SSRs) are widely used as molecular markers in maize genetics and breeding, but only two thousands pairs of SSRs have been published currently, which hardly satisfies for the increasing needs of geneticists and breeders. Furthermore, the increasing studies have revealed that SSRs also play a vital role in functional regulation and evolution. It is fortunate that the development of sequencing technology and bio-software provides the basis for characterization and development of SSRs in maize. Results In this study, MISA was applied to identify overall 179,681 SSRs in maize reference genome B73, with an average distance of 11.46 Kbp. Their distributions within the genome in different regions were non-random, and the density followed in a descending order of UTR, promotor, intron, intergenic and CDS. Meanwhile, 82,694 (46.02%) SSRs with unique flanking sequences were selected, and then applied to analyze the polymorphism of next-generation sequencing data from 345 maize inbred lines and data from maize reference genome B73. There were 58,946 SSRs with length information results in ten or more than ten genomes, accounting for 71.28% of SSRs with unique flanking sequences, while 55,621 SSRs had polymorphism, with an average PIC value of 0.498. 250 pairs of SSR primers in different genomic regions covering all maize chromosomes were randomly chosen for the experimental validation, with an average PIC value of 0.63 in 11 elite maize inbred lines. Conclusions Our work provided insight into the non-random distribution spatterns and compositions of SSRs in different regions of maize genome, and also developed more polymorphic SSR markers using next-generation sequencing reads. The genome-wide SSRs polymorphism markers could be useful for genetic analysis and marker-assisted selection in breeding practice, and it was also proved to be high efficient for molecular marker development via next-generation sequencing reads.
Collapse
|
36
|
Zhou K, Aertsen A, Michiels CW. The role of variable DNA tandem repeats in bacterial adaptation. FEMS Microbiol Rev 2013; 38:119-41. [PMID: 23927439 DOI: 10.1111/1574-6976.12036] [Citation(s) in RCA: 100] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2013] [Revised: 07/13/2013] [Accepted: 07/26/2013] [Indexed: 01/05/2023] Open
Abstract
DNA tandem repeats (TRs), also designated as satellite DNA, are inter- or intragenic nucleotide sequences that are repeated two or more times in a head-to-tail manner. Because TR tracts are prone to strand-slippage replication and recombination events that cause the TR copy number to increase or decrease, loci containing TRs are hypermutable. An increasing number of examples illustrate that bacteria can exploit this instability of TRs to reversibly shut down or modulate the function of specific genes, allowing them to adapt to changing environments on short evolutionary time scales without an increased overall mutation rate. In this review, we discuss the prevalence and distribution of inter- and intragenic TRs in bacteria and the mechanisms of their instability. In addition, we review evidence demonstrating a role of TR variations in bacterial adaptation strategies, ranging from immune evasion and tissue tropism to the modulation of environmental stress tolerance. Nevertheless, while bioinformatic analysis reveals that most bacterial genomes contain a few up to several dozens of intra- and intergenic TRs, only a small fraction of these have been functionally studied to date.
Collapse
Affiliation(s)
- Kai Zhou
- Department of Microbial and Molecular Systems (M²S), Faculty of Bioscience Engineering, Laboratory of Food Microbiology and Leuven Food Science and Nutrition Research Centre (LFoRCe), KU Leuven, Leuven, Belgium
| | | | | |
Collapse
|
37
|
Algire MA, Montague MG, Vashee S, Lartigue C, Merryman C. A Type III restriction-modification system in Mycoplasma mycoides subsp. capri. Open Biol 2013; 2:120115. [PMID: 23155485 PMCID: PMC3498834 DOI: 10.1098/rsob.120115] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2012] [Accepted: 10/10/2012] [Indexed: 12/24/2022] Open
Abstract
The sequenced genome of Mycoplasma mycoides subsp. capri revealed the presence of a Type III restriction–modification system (MmyCI). The methyltransferase (modification) subunit of MmyCI (M.MmyCI) was shown to recognize the sequence 5′-TGAG-3′ and methylate the adenine. The coding region of the methyltransferase gene contains 12 consecutive AG dinucleotide repeats that result in a translational termination at a TAA codon immediately beyond the repeat region. This strain does not have MmyCI activity. A clone was found with 10 AG repeats such that the gene is in frame, and this strain has MmyCI activity, suggesting that the expression of the MmyCI methyltransferase may be phase variable.
Collapse
Affiliation(s)
- Mikkel A Algire
- The J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA.
| | | | | | | | | |
Collapse
|
38
|
Zhao X, Tian Y, Yang R, Feng H, Ouyang Q, Tian Y, Tan Z, Li M, Niu Y, Jiang J, Shen G, Yu R. Coevolution between simple sequence repeats (SSRs) and virus genome size. BMC Genomics 2012; 13:435. [PMID: 22931422 PMCID: PMC3585866 DOI: 10.1186/1471-2164-13-435] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 08/18/2012] [Indexed: 12/26/2022] Open
Abstract
Background Relationship between the level of repetitiveness in genomic sequence and genome size has been investigated by making use of complete prokaryotic and eukaryotic genomes, but relevant studies have been rarely made in virus genomes. Results In this study, a total of 257 viruses were examined, which cover 90% of genera. The results showed that simple sequence repeats (SSRs) is strongly, positively and significantly correlated with genome size. Certain repeat class is distributed in a certain range of genome sequence length. Mono-, di- and tri- repeats are widely distributed in all virus genomes, tetra- SSRs as a common component consist in genomes which more than 100 kb in size; in the range of genome < 100 kb, genomes containing penta- and hexa- SSRs are not more than 50%. Principal components analysis (PCA) indicated that dinucleotide repeat affects the differences of SSRs most strongly among virus genomes. Results showed that SSRs tend to accumulate in larger virus genomes; and the longer genome sequence, the longer repeat units. Conclusions We conducted this research standing on the height of the whole virus. We concluded that genome size is an important factor in affecting the occurrence of SSRs; hosts are also responsible for the variances of SSRs content to a certain degree.
Collapse
Affiliation(s)
- Xiangyan Zhao
- Chinese Academy of Inspection and Quarantine, Beijing, 100029, China
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
39
|
Abel J, Mrázek J. Differences in DNA curvature-related sequence periodicity between prokaryotic chromosomes and phages, and relationship to chromosomal prophage content. BMC Genomics 2012; 13:188. [PMID: 22587570 PMCID: PMC3431218 DOI: 10.1186/1471-2164-13-188] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2011] [Accepted: 05/07/2012] [Indexed: 02/07/2023] Open
Abstract
Background Periodic spacing of A-tracts (short runs of A or T) with the DNA helical period of ~10–11 bp is characteristic of intrinsically bent DNA. In eukaryotes, the DNA bending is related to chromatin structure and nucleosome positioning. However, the physiological role of strong sequence periodicity detected in many prokaryotic genomes is not clear. Results We developed measures of intensity and persistency of DNA curvature-related sequence periodicity and applied them to prokaryotic chromosomes and phages. The results indicate that strong periodic signals present in chromosomes are generally absent in phage genomes. Moreover, chromosomes containing prophages are less likely to possess a persistent periodic signal than chromosomes with no prophages. Conclusions Absence of DNA curvature-related sequence periodicity in phages could arise from constraints associated with DNA packaging in the viral capsid. Lack of prophages in chromosomes with persistent periodic signal suggests that the sequence periodicity and concomitant DNA curvature could play a role in protecting the chromosomes from integration of phage DNA.
Collapse
Affiliation(s)
- Jacob Abel
- Department of Microbiology, University of Georgia, Athens, GA 30602, USA
| | | |
Collapse
|
40
|
Identification and characterization of a lipopolysaccharide α,2,3-sialyltransferase from the human pathogen Helicobacter bizzozeronii. J Bacteriol 2012; 194:2540-50. [PMID: 22408169 DOI: 10.1128/jb.00126-12] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
Abstract
Terminal sialic acid in the lipopolysaccharides (LPSs) of mucosal pathogens is an important virulence factor. Here we report the characterization of a Helicobacter sialyltransferase involved in the biosynthesis of sialylated LPS in Helicobacter bizzozeronii, the only non-pylori gastric Helicobacter species isolated from humans thus far. Starting from the genome sequences of canine and human strains, we identified potential sialyltransferases downstream of three genes involved in the biosynthesis of N-acetylneuraminic acid. One of these candidates showed monofunctional α,2,3-sialyltransferase activity with a preference for N-acetyllactosamine as a substrate. The LPSs from different strains were shown by SDS-PAGE and high-performance anion-exchange chromatography with pulsed amperometric detection (HPAEC-PAD) to contain sialic acid after neuraminidase treatment. The expression of this sialyltransferase and sialyl-LPS appeared to be a phase-variable characteristic common to both human and canine H. bizzozeronii strains. The sialylation site of the LPSs of two H. bizzozeronii strains was determined to be NeuAc-Hex-HexNAc, suggesting terminal 3'-sialyl-LacNAc. Moreover, serological typing revealed the possible presence of sialyl-Lewis X in two additional strains, indicating that H. bizzozeronii could also mimic the surface glycans of mammalian cells. The expression of sialyl-glycans may influence the adaptation process of H. bizzozeronii during the host jump from dogs to humans.
Collapse
|
41
|
Schott T, Kondadi PK, Hänninen ML, Rossi M. Comparative genomics of Helicobacter pylori and the human-derived Helicobacter bizzozeronii CIII-1 strain reveal the molecular basis of the zoonotic nature of non-pylori gastric Helicobacter infections in humans. BMC Genomics 2011; 12:534. [PMID: 22039924 PMCID: PMC3234257 DOI: 10.1186/1471-2164-12-534] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Accepted: 10/31/2011] [Indexed: 01/01/2023] Open
Abstract
BACKGROUND The canine Gram-negative Helicobacter bizzozeronii is one of seven species in Helicobacter heilmannii sensu lato that are detected in 0.17-2.3% of the gastric biopsies of human patients with gastric symptoms. At the present, H. bizzozeronii is the only non-pylori gastric Helicobacter sp. cultivated from human patients and is therefore a good alternative model of human gastric Helicobacter disease. We recently sequenced the genome of the H. bizzozeronii human strain CIII-1, isolated in 2008 from a 47-year old Finnish woman suffering from severe dyspeptic symptoms. In this study, we performed a detailed comparative genome analysis with H. pylori, providing new insights into non-pylori Helicobacter infections and the mechanisms of transmission between the primary animal host and humans. RESULTS H. bizzozeronii possesses all the genes necessary for its specialised life in the stomach. However, H. bizzozeronii differs from H. pylori by having a wider metabolic flexibility in terms of its energy sources and electron transport chain. Moreover, H. bizzozeronii harbours a higher number of methyl-accepting chemotaxis proteins, allowing it to respond to a wider spectrum of environmental signals. In this study, H. bizzozeronii has been shown to have high level of genome plasticity. We were able to identify a total of 43 contingency genes, 5 insertion sequences (ISs), 22 mini-IS elements, 1 genomic island and a putative prophage. Although H. bizzozeronii lacks homologues of some of the major H. pylori virulence genes, other candidate virulence factors are present. In particular, we identified a polysaccharide lyase (HBZC1_15820) as a potential new virulence factor of H. bizzozeronii. CONCLUSIONS The comparative genome analysis performed in this study increased the knowledge of the biology of gastric Helicobacter species. In particular, we propose the hypothesis that the high metabolic versatility and the ability to react to a range of environmental signals, factors which differentiate H. bizzozeronii as well as H. felis and H. suis from H. pylori, are the molecular basis of the of the zoonotic nature of H. heilmannii sensu lato infection in humans.
Collapse
Affiliation(s)
- Thomas Schott
- Department of Food Hygiene and Environmental Health (DFHEH), Faculty of Veterinary Medicine, University of Helsinki, P.O. Box 66, FI-00014 University of Helsinki, Finland
| | - Pradeep K Kondadi
- Department of Food Hygiene and Environmental Health (DFHEH), Faculty of Veterinary Medicine, University of Helsinki, P.O. Box 66, FI-00014 University of Helsinki, Finland
| | - Marja-Liisa Hänninen
- Department of Food Hygiene and Environmental Health (DFHEH), Faculty of Veterinary Medicine, University of Helsinki, P.O. Box 66, FI-00014 University of Helsinki, Finland
| | - Mirko Rossi
- Department of Food Hygiene and Environmental Health (DFHEH), Faculty of Veterinary Medicine, University of Helsinki, P.O. Box 66, FI-00014 University of Helsinki, Finland
| |
Collapse
|
42
|
Chen M, Tan Z, Zeng G. Microsatellite is an important component of complete Hepatitis C virus genomes. INFECTION GENETICS AND EVOLUTION 2011; 11:1646-54. [DOI: 10.1016/j.meegid.2011.06.012] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/11/2011] [Revised: 06/02/2011] [Accepted: 06/16/2011] [Indexed: 12/15/2022]
|
43
|
Sequence analysis of the p1 adhesin gene of Mycoplasma pneumoniae in clinical isolates collected in Beijing in 2008 to 2009. J Clin Microbiol 2011; 49:3000-3. [PMID: 21697320 DOI: 10.1128/jcm.00105-11] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The p1 genes of 60 Mycoplasma pneumoniae clinical isolates were sequenced and compared to previously reported p1 gene sequences. An AGT trinucleotide variable-number tandem repeat was identified that ranged in copy number from 5 to 14 among the isolates. In addition, a novel p1 gene variant named 2c was identified in 6 of the isolates.
Collapse
|
44
|
Victoria FC, da Maia LC, de Oliveira AC. In silico comparative analysis of SSR markers in plants. BMC PLANT BIOLOGY 2011; 11:15. [PMID: 21247422 PMCID: PMC3037304 DOI: 10.1186/1471-2229-11-15] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2010] [Accepted: 01/19/2011] [Indexed: 05/08/2023]
Abstract
BACKGROUND The adverse environmental conditions impose extreme limitation to growth and plant development, restricting the genetic potential and reflecting on plant yield losses. The progress obtained by classic plant breeding methods aiming at increasing abiotic stress tolerances have not been enough to cope with increasing food demands. New target genes need to be identified to reach this goal, which requires extensive studies of the related biological mechanisms. Comparative analyses in ancestral plant groups can help to elucidate yet unclear biological processes. RESULTS In this study, we surveyed the occurrence patterns of expressed sequence tag-derived microsatellite markers for model plants. A total of 13,133 SSR markers were discovered using the SSRLocator software in non-redundant EST databases made for all eleven species chosen for this study. The dimer motifs are more frequent in lower plant species, such as green algae and mosses, and the trimer motifs are more frequent for the majority of higher plant groups, such as monocots and dicots. With this in silico study we confirm several microsatellite plant survey results made with available bioinformatics tools. CONCLUSIONS The comparative studies of EST-SSR markers among all plant lineages is well suited for plant evolution studies as well as for future studies of transferability of molecular markers.
Collapse
Affiliation(s)
- Filipe C Victoria
- Plant Genomics and Breeding Center, Faculdade de Agronomia Eliseu Maciel, Universidade Federal de Pelotas, RS, Brasil
- Graduate Program in Biotechnology, Universidade Federal de Pelotas, RS, Brasil
| | - Luciano C da Maia
- Plant Genomics and Breeding Center, Faculdade de Agronomia Eliseu Maciel, Universidade Federal de Pelotas, RS, Brasil
| | - Antonio Costa de Oliveira
- Plant Genomics and Breeding Center, Faculdade de Agronomia Eliseu Maciel, Universidade Federal de Pelotas, RS, Brasil
| |
Collapse
|
45
|
Tian X, Strassmann JE, Queller DC. Genome nucleotide composition shapes variation in simple sequence repeats. Mol Biol Evol 2010; 28:899-909. [PMID: 20943830 DOI: 10.1093/molbev/msq266] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Simple sequence repeats (SSRs) or microsatellites are a common component of genomes but vary greatly across species in their abundance. We tested the hypothesis that this variation is due in part to AT/GC content of genomes, with genomes biased toward either high AT or high CG generating more short random repeats that are long enough to enhance expansion through slippage during replication. To test this hypothesis, we identified repeats with perfect tandem iterations of 1-6 bp from 25 protists with complete or near-complete genome sequences. As expected, the density and the frequency are highly related to genome AT content, with excellent fits to quadratic regressions with minima near a 50% AT content and rising toward both extremes. Within species, the same trends hold, except the limited variation in AT content within each species places each mainly on the descending (GC rich), middle, or ascending (AT rich) part of the curve. The base usages of repeat motifs are also significantly correlated with genome nucleotide compositions: Percentages of AT-rich motifs rise with the increase of genome AT content but vice versa for GC-rich subgroups. Amino acid homopolymer repeats also show the expected quadratic relationship, with higher abundance in species with AT content biased in either direction. Our results show that genome nucleotide composition explains up to half of the variance in the abundance and motif constitution of SSRs.
Collapse
Affiliation(s)
- Xiangjun Tian
- Department of Ecology and Evolutionary Biology, Rice University, USA
| | | | | |
Collapse
|
46
|
Comparative analysis of sequence periodicity among prokaryotic genomes points to differences in nucleoid structure and a relationship to gene expression. J Bacteriol 2010; 192:3763-72. [PMID: 20494989 DOI: 10.1128/jb.00149-10] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Regular spacing of short runs of A or T nucleotides in DNA sequences with a period close to the helical period of the DNA double helix has been associated with intrinsic DNA bending and nucleosome positioning in eukaryotes. Analogous periodic signals were also observed in prokaryotic genomes. While the exact role of this periodicity in prokaryotes is not known, it has been proposed to facilitate the DNA packaging in the prokaryotic nucleoid and/or to promote negative or positive supercoiling. We developed a methodology for assessments of intragenomic heterogeneity of these periodic patterns and applied it in analysis of 1,025 prokaryotic chromosomes. This technique allows more detailed analysis of sequence periodicity than previous methods where sequence periodicity was assessed in an integral form across the whole chromosome. We found that most genomes have the periodic signal confined to several chromosomal segments while most of the chromosome lacks a strong sequence periodicity. Moreover, there are significant differences among different prokaryotes in both the intensity and persistency of sequence periodicity related to DNA curvature. We proffer that the prokaryotic nucleoid consists of relatively rigid sections stabilized by short intrinsically bent DNA segments and characterized by locally strong periodic patterns alternating with regions featuring a weak periodic signal, which presumably permits higher structural flexibility. This model applies to most bacteria and archaea. In genomes with an exceptionally persistent periodic signal, highly expressed genes tend to concentrate in aperiodic sections, suggesting that structural heterogeneity of the nucleoid is related to local differences in transcriptional activity.
Collapse
|
47
|
Mrázek J. Finding sequence motifs in prokaryotic genomes--a brief practical guide for a microbiologist. Brief Bioinform 2009; 10:525-36. [PMID: 19553402 DOI: 10.1093/bib/bbp032] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Finding significant nucleotide sequence motifs in prokaryotic genomes can be divided into three types of tasks: (1) supervised motif finding, where a sample of motif sequences is used to find other similar sequences in genomes; (2) unsupervised motif finding, which typically relates to the task of finding regulatory motifs and protein binding sites and (3) exploratory motif finding, which aims to identify potential functionally significant sequence motifs as those that are unusual in some statistical sense. This article provides a conceptual overview for each type of task, a brief description of basic algorithms used in their solution, and a review of selected relevant software available online.
Collapse
Affiliation(s)
- Jan Mrázek
- Department of Microbiology and Institute of Bioinformatics, University of Georgia, Athens, GA 30602-2605, USA.
| |
Collapse
|
48
|
Mrazek J. Phylogenetic Signals in DNA Composition: Limitations and Prospects. Mol Biol Evol 2009; 26:1163-9. [DOI: 10.1093/molbev/msp032] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
|
49
|
Wilton J, Jenkins C, Cordwell SJ, Falconer L, Minion FC, Oneal DC, Djordjevic MA, Connolly A, Barchia I, Walker MJ, Djordjevic SP. Mhp493 (P216) is a proteolytically processed, cilium and heparin binding protein of Mycoplasma hyopneumoniae. Mol Microbiol 2008; 71:566-82. [PMID: 19040640 DOI: 10.1111/j.1365-2958.2008.06546.x] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
Mycoplasma hyopneumoniae induces respiratory disease in swine by colonizing cilia causing ciliostasis, cilial loss and epithelial cell death. Heparin binds to M. hyopneumoniae cells in a dose-dependent manner and blocks its ability to adhere to porcine cilia. We show here that Mhp493 (P216), a paralogue of the cilium adhesin P97 (Mhp183), is cleaved between amino acids 1040 and 1089 generating surface-accessible, heparin-binding proteins P120 and P85. Antiphosphoserine antibodies recognized P85 in 2-D immunoblotting studies and TiO(2) chromatography of trypsin digests of P85 isolated a single peptide with an m/z of 917.3. A phosphoserine residue in the tryptic peptide (90)VSELpSFR(96) (position 94 in P85) was identified by MALDI-MS/MS. Polyhistidine fusion proteins (F1(P216), F2(P216), F3(P216)) spanning Mhp493 bound heparin with biologically significant Kd values, and heparin, fucoidan and mucin inhibited this interaction. Latex beads coated with F1(P216), F2(P216) and F3(P216) adhered to and entered porcine kidney epithelial-like (PK15) cell monolayers. Microtitre plate-based assays showed that sequences within P120 and P85 bind to porcine cilia and are recognized by serum antibodies elicited during infection by M. hyopneumoniae. Mhp493 contributes significantly to the surface architecture of M. hyopneumoniae and is the first cilium adhesin to be described that lacks an R1 cilium-binding domain.
Collapse
Affiliation(s)
- Jody Wilton
- NSW Department of Primary Industries, Elizabeth Macarthur Agricultural Institute, Camden, NSW 2570, Australia
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Guo X, Mrázek J. Long simple sequence repeats in host-adapted pathogens localize near genes encoding antigens, housekeeping genes, and pseudogenes. J Mol Evol 2008; 67:497-509. [PMID: 18925380 DOI: 10.1007/s00239-008-9166-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Revised: 09/03/2008] [Accepted: 09/03/2008] [Indexed: 10/21/2022]
Abstract
Simple sequence repeats (SSRs) in DNA sequences are tandem iterations of a single nucleotide or a short oligonucleotide. SSRs are subject to slipped-strand mutations and a common source of phase variation in bacteria and antigenic variation in pathogens. Significantly long SSRs are generally rare in prokaryotic genomes, and long SSRs composed of iterations of mono-, di-, tri-, and tetranucleotides are mostly restricted to host-adapted pathogens. We present new results concerning associations between long SSRs and genes related to different cellular functions in genomes of host-adapted pathogens. We found that in the majority of the analyzed genomes, at least some of the genes associated with SSRs encode potential antigens, which is expected if the primary function of SSRs is their contribution to antigenic variation. However, we also found a number of long SSRs associated with housekeeping genes, including rRNA and tRNA genes, genes encoding ribosomal proteins, amino acyl-tRNA synthetases, chaperones, and important metabolic enzymes. Many of these genes are probably essential and it is unlikely that they are phase-variable. Few statistically significant associations between SSRs and gene functional classifications were detected, suggesting that most long SSRs are not related to a particular cellular function or process. Long SSRs in Mycobacterium leprae are mostly associated with pseudogenes and may be contributing to gene loss following the adaptation to an obligate pathogenic lifestyle. We speculate that LSSRs may have played a similar role in genome reduction of other host-adapted pathogens.
Collapse
Affiliation(s)
- Xiangxue Guo
- Department of Microbiology, University of Georgia, Athens, GA 30602-2605, USA
| | | |
Collapse
|