201
|
Lebedinsky AV, Chernyh NA, Bonch-Osmolovskaya EA. Phylogenetic systematics of microorganisms inhabiting thermal environments. BIOCHEMISTRY (MOSCOW) 2007; 72:1299-312. [DOI: 10.1134/s0006297907120048] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
202
|
Kunin V, Sorek R, Hugenholtz P. Evolutionary conservation of sequence and secondary structures in CRISPR repeats. Genome Biol 2007; 8:R61. [PMID: 17442114 PMCID: PMC1896005 DOI: 10.1186/gb-2007-8-4-r61] [Citation(s) in RCA: 309] [Impact Index Per Article: 18.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2006] [Revised: 01/24/2007] [Accepted: 04/18/2007] [Indexed: 11/25/2022] Open
Abstract
The categorisation and structural analysis of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs) sequences from 195 microbial genomes show that repeats from diverse organisms can be grouped based on sequence similarity, and that some groups have pronounced secondary structures with compensatory base changes. Background Clustered regularly interspaced short palindromic repeats (CRISPRs) are a novel class of direct repeats, separated by unique spacer sequences of similar length, that are present in approximately 40% of bacterial and most archaeal genomes analyzed to date. More than 40 gene families, called CRISPR-associated sequences (CASs), appear in conjunction with these repeats and are thought to be involved in the propagation and functioning of CRISPRs. It has been recently shown that CRISPR provides acquired resistance against viruses in prokaryotes. Results Here we analyze CRISPR repeats identified in 195 microbial genomes and show that they can be organized into multiple clusters based on sequence similarity. Some of the clusters present stable, highly conserved RNA secondary structures, while others lack detectable structures. Stable secondary structures exhibit multiple compensatory base changes in the stem region, indicating evolutionary and functional conservation. Conclusion We show that the repeat-based classification corresponds to, and expands upon, a previously reported CAS gene-based classification, including specific relationships between CRISPR and CAS subtypes.
Collapse
Affiliation(s)
- Victor Kunin
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Rotem Sorek
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| | - Philip Hugenholtz
- DOE Joint Genome Institute, 2800 Mitchell Drive, Walnut Creek, CA 94598, USA
| |
Collapse
|
203
|
Lin Z, Nei M, Ma H. The origins and early evolution of DNA mismatch repair genes--multiple horizontal gene transfers and co-evolution. Nucleic Acids Res 2007; 35:7591-603. [PMID: 17965091 PMCID: PMC2190696 DOI: 10.1093/nar/gkm921] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
To understand the evolutionary process of the DNA mismatch repair system, we conducted systematic phylogenetic analysis of its key components, the bacterial MutS and MutL genes and their eukaryotic homologs. Based on genome-wide homolog searches, we identified three new MutS subfamilies (MutS3-5) in addition to the previously studied MutS1 and MutS2 subfamilies. Detailed evolutionary analysis strongly suggests that frequent ancient horizontal gene transfer (HGT) occurred with both MutS and MutL genes from bacteria to eukaryotes and/or archaea. Our results further imply that the origins of mismatch repair system in eukaryotes and archaea are largely attributed to ancient HGT from bacteria instead of vertical evolution. Specifically, the eukaryotic MutS and MutL homologs likely originated from endosymbiotic ancestors of mitochondria or chloroplasts, indicating that not only archaea, but also bacteria are important sources of eukaryotic DNA metabolic genes. The archaeal MutS1 and MutL homologs were also acquired from bacteria simultaneously through HGT. Moreover, the distribution and evolution profiles of the MutS1 and MutL genes suggest that they have undergone long-term coevolution. Our work presents an overall portrait of the evolution of these important genes in DNA metabolism and also provides further understanding about the early evolution of cellular organisms.
Collapse
Affiliation(s)
- Zhenguo Lin
- Department of Biology and Institute of Molecular Evolutionary Genetics, Pennsylvania State University, University Park, PA 16802, USA
| | | | | |
Collapse
|
204
|
Fröls S, Gordon PMK, Panlilio MA, Duggin IG, Bell SD, Sensen CW, Schleper C. Response of the hyperthermophilic archaeon Sulfolobus solfataricus to UV damage. J Bacteriol 2007; 189:8708-18. [PMID: 17905990 PMCID: PMC2168930 DOI: 10.1128/jb.01016-07] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
In order to characterize the genome-wide transcriptional response of the hyperthermophilic, aerobic crenarchaeote Sulfolobus solfataricus to UV damage, we used high-density DNA microarrays which covered 3,368 genetic features encoded on the host genome, as well as the genes of several extrachromosomal genetic elements. While no significant up-regulation of genes potentially involved in direct DNA damage reversal was observed, a specific transcriptional UV response involving 55 genes could be dissected. Although flow cytometry showed only modest perturbation of the cell cycle, strong modulation of the transcript levels of the Cdc6 replication initiator genes was observed. Up-regulation of an operon encoding Mre11 and Rad50 homologs pointed to induction of recombinational repair. Consistent with this, DNA double-strand breaks were observed between 2 and 8 h after UV treatment, possibly resulting from replication fork collapse at damaged DNA sites. The strong transcriptional induction of genes which potentially encode functions for pilus formation suggested that conjugational activity might lead to enhanced exchange of genetic material. In support of this, a statistical microscopic analysis demonstrated that large cell aggregates formed upon UV exposure. Together, this provided supporting evidence to a link between recombinational repair and conjugation events.
Collapse
Affiliation(s)
- Sabrina Fröls
- Center of Geobiology, Dept. Biology, Jahnebakken 5, N-5020 Bergen, Norway
| | | | | | | | | | | | | |
Collapse
|
205
|
Brügger K, Chen L, Stark M, Zibat A, Redder P, Ruepp A, Awayez M, She Q, Garrett RA, Klenk HP. The genome of Hyperthermus butylicus: a sulfur-reducing, peptide fermenting, neutrophilic Crenarchaeote growing up to 108 degrees C. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2007; 2:127-35. [PMID: 17350933 PMCID: PMC2686385 DOI: 10.1155/2007/745987] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
Hyperthermus butylicus, a hyperthermophilic neutrophile and anaerobe, is a member of the archaeal kingdom Crenarchaeota. Its genome consists of a single circular chromosome of 1,667,163 bp with a 53.7% G+C content. A total of 1672 genes were annotated, of which 1602 are protein-coding, and up to a third are specific to H. butylicus. In contrast to some other crenarchaeal genomes, a high level of GUG and UUG start codons are predicted. Two cdc6 genes are present, but neither could be linked unambiguously to an origin of replication. Many of the predicted metabolic gene products are associated with the fermentation of peptide mixtures including several peptidases with diverse specificities, and there are many encoded transporters. Most of the sulfur-reducing enzymes, hydrogenases and electron-transfer proteins were identified which are associated with energy production by reducing sulfur to H(2)S. Two large clusters of regularly interspaced repeats (CRISPRs) are present, one of which is associated with a crenarchaeal-type cas gene superoperon; none of the spacer sequences yielded good sequence matches with known archaeal chromosomal elements. The genome carries no detectable transposable or integrated elements, no inteins, and introns are exclusive to tRNA genes. This suggests that the genome structure is quite stable, possibly reflecting a constant, and relatively uncompetitive, natural environment.
Collapse
Affiliation(s)
- Kim Brügger
- Danish Archaea Centre, Institute of Molecular Biology, Copenhagen University, Sølvgade 83H, 1307 Copenhagen K, Denmark
- These authors contributed equally to the project
| | - Lanming Chen
- Danish Archaea Centre, Institute of Molecular Biology, Copenhagen University, Sølvgade 83H, 1307 Copenhagen K, Denmark
- These authors contributed equally to the project
| | - Markus Stark
- e.gene Biotechnologie GmbH, Poeckinger Fussweg 7a, 82340 Feldafing, Germany
- Formerly EPIDAUROS Biotechnologie AG, Genes and Genome Analysis Team
| | - Arne Zibat
- Formerly EPIDAUROS Biotechnologie AG, Genes and Genome Analysis Team
| | - Peter Redder
- Danish Archaea Centre, Institute of Molecular Biology, Copenhagen University, Sølvgade 83H, 1307 Copenhagen K, Denmark
| | - Andreas Ruepp
- Formerly EPIDAUROS Biotechnologie AG, Genes and Genome Analysis Team
- Present address: Institut für Bioinformatik, GSF-Forschungszentrum für Umwelt und Gesundheit, Ingolstädter Landstrasse 1, 85764 Neuherberg, Germany
| | - Mariana Awayez
- Danish Archaea Centre, Institute of Molecular Biology, Copenhagen University, Sølvgade 83H, 1307 Copenhagen K, Denmark
| | - Qunxin She
- Danish Archaea Centre, Institute of Molecular Biology, Copenhagen University, Sølvgade 83H, 1307 Copenhagen K, Denmark
| | - Roger A. Garrett
- Danish Archaea Centre, Institute of Molecular Biology, Copenhagen University, Sølvgade 83H, 1307 Copenhagen K, Denmark
- Editing author
| | - Hans-Peter Klenk
- e.gene Biotechnologie GmbH, Poeckinger Fussweg 7a, 82340 Feldafing, Germany
- Formerly EPIDAUROS Biotechnologie AG, Genes and Genome Analysis Team
- Corresponding author ()
| |
Collapse
|
206
|
Ruan L, Xu X. Sequence analysis and characterizations of two novel plasmids isolated from Thermus sp. 4C. Plasmid 2007; 58:84-7. [PMID: 17521723 DOI: 10.1016/j.plasmid.2007.04.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2007] [Revised: 03/30/2007] [Accepted: 04/02/2007] [Indexed: 11/22/2022]
Abstract
Two novel plasmids, named pS4C and pL4C, were isolated from the thermophilic bacterium Thermus sp. 4C. The pS4C with a length of 5015bp and 58.25% of G+C content, contains 9 putative open reading frames (ORFs). The larger plasmid, pL4C, consisting of 21,248bp, has a G+C content of 68.60% and 34 putative ORFs. Both plasmids encode their own replication protein. The ORF 22 of pL4C and the ORF 4 of pS4C encode proteins with high sequence similarities to integrase (97%) and transposase (97%), respectively, which are both involved in DNA rearrangement and exchange. Furthermore, sequence analysis of pL4C also showed that several plasmid-encoded genes may be involved in DNA modification and repair, such as DNA G:T-mismatch repair endonuclease and micrococcal nuclease-like protein. These proteins may be involved in raising the repair efficiency and other minor editing needs. Interestingly, the elimination of plasmids significantly lowered the growth temperature of Thermus sp. 4C. Few reports dealing with the DNA repair enzymes on the plasmid from Thermus strains were published so far.
Collapse
Affiliation(s)
- Lingwei Ruan
- School of Life Sciences, Xiamen University, Xiamen, PR China
| | | |
Collapse
|
207
|
Barrangou R, Fremaux C, Deveau H, Richards M, Boyaval P, Moineau S, Romero DA, Horvath P. CRISPR provides acquired resistance against viruses in prokaryotes. Science 2007; 315:1709-12. [PMID: 17379808 DOI: 10.1126/science.1138140] [Citation(s) in RCA: 3917] [Impact Index Per Article: 230.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Clustered regularly interspaced short palindromic repeats (CRISPR) are a distinctive feature of the genomes of most Bacteria and Archaea and are thought to be involved in resistance to bacteriophages. We found that, after viral challenge, bacteria integrated new spacers derived from phage genomic sequences. Removal or addition of particular spacers modified the phage-resistance phenotype of the cell. Thus, CRISPR, together with associated cas genes, provided resistance against phages, and resistance specificity is determined by spacer-phage sequence similarity.
Collapse
|
208
|
Shinkai A, Kira S, Nakagawa N, Kashihara A, Kuramitsu S, Yokoyama S. Transcription activation mediated by a cyclic AMP receptor protein from Thermus thermophilus HB8. J Bacteriol 2007; 189:3891-901. [PMID: 17369302 PMCID: PMC1913326 DOI: 10.1128/jb.01739-06] [Citation(s) in RCA: 65] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The extremely thermophilic bacterium Thermus thermophilus HB8, which belongs to the phylum Deinococcus-Thermus, has an open reading frame encoding a protein belonging to the cyclic AMP (cAMP) receptor protein (CRP) family present in many bacteria. The protein named T. thermophilus CRP is highly homologous to the CRP family proteins from the phyla Firmicutes, Actinobacteria, and Cyanobacteria, and it forms a homodimer and interacts with cAMP. CRP mRNA and intracellular cAMP were detected in this strain, which did not drastically fluctuate during cultivation in a rich medium. The expression of several genes was altered upon disruption of the T. thermophilus CRP gene. We found six CRP-cAMP-dependent promoters in in vitro transcription assays involving DNA fragments containing the upstream regions of the genes exhibiting decreased expression in the CRP disruptant, indicating that the CRP is a transcriptional activator. The consensus T. thermophilus CRP-binding site predicted upon nucleotide sequence alignment is 5'-(C/T)NNG(G/T)(G/T)C(A/C)N(A/T)NNTCACAN(G/C)(G/C)-3'. This sequence is unique compared with the known consensus binding sequences of CRP family proteins. A putative -10 hexamer sequence resides at 18 to 19 bp downstream of the predicted T. thermophilus CRP-binding site. The CRP-regulated genes found in this study comprise clustered regularly interspaced short palindromic repeat (CRISPR)-associated (cas) ones, and the genes of a putative transcriptional regulator, a protein containing the exonuclease III-like domain of DNA polymerase, a GCN5-related acetyltransferase homolog, and T. thermophilus-specific proteins of unknown function. These results suggest a role for cAMP signal transduction in T. thermophilus and imply the T. thermophilus CRP is a cAMP-responsive regulator.
Collapse
Affiliation(s)
- Akeo Shinkai
- RIKEN SPring-8 Center, Harima Institute, 1-1-1 Kouto, Sayo, Hyogo 679-5148, Japan.
| | | | | | | | | | | |
Collapse
|
209
|
Conners SB, Mongodin EF, Johnson MR, Montero CI, Nelson KE, Kelly RM. Microbial biochemistry, physiology, and biotechnology of hyperthermophilic Thermotoga species. FEMS Microbiol Rev 2006; 30:872-905. [PMID: 17064285 DOI: 10.1111/j.1574-6976.2006.00039.x] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open
Abstract
High-throughput sequencing of microbial genomes has allowed the application of functional genomics methods to species lacking well-developed genetic systems. For the model hyperthermophile Thermotoga maritima, microarrays have been used in comparative genomic hybridization studies to investigate diversity among Thermotoga species. Transcriptional data have assisted in prediction of pathways for carbohydrate utilization, iron-sulfur cluster synthesis and repair, expolysaccharide formation, and quorum sensing. Structural genomics efforts aimed at the T. maritima proteome have yielded hundreds of high-resolution datasets and predicted functions for uncharacterized proteins. The information gained from genomics studies will be particularly useful for developing new biotechnology applications for T. maritima enzymes.
Collapse
Affiliation(s)
- Shannon B Conners
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695-7905, USA
| | | | | | | | | | | |
Collapse
|
210
|
Mattison K, Wilbur JS, So M, Brennan RG. Structure of FitAB from Neisseria gonorrhoeae Bound to DNA Reveals a Tetramer of Toxin-Antitoxin Heterodimers Containing Pin Domains and Ribbon-Helix-Helix Motifs. J Biol Chem 2006; 281:37942-51. [PMID: 16982615 DOI: 10.1074/jbc.m605198200] [Citation(s) in RCA: 111] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Neisseria gonorrhoeae is a sexually transmitted pathogen that initiates infections in humans by adhering to the mucosal epithelium of the urogenital tract. The bacterium then enters the apical region of the cell and traffics across the cell to exit into the subepithelial matrix. Mutations in the fast intracellular trafficking (fitAB) locus cause the bacteria to transit a polarized epithelial monolayer more quickly than the wild-type parent and to replicate within cells at an accelerated rate. Here, we describe the crystal structure of the toxin-antitoxin heterodimer, FitAB, bound to a high affinity 36-bp DNA fragment from the fitAB promoter. FitA, the antitoxin, binds DNA through its ribbon-helix-helix motif and is tethered to FitB, the toxin, to form a heterodimer by the insertion of a four turn alpha-helix into an extensive FitB hydrophobic pocket. FitB is composed of a PIN (PilT N terminus) domain, with a central, twisted, 5-stranded parallel beta-sheet that is open on one side and flanked by five alpha-helices. FitB in the context of the FitAB complex does not display nuclease activity against tested PIN substrates. The FitAB complex points to the mechanism by which antitoxins with RHH motifs can block the activity of toxins with PIN domains. Interactions between two FitB molecules result in the formation of a tetramer of FitAB heterodimers, which binds to the 36-bp DNA fragment and provides an explanation for how FitB enhances the DNA binding affinity of FitA.
Collapse
Affiliation(s)
- Kirsten Mattison
- Department of Biochemistry, Oregon Health and Science University, Portland, Oregon 97239, USA
| | | | | | | |
Collapse
|
211
|
Kang J, Blaser MJ. Bacterial populations as perfect gases: genomic integrity and diversification tensions in Helicobacter pylori. Nat Rev Microbiol 2006; 4:826-36. [PMID: 17041630 DOI: 10.1038/nrmicro1528] [Citation(s) in RCA: 115] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Microorganisms that persist in single hosts face particular challenges. Helicobacter pylori, an obligate bacterial parasite of the human stomach, has evolved a lifestyle that features interstrain competition and intraspecies cooperation, both of which involve horizontal gene transfer. Microbial species must maintain genomic integrity, yet H. pylori has evolved a complex nonlinear system for diversification that exists in dynamic tension with the mechanisms for ensuring fidelity. Here, we review these tensions and propose that they create a dynamic pool of genetic variants that is sufficiently genetically diverse to allow H. pylori to occupy all of the potential niches in the stomach.
Collapse
Affiliation(s)
- Josephine Kang
- Departments of Medicine and Microbiology, New York University School of Medicine, New York, New York, 10016 USA
| | | |
Collapse
|
212
|
Schwarzenlander C, Averhoff B. Characterization of DNA transport in the thermophilic bacterium Thermus thermophilus HB27. FEBS J 2006; 273:4210-8. [PMID: 16939619 DOI: 10.1111/j.1742-4658.2006.05416.x] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Horizontal gene transfer has been a major force for genome plasticity over evolutionary history, and is largely responsible for fitness-enhancing traits, including antibiotic resistance and virulence factors. In particular, for adaptation of prokaryotes to extreme environments, lateral gene transfer seems to have played a crucial role. Recently, by performing a genome-wide mutagenesis approach with Thermus thermophilus HB27, we identified the first genes in a thermophilic bacterium for the uptake of free DNA, a process called natural transformation. Here, we present the first data on the biochemistry and bioenergetics of the DNA transport process in this thermophile. We report that linear and circular plasmid DNA are equally well taken up with a high maximal velocity of 1.5 microg DNA.(mg protein)(-1).min(-1), demonstrating an extremely efficient binding and uptake rate of 40 kb.s(-1).cell(-1). Uncouplers and ATPase inhibitors immediately inhibited DNA uptake, providing clear evidence that DNA translocation in HB27 is an energy-dependent process. DNA uptake studies with genomic DNA of Bacteria, Archaea and Eukarya revealed that Thermus thermophilus HB27 takes up DNA from members of all three domains of life. We propose that the extraordinary broad substrate specificity of the highly efficient Thermus thermophilus HB27 DNA uptake system may contribute significantly to thermoadaptation of Thermus thermophilus HB27 and to interdomain DNA transfer in hot environments.
Collapse
Affiliation(s)
- Cornelia Schwarzenlander
- Molecular Microbiology & Bioenergetics, Institute of Molecular Biosciences, Johann Wolfgang Goethe University Frankfurt/Main, Frankfurt, Germany
| | | |
Collapse
|
213
|
Lillestøl R, Redder P, Garrett RA, Brügger K. A putative viral defence mechanism in archaeal cells. ARCHAEA-AN INTERNATIONAL MICROBIOLOGICAL JOURNAL 2006; 2:59-72. [PMID: 16877322 PMCID: PMC2685585 DOI: 10.1155/2006/542818] [Citation(s) in RCA: 216] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Clusters of regularly spaced direct repeats, separated by unconserved spacer sequences, are ubiquitous in archaeal chromosomes and occur in some plasmids. Some clusters constitute around 1% of chromosomal DNA. Similarly structured clusters, generally smaller, also occur in some bacterial chromosomes. Although early studies implicated these clusters in segregation/partition functions, recent evidence suggests that the spacer sequences derive from extrachromosomal elements, and, primarily, viruses. This has led to the proposal that the clusters provide a defence against viral propagation in cells, and that both the mode of inhibition of viral propagation and the mechanism of adding spacer-repeat units to clusters, are dependent on RNAs transcribed from the clusters. Moreover, the putative inhibitory apparatus (piRNA-based) may be evolutionarily related to the interference RNA systems (siRNA and miRNA), which are common in eukarya. Here, we analyze all the current data on archaeal repeat clusters and provide some new insights into their diverse structures, transcriptional properties and mode of structural development. The results are consistent with larger cluster transcripts being processed at the centers of the repeat sequences and being further trimmed by exonucleases to yield a dominant, intracellular RNA species, which corresponds approximately to the size of a spacer. Furthermore, analysis of the extensive clusters of Sulfolobus solfataricus strains P1 and P2B provides support for the presence of a flanking sequence adjoining a cluster being a prerequisite for the incorporation of new spacer-repeat units, which occurs between the flanking sequence and the cluster. An archaeal database summarizing the data will be maintained at http://dac.molbio.ku.dk/dbs/SRSR/.
Collapse
Affiliation(s)
- Reidun Lillestøl
- Institute of Molecular Biology, University of Copenhagen, Sølvgade 83H, DK1307 Copenhagen K, Denmark
| | - Peter Redder
- Institute of Molecular Biology, University of Copenhagen, Sølvgade 83H, DK1307 Copenhagen K, Denmark
| | - Roger A. Garrett
- Institute of Molecular Biology, University of Copenhagen, Sølvgade 83H, DK1307 Copenhagen K, Denmark
- Corresponding author ()
| | - Kim Brügger
- Institute of Molecular Biology, University of Copenhagen, Sølvgade 83H, DK1307 Copenhagen K, Denmark
| |
Collapse
|
214
|
Abstract
UvrD, a highly conserved helicase involved in mismatch repair, nucleotide excision repair (NER), and recombinational repair, plays a critical role in maintaining genomic stability and facilitating DNA lesion repair in many prokaryotic species. In this report, we focus on the UvrD homolog in Helicobacter pylori, a genetically diverse organism that lacks many known DNA repair proteins, including those involved in mismatch repair and recombinational repair, and that is noted for high levels of inter- and intragenomic recombination and mutation. H. pylori contains numerous DNA repeats in its compact genome and inhabits an environment rich in DNA-damaging agents that can lead to increased rearrangements between such repeats. We find that H. pylori UvrD functions to repair DNA damage and limit homologous recombination and DNA damage-induced genomic rearrangements between DNA repeats. Our results suggest that UvrD and other NER pathway proteins play a prominent role in maintaining genome integrity, especially after DNA damage; thus, NER may be especially critical in organisms such as H. pylori that face high-level genotoxic stress in vivo.
Collapse
Affiliation(s)
- Josephine Kang
- Department of Medicine, New York University School of Medicine, 550 First Avenue, New York, NY 10016, USA.
| | | |
Collapse
|
215
|
Williams E, Lowe TM, Savas J, DiRuggiero J. Microarray analysis of the hyperthermophilic archaeon Pyrococcus furiosus exposed to gamma irradiation. Extremophiles 2006; 11:19-29. [PMID: 16896524 DOI: 10.1007/s00792-006-0002-9] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2006] [Accepted: 05/16/2006] [Indexed: 12/15/2022]
Abstract
The remarkable survival of the hyperthermophilic archaeon Pyrococcus furiosus to ionizing radiation was previously demonstrated. Using a time course study and whole-genome microarray analyses of mRNA transcript levels, the genes and regulatory pathways involved in the repair of lesions produced by ionizing irradiation (oxidative damage and DNA strand breaks) in P. furiosus were investigated. Data analyses showed that radA, encoding the archaeal homolog of the RecA/Rad51 recombinase, was moderately up regulated by irradiation and that a putative DNA-repair gene cluster was specifically induced by exposure to ionizing radiation. This novel repair system appears to be unique to thermophilic archaea and bacteria and is suspected to be involved in translesion synthesis. Genes that encode for a putative Dps-like iron-chelating protein and two membrane-bound oxidoreductases were differentially expressed following gamma irradiation, potentially in response to oxidative stress. Surprisingly, the many systems involved in oxygen detoxification and redox homeostasis appeared to be constitutively expressed. Finally, we identified several transcriptional regulators and protein kinases highly regulated in response to gamma irradiation.
Collapse
Affiliation(s)
- Ernest Williams
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD 20742, USA
| | | | | | | |
Collapse
|
216
|
Brüggemann H, Chen C. Comparative genomics of Thermus thermophilus: Plasticity of the megaplasmid and its contribution to a thermophilic lifestyle. J Biotechnol 2006; 124:654-61. [PMID: 16713647 DOI: 10.1016/j.jbiotec.2006.03.043] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2005] [Revised: 02/01/2006] [Accepted: 03/29/2006] [Indexed: 11/24/2022]
Abstract
The bacterium Thermus thermophilus grows at temperatures up to 85 degrees C and is equipped with thermostable enzymes of biotechnological interest. The recently decoded genomes of two strains of T. thermophilus, HB27 and HB8, each composed of a chromosome and a megaplasmid, must certainly encode specific strategies to encounter the thermophile challenge. Here, a genome comparison was undertaken to distinguish common functions from the flexible gene pool, which gave some clues about the biological traits involved in a thermophile lifestyle. The chromosomes were highly conserved, with about 100 strain-specific genes probably reflecting adaptations to the corresponding biological niche, such as metabolic specialities and distinct cell surface determinates including type IV pili. The two megaplasmids showed an elevated plasticity. Upon comparison and re-examination of their gene content, both megaplasmids seem to be implicated in assisting thermophilic growth: a large portion of their genes are apparently involved in DNA repair functions. About 30 plasmid-encoded genes exhibit sequence and domain composition similarity to a predicted DNA repair system specific for thermophilic Archaea and bacteria. Moreover, the plasmid-encoded carotenoid biosynthesis gene cluster is interlocked with genes involved in UV-induced DNA damage repair. This illustrates the importance of DNA protection and repair at elevated growth temperatures.
Collapse
Affiliation(s)
- Holger Brüggemann
- Institut Pasteur, Department of Genomes and Genetics, Unité Genomics of Microbial Pathogens, 28 Rue du Docteur Roux, 75724 Paris Cedex 15, France.
| | | |
Collapse
|
217
|
Warner DF, Mizrahi V. Tuberculosis chemotherapy: the influence of bacillary stress and damage response pathways on drug efficacy. Clin Microbiol Rev 2006; 19:558-70. [PMID: 16847086 PMCID: PMC1539104 DOI: 10.1128/cmr.00060-05] [Citation(s) in RCA: 114] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The global tuberculosis (TB) control effort is focused on interrupting transmission of the causative agent, Mycobacterium tuberculosis, through chemotherapeutic intervention in active infectious disease. The insufficiency of this approach is manifest in the inexorable annual increase in TB infection and mortality rates and the emergence of multidrug-resistant isolates. Critically, the limited efficacy of the current frontline anti-TB drug combination suggests that heterogeneity of host and bacillary physiologies might impair drug activity. This review explores the possibility that strategies enabling adaptation of M. tuberculosis to hostile in vivo conditions might contribute to the subversion of anti-TB chemotherapy. In particular, evidence that infecting bacilli are exposed to environmental and host immune-mediated DNA-damaging insults suggests a role for error-prone DNA repair synthesis in the generation of chromosomally encoded antibiotic resistance mutations. The failure of frontline anti-TB drugs to sterilize a population of susceptible bacilli is independent of genetic resistance, however, and instead implies the operation of alternative tolerance mechanisms. Specifically, it is proposed that the emergence of persister subpopulations might depend on the switch to an altered metabolic state mediated by the stringent response alarmone, (p)ppGpp, possibly involving some or all of the many toxin-antitoxin modules identified in the M. tuberculosis genome.
Collapse
Affiliation(s)
- Digby F Warner
- Molecular Mycobacteriology Research Unit, Centre of Excellence for Biomedical TB Research, School of Pathology, University of the Witwatersrand and NHLS, P.O. Box 1038, Johannesburg 2000, South Africa.
| | | |
Collapse
|
218
|
Pasamontes A, Garcia-Vallve S. Use of a multi-way method to analyze the amino acid composition of a conserved group of orthologous proteins in prokaryotes. BMC Bioinformatics 2006; 7:257. [PMID: 16709240 PMCID: PMC1489954 DOI: 10.1186/1471-2105-7-257] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2005] [Accepted: 05/18/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Amino acids in proteins are not used equally. Some of the differences in the amino acid composition of proteins are between species (mainly due to nucleotide composition and lifestyle) and some are between proteins from the same species (related to protein function, expression or subcellular localization, for example). As several factors contribute to the different amino acid usage in proteins, it is difficult both to analyze these differences and to separate the contributions made by each factor. RESULTS Using a multi-way method called Tucker3, we have analyzed the amino composition of a set of 64 orthologous groups of proteins present in 62 archaea and bacteria. This dataset corresponds to essential proteins such as ribosomal proteins, tRNA synthetases and translational initiation or elongation factors, which are common to all the species analyzed. The Tucker3 model can be used to study the amino acid variability within and between species by taking into consideration the tridimensionality of the data set. We found that the main factor behind the amino acid composition of proteins is independent of the organism or protein function analyzed. This factor must be related to the biochemical characteristics of each amino acid. The difference between the non-ribosomal proteins and the ribosomal proteins (which are rich in arginine and lysine) is the main factor behind the differences in amino acid composition within species, while G+C content and optimal growth temperature are the main factors behind the differences in amino acid usage between species. CONCLUSION We show that a multi-way method is useful for comparing the amino acid composition of several groups of orthologous proteins from the same group of species. This kind of dataset is extremely useful for detecting differences between and within species.
Collapse
Affiliation(s)
- Alberto Pasamontes
- Chemometrics, Qualimetrics and Nanosensors Group, Analytical and Organic Chemistry Department, Rovira i Virgili University (URV). Campus Sescelades, c/Marcelli Domingo s/n., 43007 Tarragona, Spain
| | - Santiago Garcia-Vallve
- Evolutionary Genomics Group, Biochemistry and Biotechnology Department, Rovira i Virgili University (URV). Campus Sescelades, c/Marcelli Domingo s/n., 43007 Tarragona, Spain
| |
Collapse
|
219
|
Godde JS, Bickerton A. The repetitive DNA elements called CRISPRs and their associated genes: evidence of horizontal transfer among prokaryotes. J Mol Evol 2006; 62:718-29. [PMID: 16612537 DOI: 10.1007/s00239-005-0223-z] [Citation(s) in RCA: 232] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2005] [Accepted: 01/12/2006] [Indexed: 01/25/2023]
Abstract
We have found direct DNA repeats 21-47 bp in length interspersed with nonrepetitive sequences of similar length, or clustered regularly interspaced short palindromic repeats (CRISPRs) in a wide range of diverse prokaryotes, including many Archaeal and Eubacterial species. A number of cas, CRISPR-associated genes have also been characterized in many of the same organisms. Phylogenetic analysis of these cas genes suggests that the CRISPR loci have been propagated via HGT, horizontal gene transfer. We suggest a mechanism by which this HGT has occurred, namely, that the CRISPR loci can be carried between cells on megaplasmids > or = 40 kb in length.
Collapse
Affiliation(s)
- James S Godde
- Department of Biology, Monmouth College, Monmouth, IL 61462, USA.
| | | |
Collapse
|
220
|
Makarova KS, Grishin NV, Shabalina SA, Wolf YI, Koonin EV. A putative RNA-interference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol Direct 2006; 1:7. [PMID: 16545108 PMCID: PMC1462988 DOI: 10.1186/1745-6150-1-7] [Citation(s) in RCA: 804] [Impact Index Per Article: 44.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2006] [Accepted: 03/16/2006] [Indexed: 11/10/2022] Open
Abstract
Background All archaeal and many bacterial genomes contain Clustered Regularly Interspaced Short Palindrome Repeats (CRISPR) and variable arrays of the CRISPR-associated (cas) genes that have been previously implicated in a novel form of DNA repair on the basis of comparative analysis of their protein product sequences. However, the proximity of CRISPR and cas genes strongly suggests that they have related functions which is hard to reconcile with the repair hypothesis. Results The protein sequences of the numerous cas gene products were classified into ~25 distinct protein families; several new functional and structural predictions are described. Comparative-genomic analysis of CRISPR and cas genes leads to the hypothesis that the CRISPR-Cas system (CASS) is a mechanism of defense against invading phages and plasmids that functions analogously to the eukaryotic RNA interference (RNAi) systems. Specific functional analogies are drawn between several components of CASS and proteins involved in eukaryotic RNAi, including the double-stranded RNA-specific helicase-nuclease (dicer), the endonuclease cleaving target mRNAs (slicer), and the RNA-dependent RNA polymerase. However, none of the CASS components is orthologous to its apparent eukaryotic functional counterpart. It is proposed that unique inserts of CRISPR, some of which are homologous to fragments of bacteriophage and plasmid genes, function as prokaryotic siRNAs (psiRNA), by base-pairing with the target mRNAs and promoting their degradation or translation shutdown. Specific hypothetical schemes are developed for the functioning of the predicted prokaryotic siRNA system and for the formation of new CRISPR units with unique inserts encoding psiRNA conferring immunity to the respective newly encountered phages or plasmids. The unique inserts in CRISPR show virtually no similarity even between closely related bacterial strains which suggests their rapid turnover, on evolutionary scale. Corollaries of this finding are that, even among closely related prokaryotes, the most commonly encountered phages and plasmids are different and/or that the dominant phages and plasmids turn over rapidly. Conclusion We proposed previously that Cas proteins comprise a novel DNA repair system. The association of the cas genes with CRISPR and, especially, the presence, in CRISPR units, of unique inserts homologous to phage and plasmid genes make us abandon this hypothesis. It appears most likely that CASS is a prokaryotic system of defense against phages and plasmids that functions via the RNAi mechanism. The functioning of this system seems to involve integration of fragments of foreign genes into archaeal and bacterial chromosomes yielding heritable immunity to the respective agents. However, it appears that this inheritance is extremely unstable on the evolutionary scale such that the repertoires of unique psiRNAs are completely replaced even in closely related prokaryotes, presumably, in response to rapidly changing repertoires of dominant phages and plasmids. This article was reviewed by: Eric Bapteste, Patrick Forterre, and Martijn Huynen. Open peer review Reviewed by Eric Bapteste, Patrick Forterre, and Martijn Huynen. For the full reviews, please go to the Reviewers' comments section.
Collapse
Affiliation(s)
- Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Nick V Grishin
- Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 75390-9050, USA
| | - Svetlana A Shabalina
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
221
|
Pavlov YI, Shcherbakova PV, Rogozin IB. Roles of DNA Polymerases in Replication, Repair, and Recombination in Eukaryotes. INTERNATIONAL REVIEW OF CYTOLOGY 2006; 255:41-132. [PMID: 17178465 DOI: 10.1016/s0074-7696(06)55002-8] [Citation(s) in RCA: 97] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
The functioning of the eukaryotic genome depends on efficient and accurate DNA replication and repair. The process of replication is complicated by the ongoing decomposition of DNA and damage of the genome by endogenous and exogenous factors. DNA damage can alter base coding potential resulting in mutations, or block DNA replication, which can lead to double-strand breaks (DSB) and to subsequent chromosome loss. Replication is coordinated with DNA repair systems that operate in cells to remove or tolerate DNA lesions. DNA polymerases can serve as sensors in the cell cycle checkpoint pathways that delay cell division until damaged DNA is repaired and replication is completed. Eukaryotic DNA template-dependent DNA polymerases have different properties adapted to perform an amazingly wide spectrum of DNA transactions. In this review, we discuss the structure, the mechanism, and the evolutionary relationships of DNA polymerases and their possible functions in the replication of intact and damaged chromosomes, DNA damage repair, and recombination.
Collapse
Affiliation(s)
- Youri I Pavlov
- Eppley Institute for Research in Cancer and Allied Diseases, Departments of Biochemistry and Molecular Biology, and Pathology and Microbiology, University of Nebraska Medical Center, Omaha, Nebraska 68198-6805, USA
| | | | | |
Collapse
|
222
|
Bolotin A, Quinquis B, Sorokin A, Ehrlich SD. Clustered regularly interspaced short palindrome repeats (CRISPRs) have spacers of extrachromosomal origin. MICROBIOLOGY-SGM 2005; 151:2551-2561. [PMID: 16079334 DOI: 10.1099/mic.0.28048-0] [Citation(s) in RCA: 1059] [Impact Index Per Article: 55.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Numerous prokaryote genomes contain structures known as clustered regularly interspaced short palindromic repeats (CRISPRs), composed of 25-50 bp repeats separated by unique sequence spacers of similar length. CRISPR structures are found in the vicinity of four genes named cas1 to cas4. In silico analysis revealed another cluster of three genes associated with CRISPR structures in many bacterial species, named here as cas1B, cas5 and cas6, and also revealed a certain number of spacers that have homology with extant genes, most frequently derived from phages, but also derived from other extrachromosomal elements. Sequence analysis of CRISPR structures from 24 strains of Streptococcus thermophilus and Streptococcus vestibularis confirmed the homology of spacers with extrachromosomal elements. Phage sensitivity of S. thermophilus strains appears to be correlated with the number of spacers in the CRISPR locus the strain carries. The authors suggest that the spacer elements are the traces of past invasions by extrachromosomal elements, and hypothesize that they provide the cell immunity against phage infection, and more generally foreign DNA expression, by coding an anti-sense RNA. The presence of gene fragments in CRISPR structures and the nuclease motifs in cas genes of both cluster types suggests that CRISPR formation involves a DNA degradation step.
Collapse
Affiliation(s)
- Alexander Bolotin
- Génétique Microbienne, Institut National de la Recherche Agronomique, Domaine de Vilvert, 78352 Jouy en Josas CEDEX, France
| | - Benoit Quinquis
- Génétique Microbienne, Institut National de la Recherche Agronomique, Domaine de Vilvert, 78352 Jouy en Josas CEDEX, France
| | - Alexei Sorokin
- Génétique Microbienne, Institut National de la Recherche Agronomique, Domaine de Vilvert, 78352 Jouy en Josas CEDEX, France
| | - S Dusko Ehrlich
- Génétique Microbienne, Institut National de la Recherche Agronomique, Domaine de Vilvert, 78352 Jouy en Josas CEDEX, France
| |
Collapse
|
223
|
Haft DH, Selengut J, Mongodin EF, Nelson KE. A guild of 45 CRISPR-associated (Cas) protein families and multiple CRISPR/Cas subtypes exist in prokaryotic genomes. PLoS Comput Biol 2005; 1:e60. [PMID: 16292354 PMCID: PMC1282333 DOI: 10.1371/journal.pcbi.0010060] [Citation(s) in RCA: 683] [Impact Index Per Article: 35.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2005] [Accepted: 10/05/2005] [Indexed: 11/19/2022] Open
Abstract
Clustered regularly interspaced short palindromic repeats (CRISPRs) are a family of DNA direct repeats found in many prokaryotic genomes. Repeats of 21-37 bp typically show weak dyad symmetry and are separated by regularly sized, nonrepetitive spacer sequences. Four CRISPR-associated (Cas) protein families, designated Cas1 to Cas4, are strictly associated with CRISPR elements and always occur near a repeat cluster. Some spacers originate from mobile genetic elements and are thought to confer "immunity" against the elements that harbor these sequences. In the present study, we have systematically investigated uncharacterized proteins encoded in the vicinity of these CRISPRs and found many additional protein families that are strictly associated with CRISPR loci across multiple prokaryotic species. Multiple sequence alignments and hidden Markov models have been built for 45 Cas protein families. These models identify family members with high sensitivity and selectivity and classify key regulators of development, DevR and DevS, in Myxococcus xanthus as Cas proteins. These identifications show that CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a repeat cluster or filling the region between two repeat clusters. Distinctive subsets of the collection of Cas proteins recur in phylogenetically distant species and correlate with characteristic repeat periodicity. The analyses presented here support initial proposals of mobility of these units, along with the likelihood that loci of different subtypes interact with one another as well as with host cell defensive, replicative, and regulatory systems. It is evident from this analysis that CRISPR/cas loci are larger, more complex, and more heterogeneous than previously appreciated.
Collapse
Affiliation(s)
- Daniel H Haft
- The Institute for Genomic Research, Rockville, Maryland, USA.
| | | | | | | |
Collapse
|
224
|
Omelchenko MV, Wolf YI, Gaidamakova EK, Matrosova VY, Vasilenko A, Zhai M, Daly MJ, Koonin EV, Makarova KS. Comparative genomics of Thermus thermophilus and Deinococcus radiodurans: divergent routes of adaptation to thermophily and radiation resistance. BMC Evol Biol 2005; 5:57. [PMID: 16242020 PMCID: PMC1274311 DOI: 10.1186/1471-2148-5-57] [Citation(s) in RCA: 137] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2005] [Accepted: 10/20/2005] [Indexed: 01/02/2023] Open
Abstract
Background Thermus thermophilus and Deinococcus radiodurans belong to a distinct bacterial clade but have remarkably different phenotypes. T. thermophilus is a thermophile, which is relatively sensitive to ionizing radiation and desiccation, whereas D. radiodurans is a mesophile, which is highly radiation- and desiccation-resistant. Here we present an in-depth comparison of the genomes of these two related but differently adapted bacteria. Results By reconstructing the evolution of Thermus and Deinococcus after the divergence from their common ancestor, we demonstrate a high level of post-divergence gene flux in both lineages. Various aspects of the adaptation to high temperature in Thermus can be attributed to horizontal gene transfer from archaea and thermophilic bacteria; many of the horizontally transferred genes are located on the single megaplasmid of Thermus. In addition, the Thermus lineage has lost a set of genes that are still present in Deinococcus and many other mesophilic bacteria but are not common among thermophiles. By contrast, Deinococcus seems to have acquired numerous genes related to stress response systems from various bacteria. A comparison of the distribution of orthologous genes among the four partitions of the Deinococcus genome and the two partitions of the Thermus genome reveals homology between the Thermus megaplasmid (pTT27) and Deinococcus megaplasmid (DR177). Conclusion After the radiation from their common ancestor, the Thermus and Deinococcus lineages have taken divergent paths toward their distinct lifestyles. In addition to extensive gene loss, Thermus seems to have acquired numerous genes from thermophiles, which likely was the decisive contribution to its thermophilic adaptation. By contrast, Deinococcus lost few genes but seems to have acquired many bacterial genes that apparently enhanced its ability to survive different kinds of environmental stresses. Notwithstanding the accumulation of horizontally transferred genes, we also show that the single megaplasmid of Thermus and the DR177 megaplasmid of Deinococcus are homologous and probably were inherited from the common ancestor of these bacteria.
Collapse
Affiliation(s)
- Marina V Omelchenko
- Department of Pathology, F.E. Hebert School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814-4799, USA
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Yuri I Wolf
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Elena K Gaidamakova
- Department of Pathology, F.E. Hebert School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814-4799, USA
| | - Vera Y Matrosova
- Department of Pathology, F.E. Hebert School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814-4799, USA
| | - Alexander Vasilenko
- Department of Pathology, F.E. Hebert School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814-4799, USA
| | - Min Zhai
- Department of Pathology, F.E. Hebert School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814-4799, USA
| | - Michael J Daly
- Department of Pathology, F.E. Hebert School of Medicine, Uniformed Services University of the Health Sciences, Bethesda, MD 20814-4799, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
225
|
Makarova KS, Koonin EV. Evolutionary and functional genomics of the Archaea. Curr Opin Microbiol 2005; 8:586-94. [PMID: 16111915 DOI: 10.1016/j.mib.2005.08.003] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2005] [Accepted: 08/05/2005] [Indexed: 11/20/2022]
Abstract
In the past two years, archaeal genomics has achieved several breakthroughs. On the evolutionary front the most exciting development was the sequencing and analysis of the genome of Nanoarchaeum equitans, a tiny parasitic organism that has only approximately 540 genes. The genome of Nanoarchaeum shows signs of extreme rearrangement including the virtual absence of conserved operons and the presence of several split genes. Nanoarchaeum is distantly related to other archaea, and it has been proposed to represent a deep archaeal branch that is distinct from Euryarchaeota and Crenarchaeota. This would imply that many features of its gene repertoire and genome organization might be ancestral. However, additional genome analysis has provided a more conservative suggestion - that Nanoarchaeum is a highly derived euryarchaeon. Also there have been substantial developments in functional genomics, including the discovery of the elusive aminoacyl-tRNA synthetase that is involved in both the biosynthesis of cysteine and its incorporation into proteins in methanogens, and the first experimental validation of the predicted archaeal exosome.
Collapse
Affiliation(s)
- Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | |
Collapse
|
226
|
Ettema TJG, de Vos WM, van der Oost J. Discovering novel biology by in silico archaeology. Nat Rev Microbiol 2005; 3:859-69. [PMID: 16175172 DOI: 10.1038/nrmicro1268] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Archaea are prokaryotes that evolved in parallel with bacteria. Since the discovery of the distinct status of the Archaea, extensive physiological and biochemical research has been conducted to elucidate the molecular basis of their remarkable lifestyle and their unique biology. Here, we discuss how in-depth comparative genomics has been used to improve the annotation of archaeal genomes. Combined with experimental verification, bioinformatic analysis contributes to the ongoing discovery of novel metabolic conversions and control mechanisms, and as such to a better understanding of the intriguing biology of the Archaea.
Collapse
Affiliation(s)
- Thijs J G Ettema
- Laboratory of Microbiology, Wageningen University, 6703 CT Wageningen, The Netherlands
| | | | | |
Collapse
|
227
|
Cubeddu L, White MF. DNA damage detection by an archaeal single-stranded DNA-binding protein. J Mol Biol 2005; 353:507-16. [PMID: 16181640 DOI: 10.1016/j.jmb.2005.08.050] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2005] [Revised: 08/11/2005] [Accepted: 08/18/2005] [Indexed: 10/25/2022]
Abstract
Archaeal DNA repair pathways are not well defined; in particular, there are no convincing candidate proteins for detection of DNA mismatches or the bulky lesions removed by excision repair pathways. Single-stranded DNA-binding proteins (SSBs) play a central role in DNA replication, recombination and repair. The crenarchaeal SSB is a monomer with a single oligonucleotide-binding fold for single-stranded DNA binding coupled to a flexible C-terminal tail reminiscent of bacterial SSB that mediates interactions with other proteins. We demonstrate that Sulfolobus solfataricus SSB can melt DNA containing a mismatch or DNA lesion specifically in vitro. We suggest that a potential role for SSB in archaea is the detection of DNA damage due to local destabilisation of the DNA double helix, followed by recruitment of specific repair proteins. Proteins interacting specifically with a single-stranded DNA:SSB complex include several known or putative DNA repair proteins and DNA helicases.
Collapse
Affiliation(s)
- Liza Cubeddu
- Centre for Biomolecular Sciences, University of St Andrews, St Andrews KY16 9ST, UK
| | | |
Collapse
|
228
|
Iyer LM, Koonin EV, Leipe DD, Aravind L. Origin and evolution of the archaeo-eukaryotic primase superfamily and related palm-domain proteins: structural insights and new members. Nucleic Acids Res 2005; 33:3875-96. [PMID: 16027112 PMCID: PMC1176014 DOI: 10.1093/nar/gki702] [Citation(s) in RCA: 219] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We report an in-depth computational study of the protein sequences and structures of the superfamily of archaeo-eukaryotic primases (AEPs). This analysis greatly expands the range of diversity of the AEPs and reveals the unique active site shared by all members of this superfamily. In particular, it is shown that eukaryotic nucleo-cytoplasmic large DNA viruses, including poxviruses, asfarviruses, iridoviruses, phycodnaviruses and the mimivirus, encode AEPs of a distinct family, which also includes the herpesvirus primases whose relationship to AEPs has not been recognized previously. Many eukaryotic genomes, including chordates and plants, encode previously uncharacterized homologs of these predicted viral primases, which might be involved in novel DNA repair pathways. At a deeper level of evolutionary connections, structural comparisons indicate that AEPs, the nucleases involved in the initiation of rolling circle replication in plasmids and viruses, and origin-binding domains of papilloma and polyoma viruses evolved from a common ancestral protein that might have been involved in a protein-priming mechanism of initiation of DNA replication. Contextual analysis of multidomain protein architectures and gene neighborhoods in prokaryotes and viruses reveals remarkable parallels between AEPs and the unrelated DnaG-type primases, in particular, tight associations with the same repertoire of helicases. These observations point to a functional equivalence of the two classes of primases, which seem to have repeatedly displaced each other in various extrachromosomal replicons.
Collapse
Affiliation(s)
| | | | | | - L. Aravind
- To whom correspondence should be addressed. Tel: +1 301 594 2445; Fax: +1 301 480 9241;
| |
Collapse
|
229
|
Kinch LN, Ginalski K, Rychlewski L, Grishin NV. Identification of novel restriction endonuclease-like fold families among hypothetical proteins. Nucleic Acids Res 2005; 33:3598-605. [PMID: 15972856 PMCID: PMC1157100 DOI: 10.1093/nar/gki676] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Restriction endonucleases and other nucleic acid cleaving enzymes form a large and extremely diverse superfamily that display little sequence similarity despite retaining a common core fold responsible for cleavage. The lack of significant sequence similarity between protein families makes homology inference a challenging task and hinders new family identification with traditional sequence-based approaches. Using the consensus fold recognition method Meta-BASIC that combines sequence profiles with predicted protein secondary structure, we identify nine new restriction endonuclease-like fold families among previously uncharacterized proteins and predict these proteins to cleave nucleic acid substrates. Application of transitive searches combined with gene neighborhood analysis allow us to confidently link these unknown families to a number of known restriction endonuclease-like structures and thus assign folds to the uncharacterized proteins. Finally, our method identifies a novel restriction endonuclease-like domain in the C-terminus of RecC that is not detected with structure-based searches of the existing PDB database.
Collapse
Affiliation(s)
- Lisa N Kinch
- Department of Biochemistry, Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Boulevard, Dallas, TX 75390-9050, USA.
| | | | | | | |
Collapse
|
230
|
Pourcel C, Salvignol G, Vergnaud G. CRISPR elements in Yersinia pestis acquire new repeats by preferential uptake of bacteriophage DNA, and provide additional tools for evolutionary studies. MICROBIOLOGY-SGM 2005; 151:653-663. [PMID: 15758212 DOI: 10.1099/mic.0.27437-0] [Citation(s) in RCA: 842] [Impact Index Per Article: 44.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The remarkable repetitive elements called CRISPRs (clustered regularly interspaced short palindromic repeats) consist of repeats interspaced with non-repetitive elements or 'spacers'. CRISPRs are present in both archaea and bacteria, in association with genes involved in DNA recombination and repair. In the Yersinia pestis genome, three such elements are found at three distinct loci, one of them being highly polymorphic. The authors have sequenced a total of 109 alleles of the three Y. pestis CRISPRs and they describe 29 new spacers, most being specific to one isolate. In nine strains of Yersinia pseudotuberculosis, 132 spacers were found, of which only three are common to Y. pestis isolates. In Y. pestis of the Orientalis biovar investigated in detail here, deletion of motifs is observed but it appears that addition of new motifs to a common ancestral element is the most frequent event. This takes place at the three different loci, although at a higher rate in one of the loci, and the addition of new motifs is polarized. Interestingly, the most recently acquired spacers were found to have a homologue at another locus in the genome, the majority of these inside an inactive prophage. This is believed to be the first time that the origin of the spacers in CRISPR elements has been explained. The CRISPR structure provides a new and robust identification tool.
Collapse
Affiliation(s)
- C Pourcel
- GPMS, Institut de Génétique et Microbiologie, Université Paris XI, 91405 Orsay cedex, France
| | - G Salvignol
- GPMS, Institut de Génétique et Microbiologie, Université Paris XI, 91405 Orsay cedex, France
| | - G Vergnaud
- Centre d'Etudes du Bouchet, 5 rue Lavoisier, 91710 Vert le Petit, France
- GPMS, Institut de Génétique et Microbiologie, Université Paris XI, 91405 Orsay cedex, France
| |
Collapse
|
231
|
Pavlov AR, Pavlova NV, Kozyavkin SA, Slesarev AI. Recent developments in the optimization of thermostable DNA polymerases for efficient applications. Trends Biotechnol 2005; 22:253-60. [PMID: 15109812 DOI: 10.1016/j.tibtech.2004.02.011] [Citation(s) in RCA: 73] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
- Andrey R Pavlov
- Fidelity Systems Inc., 7961 Cessna Avenue, Gaithersburg, Maryland 20879, USA
| | | | | | | |
Collapse
|
232
|
Abstract
There are many ways to group completed genome sequences in hierarchical patterns (trees) reflecting relationships between their genes. Such groupings help us organize biological information and bear crucially on underlying processes of genome and organismal evolution. Genome trees make use of all comparable genes but can variously weight the contributions of these genes according to similarity, congruent patterns of similarity, or prevalence among genomes. Here we explore such possible weighting strategies, in an analysis of 142 prokaryotic and 5 eukaryotic genomes. We demonstrate that alternate weighting strategies have different advantages, and we propose that each may have its specific uses in systematic or evolutionary biology. Comparisons of results obtained with different methods can provide further clues to major events and processes in genome evolution.
Collapse
Affiliation(s)
- Uri Gophna
- Genome Atlantic and Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia
| | | | | |
Collapse
|
233
|
Abstract
For decades, archaea were misclassified as bacteria because of their prokaryotic morphology. Molecular phylogeny eventually revealed that archaea, like bacteria and eukaryotes, are a fundamentally distinct domain of life. Genome analyses have confirmed that archaea share many features with eukaryotes, particularly in information processing, and therefore can serve as streamlined models for understanding eukaryotic biology. Biochemists and structural biologists have embraced the study of archaea but geneticists have been more wary, despite the fact that genetic techniques for archaea are quite sophisticated. It is time for geneticists to start asking fundamental questions about our distant relatives.
Collapse
Affiliation(s)
- Thorsten Allers
- Institute of Genetics, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK.
| | | |
Collapse
|
234
|
Guy CP, Majerník AI, Chong JPJ, Bolt EL. A novel nuclease-ATPase (Nar71) from archaea is part of a proposed thermophilic DNA repair system. Nucleic Acids Res 2004; 32:6176-86. [PMID: 15570068 PMCID: PMC535669 DOI: 10.1093/nar/gkh960] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We have identified a novel structure-specific nuclease in highly fractionated extracts of the thermophilic archaeon Methanothermobacter thermautotrophicus (Mth). The 71 kDa protein product of open reading frame mth1090 is a nuclease with ATPase activity, which we call Nar71 (Nuclease-ATPase in Repair, 71 kDa). The nar71 gene is located in a gene neighbourhood proposed by genomics to encode a novel DNA repair system conserved in thermophiles. The biochemical characterization of Nar71 presented here is the first analysis from within this neighbourhood, and it supports the insight from genomics. Nuclease activity of Nar71 is specific for 3' flaps and flayed duplexes, targeting single-stranded DNA (ssDNA) regions. This activity requires Mg2+ or Mn2+ and is greatly reduced in ATP. In ATP, Nar71 displaces ssDNA, also with high specificity for 3' flap and flayed duplex DNA. Strand displacement is weak compared with nuclease activity, but in ATPS it is abolished, suggesting that Nar71 couples ATP hydrolysis to DNA strand separation. ATPase assays confirmed that Nar71 is stimulated by ssDNA, though not double-stranded DNA. Mutation of Lys-117 in Nar71 abolished ATPase and nuclease activity, and we describe a separation-of-function mutant (K68A) that has lost ATPase activity but retains nuclease activity. A model of possible Nar71 function in DNA repair is presented.
Collapse
Affiliation(s)
- Colin P Guy
- Institute of Genetics, School of Biology, University of Nottingham, Queen's Medical Centre, Nottingham NG7 2UH, UK
| | | | | | | |
Collapse
|
235
|
Arcus VL, Bäckbro K, Roos A, Daniel EL, Baker EN. Distant Structural Homology Leads to the Functional Characterization of an Archaeal PIN Domain as an Exonuclease. J Biol Chem 2004; 279:16471-8. [PMID: 14734548 DOI: 10.1074/jbc.m313833200] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Genome sequencing projects have focused attention on the problem of discovering the functions of protein domains that are widely distributed throughout living species but which are, as yet, largely uncharacterized. One such example is the PIN domain, found in eukaryotes, bacteria, and Archaea, and with suggested roles in signaling, RNase editing, and/or nucleotide binding. The first reported crystal structure of a PIN domain (open reading frame PAE2754, derived from the crenarchaeon, Pyrobaculum aerophilum) has been determined to 2.5 A resolution and is presented here. Mapping conserved residues from a multiple sequence alignment onto the structure identifies a putative active site. The discovery of distant structural homology with several exonucleases, including T4 phage RNase H and flap endonuclease (FEN1), further suggests a likely function for PIN domains as Mg2+-dependent exonucleases, a hypothesis that we have confirmed in vitro. The tetrameric structure of PAE2754, with the active sites inside a tunnel, suggests a mechanism for selective cleavage of single-stranded overhangs or flap structures. These results indicate likely DNA or RNA editing roles for prokaryotic PIN domains, which are strikingly numerous in thermophiles, and in organisms such as Mycobacterium tuberculosis. They also support previous hypotheses that eukaryotic PIN domains participate in RNAi and nonsense-mediated RNA degradation.
Collapse
Affiliation(s)
- Vickery L Arcus
- School of Biological Sciences, University of Auckland, Private Bag 92019, Auckland, New Zealand.
| | | | | | | | | |
Collapse
|
236
|
Paz A, Mester D, Baca I, Nevo E, Korol A. Adaptive role of increased frequency of polypurine tracts in mRNA sequences of thermophilic prokaryotes. Proc Natl Acad Sci U S A 2004; 101:2951-6. [PMID: 14973185 PMCID: PMC365726 DOI: 10.1073/pnas.0308594100] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The mechanism of an organism's adaptation to high temperatures has been investigated intensively in recent years. It was suggested that the macromolecules of thermophilic microorganisms (especially proteins) have structural features that enhance their thermostability. We compared mRNA sequences of 72 fully sequenced prokaryotic proteomes (14 thermophilic and 58 mesophilic species). Although the differences between the percentage of adenine plus guanine content of whole mRNAs of different prokaryotic species are much lower than those of guanine plus cytosine content, the thermophile purine-pyrimidine (R/Y) ratio within their mRNAs is significantly higher than that of the mesophiles. The first and third codon positions of both thermophiles and mesophiles are purine-biased, with the bias more pronounced by the thermophiles. Thermophile mRNAs that display the highest R/Y ratio (1.43-1.69) are those of the ribosomal proteins, histone-like proteins, DNA-dependent RNA polymerase subunits, and heat-shock proteins. Within mesophilic prokaryotes and five eukaryotic species, the R/Y ratio of the mRNAs of heat-shock proteins is higher than their average over coding part of the genome. Polypurine tracts (R)(n) (with n > or = 5) are much more abundant within the thermophile mRNAs compared with mesophiles. Between two sequential pure-purinic codons of thermophile mRNAs, there is a rather strong tendency for the occurrence of adenine but not guanine tracts. The data suggest that mixed adenine.guanine and polyadenine tracts in mRNAs increase the thermostability beyond the contribution of amino acids encoded by purine tracts, which highlights the importance of ecological stress in the evolution of genome architecture.
Collapse
Affiliation(s)
- Arnon Paz
- Institute of Evolution, Haifa University, Mount Carmel, Haifa 31905, Israel
| | | | | | | | | |
Collapse
|
237
|
Boucher Y, Douady CJ, Papke RT, Walsh DA, Boudreau MER, Nesbø CL, Case RJ, Doolittle WF. Lateral gene transfer and the origins of prokaryotic groups. Annu Rev Genet 2004; 37:283-328. [PMID: 14616063 DOI: 10.1146/annurev.genet.37.050503.084247] [Citation(s) in RCA: 279] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Lateral gene transfer (LGT) is now known to be a major force in the evolution of prokaryotic genomes. To date, most analyses have focused on either (a) verifying phylogenies of individual genes thought to have been transferred, or (b) estimating the fraction of individual genomes likely to have been introduced by transfer. Neither approach does justice to the ability of LGT to effect massive and complex transformations in basic biology. In some cases, such transformation will be manifested as the patchy distribution of a seemingly fundamental property (such as aerobiosis or nitrogen fixation) among the members of a group classically defined by the sharing of other properties (metabolic, morphological, or molecular, such as small subunit ribosomal RNA sequence). In other cases, the lineage of recipients so transformed may be seen to comprise a new group of high taxonomic rank ("class" or even "phylum"). Here we review evidence for an important role of LGT in the evolution of photosynthesis, aerobic respiration, nitrogen fixation, sulfate reduction, methylotrophy, isoprenoid biosynthesis, quorum sensing, flotation (gas vesicles), thermophily, and halophily. Sometimes transfer of complex gene clusters may have been involved, whereas other times separate exchanges of many genes must be invoked.
Collapse
Affiliation(s)
- Yan Boucher
- Program in Evolutionary Biology, Canadian Institute for Advanced Research, Department of Biochemistry, Sir Charles Tupper Medical Building, 5859 University Avenue, Halifax, Nova Scotia, Canada, B3H 4H7
| | | | | | | | | | | | | | | |
Collapse
|
238
|
Rodrigues JCM, Cabral GB, Dusi DMA, de Mello LV, Rigden DJ, Carneiro VTC. Identification of differentially expressed cDNA sequences in ovaries of sexual and apomictic plants of Brachiaria brizantha. PLANT MOLECULAR BIOLOGY 2003; 53:745-57. [PMID: 15082923 DOI: 10.1023/b:plan.0000023664.21910.bd] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The isolation of genes associated with apomixis would improve understanding of the molecular mechanism of this mode of reproduction in plants as well as open the possibility of transfer of apomixis to sexual plants, enabling cloning of crops through seeds. Brachiaria brizantha is a highly apomictic grass species with 274 tetraploid apomicts accessions and only one diploid sexual. In this study we have compared gene expression in ovaries at megasporogenesis and megagametogenesis of sexual and apomictic accessions of B. brizantha by differential display (DD-PCR), with 60 primer combinations. Specificity of 65 cloned fragments, checked by reverse northern blot analysis, showed that 11 clones were differentially expressed, 6 in apomictic ovaries, 2 in sexual and 3 in apomictic and sexual, but at different stages. Of the 6 sequences isolated that were preferentially expressed in the apomictic accession: one sequence was from ovaries at megasporogenesis stage; three were from megagametogenesis stage; two were from both stages. Of the two sequences isolated from the sexual accessions, one showed expression in ovaries at megagametogenesis, while the other sequence was shown to be specific to both stages. Three sequences were from megasporogenesis stage in apomicts but were also detected at megagametogenesis in sexual plants. Sequence analysis showed that 5 of the 11 clones had no apparent homologues in the protein database. Some of the clones identified as apomictic-specific shared homology with known genes enabling their functional annotation. The relationships of these functions to the generation of the apomictic trait are discussed.
Collapse
Affiliation(s)
- Júlio C M Rodrigues
- Embrapa Genetic Resources and Biotechnology, CP: 02372, CEP: 70770-900 Brasilia-DF, Brazil
| | | | | | | | | | | |
Collapse
|
239
|
Kanugula S, Pegg AE. Alkylation damage repair protein O6-alkylguanine-DNA alkyltransferase from the hyperthermophiles Aquifex aeolicus and Archaeoglobus fulgidus. Biochem J 2003; 375:449-55. [PMID: 12892560 PMCID: PMC1223701 DOI: 10.1042/bj20030809] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2003] [Revised: 07/24/2003] [Accepted: 08/01/2003] [Indexed: 10/27/2022]
Abstract
AGT (O6-alkylguanine DNA alkyltransferase) is an important DNA-repair protein that protects cells from killing and mutagenesis by alkylating agents. The AGT genes from two extremely thermophilic organisms, the bacterium Aquifex aeolicus and the archaeon Archaeoglobus fulgidus were PCR-derived and cloned into an expression vector. The nucleotide sequence of the Aq. aeolicus AGT encodes a 201-amino-acid protein with a molecular mass of 23000 Da and Ar. fulgidus AGT codes for a 147-amino-acid protein with a molecular mass of 16718 Da. The Aq. aeolicus and Ar. fulgidus AGTs were expressed at high levels in Escherichia coli fused to an N-terminal polyhistidine tag that allowed single-step isolation and purification by metal-affinity chromatography. Both AGTs formed inclusion bodies and were not soluble under native purification conditions. Therefore AGT isolation was performed under protein-denaturation conditions in the presence of 8.0 M urea. Soluble AGT was obtained by refolding the AGT in the presence of calf thymus DNA. Both AGTs were active in repairing O6-methylguanine and, at a lower rate, O4-methylthymine in DNA. They exhibited thermostability and optimum activity at high temperature. The thermostable AGTs, particularly that from Aq. aeolicus, were readily inactivated by the low-molecular-mass inhibitor O6-benzylguanine, which is currently in clinical trials to enhance cancer chemotherapy.
Collapse
Affiliation(s)
- Sreenivas Kanugula
- Department of Cellular and Molecular Physiology, Pennsylvania State University College of Medicine, The Milton S. Hershey Medical Center, P.O. Box 850, 500 University Drive, Hershey, PA 17033, USA
| | | |
Collapse
|
240
|
Anantharaman V, Aravind L. Application of comparative genomics in the identification and analysis of novel families of membrane-associated receptors in bacteria. BMC Genomics 2003; 4:34. [PMID: 12914674 PMCID: PMC212514 DOI: 10.1186/1471-2164-4-34] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2003] [Accepted: 08/12/2003] [Indexed: 11/29/2022] Open
Abstract
BACKGROUND A great diversity of multi-pass membrane receptors, typically with 7 transmembrane (TM) helices, is observed in the eukaryote crown group. So far, they are relatively rare in the prokaryotes, and are restricted to the well-characterized sensory rhodopsins of various phototropic prokaryotes. RESULTS Utilizing the currently available wealth of prokaryotic genomic sequences, we set up a computational screen to identify putative 7 (TM) and other multi-pass membrane receptors in prokaryotes. As a result of this procedure we were able to recover two widespread families of 7 TM receptors in bacteria that are distantly related to the eukaryotic 7 TM receptors and prokaryotic rhodopsins. Using sequence profile analysis, we were able to establish that the first members of these receptor families contain one of two distinct N-terminal extracellular globular domains, which are predicted to bind ligands such as carbohydrates. In their intracellular portions they contain fusions to a variety of signaling domains, which suggest that they are likely to transduce signals via cyclic AMP, cyclic diguanylate, histidine phosphorylation, dephosphorylation, and through direct interactions with DNA. The second family of bacterial 7 TM receptors possesses an alpha-helical extracellular domain, and is predicted to transduce a signal via an intracellular HD hydrolase domain. Based on comparative analysis of gene neighborhoods, this receptor is predicted to function as a regulator of the diacylglycerol-kinase-dependent glycerolipid pathway. Additionally, our procedure also recovered other types of putative prokaryotic multi-pass membrane associated receptor domains. Of these, we characterized two widespread, evolutionarily mobile multi-TM domains that are fused to a variety of C-terminal intracellular signaling domains. One of these typified by the Gram-positive LytS protein is predicted to be a potential sensor of murein derivatives, whereas the other one typified by the Escherichia coli UhpB protein is predicted to function as sensor of conformational changes occurring in associated membrane proteins CONCLUSIONS We present evidence for considerable variety in the types of uncharacterized surface receptors in bacteria, and reconstruct the evolutionary processes that model their diversity. The identification of novel receptor families in prokaryotes is likely to aid in the experimental analysis of signal transduction and environmental responses of several bacteria, including pathogens such as Leptospira, Treponema, Corynebacterium, Coxiella, Bacillus anthracis and Cytophaga.
Collapse
Affiliation(s)
- Vivek Anantharaman
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
241
|
Makarova KS, Koonin EV. Comparative genomics of Archaea: how much have we learned in six years, and what's next? Genome Biol 2003; 4:115. [PMID: 12914651 PMCID: PMC193635 DOI: 10.1186/gb-2003-4-8-115] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Archaea comprise one of the three distinct domains of life (with bacteria and eukaryotes). With 16 complete archaeal genomes sequenced to date, comparative genomics has revealed a conserved core of 313 genes that are represented in all sequenced archaeal genomes, plus a variable 'shell' that is prone to lineage-specific gene loss and horizontal gene exchange. The majority of archaeal genes have not been experimentally characterized, but novel functional pathways have been predicted.
Collapse
Affiliation(s)
- Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA.
| | | |
Collapse
|
242
|
Boshoff HIM, Reed MB, Barry CE, Mizrahi V. DnaE2 polymerase contributes to in vivo survival and the emergence of drug resistance in Mycobacterium tuberculosis. Cell 2003; 113:183-93. [PMID: 12705867 DOI: 10.1016/s0092-8674(03)00270-8] [Citation(s) in RCA: 313] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The presence of multiple copies of the major replicative DNA polymerase (DnaE) in some organisms, including important pathogens and symbionts, has remained an unresolved enigma. We postulated that one copy might participate in error-prone DNA repair synthesis. We found that UV irradiation of Mycobacterium tuberculosis results in increased mutation frequency in the surviving fraction. We identified dnaE2 as a gene that is upregulated in vitro by several DNA damaging agents, as well as during infection of mice. Loss of this protein reduces both survival of the bacillus after UV irradiation and the virulence of the organism in mice. Our data suggest that DnaE2, and not a member of the Y family of error-prone DNA polymerases, is the primary mediator of survival through inducible mutagenesis and can contribute directly to the emergence of drug resistance in vivo. These results may indicate a potential new target for therapeutic intervention.
Collapse
Affiliation(s)
- Helena I M Boshoff
- Laboratory of Host Defenses, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Twinbrook II, 12441 Parklawn Drive, Rockville, MD 20852, USA
| | | | | | | |
Collapse
|
243
|
Abstract
We searched for genes that could be important for hyperthermophily using a flexible approach to phyletic pattern analysis. We identified 290 clusters of orthologous groups of proteins (COGs) that are preferentially present in archaeal and bacterial hyperthermophiles. Of these, 58 COGs include proteins from at least one bacterium and two archaea, and these were considered to be the best candidates for a specific association with the hyperthermophilic phenotype. Detailed sequence and genome-context analysis of these COGs led to functional predictions for several previously uncharacterized protein families, including a novel group of putative molecular chaperones and a unique transcriptional regulator.
Collapse
Affiliation(s)
- Kira S Makarova
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | |
Collapse
|
244
|
Iyer LM, Koonin EV, Aravind L. Evolutionary connection between the catalytic subunits of DNA-dependent RNA polymerases and eukaryotic RNA-dependent RNA polymerases and the origin of RNA polymerases. BMC STRUCTURAL BIOLOGY 2003; 3:1. [PMID: 12553882 PMCID: PMC151600 DOI: 10.1186/1472-6807-3-1] [Citation(s) in RCA: 165] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2003] [Accepted: 01/28/2003] [Indexed: 12/02/2022]
Abstract
BACKGROUND The eukaryotic RNA-dependent RNA polymerase (RDRP) is involved in the amplification of regulatory microRNAs during post-transcriptional gene silencing. This enzyme is highly conserved in most eukaryotes but is missing in archaea and bacteria. No evolutionary relationship between RDRP and other polymerases has been reported so far, hence the origin of this eukaryote-specific polymerase remains a mystery. RESULTS Using extensive sequence profile searches, we identified bacteriophage homologs of the eukaryotic RDRP. The comparison of the eukaryotic RDRP and their homologs from bacteriophages led to the delineation of the conserved portion of these enzymes, which is predicted to harbor the catalytic site. Further, detailed sequence comparison, aided by examination of the crystal structure of the DNA-dependent RNA polymerase (DDRP), showed that the RDRP and the beta' subunit of DDRP (and its orthologs in archaea and eukaryotes) contain a conserved double-psi beta-barrel (DPBB) domain. This DPBB domain contains the signature motif DbDGD (b is a bulky residue), which is conserved in all RDRPs and DDRPs and contributes to catalysis via a coordinated divalent cation. Apart from the DPBB domain, no similarity was detected between RDRP and DDRP, which leaves open two scenarios for the origin of RDRP: i) RDRP evolved at the onset of the evolution of eukaryotes via a duplication of the DDRP beta' subunit followed by dramatic divergence that obliterated the sequence similarity outside the core catalytic domain and ii) the primordial RDRP, which consisted primarily of the DPBB domain, evolved from a common ancestor with the DDRP at a very early stage of evolution, during the RNA world era. The latter hypothesis implies that RDRP had been subsequently eliminated from cellular life forms and might have been reintroduced into the eukaryotic genomes through a bacteriophage. Sequence and structure analysis of the DDRP led to further insights into the evolution of RNA polymerases. In addition to the beta' subunit, beta subunit of DDRP also contains a DPBB domain, which is, however, distorted by large inserts and does not harbor a counterpart of the DbDGD motif. The DPBB domains of the two DDRP subunits together form the catalytic cleft, with the domain from the beta' subunit supplying the metal-coordinating DbDGD motif and the one from the beta subunit providing two lysine residues involved in catalysis. Given that the two DPBB domains of DDRP contribute completely different sets of active residues to the catalytic center, it is hypothesized that the ultimate ancestor of RNA polymerases functioned as a homodimer of a generic, RNA-binding DPBB domain. This ancestral protein probably did not have catalytic activity and served as a cofactor for a ribozyme RNA polymerase. Subsequent evolution of DDRP and RDRP involved accretion of distinct sets of additional domains. In the DDRPs, these included a RNA-binding Zn-ribbon, an AT-hook-like module and a sandwich-barrel hybrid motif (SBHM) domain. Further, lineage-specific accretion of SBHM domains and other, DDRP-specific domains is observed in bacterial DDRPs. In contrast, the orthologs of the beta' subunit in archaea and eukaryotes contains a four-stranded alpha + beta domain that is shared with the alpha-subunit of bacterial DDRP, eukaryotic DDRP subunit RBP11, translation factor eIF1 and type II topoisomerases. The additional domains of the RDRPs remain to be characterized. CONCLUSIONS Eukaryotic RNA-dependent RNA polymerases share the catalytic double-psi beta-barrel domain, containing a signature metal-coordinating motif, with the universally conserved beta' subunit of DNA-dependent RNA polymerases. Beyond this core catalytic domain, the two classes of RNA polymerases do not have common domains, suggesting early divergence from a common ancestor, with subsequent independent domain accretion. The beta-subunit of DDRP contains another, highly diverged DPBB domain. The presence of two distinct DPBB domains in two subunits of DDRP is compatible with the hypothesis that the ith the hypothesis that the ultimate ancestor of RNA polymerases was a RNA-binding DPBB domain that had no catalytic activity but rather functioned as a homodimeric cofactor for a ribozyme polymerase.
Collapse
Affiliation(s)
- Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
245
|
Abstract
When replication forks stall or collapse at sites of DNA damage, there are two avenues for fork rescue. Mutagenic translesion synthesis by a special class of DNA polymerases can move a fork past the damage, but can leave behind mutations. The alternative nonmutagenic pathways for fork repair involve cellular recombination systems. In bacteria, nonmutagenic repair of replication forks may occur as often as once per cell per generation, and is the favored path for fork restoration under normal growth conditions. Replication fork repair is almost certainly the major function of bacterial recombination systems, and was probably the impetus for the evolution of recombination systems. Increasingly, the nonmutagenic repair of replication forks is seen as a major function of eukaryotic recombination systems as well.
Collapse
Affiliation(s)
- Michael M Cox
- Department of Biochemistry, University of Wisconsin at Madison, 433 Babcock Drive, Madison, WI 53706-1544, USA.
| |
Collapse
|
246
|
Iyer LM, Aravind L. The catalytic domains of thiamine triphosphatase and CyaB-like adenylyl cyclase define a novel superfamily of domains that bind organic phosphates. BMC Genomics 2002; 3:33. [PMID: 12456267 PMCID: PMC138802 DOI: 10.1186/1471-2164-3-33] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2002] [Accepted: 11/27/2002] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The CyaB protein from Aeromonas hydrophila has been shown to possess adenylyl cyclase activity. While orthologs of this enzyme have been found in some bacteria and archaea, it shows no detectable relationship to the classical nucleotide cyclases. Furthermore, the actual biological functions of these proteins are not clearly understood because they are also present in organisms in which there is no evidence for cyclic nucleotide signaling. RESULTS We show that the CyaB like adenylyl cyclase and the mammalian thiamine triphosphatases define a novel superfamily of catalytic domains called the CYTH domain that is present in all three superkingdoms of life. Using multiple alignments and secondary structure predictions, we define the catalytic core of these enzymes to contain a novel alpha+beta scaffold with 6 conserved acidic residues and 4 basic residues. Using contextual information obtained from the analysis of gene neighborhoods and domain fusions, we predict that members of this superfamily may play a central role in the interface between nucleotide and polyphosphate metabolism. Additionally, based on contextual information, we identify a novel domain (called CHAD) that is predicted to functionally interact with the CYTH domain-containing enzymes in bacteria and archaea. The CHAD is predicted to be an alpha helical domain, and contains conserved histidines that may be critical for its function. CONCLUSIONS The phyletic distribution of the CYTH domain suggests that it is an ancient enzymatic domain that was present in the Last Universal Common Ancestor and was involved in nucleotide or organic phosphate metabolism. Based on the conservation of catalytic residues, we predict that CYTH domains are likely to chelate two divalent cations, and exhibit a reaction mechanism that is dependent on two metal ions, analogous to nucleotide cyclases, polymerases and certain phosphoesterases. Our analysis also suggests that the experimentally characterized members of this superfamily, namely adenylyl cyclase and thiamine triphosphatase, are secondary derivatives of proteins that performed an ancient role in polyphosphate and nucleotide metabolism.
Collapse
Affiliation(s)
- Lakshminarayan M Iyer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
247
|
Constantinesco F, Forterre P, Elie C. NurA, a novel 5'-3' nuclease gene linked to rad50 and mre11 homologs of thermophilic Archaea. EMBO Rep 2002; 3:537-42. [PMID: 12052775 PMCID: PMC1084146 DOI: 10.1093/embo-reports/kvf112] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
We isolated and characterized a new nuclease (NurA) exhibiting both single-stranded endonuclease activity and 5'-3' exonuclease activity on single-stranded and double-stranded DNA from the hyperthermophilic archaeon Sulfolobus acidocaldarius. Nuclease homologs are detected in all thermophilic archaea and, in most species, the nurA gene is organized in an operon-like structure with rad50 and mre11 archaeal homologs. This nuclease might thus act in concert with Rad50 and Mre11 proteins in archaeal recombination/repair. To our knowledge, this is the first report of a 5'-3' nuclease potentially associated with Rad50 and Mre11-like proteins that may lead to the processing of double-stranded breaks in 3' single-stranded tails.
Collapse
Affiliation(s)
- Florence Constantinesco
- Institut de Génétique et Microbiologie, UMR CNRS no 8621, Bâtiment 409, Université Paris-Sud, 91405 Orsay cedex, France
| | | | | |
Collapse
|
248
|
Aravind L, Mazumder R, Vasudevan S, Koonin EV. Trends in protein evolution inferred from sequence and structure analysis. Curr Opin Struct Biol 2002; 12:392-9. [PMID: 12127460 DOI: 10.1016/s0959-440x(02)00334-2] [Citation(s) in RCA: 121] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Complementary developments in comparative genomics, protein structure determination and in-depth comparison of protein sequences and structures have provided a better understanding of the prevailing trends in the emergence and diversification of protein domains. The investigation of deep relationships among different classes of proteins involved in key cellular functions, such as nucleic acid polymerases and other nucleotide-dependent enzymes, indicates that a substantial set of diverse protein domains evolved within the primordial, ribozyme-dominated RNA world.
Collapse
Affiliation(s)
- L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | |
Collapse
|
249
|
Rogozin IB, Makarova KS, Murvai J, Czabarka E, Wolf YI, Tatusov RL, Szekely LA, Koonin EV. Connected gene neighborhoods in prokaryotic genomes. Nucleic Acids Res 2002; 30:2212-23. [PMID: 12000841 PMCID: PMC115289 DOI: 10.1093/nar/30.10.2212] [Citation(s) in RCA: 130] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
A computational method was developed for delineating connected gene neighborhoods in bacterial and archaeal genomes. These gene neighborhoods are not typically present, in their entirety, in any single genome, but are held together by overlapping, partially conserved gene arrays. The procedure was applied to comparing the orders of orthologous genes, which were extracted from the database of Clusters of Orthologous Groups of proteins (COGs), in 31 prokaryotic genomes and resulted in the identification of 188 clusters of gene arrays, which included 1001 of 2890 COGs. These clusters were projected onto actual genomes to produce extended neighborhoods including additional genes, which are adjacent to the genes from the clusters and are transcribed in the same direction, which resulted in a total of 2387 COGs being included in the neighborhoods. Most of the neighborhoods consist predominantly of genes united by a coherent functional theme, but also include a minority of genes without an obvious functional connection to the main theme. We hypothesize that although some of the latter genes might have unsuspected roles, others are maintained within gene arrays because of the advantage of expression at a level that is typical of the given neighborhood. We designate this phenomenon 'genomic hitchhiking'. The largest neighborhood includes 79 genes (COGs) and consists of overlapping, rearranged ribosomal protein superoperons; apparent genome hitchhiking is particularly typical of this neighborhood and other neighborhoods that consist of genes coding for translation machinery components. Several neighborhoods involve previously undetected connections between genes, allowing new functional predictions. Gene neighborhoods appear to evolve via complex rearrangement, with different combinations of genes from a neighborhood fixed in different lineages.
Collapse
Affiliation(s)
- Igor B Rogozin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | | | | | | | | | | | | | | |
Collapse
|
250
|
Slesarev AI, Mezhevaya KV, Makarova KS, Polushin NN, Shcherbinina OV, Shakhova VV, Belova GI, Aravind L, Natale DA, Rogozin IB, Tatusov RL, Wolf YI, Stetter KO, Malykh AG, Koonin EV, Kozyavkin SA. The complete genome of hyperthermophile Methanopyrus kandleri AV19 and monophyly of archaeal methanogens. Proc Natl Acad Sci U S A 2002; 99:4644-9. [PMID: 11930014 PMCID: PMC123701 DOI: 10.1073/pnas.032671499] [Citation(s) in RCA: 239] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2001] [Accepted: 12/14/2001] [Indexed: 11/18/2022] Open
Abstract
We have determined the complete 1,694,969-nt sequence of the GC-rich genome of Methanopyrus kandleri by using a whole direct genome sequencing approach. This approach is based on unlinking of genomic DNA with the ThermoFidelase version of M. kandleri topoisomerase V and cycle sequencing directed by 2'-modified oligonucleotides (Fimers). Sequencing redundancy (3.3x) was sufficient to assemble the genome with less than one error per 40 kb. Using a combination of sequence database searches and coding potential prediction, 1,692 protein-coding genes and 39 genes for structural RNAs were identified. M. kandleri proteins show an unusually high content of negatively charged amino acids, which might be an adaptation to the high intracellular salinity. Previous phylogenetic analysis of 16S RNA suggested that M. kandleri belonged to a very deep branch, close to the root of the archaeal tree. However, genome comparisons indicate that, in both trees constructed using concatenated alignments of ribosomal proteins and trees based on gene content, M. kandleri consistently groups with other archaeal methanogens. M. kandleri shares the set of genes implicated in methanogenesis and, in part, its operon organization with Methanococcus jannaschii and Methanothermobacter thermoautotrophicum. These findings indicate that archaeal methanogens are monophyletic. A distinctive feature of M. kandleri is the paucity of proteins involved in signaling and regulation of gene expression. Also, M. kandleri appears to have fewer genes acquired via lateral transfer than other archaea. These features might reflect the extreme habitat of this organism.
Collapse
|