101
|
Toffano-Nioche C, Nguyen AN, Kuchly C, Ott A, Gautheret D, Bouloc P, Jacq A. Transcriptomic profiling of the oyster pathogen Vibrio splendidus opens a window on the evolutionary dynamics of the small RNA repertoire in the Vibrio genus. RNA (NEW YORK, N.Y.) 2012; 18:2201-2219. [PMID: 23097430 PMCID: PMC3504672 DOI: 10.1261/rna.033324.112] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/21/2012] [Accepted: 09/08/2012] [Indexed: 06/01/2023]
Abstract
Work in recent years has led to the recognition of the importance of small regulatory RNAs (sRNAs) in bacterial regulation networks. New high-throughput sequencing technologies are paving the way to the exploration of an expanding sRNA world in nonmodel bacteria. In the Vibrio genus, compared to the enterobacteriaceae, still a limited number of sRNAs have been characterized, mostly in Vibrio cholerae, where they have been shown to be important for virulence, as well as in Vibrio harveyi. In addition, genome-wide approaches in V. cholerae have led to the discovery of hundreds of potential new sRNAs. Vibrio splendidus is an oyster pathogen that has been recently associated with massive mortality episodes in the French oyster growing industry. Here, we report the first RNA-seq study in a Vibrio outside of the V. cholerae species. We have uncovered hundreds of candidate regulatory RNAs, be it cis-regulatory elements, antisense RNAs, and trans-encoded sRNAs. Conservation studies showed the majority of them to be specific to V. splendidus. However, several novel sRNAs, previously unidentified, are also present in V. cholerae. Finally, we identified 28 trans sRNAs that are conserved in all the Vibrio genus species for which a complete genome sequence is available, possibly forming a Vibrio "sRNA core."
Collapse
Affiliation(s)
- Claire Toffano-Nioche
- Institut de Génétique et Microbiologie, CNRS/UMR 8621, IFR115, Université Paris-Sud, Bâtiment 400, 91405 Orsay Cedex, France
| | - An N. Nguyen
- Institut de Génétique et Microbiologie, CNRS/UMR 8621, IFR115, Université Paris-Sud, Bâtiment 400, 91405 Orsay Cedex, France
| | - Claire Kuchly
- Institut de Génétique et Microbiologie, CNRS/UMR 8621, IFR115, Université Paris-Sud, Bâtiment 400, 91405 Orsay Cedex, France
| | - Alban Ott
- Institut de Génétique et Microbiologie, CNRS/UMR 8621, IFR115, Université Paris-Sud, Bâtiment 400, 91405 Orsay Cedex, France
| | - Daniel Gautheret
- Institut de Génétique et Microbiologie, CNRS/UMR 8621, IFR115, Université Paris-Sud, Bâtiment 400, 91405 Orsay Cedex, France
| | - Philippe Bouloc
- Institut de Génétique et Microbiologie, CNRS/UMR 8621, IFR115, Université Paris-Sud, Bâtiment 400, 91405 Orsay Cedex, France
| | - Annick Jacq
- Institut de Génétique et Microbiologie, CNRS/UMR 8621, IFR115, Université Paris-Sud, Bâtiment 400, 91405 Orsay Cedex, France
| |
Collapse
|
102
|
Darby AC, Armstrong SD, Bah GS, Kaur G, Hughes MA, Kay SM, Koldkjær P, Rainbow L, Radford AD, Blaxter ML, Tanya VN, Trees AJ, Cordaux R, Wastling JM, Makepeace BL. Analysis of gene expression from the Wolbachia genome of a filarial nematode supports both metabolic and defensive roles within the symbiosis. Genome Res 2012; 22:2467-77. [PMID: 22919073 PMCID: PMC3514676 DOI: 10.1101/gr.138420.112] [Citation(s) in RCA: 141] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2012] [Accepted: 07/27/2012] [Indexed: 02/06/2023]
Abstract
The α-proteobacterium Wolbachia is probably the most prevalent, vertically transmitted symbiont on Earth. In contrast with its wide distribution in arthropods, Wolbachia is restricted to one family of animal-parasitic nematodes, the Onchocercidae. This includes filarial pathogens such as Onchocerca volvulus, the cause of human onchocerciasis, or river blindness. The symbiosis between filariae and Wolbachia is obligate, although the basis of this dependency is not fully understood. Previous studies suggested that Wolbachia may provision metabolites (e.g., haem, riboflavin, and nucleotides) and/or contribute to immune defense. Importantly, Wolbachia is restricted to somatic tissues in adult male worms, whereas females also harbor bacteria in the germline. We sought to characterize the nature of the symbiosis between Wolbachia and O. ochengi, a bovine parasite representing the closest relative of O. volvulus. First, we sequenced the complete genome of Wolbachia strain wOo, which revealed an inability to synthesize riboflavin de novo. Using RNA-seq, we also generated endobacterial transcriptomes from male soma and female germline. In the soma, transcripts for membrane transport and respiration were up-regulated, while the gonad exhibited enrichment for DNA replication and translation. The most abundant Wolbachia proteins, as determined by geLC-MS, included ligands for mammalian Toll-like receptors. Enzymes involved in nucleotide synthesis were dominant among metabolism-related proteins, whereas the haem biosynthetic pathway was poorly represented. We conclude that Wolbachia may have a mitochondrion-like function in the soma, generating ATP for its host. Moreover, the abundance of immunogenic proteins in wOo suggests a role in diverting the immune system toward an ineffective antibacterial response.
Collapse
Affiliation(s)
- Alistair C. Darby
- Institute of Integrative Biology and the Centre for Genomic Research, Biosciences Building, University of Liverpool, Liverpool, Merseyside L69 7ZB, United Kingdom
| | - Stuart D. Armstrong
- Institute of Infection & Global Health, Liverpool Science Park IC2, University of Liverpool, Liverpool, Merseyside L3 5RF, United Kingdom
| | - Germanus S. Bah
- Institute of Infection & Global Health, Liverpool Science Park IC2, University of Liverpool, Liverpool, Merseyside L3 5RF, United Kingdom
- Institut de Recherche Agricole pour le Développement, Regional Centre of Wakwa, Ngaoundéré, BP65 Adamawa Region, Cameroon
| | - Gaganjot Kaur
- Institute of Evolutionary Biology and the GenePool Genomics Facility, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
| | - Margaret A. Hughes
- Institute of Integrative Biology and the Centre for Genomic Research, Biosciences Building, University of Liverpool, Liverpool, Merseyside L69 7ZB, United Kingdom
| | - Suzanne M. Kay
- Institute of Integrative Biology and the Centre for Genomic Research, Biosciences Building, University of Liverpool, Liverpool, Merseyside L69 7ZB, United Kingdom
| | - Pia Koldkjær
- Institute of Integrative Biology and the Centre for Genomic Research, Biosciences Building, University of Liverpool, Liverpool, Merseyside L69 7ZB, United Kingdom
| | - Lucille Rainbow
- Institute of Integrative Biology and the Centre for Genomic Research, Biosciences Building, University of Liverpool, Liverpool, Merseyside L69 7ZB, United Kingdom
| | - Alan D. Radford
- Institute of Infection & Global Health, Liverpool Science Park IC2, University of Liverpool, Liverpool, Merseyside L3 5RF, United Kingdom
| | - Mark L. Blaxter
- Institute of Evolutionary Biology and the GenePool Genomics Facility, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
| | - Vincent N. Tanya
- Institut de Recherche Agricole pour le Développement, Regional Centre of Wakwa, Ngaoundéré, BP65 Adamawa Region, Cameroon
| | - Alexander J. Trees
- Institute of Infection & Global Health, Liverpool Science Park IC2, University of Liverpool, Liverpool, Merseyside L3 5RF, United Kingdom
| | - Richard Cordaux
- Laboratoire Ecologie et Biologie des Interactions (UMR CNRS 7267), Equipe Ecologie Evolution Symbiose, Université de Poitiers, 86022 Poitiers CEDEX, France
| | - Jonathan M. Wastling
- Institute of Infection & Global Health, Liverpool Science Park IC2, University of Liverpool, Liverpool, Merseyside L3 5RF, United Kingdom
| | - Benjamin L. Makepeace
- Institute of Infection & Global Health, Liverpool Science Park IC2, University of Liverpool, Liverpool, Merseyside L3 5RF, United Kingdom
| |
Collapse
|
103
|
Campanaro S, Pascale FD, Telatin A, Schiavon R, Bartlett DH, Valle G. The transcriptional landscape of the deep-sea bacterium Photobacterium profundum in both a toxR mutant and its parental strain. BMC Genomics 2012; 13:567. [PMID: 23107454 PMCID: PMC3505737 DOI: 10.1186/1471-2164-13-567] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Accepted: 10/16/2012] [Indexed: 02/08/2023] Open
Abstract
Background The deep-sea bacterium Photobacterium profundum is an established model for studying high pressure adaptation. In this paper we analyse the parental strain DB110 and the toxR mutant TW30 by massively parallel cDNA sequencing (RNA-seq). ToxR is a transmembrane DNA-binding protein first discovered in Vibrio cholerae, where it regulates a considerable number of genes involved in environmental adaptation and virulence. In P. profundum the abundance and activity of this protein is influenced by hydrostatic pressure and its role is related to the regulation of genes in a pressure-dependent manner. Results To better characterize the ToxR regulon, we compared the expression profiles of wt and toxR strains in response to pressure changes. Our results revealed a complex expression pattern with a group of 22 genes having expression profiles similar to OmpH that is an outer membrane protein transcribed in response to high hydrostatic pressure. Moreover, RNA-seq allowed a deep characterization of the transcriptional landscape that led to the identification of 460 putative small RNA genes and the detection of 298 protein-coding genes previously unknown. We were also able to perform a genome-wide prediction of operon structure, transcription start and termination sites, revealing an unexpected high number of genes (992) with large 5′-UTRs, long enough to harbour cis-regulatory RNA structures, suggesting a correlation between intergenic region size and UTR length. Conclusion This work led to a better understanding of high-pressure response in P. profundum. Furthermore, the high-resolution RNA-seq analysis revealed several unexpected features about transcriptional landscape and general mechanisms of controlling bacterial gene expression.
Collapse
Affiliation(s)
- Stefano Campanaro
- Department of Biology and CRIBI Biotechnology Centre, University of Padua, Via Ugo Bassi 58/B, Padova 35131, Italy.
| | | | | | | | | | | |
Collapse
|
104
|
Patenge N, Billion A, Raasch P, Normann J, Wisniewska-Kucper A, Retey J, Boisguérin V, Hartsch T, Hain T, Kreikemeyer B. Identification of novel growth phase- and media-dependent small non-coding RNAs in Streptococcus pyogenes M49 using intergenic tiling arrays. BMC Genomics 2012; 13:550. [PMID: 23062031 PMCID: PMC3542284 DOI: 10.1186/1471-2164-13-550] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2012] [Accepted: 10/10/2012] [Indexed: 12/27/2022] Open
Abstract
BACKGROUND Small non-coding RNAs (sRNAs) have attracted attention as a new class of gene regulators in both eukaryotes and bacteria. Genome-wide screening methods have been successfully applied in Gram-negative bacteria to identify sRNA regulators. Many sRNAs are well characterized, including their target mRNAs and mode of action. In comparison, little is known about sRNAs in Gram-positive pathogens. In this study, we identified novel sRNAs in the exclusively human pathogen Streptococcus pyogenes M49 (Group A Streptococcus, GAS M49), employing a whole genome intergenic tiling array approach. GAS is an important pathogen that causes diseases ranging from mild superficial infections of the skin and mucous membranes of the naso-pharynx, to severe toxic and invasive diseases. RESULTS We identified 55 putative sRNAs in GAS M49 that were expressed during growth. Of these, 42 were novel. Some of the newly-identified sRNAs belonged to one of the common non-coding RNA families described in the Rfam database. Comparison of the results of our screen with the outcome of two recently published bioinformatics tools showed a low level of overlap between putative sRNA genes. Previously, 40 potential sRNAs have been reported to be expressed in a GAS M1T1 serotype, as detected by a whole genome intergenic tiling array approach. Our screen detected 12 putative sRNA genes that were expressed in both strains. Twenty sRNA candidates appeared to be regulated in a medium-dependent fashion, while eight sRNA genes were regulated throughout growth in chemically defined medium. Expression of candidate genes was verified by reverse transcriptase-qPCR. For a subset of sRNAs, the transcriptional start was determined by 5' rapid amplification of cDNA ends-PCR (RACE-PCR) analysis. CONCLUSIONS In accord with the results of previous studies, we found little overlap between different screening methods, which underlines the fact that a comprehensive analysis of sRNAs expressed by a given organism requires the complementary use of different methods and the investigation of several environmental conditions. Despite a high conservation of sRNA genes within streptococci, the expression of sRNAs appears to be strain specific.
Collapse
Affiliation(s)
- Nadja Patenge
- Institute of Medical Microbiology and Hospital Hygiene, University of Rostock, Schillingallee 70, 18057, Rostock, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
105
|
The alternative translational profile that underlies the immune-evasive state of persistence in Chlamydiaceae exploits differential tryptophan contents of the protein repertoire. Microbiol Mol Biol Rev 2012; 76:405-43. [PMID: 22688818 DOI: 10.1128/mmbr.05013-11] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
One form of immune evasion is a developmental state called "persistence" whereby chlamydial pathogens respond to the host-mediated withdrawal of L-tryptophan (Trp). A sophisticated survival mode of reversible quiescence is implemented. A mechanism has evolved which suppresses gene products necessary for rapid pathogen proliferation but allows expression of gene products that underlie the morphological and developmental characteristics of persistence. This switch from one translational profile to an alternative translational profile of newly synthesized proteins is proposed to be accomplished by maximizing the Trp content of some proteins needed for rapid proliferation (e.g., ADP/ATP translocase, hexose-phosphate transporter, phosphoenolpyruvate [PEP] carboxykinase, the Trp transporter, the Pmp protein superfamily for cell adhesion and antigenic variation, and components of the cell division pathway) while minimizing the Trp content of other proteins supporting the state of persistence. The Trp starvation mechanism is best understood in the human-Chlamydia trachomatis relationship, but the similarity of up-Trp and down-Trp proteomic profiles in all of the pathogenic Chlamydiaceae suggests that Trp availability is an underlying cue relied upon by this family of pathogens to trigger developmental transitions. The biochemically expensive pathogen strategy of selectively increased Trp usage to guide the translational profile can be leveraged significantly with minimal overall Trp usage by (i) regional concentration of Trp residue placements, (ii) amplified Trp content of a single protein that is required for expression or maturation of multiple proteins with low Trp content, and (iii) Achilles'-heel vulnerabilities of complex pathways to high Trp content of one or a few enzymes.
Collapse
|
106
|
Kuchipudi SV, Tellabati M, Nelli RK, White GA, Perez BB, Sebastian S, Slomka MJ, Brookes SM, Brown IH, Dunham SP, Chang KC. 18S rRNA is a reliable normalisation gene for real time PCR based on influenza virus infected cells. Virol J 2012; 9:230. [PMID: 23043930 PMCID: PMC3499178 DOI: 10.1186/1743-422x-9-230] [Citation(s) in RCA: 114] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2012] [Accepted: 10/05/2012] [Indexed: 01/04/2023] Open
Abstract
BACKGROUND One requisite of quantitative reverse transcription PCR (qRT-PCR) is to normalise the data with an internal reference gene that is invariant regardless of treatment, such as virus infection. Several studies have found variability in the expression of commonly used housekeeping genes, such as beta-actin (ACTB) and glyceraldehyde-3-phosphate dehydrogenase (GAPDH), under different experimental settings. However, ACTB and GAPDH remain widely used in the studies of host gene response to virus infections, including influenza viruses. To date no detailed study has been described that compares the suitability of commonly used housekeeping genes in influenza virus infections. The present study evaluated several commonly used housekeeping genes [ACTB, GAPDH, 18S ribosomal RNA (18S rRNA), ATP synthase, H+ transporting, mitochondrial F1 complex, beta polypeptide (ATP5B) and ATP synthase, H+ transporting, mitochondrial Fo complex, subunit C1 (subunit 9) (ATP5G1)] to identify the most stably expressed gene in human, pig, chicken and duck cells infected with a range of influenza A virus subtypes. RESULTS The relative expression stability of commonly used housekeeping genes were determined in primary human bronchial epithelial cells (HBECs), pig tracheal epithelial cells (PTECs), and chicken and duck primary lung-derived cells infected with five influenza A virus subtypes. Analysis of qRT-PCR data from virus and mock infected cells using NormFinder and BestKeeper software programmes found that 18S rRNA was the most stable gene in HBECs, PTECs and avian lung cells. CONCLUSIONS Based on the presented data from cell culture models (HBECs, PTECs, chicken and duck lung cells) infected with a range of influenza viruses, we found that 18S rRNA is the most stable reference gene for normalising qRT-PCR data. Expression levels of the other housekeeping genes evaluated in this study (including ACTB and GPADH) were highly affected by influenza virus infection and hence are not reliable as reference genes for RNA normalisation.
Collapse
Affiliation(s)
- Suresh V Kuchipudi
- School of Veterinary Medicine and Science, University of Nottingham, Sutton Bonington Campus, College Road, Loughborough, Leicestershire LE12 5RD, UK.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
107
|
Arakawa K, Tomita M. Measures of compositional strand bias related to replication machinery and its applications. Curr Genomics 2012; 13:4-15. [PMID: 22942671 PMCID: PMC3269016 DOI: 10.2174/138920212799034749] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2011] [Revised: 09/10/2011] [Accepted: 09/20/2011] [Indexed: 11/22/2022] Open
Abstract
The compositional asymmetry of complementary bases in nucleotide sequences implies the existence of a mutational or selectional bias in the two strands of the DNA duplex, which is commonly shaped by strand-specific mechanisms in transcription or replication. Such strand bias in genomes, frequently visualized by GC skew graphs, is used for the computational prediction of transcription start sites and replication origins, as well as for comparative evolutionary genomics studies. The use of measures of compositional strand bias in order to quantify the degree of strand asymmetry is crucial, as it is the basis for determining the applicability of compositional analysis and comparing the strength of the mutational bias in different biological machineries in various species. Here, we review the measures of strand bias that have been proposed to date, including the ∆GC skew, the B1 index, the predictability score of linear discriminant analysis for gene orientation, the signal-to-noise ratio of the oligonucleotide bias, and the GC skew index. These measures have been predominantly designed for and applied to the analysis of replication-related mutational processes in prokaryotes, but we also give research examples in eukaryotes.
Collapse
Affiliation(s)
- Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Fujisawa 252-8520, Japan
| | | |
Collapse
|
108
|
Lin Q, Cui P, Ding F, Hu S, Yu J. Replication-Associated Mutational Pressure (RMP) Governs Strand-Biased Compositional Asymmetry (SCA) and Gene Organization in Animal Mitochondrial Genomes. Curr Genomics 2012; 13:28-36. [PMID: 22942673 PMCID: PMC3269014 DOI: 10.2174/138920212799034811] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2011] [Revised: 10/01/2011] [Accepted: 10/04/2011] [Indexed: 11/30/2022] Open
Abstract
The nucleotide composition of the light (L-) and heavy (H-) strands of animal mitochondrial genomes is known to exhibit strand-biased compositional asymmetry (SCA). One of the possibilities is the existence of a replication-associated mutational pressure (RMP) that may introduce characteristic nucleotide changes among mitochondrial genomes of different animal lineages. Here, we discuss the influence of RMP on nucleotide and amino acid compositions as well as gene organization. Among animal mitochondrial genomes, RMP may represent the major force that compels the evolution of mitochondrial protein-coding genes, coupled with other process-based selective pressures, such as on components of translation machinery— tRNAs and their anticodons. Through comparative analyses of sequenced mitochondrial genomes among diverse animal lineages and literature reviews, we suggest a strong RMP effect, observed among invertebrate mitochondrial genes as compared to those of vertebrates, that is either a result of positive selection on the invertebrate or a relaxed selective pressure on the vertebrate mitochondrial genes.
Collapse
Affiliation(s)
- Qiang Lin
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, 100029 Beijing, China
| | | | | | | | | |
Collapse
|
109
|
Retchless AC, Lawrence JG. Ecological adaptation in bacteria: speciation driven by codon selection. Mol Biol Evol 2012; 29:3669-83. [PMID: 22740635 DOI: 10.1093/molbev/mss171] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
In bacteria, physiological change may be effected by a single gene acquisition, producing ecological differentiation without genetic isolation. Natural selection acting on such differences can reduce the frequency of genotypes that arise from recombination at these loci. However, gene acquisition can only account for recombination interference in the fraction of the genome that is tightly linked to the integration site. To identify additional loci that contribute to adaptive differences, we examined orthologous genes in species of Enterobacteriaceae to identify significant differences in the degree of codon selection. Significance was assessed using the Adaptive Codon Enrichment metric, which accounts for the variation in codon usage bias that is expected to arise from mutation and drift; large differences in codon usage bias were identified in more genes than would be expected to arise from stochastic processes alone. Genes in the same operon showed parallel differences in codon usage bias, suggesting that changes in the overall levels of gene expression led to changes in the degree of adaptive codon usage. Most significant differences between orthologous operons were found among those involved with specific environmental adaptations, whereas "housekeeping" genes rarely showed significant changes. When considered together, the loci experiencing significant changes in codon selection outnumber potentially adaptive gene acquisition events. The identity of genes under strong codon selection seems to be influenced by the habitat from which the bacteria were isolated. We propose a two-stage model for how adaptation to different selective regimes can drive bacterial speciation. Initially, gene acquisitions catalyze rapid ecological differentiation, which modifies the utilization of genes, thereby changing the strength of codon selection on them. Alleles develop fitness variation by substitution, producing recombination interference at these loci in addition to those flanking acquired genes, allowing sequences to diverge across the entire genome and establishing genetic isolation (i.e., protection from frequent homologous recombination).
Collapse
Affiliation(s)
- Adam C Retchless
- Department of Biological Sciences, University of Pittsburgh, USA
| | | |
Collapse
|
110
|
Mao X, Zhang H, Yin Y, Xu Y. The percentage of bacterial genes on leading versus lagging strands is influenced by multiple balancing forces. Nucleic Acids Res 2012; 40:8210-8. [PMID: 22735706 PMCID: PMC3458553 DOI: 10.1093/nar/gks605] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
The majority of bacterial genes are located on the leading strand, and the percentage of such genes has a large variation across different bacteria. Although some explanations have been proposed, these are at most partial explanations as they cover only small percentages of the genes and do not even consider the ones biased toward the lagging strand. We have carried out a computational study on 725 bacterial genomes, aiming to elucidate other factors that may have influenced the strand location of genes in a bacterium. Our analyses suggest that (i) genes of some functional categories such as ribosome have higher preferences to be on the leading strands; (ii) genes of some functional categories such as transcription factor have higher preferences on the lagging strands; (iii) there is a balancing force that tends to keep genes from all moving to the leading and more efficient strand and (iv) the percentage of leading-strand genes in an bacterium can be accurately explained based on the numbers of genes in the functional categories outlined in (i) and (ii), genome size and gene density, indicating that these numbers implicitly contain the information about the percentage of genes on the leading versus lagging strand in a genome.
Collapse
Affiliation(s)
- Xizeng Mao
- Computational Systems Biology Lab, Department of Biochemistry and Molecular Biology and Institute of Bioinformatics, University of Georgia, Athens, GA 30605, USA
| | | | | | | |
Collapse
|
111
|
Preventing replication stress to maintain genome stability: resolving conflicts between replication and transcription. Mol Cell 2012; 45:710-8. [PMID: 22464441 DOI: 10.1016/j.molcel.2012.03.001] [Citation(s) in RCA: 143] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2011] [Revised: 02/15/2012] [Accepted: 03/07/2012] [Indexed: 11/22/2022]
Abstract
DNA and RNA polymerases clash along the genome as they compete for the same DNA template. Cells have evolved specialized strategies to prevent and resolve replication and transcription interference. Here, we review the topology and architecture at sites of replication fork clashes with transcription bubbles as well as the regulatory circuits that control replication fork passage across transcribed genes. In the case of RNA polymerase II-transcribed genes, cotranscriptional processes such as mRNA maturation, splicing, and export influence the integrity of replication forks and transcribed loci. Fork passage likely contributes to reset the epigenetic landscape, influencing gene expression and transcriptional memory. When any of these processes are not properly coordinated, aberrant outcomes such as fork reversal and R-loop formation arise and trigger unscheduled recombinogenic events and genome rearrangements. The evolutionary implications of such conflicts on genome dynamics and their potential impact on oncogenic stress are discussed.
Collapse
|
112
|
[Current status of theoretical studies on essential genes in microbes]. YI CHUAN = HEREDITAS 2012; 34:420-30. [PMID: 22522159 DOI: 10.3724/sp.j.1005.2012.00420] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
Essential genes are indispensable for the survival of an organism in optimal conditions. Recently, study on essential gene is becoming a hot topic of microbiology, genomics, and bioinformatics. This paper described the experiments that determined essential genes in some microbes and the theoretical researches on essential genes were reviewed. The major content contained comparison of essential genes and non-essential genes based on information on evolutionary conservation and sequence composition, and in silico prediction of essential genes, and analysis of the chromosomal distributions of essential genes. Finally, related progresses were concluded and the open problems were pointed out.
Collapse
|
113
|
Dutta C, Paul S. Microbial lifestyle and genome signatures. Curr Genomics 2012; 13:153-62. [PMID: 23024607 PMCID: PMC3308326 DOI: 10.2174/138920212799860698] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2011] [Revised: 09/13/2011] [Accepted: 09/28/2011] [Indexed: 12/29/2022] Open
Abstract
Microbes are known for their unique ability to adapt to varying lifestyle and environment, even to the extreme or adverse ones. The genomic architecture of a microbe may bear the signatures not only of its phylogenetic position, but also of the kind of lifestyle to which it is adapted. The present review aims to provide an account of the specific genome signatures observed in microbes acclimatized to distinct lifestyles or ecological niches. Niche-specific signatures identified at different levels of microbial genome organization like base composition, GC-skew, purine-pyrimidine ratio, dinucleotide abundance, codon bias, oligonucleotide composition etc. have been discussed. Among the specific cases highlighted in the review are the phenomena of genome shrinkage in obligatory host-restricted microbes, genome expansion in strictly intra-amoebal pathogens, strand-specific codon usage in intracellular species, acquisition of genome islands in pathogenic or symbiotic organisms, discriminatory genomic traits of marine microbes with distinct trophic strategies, and conspicuous sequence features of certain extremophiles like those adapted to high temperature or high salinity.
Collapse
Affiliation(s)
- Chitra Dutta
- Structural Biology & Bioinformatics Division, CSIR- Indian Institute of Chemical Biology, 4, Raja S. C. Mullick Road, Kolkata 700032, India
| | | |
Collapse
|
114
|
Khrustalev VV, Barkovsky EV. A blueprint for a mutationist theory of replicative strand asymmetries formation. Curr Genomics 2012; 13:55-64. [PMID: 22942675 PMCID: PMC3269017 DOI: 10.2174/138920212799034730] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2011] [Revised: 09/15/2011] [Accepted: 09/29/2011] [Indexed: 11/26/2022] Open
Abstract
In the present review, we summarized current knowledge on replicative strand asymmetries in prokaryotic genomes. A cornerstone for the creation of a theory of their formation has been overviewed. According to our recent works, the probability of nonsense mutation caused by replication-associated mutational pressure is higher for genes from lagging strands than for genes from leading strands of both bacterial and archaeal genomes. Lower density of open reading frames in lagging strands can be explained by faster rates of nonsense mutations in genes situated on them. According to the asymmetries in nucleotide usage in fourfold and twofold degenerate sites, the direction of replication-associated mutational pressure for genes from lagging strands is usually the same as the direction of transcription-associated mutational pressure. It means that lagging strands should accumulate more 8-oxo-G, uracil and 5-formyl-uracil, respectively. In our opinion, consequences of cytosine deamination (C to T transitions) do not lead to the decrease of cytosine usage in genes from lagging strands because of the consequences of thymine oxidation (T to C transitions), while guanine oxidation (causing G to T transversions) makes the main contribution into the decrease of guanine usage in fourfold degenerate sites of genes from lagging strands. Nucleotide usage asymmetries and bias in density of coding regions can be found in archaeal genomes, although, the percent of "inversed" asymmetries is much higher for them than for bacterial genomes. "Homogenized" and "inversed" replicative strand asymmetries in archaeal genomes can be used as retrospective indexes for detection of OriC translocations and large inversions.
Collapse
Affiliation(s)
- Vladislav V Khrustalev
- Department of General Chemistry, Belarussian State Medical University, Belarus, Minsk, Dzerzinskogo, 83, Russia
| | | |
Collapse
|
115
|
Jha JK, Baek JH, Venkova-Canova T, Chattoraj DK. Chromosome dynamics in multichromosome bacteria. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2012; 1819:826-9. [PMID: 22306663 DOI: 10.1016/j.bbagrm.2012.01.012] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2011] [Revised: 01/10/2012] [Accepted: 01/17/2012] [Indexed: 12/23/2022]
Abstract
On the basis of limited information, bacteria were once assumed to have no more than one chromosome. In the era of genomics, it has become clear that some, like eukaryotes, have more than one chromosome. Multichromosome bacteria provide opportunities to investigate how split genomes emerged, whether the individual chromosomes communicate to coordinate their replication and segregation, and what selective advantages split genomes might provide. Our current knowledge of these topics comes mostly from studies in Vibrio cholerae, which has two chromosomes, chr1 and chr2. Chr1 carries out most of the house-keeping functions and is considered the main chromosome, whereas chr2 appears to have originated from a plasmid and has acquired genes of mostly unknown origin and function. Nevertheless, unlike plasmids, chr2 replicates once and only once per cell cycle, like a bona fide chromosome. The two chromosomes replicate and segregate using separate programs, unlike eukaryotic chromosomes. They terminate replication synchronously, suggesting that there might be communication between them. Replication of the chromosomes is affected by segregation genes but in a chromosome specific fashion, a new development in the field of DNA replication control. The split genome allows genome duplication to complete in less time and with fewer replication forks, which could be beneficial for genome maintenance during rapid growth, which is the norm for V. cholerae in broth cultures and in the human host. In the latter, the expression of chr2 genes increases preferentially. Studies of chromosome maintenance in multichromosomal bacteria, although in their infancy, are already broadening our view of chromosome biology. This article is part of a Special Issue entitled: Chromatin in time and space.
Collapse
Affiliation(s)
- Jyoti K Jha
- Laboratory of Molecular Biology and Biochemistry, Center for Cancer Research, NCI, NIH, Bethesda, MD, USA
| | | | | | | |
Collapse
|
116
|
Grant MAA, Saggioro C, Ferrari U, Bassetti B, Sclavi B, Cosentino Lagomarsino M. DnaA and the timing of chromosome replication in Escherichia coli as a function of growth rate. BMC SYSTEMS BIOLOGY 2011; 5:201. [PMID: 22189092 PMCID: PMC3309966 DOI: 10.1186/1752-0509-5-201] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/03/2011] [Accepted: 12/21/2011] [Indexed: 11/16/2022]
Abstract
Background In Escherichia coli, overlapping rounds of DNA replication allow the bacteria to double in faster times than the time required to copy the genome. The precise timing of initiation of DNA replication is determined by a regulatory circuit that depends on the binding of a critical number of ATP-bound DnaA proteins at the origin of replication, resulting in the melting of the DNA and the assembly of the replication complex. The synthesis of DnaA in the cell is controlled by a growth-rate dependent, negatively autoregulated gene found near the origin of replication. Both the regulatory and initiation activity of DnaA depend on its nucleotide bound state and its availability. Results In order to investigate the contributions of the different regulatory processes to the timing of initiation of DNA replication at varying growth rates, we formulate a minimal quantitative model of the initiator circuit that includes the key ingredients known to regulate the activity of the DnaA protein. This model describes the average-cell oscillations in DnaA-ATP/DNA during the cell cycle, for varying growth rates. We evaluate the conditions under which this ratio attains the same threshold value at the time of initiation, independently of the growth rate. Conclusions We find that a quantitative description of replication initiation by DnaA must rely on the dependency of the basic parameters on growth rate, in order to account for the timing of initiation of DNA replication at different cell doubling times. We isolate two main possible scenarios for this, depending on the roles of DnaA autoregulation and DnaA ATP-hydrolysis regulatory process. One possibility is that the basal rate of regulatory inactivation by ATP hydrolysis must vary with growth rate. Alternatively, some parameters defining promoter activity need to be a function of the growth rate. In either case, the basal rate of gene expression needs to increase with the growth rate, in accordance with the known characteristics of the dnaA promoter. Furthermore, both inactivation and autorepression reduce the amplitude of the cell-cycle oscillations of DnaA-ATP/DNA.
Collapse
Affiliation(s)
- Matthew A A Grant
- BSS Group, Department of Physics, University of Cambridge, JJ Thomson Avenue, Cambridge, CB3 0HE, UK
| | | | | | | | | | | |
Collapse
|
117
|
Williams LE, Wernegreen JJ. Purifying selection, sequence composition, and context-specific indel mutations shape intraspecific variation in a bacterial endosymbiont. Genome Biol Evol 2011; 4:44-51. [PMID: 22117087 PMCID: PMC3268670 DOI: 10.1093/gbe/evr128] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
Comparative genomics of closely related bacterial strains can clarify mutational processes and selective forces that impact genetic variation. Among primary bacterial endosymbionts of insects, such analyses have revealed ongoing genome reduction, raising questions about the ultimate evolutionary fate of these partnerships. Here, we explored genomic variation within Blochmannia vafer, an obligate mutualist of the ant Camponotus vafer. Polymorphism analysis of the Illumina data set used previously for de novo assembly revealed a second Bl. vafer genotype. To determine why a single ant colony contained two symbiont genotypes, we examined polymorphisms in 12 C. vafer mitochondrial sequences assembled from the Illumina data; the spectrum of variants suggests that the colony contained two maternal lineages, each harboring a distinct Bl. vafer genotype. Comparing the two Bl. vafer genotypes revealed that purifying selection purged most indels and nonsynonymous differences from protein-coding genes. We also discovered that indels occur frequently in multimeric simple sequence repeats, which are relatively abundant in Bl. vafer and may play a more substantial role in generating variation in this ant mutualist than in the aphid endosymbiont Buchnera. Finally, we explored how an apparent relocation of the origin of replication in Bl. vafer and the resulting shift in strand-associated mutational pressures may have caused accelerated gene loss and an elevated rate of indel polymorphisms in the region spanning the origin relocation. Combined, these results point to significant impacts of purifying selection on genomic polymorphisms as well as distinct patterns of indels associated with unusual genomic features of Blochmannia.
Collapse
Affiliation(s)
| | - Jennifer J. Wernegreen
- Institute for Genome Sciences and Policy, Duke University
- Nicholas School of the Environment, Duke University
- Corresponding author: E-mail:
| |
Collapse
|
118
|
CAGO: a software tool for dynamic visual comparison and correlation measurement of genome organization. PLoS One 2011; 6:e27080. [PMID: 22114666 PMCID: PMC3219657 DOI: 10.1371/journal.pone.0027080] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2011] [Accepted: 10/10/2011] [Indexed: 11/26/2022] Open
Abstract
CAGO (Comparative Analysis of Genome Organization) is developed to address two critical shortcomings of conventional genome atlas plotters: lack of dynamic exploratory functions and absence of signal analysis for genomic properties. With dynamic exploratory functions, users can directly manipulate chromosome tracks of a genome atlas and intuitively identify distinct genomic signals by visual comparison. Signal analysis of genomic properties can further detect inconspicuous patterns from noisy genomic properties and calculate correlations between genomic properties across various genomes. To implement dynamic exploratory functions, CAGO presents each genome atlas in Scalable Vector Graphics (SVG) format and allows users to interact with it using a SVG viewer through JavaScript. Signal analysis functions are implemented using R statistical software and a discrete wavelet transformation package waveslim. CAGO is not only a plotter for generating complex genome atlases, but also a platform for exploring genome atlases with dynamic exploratory functions for visual comparison and with signal analysis for comparing genomic properties across multiple organisms. The web-based application of CAGO, its source code, user guides, video demos, and live examples are publicly available and can be accessed at http://cbs.ym.edu.tw/cago.
Collapse
|
119
|
Takuno S, Kado T, Sugino RP, Nakhleh L, Innan H. Population genomics in bacteria: a case study of Staphylococcus aureus. Mol Biol Evol 2011; 29:797-809. [PMID: 22009061 DOI: 10.1093/molbev/msr249] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
We analyzed the genome-wide pattern of single nucleotide polymorphisms (SNPs) in a sample with 12 strains of Staphylococcus aureus. Population structure of S. aureus seems to be complex, and the 12 strains were divided into five groups, named A, B, C, D, and E. We conducted a detailed analysis of the topologies of gene genealogies across the genomes and observed a high rate and frequency of tree-shape switching, indicating extensive homologous recombination. Most of the detected recombination occurred in the ancestral population of A, B, and C, whereas there are a number of small regions that exhibit evidence for homologous recombination with a distinct related species. As such regions would contain a number of novel mutations, it is suggested that homologous recombination would play a crucial role to maintain genetic variation within species. In the A-B-C ancestral population, we found multiple lines of evidence that the coalescent pattern is very similar to what is expected in a panmictic population, suggesting that this population is suitable to apply the standard population genetic theories. Our analysis showed that homologous recombination caused a dramatic decay in linkage disequilibrium (LD) and there is almost no LD between SNPs with distance more than 10 kb. Coalescent simulations demonstrated that a high rate of homologous recombination-a relative rate of 0.6 to the mutation rate with an average tract length of about 10 kb-is required to produce patterns similar to those observed in the S. aureus genomes. Our results call for more research into the evolutionary role of homologous recombination in bacterial populations.
Collapse
Affiliation(s)
- Shohei Takuno
- Graduate University for Advanced Studies, Hayama, Kanagawa, Japan
| | | | | | | | | |
Collapse
|
120
|
Guo FB. [Strong strand specific composition bias-a genomic character of some obligate parasites or symbionts]. YI CHUAN = HEREDITAS 2011; 33:1039-1047. [PMID: 21993278 DOI: 10.3724/sp.j.1005.2011.01039] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
DNA replication includes a set of asymmetric mechanisms, which is a division into lagging and leading strands. The former is synthesized continuously whereas the synthesis for the latter is discontinuous. Such a asymmetric mechanism leads to distinct nucleotide composition of these two strands. Strands specific nucleotide composition bias was originally found in genomes of echinoderm and vertebrate mitochondria and then in several bacterial genomes. With the rapid growth in the number of sequenced genomes, many bacteria and even eukaryotes are found to have the consistent strand composition bias. In some bacteria, the extent of strand specific composition bias was so strong that genes on the two replicating strands could be separated according to their codon usages. Till now, 11 obligate intracellular bacteria have been found to have separate codon usages according to whether genes located on the leading or lagging strands. However, there is still not a well-accepted theory that could interpret the reason for the occurrence of separate codon usages in some special bacterial genomes and not in others. This paper reviews the related works and points out its open problems.
Collapse
Affiliation(s)
- Feng-Biao Guo
- University of Electronic Science and Technology of China, Chengdu, China.
| |
Collapse
|
121
|
Marsolier-Kergoat MC, Goldar A. DNA replication induces compositional biases in yeast. Mol Biol Evol 2011; 29:893-904. [PMID: 21948086 DOI: 10.1093/molbev/msr240] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Asymmetries intrinsic to the process of DNA replication are expected to cause differences in the substitution patterns of the leading and the lagging strands and to induce compositional biases. These biases have been detected in the majority of eubacterial genomes but rarely in eukaryotes. Only in the human genome, the activity of a minority of replication origins seems to generate compositional biases. In this work, we provide evidence for replication-associated GC and TA skews in the genomes of two yeast species, Saccharomyces cerevisiae and Kluyveromyces lactis, whereas the data for the Schizosaccharomyces pombe genome are less conclusive. In contrast with the genomes of Homo sapiens and of the majority of eubacteria, the leading strand is enriched in cytosine and adenine in both S. cerevisiae and K. lactis. We observed significant variations across the interorigin intervals of several substitution rates in the S. cerevisiae lineage since its divergence from S. paradoxus. We also found that the S. cerevisiae genome is far from compositional equilibrium and that its present compositional biases are due to substitution rates operating before its divergence from S. paradoxus. Finally, we observed that replication and transcription tend to be cooriented in the S. cerevisiae genome, especially for genes encoding subunits of protein complexes. Taken together, our results suggest that replication-related compositional biases may be a feature of many eukaryotic genomes despite the stochastic nature of the firing of replication origins in these genomes.
Collapse
|
122
|
Bermejo R, Capra T, Jossen R, Colosio A, Frattini C, Carotenuto W, Cocito A, Doksani Y, Klein H, Gómez-González B, Aguilera A, Katou Y, Shirahige K, Foiani M. The replication checkpoint protects fork stability by releasing transcribed genes from nuclear pores. Cell 2011; 146:233-46. [PMID: 21784245 PMCID: PMC3160733 DOI: 10.1016/j.cell.2011.06.033] [Citation(s) in RCA: 179] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2011] [Revised: 04/22/2011] [Accepted: 06/15/2011] [Indexed: 02/08/2023]
Abstract
Transcription hinders replication fork progression and stability, and the Mec1/ATR checkpoint protects fork integrity. Examining checkpoint-dependent mechanisms controlling fork stability, we find that fork reversal and dormant origin firing due to checkpoint defects are rescued in checkpoint mutants lacking THO, TREX-2, or inner-basket nucleoporins. Gene gating tethers transcribed genes to the nuclear periphery and is counteracted by checkpoint kinases through phosphorylation of nucleoporins such as Mlp1. Checkpoint mutants fail to detach transcribed genes from nuclear pores, thus generating topological impediments for incoming forks. Releasing this topological complexity by introducing a double-strand break between a fork and a transcribed unit prevents fork collapse. Mlp1 mutants mimicking constitutive checkpoint-dependent phosphorylation also alleviate checkpoint defects. We propose that the checkpoint assists fork progression and stability at transcribed genes by phosphorylating key nucleoporins and counteracting gene gating, thus neutralizing the topological tension generated at nuclear pore gated genes.
Collapse
Affiliation(s)
- Rodrigo Bermejo
- Fondazione Istituto FIRC di Oncologia Molecolare (IFOM) at IFOM-IEO Campus, Via Adamello 16, 20139 Milan, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
123
|
Abstract
This study reports the essential Caulobacter genome at 8 bp resolution determined by saturated transposon mutagenesis and high-throughput sequencing. This strategy is applicable to full genome essentiality studies in a broad class of bacterial species. The essential Caulobacter genome was determined at 8 bp resolution using hyper-saturated transposon mutagenesis coupled with high-throughput sequencing. Essential protein-coding sequences comprise 90% of the essential genome; the remaining 10% comprising essential non-coding RNA sequences, gene regulatory elements and essential genome replication features. Of the 3876 annotated open reading frames (ORFs), 480 (12.4%) were essential ORFs, 3240 (83.6%) were non-essential ORFs and 156 (4.0%) were ORFs that severely impacted fitness when mutated. The essential elements are preferentially positioned near the origin and terminus of the Caulobacter chromosome. This high-resolution strategy is applicable to high-throughput, full genome essentiality studies and large-scale genetic perturbation experiments in a broad class of bacterial species.
The regulatory events that control polar differentiation and cell-cycle progression in the bacterium Caulobacter crescentus are highly integrated, and they have to occur in the proper order (McAdams and Shapiro, 2011). Components of the core regulatory circuit are largely known. Full discovery of its essential genome, including non-coding, regulatory and coding elements, is a prerequisite for understanding the complete regulatory network of this bacterial cell. We have identified all the essential coding and non-coding elements of the Caulobacter chromosome using a hyper-saturated transposon mutagenesis strategy that is scalable and can be readily extended to obtain rapid and accurate identification of the essential genome elements of any sequenced bacterial species at a resolution of a few base pairs. We engineered a Tn5 derivative transposon (Tn5Pxyl) that carries at one end an inducible outward pointing Pxyl promoter (Christen et al, 2010). We showed that this transposon construct inserts into the genome randomly where it can activate or disrupt transcription at the site of integration, depending on the insertion orientation. DNA from hundred of thousands of transposon insertion sites reading outward into flanking genomic regions was parallel PCR amplified and sequenced by Illumina paired-end sequencing to locate the insertion site in each mutant strain (Figure 1). A single sequencing run on DNA from a mutagenized cell population yielded 118 million raw sequencing reads. Of these, >90 million (>80%) read outward from the transposon element into adjacent genomic DNA regions and the insertion site could be mapped with single nucleotide resolution. This yielded the location and orientation of 428 735 independent transposon insertions in the 4-Mbp Caulobacter genome. Within non-coding sequences of the Caulobacter genome, we detected 130 non-disruptable DNA segments between 90 and 393 bp long in addition to all essential promoter elements. Among 27 previously identified and validated sRNAs (Landt et al, 2008), three were contained within non-disruptable DNA segments and another three were partially disruptable, that is, insertions caused a notable growth defect. Two additional small RNAs found to be essential are the transfer-messenger RNA (tmRNA) and the ribozyme RNAseP (Landt et al, 2008). In addition to the 8 non-disruptable sRNAs, 29 out of the 130 intergenic essential non-coding sequences contained non-redundant tRNA genes; duplicated tRNA genes were non-essential. We also identified two non-disruptable DNA segments within the chromosomal origin of replication. Thus, we resolved essential non-coding RNAs, tRNAs and essential replication elements within the origin region of the chromosome. An additional 90 non-disruptable small genome elements of currently unknown function were identified. Eighteen of these are conserved in at least one closely related species. Only 2 could encode a protein of over 50 amino acids. For each of the 3876 annotated open reading frames (ORFs), we analyzed the distribution, orientation, and genetic context of transposon insertions. There are 480 essential ORFs and 3240 non-essential ORFs. In addition, there were 156 ORFs that severely impacted fitness when mutated. The 8-bp resolution allowed a dissection of the essential and non-essential regions of the coding sequences. Sixty ORFs had transposon insertions within a significant portion of their 3′ region but lacked insertions in the essential 5′ coding region, allowing the identification of non-essential protein segments. For example, transposon insertions in the essential cell-cycle regulatory gene divL, a tyrosine kinase, showed that the last 204 C-terminal amino acids did not impact viability, confirming previous reports that the C-terminal ATPase domain of DivL is dispensable for viability (Reisinger et al, 2007; Iniesta et al, 2010). In addition, we found that 30 out of 480 (6.3%) of the essential ORFs appear to be shorter than the annotated ORF, suggesting that these are probably mis-annotated. Among the 480 ORFs essential for growth on rich media, there were 10 essential transcriptional regulatory proteins, including 5 previously identified cell-cycle regulators (McAdams and Shapiro, 2003; Holtzendorff et al, 2004; Collier and Shapiro, 2007; Gora et al, 2010; Tan et al, 2010) and 5 uncharacterized predicted transcription factors. In addition, two RNA polymerase sigma factors RpoH and RpoD, as well as the anti-sigma factor ChrR, which mitigates rpoE-dependent stress response under physiological growth conditions (Lourenco and Gomes, 2009), were also found to be essential. Thus, a set of 10 transcription factors, 2 RNA polymerase sigma factors and 1 anti-sigma factor are the core essential transcriptional regulators for growth on rich media. To further characterize the core components of the Caulobacter cell-cycle control network, we identified all essential regulatory sequences and operon transcripts. Altogether, the 480 essential protein-coding and 37 essential RNA-coding Caulobacter genes are organized into operons such that 402 individual promoter regions are sufficient to regulate their expression. Of these 402 essential promoters, the transcription start sites (TSSs) of 105 were previously identified (McGrath et al, 2007). The essential genome features are non-uniformly distributed on the Caulobacter genome and enriched near the origin and the terminus regions. In contrast, the chromosomal positions of the published E. coli essential coding sequences (Rocha, 2004) are preferentially located at either side of the origin (Figure 4A). This indicates that there are selective pressures on chromosomal positioning of some essential elements (Figure 4A). The strategy described in this report could be readily extended to quickly determine the essential genome for a large class of bacterial species. Caulobacter crescentus is a model organism for the integrated circuitry that runs a bacterial cell cycle. Full discovery of its essential genome, including non-coding, regulatory and coding elements, is a prerequisite for understanding the complete regulatory network of a bacterial cell. Using hyper-saturated transposon mutagenesis coupled with high-throughput sequencing, we determined the essential Caulobacter genome at 8 bp resolution, including 1012 essential genome features: 480 ORFs, 402 regulatory sequences and 130 non-coding elements, including 90 intergenic segments of unknown function. The essential transcriptional circuitry for growth on rich media includes 10 transcription factors, 2 RNA polymerase sigma factors and 1 anti-sigma factor. We identified all essential promoter elements for the cell cycle-regulated genes. The essential elements are preferentially positioned near the origin and terminus of the chromosome. The high-resolution strategy used here is applicable to high-throughput, full genome essentiality studies and large-scale genetic perturbation experiments in a broad class of bacterial species.
Collapse
|
124
|
Kimbrel JA, Givan SA, Temple TN, Johnson KB, Chang JH. Genome sequencing and comparative analysis of the carrot bacterial blight pathogen, Xanthomonas hortorum pv. carotae M081, for insights into pathogenicity and applications in molecular diagnostics. MOLECULAR PLANT PATHOLOGY 2011; 12:580-94. [PMID: 21722296 PMCID: PMC6640479 DOI: 10.1111/j.1364-3703.2010.00694.x] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Xanthomonas hortorum pv. carotae (Xhc) is an economically important pathogen of carrots. Its ability to epiphytically colonize foliar surfaces and infect seeds can result in bacterial blight of carrots when grown in warm and humid regions. We used high-throughput sequencing to determine the genome sequence of isolate M081 of Xhc. The short reads were de novo assembled and the resulting contigs were ordered using a syntenic reference genome sequence from X. campestris pv. campestris ATCC 33913. The improved, high-quality draft genome sequence of Xhc M081 is the first for its species. Despite its distance from other sequenced xanthomonads, Xhc M081 still shared a large inventory of orthologous genes, including many clusters of virulence genes common to other foliar pathogenic species of Xanthomonas. We also mined the genome sequence and identified at least 21 candidate type III effector genes. Two were members of the avrBs2 and xopQ families that demonstrably elicit effector-triggered immunity. We showed that expression in planta of these two type III effectors from Xhc M081 was sufficient to elicit resistance gene-mediated hypersensitive responses in heterologous plants, indicating a possibility for resistance gene-mediated control of Xhc. Finally, we identified regions unique to the Xhc M081 genome sequence, and demonstrated their potential in the design of molecular diagnostics for this pathogen.
Collapse
Affiliation(s)
- Jeffrey A Kimbrel
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, USA
| | | | | | | | | |
Collapse
|
125
|
Silby MW, Winstanley C, Godfrey SA, Levy SB, Jackson RW. Pseudomonasgenomes: diverse and adaptable. FEMS Microbiol Rev 2011; 35:652-80. [DOI: 10.1111/j.1574-6976.2011.00269.x] [Citation(s) in RCA: 578] [Impact Index Per Article: 44.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
|
126
|
Replication and segregation of an Escherichia coli chromosome with two replication origins. Proc Natl Acad Sci U S A 2011; 108:E243-50. [PMID: 21670292 DOI: 10.1073/pnas.1100874108] [Citation(s) in RCA: 70] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Characterized bacteria, unlike eukaryotes and some archaea, initiate replication bidirectionally from a single replication origin contained within a circular or linear chromosome. We constructed Escherichia coli cells with two WT origins separated by 1 Mb in their 4.64-Mb chromosome. Productive bidirectional replication initiated synchronously at both spatially separate origins. Newly replicated DNA from both origins was segregated sequentially as replication progressed, with two temporally and spatially separate replication termination events. Replication initiation occurred at a cell volume identical to that of cells with a single WT origin, showing that initiation control is independent of cellular and chromosomal oriC concentration. Cells containing just the ectopic origin initiated bidirectional replication at the expected cell mass and at the normal cellular location of that region. In all strains, spatial separation of sister loci adjacent to active origins occurred shortly after their replication, independently of whether replication initiated at the normal origin, the ectopic origin, or both origins.
Collapse
|
127
|
Lyons E, Freeling M, Kustu S, Inwood W. Using genomic sequencing for classical genetics in E. coli K12. PLoS One 2011; 6:e16717. [PMID: 21364914 PMCID: PMC3045373 DOI: 10.1371/journal.pone.0016717] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2010] [Accepted: 12/23/2010] [Indexed: 02/07/2023] Open
Abstract
We here develop computational methods to facilitate use of 454 whole genome shotgun sequencing to identify mutations in Escherichia coli K12. We had Roche sequence eight related strains derived as spontaneous mutants in a background without a whole genome sequence. They provided difference tables based on assembling each genome to reference strain E. coli MG1655 (NC_000913). Due to the evolutionary distance to MG1655, these contained a large number of both false negatives and positives. By manual analysis of the dataset, we detected all the known mutations (24 at nine locations) and identified and genetically confirmed new mutations necessary and sufficient for the phenotypes we had selected in four strains. We then had Roche assemble contigs de novo, which we further assembled to full-length pseudomolecules based on synteny with MG1655. This hybrid method facilitated detection of insertion mutations and allowed annotation from MG1655. After removing one genome with less than the optimal 20- to 30-fold sequence coverage, we identified 544 putative polymorphisms that included all of the known and selected mutations apart from insertions. Finally, we detected seven new mutations in a total of only 41 candidates by comparing single genomes to composite data for the remaining six and using a ranking system to penalize homopolymer sequencing and misassembly errors. An additional benefit of the analysis is a table of differences between MG1655 and a physiologically robust E. coli wild-type strain NCM3722. Both projects were greatly facilitated by use of comparative genomics tools in the CoGe software package (http://genomevolution.org/).
Collapse
Affiliation(s)
- Eric Lyons
- Department of Plant and Microbial Biology, University of California, Berkeley, California, United States of America
- * E-mail:
| | - Michael Freeling
- Department of Plant and Microbial Biology, University of California, Berkeley, California, United States of America
| | - Sydney Kustu
- Department of Plant and Microbial Biology, University of California, Berkeley, California, United States of America
| | - William Inwood
- Department of Plant and Microbial Biology, University of California, Berkeley, California, United States of America
| |
Collapse
|
128
|
Merrikh H, Machón C, Grainger WH, Grossman AD, Soultanas P. Co-directional replication-transcription conflicts lead to replication restart. Nature 2011; 470:554-7. [PMID: 21350489 PMCID: PMC3059490 DOI: 10.1038/nature09758] [Citation(s) in RCA: 134] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2010] [Accepted: 12/09/2010] [Indexed: 11/23/2022]
Abstract
Head-on encounters between the replication and transcription machineries on the lagging DNA strand can lead to replication fork arrest and genomic instability. To avoid head-on encounters, most genes, especially essential and highly transcribed genes, are encoded on the leading strand such that transcription and replication are co-directional. Virtually all bacteria have the highly expressed ribosomal RNA genes co-directional with replication. In bacteria, co-directional encounters seem inevitable because the rate of replication is about 10-20-fold greater than the rate of transcription. However, these encounters are generally thought to be benign. Biochemical analyses indicate that head-on encounters are more deleterious than co-directional encounters and that in both situations, replication resumes without the need for any auxiliary restart proteins, at least in vitro. Here we show that in vivo, co-directional transcription can disrupt replication, leading to the involvement of replication restart proteins. We found that highly transcribed rRNA genes are hotspots for co-directional conflicts between replication and transcription in rapidly growing Bacillus subtilis cells. We observed a transcription-dependent increase in association of the replicative helicase and replication restart proteins where head-on and co-directional conflicts occur. Our results indicate that there are co-directional conflicts between replication and transcription in vivo. Furthermore, in contrast to the findings in vitro, the replication restart machinery is involved in vivo in resolving potentially deleterious encounters due to head-on and co-directional conflicts. These conflicts probably occur in many organisms and at many chromosomal locations and help to explain the presence of important auxiliary proteins involved in replication restart and in helping to clear a path along the DNA for the replisome.
Collapse
Affiliation(s)
- Houra Merrikh
- Department of Biology, Building 68-530, M.I.T., Cambridge, MA, 02139, USA
| | - Cristina Machón
- Centre for Biomolecular Sciences, School of Chemistry, University of Nottingham, University Park, Nottingham NG7 2RD, UK
| | - William H. Grainger
- Centre for Biomolecular Sciences, School of Chemistry, University of Nottingham, University Park, Nottingham NG7 2RD, UK
| | - Alan D. Grossman
- Department of Biology, Building 68-530, M.I.T., Cambridge, MA, 02139, USA
| | - Panos Soultanas
- Centre for Biomolecular Sciences, School of Chemistry, University of Nottingham, University Park, Nottingham NG7 2RD, UK
| |
Collapse
|
129
|
Abstract
DNA replication fork movement is impeded by collisions with transcription elongation complexes (TEC). We propose that a critical function of transcription termination factors is to prevent TEC from blocking DNA replication and inducing replication fork arrest, one consequence of which is DNA double-strand breaks. We show that inhibition of Rho-dependent transcription termination by bicyclomycin in Escherichia coli induced double-strand breaks. Cells deleted for Rho-cofactors nusA and nusG were hypersensitive to bicyclomycin, and had extensive chromosome fragmentation even in the absence of the drug. An RNA polymerase mutation that destabilizes TEC (rpoB*35) increased bicyclomycin resistance >40-fold. Double-strand break formation depended on DNA replication, and can be explained by replication fork collapse. Deleting recombination genes required for replication fork repair (recB and ruvC) increased sensitivity to bicyclomycin, as did loss of the replication fork reloading helicases rep and priA. We propose that Rho responds to a translocating replisome by releasing obstructing TEC.
Collapse
|
130
|
Kimbrel JA, Givan SA, Halgren AB, Creason AL, Mills DI, Banowetz GM, Armstrong DJ, Chang JH. An improved, high-quality draft genome sequence of the Germination-Arrest Factor-producing Pseudomonas fluorescens WH6. BMC Genomics 2010; 11:522. [PMID: 20920191 PMCID: PMC2997014 DOI: 10.1186/1471-2164-11-522] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2010] [Accepted: 09/28/2010] [Indexed: 12/20/2022] Open
Abstract
Background Pseudomonas fluorescens is a genetically and physiologically diverse species of bacteria present in many habitats and in association with plants. This species of bacteria produces a large array of secondary metabolites with potential as natural products. P. fluorescens isolate WH6 produces Germination-Arrest Factor (GAF), a predicted small peptide or amino acid analog with herbicidal activity that specifically inhibits germination of seeds of graminaceous species. Results We used a hybrid next-generation sequencing approach to develop a high-quality draft genome sequence for P. fluorescens WH6. We employed automated, manual, and experimental methods to further improve the draft genome sequence. From this assembly of 6.27 megabases, we predicted 5876 genes, of which 3115 were core to P. fluorescens and 1567 were unique to WH6. Comparative genomic studies of WH6 revealed high similarity in synteny and orthology of genes with P. fluorescens SBW25. A phylogenomic study also placed WH6 in the same lineage as SBW25. In a previous non-saturating mutagenesis screen we identified two genes necessary for GAF activity in WH6. Mapping of their flanking sequences revealed genes that encode a candidate anti-sigma factor and an aminotransferase. Finally, we discovered several candidate virulence and host-association mechanisms, one of which appears to be a complete type III secretion system. Conclusions The improved high-quality draft genome sequence of WH6 contributes towards resolving the P. fluorescens species, providing additional impetus for establishing two separate lineages in P. fluorescens. Despite the high levels of orthology and synteny to SBW25, WH6 still had a substantial number of unique genes and represents another source for the discovery of genes with implications in affecting plant growth and health. Two genes are demonstrably necessary for GAF and further characterization of their proteins is important for developing natural products as control measure against grassy weeds. Finally, WH6 is the first isolate of P. fluorescens reported to encode a complete T3SS. This gives us the opportunity to explore the role of what has traditionally been thought of as a virulence mechanism for non-pathogenic interactions with plants.
Collapse
Affiliation(s)
- Jeffrey A Kimbrel
- Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331, USA
| | | | | | | | | | | | | | | |
Collapse
|
131
|
Qu H, Wu H, Zhang T, Zhang Z, Hu S, Yu J. Nucleotide compositional asymmetry between the leading and lagging strands of eubacterial genomes. Res Microbiol 2010; 161:838-46. [PMID: 20868744 DOI: 10.1016/j.resmic.2010.09.015] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2010] [Accepted: 08/03/2010] [Indexed: 11/15/2022]
Abstract
Nucleotide compositional asymmetry (NCA) between leading and lagging strands (LeS and LaS) is dynamic and diverse among eubacterial genomes due to different mutation and selection forces. A thorough investigation is needed in order to study the relationship between nucleotide composition dynamics and gene distribution biases. Based on a collection of 364 eubacterial genomes that were grouped according to a DnaE-based scheme (DnaE1-DnaE1, DnaE2-DnaE1, and DnaE3-PolC), we investigated NCA and nucleotide composition gradients at three codon positions and found that there was universal G-enrichment on LeS among all groups. This was due to a strong selection for G-heading (codon position1 or cp1) codons and mutation pressure that led to more G-ending (cp3) codons. Moreover, a slight T-enrichment of LeS due to the mutation of cytosine deamination at cp3 was universal among DnaE1-DnaE1 and DnaE2-DnaE1 genomes, but was not clearly seen among DnaE3-PolC genomes, in which A-enrichment of LeS was proposed to be the effect of selections unique to polC and a mutation bias toward A-richness at cp1 that may be a result of transcription-coupled DNA repair mechanisms. Furthermore, strand-biased gene distribution enhances the purine-richness of LeS for DnaE3-PolC genomes and T-richness of LeS for DnaE1-DnaE1 and DnaE2-dnaE1 genomes.
Collapse
Affiliation(s)
- Hongzhu Qu
- Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, China.
| | | | | | | | | | | |
Collapse
|
132
|
Huff J, Czyz A, Landick R, Niederweis M. Taking phage integration to the next level as a genetic tool for mycobacteria. Gene 2010; 468:8-19. [PMID: 20692326 DOI: 10.1016/j.gene.2010.07.012] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2009] [Revised: 07/26/2010] [Accepted: 07/27/2010] [Indexed: 11/25/2022]
Abstract
Genes must be stably integrated into bacterial chromosomes for complementation of gene deletion mutants in animal infection experiments or to express antigens in vaccine strains. However, with currently available vectors it is cumbersome to create multiple, stable, unmarked chromosomal integrations in mycobacteria. Here, we have constructed a novel integration vector for mycobacteria that enables expression of genes from a cassette protected from transcriptional interference by bi-directional transcriptional terminators proven to be highly efficient in in vitro transcription termination assays. Removal of the integrase gene by a site-specific recombinase, easily identifiable by loss of a backbone reporter gene, stabilizes the integration cassette and makes this vector ideally suitable for infection experiments. This integration vector can be easily adapted to different mycobacteriophage attachment sites (attB) due to its modular design. Integration of a gfp expression cassette at the L5, Giles and Ms6 attB sites in the chromosomes of Mycobacterium smegmatis and Mycobacterium tuberculosis yielded identical gfp expression levels, indicating that none of these sites are compromised for gene expression. The copy number of pAL5000-based extrachromosomal plasmids is 23 in M. smegmatis as determined by quantitative real-time PCR and accounts for the previously observed drastic reduction of gene expression upon integration of plasmids into the chromosome of mycobacteria. Gfp expression and fluorescence of M. smegmatis and M. tuberculosis strains with multiple integrations of gfp increased concomitantly with the copy number demonstrating that these vectors can be used to generate stronger phenotypes and/or to analyze several genes simultaneously in vivo.
Collapse
Affiliation(s)
- Jason Huff
- Department of Microbiology, University of Alabama at Birmingham, 609 Bevill Biomedical Research Building, 845 19th Street South, Birmingham, AL 35294, USA
| | | | | | | |
Collapse
|
133
|
Supek F, Škunca N, Repar J, Vlahoviček K, Šmuc T. Translational selection is ubiquitous in prokaryotes. PLoS Genet 2010; 6:e1001004. [PMID: 20585573 PMCID: PMC2891978 DOI: 10.1371/journal.pgen.1001004] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2009] [Accepted: 05/26/2010] [Indexed: 11/29/2022] Open
Abstract
Codon usage bias in prokaryotic genomes is largely a consequence of background substitution patterns in DNA, but highly expressed genes may show a preference towards codons that enable more efficient and/or accurate translation. We introduce a novel approach based on supervised machine learning that detects effects of translational selection on genes, while controlling for local variation in nucleotide substitution patterns represented as sequence composition of intergenic DNA. A cornerstone of our method is a Random Forest classifier that outperformed previous distance measure-based approaches, such as the codon adaptation index, in the task of discerning the (highly expressed) ribosomal protein genes by their codon frequencies. Unlike previous reports, we show evidence that translational selection in prokaryotes is practically universal: in 460 of 461 examined microbial genomes, we find that a subset of genes shows a higher codon usage similarity to the ribosomal proteins than would be expected from the local sequence composition. These genes constitute a substantial part of the genome—between 5% and 33%, depending on genome size—while also exhibiting higher experimentally measured mRNA abundances and tending toward codons that match tRNA anticodons by canonical base pairing. Certain gene functional categories are generally enriched with, or depleted of codon-optimized genes, the trends of enrichment/depletion being conserved between Archaea and Bacteria. Prominent exceptions from these trends might indicate genes with alternative physiological roles; we speculate on specific examples related to detoxication of oxygen radicals and ammonia and to possible misannotations of asparaginyl–tRNA synthetases. Since the presence of codon optimizations on genes is a valid proxy for expression levels in fully sequenced genomes, we provide an example of an “adaptome” by highlighting gene functions with expression levels elevated specifically in thermophilic Bacteria and Archaea. Synonymous codons are not equally common in genomes. The main causes of unequal codon usage are varying nucleotide substitution patterns, as manifested in the wide range of genomic nucleotide compositions. However, since the first E. coli and yeast genes were sequenced, it became evident that there was also a bias towards codons that can be translated to protein faster and more accurately. This bias was stronger in highly expressed genes, and its driving force was termed translational selection. Researchers sought for effects of translational selection in microbial genomes as they became available, employing a flurry of mathematical approaches which sometimes led to contradictory conclusions. We introduce a sensitive and accurate machine learning-based methodology and find that highly expressed genes have a recognizable codon usage pattern in almost every bacterial and archaeal genome analyzed, even after accounting for large differences in background nucleotide composition. We also show that the gene functional category has a great bearing on whether that gene is subject to translational selection. Since presence of codon optimizations can be used as a purely sequence-derived proxy for expression levels, we can delineate “adaptomes” by relating predicted gene activity to organisms' phenotypes, which we demonstrate on genomes of temperature-resistant Bacteria and Archaea.
Collapse
Affiliation(s)
- Fran Supek
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Nives Škunca
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Jelena Repar
- Division of Molecular Biology, Rudjer Boskovic Institute, Zagreb, Croatia
| | - Kristian Vlahoviček
- Division of Biology, Faculty of Science, University of Zagreb, Zagreb, Croatia
- Department of Informatics, University of Oslo, Oslo, Norway
| | - Tomislav Šmuc
- Division of Electronics, Rudjer Boskovic Institute, Zagreb, Croatia
- * E-mail:
| |
Collapse
|
134
|
Tehranchi AK, Blankschien MD, Zhang Y, Halliday JA, Srivatsan A, Peng J, Herman C, Wang JD. The transcription factor DksA prevents conflicts between DNA replication and transcription machinery. Cell 2010; 141:595-605. [PMID: 20478253 DOI: 10.1016/j.cell.2010.03.036] [Citation(s) in RCA: 105] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2009] [Revised: 01/06/2010] [Accepted: 03/11/2010] [Indexed: 12/27/2022]
Abstract
Actively dividing cells perform robust and accurate DNA replication during fluctuating nutrient availability, yet factors that prevent disruption of replication remain largely unknown. Here we report that DksA, a nutrient-responsive transcription factor, ensures replication completion in Escherichia coli by removing transcription roadblocks. In the absence of DksA, replication is rapidly arrested upon amino acid starvation. This arrest requires active transcription and is alleviated by RNA polymerase mutants that compensate for DksA activity. This replication arrest occurs independently of exogenous DNA damage, yet it induces the DNA-damage response and recruits the main recombination protein RecA. This function of DksA is independent of its transcription initiation activity but requires its less-studied transcription elongation activity. Finally, GreA/B elongation factors also prevent replication arrest during nutrient stress. We conclude that transcription elongation factors alleviate fundamental conflicts between replication and transcription, thereby protecting replication fork progression and DNA integrity.
Collapse
Affiliation(s)
- Ashley K Tehranchi
- Department of Molecular and Human Genetics, Baylor College of Medicine, One Baylor Plaza, Houston, TX 77030, USA
| | | | | | | | | | | | | | | |
Collapse
|
135
|
Hartman AL, Norais C, Badger JH, Delmas S, Haldenby S, Madupu R, Robinson J, Khouri H, Ren Q, Lowe TM, Maupin-Furlow J, Pohlschroder M, Daniels C, Pfeiffer F, Allers T, Eisen JA. The complete genome sequence of Haloferax volcanii DS2, a model archaeon. PLoS One 2010; 5:e9605. [PMID: 20333302 PMCID: PMC2841640 DOI: 10.1371/journal.pone.0009605] [Citation(s) in RCA: 199] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2009] [Accepted: 02/11/2010] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND Haloferax volcanii is an easily culturable moderate halophile that grows on simple defined media, is readily transformable, and has a relatively stable genome. This, in combination with its biochemical and genetic tractability, has made Hfx. volcanii a key model organism, not only for the study of halophilicity, but also for archaeal biology in general. METHODOLOGY/PRINCIPAL FINDINGS We report here the sequencing and analysis of the genome of Hfx. volcanii DS2, the type strain of this species. The genome contains a main 2.848 Mb chromosome, three smaller chromosomes pHV1, 3, 4 (85, 438, 636 kb, respectively) and the pHV2 plasmid (6.4 kb). CONCLUSIONS/SIGNIFICANCE The completed genome sequence, presented here, provides an invaluable tool for further in vivo and in vitro studies of Hfx. volcanii.
Collapse
Affiliation(s)
- Amber L. Hartman
- Department of Biology, Johns Hopkins University, Baltimore, Maryland, United States of America
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
- UC Davis Genome Center, University of California Davis, Davis, California, United States of America
| | - Cédric Norais
- Institut de Génétique et Microbiologie, Université Paris-Sud, Paris, France
- Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Jonathan H. Badger
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Stéphane Delmas
- Institute of Genetics, University of Nottingham, Nottingham, United Kingdom
| | - Sam Haldenby
- Institute of Genetics, University of Nottingham, Nottingham, United Kingdom
| | - Ramana Madupu
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Jeffrey Robinson
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Hoda Khouri
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Qinghu Ren
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
| | - Todd M. Lowe
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, California, United States of America
| | - Julie Maupin-Furlow
- Department of Microbiology and Cell Science, University of Florida, Gainesville, Florida, United States of America
| | - Mecky Pohlschroder
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Charles Daniels
- Department of Microbiology, Ohio State University, Columbus, Ohio, United States of America
| | - Friedhelm Pfeiffer
- Department of Membrane Biochemistry, Max-Planck-Institute of Biochemistry, Martinsried, Germany
| | - Thorsten Allers
- Institute of Genetics, University of Nottingham, Nottingham, United Kingdom
| | - Jonathan A. Eisen
- The Institute for Genomic Research (J. Craig Venter Institute), Rockville, Maryland, United States of America
- UC Davis Genome Center, University of California Davis, Davis, California, United States of America
- Department of Medical Microbiology and Immunology, University of California Davis, Davis, California, United States of America
- Department of Evolution and Ecology, University of California Davis, Davis, California, United States of America
| |
Collapse
|
136
|
Srivatsan A, Tehranchi A, MacAlpine DM, Wang JD. Co-orientation of replication and transcription preserves genome integrity. PLoS Genet 2010; 6:e1000810. [PMID: 20090829 PMCID: PMC2797598 DOI: 10.1371/journal.pgen.1000810] [Citation(s) in RCA: 135] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Accepted: 12/10/2009] [Indexed: 01/18/2023] Open
Abstract
In many bacteria, there is a genome-wide bias towards co-orientation of replication and transcription, with essential and/or highly-expressed genes further enriched co-directionally. We previously found that reversing this bias in the bacterium Bacillus subtilis slows replication elongation, and we proposed that this effect contributes to the evolutionary pressure selecting the transcription-replication co-orientation bias. This selection might have been based purely on selection for speedy replication; alternatively, the slowed replication might actually represent an average of individual replication-disruption events, each of which is counter-selected independently because genome integrity is selected. To differentiate these possibilities and define the precise forces driving this aspect of genome organization, we generated new strains with inversions either over ∼1/4 of the chromosome or at ribosomal RNA (rRNA) operons. Applying mathematical analysis to genomic microarray snapshots, we found that replication rates vary dramatically within the inverted genome. Replication is moderately impeded throughout the inverted region, which results in a small but significant competitive disadvantage in minimal medium. Importantly, replication is strongly obstructed at inverted rRNA loci in rich medium. This obstruction results in disruption of DNA replication, activation of DNA damage responses, loss of genome integrity, and cell death. Our results strongly suggest that preservation of genome integrity drives the evolution of co-orientation of replication and transcription, a conserved feature of genome organization. An important feature of genome organization is that transcription and replication are selectively co-oriented. This feature helps to avoid conflicts between head-on replication and transcription. The precise consequences of the conflict and how it affects genome organization remain to be understood. We previously found that reversing the transcription bias slows replication in the Bacillus subtilis genome. Here we engineered new inversions to avoid changes in other aspects of genome organization. We found that the reversed transcription bias is sufficient to decrease replication speed, and it results in lowered fitness of the inversion strains and a competitive disadvantage relative to wild-type cells in minimal medium. Further, by analyzing genomic copy-number snapshots to obtain replication speed as a function of genome position, we found that inversion of the strongly-transcribed rRNA genes obstructs replication during growth in rich medium. This confers a strong growth disadvantage to cells in rich medium, turns on DNA damage responses, and leads to cell death in a subpopulation of cells, while the surviving cells are more sensitive to genotoxic agents. Our results strongly support the hypothesis that evolution has favored co-orientation of transcription with replication, mainly to avoid these effects.
Collapse
Affiliation(s)
- Anjana Srivatsan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - Ashley Tehranchi
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
| | - David M. MacAlpine
- Department of Pharmacology and Cancer Biology, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Jue D. Wang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
- * E-mail:
| |
Collapse
|
137
|
Arakawa K, Suzuki H, Tomita M. Quantitative analysis of replication-related mutation and selection pressures in bacterial chromosomes and plasmids using generalised GC skew index. BMC Genomics 2009; 10:640. [PMID: 20042086 PMCID: PMC2804667 DOI: 10.1186/1471-2164-10-640] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2009] [Accepted: 12/30/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Due to their bi-directional replication machinery starting from a single finite origin, bacterial genomes show characteristic nucleotide compositional bias between the two replichores, which can be visualised through GC skew or (C-G)/(C+G). Although this polarisation is used for computational prediction of replication origins in many bacterial genomes, the degree of GC skew visibility varies widely among different species, necessitating a quantitative measurement of GC skew strength in order to provide confidence measures for GC skew-based predictions of replication origins. RESULTS Here we discuss a quantitative index for the measurement of GC skew strength, named the generalised GC skew index (gGCSI), which is applicable to genomes of any length, including bacterial chromosomes and plasmids. We demonstrate that gGCSI is independent of the window size and can thus be used to compare genomes with different sizes, such as bacterial chromosomes and plasmids. It can suggest the existence of different replication mechanisms in archaea and of rolling-circle replication in plasmids. Correlation of gGCSI values between plasmids and their corresponding host chromosomes suggests that within the same strain, these replicons have reproduced using the same replication machinery and thus exhibit similar strengths of replication strand skew. CONCLUSIONS gGCSI can be applied to genomes of any length and thus allows comparative study of replication-related mutation and selection pressures in genomes of different lengths such as bacterial chromosomes and plasmids. Using gGCSI, we showed that replication-related mutation or selection pressure is similar for replicons with similar machinery.
Collapse
Affiliation(s)
- Kazuharu Arakawa
- Institute for Advanced Biosciences, Keio University, Fujisawa, 252-8520, Japan.
| | | | | |
Collapse
|
138
|
Cadoret JC, Prioleau MN. Genome-wide approaches to determining origin distribution. Chromosome Res 2009; 18:79-89. [DOI: 10.1007/s10577-009-9094-2] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
|
139
|
Kong SG, Fan WL, Chen HD, Hsu ZT, Zhou N, Zheng B, Lee HC. Inverse symmetry in complete genomes and whole-genome inverse duplication. PLoS One 2009; 4:e7553. [PMID: 19898631 PMCID: PMC2771390 DOI: 10.1371/journal.pone.0007553] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2009] [Accepted: 07/22/2009] [Indexed: 12/18/2022] Open
Abstract
The cause of symmetry is usually subtle, and its study often leads to a deeper understanding of the bearer of the symmetry. To gain insight into the dynamics driving the growth and evolution of genomes, we conducted a comprehensive study of textual symmetries in 786 complete chromosomes. We focused on symmetry based on our belief that, in spite of their extreme diversity, genomes must share common dynamical principles and mechanisms that drive their growth and evolution, and that the most robust footprints of such dynamics are symmetry related. We found that while complement and reverse symmetries are essentially absent in genomic sequences, inverse-complement plus reverse-symmetry is prevalent in complex patterns in most chromosomes, a vast majority of which have near maximum global inverse symmetry. We also discovered relations that can quantitatively account for the long observed but unexplained phenomenon of -mer skews in genomes. Our results suggest segmental and whole-genome inverse duplications are important mechanisms in genome growth and evolution, probably because they are efficient means by which the genome can exploit its double-stranded structure to enrich its code-inventory.
Collapse
Affiliation(s)
- Sing-Guan Kong
- Graduate Institute of Systems Biology and Bioinformatics, National Central University, Chungli, Taiwan, Republic of China
| | - Wen-Lang Fan
- Department of Physics, National Central University, Chungli, Taiwan, Republic of China
| | - Hong-Da Chen
- Department of Physics, National Central University, Chungli, Taiwan, Republic of China
| | - Zi-Ting Hsu
- Graduate Institute of Systems Biology and Bioinformatics, National Central University, Chungli, Taiwan, Republic of China
| | - Nengji Zhou
- Institute of Modern Physics, Zhejiang University, Hangzhou, Zhejiang, China
- National Center for Theoretical Science, Shinchu, Taiwan, Republic of China
| | - Bo Zheng
- Graduate Institute of Systems Biology and Bioinformatics, National Central University, Chungli, Taiwan, Republic of China
| | - Hoong-Chien Lee
- Graduate Institute of Systems Biology and Bioinformatics, National Central University, Chungli, Taiwan, Republic of China
- Department of Physics, National Central University, Chungli, Taiwan, Republic of China
- Institute of Modern Physics, Zhejiang University, Hangzhou, Zhejiang, China
- National Center for Theoretical Science, Shinchu, Taiwan, Republic of China
- * E-mail:
| |
Collapse
|
140
|
Powdel BR, Satapathy SS, Kumar A, Jha PK, Buragohain AK, Borah M, Ray SK. A study in entire chromosomes of violations of the intra-strand parity of complementary nucleotides (Chargaff's second parity rule). DNA Res 2009; 16:325-43. [PMID: 19861381 PMCID: PMC2780954 DOI: 10.1093/dnares/dsp021] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Chargaff's rule of intra-strand parity (ISP) between complementary mono/oligonucleotides in chromosomes is well established in the scientific literature. Although a large numbers of papers have been published citing works and discussions on ISP in the genomic era, scientists are yet to find all the factors responsible for such a universal phenomenon in the chromosomes. In the present work, we have tried to address the issue from a new perspective, which is a parallel feature to ISP. The compositional abundance values of mono/oligonucleotides were determined in all non-overlapping sub-chromosomal regions of specific size. Also the frequency distributions of the mono/oligonucleotides among the regions were compared using the Kolmogorov–Smirnov test. Interestingly, the frequency distributions between the complementary mono/oligonucleotides revealed statistical similarity, which we named as intra-strand frequency distribution parity (ISFDP). ISFDP was observed as a general feature in chromosomes of bacteria, archaea and eukaryotes. Violation of ISFDP was also observed in several chromosomes. Chromosomes of different strains belonging a species in bacteria/archaea (Haemophilus influenza, Xylella fastidiosa etc.) and chromosomes of a eukaryote are found to be different among each other with respect to ISFDP violation. ISFDP correlates weakly with ISP in chromosomes suggesting that the latter one is not entirely responsible for the former. Asymmetry of replication topography and composition of forward-encoded sequences between the strands in chromosomes are found to be insufficient to explain the ISFDP feature in all chromosomes. This suggests that multiple factors in chromosomes are responsible for establishing ISFDP.
Collapse
Affiliation(s)
- B R Powdel
- 1Department of Mathematical Sciences, Tezpur University, Tezpur, Assam 784 028, India
| | | | | | | | | | | | | |
Collapse
|
141
|
Bermejo R, Capra T, Gonzalez-Huici V, Fachinetti D, Cocito A, Natoli G, Katou Y, Mori H, Kurokawa K, Shirahige K, Foiani M. Genome-Organizing Factors Top2 and Hmo1 Prevent Chromosome Fragility at Sites of S phase Transcription. Cell 2009; 138:870-84. [DOI: 10.1016/j.cell.2009.06.022] [Citation(s) in RCA: 89] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2008] [Revised: 04/10/2009] [Accepted: 06/10/2009] [Indexed: 12/18/2022]
|
142
|
Poptsova MS, Larionov SA, Ryadchenko EV, Rybalko SD, Zakharov IA, Loskutov A. Hidden chromosome symmetry: in silico transformation reveals symmetry in 2D DNA walk trajectories of 671 chromosomes. PLoS One 2009; 4:e6396. [PMID: 19636424 PMCID: PMC2712679 DOI: 10.1371/journal.pone.0006396] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2009] [Accepted: 06/23/2009] [Indexed: 11/18/2022] Open
Abstract
Maps of 2D DNA walk of 671 examined chromosomes show composition complexity change from symmetrical half-turn in bacteria to pseudo-random trajectories in archaea, fungi and humans. In silico transformation of gene order and strand position returns most of the analyzed chromosomes to a symmetrical bacterial-like state with one transition point. The transformed chromosomal sequences also reveal remarkable segmental compositional symmetry between regions from different strands located equidistantly from the transition point. Despite extensive chromosome rearrangement the relation of gene numbers on opposite strands for chromosomes of different taxa varies in narrow limits around unity with Pearson coefficient r = 0.98. Similar relation is observed for total genes' length (r = 0.86) and cumulative GC (r = 0.95) and AT (r = 0.97) skews. This is also true for human coding sequences (CDS), which comprise only several percent of the entire chromosome length. We found that frequency distributions of the length of gene clusters, continuously located on the same strand, have close values for both strands. Eukaryotic gene distribution is believed to be non-random. Contribution of different subsystems to the noted symmetries and distributions, and evolutionary aspects of symmetry are discussed.
Collapse
Affiliation(s)
- Maria S Poptsova
- University of Connecticut, Storrs, Connecticut, United States of America.
| | | | | | | | | | | |
Collapse
|
143
|
Allelic variation of bile salt hydrolase genes in Lactobacillus salivarius does not determine bile resistance levels. J Bacteriol 2009; 191:5743-57. [PMID: 19592587 DOI: 10.1128/jb.00506-09] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Commensal lactobacilli frequently produce bile salt hydrolase (Bsh) enzymes whose roles in intestinal survival are unclear. Twenty-six Lactobacillus salivarius strains from different sources all harbored a bsh1 allele on their respective megaplasmids. This allele was related to the plasmid-borne bsh1 gene of the probiotic strain UCC118. A second locus (bsh2) was found in the chromosomes of two strains that had higher bile resistance levels. Four Bsh1-encoding allele groups were identified, defined by truncations or deletions involving a conserved residue. In vitro analyses showed that this allelic variation was correlated with widely varying bile deconjugation phenotypes. Despite very low activity of the UCC118 Bsh1 enzyme, a mutant lacking this protein had significantly lower bile resistance, both in vitro and during intestinal transit in mice. However, the overall bile resistance phenotype of this and other strains was independent of the bsh1 allele type. Analysis of the L. salivarius transcriptome upon exposure to bile and cholate identified a multiplicity of stress response proteins and putative efflux proteins that appear to broadly compensate for, or mask, the effects of allelic variation of bsh genes. Bsh enzymes with different bile-degrading kinetics, though apparently not the primary determinants of bile resistance in L. salivarius, may have additional biological importance because of varying effects upon bile as a signaling molecule in the host.
Collapse
|
144
|
Krishnan NM, Rao BJ. A comparative approach to elucidate chloroplast genome replication. BMC Genomics 2009; 10:237. [PMID: 19457260 PMCID: PMC2695485 DOI: 10.1186/1471-2164-10-237] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2008] [Accepted: 05/20/2009] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Electron microscopy analyses of replicating chloroplast molecules earlier predicted bidirectional Cairns replication as the prevalent mechanism, perhaps followed by rounds of a rolling circle mechanism. This standard model is being challenged by the recent proposition of homologous recombination-mediated replication in chloroplasts. RESULTS We address this issue in our current study by analyzing nucleotide composition in genome regions between known replication origins, with an aim to reveal any adenine to guanine deamination gradients. These gradual linear gradients typically result from the accumulation of deaminations over the time spent single-stranded by one of the strands of the circular molecule during replication and can, therefore, be used to model the course of replication. Our linear regression analyses on the nucleotide compositions of the non-coding regions and the synonymous third codon position of coding regions, between pairs of replication origins, reveal the existence of significant adenine to guanine deamination gradients in portions overlapping the Small Single Copy (SSC) and the Large Single Copy (LSC) regions between inverted repeats. These gradients increase bi-directionally from the center of each region towards the respective ends, suggesting that both the strands were left single-stranded during replication. CONCLUSION Single-stranded regions of the genome and gradients in time that these regions are left single-stranded, as revealed by our nucleotide composition analyses, appear to converge with the original bi-directional dual displacement loop model and restore evidence for its existence as the primary mechanism. Other proposed faster modes such as homologous recombination and rolling circle initiation could exist in addition to this primary mechanism to facilitate homoplasmy among the intra-cellular chloroplast population.
Collapse
Affiliation(s)
- Neeraja M Krishnan
- B-202, Department of Biological Sciences, Tata Institute of Fundamental Research, 1 Homi Bhabha road, Colaba, Mumbai 400 005, India
- Current address: Molecular Reproduction, Development and Genetics, Indian Institute of Science, Bangalore 560 012, India
| | - Basuthkar J Rao
- B-202, Department of Biological Sciences, Tata Institute of Fundamental Research, 1 Homi Bhabha road, Colaba, Mumbai 400 005, India
| |
Collapse
|
145
|
Yin S, Deng W, Hu L, Kong X. The impact of nucleosome positioning on the organization of replication origins in eukaryotes. Biochem Biophys Res Commun 2009; 385:363-8. [PMID: 19463783 DOI: 10.1016/j.bbrc.2009.05.072] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2009] [Accepted: 05/13/2009] [Indexed: 01/01/2023]
Abstract
The distribution of DNA replication origins (ORIs) on eukaryotic chromosomes is nonrandom, but the reasons behind this are not well understood. Previous studies have suggested a prominent role of transcriptional activity in determining the ORI organization. Here, we identify nucleosome occupancy as a likely candidate to set up ORI distribution. Combining genome-wide data on nucleosome positioning and ORI organization in yeast and humans, we demonstrate that open chromatin domains, characterized by nucleosome depletion, are preferentially permissive for replication. However, contrary to priori claims, the impact of transcriptional activity is considerably weaker than previously proposed and could partly be explained by our nucleosome exclusion model. We propose that the ORI organization imposed by nucleosome positioning is phylogenetically widespread in eukaryotes.
Collapse
Affiliation(s)
- Shanye Yin
- Institute of Health Sciences, Shanghai Institutes for Biological Sciences (SIBS), Chinese Academy of Sciences (CAS) & Shanghai Jiao Tong University School of Medicine (SJTUSM), Shanghai 200025, People's Republic of China
| | | | | | | |
Collapse
|
146
|
Rudi K. Environmental shaping of ribosomal RNA nucleotide composition. MICROBIAL ECOLOGY 2009; 57:469-477. [PMID: 18825450 DOI: 10.1007/s00248-008-9446-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2008] [Accepted: 08/27/2008] [Indexed: 05/26/2023]
Abstract
Ribosomal RNA (rRNA) is one of the most important macromolecules in the cell. It is well established that high-temperature environmental conditions destabilize rRNA, leading to a selection for G+C-rich stabilizing structures. Our knowledge about the nucleotide composition effect of other environmental conditions, however, is limited. In the present work, I addressed this by correlating the rRNA nucleotide composition to known environmental habitats for bacteria. The bacterial phyla Firmicutes, Actinobacteria, and Proteobacteria were chosen for in-depth analyses due to the abundance of information available in the databases. Major differences in nucleotide composition were identified between these phyla. In addition to the G+C-->A+T gradients, a main gradient of G+A-->C+T was identified for Firmicutes, while a G+T-->A+C gradient was identified for Actinobacteria. With respect to correlation to environmental conditions, the Firmicutes showed a main structure of high G+C being correlated to thermophilic conditions, high A+T to anaerobic conditions, and high C+T to halophilic conditions. The main patterns detected for Firmicutes can be explained by structural stability for high G+C, chemical instability of G under aerobic conditions, and structural stability by purine/pyrimidine skew for halophilic conditions. On the contrary, the correlations for Actinobacteria cannot easily be explained by chemical and/or structural stability. This may indicate interference with factors not included in my work. Finally, I found a main correlation between high A+T and endosymbiosis for Proteobacteria. High A+T probably reflects adaptation to cell internal growth. Further support for environmentally driven nucleotide composition shaping was found and that polyphyletic bacteria were associated with the same environment/nucleotide correlations. My conclusion is that environmental conditions and habitats have a major effect on rRNA nucleotide composition but that the effects may differ between the bacterial phyla.
Collapse
Affiliation(s)
- Knut Rudi
- Hedmark University College, Hamar, Norway.
| |
Collapse
|
147
|
Duggin IG, Bell SD. Termination structures in the Escherichia coli chromosome replication fork trap. J Mol Biol 2009; 387:532-9. [PMID: 19233209 DOI: 10.1016/j.jmb.2009.02.027] [Citation(s) in RCA: 63] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2008] [Revised: 02/10/2009] [Accepted: 02/12/2009] [Indexed: 11/19/2022]
Abstract
The Escherichia coli chromosome contains two opposed sets of unidirectional DNA replication pause (Ter) sites that, according to the replication fork trap theory, control the termination of chromosome replication by restricting replication fork fusion to the terminus region. In contrast, a recent hypothesis suggested that termination occurs at the dif locus instead. Using two-dimensional agarose gel electrophoresis, we examined DNA replication intermediates at the Ter sites and at dif in wild-type cells. Two definitive signatures of site-specific termination--specific replication fork arrest and converging replication forks--were clearly detected at Ter sites, but not at dif. We also detected a significant pause during the latter stages of replication fork convergence at Ter sites. Quantification of fork pausing at the Ter sites in both their native chromosomal context and the plasmid context further supported the fork trap model.
Collapse
Affiliation(s)
- Iain G Duggin
- Medical Research Council Cancer Cell Unit, Hutchison-Medical Research Council Research Centre, Hills Road, Cambridge, UK.
| | | |
Collapse
|
148
|
Balleza E, López-Bojorquez LN, Martínez-Antonio A, Resendis-Antonio O, Lozada-Chávez I, Balderas-Martínez YI, Encarnación S, Collado-Vides J. Regulation by transcription factors in bacteria: beyond description. FEMS Microbiol Rev 2009; 33:133-51. [PMID: 19076632 PMCID: PMC2704942 DOI: 10.1111/j.1574-6976.2008.00145.x] [Citation(s) in RCA: 133] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Abstract
Transcription is an essential step in gene expression and its understanding has been one of the major interests in molecular and cellular biology. By precisely tuning gene expression, transcriptional regulation determines the molecular machinery for developmental plasticity, homeostasis and adaptation. In this review, we transmit the main ideas or concepts behind regulation by transcription factors and give just enough examples to sustain these main ideas, thus avoiding a classical ennumeration of facts. We review recent concepts and developments: cis elements and trans regulatory factors, chromosome organization and structure, transcriptional regulatory networks (TRNs) and transcriptomics. We also summarize new important discoveries that will probably affect the direction of research in gene regulation: epigenetics and stochasticity in transcriptional regulation, synthetic circuits and plasticity and evolution of TRNs. Many of the new discoveries in gene regulation are not extensively tested with wetlab approaches. Consequently, we review this broad area in Inference of TRNs and Dynamical Models of TRNs. Finally, we have stepped backwards to trace the origins of these modern concepts, synthesizing their history in a timeline schema.
Collapse
Affiliation(s)
- Enrique Balleza
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | | | | | | | | | | | | | | |
Collapse
|
149
|
Koonin EV. Evolution of genome architecture. Int J Biochem Cell Biol 2009; 41:298-306. [PMID: 18929678 PMCID: PMC3272702 DOI: 10.1016/j.biocel.2008.09.015] [Citation(s) in RCA: 143] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2008] [Revised: 09/16/2008] [Accepted: 09/16/2008] [Indexed: 11/26/2022]
Abstract
Charles Darwin believed that all traits of organisms have been honed to near perfection by natural selection. The empirical basis underlying Darwin's conclusions consisted of numerous observations made by him and other naturalists on the exquisite adaptations of animals and plants to their natural habitats and on the impressive results of artificial selection. Darwin fully appreciated the importance of heredity but was unaware of the nature and, in fact, the very existence of genomes. A century and a half after the publication of the "Origin", we have the opportunity to draw conclusions from the comparisons of hundreds of genome sequences from all walks of life. These comparisons suggest that the dominant mode of genome evolution is quite different from that of the phenotypic evolution. The genomes of vertebrates, those purported paragons of biological perfection, turned out to be veritable junkyards of selfish genetic elements where only a small fraction of the genetic material is dedicated to encoding biologically relevant information. In sharp contrast, genomes of microbes and viruses are incomparably more compact, with most of the genetic material assigned to distinct biological functions. However, even in these genomes, the specific genome organization (gene order) is poorly conserved. The results of comparative genomics lead to the conclusion that the genome architecture is not a straightforward result of continuous adaptation but rather is determined by the balance between the selection pressure, that is itself dependent on the effective population size and mutation rate, the level of recombination, and the activity of selfish elements. Although genes and, in many cases, multigene regions of genomes possess elaborate architectures that ensure regulation of expression, these arrangements are evolutionarily volatile and typically change substantially even on short evolutionary scales when gene sequences diverge minimally. Thus, the observed genome architectures are, mostly, products of neutral processes or epiphenomena of more general selective processes, such as selection for genome streamlining in successful lineages with large populations. Selection for specific gene arrangements (elements of genome architecture) seems only to modulate the results of these processes.
Collapse
Affiliation(s)
- Eugene V Koonin
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 8600 Rockville Pike, Bethesda, MD 20894, USA.
| |
Collapse
|
150
|
Necsulea A, Guillet C, Cadoret JC, Prioleau MN, Duret L. The relationship between DNA replication and human genome organization. Mol Biol Evol 2009; 26:729-41. [PMID: 19126867 DOI: 10.1093/molbev/msn303] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Assessment of the impact of DNA replication on genome architecture in Eukaryotes has long been hampered by the scarcity of experimental data. Recent work, relying on computational predictions of origins of replication, suggested that replication might be a major determinant of gene organization in human (Huvet et al. 2007. Human gene organization driven by the coordination of replication and transcription. Genome Res. 17:1278-1285). Here, we address this question by analyzing the first large-scale data set of experimentally determined origins of replication in human: 283 origins identified in HeLa cells, in 1% of the genome covered by ENCODE regions (Cadoret et al. 2008. Genome-wide studies highlight indirect links between human replication origins and gene regulation. Proc Natl Acad Sci USA. 105:15837-15842). We show that origins of replication are not randomly distributed as they display significant overlap with promoter regions and CpG islands. The hypothesis of a selective pressure to avoid frontal collisions between replication and transcription polymerases is not supported by experimental data as we find no evidence for gene orientation bias in the proximity of origins of replication. The lack of a significant orientation bias remains manifest even when considering only genes expressed at a high rate, or in a wide number of tissues, and is not affected by the regional replication timing. Gene expression breadth does not appear to be correlated with the distance from the origins of replication. We conclude that the impact of DNA replication on human genome organization is considerably weaker than previously proposed.
Collapse
|