1
|
Ageorges V, Monteiro R, Leroy S, Burgess CM, Pizza M, Chaucheyras-Durand F, Desvaux M. Molecular determinants of surface colonisation in diarrhoeagenic Escherichia coli (DEC): from bacterial adhesion to biofilm formation. FEMS Microbiol Rev 2021; 44:314-350. [PMID: 32239203 DOI: 10.1093/femsre/fuaa008] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2019] [Accepted: 03/31/2020] [Indexed: 12/11/2022] Open
Abstract
Escherichia coli is primarily known as a commensal colonising the gastrointestinal tract of infants very early in life but some strains being responsible for diarrhoea, which can be especially severe in young children. Intestinal pathogenic E. coli include six pathotypes of diarrhoeagenic E. coli (DEC), namely, the (i) enterotoxigenic E. coli, (ii) enteroaggregative E. coli, (iii) enteropathogenic E. coli, (iv) enterohemorragic E. coli, (v) enteroinvasive E. coli and (vi) diffusely adherent E. coli. Prior to human infection, DEC can be found in natural environments, animal reservoirs, food processing environments and contaminated food matrices. From an ecophysiological point of view, DEC thus deal with very different biotopes and biocoenoses all along the food chain. In this context, this review focuses on the wide range of surface molecular determinants acting as surface colonisation factors (SCFs) in DEC. In the first instance, SCFs can be broadly discriminated into (i) extracellular polysaccharides, (ii) extracellular DNA and (iii) surface proteins. Surface proteins constitute the most diverse group of SCFs broadly discriminated into (i) monomeric SCFs, such as autotransporter (AT) adhesins, inverted ATs, heat-resistant agglutinins or some moonlighting proteins, (ii) oligomeric SCFs, namely, the trimeric ATs and (iii) supramolecular SCFs, including flagella and numerous pili, e.g. the injectisome, type 4 pili, curli chaperone-usher pili or conjugative pili. This review also details the gene regulatory network of these numerous SCFs at the various stages as it occurs from pre-transcriptional to post-translocational levels, which remains to be fully elucidated in many cases.
Collapse
Affiliation(s)
- Valentin Ageorges
- Université Clermont Auvergne, INRAE, MEDiS, F-63000 Clermont-Ferrand, France
| | - Ricardo Monteiro
- Université Clermont Auvergne, INRAE, MEDiS, F-63000 Clermont-Ferrand, France.,GSK, Via Fiorentina 1, 53100 Siena, Italy
| | - Sabine Leroy
- Université Clermont Auvergne, INRAE, MEDiS, F-63000 Clermont-Ferrand, France
| | - Catherine M Burgess
- Food Safety Department, Teagasc Food Research Centre, Ashtown, Dublin 15, Ireland
| | | | - Frédérique Chaucheyras-Durand
- Université Clermont Auvergne, INRAE, MEDiS, F-63000 Clermont-Ferrand, France.,Lallemand Animal Nutrition SAS, F-31702 Blagnac Cedex, France
| | - Mickaël Desvaux
- Université Clermont Auvergne, INRAE, MEDiS, F-63000 Clermont-Ferrand, France
| |
Collapse
|
2
|
Retchless AC, Lawrence JG. Quantification of codon selection for comparative bacterial genomics. BMC Genomics 2011; 12:374. [PMID: 21787402 PMCID: PMC3162537 DOI: 10.1186/1471-2164-12-374] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2011] [Accepted: 07/25/2011] [Indexed: 11/16/2022] Open
Abstract
Background Statistics measuring codon selection seek to compare genes by their sensitivity to selection for translational efficiency, but existing statistics lack a model for testing the significance of differences between genes. Here, we introduce a new statistic for measuring codon selection, the Adaptive Codon Enrichment (ACE). Results This statistic represents codon usage bias in terms of a probabilistic distribution, quantifying the extent that preferred codons are over-represented in the gene of interest relative to the mean and variance that would result from stochastic sampling of codons. Expected codon frequencies are derived from the observed codon usage frequencies of a broad set of genes, such that they are likely to reflect nonselective, genome wide influences on codon usage (e.g. mutational biases). The relative adaptiveness of synonymous codons is deduced from the frequency of codon usage in a pre-selected set of genes relative to the expected frequency. The ACE can predict both transcript abundance during rapid growth and the rate of synonymous substitutions, with accuracy comparable to or greater than existing metrics. We further examine how the composition of reference gene sets affects the accuracy of the statistic, and suggest methods for selecting appropriate reference sets for any genome, including bacteriophages. Finally, we demonstrate that the ACE may naturally be extended to quantify the genome-wide influence of codon selection in a manner that is sensitive to a large fraction of codons in the genome. This reveals substantial variation among genomes, correlated with the tRNA gene number, even among groups of bacteria where previously proposed whole-genome measures show little variation. Conclusions The statistical framework of the ACE allows rigorous comparison of the level of codon selection acting on genes, both within a genome and between genomes.
Collapse
Affiliation(s)
- Adam C Retchless
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260, USA
| | | |
Collapse
|
3
|
Abstract
Across all kingdoms of biological life, protein-coding genes exhibit unequal usage of synonymous codons. Although alternative theories abound, translational selection has been accepted as an important mechanism that shapes the patterns of codon usage in prokaryotes and simple eukaryotes. Here we analyze patterns of codon usage across 74 diverse bacteriophages that infect E. coli, P. aeruginosa, and L. lactis as their primary host. We use the concept of a “genome landscape,” which helps reveal non-trivial, long-range patterns in codon usage across a genome. We develop a series of randomization tests that allow us to interrogate the significance of one aspect of codon usage, such as GC content, while controlling for another aspect, such as adaptation to host-preferred codons. We find that 33 phage genomes exhibit highly non-random patterns in their GC3-content, use of host-preferred codons, or both. We show that the head and tail proteins of these phages exhibit significant bias towards host-preferred codons, relative to the non-structural phage proteins. Our results support the hypothesis of translational selection on viral genes for host-preferred codons, over a broad range of bacteriophages. Any protein can be encoded by multiple, synonymous spellings. But organisms typically prefer one spelling over another—a phenomenon known as codon bias. Codon bias is generally understood to result from selection for synonymous spellings that increase the rate and accuracy of protein translation. In this work, we have examined the complete genomes of all sequenced viruses that infect the bacteria E. coli, P. aeruginosa, and L. lactis, and have found that many of these viral genomes also exhibit codon bias. Moreover, the degree of codon bias varies across the viral genome, as visualized using a technique called a “genome landscape.” By comparing the observed genomes to randomly drawn genomes, we demonstrate that the regions of high codon bias in these viral genomes often coincide with regions encoding structural proteins. Thus, the proteins that a virus needs to produce in high copy number utilize the same encoding as its host organism does for highly expressed proteins. Our results extend the translational theory of codon bias to the viral kingdom: parts of the viral genome are selected to obey the preferences of its host.
Collapse
|
4
|
Plotkin JB, Dushoff J. Codon bias and frequency-dependent selection on the hemagglutinin epitopes of influenza A virus. Proc Natl Acad Sci U S A 2003; 100:7152-7. [PMID: 12748378 PMCID: PMC165845 DOI: 10.1073/pnas.1132114100] [Citation(s) in RCA: 140] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
Although the surface proteins of human influenza A virus evolve rapidly and continually produce antigenic variants, the internal viral genes acquire mutations very gradually. In this paper, we analyze the sequence evolution of three influenza A genes over the past two decades. We study codon usage as a discriminating signature of gene- and even residue-specific diversifying and purifying selection. Nonrandom codon choice can increase or decrease the effective local substitution rate. We demonstrate that the codons of hemagglutinin, particularly those in the antibody-combining regions, are significantly biased toward substitutional point mutations relative to the codons of other influenza virus genes. We discuss the evolutionary interpretation and implications of these biases for hemagglutinin's antigenic evolution. We also introduce information-theoretic methods that use sequence data to detect regions of recent positive selection and potential protein conformational changes.
Collapse
Affiliation(s)
- Joshua B Plotkin
- Institute for Advanced Study, Olden Lane, Princeton, NJ 08540, USA
| | | |
Collapse
|
5
|
Eyre-Walker A. The close proximity of Escherichia coli genes: consequences for stop codon and synonymous codon use. J Mol Evol 1996; 42:73-8. [PMID: 8919857 DOI: 10.1007/bf02198830] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
It is shown that synonymous codon usage is less biased in favor of those codons preferred by highly expressed genes at the end of Escherichia coli genes than in the middle. This appears to be due to the close proximity of many E. coli genes. It is shown that a substantial number of genes overlap either the Shine-Dalgarno sequence or the coding sequence of the next gene on the chromosome and that the codons that overlap have lower synonymous codon bias than those which do not. It is also shown that there is an increase in the frequency of A-ending codons, and a decrease in the frequency of G-ending codons at the end of E. coli genes that lie close to another gene. It is suggested that these trends in composition could be associated with selection against the formation of mRNA secondary structure near the start of the next gene on the chromosome. Stop codon use is also affected by the close proximity of genes; many genes are forced to use TGA and TAG stop codons because they terminate either within the Shine-Dalgarno or coding sequence of the next gene on the chromosome. The implications these results have for the evolution of synonymous codon use are discussed.
Collapse
MESH Headings
- Base Sequence
- Binding Sites
- Codon, Terminator/genetics
- DNA, Bacterial/genetics
- DNA, Bacterial/metabolism
- Escherichia coli/genetics
- Escherichia coli/metabolism
- Evolution, Molecular
- Gene Expression
- Genes, Bacterial
- Molecular Sequence Data
- RNA, Bacterial/genetics
- RNA, Bacterial/metabolism
- RNA, Ribosomal, 16S/genetics
- RNA, Ribosomal, 16S/metabolism
- Ribosomes/metabolism
Collapse
Affiliation(s)
- A Eyre-Walker
- Department of Biological Sciences, Rutgers University, Piscataway, NJ 08855-1059, USA
| |
Collapse
|
6
|
Bisercić M, Ochman H. The ancestry of insertion sequences common to Escherichia coli and Salmonella typhimurium. J Bacteriol 1993; 175:7863-8. [PMID: 8253675 PMCID: PMC206963 DOI: 10.1128/jb.175.24.7863-7868.1993] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
Despite very restricted gene exchange between Escherichia coli and Salmonella typhimurium, both species harbor several of the same classes of insertion sequences. To determine whether the present-day distribution of these transposable elements is due to common ancestry or to horizontal transfer, we determined the sequences of IS1 and IS200 from natural isolates of S. typhimurium and E. coli. One strain of S. typhimurium harbored an IS1 element identical to that originally recovered from E. coli, suggesting that the element was recently transferred between these two species. The level of sequence divergence between copies of IS200 from E. coli and S. typhimurium ranged from 9.5 to 10.7%, indicating that IS200, unlike IS1, has not been repeatedly transferred between these enteric species since E. coli and S. typhimurium diverged from a common ancestor. Levels of variability in IS1 and IS200 for strains of E. coli and S. typhimurium show that each class of insertion sequence has a characteristic pattern of transposition within and among host genomes.
Collapse
Affiliation(s)
- M Bisercić
- Department of Biology, University of Rochester, New York 14627
| | | |
Collapse
|
7
|
Fennoy SL, Bailey-Serres J. Synonymous codon usage in Zea mays L. nuclear genes is varied by levels of C and G-ending codons. Nucleic Acids Res 1993; 21:5294-300. [PMID: 8265340 PMCID: PMC310561 DOI: 10.1093/nar/21.23.5294] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open
Abstract
A multivariate statistical method called correspondence analysis was used to examine the codon usage of one-hundred-and-one nuclear genes of maize (Zea mays L.). Forty percent of the variation in codon usage was due to bias toward G or C-ending versus A or U-ending codons. Differences in levels of G-ending codons showed the weakest correlation with the major codon usage bias. The bias toward C or U versus A or G in the silent third nucleotide position of synonymous codons accounted for approximately 10% of the variation in codon usage. The G+C content of the silent third nucleotide position of coding regions was not strongly correlated with G+C content of introns. Codon usage was strongly biased toward codons ending in G or C for a number of highly expressed genes including most light-regulated chloroplast proteins, ABA-induced proteins, histones, and anthocyanin biosynthetic enzymes. Codon usage of genes encoding storage proteins and regulatory proteins, such as transposases, kinases, phosphatases and transcription factors, was more random than that of genes encoding cytosolic enzymes with similar bias toward G or C-ending codons. Codon usage in maize may reflect both regional bias on nucleotide composition and selection on the silent third nucleotide position.
Collapse
Affiliation(s)
- S L Fennoy
- Department of Botany and Plant Sciences, University of California, Riverside 92521-0124
| | | |
Collapse
|
8
|
Argentine JA, James AA. Codon preference of Aedes aegypti and Aedes albopictus. INSECT MOLECULAR BIOLOGY 1993; 1:189-194. [PMID: 8269097 DOI: 10.1111/j.1365-2583.1993.tb00091.x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The codon bias of two Aedes mosquito species was examined using a sign test. In general, there appeared to be some preference for C + G at the third base position, although this was not statistically significant. While amino acids such as phenylalanine and tyrosine clearly displayed biases, others such as valine and serine appeared to have little or no bias for any particular codon. Three homologous genes of Aedes aegypti and Drosophila melanogaster were compared using the chi-square test and the codon bias of the two species compared. Drosophila melanogaster was found to have a much stronger bias for C + G at the third base position compared to Aedes. The implications and usefulness of the codon bias are discussed.
Collapse
Affiliation(s)
- J A Argentine
- Department of Molecular Biology and Biochemistry, University of California, Irvine 92717
| | | |
Collapse
|
9
|
Hartl DL, Lozovskaya ER, Lawrence JG. Nonautonomous transposable elements in prokaryotes and eukaryotes. Genetica 1992; 86:47-53. [PMID: 1334917 DOI: 10.1007/bf00133710] [Citation(s) in RCA: 25] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Defective (nonautonomous) copies of transposable elements are relatively common in the genomes of eukaryotes but less common in the genomes of prokaryotes. With regard to transposable elements that exist exclusively in the form of DNA (nonretroviral transposable elements), nonautonomous elements may play a role in the regulation of transposition. In prokaryotes, plasmid-mediated horizontal transmission probably imposes a selection against nonautonomous elements, since nonautonomous elements are incapable of mobilizing themselves. The lower relative frequency of nonautonomous elements in prokaryotes may also reflect the coupling of transcription and translation, which may bias toward the cis activation of transposition. The cis bias we suggest need not be absolute in order to militate against the long-term maintenance of prokaryotic elements unable to transpose on their own. Furthermore, any cis bias in transposition would also decrease the opportunity for trans repression of transposition by nonautonomous elements.
Collapse
Affiliation(s)
- D L Hartl
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110
| | | | | |
Collapse
|
10
|
Abstract
Synonymous and nonsynonymous substitution rates at the loci encoding glyceraldehyde-3-phosphate dehydrogenase (gap) and outer membrane protein 3A (ompA) were examined in 12 species of enteric bacteria. By examining homologous sequences in species of varying degrees of relatedness and of known phylogenetic relationships, we analyzed the patterns of synonymous and nonsynonymous substitutions within and among these genes. Although both loci accumulate synonymous substitutions at reduced rates due to codon usage bias, portions of the gap and ompA reading frames show significant deviation in synonymous substitution rates not attributable to local codon bias. A paucity of synonymous substitutions in portions of the ompA gene may reflect selection for a novel mRNA secondary structure. In addition, these studies allow comparisons of homologous protein-coding sequences (gap) in plants, animals, and bacteria, revealing differences in evolutionary constraints on this glycolytic enzyme in these lineages.
Collapse
Affiliation(s)
- J G Lawrence
- Department of Genetics, Washington University School of Medicine, St. Louis, MO 63110
| | | | | |
Collapse
|