51
|
Murray GGR, Weinert LA, Rhule EL, Welch JJ. The Phylogeny of Rickettsia Using Different Evolutionary Signatures: How Tree-Like is Bacterial Evolution? Syst Biol 2015; 65:265-79. [PMID: 26559010 PMCID: PMC4748751 DOI: 10.1093/sysbio/syv084] [Citation(s) in RCA: 56] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2015] [Accepted: 11/04/2015] [Indexed: 11/14/2022] Open
Abstract
Rickettsia is a genus of intracellular bacteria whose hosts and transmission strategies are both impressively diverse, and this is reflected in a highly dynamic genome. Some previous studies have described the evolutionary history of Rickettsia as non-tree-like, due to incongruity between phylogenetic reconstructions using different portions of the genome. Here, we reconstruct the Rickettsia phylogeny using whole-genome data, including two new genomes from previously unsampled host groups. We find that a single topology, which is supported by multiple sources of phylogenetic signal, well describes the evolutionary history of the core genome. We do observe extensive incongruence between individual gene trees, but analyses of simulations over a single topology and interspersed partitions of sites show that this is more plausibly attributed to systematic error than to horizontal gene transfer. Some conflicting placements also result from phylogenetic analyses of accessory genome content (i.e., gene presence/absence), but we argue that these are also due to systematic error, stemming from convergent genome reduction, which cannot be accommodated by existing phylogenetic methods. Our results show that, even within a single genus, tests for gene exchange based on phylogenetic incongruence may be susceptible to false positives.
Collapse
Affiliation(s)
- Gemma G R Murray
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK; and
| | - Lucy A Weinert
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge CB3 0ES, UK
| | - Emma L Rhule
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK; and
| | - John J Welch
- Department of Genetics, University of Cambridge, Downing Street, Cambridge CB2 3EH, UK; and
| |
Collapse
|
52
|
Xu Y, Liu B, Gröndahl-Yli-Hannuksila K, Tan Y, Feng L, Kallonen T, Wang L, Peng D, He Q, Wang L, Zhang S. Whole-genome sequencing reveals the effect of vaccination on the evolution of Bordetella pertussis. Sci Rep 2015; 5:12888. [PMID: 26283022 PMCID: PMC4539551 DOI: 10.1038/srep12888] [Citation(s) in RCA: 39] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Accepted: 07/10/2015] [Indexed: 12/11/2022] Open
Abstract
Herd immunity can potentially induce a change of circulating viruses. However, it remains largely unknown that how bacterial pathogens adapt to vaccination. In this study, Bordetella pertussis, the causative agent of whooping cough, was selected as an example to explore possible effect of vaccination on the bacterial pathogen. We sequenced and analysed the complete genomes of 40 B. pertussis strains from Finland and China, as well as 11 previously sequenced strains from the Netherlands, where different vaccination strategies have been used over the past 50 years. The results showed that the molecular clock moved at different rates in these countries and in distinct periods, which suggested that evolution of the B. pertussis population was closely associated with the country vaccination coverage. Comparative whole-genome analyses indicated that evolution in this human-restricted pathogen was mainly characterised by ongoing genetic shift and gene loss. Furthermore, 116 SNPs were specifically detected in currently circulating ptxP3-containing strains. The finding might explain the successful emergence of this lineage and its spread worldwide. Collectively, our results suggest that the immune pressure of vaccination is one major driving force for the evolution of B. pertussis, which facilitates further exploration of the pathogenicity of B. pertussis.
Collapse
Affiliation(s)
- Yinghua Xu
- Key Laboratory of the Ministry of Health for Research on Quality and Standardization of Biotech Products, National Institutes of Food and Drug Control, Beijing 100050, P. R. China
| | - Bin Liu
- 1] TEDA School of Biological Sciences and Biotechnology, Nankai University, Tianjin 300457, P.R. China [2] Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, 23 Hongda Street, Tianjin 300457, P. R. China
| | | | - Yajun Tan
- Key Laboratory of the Ministry of Health for Research on Quality and Standardization of Biotech Products, National Institutes of Food and Drug Control, Beijing 100050, P. R. China
| | - Lu Feng
- 1] TEDA School of Biological Sciences and Biotechnology, Nankai University, Tianjin 300457, P.R. China [2] Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, 23 Hongda Street, Tianjin 300457, P. R. China
| | - Teemu Kallonen
- Department of Medical Microbiology and Immunology, Turku University, Turku 20520, Finland
| | - Lichan Wang
- Key Laboratory of the Ministry of Health for Research on Quality and Standardization of Biotech Products, National Institutes of Food and Drug Control, Beijing 100050, P. R. China
| | - Ding Peng
- 1] TEDA School of Biological Sciences and Biotechnology, Nankai University, Tianjin 300457, P.R. China [2] Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, 23 Hongda Street, Tianjin 300457, P. R. China
| | - Qiushui He
- 1] Department of Medical Microbiology and Immunology, Turku University, Turku 20520, Finland [2] Department of Infectious Disease Surveillance and Control, National Institute for Health and Welfare, Turku 20520, Finland [3] Department of Medical Microbiology, Capital Medical University, Beijing 100069, P. R. China
| | - Lei Wang
- 1] TEDA School of Biological Sciences and Biotechnology, Nankai University, Tianjin 300457, P.R. China [2] Key Laboratory of Molecular Microbiology and Technology, Ministry of Education, 23 Hongda Street, Tianjin 300457, P. R. China [3] State Key Laboratory of Medicinal Chemical Biology, Nankai University 300457, Tianjin, P. R. China
| | - Shumin Zhang
- Key Laboratory of the Ministry of Health for Research on Quality and Standardization of Biotech Products, National Institutes of Food and Drug Control, Beijing 100050, P. R. China
| |
Collapse
|
53
|
Bolotin E, Hershberg R. Gene Loss Dominates As a Source of Genetic Variation within Clonal Pathogenic Bacterial Species. Genome Biol Evol 2015; 7:2173-87. [PMID: 26163675 PMCID: PMC4558853 DOI: 10.1093/gbe/evv135] [Citation(s) in RCA: 50] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023] Open
Abstract
Some of the most dangerous pathogens such as Mycobacterium tuberculosis and Yersinia pestis evolve clonally. This means that little or no recombination occurs between strains belonging to these species. Paradoxically, although different members of these species show extreme sequence similarity of orthologous genes, some show considerable intraspecies phenotypic variation, the source of which remains elusive. To examine the possible sources of phenotypic variation within clonal pathogenic bacterial species, we carried out an extensive genomic and pan-genomic analysis of the sources of genetic variation available to a large collection of clonal and nonclonal pathogenic bacterial species. We show that while nonclonal species diversify through a combination of changes to gene sequences, gene loss and gene gain, gene loss completely dominates as a source of genetic variation within clonal species. Indeed, gene loss is so prevalent within clonal species as to lead to levels of gene content variation comparable to those found in some nonclonal species that are much more diverged in their gene sequences and that acquire a substantial number of genes horizontally. Gene loss therefore needs to be taken into account as a potential dominant source of phenotypic variation within clonal bacterial species.
Collapse
Affiliation(s)
- Evgeni Bolotin
- Rachel & Menachem Mendelovitch Evolutionary Processes of Mutation & Natural Selection Research Laboratory, Department of Genetics and Developmental Biology, The Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
| | - Ruth Hershberg
- Rachel & Menachem Mendelovitch Evolutionary Processes of Mutation & Natural Selection Research Laboratory, Department of Genetics and Developmental Biology, The Ruth and Bruce Rappaport Faculty of Medicine, Technion-Israel Institute of Technology, Haifa, Israel
| |
Collapse
|
54
|
da Silva SM, Amaral C, Neves SS, Santos C, Pimentel C, Rodrigues-Pousada C. An HcpR paralog of Desulfovibrio gigas provides protection against nitrosative stress. FEBS Open Bio 2015; 5:594-604. [PMID: 26273559 PMCID: PMC4534486 DOI: 10.1016/j.fob.2015.07.001] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2015] [Accepted: 07/01/2015] [Indexed: 01/12/2023] Open
Abstract
Desulfovibrio gigas genome encodes two HcpR paralogs, HcpR1 and HcpR2. Cells lacking HcpR1 are less tolerant to NO. HcpR1 regulates the expression of several genes related to nitrogen metabolism. Phylogenetic analyses indicate that the presence of HcpR paralogs is a common finding among Desulfovibrio species.
Desulfovibrio gigas belongs to the group of sulfate reducing bacteria (SRB). These ubiquitous and metabolically versatile microorganisms are often exposed to reactive nitrogen species (RNS). Nonetheless, the mechanisms and regulatory elements involved in nitrosative stress protection are still poorly understood. The transcription factor HcpR has emerged as a putative regulator of nitrosative stress response among anaerobic bacteria. HcpR is known to orchestrate the expression of the hybrid cluster protein gene, hcp, proposed to be involved in cellular defense against RNS. According to phylogenetic analyses, the occurrence of hcpR paralog genes is a common feature among several Desulfovibrio species. Within the D. gigas genome we have identified two HcpR-related sequences. One of these sequences, hcpR1, was found in the close vicinity of the hcp gene and this finding prompted us to proceed with its functional characterization. We observed that the growth of a D. gigas strain lacking hcpR1 is severely impaired under nitrosative stress. An in silico search revealed several putative targets of HcpR1 that were experimentally validated. The fact that HcpR1 regulates several genes encoding proteins involved in nitrite and nitrate metabolism, together with the sensitive growth phenotype to NO displayed by an hcpR1 mutant strain, strongly supports a relevant role of this factor under nitrosative stress. Moreover, the finding that several Desulfovibrio species possess HcpR paralogs, which have been transmitted vertically in the evolution and diversification of the genus, suggests that these sequences may confer adaptive or survival advantage to these organisms, possibly by increasing their tolerance to nitrosative stress.
Collapse
Key Words
- BI, Bayesian inference
- BS, bootstrap
- CRP/FNR, cAMP receptor protein/fumarate and nitrate reductase regulatory protein
- Desulfovibrio
- Frdx, ferredoxin
- GSNO, S-nitrosoglutathione
- HGT, horizontal gene transfer
- Hcp, hybrid cluster protein
- HcpR
- ML, maximum likelihood
- MP, maximum parsimony
- Molecular phylogeny
- NO, nitric oxide
- Nitrosative stress
- PP, posterior probability
- RNS, reactive nitrogen species
- ROO, rubredoxin oxygen reductase
- SRB, sulfate reducing bacteria
- Sulfate reducing bacteria
- Transcription regulation
Collapse
Affiliation(s)
- Sofia M da Silva
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Catarina Amaral
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Susana S Neves
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Cátia Santos
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Catarina Pimentel
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| | - Claudina Rodrigues-Pousada
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da República, 2780-157 Oeiras, Portugal
| |
Collapse
|
55
|
Rao Q, Rollat-Farnier PA, Zhu DT, Santos-Garcia D, Silva FJ, Moya A, Latorre A, Klein CC, Vavre F, Sagot MF, Liu SS, Mouton L, Wang XW. Genome reduction and potential metabolic complementation of the dual endosymbionts in the whitefly Bemisia tabaci. BMC Genomics 2015; 16:226. [PMID: 25887812 PMCID: PMC4438442 DOI: 10.1186/s12864-015-1379-6] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2014] [Accepted: 02/21/2015] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The whitefly Bemisia tabaci is an important agricultural pest with global distribution. This phloem-sap feeder harbors a primary symbiont, "Candidatus Portiera aleyrodidarum", which compensates for the deficient nutritional composition of its food sources, and a variety of secondary symbionts. Interestingly, all of these secondary symbionts are found in co-localization with the primary symbiont within the same bacteriocytes, which should favor the evolution of strong interactions between symbionts. RESULTS In this paper, we analyzed the genome sequences of the primary symbiont Portiera and of the secondary symbiont Hamiltonella in the B. tabaci Mediterranean (MED) species in order to gain insight into the metabolic role of each symbiont in the biology of their host. The genome sequences of the uncultured symbionts Portiera and Hamiltonella were obtained from one single bacteriocyte of MED B. tabaci. As already reported, the genome of Portiera is highly reduced (357 kb), but has kept a number of genes encoding most essential amino-acids and carotenoids. On the other hand, Portiera lacks almost all the genes involved in the synthesis of vitamins and cofactors. Moreover, some pathways are incomplete, notably those involved in the synthesis of some essential amino-acids. Interestingly, the genome of Hamiltonella revealed that this secondary symbiont can not only provide vitamins and cofactors, but also complete the missing steps of some of the pathways of Portiera. In addition, some critical amino-acid biosynthetic genes are missing in the two symbiotic genomes, but analysis of whitefly transcriptome suggests that the missing steps may be performed by the whitefly itself or its microbiota. CONCLUSIONS These data suggest that Portiera and Hamiltonella are not only complementary but could also be mutually dependent to provide a full complement of nutrients to their host. Altogether, these results illustrate how functional redundancies can lead to gene losses in the genomes of the different symbiotic partners, reinforcing their inter-dependency.
Collapse
Affiliation(s)
- Qiong Rao
- Ministry of Agriculture Key Laboratory of Agricultural Entomology, Institute of Insect Sciences, Zhejiang University, 310058, Hangzhou, China.
- School of Agriculture and Food Science, Zhejiang A & F University, 311300, Lin'an, Zhejiang, China.
| | - Pierre-Antoine Rollat-Farnier
- Université de Lyon, Université Lyon1, Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 558, 69622, Villeurbanne, Cedex, France.
- Inria Grenoble Rhône-Alpes, Grenoble, France.
| | - Dan-Tong Zhu
- Ministry of Agriculture Key Laboratory of Agricultural Entomology, Institute of Insect Sciences, Zhejiang University, 310058, Hangzhou, China.
| | - Diego Santos-Garcia
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Valencia, Spain.
| | - Francisco J Silva
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Valencia, Spain.
- Unidad Mixta de Investigación en Genómica y Salud FISABIO-Salud Pública and Universitat de València, Valencia, Spain.
| | - Andrés Moya
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Valencia, Spain.
- Unidad Mixta de Investigación en Genómica y Salud FISABIO-Salud Pública and Universitat de València, Valencia, Spain.
| | - Amparo Latorre
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Valencia, Spain.
- Unidad Mixta de Investigación en Genómica y Salud FISABIO-Salud Pública and Universitat de València, Valencia, Spain.
| | - Cecilia C Klein
- Université de Lyon, Université Lyon1, Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 558, 69622, Villeurbanne, Cedex, France.
- Inria Grenoble Rhône-Alpes, Grenoble, France.
| | - Fabrice Vavre
- Université de Lyon, Université Lyon1, Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 558, 69622, Villeurbanne, Cedex, France.
| | - Marie-France Sagot
- Université de Lyon, Université Lyon1, Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 558, 69622, Villeurbanne, Cedex, France.
- Inria Grenoble Rhône-Alpes, Grenoble, France.
| | - Shu-Sheng Liu
- Ministry of Agriculture Key Laboratory of Agricultural Entomology, Institute of Insect Sciences, Zhejiang University, 310058, Hangzhou, China.
| | - Laurence Mouton
- Université de Lyon, Université Lyon1, Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 558, 69622, Villeurbanne, Cedex, France.
| | - Xiao-Wei Wang
- Ministry of Agriculture Key Laboratory of Agricultural Entomology, Institute of Insect Sciences, Zhejiang University, 310058, Hangzhou, China.
| |
Collapse
|
56
|
Rollat-Farnier PA, Santos-Garcia D, Rao Q, Sagot MF, Silva FJ, Henri H, Zchori-Fein E, Latorre A, Moya A, Barbe V, Liu SS, Wang XW, Vavre F, Mouton L. Two host clades, two bacterial arsenals: evolution through gene losses in facultative endosymbionts. Genome Biol Evol 2015; 7:839-55. [PMID: 25714744 PMCID: PMC5322557 DOI: 10.1093/gbe/evv030] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Bacterial endosymbiosis is an important evolutionary process in insects, which can harbor both obligate and facultative symbionts. The evolution of these symbionts is driven by evolutionary convergence, and they exhibit among the tiniest genomes in prokaryotes. The large host spectrum of facultative symbionts and the high diversity of strategies they use to infect new hosts probably impact the evolution of their genome and explain why they undergo less severe genomic erosion than obligate symbionts. Candidatus Hamiltonella defensa is suitable for the investigation of the genomic evolution of facultative symbionts because the bacteria are engaged in specific relationships in two clades of insects. In aphids, H. defensa is found in several species with an intermediate prevalence and confers protection against parasitoids. In whiteflies, H. defensa is almost fixed in some species of Bemisia tabaci, which suggests an important role of and a transition toward obligate symbiosis. In this study, comparisons of the genome of H. defensa present in two B. tabaci species (Middle East Asia Minor 1 and Mediterranean) and in the aphid Acyrthosiphon pisum revealed that they belong to two distinct clades and underwent specific gene losses. In aphids, it contains highly virulent factors that could allow protection and horizontal transfers. In whiteflies, the genome lost these factors and seems to have a limited ability to acquire genes. However it contains genes that could be involved in the production of essential nutrients, which is consistent with a primordial role for this symbiont. In conclusion, although both lineages of H. defensa have mutualistic interactions with their hosts, their genomes follow distinct evolutionary trajectories that reflect their phenotype and could have important consequences on their evolvability.
Collapse
Affiliation(s)
- Pierre-Antoine Rollat-Farnier
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université de Lyon, Université Lyon1, Villeurbanne, France BAMBOO Research Team, INRIA Grenoble, Rhône-Alpes, France
| | - Diego Santos-Garcia
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Spain
| | - Qiong Rao
- Ministry of Agriculture Key Laboratory of Agricultural Entomology, Institute of Insect Sciences, Zhejiang University, Hangzhou, China School of Agriculture and Food Science, Zhejiang Agriculture and Forestry University, Lin'an, Hangzhou, China
| | - Marie-France Sagot
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université de Lyon, Université Lyon1, Villeurbanne, France BAMBOO Research Team, INRIA Grenoble, Rhône-Alpes, France
| | - Francisco J Silva
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Spain Unidad Mixta de Investigación en Genómica y Salud de la Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública) y el Instituto Cavanilles de Biodiversitad y Biología Evolutiva (Universitat de València), Valencia, Spain
| | - Hélène Henri
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université de Lyon, Université Lyon1, Villeurbanne, France
| | - Einat Zchori-Fein
- Department of Entomology, NeweYa'ar Research Center, Agricultural Research Organization, Ramat Yishay, Israel
| | - Amparo Latorre
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Spain Unidad Mixta de Investigación en Genómica y Salud de la Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública) y el Instituto Cavanilles de Biodiversitad y Biología Evolutiva (Universitat de València), Valencia, Spain
| | - Andrés Moya
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València, Spain Unidad Mixta de Investigación en Genómica y Salud de la Fundación para el Fomento de la Investigación Sanitaria y Biomédica de la Comunidad Valenciana (FISABIO-Salud Pública) y el Instituto Cavanilles de Biodiversitad y Biología Evolutiva (Universitat de València), Valencia, Spain
| | - Valérie Barbe
- CEA/DSV/IG/Genoscope, 2 rue Gaston Cremieux, Evry, France
| | - Shu-Sheng Liu
- Ministry of Agriculture Key Laboratory of Agricultural Entomology, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Xiao-Wei Wang
- Ministry of Agriculture Key Laboratory of Agricultural Entomology, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Fabrice Vavre
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université de Lyon, Université Lyon1, Villeurbanne, France BAMBOO Research Team, INRIA Grenoble, Rhône-Alpes, France
| | - Laurence Mouton
- Laboratoire de Biométrie et Biologie Evolutive, UMR CNRS 5558, Université de Lyon, Université Lyon1, Villeurbanne, France
| |
Collapse
|
57
|
diCenzo GC, Finan TM. Genetic redundancy is prevalent within the 6.7 Mb Sinorhizobium meliloti genome. Mol Genet Genomics 2015; 290:1345-56. [PMID: 25638282 DOI: 10.1007/s00438-015-0998-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2014] [Accepted: 01/17/2015] [Indexed: 01/09/2023]
Abstract
Biological pathways are frequently identified via a genetic loss-of-function approach. While this approach has proven to be powerful, it is imperfect as illustrated by well-studied pathways continuing to have missing steps. One potential limiting factor is the masking of phenotypes through genetic redundancy. The prevalence of genetic redundancy in bacterial species has received little attention, although isolated examples of functionally redundant gene pairs exist. Here, we made use of a strain of Sinorhizobium meliloti whose genome was reduced by 45 % through the complete removal of a megaplasmid and a chromid (3 Mb of the 6.7 Mb genome was removed) to begin quantifying the level of genetic redundancy within a large bacterial genome. A mutagenesis of the strain with the reduced genome identified a set of transposon insertions precluding growth of this strain on minimal medium. Transfer of these mutations to the wild-type background revealed that 10-15 % of these chromosomal mutations were located within duplicated genes, as they did not prevent growth of cells with the full genome. The functionally redundant genes were involved in a variety of metabolic pathways, including central carbon metabolism, transport, and amino acid biosynthesis. These results indicate that genetic redundancy may be prevalent within large bacterial genomes. Failing to account for redundantly encoded functions in loss-of-function studies will impair our understanding of a broad range of biological processes and limit our ability to use synthetic biology in the construction of designer cell factories.
Collapse
Affiliation(s)
- George C diCenzo
- Department of Biology, McMaster University, 1280 Main St. W., Hamilton, ON, L8S 4K1, Canada
| | | |
Collapse
|
58
|
Challacombe JF, Stubben CJ, Klimko CP, Welkos SL, Kern SJ, Bozue JA, Worsham PL, Cote CK, Wolfe DN. Interrogation of the Burkholderia pseudomallei genome to address differential virulence among isolates. PLoS One 2014; 9:e115951. [PMID: 25536074 PMCID: PMC4275268 DOI: 10.1371/journal.pone.0115951] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2014] [Accepted: 12/01/2014] [Indexed: 11/18/2022] Open
Abstract
Infection by the Gram-negative pathogen Burkholderia pseudomallei results in the disease melioidosis, acquired from the environment in parts of southeast Asia and northern Australia. Clinical symptoms of melioidosis range from acute (fever, pneumonia, septicemia, and localized infection) to chronic (abscesses in various organs and tissues, most commonly occurring in the lungs, liver, spleen, kidney, prostate and skeletal muscle), and persistent infections in humans are difficult to cure. Understanding the basic biology and genomics of B. pseudomallei is imperative for the development of new vaccines and therapeutic interventions. This formidable task is becoming more tractable due to the increasing number of B. pseudomallei genomes that are being sequenced and compared. Here, we compared three B. pseudomallei genomes, from strains MSHR668, K96243 and 1106a, to identify features that might explain why MSHR668 is more virulent than K96243 and 1106a in a mouse model of B. pseudomallei infection. Our analyses focused on metabolic, virulence and regulatory genes that were present in MSHR668 but absent from both K96243 and 1106a. We also noted features present in K96243 and 1106a but absent from MSHR668, and identified genomic differences that may contribute to variations in virulence noted among the three B. pseudomallei isolates. While this work contributes to our understanding of B. pseudomallei genomics, more detailed experiments are necessary to characterize the relevance of specific genomic features to B. pseudomallei metabolism and virulence. Functional analyses of metabolic networks, virulence and regulation shows promise for examining the effects of B. pseudomallei on host cell metabolism and will lay a foundation for future prediction of the virulence of emerging strains. Continued emphasis in this area will be critical for protection against melioidosis, as a better understanding of what constitutes a fully virulent Burkholderia isolate may provide for better diagnostic and medical countermeasure strategies.
Collapse
Affiliation(s)
- Jean F. Challacombe
- Los Alamos National Laboratory, Bioscience Division, Los Alamos, NM, United States of America
- * E-mail:
| | - Chris J. Stubben
- Los Alamos National Laboratory, Bioscience Division, Los Alamos, NM, United States of America
| | - Christopher P. Klimko
- US Army Medical Research Institute of Infectious Diseases, Bacteriology Division, Fort Detrick, MD, United States of America
| | - Susan L. Welkos
- US Army Medical Research Institute of Infectious Diseases, Bacteriology Division, Fort Detrick, MD, United States of America
| | - Steven J. Kern
- US Army Medical Research Institute of Infectious Diseases, Biostatistics Division, Fort Detrick, MD, United States of America
| | - Joel A. Bozue
- US Army Medical Research Institute of Infectious Diseases, Bacteriology Division, Fort Detrick, MD, United States of America
| | - Patricia L. Worsham
- US Army Medical Research Institute of Infectious Diseases, Bacteriology Division, Fort Detrick, MD, United States of America
| | - Christopher K. Cote
- US Army Medical Research Institute of Infectious Diseases, Bacteriology Division, Fort Detrick, MD, United States of America
| | - Daniel N. Wolfe
- Defense Threat Reduction Agency, Chemical and Biological Technologies Department, Fort Belvoir, VA, United States of America
| |
Collapse
|
59
|
Goodhead I, Darby AC. Taking the pseudo out of pseudogenes. Curr Opin Microbiol 2014; 23:102-9. [PMID: 25461580 DOI: 10.1016/j.mib.2014.11.012] [Citation(s) in RCA: 51] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Revised: 11/11/2014] [Accepted: 11/12/2014] [Indexed: 12/23/2022]
Abstract
Pseudogenes are defined as fragments of once-functional genes that have been silenced by one or more nonsense, frameshift or missense mutations. Despite continuing increases in the speed of sequencing and annotating bacterial genomes, the identification and categorisation of pseudogenes remains problematic. Even when identified, pseudogenes are considered to be rare and tend to be ignored. On the contrary, pseudogenes are surprisingly prevalent and can persist for long evolutionary time periods, representing a record of once-functional genetic characteristics. Most importantly, pseudogenes provide an insight into prokaryotic evolutionary history as a record of phenotypic traits that have been lost. Focusing on the intracellular and symbiotic bacteria in which pseudogenes predominate, this review discusses the importance of identifying pseudogenes to fully understand the abilities of bacteria, and to understand prokaryotes within their evolutionary context.
Collapse
Affiliation(s)
- Ian Goodhead
- Functional and Comparative Genomics, School of Biological Sciences, University of Liverpool, Crown St., Liverpool L69 7ZB, UK
| | - Alistair C Darby
- Functional and Comparative Genomics, School of Biological Sciences, University of Liverpool, Crown St., Liverpool L69 7ZB, UK
| |
Collapse
|
60
|
Genome sequence of Candidatus Riesia pediculischaeffi, endosymbiont of chimpanzee lice, and genomic comparison of recently acquired endosymbionts from human and chimpanzee lice. G3-GENES GENOMES GENETICS 2014; 4:2189-95. [PMID: 25213693 PMCID: PMC4232544 DOI: 10.1534/g3.114.012567] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The obligate-heritable endosymbionts of insects possess some of the smallest known bacterial genomes. This is likely due to loss of genomic material during symbiosis. The mode and rate of this erosion may change over evolutionary time: faster in newly formed associations and slower in long-established ones. The endosymbionts of human and anthropoid primate lice present a unique opportunity to study genome erosion in newly established (or young) symbionts. This is because we have a detailed phylogenetic history of these endosymbionts with divergence dates for closely related species. This allows for genome evolution to be studied in detail and rates of change to be estimated in a phylogenetic framework. Here, we sequenced the genome of the chimpanzee louse endosymbiont (Candidatus Riesia pediculischaeffi) and compared it with the closely related genome of the human body louse endosymbiont. From this comparison, we found evidence for recent genome erosion leading to gene loss in these endosymbionts. Although gene loss was detected, it was not significantly greater than in older endosymbionts from aphids and ants. Additionally, we searched for genes associated with B-vitamin synthesis in the two louse endosymbiont genomes because these endosymbionts are believed to synthesize essential B vitamins absent in the louse’s diet. All of the expected genes were present, except those involved in thiamin synthesis. We failed to find genes encoding for proteins involved in the biosynthesis of thiamin or any complete exogenous means of salvaging thiamin, suggesting there is an undescribed mechanism for the salvage of thiamin. Finally, genes encoding for the pantothenate de novo biosynthesis pathway were located on a plasmid in both taxa along with a heat shock protein. Movement of these genes onto a plasmid may be functionally and evolutionarily significant, potentially increasing production and guarding against the deleterious effects of mutation. These data add to a growing resource of obligate endosymbiont genomes and to our understanding of the rate and mode of genome erosion in obligate animal-associated bacteria. Ultimately sequencing additional louse p-endosymbiont genomes will provide a model system for studying genome evolution in obligate host associated bacteria.
Collapse
|
61
|
Feng S, Powell SM, Wilson R, Bowman JP. Extensive gene acquisition in the extremely psychrophilic bacterial species Psychroflexus torquis and the link to sea-ice ecosystem specialism. Genome Biol Evol 2014; 6:133-48. [PMID: 24391155 PMCID: PMC3914696 DOI: 10.1093/gbe/evt209] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023] Open
Abstract
Sea ice is a highly dynamic and productive environment that includes a diverse array of psychrophilic prokaryotic and eukaryotic taxa distinct from the underlying water column. Because sea ice has only been extensive on Earth since the mid-Eocene, it has been hypothesized that bacteria highly adapted to inhabit sea ice have traits that have been acquired through horizontal gene transfer (HGT). Here we compared the genomes of the psychrophilic bacterium Psychroflexus torquis ATCC 700755T, associated with both Antarctic and Arctic sea ice, and its closely related nonpsychrophilic sister species, P. gondwanensis ACAM 44T. Results show that HGT has occurred much more extensively in P. torquis in comparison to P. gondwanensis. Genetic features that can be linked to the psychrophilic and sea ice-specific lifestyle of P. torquis include genes for exopolysaccharide (EPS) and polyunsaturated fatty acid (PUFA) biosynthesis, numerous specific modes of nutrient acquisition, and proteins putatively associated with ice-binding, light-sensing (bacteriophytochromes), and programmed cell death (metacaspases). Proteomic analysis showed that several genes associated with these traits are highly translated, especially those involved with EPS and PUFA production. Because most of the genes relating to the ability of P. torquis to dwell in sea-ice ecosystems occur on genomic islands that are absent in closely related P. gondwanensis, its adaptation to the sea-ice environment appears driven mainly by HGT. The genomic islands are rich in pseudogenes, insertional elements, and addiction modules, suggesting that gene acquisition is being followed by a process of genome reduction potentially indicative of evolving ecosystem specialism.
Collapse
Affiliation(s)
- Shi Feng
- Food Safety Centre, Tasmanian Institute of Agriculture, University of Tasmania, Australia
| | | | | | | |
Collapse
|
62
|
Abstract
Pseudogenes are defined as non-functional relatives of genes whose protein-coding abilities are lost and are no longer expressed within cells. They are an outcome of accumulation of mutations within a gene whose end product is not essential for survival. Proper investigation of the procedure of pseudogenization is relevant for estimating occurrence of duplications in genomes. Frankineae houses an interesting group of microorganisms, carving a niche in the microbial world. This study was undertaken with the objective of determining the abundance of pseudogenes, understanding strength of purifying selection, investigating evidence of pseudogene expression, and analysing their molecular nature, their origin, evolution and deterioration patterns amongst domain families. Investigation revealed the occurrence of 956 core pFAM families sharing common characteristics indicating co-evolution. WD40, Rve_3, DDE_Tnp_IS240 and phage integrase core domains are larger families, having more pseudogenes, signifying a probability of harmful foreign genes being disabled within transposable elements. High selective pressure depicted that gene families rapidly duplicating and evolving undoubtedly facilitated creation of a number of pseudogenes in Frankineae. Codon usage analysis between protein-coding genes and pseudogenes indicated a wide degree of variation with respect to different factors. Moreover, the majority of pseudogenes were under the effect of purifying selection. Frankineae pseudogenes were under stronger selective constraints, indicating that they were functional for a very long time and became pseudogenes abruptly. The origin and deterioration of pseudogenes has been attributed to selection and mutational pressure acting upon sequences for adapting to stressed soil environments.
Collapse
|
63
|
Fernandes GDC, Trarbach LJ, de Campos SB, Beneduzi A, Passaglia LMP. Alternative nitrogenase and pseudogenes: unique features of the Paenibacillus riograndensis nitrogen fixation system. Res Microbiol 2014; 165:571-80. [PMID: 24956360 DOI: 10.1016/j.resmic.2014.06.002] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2014] [Revised: 06/13/2014] [Accepted: 06/14/2014] [Indexed: 10/25/2022]
Abstract
Biological nitrogen fixation (BNF) is a tightly regulated process that is carried out by diazotrophic microorganisms. The regulatory mechanisms of BNF-related genes are well characterized in Gram-negative models, but they are poorly understood in Gram-positive bacteria. Paenibacillus riograndensis SBR5(T) is a Gram-positive, endospore-forming facultative aerobic diazotroph. Three clusters of BNF-related genes with dissimilar phylogenetic histories were identified in the P. riograndensis genome, and no regulatory genes were recognized. P. riograndensis nifH2 was considered inactive based on transcript and promoter analyses, whereas transcripts of nifH1 and anfH were induced upon nitrogen-limited conditions. The functionality of the alternative nitrogenase system was also validated by enzymatic activity analysis. Fragments upstream of the two active clusters seem to harbor primary functional promoter sequences, producing a constitutive expression pattern in Escherichia coli. Sequences upstream of the anf genes were not recognized by this heterologous host, indicating a very distinct promoter pattern. These results shed light upon the evolutionary history of nitrogen fixation genes in this Gram-positive bacterium and highlight the presence of novel regulatory elements that are yet to be described.
Collapse
Affiliation(s)
- Gabriela de C Fernandes
- Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul. Av. Bento Gonçalves, 9500, Caixa Postal 15.053, 91501-970 Porto Alegre, RS, Brazil.
| | - Laura J Trarbach
- Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul. Av. Bento Gonçalves, 9500, Caixa Postal 15.053, 91501-970 Porto Alegre, RS, Brazil.
| | - Samanta B de Campos
- Department of Genetics of Prokaryotes, Bielefeld University, Universitätsstraße 25, 33594 Bielefeld, Germany.
| | - Anelise Beneduzi
- Fundação Estadual de Pesquisa Agropecuária (FEPAGRO), Rua Gonçalves Dias 570, 90130-060, Porto Alegre, RS, Brazil.
| | - Luciane M P Passaglia
- Departamento de Genética, Instituto de Biociências, Universidade Federal do Rio Grande do Sul. Av. Bento Gonçalves, 9500, Caixa Postal 15.053, 91501-970 Porto Alegre, RS, Brazil.
| |
Collapse
|
64
|
Abstract
The number of complete genome sequences explodes more and more with each passing year. Thus, methods for genome annotation need to be honed constantly to handle the deluge of information. Annotation of pseudogenes (i.e., gene copies that appear not to make a functional protein) in genomes is a persistent problem; here, we overview pseudogene annotation methods that are based on the detection of sequence homology in genomic DNA.
Collapse
Affiliation(s)
- Paul M Harrison
- Department of Biology, McGill University, Stewart Biology Building, 1205 Doctor Penfield Avenue, Montreal, QC, Canada, H3A 1B1,
| |
Collapse
|
65
|
Olson AB, Kent H, Sibley CD, Grinwis ME, Mabon P, Ouellette C, Tyson S, Graham M, Tyler SD, Van Domselaar G, Surette MG, Corbett CR. Phylogenetic relationship and virulence inference of Streptococcus Anginosus Group: curated annotation and whole-genome comparative analysis support distinct species designation. BMC Genomics 2013; 14:895. [PMID: 24341328 PMCID: PMC3897883 DOI: 10.1186/1471-2164-14-895] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2013] [Accepted: 12/09/2013] [Indexed: 12/30/2022] Open
Abstract
Background The Streptococcus Anginosus Group (SAG) represents three closely related species of the viridans group streptococci recognized as commensal bacteria of the oral, gastrointestinal and urogenital tracts. The SAG also cause severe invasive infections, and are pathogens during cystic fibrosis (CF) pulmonary exacerbation. Little genomic information or description of virulence mechanisms is currently available for SAG. We conducted intra and inter species whole-genome comparative analyses with 59 publically available Streptococcus genomes and seven in-house closed high quality finished SAG genomes; S. constellatus (3), S. intermedius (2), and S. anginosus (2). For each SAG species, we sequenced at least one numerically dominant strain from CF airways recovered during acute exacerbation and an invasive, non-lung isolate. We also evaluated microevolution that occurred within two isolates that were cultured from one individual one year apart. Results The SAG genomes were most closely related to S. gordonii and S. sanguinis, based on shared orthologs and harbor a similar number of proteins within each COG category as other Streptococcus species. Numerous characterized streptococcus virulence factor homologs were identified within the SAG genomes including; adherence, invasion, spreading factors, LPxTG cell wall proteins, and two component histidine kinases known to be involved in virulence gene regulation. Mobile elements, primarily integrative conjugative elements and bacteriophage, account for greater than 10% of the SAG genomes. S. anginosus was the most variable species sequenced in this study, yielding both the smallest and the largest SAG genomes containing multiple genomic rearrangements, insertions and deletions. In contrast, within the S. constellatus and S. intermedius species, there was extensive continuous synteny, with only slight differences in genome size between strains. Within S. constellatus we were able to determine important SNPs and changes in VNTR numbers that occurred over the course of one year. Conclusions The comparative genomic analysis of the SAG clarifies the phylogenetics of these bacteria and supports the distinct species classification. Numerous potential virulence determinants were identified and provide a foundation for further studies into SAG pathogenesis. Furthermore, the data may be used to enable the development of rapid diagnostic assays and therapeutics for these pathogens.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | - Michael G Surette
- National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, MB, Canada.
| | | |
Collapse
|
66
|
Bacterial endosymbiosis in a chordate host: long-term co-evolution and conservation of secondary metabolism. PLoS One 2013; 8:e80822. [PMID: 24324632 PMCID: PMC3851785 DOI: 10.1371/journal.pone.0080822] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2013] [Accepted: 10/16/2013] [Indexed: 11/19/2022] Open
Abstract
Intracellular symbiosis is known to be widespread in insects, but there are few described examples in other types of host. These symbionts carry out useful activities such as synthesizing nutrients and conferring resistance against adverse events such as parasitism. Such symbionts persist through host speciation events, being passed down through vertical transmission. Due to various evolutionary forces, symbionts go through a process of genome reduction, eventually resulting in tiny genomes where only those genes essential to immediate survival and those beneficial to the host remain. In the marine environment, invertebrates such as tunicates are known to harbor complex microbiomes implicated in the production of natural products that are toxic and probably serve a defensive function. Here, we show that the intracellular symbiont Candidatus Endolissoclinum faulkneri is a long-standing symbiont of the tunicate Lissoclinum patella, that has persisted through cryptic speciation of the host. In contrast to the known examples of insect symbionts, which tend to be either relatively recent or ancient relationships, the genome of Ca. E. faulkneri has a very low coding density but very few recognizable pseudogenes. The almost complete degradation of intergenic regions and stable gene inventory of extant strains of Ca. E. faulkneri show that further degradation and deletion is happening very slowly. This is a novel stage of genome reduction and provides insight into how tiny genomes are formed. The ptz pathway, which produces the defensive patellazoles, is shown to date to before the divergence of Ca. E. faulkneri strains, reinforcing its importance in this symbiotic relationship. Lastly, as in insects we show that stable symbionts can be lost, as we describe an L. patella animal where Ca. E. faulkneri is displaced by a likely intracellular pathogen. Our results suggest that intracellular symbionts may be an important source of ecologically significant natural products in animals.
Collapse
|
67
|
Demidenok OI, Goncharenko AV. Bacterial toxin-antitoxin systems and perspectives for their application in medicine. APPL BIOCHEM MICRO+ 2013. [DOI: 10.1134/s0003683813060070] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
68
|
Pang S, Octavia S, Feng L, Liu B, Reeves PR, Lan R, Wang L. Genomic diversity and adaptation of Salmonella enterica serovar Typhimurium from analysis of six genomes of different phage types. BMC Genomics 2013; 14:718. [PMID: 24138507 PMCID: PMC3853940 DOI: 10.1186/1471-2164-14-718] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2012] [Accepted: 10/11/2013] [Indexed: 12/28/2022] Open
Abstract
Background Salmonella enterica serovar Typhimurium (or simply Typhimurium) is the most common serovar in both human infections and farm animals in Australia and many other countries. Typhimurium is a broad host range serovar but has also evolved into host-adapted variants (i.e. isolated from a particular host such as pigeons). Six Typhimurium strains of different phage types (defined by patterns of susceptibility to lysis by a set of bacteriophages) were analysed using Illumina high-throughput genome sequencing. Results Variations between strains were mainly due to single nucleotide polymorphisms (SNPs) with an average of 611 SNPs per strain, ranging from 391 SNPs to 922 SNPs. There were seven insertions/deletions (indels) involving whole or partial gene deletions, four inactivation events due to IS200 insertion and 15 pseudogenes due to early termination. Four of these inactivated or deleted genes may be virulence related. Nine prophage or prophage remnants were identified in the six strains. Gifsy-1, Gifsy-2 and the sopE2 and sspH2 phage remnants were present in all six genomes while Fels-1, Fels-2, ST64B, ST104 and CP4-57 were variably present. Four strains carried the 90-kb plasmid pSLT which contains several known virulence genes. However, two strains were found to lack the plasmid. In addition, one strain had a novel plasmid similar to Typhi strain CT18 plasmid pHCM2. Conclusion The genome data suggest that variations between strains were mainly due to accumulation of SNPs, some of which resulted in gene inactivation. Unique genetic elements that were common between host-adapted phage types were not found. This study advanced our understanding on the evolution and adaptation of Typhimurium at genomic level.
Collapse
Affiliation(s)
| | | | | | | | | | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales 2052, Australia.
| | | |
Collapse
|
69
|
Spoor LE, McAdam PR, Weinert LA, Rambaut A, Hasman H, Aarestrup FM, Kearns AM, Larsen AR, Skov RL, Fitzgerald JR. Livestock origin for a human pandemic clone of community-associated methicillin-resistant Staphylococcus aureus. mBio 2013; 4:e00356-13. [PMID: 23943757 PMCID: PMC3747577 DOI: 10.1128/mbio.00356-13%0a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2013] [Accepted: 06/14/2013] [Indexed: 11/19/2023] Open
Abstract
UNLABELLED The importance of livestock as a source of bacterial pathogens with the potential for epidemic spread in human populations is unclear. In recent years, there has been a global increase in community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) infections of healthy humans, but an understanding of the different evolutionary origins of CA-MRSA clones and the basis for their recent expansion is lacking. Here, using a high-resolution phylogenetic approach, we report the discovery of two emergent clones of human epidemic CA-MRSA which resulted from independent livestock-to-human host jumps by the major bovine S. aureus complex, CC97. Of note, one of the new clones was isolated from human infections on four continents, demonstrating its global dissemination since the host jump occurred over 40 years ago. The emergence of both human S. aureus clones coincided with the independent acquisition of mobile genetic elements encoding antimicrobial resistance and human-specific mediators of immune evasion, consistent with an important role for these genetic events in the capacity to survive and transmit among human populations. In conclusion, we provide evidence that livestock represent a reservoir for the emergence of new human-pathogenic S. aureus clones with the capacity for pandemic spread. These findings have major public health implications highlighting the importance of surveillance for early identification of emergent clones and improved transmission control measures at the human-livestock interface. IMPORTANCE Animals are the major source of new pathogens affecting humans. However, the potential for pathogenic bacteria that originally were found in animals to switch hosts and become widely established in human populations is not clear. Here, we report the discovery of emergent clones of methicillin-resistant Staphylococcus aureus (MRSA) that originated in livestock and switched to humans, followed by host-adaptive evolution and epidemic spread in global human populations. Our findings demonstrate that livestock can act as a reservoir for the emergence of new human bacterial clones with potential for pandemic spread, highlighting the potential role of surveillance and biosecurity measures in the agricultural setting for preventing the emergence of new human pathogens.
Collapse
Affiliation(s)
- Laura E. Spoor
- The Roslin Institute and Edinburgh Infectious Diseases, University of Edinburgh, Easter Bush, Midlothian, United Kingdom
| | - Paul R. McAdam
- The Roslin Institute and Edinburgh Infectious Diseases, University of Edinburgh, Easter Bush, Midlothian, United Kingdom
| | - Lucy A. Weinert
- University of Cambridge, Department of Veterinary Medicine, Cambridge, United Kingdom
| | - Andrew Rambaut
- Institute of Evolutionary Biology, Ashworth Laboratories, University of Edinburgh, Edinburgh, United Kingdom
| | - Henrik Hasman
- National Food Institute, Technical University of Denmark (DTU), Lyngby, Denmark
| | - Frank M. Aarestrup
- National Food Institute, Technical University of Denmark (DTU), Lyngby, Denmark
| | - Angela M. Kearns
- Microbiology Services, Colindale, Health Protection Agency, London, United Kingdom
| | - Anders R. Larsen
- Department of Antimicrobial Surveillance and Research, Statens Serum Institute, Copenhagen, Denmark
| | - Robert L. Skov
- Department of Antimicrobial Surveillance and Research, Statens Serum Institute, Copenhagen, Denmark
| | - J. Ross Fitzgerald
- The Roslin Institute and Edinburgh Infectious Diseases, University of Edinburgh, Easter Bush, Midlothian, United Kingdom
| |
Collapse
|
70
|
Livestock origin for a human pandemic clone of community-associated methicillin-resistant Staphylococcus aureus. mBio 2013; 4:mBio.00356-13. [PMID: 23943757 PMCID: PMC3747577 DOI: 10.1128/mbio.00356-13] [Citation(s) in RCA: 152] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open
Abstract
The importance of livestock as a source of bacterial pathogens with the potential for epidemic spread in human populations is unclear. In recent years, there has been a global increase in community-associated methicillin-resistant Staphylococcus aureus (CA-MRSA) infections of healthy humans, but an understanding of the different evolutionary origins of CA-MRSA clones and the basis for their recent expansion is lacking. Here, using a high-resolution phylogenetic approach, we report the discovery of two emergent clones of human epidemic CA-MRSA which resulted from independent livestock-to-human host jumps by the major bovine S. aureus complex, CC97. Of note, one of the new clones was isolated from human infections on four continents, demonstrating its global dissemination since the host jump occurred over 40 years ago. The emergence of both human S. aureus clones coincided with the independent acquisition of mobile genetic elements encoding antimicrobial resistance and human-specific mediators of immune evasion, consistent with an important role for these genetic events in the capacity to survive and transmit among human populations. In conclusion, we provide evidence that livestock represent a reservoir for the emergence of new human-pathogenic S. aureus clones with the capacity for pandemic spread. These findings have major public health implications highlighting the importance of surveillance for early identification of emergent clones and improved transmission control measures at the human-livestock interface. Animals are the major source of new pathogens affecting humans. However, the potential for pathogenic bacteria that originally were found in animals to switch hosts and become widely established in human populations is not clear. Here, we report the discovery of emergent clones of methicillin-resistant Staphylococcus aureus (MRSA) that originated in livestock and switched to humans, followed by host-adaptive evolution and epidemic spread in global human populations. Our findings demonstrate that livestock can act as a reservoir for the emergence of new human bacterial clones with potential for pandemic spread, highlighting the potential role of surveillance and biosecurity measures in the agricultural setting for preventing the emergence of new human pathogens.
Collapse
|
71
|
Hottes AK, Freddolino PL, Khare A, Donnell ZN, Liu JC, Tavazoie S. Bacterial adaptation through loss of function. PLoS Genet 2013; 9:e1003617. [PMID: 23874220 PMCID: PMC3708842 DOI: 10.1371/journal.pgen.1003617] [Citation(s) in RCA: 153] [Impact Index Per Article: 13.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2013] [Accepted: 05/23/2013] [Indexed: 11/18/2022] Open
Abstract
The metabolic capabilities and regulatory networks of bacteria have been optimized by evolution in response to selective pressures present in each species' native ecological niche. In a new environment, however, the same bacteria may grow poorly due to regulatory constraints or biochemical deficiencies. Adaptation to such conditions can proceed through the acquisition of new cellular functionality due to gain of function mutations or via modulation of cellular networks. Using selection experiments on transposon-mutagenized libraries of bacteria, we illustrate that even under conditions of extreme nutrient limitation, substantial adaptation can be achieved solely through loss of function mutations, which rewire the metabolism of the cell without gain of enzymatic or sensory function. A systematic analysis of similar experiments under more than 100 conditions reveals that adaptive loss of function mutations exist for many environmental challenges. Drawing on a wealth of examples from published articles, we detail the range of mechanisms through which loss-of-function mutations can generate such beneficial regulatory changes, without the need for rare, specific mutations to fine-tune enzymatic activities or network connections. The high rate at which loss-of-function mutations occur suggests that null mutations play an underappreciated role in the early stages of adaption of bacterial populations to new environments.
Collapse
Affiliation(s)
- Alison K. Hottes
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Peter L. Freddolino
- Joint Centers for Systems Biology, Columbia University, New York, New York, United States of America
| | - Anupama Khare
- Joint Centers for Systems Biology, Columbia University, New York, New York, United States of America
| | - Zachary N. Donnell
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Julia C. Liu
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey, United States of America
| | - Saeed Tavazoie
- Joint Centers for Systems Biology, Columbia University, New York, New York, United States of America
- Department of Biochemistry and Molecular Biology, Columbia University, New York, New York, United States of America
- * E-mail:
| |
Collapse
|
72
|
Ji C, Huang A, Liu W, Pan G, Wang G. Identification and bioinformatics analysis of pseudogenes from whole genome sequence of Phaeodactylum tricornutum. CHINESE SCIENCE BULLETIN-CHINESE 2013. [DOI: 10.1007/s11434-012-5174-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
73
|
Brito PH, Rocha EPC, Xavier KB, Gordo I. Natural genome diversity of AI-2 quorum sensing in Escherichia coli: conserved signal production but labile signal reception. Genome Biol Evol 2013; 5:16-30. [PMID: 23246794 PMCID: PMC3595036 DOI: 10.1093/gbe/evs122] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Quorum sensing (QS) regulates the onset of bacterial social responses in function to cell density having an important impact in virulence. Autoinducer-2 (AI-2) is a signal that has the peculiarity of mediating both intra- and interspecies bacterial QS. We analyzed the diversity of all components of AI-2 QS across 44 complete genomes of Escherichia coli and Shigella strains. We used phylogenetic tools to study its evolution and determined the phenotypes of single-deletion mutants to predict phenotypes of natural strains. Our analysis revealed many likely adaptive polymorphisms both in gene content and in nucleotide sequence. We show that all natural strains possess the signal emitter (the luxS gene), but many lack a functional signal receptor (complete lsr operon) and the ability to regulate extracellular signal concentrations. This result is in striking contrast with the canonical species-specific QS systems where one often finds orphan receptors, without a cognate synthase, but not orphan emitters. Our analysis indicates that selection actively maintains a balanced polymorphism for the presence/absence of a functional lsr operon suggesting diversifying selection on the regulation of signal accumulation and recognition. These results can be explained either by niche-specific adaptation or by selection for a coercive behavior where signal-blind emitters benefit from forcing other individuals in the population to haste in cooperative behaviors.
Collapse
|
74
|
Jimenez-Lopez JC, Gachomo EW, Sharma S, Kotchoni SO. Genome sequencing and next-generation sequence data analysis: A comprehensive compilation of bioinformatics tools and databases. ACTA ACUST UNITED AC 2013. [DOI: 10.4236/ajmb.2013.32016] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
75
|
Factors behind junk DNA in bacteria. Genes (Basel) 2012; 3:634-50. [PMID: 24705080 PMCID: PMC3899985 DOI: 10.3390/genes3040634] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Revised: 09/11/2012] [Accepted: 09/25/2012] [Indexed: 11/16/2022] Open
Abstract
Although bacterial genomes have been traditionally viewed as being very compact, with relatively low amounts of repetitive and non-coding DNA, this view has dramatically changed in recent years. The increase of available complete bacterial genomes has revealed that many species present abundant repetitive DNA (i.e., insertion sequences, prophages or paralogous genes) and that many of these sequences are not functional but can have evolutionary consequences as concerns the adaptation to specialized host-related ecological niches. Comparative genomics analyses with close relatives that live in non-specialized environments reveal the nature and fate of this bacterial junk DNA. In addition, the number of insertion sequences and pseudogenes, as well as the size of the intergenic regions, can be used as markers of the evolutionary stage of a genome.
Collapse
|
76
|
Turner CE, Sommerlad M, McGregor K, Davies FJ, Pichon B, Chong DLW, Farzaneh L, Holden MTG, Spratt BG, Efstratiou A, Sriskandan S. Superantigenic activity of emm3 Streptococcus pyogenes is abrogated by a conserved, naturally occurring smeZ mutation. PLoS One 2012; 7:e46376. [PMID: 23049698 PMCID: PMC3462185 DOI: 10.1371/journal.pone.0046376] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2012] [Accepted: 08/29/2012] [Indexed: 11/18/2022] Open
Abstract
Streptococcus pyogenes M/emm3 strains have been epidemiologically linked with enhanced infection severity and risk of streptococcal toxic shock syndrome (STSS), a syndrome triggered by superantigenic stimulation of T cells. Comparison of S. pyogenes strains causing STSS demonstrated that emm3 strains were surprisingly less mitogenic than other emm-types (emm1, emm12, emm18, emm28, emm87, emm89) both in vitro and in vivo, indicating poor superantigenic activity. We identified a 13 bp deletion in the superantigen smeZ gene of all emm3 strains tested. The deletion led to a premature stop codon in smeZ, and was not present in other major emm-types tested. Expression of a functional non-M3-smeZ gene successfully enhanced mitogenic activity in emm3 S. pyogenes and also restored mitogenic activity to emm1 and emm89 S. pyogenes strains where the smeZ gene had been disrupted. In contrast, the M3-smeZ gene with the 13 bp deletion could not enhance or restore mitogenicity in any of these S. pyogenes strains, confirming that M3-smeZ is non-functional regardless of strain background. The mutation in M3-smeZ reduced the potential for M3 S. pyogenes to induce cytokines in human tonsil, but not during invasive infection of superantigen-sensitive mice. Notwithstanding epidemiological associations with STSS and disease severity, emm3 strains have inherently poor superantigenicity that is explained by a conserved mutation in smeZ.
Collapse
Affiliation(s)
- Claire E. Turner
- Department of Infectious Diseases & Immunity, Imperial College London, London, United Kingdom
| | - Mary Sommerlad
- Department of Infectious Diseases & Immunity, Imperial College London, London, United Kingdom
| | - Karen McGregor
- Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Frances J. Davies
- Department of Infectious Diseases & Immunity, Imperial College London, London, United Kingdom
| | - Bruno Pichon
- Respiratory and Systemic Infection Laboratory, The Health Protection Agency Centre for Infections, London, United Kingdom
| | - Deborah L. W. Chong
- Department of Infectious Diseases & Immunity, Imperial College London, London, United Kingdom
| | - Leili Farzaneh
- Department of Infectious Diseases & Immunity, Imperial College London, London, United Kingdom
| | - Matthew T. G. Holden
- Pathogen Genomics, The Wellcome Trust Sanger Institute, Cambridge, United Kingdom
| | - Brian G. Spratt
- Department of Infectious Disease Epidemiology, Imperial College London, London, United Kingdom
| | - Androulla Efstratiou
- Respiratory and Systemic Infection Laboratory, The Health Protection Agency Centre for Infections, London, United Kingdom
| | - Shiranee Sriskandan
- Department of Infectious Diseases & Immunity, Imperial College London, London, United Kingdom
- * E-mail:
| |
Collapse
|
77
|
Zheng J, Liu L, Wei C, Leng W, Yang J, Li W, Wang J, Jin Q. A comprehensive proteomic analysis of Mycobacterium bovis bacillus Calmette-Guérin using high resolution Fourier transform mass spectrometry. J Proteomics 2012; 77:357-71. [PMID: 23000594 DOI: 10.1016/j.jprot.2012.09.010] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2012] [Revised: 08/13/2012] [Accepted: 09/08/2012] [Indexed: 12/14/2022]
Abstract
Since 1921, Mycobacterium bovis bacillus Calmette-Guérin (BCG) has been recognized as an important vaccine to prevent tuberculosis worldwide. Nonetheless, a global analysis of BCG proteome has not been clearly investigated. In this study, we performed an in-depth proteomic analysis of BCG under an in vitro cultivation condition using SDS-PAGE and high resolution Fourier transform mass spectrometry. In total, 3434 proteins (35,259 unique peptides) including 512 transmembrane proteins were identified, covering ~87% of the predicted BCG proteome. Seven pseudogene protein products were also obtained and validated by RT-PCR at gene transcript level. Additionally, translational start sites of 832 proteins were confirmed and 186 were extended using N-terminus-derived peptides. The physicochemical characteristics of all identified proteins were determined. Some predominant proteins, including PE and PPE family proteins, lipoproteins, heat shock proteins, transport proteins and low molecular weight protein antigens, are discussed, which represent potential prominent antigens in the humoral and cellular immune response. This study represents the most comprehensive BCG proteome to date, which will likely facilitate the design of vaccination and immunodiagnostic strategies against TB.
Collapse
Affiliation(s)
- Jianhua Zheng
- MOH Key Laboratory of Systems Biology of Pathogens, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, PR China
| | | | | | | | | | | | | | | |
Collapse
|
78
|
Fielder D, Vernes K, Alacs E, Georges A. Mitochondrial variation among Australian freshwater turtles (genus Myuchelys), with special reference to the Endangered M. bellii. ENDANGER SPECIES RES 2012. [DOI: 10.3354/esr00417] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
79
|
Schrimpe-Rutledge AC, Jones MB, Chauhan S, Purvine SO, Sanford JA, Monroe ME, Brewer HM, Payne SH, Ansong C, Frank BC, Smith RD, Peterson SN, Motin VL, Adkins JN. Comparative omics-driven genome annotation refinement: application across Yersiniae. PLoS One 2012; 7:e33903. [PMID: 22479471 PMCID: PMC3313959 DOI: 10.1371/journal.pone.0033903] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Accepted: 02/19/2012] [Indexed: 02/03/2023] Open
Abstract
Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. The annotation process is now performed almost exclusively in an automated fashion to balance the large number of sequences generated. One possible way of reducing errors inherent to automated computational annotations is to apply data from omics measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. Here, the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species. Transcriptomic and proteomic data derived from highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis Pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 incorrect (i.e., observed frameshifts, extended start sites, and translated pseudogenes) protein-coding sequences within the three current genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes, including the insertion-ablated argD, underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, a transcriptional regulator, and many hypothetical proteins that were missed during annotation.
Collapse
Affiliation(s)
| | - Marcus B. Jones
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Sadhana Chauhan
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Samuel O. Purvine
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - James A. Sanford
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Matthew E. Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Heather M. Brewer
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Samuel H. Payne
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Charles Ansong
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Bryan C. Frank
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Richard D. Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Scott N. Peterson
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Vladimir L. Motin
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Joshua N. Adkins
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
- * E-mail:
| |
Collapse
|
80
|
Suntsov VV. Origin of the plague microbe Yersinia pestis: Structure of the process of speciation. BIOL BULL+ 2012. [DOI: 10.1134/s1062359012010104] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
81
|
Youm J, Saier MH. Comparative analyses of transport proteins encoded within the genomes of Mycobacterium tuberculosis and Mycobacterium leprae. BIOCHIMICA ET BIOPHYSICA ACTA-BIOMEMBRANES 2011; 1818:776-97. [PMID: 22179038 DOI: 10.1016/j.bbamem.2011.11.015] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2011] [Revised: 11/18/2011] [Accepted: 11/18/2011] [Indexed: 10/15/2022]
Abstract
The co-emergence of multidrug resistant pathogenic bacterial strains and the Human Immunodeficiency Virus pandemic has made tuberculosis a leading public health threat. The causative agent is Mycobacterium tuberculosis (Mtu), a facultative intracellular parasite. Mycobacterium leprae (Mle), a related organism that causes leprosy, is an obligate intracellular parasite. Given that different transporters are required for bacterial growth and persistence under a variety of growth conditions, we conducted comparative analyses of transport proteins encoded within the genomes of these two organisms. A minimal set of genes required for intracellular and extracellular life was identified. Drug efflux systems utilizing primary active transport mechanisms have been preferentially retained in Mle and still others preferentially lost. Transporters associated with environmental adaptation found in Mtu were mostly lost in Mle. These findings provide starting points for experimental studies that may elucidate the dependencies of pathogenesis on transport for these two pathogenic mycobacteria. They also lead to suggestions regarding transporters that function in intra- versus extra-cellular growth.
Collapse
Affiliation(s)
- Jiwon Youm
- University of California, La Jolla, CA, USA
| | | |
Collapse
|
82
|
Zhao L, Liu L, Leng W, Wei C, Jin Q. A proteogenomic analysis of Shigella flexneri using 2D LC-MALDI TOF/TOF. BMC Genomics 2011; 12:528. [PMID: 22032405 PMCID: PMC3219829 DOI: 10.1186/1471-2164-12-528] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2011] [Accepted: 10/28/2011] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND New strategies for high-throughput sequencing are constantly appearing, leading to a great increase in the number of completely sequenced genomes. Unfortunately, computational genome annotation is out of step with this progress. Thus, the accurate annotation of these genomes has become a bottleneck of knowledge acquisition. RESULTS We exploited a proteogenomic approach to improve conventional genome annotation by integrating proteomic data with genomic information. Using Shigella flexneri 2a as a model, we identified total 823 proteins, including 187 hypothetical proteins. Among them, three annotated ORFs were extended upstream through comprehensive analysis against an in-house N-terminal extension database. Two genes, which could not be translated to their full length because of stop codon 'mutations' induced by genome sequencing errors, were revised and annotated as fully functional genes. Above all, seven new ORFs were discovered, which were not predicted in S. flexneri 2a str.301 by any other annotation approaches. The transcripts of four novel ORFs were confirmed by RT-PCR assay. Additionally, most of these novel ORFs were overlapping genes, some even nested within the coding region of other known genes. CONCLUSIONS Our findings demonstrate that current Shigella genome annotation methods are not perfect and need to be improved. Apart from the validation of predicted genes at the protein level, the additional features of proteogenomic tools include revision of annotation errors and discovery of novel ORFs. The complementary dataset could provide more targets for those interested in Shigella to perform functional studies.
Collapse
Affiliation(s)
- Lina Zhao
- State Key Laboratory for Molecular Virology and Genetic Engineering, Institute of Pathogen Biology, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, PR China
| | | | | | | | | |
Collapse
|
83
|
Wang Y, Li X, Mao Y, Blaschek HP. Single-nucleotide resolution analysis of the transcriptome structure of Clostridium beijerinckii NCIMB 8052 using RNA-Seq. BMC Genomics 2011; 12:479. [PMID: 21962126 PMCID: PMC3271303 DOI: 10.1186/1471-2164-12-479] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2011] [Accepted: 09/30/2011] [Indexed: 12/02/2022] Open
Abstract
Background Clostridium beijerinckii is an important solvent producing microorganism. The genome of C. beijerinckii NCIMB 8052 has recently been sequenced. Although transcriptome structure is important in order to reveal the functional and regulatory architecture of the genome, the physical structure of transcriptome for this strain, such as the operon linkages and transcript boundaries are not well understood. Results In this study, we conducted a single-nucleotide resolution analysis of the C. beijerinckii NCIMB 8052 transcriptome using high-throughput RNA-Seq technology. We identified the transcription start sites and operon structure throughout the genome. We confirmed the structure of important gene operons involved in metabolic pathways for acid and solvent production in C. beijerinckii 8052, including pta-ack, ptb-buk, hbd-etfA-etfB-crt (bcs) and ald-ctfA-ctfB-adc (sol) operons; we also defined important operons related to chemotaxis/motility, transcriptional regulation, stress response and fatty acids biosynthesis along with others. We discovered 20 previously non-annotated regions with significant transcriptional activities and 15 genes whose translation start codons were likely mis-annotated. As a consequence, the accuracy of existing genome annotation was significantly enhanced. Furthermore, we identified 78 putative silent genes and 177 putative housekeeping genes based on normalized transcription measurement with the sequence data. We also observed that more than 30% of pseudogenes had significant transcriptional activities during the fermentation process. Strong correlations exist between the expression values derived from RNA-Seq analysis and microarray data or qRT-PCR results. Conclusions Transcriptome structural profiling in this research provided important supplemental information on the accuracy of genome annotation, and revealed additional gene functions and regulation in C. beijerinckii.
Collapse
Affiliation(s)
- Yi Wang
- Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA
| | | | | | | |
Collapse
|
84
|
Abstract
Galactose is important for the survival and virulence of bacteria. In Escherichia coli, galactose is utilized by the Leloir pathway, which is controlled by a complex network. To shed light on the potential functions the galactose network could perform, we performed bioinformatical analysis of reference genome sequences belonging to the Enterobacteriaceae family. We found that several genomes have reduced numbers of components compared to the E. coli galactose system, suggesting that the network can be optimized for different environments. Typically, genes are removed by deletions; however, in Yersinia pestis, the galactose mutarotase (galM) gene is inactivated by a single-base-pair deletion. Lack of GalM activity indicates that the two anomers of d-galactose are used for different purposes, α-d-galactose as a carbon source and β-d-galactose for induction of UDP-galactose synthesis for biosynthetic glycosylation. We demonstrate that activity of the galM gene can be restored by different single-base-pair insertions. During the evolution of Y. pestis to become a vector-transmitted systemic pathogen, many genes were converted to pseudogenes. It is not clear whether pseudogenes are present to maintain meiotrophism or are in the process of elimination. Our results suggest that the galM pseudogene has not been deleted because its reactivation may be beneficial in certain environments. Evolution of bacteria to populate a new environment necessarily involves reengineering of their molecular network. Members of the Enterobacteriaceae family of bacteria have diverse lifestyles and can function in a wide range of environments. In this study we performed bioinformatical analysis of 34 reference genome sequences belonging to the Enterobacteriaceae family to gain insight into the natural diversity of the d-galactose utilization network. Our bioinformatical analysis shows that in several species, some genes of the network are completely missing or are inactivated by large deletions. The only exception is the galactose mutarotase (galM) gene of Yersinia pestis, which is converted to a pseudogene by a single-base-pair deletion. In this paper, we discuss the possible consequences of galM inactivation on network function. We suggest that galM was converted to a pseudogene rather than being deleted in evolution because its reactivation can be beneficial in certain environments.
Collapse
|
85
|
Strong PCR, Hinchliffe SJ, Patrick H, Atkinson S, Champion OL, Wren BW. Identification and characterisation of a novel adhesin Ifp in Yersinia pseudotuberculosis. BMC Microbiol 2011; 11:85. [PMID: 21527009 PMCID: PMC3102037 DOI: 10.1186/1471-2180-11-85] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2011] [Accepted: 04/28/2011] [Indexed: 11/23/2022] Open
Abstract
Background In order to identify new virulence determinants in Y. pseudotuberculosis a comparison between its genome and that of Yersinia pestis was undertaken. This reveals dozens of pseudogenes in Y. pestis, which are still putatively functional in Y. pseudotuberculosis and may be important in the enteric lifestyle. One such gene, YPTB1572 in the Y. pseudotuberculosis IP32953 genome sequence, encodes a protein with similarity to invasin, a classic adhesion/invasion protein, and to intimin, the attaching and effacing protein from enteropathogenic (EPEC) and enterohaemorraghic (EHEC) Escherichia coli. Results We termed YPTB1572 Ifp (Intimin family protein) and show that it is able to bind directly to human HEp-2 epithelial cells. Cysteine and tryptophan residues in the C-terminal region of intimin that are essential for function in EPEC and EHEC are conserved in Ifp. Protein binding occurred at distinct foci on the HEp-2 cell surface and can be disrupted by mutation of a single cysteine residue at the C-terminus of the protein. Temporal expression analysis using lux reporter constructs revealed that ifp is expressed at late log phase at 37°C in contrast to invasin, suggesting that Ifp is a late stage adhesin. An ifp defined mutant showed a reduction in adhesion to HEp-2 cells and was attenuated in the Galleria mellonella infection model. Conclusion A new Y. pseudotuberculosis adhesin has been identified and characterised. This Ifp is a new member in the family of invasin/intimin outer membrane adhesins.
Collapse
Affiliation(s)
- Philippa C R Strong
- Pathogen Molecular Biology Unit, Department of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, WC1E 7HT, UK
| | | | | | | | | | | |
Collapse
|
86
|
Advances in Research on Pseudogenes. PROG BIOCHEM BIOPHYS 2011. [DOI: 10.3724/sp.j.1206.2010.00215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
87
|
Muro EM, Mah N, Moreno-Hagelsieb G, Andrade-Navarro MA. The pseudogenes of Mycobacterium leprae reveal the functional relevance of gene order within operons. Nucleic Acids Res 2010; 39:1732-8. [PMID: 21051341 PMCID: PMC3061063 DOI: 10.1093/nar/gkq1067] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Almost 50 years following the discovery of the prokaryotic operon, the functional relevance of gene order within operons remains unclear. In this work, we take advantage of the eroded genome of Mycobacterium leprae to add evidence supporting the notion that functionally less important genes have a tendency to be located at the end of its operons. M. leprae’s genome includes 1133 pseudogenes and 1614 protein-coding genes and can be compared with the close genome of M. tuberculosis. Assuming M. leprae’s pseudogenes to represent dispensable genes, we have studied the position of these pseudogenes in the operons of M. leprae and of their orthologs in M. tuberculosis. We observed that both tend to be located in the 3′ (downstream) half of the operon (P-values of 0.03 and 0.18, respectively). Analysis of pseudogenes in all available prokaryotic genomes confirms this trend (P-value of 7.1 × 10−7). In a complementary analysis, we found a significant tendency for essential genes to be located at the 5′ (upstream) half of the operon (P-value of 0.006). Our work provides an indication that, in prokarya, functionally less important genes have a tendency to be located at the end of operons, while more relevant genes tend to be located toward operon starts.
Collapse
Affiliation(s)
- Enrique M Muro
- Computational Biology and Data Mining Group, Max Delbrück Center for Molecular Medicine, Robert-Rössle Strasse 10, 13125, Berlin, Germany.
| | | | | | | |
Collapse
|
88
|
Kuo CH, Ochman H. The extinction dynamics of bacterial pseudogenes. PLoS Genet 2010; 6. [PMID: 20700439 PMCID: PMC2916853 DOI: 10.1371/journal.pgen.1001050] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2010] [Accepted: 07/06/2010] [Indexed: 01/02/2023] Open
Abstract
Pseudogenes are usually considered to be completely neutral sequences whose evolution is shaped by random mutations and chance events. It is possible, however, for disrupted genes to generate products that are deleterious due either to the energetic costs of their transcription and translation or to the formation of toxic proteins. We found that after their initial formation, the youngest pseudogenes in Salmonella genomes have a very high likelihood of being removed by deletional processes and are eliminated too rapidly to be governed by a strictly neutral model of stochastic loss. Those few highly degraded pseudogenes that have persisted in Salmonella genomes correspond to genes with low expression levels and low connectivity in gene networks, such that their inactivation and any initial deleterious effects associated with their inactivation are buffered. Although pseudogenes have long been considered the paradigm of neutral evolution, the distribution of pseudogenes among Salmonella strains indicates that removal of many of these apparently functionless regions is attributable to positive selection.
Collapse
Affiliation(s)
- Chih-Horng Kuo
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, United States of America
| | - Howard Ochman
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, United States of America
- * E-mail:
| |
Collapse
|
89
|
Belda E, Moya A, Bentley S, Silva FJ. Mobile genetic element proliferation and gene inactivation impact over the genome structure and metabolic capabilities of Sodalis glossinidius, the secondary endosymbiont of tsetse flies. BMC Genomics 2010; 11:449. [PMID: 20649993 PMCID: PMC3091646 DOI: 10.1186/1471-2164-11-449] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2010] [Accepted: 07/22/2010] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Genome reduction is a common evolutionary process in symbiotic and pathogenic bacteria. This process has been extensively characterized in bacterial endosymbionts of insects, where primary mutualistic bacteria represent the most extreme cases of genome reduction consequence of a massive process of gene inactivation and loss during their evolution from free-living ancestors. Sodalis glossinidius, the secondary endosymbiont of tsetse flies, contains one of the few complete genomes of bacteria at the very beginning of the symbiotic association, allowing to evaluate the relative impact of mobile genetic element proliferation and gene inactivation over the structure and functional capabilities of this bacterial endosymbiont during the transition to a host dependent lifestyle. RESULTS A detailed characterization of mobile genetic elements and pseudogenes reveals a massive presence of different types of prophage elements together with five different families of IS elements that have proliferated across the genome of Sodalis glossinidius at different levels. In addition, a detailed survey of intergenic regions allowed the characterization of 1501 pseudogenes, a much higher number than the 972 pseudogenes described in the original annotation. Pseudogene structure reveals a minor impact of mobile genetic element proliferation in the process of gene inactivation, with most of pseudogenes originated by multiple frameshift mutations and premature stop codons. The comparison of metabolic profiles of Sodalis glossinidius and tsetse fly primary endosymbiont Wiglesworthia glossinidia based on their whole gene and pseudogene repertoires revealed a novel case of pathway inactivation, the arginine biosynthesis, in Sodalis glossinidius together with a possible case of metabolic complementation with Wigglesworthia glossinidia for thiamine biosynthesis. CONCLUSIONS The complete re-analysis of the genome sequence of Sodalis glossinidius reveals novel insights in the evolutionary transition from a free-living ancestor to a host-dependent lifestyle, with a massive proliferation of mobile genetic elements mainly of phage origin although with minor impact in the process of gene inactivation that is taking place in this bacterial genome. The metabolic analysis of the whole endosymbiotic consortia of tsetse flies have revealed a possible phenomenon of metabolic complementation between primary and secondary endosymbionts that can contribute to explain the co-existence of both bacterial endosymbionts in the context of the tsetse host.
Collapse
Affiliation(s)
- Eugeni Belda
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València. Apartat 22085, València E-46071, Spain
| | - Andrés Moya
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València. Apartat 22085, València E-46071, Spain
- CIBER en Epidemiología y Salud Pública (CIBEResp), Barcelona, Spain
- Unidad Mixta de Investigación de Genómica y Salud (Centro Superior de Investigación en Salud Pública, CSISP/Institut Cavanilles, Universitat de València, Spain
| | | | - Francisco J Silva
- Institut Cavanilles de Biodiversitat i Biologia Evolutiva, Universitat de València. Apartat 22085, València E-46071, Spain
- CIBER en Epidemiología y Salud Pública (CIBEResp), Barcelona, Spain
- Unidad Mixta de Investigación de Genómica y Salud (Centro Superior de Investigación en Salud Pública, CSISP/Institut Cavanilles, Universitat de València, Spain
| |
Collapse
|
90
|
Abstract
Bacterial gene content variation during the course of evolution has been widely acknowledged and its pattern has been actively modeled in recent years. Gene truncation or gene pseudogenization also plays an important role in shaping bacterial genome content. Truncated genes could also arise from small-scale lateral gene transfer events. Unfortunately, the information of truncated genes has not been considered in any existing mathematical models on gene content variation. In this study, we developed a model to incorporate truncated genes. Maximum-likelihood estimates (MLEs) of the new model reveal fast rates of gene insertions/deletions on recent branches, suggesting a fast turnover of many recently transferred genes. The estimates also suggest that many truncated genes are in the process of being eliminated from the genome. Furthermore, we demonstrate that the ignorance of truncated genes in the estimation does not lead to a systematic bias but rather has a more complicated effect. Analysis using the new model not only provides more accurate estimates on gene gains/losses (or insertions/deletions), but also reduces any concern of a systematic bias from applying simplified models to bacterial genome evolution. Although not a primary purpose, the model incorporating truncated genes could be potentially used for phylogeny reconstruction using gene family content.
Collapse
|
91
|
Perry SC, Beiko RG. Distinguishing microbial genome fragments based on their composition: evolutionary and comparative genomic perspectives. Genome Biol Evol 2010; 2:117-31. [PMID: 20333228 PMCID: PMC2839357 DOI: 10.1093/gbe/evq004] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/19/2010] [Indexed: 01/23/2023] Open
Abstract
It is well known that patterns of nucleotide composition vary within and among
genomes, although the reasons why these variations exist are not completely
understood. Between-genome compositional variation has been exploited to assign
environmental shotgun sequences to their most likely originating genomes,
whereas within-genome variation has been used to identify recently acquired
genetic material such as pathogenicity islands. Recent sequence assignment
techniques have achieved high levels of accuracy on artificial data sets, but
the relative difficulty of distinguishing lineages with varying degrees of
relatedness, and different types of genomic sequence, has not been examined in
depth. We investigated the compositional differences in a set of 774 sequenced
microbial genomes, finding rapid divergence among closely related genomes, but
also convergence of compositional patterns among genomes with similar habitats.
Support vector machines were then used to distinguish all pairs of genomes based
on genome fragments 500 nucleotides in length. The nearly 300,000 accuracy
scores obtained from these trials were used to construct general models of
distinguishability versus taxonomic and compositional indices of genomic
divergence. Unusual genome pairs were evident from their large residuals
relative to the fitted model, and we identified several factors including genome
reduction, putative lateral genetic transfer, and habitat convergence that
influence the distinguishability of genomes. The positional, compositional, and
functional context of a fragment within a genome has a strong influence on its
likelihood of correct classification, but in a way that depends on the taxonomic
and ecological similarity of the comparator genome.
Collapse
Affiliation(s)
- Scott C Perry
- Faculty of Computer Science, Dalhousie University, Halifax, Nova Scotia, Canada
| | | |
Collapse
|
92
|
Recent human-to-poultry host jump, adaptation, and pandemic spread of Staphylococcus aureus. Proc Natl Acad Sci U S A 2009; 106:19545-50. [PMID: 19884497 DOI: 10.1073/pnas.0909285106] [Citation(s) in RCA: 300] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The impact of globalization on the emergence and spread of pathogens is an important veterinary and public health issue. Staphylococcus aureus is a notorious human pathogen associated with serious nosocomial and community-acquired infections. In addition, S. aureus is a major cause of animal diseases including skeletal infections of poultry, which are a large economic burden on the global broiler chicken industry. Here, we provide evidence that the majority of S. aureus isolates from broiler chickens are the descendants of a single human-to-poultry host jump that occurred approximately 38 years ago (range, 30 to 63 years ago) by a subtype of the worldwide human ST5 clonal lineage unique to Poland. In contrast to human subtypes of the ST5 radiation, which demonstrate strong geographic clustering, the poultry ST5 clade was distributed in different continents, consistent with wide dissemination via the global poultry industry distribution network. The poultry ST5 clade has undergone genetic diversification from its human progenitor strain by acquisition of novel mobile genetic elements from an avian-specific accessory gene pool, and by the inactivation of several proteins important for human disease pathogenesis. These genetic events have resulted in enhanced resistance to killing by chicken heterophils, reflecting avian host-adaptive evolution. Taken together, we have determined the evolutionary history of a major new animal pathogen that has undergone rapid avian host adaptation and intercontinental dissemination. These data provide a new paradigm for the impact of human activities on the emergence of animal pathogens.
Collapse
|
93
|
Liu YJ, Zheng D, Balasubramanian S, Carriero N, Khurana E, Robilotto R, Gerstein MB. Comprehensive analysis of the pseudogenes of glycolytic enzymes in vertebrates: the anomalously high number of GAPDH pseudogenes highlights a recent burst of retrotrans-positional activity. BMC Genomics 2009; 10:480. [PMID: 19835609 PMCID: PMC2770531 DOI: 10.1186/1471-2164-10-480] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2009] [Accepted: 10/16/2009] [Indexed: 11/30/2022] Open
Abstract
BACKGROUND Pseudogenes provide a record of the molecular evolution of genes. As glycolysis is such a highly conserved and fundamental metabolic pathway, the pseudogenes of glycolytic enzymes comprise a standardized genomic measuring stick and an ideal platform for studying molecular evolution. One of the glycolytic enzymes, glyceraldehyde-3-phosphate dehydrogenase (GAPDH), has already been noted to have one of the largest numbers of associated pseudogenes, among all proteins. RESULTS We assembled the first comprehensive catalog of the processed and duplicated pseudogenes of glycolytic enzymes in many vertebrate model-organism genomes, including human, chimpanzee, mouse, rat, chicken, zebrafish, pufferfish, fruitfly, and worm (available at http://pseudogene.org/glycolysis/). We found that glycolytic pseudogenes are predominantly processed, i.e. retrotransposed from the mRNA of their parent genes. Although each glycolytic enzyme plays a unique role, GAPDH has by far the most pseudogenes, perhaps reflecting its large number of non-glycolytic functions or its possession of a particularly retrotranspositionally active sub-sequence. Furthermore, the number of GAPDH pseudogenes varies significantly among the genomes we studied: none in zebrafish, pufferfish, fruitfly, and worm, 1 in chicken, 50 in chimpanzee, 62 in human, 331 in mouse, and 364 in rat. Next, we developed a simple method of identifying conserved syntenic blocks (consistently applicable to the wide range of organisms in the study) by using orthologous genes as anchors delimiting a conserved block between a pair of genomes. This approach showed that few glycolytic pseudogenes are shared between primate and rodent lineages. Finally, by estimating pseudogene ages using Kimura's two-parameter model of nucleotide substitution, we found evidence for bursts of retrotranspositional activity approximately 42, 36, and 26 million years ago in the human, mouse, and rat lineages, respectively. CONCLUSION Overall, we performed a consistent analysis of one group of pseudogenes across multiple genomes, finding evidence that most of them were created within the last 50 million years, subsequent to the divergence of rodent and primate lineages.
Collapse
Affiliation(s)
- Yuen-Jong Liu
- Department of Surgery, Beth Israel Deaconess Medical Center, Harvard Medical School, 110 Francis Street, Boston, MA, USA
- Department of Molecular Biophysics and Biochemistry, P.O. Box 208114, Yale University, New Haven, CT 06520, USA
| | - Deyou Zheng
- Albert Einstein College of Medicine of Yeshiva University, Department of Neurology, Rose F. Kennedy Center, 1410 Pelham Parkway South, Room 915B, Bronx, NY 10461, USA
| | - Suganthi Balasubramanian
- Department of Molecular Biophysics and Biochemistry, P.O. Box 208114, Yale University, New Haven, CT 06520, USA
| | - Nicholas Carriero
- Department of Molecular Biophysics and Biochemistry, P.O. Box 208114, Yale University, New Haven, CT 06520, USA
| | - Ekta Khurana
- Department of Molecular Biophysics and Biochemistry, P.O. Box 208114, Yale University, New Haven, CT 06520, USA
| | - Rebecca Robilotto
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
| | - Mark B Gerstein
- Department of Molecular Biophysics and Biochemistry, P.O. Box 208114, Yale University, New Haven, CT 06520, USA
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520, USA
- Department of Computer Science, Yale University, Bass 432, 266 Whitney Avenue, New Haven, CT 06520, USA
| |
Collapse
|
94
|
de Souza GA, Søfteland T, Koehler CJ, Thiede B, Wiker HG. Validating divergent ORF annotation of the Mycobacterium leprae genome through a full translation data set and peptide identification by tandem mass spectrometry. Proteomics 2009; 9:3233-43. [PMID: 19562797 DOI: 10.1002/pmic.200800955] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Mycobacterium leprae has undergone extensive degenerative evolution, with a large number of pseudogenes. It is also the organism with the greatest divergence between gene annotations from independent institutes. Therefore, M. leprae is a good model to verify the currently predicted coding sequence regions between different annotations, to identify new ones and to investigate the expression of pseudogenes. We submitted a total extract of the bacteria isolated from Armadillo to Gel-LC-MS/MS using a linear quadrupole ion trap-Orbitrap mass spectrometer. Spectra were analyzed using the Leproma (1614 genes and 1133 pseudogenes) and TIGR (5446 genes) databases and a database containing the full genome translation. We identified a total of 1046 proteins, including five proteins encoded by previously predicted pseudogenes, which upon closer inspection appeared to be proper genes. Only 11 of the additional annotations by TIGR were verified. We also identified six tryptic peptides from five proteins from regions not considered to be coding sequences, in addition to peptides from two unannotated gene candidates that overlap with other genes. Our data show that the Leproma annotation of M. leprae is quite accurate, and there were no peptide observations corresponding to true pseudogenes, except for a new gene candidate, overlapping with an essential enolase on the complementary strand.
Collapse
Affiliation(s)
- Gustavo A de Souza
- The Gade Institute, Section for Microbiology and Immunology, University of Bergen, Norway
| | | | | | | | | |
Collapse
|
95
|
Williams DL, Slayden RA, Amin A, Martinez AN, Pittman TL, Mira A, Mitra A, Nagaraja V, Morrison NE, Moraes M, Gillis TP. Implications of high level pseudogene transcription in Mycobacterium leprae. BMC Genomics 2009; 10:397. [PMID: 19706172 PMCID: PMC2753549 DOI: 10.1186/1471-2164-10-397] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2008] [Accepted: 08/25/2009] [Indexed: 11/24/2022] Open
Abstract
BACKGROUND The Mycobacterium leprae genome has less than 50% coding capacity and 1,133 pseudogenes. Preliminary evidence suggests that some pseudogenes are expressed. Therefore, defining pseudogene transcriptional and translational potentials of this genome should increase our understanding of their impact on M. leprae physiology. RESULTS Gene expression analysis identified transcripts from 49% of all M. leprae genes including 57% of all ORFs and 43% of all pseudogenes in the genome. Transcribed pseudogenes were randomly distributed throughout the chromosome. Factors resulting in pseudogene transcription included: 1) co-orientation of transcribed pseudogenes with transcribed ORFs within or exclusive of operon-like structures; 2) the paucity of intrinsic stem-loop transcriptional terminators between transcribed ORFs and downstream pseudogenes; and 3) predicted pseudogene promoters. Mechanisms for translational "silencing" of pseudogene transcripts included the lack of both translational start codons and strong Shine-Dalgarno (SD) sequences. Transcribed pseudogenes also contained multiple "in-frame" stop codons and high Ka/Ks ratios, compared to that of homologs in M. tuberculosis and ORFs in M. leprae. A pseudogene transcript containing an active promoter, strong SD site, a start codon, but containing two in frame stop codons yielded a protein product when expressed in E. coli. CONCLUSION Approximately half of M. leprae's transcriptome consists of inactive gene products consuming energy and resources without potential benefit to M. leprae. Presently it is unclear what additional detrimental affect(s) this large number of inactive mRNAs has on the functional capability of this organism. Translation of these pseudogenes may play an important role in overall energy consumption and resultant pathophysiological characteristics of M. leprae. However, this study also demonstrated that multiple translational "silencing" mechanisms are present, reducing additional energy and resource expenditure required for protein production from the vast majority of these transcripts.
Collapse
Affiliation(s)
- Diana L Williams
- HRSA, BPHC, Division of National Hansen's Disease Programs, Laboratory Research Branch, Molecular Biology Research Department @ School of Veterinary Medicine, Louisiana State University, Baton Rouge, LA, USA
| | - Richard A Slayden
- Rocky Mountain Regional Center of Excellence, Department of Microbiology, Immunology & Pathology, Colorado State University, Fort Collins, CO, USA
| | - Amol Amin
- Rocky Mountain Regional Center of Excellence, Department of Microbiology, Immunology & Pathology, Colorado State University, Fort Collins, CO, USA
| | - Alejandra N Martinez
- HRSA, BPHC, Division of National Hansen's Disease Programs, Laboratory Research Branch, Molecular Biology Research Department @ School of Veterinary Medicine, Louisiana State University, Baton Rouge, LA, USA
- Leprosy Laboratory, Department, Tropical Medicine Institute Oswaldo Cruz-FIOCRUZ, Rio de Janeiro, RJ, Brazil
| | - Tana L Pittman
- HRSA, BPHC, Division of National Hansen's Disease Programs, Laboratory Research Branch, Molecular Biology Research Department @ School of Veterinary Medicine, Louisiana State University, Baton Rouge, LA, USA
| | - Alex Mira
- Center for Advanced Research in Public Health, CSISP, Area de Genomica y Salud, Valencia, Spain
| | - Anirban Mitra
- Department of Microbiology and Cell Biology, Indian Institute of Science, Bangalore, India
| | - Valakunja Nagaraja
- Department of Microbiology and Cell Biology, Indian Institute of Science, Bangalore, India
| | - Norman E Morrison
- Center for Tuberculosis Research, Department of Medicine, Division of Infectious Diseases, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Milton Moraes
- Leprosy Laboratory, Department, Tropical Medicine Institute Oswaldo Cruz-FIOCRUZ, Rio de Janeiro, RJ, Brazil
| | - Thomas P Gillis
- HRSA, BPHC, Division of National Hansen's Disease Programs, Laboratory Research Branch, Molecular Biology Research Department @ School of Veterinary Medicine, Louisiana State University, Baton Rouge, LA, USA
| |
Collapse
|
96
|
Nakamura K, Akama T, Bang PD, Sekimura S, Tanigawa K, Wu H, Kawashima A, Hayashi M, Suzuki K, Ishii N. Detection of RNA expression from pseudogenes and non-coding genomic regions of Mycobacterium leprae. Microb Pathog 2009; 47:183-7. [PMID: 19555754 DOI: 10.1016/j.micpath.2009.06.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2009] [Revised: 05/20/2009] [Accepted: 06/12/2009] [Indexed: 10/20/2022]
Abstract
We have previously reported that some pseudogenes are expressed in Mycobacterium leprae (M. leprae), the causative agent of leprosy, and that their expression levels alter upon infection of macrophages. We attempted to further examine the expression of pseudogene and non-coding genomic region in M. leprae, in this study. 19 Pseudogenes, 17 non-coding genomic regions, and 21 coding genes expression in M. leprae maintained in the footpads of the hypertensive nude rat (SHR/NCrj-rnu) were examined by reverse transcriptase polymerase chain reaction (RT-PCR). The expression of some of these pseudogenes, non-coding genomic regions and coding genes were also examined in M. leprae from skin smear specimens obtained from patients with lepromatous leprosy by RT-PCR. Transcripts from pseudogenes, non-coding genomic regions and coding genes examined in this study were clearly observed in M. leprae. The expression patterns of some of these transcripts vary greatly among different leprosy patients. These results indicate that some of pseudogenes and non-coding genomic regions are transcribed in M. leprae and analysis of RNA expression patterns including pseudogene and non-coding genomic region in M. leprae may be useful in understanding the pathological states of infected patients.
Collapse
Affiliation(s)
- Kazuaki Nakamura
- Department of Bioregulation, Leprosy Research Center, National Institute of Infectious Diseases, Aoba-cho, Higashimurayama, Tokyo, Japan
| | | | | | | | | | | | | | | | | | | |
Collapse
|
97
|
Ward PN, Holden MTG, Leigh JA, Lennard N, Bignell A, Barron A, Clark L, Quail MA, Woodward J, Barrell BG, Egan SA, Field TR, Maskell D, Kehoe M, Dowson CG, Chanter N, Whatmore AM, Bentley SD, Parkhill J. Evidence for niche adaptation in the genome of the bovine pathogen Streptococcus uberis. BMC Genomics 2009; 10:54. [PMID: 19175920 PMCID: PMC2657157 DOI: 10.1186/1471-2164-10-54] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2008] [Accepted: 01/28/2009] [Indexed: 01/28/2023] Open
Abstract
BACKGROUND Streptococcus uberis, a Gram positive bacterial pathogen responsible for a significant proportion of bovine mastitis in commercial dairy herds, colonises multiple body sites of the cow including the gut, genital tract and mammary gland. Comparative analysis of the complete genome sequence of S. uberis strain 0140J was undertaken to help elucidate the biology of this effective bovine pathogen. RESULTS The genome revealed 1,825 predicted coding sequences (CDSs) of which 62 were identified as pseudogenes or gene fragments. Comparisons with related pyogenic streptococci identified a conserved core (40%) of orthologous CDSs. Intriguingly, S. uberis 0140J displayed a lower number of mobile genetic elements when compared with other pyogenic streptococci, however bacteriophage-derived islands and a putative genomic island were identified. Comparative genomics analysis revealed most similarity to the genomes of Streptococcus agalactiae and Streptococcus equi subsp. zooepidemicus. In contrast, streptococcal orthologs were not identified for 11% of the CDSs, indicating either unique retention of ancestral sequence, or acquisition of sequence from alternative sources. Functions including transport, catabolism, regulation and CDSs encoding cell envelope proteins were over-represented in this unique gene set; a limited array of putative virulence CDSs were identified. CONCLUSION S. uberis utilises nutritional flexibility derived from a diversity of metabolic options to successfully occupy a discrete ecological niche. The features observed in S. uberis are strongly suggestive of an opportunistic pathogen adapted to challenging and changing environmental parameters.
Collapse
Affiliation(s)
- Philip N Ward
- Nuffield Department of Clinical Laboratory Sciences, Oxford University, John Radcliffe Hospital, Headington, Oxford, OX3 9DU, UK
| | - Matthew TG Holden
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - James A Leigh
- The School of Veterinary Medicine and Science, The University of Nottingham, Sutton Bonington Campus, Sutton Bonington, Leicestershire, LE12 5RD, UK
| | - Nicola Lennard
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Alexandra Bignell
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Andy Barron
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Louise Clark
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Michael A Quail
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - John Woodward
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Bart G Barrell
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Sharon A Egan
- The School of Veterinary Medicine and Science, The University of Nottingham, Sutton Bonington Campus, Sutton Bonington, Leicestershire, LE12 5RD, UK
| | - Terence R Field
- Institute for Animal Health, Compton Laboratory, Compton, Newbury, Berks, RG20 7NN, UK
| | - Duncan Maskell
- Dept. of Veterinary Medicine, The University of Cambridge, Cambridge, CB3 0ES, UK
| | - Michael Kehoe
- Institute for Cell and Molecular Biosciences, The Medical School, University of Newcastle upon Tyne, Framlington Place, Newcastle upon Tyne, NE2 4HH, UK
| | | | - Neil Chanter
- Centre for Preventative Medicine, Animal Health Trust, Newmarket, Suffolk, CB8 7UU, UK
| | - Adrian M Whatmore
- Department of Biological Sciences, University of Warwick, Coventry, CV4 7AL, UK
- Veterinary Laboratories Agency, Weybridge, UK
| | - Stephen D Bentley
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| | - Julian Parkhill
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SA, UK
| |
Collapse
|
98
|
Touchon M, Hoede C, Tenaillon O, Barbe V, Baeriswyl S, Bidet P, Bingen E, Bonacorsi S, Bouchier C, Bouvet O, Calteau A, Chiapello H, Clermont O, Cruveiller S, Danchin A, Diard M, Dossat C, Karoui ME, Frapy E, Garry L, Ghigo JM, Gilles AM, Johnson J, Le Bouguénec C, Lescat M, Mangenot S, Martinez-Jéhanne V, Matic I, Nassif X, Oztas S, Petit MA, Pichon C, Rouy Z, Ruf CS, Schneider D, Tourret J, Vacherie B, Vallenet D, Médigue C, Rocha EPC, Denamur E. Organised genome dynamics in the Escherichia coli species results in highly diverse adaptive paths. PLoS Genet 2009; 5:e1000344. [PMID: 19165319 PMCID: PMC2617782 DOI: 10.1371/journal.pgen.1000344] [Citation(s) in RCA: 778] [Impact Index Per Article: 51.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2008] [Accepted: 12/16/2008] [Indexed: 01/01/2023] Open
Abstract
The Escherichia coli species represents one of the best-studied model organisms, but also encompasses a variety of commensal and pathogenic strains that diversify by high rates of genetic change. We uniformly (re-) annotated the genomes of 20 commensal and pathogenic E. coli strains and one strain of E. fergusonii (the closest E. coli related species), including seven that we sequenced to completion. Within the ∼18,000 families of orthologous genes, we found ∼2,000 common to all strains. Although recombination rates are much higher than mutation rates, we show, both theoretically and using phylogenetic inference, that this does not obscure the phylogenetic signal, which places the B2 phylogenetic group and one group D strain at the basal position. Based on this phylogeny, we inferred past evolutionary events of gain and loss of genes, identifying functional classes under opposite selection pressures. We found an important adaptive role for metabolism diversification within group B2 and Shigella strains, but identified few or no extraintestinal virulence-specific genes, which could render difficult the development of a vaccine against extraintestinal infections. Genome flux in E. coli is confined to a small number of conserved positions in the chromosome, which most often are not associated with integrases or tRNA genes. Core genes flanking some of these regions show higher rates of recombination, suggesting that a gene, once acquired by a strain, spreads within the species by homologous recombination at the flanking genes. Finally, the genome's long-scale structure of recombination indicates lower recombination rates, but not higher mutation rates, at the terminus of replication. The ensuing effect of background selection and biased gene conversion may thus explain why this region is A+T-rich and shows high sequence divergence but low sequence polymorphism. Overall, despite a very high gene flow, genes co-exist in an organised genome. Although abundant knowledge has been accumulated regarding the E. coli laboratory strain K-12, little is known about the evolutionary trajectories that have driven the high diversity observed among natural isolates of the species, which encompass both commensal and highly virulent intestinal and extraintestinal pathogenic strains. We have annotated or re-annotated the genomes of 20 commensal and pathogenic E. coli strains and one strain of E. fergusonii (the closest E. coli related species), including seven that we sequenced to completion. Although recombination rates are much higher than mutation rates, we were able to reconstruct a robust phylogeny based on the ∼2,000 genes common to all strains. Based on this phylogeny, we established the evolutionary scenario of gains and losses of thousands of specific genes, identifying functional classes under opposite selection pressures. This genome flux is confined to very few positions in the chromosome, which are the same for every genome. Notably, we identified few or no extraintestinal virulence-specific genes. We also defined a long-scale structure of recombination in the genome with lower recombination rates at the terminus of replication. These findings demonstrate that, despite a very high gene flow, genes can co-exist in an organised genome.
Collapse
Affiliation(s)
- Marie Touchon
- Atelier de BioInformatique, Université Pierre et Marie Curie - Paris 6 (UPMC), Paris, France
- Microbial Evolutionary Genomics, Institut Pasteur, CNRS URA2171, Paris, France
| | - Claire Hoede
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
| | - Olivier Tenaillon
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
| | | | - Simon Baeriswyl
- Faculté de Médecine, Université Paris 5 René Descartes, INSERM U571, Paris, France
| | - Philippe Bidet
- Université Paris 7 Denis Diderot, Hôpital Robert Debré (APHP), EA 3105, Paris, France
| | - Edouard Bingen
- Université Paris 7 Denis Diderot, Hôpital Robert Debré (APHP), EA 3105, Paris, France
| | - Stéphane Bonacorsi
- Université Paris 7 Denis Diderot, Hôpital Robert Debré (APHP), EA 3105, Paris, France
| | | | - Odile Bouvet
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
| | - Alexandra Calteau
- Laboratoire de Génomique Comparative, CNRS UMR8030, Institut de Génomique, CEA, Génoscope, Evry, France
| | - Hélène Chiapello
- UR1077 Mathématique, Informatique, et Génome, INRA, Jouy en Josas, France
| | - Olivier Clermont
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
| | - Stéphane Cruveiller
- Laboratoire de Génomique Comparative, CNRS UMR8030, Institut de Génomique, CEA, Génoscope, Evry, France
| | - Antoine Danchin
- Unité de Génétique des Génomes Bactériens, Institut Pasteur, CNRS URA2171, Paris, France
| | - Médéric Diard
- Faculté de Médecine, Université Paris 5 René Descartes, INSERM U571, Paris, France
| | | | - Meriem El Karoui
- UR888 Unité des Bactéries Lactiques et Pathogènes Opportunistes, INRA, Jouy en Josas, France
| | - Eric Frapy
- Faculté de Médecine, Université Paris 5 René Descartes, INSERM U570, Paris, France
| | - Louis Garry
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
| | - Jean Marc Ghigo
- Unité de Génétique des Biofilms, Institut Pasteur, CNRS URA2172, Paris, France
| | - Anne Marie Gilles
- Unité de Génétique des Génomes Bactériens, Institut Pasteur, CNRS URA2171, Paris, France
| | - James Johnson
- Veterans Affairs Medical Center, Minneapolis, Minnesota, United States of America
- Department of Medicine, University of Minnesota, Minneapolis, Minnesota, United States of America
| | | | - Mathilde Lescat
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
| | | | | | - Ivan Matic
- Faculté de Médecine, Université Paris 5 René Descartes, INSERM U571, Paris, France
| | - Xavier Nassif
- Faculté de Médecine, Université Paris 5 René Descartes, INSERM U570, Paris, France
| | - Sophie Oztas
- Génoscope, Institut de Génomique, CEA, Evry, France
| | - Marie Agnès Petit
- UR888 Unité des Bactéries Lactiques et Pathogènes Opportunistes, INRA, Jouy en Josas, France
| | - Christophe Pichon
- Pathogénie Bactérienne des Muqueuses, Institut Pasteur, Paris, France
| | - Zoé Rouy
- Laboratoire de Génomique Comparative, CNRS UMR8030, Institut de Génomique, CEA, Génoscope, Evry, France
| | - Claude Saint Ruf
- Faculté de Médecine, Université Paris 5 René Descartes, INSERM U571, Paris, France
| | | | - Jérôme Tourret
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
| | | | - David Vallenet
- Laboratoire de Génomique Comparative, CNRS UMR8030, Institut de Génomique, CEA, Génoscope, Evry, France
| | - Claudine Médigue
- Laboratoire de Génomique Comparative, CNRS UMR8030, Institut de Génomique, CEA, Génoscope, Evry, France
- * E-mail: (CM); (EPCR); (ED)
| | - Eduardo P. C. Rocha
- Atelier de BioInformatique, Université Pierre et Marie Curie - Paris 6 (UPMC), Paris, France
- Microbial Evolutionary Genomics, Institut Pasteur, CNRS URA2171, Paris, France
- * E-mail: (CM); (EPCR); (ED)
| | - Erick Denamur
- Faculté de Médecine, Université Paris 7 Denis Diderot, INSERM U722, Site Xavier Bichat, Paris, France
- * E-mail: (CM); (EPCR); (ED)
| |
Collapse
|
99
|
The genome of Thermosipho africanus TCF52B: lateral genetic connections to the Firmicutes and Archaea. J Bacteriol 2009; 191:1974-8. [PMID: 19124572 DOI: 10.1128/jb.01448-08] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Lateral gene transfers (LGT) (also called horizontal gene transfers) have been a major force shaping the Thermosipho africanus TCF52B genome, whose sequence we describe here. Firmicutes emerge as the principal LGT partner. Twenty-six percent of phylogenetic trees suggest LGT with this group, while 13% of the open reading frames indicate LGT with Archaea.
Collapse
|
100
|
Abstract
Haemophilus parasuis is the causative agent of Glässer's disease, which produces big losses in swine populations worldwide. H. parasuis SH0165, belonging to the dominant serovar 5 in China, is a clinically isolated strain with high-level virulence. Here, we report the first completed genome sequence of this species.
Collapse
|