1
|
Yusuf B, Wang S, Alam MS, Zhang J, Liu Z, Lu Z, Ding J, Chiwala G, Gao Y, Fang C, Khan SA, Tian X, Islam MM, Hameed HMA, Maslov DA, Zhong N, Hu J, Zhang T. Investigating the role of MAB_1915 in intrinsic resistance to multiple drugs in Mycobacterium abscessus. Microbiol Spectr 2024; 12:e0397423. [PMID: 39162545 PMCID: PMC11448072 DOI: 10.1128/spectrum.03974-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Accepted: 06/25/2024] [Indexed: 08/21/2024] Open
Abstract
The increasing clinical significance of Mycobacterium abscessus is owed to its innate high-level, broad-spectrum resistance to antibiotics and therefore rapidly evolves as an important human pathogen. This warrants the identification of novel targets for aiding the discovery of new drugs or drug combinations to treat M. abscessus infections. This study is inspired by the drug-hypersensitive profile of a mutant M. abscessus (U14) with transposon insertion in MAB_1915. We validated the role of MAB_1915 in intrinsic drug resistance in M. abscessus by constructing a selectable marker-free in-frame deletion in MAB_1915 and complementing the mutant with the same or extended version of the gene and then followed by drug susceptibility testing. Judging by the putative function of MAB_1915, cell envelope permeability was studied by ethidium bromide accumulation assay and susceptibility testing against dyes and detergents. In this study, we established genetic evidence of the role of MAB_1915 in intrinsic resistance to rifampicin, rifabutin, linezolid, clarithromycin, vancomycin, and bedaquiline. Disruption of MAB_1915 has also been observed to cause a significant increase in cell envelope permeability in M. abscessus. Restoration of resistance is observed to depend on at least 27 base pairs upstream of the coding DNA sequence of MAB_1915. MAB_1915 could therefore be associated with cell envelope permeability, and hence its role in intrinsic resistance to multiple drugs in M. abscessus, which presents it as a novel target for future development of effective antimicrobials to overcome intrinsic drug resistance in M. abscessus. IMPORTANCE This study reports the role of a putative fadD (MAB_1915) in innate resistance to multiple drugs by M. abscessus, hence identifying MAB_1915 as a valuable target and providing a baseline for further mechanistic studies and development of effective antimicrobials to check the high level of intrinsic resistance in this pathogen.
Collapse
Affiliation(s)
- Buhari Yusuf
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Shuai Wang
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
| | - Md Shah Alam
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jingran Zhang
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- School of Life Sciences, University of Science and Technology of China, Hefei, China
| | - Zhiyong Liu
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangzhou Medical University, Guangzhou, China
- Guangzhou National Laboratory, Guangzhou, China
| | - Ziwen Lu
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jie Ding
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, China
| | - Gift Chiwala
- Malawi Liverpool Wellcome Clinical Research Programme, Blantyre, Malawi
| | - Yamin Gao
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
| | - Cuiting Fang
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Shahzad Akbar Khan
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Laboratory of Pathology, Department of Pathobiology, University of Poonch Rawalakot Azad Kashmir, Rawalakot, Pakistan
| | - Xirong Tian
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Md Mahmudul Islam
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
| | - H M Adnan Hameed
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
| | - Dmitry A Maslov
- Division of Gastroenterology and Hepatology, Department of Medicine, Stanford University School of Medicine, Stanford, California, USA
| | - Nanshan Zhong
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- Guangzhou Medical University, Guangzhou, China
- Guangzhou National Laboratory, Guangzhou, China
- State Key Laboratory of Respiratory Disease, National Clinical Research Center for Respiratory Disease, The National Center for Respiratory Medicine, The First Affiliated Hospital of Guangzhou Medical University, Guangzhou, China
| | - Jinxing Hu
- Guangzhou National Laboratory, Guangzhou, China
- State Key Laboratory of Respiratory Disease, Guangzhou Chest Hospital, Guangzhou, China
| | - Tianyu Zhang
- State Key Laboratory of Respiratory Disease, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- Guangdong-Hong Kong-Macao Joint Laboratory of Respiratory Infectious Diseases, Guangzhou, China
- China-New Zealand Joint Laboratory on Biomedicine and Health, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
- School of Life Sciences, University of Science and Technology of China, Hefei, China
- Guangzhou Medical University, Guangzhou, China
- Guangzhou National Laboratory, Guangzhou, China
- Institutes of Physical Science and Information Technology, Anhui University, Hefei, China
- State Key Laboratory of Respiratory Disease, Guangzhou Chest Hospital, Guangzhou, China
| |
Collapse
|
2
|
Colombatti Olivieri MA, Fresia P, Graña M, Cuerda MX, Nagel A, Alvarado Pinedo F, Romano MI, Caimi K, Berná L, Santangelo MP. Genomic comparison of two strains of Mycobacterium avium subsp. paratuberculosis with contrasting pathogenic phenotype. Tuberculosis (Edinb) 2023; 138:102299. [PMID: 36587510 DOI: 10.1016/j.tube.2022.102299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Revised: 11/28/2022] [Accepted: 12/19/2022] [Indexed: 12/24/2022]
Abstract
In a previous study, we evaluated the degree of virulence of Mycobacterium avium subsp. paratuberculosis (Map) strains isolated from cattle in Argentina in a murine model. This assay allowed us to differentiate between high-virulent MapARG1347 and low-virulent MapARG1543 strains. To corroborate whether the differences in virulence could be attributed to genetic differences between the strains, we performed Whole Genome Sequencing and compared the genomes and gene content between them and determined the differences related to the reference strain MapK10. We found 233 SNPs/INDELS in one or both strains relative to Map K10. The two strains share most of the variations, but we found 15 mutations present in only one of the strains. Considering NS-SNP/INDELS that produced a severe effect in the coding sequence, we focus the analysis on four predicted proteins, putatively related to virulence. Survival of MapARG1347 strain in bMDM was higher than MapARG1543 and was more resistant to acidic pH and H2O2 stresses than MapK10. The genomic differences between the two strains found in genes MAP1203 (a putative peptidoglycan hydrolase), MAP0403 (a putative serine protease) MAP1003c (a member of the PE-PPE family) and MAP4152 (a putative mycofactocin binding protein) could contribute to explain the contrasting phenotype previously observed in mice models.
Collapse
Affiliation(s)
- M A Colombatti Olivieri
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), INTA-CONICET, Dr. Nicolás Repetto y De Los Reseros S/Nº B1686IGC, Hurlingham, Buenos Aires, Argentina.
| | - P Fresia
- Unidad Mixta Pasteur+INIA, Institut Pasteur de Montevideo, Mataojo 2020, CP11400, Montevideo, Uruguay.
| | - M Graña
- Unidad de Bioinformática, Institut Pasteur de Montevideo, Mataojo 2020, CP11400, Montevideo, Uruguay.
| | - M X Cuerda
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), INTA-CONICET, Dr. Nicolás Repetto y De Los Reseros S/Nº B1686IGC, Hurlingham, Buenos Aires, Argentina.
| | - A Nagel
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), INTA-CONICET, Dr. Nicolás Repetto y De Los Reseros S/Nº B1686IGC, Hurlingham, Buenos Aires, Argentina.
| | - F Alvarado Pinedo
- Centro de Diagnóstico e Investigaciones Veterinarias (CEDIVE), Facultad de Ciencias Veterinarias - Universidad de La Plata (UNLP), Chascomus, Buenos Aires, Argentina.
| | - M I Romano
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), INTA-CONICET, Dr. Nicolás Repetto y De Los Reseros S/Nº B1686IGC, Hurlingham, Buenos Aires, Argentina.
| | - K Caimi
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), INTA-CONICET, Dr. Nicolás Repetto y De Los Reseros S/Nº B1686IGC, Hurlingham, Buenos Aires, Argentina.
| | - L Berná
- Unidad de Biología Molecular, Institut Pasteur de Montevideo, Mataojo 2020, CP 11400, Montevideo, Uruguay.
| | - M P Santangelo
- Instituto de Agrobiotecnología y Biología Molecular (IABIMO), INTA-CONICET, Dr. Nicolás Repetto y De Los Reseros S/Nº B1686IGC, Hurlingham, Buenos Aires, Argentina.
| |
Collapse
|
3
|
Mycobacterium smegmatis does not display functional redundancy in nitrate reductase enzymes. PLoS One 2021; 16:e0245745. [PMID: 33471823 PMCID: PMC7816997 DOI: 10.1371/journal.pone.0245745] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2020] [Accepted: 01/06/2021] [Indexed: 12/04/2022] Open
Abstract
Reduction of nitrate to nitrite in bacteria is an essential step in the nitrogen cycle, catalysed by a variety of nitrate reductase (NR) enzymes. The soil dweller, Mycobacterium smegmatis is able to assimilate nitrate and herein we set out to confirm the genetic basis for this by probing NR activity in mutants defective for putative nitrate reductase (NR) encoding genes. In addition to the annotated narB and narGHJI, bioinformatics identified three other putative NR-encoding genes: MSMEG_4206, MSMEG_2237 and MSMEG_6816. To assess the relative contribution of each, the corresponding gene loci were deleted using two-step allelic replacement, individually and in combination. The resulting strains were tested for their ability to assimilate nitrate and reduce nitrate under aerobic and anaerobic conditions, using nitrate assimilation and modified Griess assays. We demonstrated that narB, narGHJI, MSMEG_2237 and MSMEG_6816 were individually dispensable for nitrate assimilation and for nitrate reductase activity under aerobic and anaerobic conditions. Only deletion of MSMEG_4206 resulted in significant reduction in nitrate assimilation under aerobic conditions. These data confirm that in M. smegmatis, narB, narGHJI, MSMEG_2237 and MSMEG_6816 are not required for nitrate reduction as MSMEG_4206 serves as the sole assimilatory NR.
Collapse
|
4
|
Integrating multi-omics data to investigate pseudogene expression in Mycolicibacterium smegmatis. Gene X 2020; 755:144908. [DOI: 10.1016/j.gene.2020.144908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2020] [Revised: 06/05/2020] [Accepted: 06/17/2020] [Indexed: 11/21/2022] Open
|
5
|
Ndah E, Jonckheere V, Giess A, Valen E, Menschaert G, Van Damme P. REPARATION: ribosome profiling assisted (re-)annotation of bacterial genomes. Nucleic Acids Res 2017; 45:e168. [PMID: 28977509 PMCID: PMC5714196 DOI: 10.1093/nar/gkx758] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2017] [Accepted: 08/17/2017] [Indexed: 12/13/2022] Open
Abstract
Prokaryotic genome annotation is highly dependent on automated methods, as manual curation cannot keep up with the exponential growth of sequenced genomes. Current automated methods depend heavily on sequence composition and often underestimate the complexity of the proteome. We developed RibosomeE Profiling Assisted (re-)AnnotaTION (REPARATION), a de novo machine learning algorithm that takes advantage of experimental protein synthesis evidence from ribosome profiling (Ribo-seq) to delineate translated open reading frames (ORFs) in bacteria, independent of genome annotation (https://github.com/Biobix/REPARATION). REPARATION evaluates all possible ORFs in the genome and estimates minimum thresholds based on a growth curve model to screen for spurious ORFs. We applied REPARATION to three annotated bacterial species to obtain a more comprehensive mapping of their translation landscape in support of experimental data. In all cases, we identified hundreds of novel (small) ORFs including variants of previously annotated ORFs and >70% of all (variants of) annotated protein coding ORFs were predicted by REPARATION to be translated. Our predictions are supported by matching mass spectrometry proteomics data, sequence composition and conservation analysis. REPARATION is unique in that it makes use of experimental translation evidence to intrinsically perform a de novo ORF delineation in bacterial genomes irrespective of the sequence features linked to open reading frames.
Collapse
Affiliation(s)
- Elvis Ndah
- VIB-UGent Center for Medical Biotechnology, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium.,Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modelling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, B-9000 Ghent, Belgium
| | - Veronique Jonckheere
- VIB-UGent Center for Medical Biotechnology, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| | - Adam Giess
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway
| | - Eivind Valen
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen 5020, Norway.,Sars International Centre for Marine Molecular Biology, University of Bergen, 5008 Bergen, Norway
| | - Gerben Menschaert
- Lab of Bioinformatics and Computational Genomics, Department of Mathematical Modelling, Statistics and Bioinformatics, Faculty of Bioscience Engineering, Ghent University, B-9000 Ghent, Belgium
| | - Petra Van Damme
- VIB-UGent Center for Medical Biotechnology, B-9000 Ghent, Belgium.,Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium
| |
Collapse
|
6
|
Li H, Cowie A, Johnson JA, Webster D, Martyniuk CJ, Gray CA. Determining the mode of action of anti-mycobacterial C17 diyne natural products using expression profiling: evidence for fatty acid biosynthesis inhibition. BMC Genomics 2016; 17:621. [PMID: 27514659 PMCID: PMC4981992 DOI: 10.1186/s12864-016-2949-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2016] [Accepted: 07/18/2016] [Indexed: 11/10/2022] Open
Abstract
Background The treatment of microbial infections is becoming increasingly challenging because of limited therapeutic options and the growing number of pathogenic strains that are resistant to current antibiotics. There is an urgent need to identify molecules with novel modes of action to facilitate the development of new and more effective therapeutic agents. The anti-mycobacterial activity of the C17 diyne natural products falcarinol and panaxydol has been described previously; however, their mode of action remains largely undetermined in microbes. Gene expression profiling was therefore used to determine the transcriptomic response of Mycobacterium smegmatis upon treatment with falcarinol and panaxydol to better characterize the mode of action of these C17 diynes. Results Our analyses identified 704 and 907 transcripts that were differentially expressed in M. smegmatis after treatment with falcarinol and panaxydol respectively. Principal component analysis suggested that the C17 diynes exhibit a mode of action that is distinct to commonly used antimycobacterial drugs. Functional enrichment analysis and pathway enrichment analysis revealed that cell processes such as ectoine biosynthesis and cyclopropane-fatty-acyl-phospholipid synthesis were responsive to falcarinol and panaxydol treatment at the transcriptome level in M. smegmatis. The modes of action of the two C17 diynes were also predicted through Prediction of Activity Spectra of Substances (PASS). Based upon convergence of these three independent analyses, we hypothesize that the C17 diynes inhibit fatty acid biosynthesis, specifically phospholipid synthesis, in mycobacteria. Conclusion Based on transcriptomic responses, it is suggested that the C17 diynes act differently than other anti-mycobacterial compounds in M. smegmatis, and do so by inhibiting phospholipid biosynthesis. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-2949-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Haoxin Li
- Department of Biological Sciences, University of New Brunswick, PO Box 5050, 100 Tucker Park Road, E2L 4L5, Saint John, NB, Canada
| | - Andrew Cowie
- Department of Biological Sciences, University of New Brunswick, PO Box 5050, 100 Tucker Park Road, E2L 4L5, Saint John, NB, Canada
| | - John A Johnson
- Department of Biological Sciences, University of New Brunswick, PO Box 5050, 100 Tucker Park Road, E2L 4L5, Saint John, NB, Canada
| | - Duncan Webster
- Department of Medicine, Division of Infectious Diseases, Saint John Regional Hospital, 400 University Ave, E2L 4L4, Saint John, NB, Canada
| | - Christopher J Martyniuk
- Department of Biological Sciences, University of New Brunswick, PO Box 5050, 100 Tucker Park Road, E2L 4L5, Saint John, NB, Canada.,Present address: Center for Environmental and Human Toxicology & Department of Physiological Sciences, UF Genetics Institute, College of Veterinary Medicine, University of Florida, 1333 Center Drive, 32610-0144, Gainesville, FL, USA
| | - Christopher A Gray
- Department of Biological Sciences, University of New Brunswick, PO Box 5050, 100 Tucker Park Road, E2L 4L5, Saint John, NB, Canada. .,Department of Chemistry, University of New Brunswick, PO Box 4400, 30 Dineen Drive, E3B 5A3, Fredericton, NB, Canada.
| |
Collapse
|
7
|
Potgieter MG, Nakedi KC, Ambler JM, Nel AJM, Garnett S, Soares NC, Mulder N, Blackburn JM. Proteogenomic Analysis of Mycobacterium smegmatis Using High Resolution Mass Spectrometry. Front Microbiol 2016; 7:427. [PMID: 27092112 PMCID: PMC4821088 DOI: 10.3389/fmicb.2016.00427] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2015] [Accepted: 03/16/2016] [Indexed: 11/30/2022] Open
Abstract
Biochemical evidence is vital for accurate genome annotation. The integration of experimental data collected at the proteome level using high resolution mass spectrometry allows for improvements in genome annotation by providing evidence for novel gene models, while validating or modifying others. Here, we report the results of a proteogenomic analysis of a reference strain of Mycobacterium smegmatis (mc2155), a fast growing model organism for the pathogenic Mycobacterium tuberculosis—the causative agent for Tuberculosis. By integrating high throughput LC/MS/MS proteomic data with genomic six frame translation and ab initio gene prediction databases, a total of 2887 ORFs were identified, including 2810 ORFs annotated to a Reference protein, and 63 ORFs not previously annotated to a Reference protein. Further, the translational start site (TSS) was validated for 558 Reference proteome gene models, while upstream translational evidence was identified for 81. In addition, N-terminus derived peptide identifications allowed for downstream TSS modification of a further 24 gene models. We validated the existence of six previously described interrupted coding sequences at the peptide level, and provide evidence for four novel frameshift positions. Analysis of peptide posterior error probability (PEP) scores indicates high-confidence novel peptide identifications and shows that the genome of M. smegmatis mc2155 is not yet fully annotated. Data are available via ProteomeXchange with identifier PXD003500.
Collapse
Affiliation(s)
- Matthys G Potgieter
- Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| | - Kehilwe C Nakedi
- Division of Chemical and Systems Biology, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| | - Jon M Ambler
- Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| | - Andrew J M Nel
- Division of Chemical and Systems Biology, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| | - Shaun Garnett
- Division of Chemical and Systems Biology, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| | - Nelson C Soares
- Division of Chemical and Systems Biology, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| | - Nicola Mulder
- Computational Biology Division, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| | - Jonathan M Blackburn
- Division of Chemical and Systems Biology, Department of Integrative Biomedical Sciences, IDM, University of Cape Town Cape Town, South Africa
| |
Collapse
|
8
|
Kumar D, Mondal AK, Kutum R, Dash D. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes. Proteomics 2015; 16:226-40. [PMID: 26773550 DOI: 10.1002/pmic.201500263] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 09/18/2015] [Accepted: 09/28/2015] [Indexed: 01/04/2023]
Abstract
Sustainable innovations in sequencing technologies have resulted in a torrent of microbial genome sequencing projects. However, the prokaryotic genomes sequenced so far are unequally distributed along their phylogenetic tree; few phyla contain the majority, the rest only a few representatives. Accurate genome annotation lags far behind genome sequencing. While automated computational prediction, aided by comparative genomics, remains a popular choice for genome annotation, substantial fraction of these annotations are erroneous. Proteogenomics utilizes protein level experimental observations to annotate protein coding genes on a genome wide scale. Benefits of proteogenomics include discovery and correction of gene annotations regardless of their phylogenetic conservation. This not only allows detection of common, conserved proteins but also the discovery of protein products of rare genes that may be horizontally transferred or taxonomy specific. Chances of encountering such genes are more in rare phyla that comprise a small number of complete genome sequences. We collated all bacterial and archaeal proteogenomic studies carried out to date and reviewed them in the context of genome sequencing projects. Here, we present a comprehensive list of microbial proteogenomic studies, their taxonomic distribution, and also urge for targeted proteogenomics of underexplored taxa to build an extensive reference of protein coding genes.
Collapse
Affiliation(s)
- Dhirendra Kumar
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Anupam Kumar Mondal
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Rintu Kutum
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| | - Debasis Dash
- G. N. Ramachandran Knowledge Center of Genome Informatics, CSIR-Institute of Genomics and Integrative Biology, South Campus, Sukhdev Vihar, Delhi, India
| |
Collapse
|
9
|
Viswanathan G, Joshi SV, Sridhar A, Dutta S, Raghunand TR. Identifying novel mycobacterial stress associated genes using a random mutagenesis screen in Mycobacterium smegmatis. Gene 2015. [PMID: 26211627 DOI: 10.1016/j.gene.2015.07.063] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Cell envelope associated components of Mycobacterium tuberculosis (M.tb) have been implicated in stress response, immune modulation and in vivo survival of the pathogen. Although many such factors have been identified, there is a large disparity between the number of genes predicted to be involved in functions linked to the envelope and those described in the literature. To identify and characterise novel stress related factors associated with the mycobacterial cell envelope, we isolated colony morphotype mutants of Mycobacterium smegmatis (M. smegmatis), based on the hypothesis that mutants with unusual colony morphology may have defects in the biosynthesis of cell envelope components. On testing their susceptibility to stress conditions relevant to M.tb physiology, multiple mutants were found to be sensitive to Isoniazid, Diamide and H2O2, indicative of altered permeability due to changes in cell envelope composition. Two mutants showed defects in biofilm formation implying possible roles for the target genes in antibiotic tolerance and/or virulence. These assays identified novel stress associated roles for several mycobacterial genes including sahH, tatB and aceE. Complementation analysis of selected mutants with the M. smegmatis genes and their M.tb homologues showed phenotypic restoration, validating their link to the observed phenotypes. A mutant carrying an insertion in fhaA encoding a forkhead associated domain containing protein, showed reduced survival in THP-1 macrophages, providing in vivo validation to this screen. Taken together, these results suggest that the M.tb homologues of a majority of the identified genes may play significant roles in the pathogenesis of tuberculosis.
Collapse
Affiliation(s)
| | - Shrilaxmi V Joshi
- CSIR - Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad, India
| | - Aditi Sridhar
- CSIR - Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad, India
| | - Sayantanee Dutta
- CSIR - Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad, India
| | | |
Collapse
|
10
|
Regulation of homocysteine metabolism by Mycobacterium tuberculosis S-adenosylhomocysteine hydrolase. Sci Rep 2014; 3:2264. [PMID: 23877358 PMCID: PMC3719076 DOI: 10.1038/srep02264] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Accepted: 07/08/2013] [Indexed: 12/28/2022] Open
Abstract
Mycobacterium tuberculosis modulates expression of various metabolism-related genes to adapt in the adverse host environment. The gene coding for M. tuberculosis S-adenosylhomocysteine hydrolase (Mtb-SahH) is essential for optimal growth and the protein product is involved in intermediary metabolism. However, the relevance of SahH in mycobacterial physiology is unknown. In this study, we analyze the role of Mtb-SahH in regulating homocysteine concentration in surrogate host Mycobacterium smegmatis. Mtb-SahH catalyzes reversible hydrolysis of S-adenosylhomocysteine to homocysteine and adenosine and we demonstrate that the conserved His363 residue is critical for bi-directional catalysis. Mtb-SahH is regulated by serine/threonine phosphorylation of multiple residues by M. tuberculosis PknB. Major phosphorylation events occur at contiguous residues Thr219, Thr220 and Thr221, which make pivotal contacts with cofactor NAD+. Consequently, phosphorylation negatively modulates affinity of enzyme towards NAD+ as well as SAH-synthesis. Thr219, Thr220 and Thr221 are essential for enzyme activity, and therefore, responsible for SahH-mediated regulation of homocysteine.
Collapse
|
11
|
Armengaud J, Hartmann EM, Bland C. Proteogenomics for environmental microbiology. Proteomics 2013; 13:2731-42. [PMID: 23636904 DOI: 10.1002/pmic.201200576] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 03/06/2013] [Accepted: 04/09/2013] [Indexed: 11/09/2022]
Abstract
Proteogenomics sensu stricto refers to the use of proteomic data to refine the annotation of genomes from model organisms. Because of the limitations of automatic annotation pipelines, a relatively high number of errors occur during the structural annotation of genes coding for proteins. Whether putative orphan sequences or short genes encoding low-molecular-weight proteins really exist is still frequently a mystery. Whether start codons are well defined is also an open debate. These problems are exacerbated for genomes of microorganisms belonging to poorly documented genera, as related sequences are not always available for homology-guided annotation. The functional annotation of a significant proportion of genes is also another well-known issue when annotating environmental microorganisms. High-throughput shotgun proteomics has recently greatly evolved, allowing the exploration of the proteome from any microorganism at an unprecedented depth. The structural and functional annotation process may be usefully complemented with experimental data. Indeed, proteogenomic mapping has been successfully performed for a wide variety of organisms. Specific approaches devoted to systematically establishing the N-termini of a large set of proteins are being developed. N-terminomics is giving rise to datasets of experimentally proven translational start codons as well as validated peptide signals for secreted proteins. By extension, combining genomic and proteomic data is becoming routine in many research projects. The proteomic analysis of organisms with unfinished genome sequences, the so-called composite proteomics, and the search for microbial biomarkers by bottom-up and top-down combined approaches are some examples of proteogenomic-flavored studies. They illustrate the advent of a new era of environmental microbiology where proteomics and genomics are intimately integrated to answer key biological questions.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, France
| | | | | |
Collapse
|
12
|
Antonov I, Baranov P, Borodovsky M. GeneTack database: genes with frameshifts in prokaryotic genomes and eukaryotic mRNA sequences. Nucleic Acids Res 2012; 41:D152-6. [PMID: 23161689 PMCID: PMC3531167 DOI: 10.1093/nar/gks1062] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Database annotations of prokaryotic genomes and eukaryotic mRNA sequences pay relatively low attention to frame transitions that disrupt protein-coding genes. Frame transitions (frameshifts) could be caused by sequencing errors or indel mutations inside protein-coding regions. Other observed frameshifts are related to recoding events (that evolved to control expression of some genes). Earlier, we have developed an algorithm and software program GeneTack for ab initio frameshift finding in intronless genes. Here, we describe a database (freely available at http://topaz.gatech.edu/GeneTack/db.html) containing genes with frameshifts (fs-genes) predicted by GeneTack. The database includes 206 991 fs-genes from 1106 complete prokaryotic genomes and 45 295 frameshifts predicted in mRNA sequences from 100 eukaryotic genomes. The whole set of fs-genes was grouped into clusters based on sequence similarity between fs-proteins (conceptually translated fs-genes), conservation of the frameshift position and frameshift direction (−1, +1). The fs-genes can be retrieved by similarity search to a given query sequence via a web interface, by fs-gene cluster browsing, etc. Clusters of fs-genes are characterized with respect to their likely origin, such as pseudogenization, phase variation, etc. The largest clusters contain fs-genes with programed frameshifts (related to recoding events).
Collapse
Affiliation(s)
- Ivan Antonov
- School of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA 30332, USA
| | | | | |
Collapse
|
13
|
Vats A, Singh AK, Mukherjee R, Chopra T, Ravindran MS, Mohanty D, Chatterji D, Reyrat JM, Gokhale RS. Retrobiosynthetic approach delineates the biosynthetic pathway and the structure of the acyl chain of mycobacterial glycopeptidolipids. J Biol Chem 2012; 287:30677-87. [PMID: 22798073 DOI: 10.1074/jbc.m112.384966] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Glycopeptidolipids (GPLs) are dominant cell surface molecules present in several non-tuberculous and opportunistic mycobacterial species. GPLs from Mycobacterium smegmatis are composed of a lipopeptide core unit consisting of a modified C(26)-C(34) fatty acyl chain that is linked to a tetrapeptide (Phe-Thr-Ala-alaninol). The hydroxyl groups of threonine and terminal alaninol are further modified by glycosylations. Although chemical structures have been reported for 16 GPLs from diverse mycobacteria, there is still ambiguity in identifying the exact position of the hydroxyl group on the fatty acyl chain. Moreover, the enzymes involved in the biosynthesis of the fatty acyl component are unknown. In this study we show that a bimodular polyketide synthase in conjunction with a fatty acyl-AMP ligase dictates the synthesis of fatty acyl chain of GPL. Based on genetic, biochemical, and structural investigations, we determine that the hydroxyl group is present at the C-5 position of the fatty acyl component. Our retrobiosynthetic approach has provided a means to understand the biosynthesis of GPLs and also resolve the long-standing debate on the accurate structure of mycobacterial GPLs.
Collapse
Affiliation(s)
- Archana Vats
- CSIR-Institute of Genomics and Integrative Biology, Mall Road, Delhi 110007, India
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Christie-Oleza JA, Miotello G, Armengaud J. High-throughput proteogenomics of Ruegeria pomeroyi: seeding a better genomic annotation for the whole marine Roseobacter clade. BMC Genomics 2012; 13:73. [PMID: 22336032 PMCID: PMC3305630 DOI: 10.1186/1471-2164-13-73] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2011] [Accepted: 02/15/2012] [Indexed: 11/10/2022] Open
Abstract
Background The structural and functional annotation of genomes is now heavily based on data obtained using automated pipeline systems. The key for an accurate structural annotation consists of blending similarities between closely related genomes with biochemical evidence of the genome interpretation. In this work we applied high-throughput proteogenomics to Ruegeria pomeroyi, a member of the Roseobacter clade, an abundant group of marine bacteria, as a seed for the annotation of the whole clade. Results A large dataset of peptides from R. pomeroyi was obtained after searching over 1.1 million MS/MS spectra against a six-frame translated genome database. We identified 2006 polypeptides, of which thirty-four were encoded by open reading frames (ORFs) that had not previously been annotated. From the pool of 'one-hit-wonders', i.e. those ORFs specified by only one peptide detected by tandem mass spectrometry, we could confirm the probable existence of five additional new genes after proving that the corresponding RNAs were transcribed. We also identified the most-N-terminal peptide of 486 polypeptides, of which sixty-four had originally been wrongly annotated. Conclusions By extending these re-annotations to the other thirty-six Roseobacter isolates sequenced to date (twenty different genera), we propose the correction of the assigned start codons of 1082 homologous genes in the clade. In addition, we also report the presence of novel genes within operons encoding determinants of the important tricarboxylic acid cycle, a feature that seems to be characteristic of some Roseobacter genomes. The detection of their corresponding products in large amounts raises the question of their function. Their discoveries point to a possible theory for protein evolution that will rely on high expression of orphans in bacteria: their putative poor efficiency could be counterbalanced by a higher level of expression. Our proteogenomic analysis will increase the reliability of the future annotation of marine bacterial genomes.
Collapse
|
15
|
Sharma V, Firth AE, Antonov I, Fayet O, Atkins JF, Borodovsky M, Baranov PV. A pilot study of bacterial genes with disrupted ORFs reveals a surprising profusion of protein sequence recoding mediated by ribosomal frameshifting and transcriptional realignment. Mol Biol Evol 2011; 28:3195-211. [PMID: 21673094 DOI: 10.1093/molbev/msr155] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Bacterial genome annotations contain a number of coding sequences (CDSs) that, in spite of reading frame disruptions, encode a single continuous polypeptide. Such disruptions have different origins: sequencing errors, frameshift, or stop codon mutations, as well as instances of utilization of nontriplet decoding. We have extracted over 1,000 CDSs with annotated disruptions and found that about 75% of them can be clustered into 64 groups based on sequence similarity. Analysis of the clusters revealed deep phylogenetic conservation of open reading frame organization as well as the presence of conserved sequence patterns that indicate likely utilization of the nonstandard decoding mechanisms: programmed ribosomal frameshifting (PRF) and programmed transcriptional realignment (PTR). Further enrichment of these clusters with additional homologous nucleotide sequences revealed over 6,000 candidate genes utilizing PRF or PTR. Analysis of the patterns of conservation apparently associated with nontriplet decoding revealed the presence of both previously characterized frameshift-prone sequences and a few novel ones. Since the starting point of our analysis was a set of genes with already annotated disruptions, it is highly plausible that in this study, we have identified only a fraction of all bacterial genes that utilize PRF or PTR. In addition to the identification of a large number of recoded genes, a surprising observation is that nearly half of them are expressed via PTR-a mechanism that, in contrast to PRF, has not yet received substantial attention.
Collapse
Affiliation(s)
- Virag Sharma
- Department of Biochemistry, University College Cork, Cork, Ireland
| | | | | | | | | | | | | |
Collapse
|
16
|
Identification of the monooxygenase gene clusters responsible for the regioselective oxidation of phenol to hydroquinone in mycobacteria. Appl Environ Microbiol 2010; 77:1214-20. [PMID: 21183637 DOI: 10.1128/aem.02316-10] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Mycobacterium goodii strain 12523 is an actinomycete that is able to oxidize phenol regioselectively at the para position to produce hydroquinone. In this study, we investigated the genes responsible for this unique regioselective oxidation. On the basis of the fact that the oxidation activity of M. goodii strain 12523 toward phenol is induced in the presence of acetone, we first identified acetone-induced proteins in this microorganism by two-dimensional electrophoretic analysis. The N-terminal amino acid sequence of one of these acetone-induced proteins shares 100% identity with that of the protein encoded by the open reading frame Msmeg_1971 in Mycobacterium smegmatis strain mc(2)155, whose genome sequence has been determined. Since Msmeg_1971, Msmeg_1972, Msmeg_1973, and Msmeg_1974 constitute a putative binuclear iron monooxygenase gene cluster, we cloned this gene cluster of M. smegmatis strain mc(2)155 and its homologous gene cluster found in M. goodii strain 12523. Sequence analysis of these binuclear iron monooxygenase gene clusters revealed the presence of four genes designated mimABCD, which encode an oxygenase large subunit, a reductase, an oxygenase small subunit, and a coupling protein, respectively. When the mimA gene (Msmeg_1971) of M. smegmatis strain mc(2)155, which was also found to be able to oxidize phenol to hydroquinone, was deleted, this mutant lost the oxidation ability. This ability was restored by introduction of the mimA gene of M. smegmatis strain mc(2)155 or of M. goodii strain 12523 into this mutant. Interestingly, we found that these gene clusters also play essential roles in propane and acetone metabolism in these mycobacteria.
Collapse
|
17
|
Lamontagne J, Béland M, Forest A, Côté-Martin A, Nassif N, Tomaki F, Moriyón I, Moreno E, Paramithiotis E. Proteomics-based confirmation of protein expression and correction of annotation errors in the Brucella abortus genome. BMC Genomics 2010; 11:300. [PMID: 20462421 PMCID: PMC2877026 DOI: 10.1186/1471-2164-11-300] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2009] [Accepted: 05/12/2010] [Indexed: 12/23/2022] Open
Abstract
Background Brucellosis is a major bacterial zoonosis affecting domestic livestock and wild mammals, as well as humans around the globe. While conducting proteomics studies to better understand Brucella abortus virulence, we consolidated the proteomic data collected and compared it to publically available genomic data. Results The proteomic data was compiled from several independent comparative studies of Brucella abortus that used either outer membrane blebs, cytosols, or whole bacteria grown in media, as well as intracellular bacteria recovered at different times following macrophage infection. We identified a total of 621 bacterial proteins that were differentially expressed in a condition-specific manner. For 305 of these proteins we provide the first experimental evidence of their expression. Using a custom-built protein sequence database, we uncovered 7 annotation errors. We provide experimental evidence of expression of 5 genes that were originally annotated as non-expressed pseudogenes, as well as start site annotation errors for 2 other genes. Conclusions An essential element for ensuring correct functional studies is the correspondence between reported genome sequences and subsequent proteomics studies. In this study, we have used proteomics evidence to confirm expression of multiple proteins previously considered to be putative, as well as correct annotation errors in the genome of Brucella abortus strain 2308.
Collapse
Affiliation(s)
- Julie Lamontagne
- Caprion Proteomics Inc, 7150 Alexander-Fleming, Montreal, Quebec, Canada
| | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Armengaud J. Proteogenomics and systems biology: quest for the ultimate missing parts. Expert Rev Proteomics 2010; 7:65-77. [DOI: 10.1586/epr.09.104] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/30/2023]
|
19
|
Baudet M, Ortet P, Gaillard JC, Fernandez B, Guérin P, Enjalbal C, Subra G, de Groot A, Barakat M, Dedieu A, Armengaud J. Proteomics-based refinement of Deinococcus deserti genome annotation reveals an unwonted use of non-canonical translation initiation codons. Mol Cell Proteomics 2009; 9:415-26. [PMID: 19875382 PMCID: PMC2830850 DOI: 10.1074/mcp.m900359-mcp200] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Deinococcaceae are a family of extremely radiation-tolerant bacteria that are currently subjected to numerous studies aimed at understanding the molecular mechanisms for such radiotolerance. To achieve a comprehensive and accurate annotation of the Deinococcus deserti genome, we performed an N terminus-oriented characterization of its proteome. For this, we used a labeling reagent, N-tris(2,4,6-trimethoxyphenyl)phosphonium acetyl succinimide, to selectively derivatize protein N termini. The large scale identification of N-tris(2,4,6-trimethoxyphenyl)phosphonium acetyl succinimide-modified N-terminal-most peptides by shotgun liquid chromatography-tandem mass spectrometry analysis led to the validation of 278 and the correction of 73 translation initiation codons in the D. deserti genome. In addition, four new genes were detected, three located on the main chromosome and one on plasmid P3. We also analyzed signal peptide cleavages on a genome-wide scale. Based on comparative proteogenomics analysis, we propose a set of 137 corrections to improve Deinococcus radiodurans and Deinococcus geothermalis gene annotations. Some of these corrections affect important genes involved in DNA repair mechanisms such as polA, ligA, and ddrB. Surprisingly, experimental evidences were obtained indicating that DnaA (the protein involved in the DNA replication initiation process) and RpsL (the S12 ribosomal conserved protein) translation is initiated in Deinococcaceae from non-canonical codons (ATC and CTG, respectively). Such use may be the basis of specific regulation mechanisms affecting replication and translation. We also report the use of non-conventional translation initiation codons for two other genes: Deide_03051 and infC. Whether such use of non-canonical translation initiation codons is much more frequent than for other previously reported bacterial phyla or restricted to Deinococcaceae remains to be investigated. Our results demonstrate that predicting translation initiation codons is still difficult for some bacteria and that proteomics-based refinement of genome annotations may be helpful in such cases.
Collapse
Affiliation(s)
- Mathieu Baudet
- Laboratoire de Biochimie des Systèmes Perturbés, Service de Biochimie et Toxicologie Nucléaire, Institut de Biologie Environnementale et Biotechnologie (iBEB), Direction des Sciences du Vivant (DSV), Commissariat à l'Energie Atomique et aux Energies Alternatives (CEA), F-30207 Bagnols-sur-Cèze, France
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Abstract
This unit gives background information on Mycobacterium smegmatis, a mycobacterial model system, and covers all the laboratory maintenance for this species including growth in liquid and on solid medium. It also contains recommendations concerning long-term strain storage. Although M. smegmatis is a Biosafety Level 1 organism, some rare infections in humans have been reported, and, thus all of the required safety measures are discussed here.
Collapse
|
21
|
Armengaud J. A perfect genome annotation is within reach with the proteomics and genomics alliance. Curr Opin Microbiol 2009; 12:292-300. [PMID: 19410500 DOI: 10.1016/j.mib.2009.03.005] [Citation(s) in RCA: 79] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2009] [Revised: 03/26/2009] [Accepted: 03/26/2009] [Indexed: 11/17/2022]
Abstract
High-throughput identification of proteins and their accurate partial sequencing by shotgun nanoLC-MS/MS are now feasible for any cellular model at a full genomic scale. Proteogenomics is the integration of these data with the genome. Mining microbial proteomes allows validation of predicted orphan genes and correction of genome annotation errors such as discovery of unannotated genes, reversal of reading frames and identification of translational start sites, stop codon read-throughs or programmed frameshifts. Recent advances have been achieved in database searches, N-terminal oriented proteomics and homology-driven proteogenomics. From now on, proteogenomics on newly sequenced model genomes can be carried out at the earliest stage of the genome project as already exemplified by Mycoplasma mobile and Deinococcus deserti genomes. The proteomics and genomics alliance produces almost complete and accurate gene catalogues for small microbial genomes, a comprehensiveness which is essential for efficient systems biology.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, France.
| |
Collapse
|
22
|
Wu CW, Schramm TM, Zhou S, Schwartz DC, Talaat AM. Optical mapping of the Mycobacterium avium subspecies paratuberculosis genome. BMC Genomics 2009; 10:25. [PMID: 19146697 PMCID: PMC2633350 DOI: 10.1186/1471-2164-10-25] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2008] [Accepted: 01/15/2009] [Indexed: 01/27/2023] Open
Abstract
BACKGROUND Infection of cattle with Mycobacterium avium subspecies paratuberculosis (M. ap) causes severe economic losses to the dairy industry in the USA and worldwide. In an effort to better examine diversity among M. ap strains, we used optical mapping to profile genomic variations between strains of M. ap K-10 (sequenced strain) and M. ap ATCC 19698 (type strain). RESULTS The assembled physical restriction map of M. ap ATCC 19698 showed a genome size of 4,839 kb compared to the sequenced K-10 genome of 4,830 kb. Interestingly, alignment of the optical map of the M. ap ATCC 19698 genome to the complete M. ap K-10 genome sequence revealed a 648-kb inversion around the origin of replication. However, Southern blotting, PCR amplification and sequencing analyses of the inverted region revealed that the genome of M. ap K-10 differs from the published sequence in the region starting from 4,197,080 bp to 11,150 bp, spanning the origin of replication. Additionally, two new copies of the coding sequences > 99.8% were identified, identical to the MAP0849c and MAP0850c genes located immediately downstream of the MAP3758c gene. CONCLUSION The optical map of M. ap ATCC 19698 clearly indicated the miss-assembly of the sequenced genome of M. ap K-10. Moreover, it identified 2 new genes in M. ap K-10 genome. This analysis strongly advocates for the utility of physical mapping protocols to complement genome sequencing projects.
Collapse
Affiliation(s)
- Chia-wei Wu
- The Laboratory of Bacterial Genomics, Department of Pathobiological Sciences, University of Wisconsin-Madison, WI, USA.
| | | | | | | | | |
Collapse
|
23
|
Gallien S, Perrodou E, Carapito C, Deshayes C, Reyrat JM, Van Dorsselaer A, Poch O, Schaeffer C, Lecompte O. Ortho-proteogenomics: multiple proteomes investigation through orthology and a new MS-based protocol. Genome Res 2008; 19:128-35. [PMID: 18955433 DOI: 10.1101/gr.081901.108] [Citation(s) in RCA: 92] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The progress in sequencing technologies irrigates biology with an ever-increasing number of genome sequences. In most cases, the gene repertoire is predicted in silico and conceptually translated into proteins. As recently highlighted, the predicted genes exhibit frequent errors, particularly in start codons, with a serious impact on subsequent biological studies. A new "ortho-proteogenomic" approach is presented here for the annotation refinement of multiple genomes at once. It combines comparative genomics with an original proteomic protocol that allows the characterization of both N-terminal and internal peptides in a single experiment. This strategy was applied to the Mycobacterium genus with Mycobacterium smegmatis as the reference, and identified 946 distinct proteins, including 443 characterized N termini. These experimental data allowed the correction of 19% of the characterized start codons, the identification of 29 proteins missed during the annotation process, and the curation, thanks to comparative genomics, of 4328 sequences of 16 other Mycobacterium proteomes.
Collapse
Affiliation(s)
- Sébastien Gallien
- Laboratoire de Spectrométrie de Masse Bio-Organique, IPHC-DSA, ULP, CNRS, UMR7178, 67 087 Strasbourg, France.
| | | | | | | | | | | | | | | | | |
Collapse
|
24
|
de Souza GA, Målen H, Søfteland T, Saelensminde G, Prasad S, Jonassen I, Wiker HG. High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using Mycobacterium tuberculosis as an example. BMC Genomics 2008; 9:316. [PMID: 18597682 PMCID: PMC2483986 DOI: 10.1186/1471-2164-9-316] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2008] [Accepted: 07/02/2008] [Indexed: 01/23/2023] Open
Abstract
Background While the genomic annotations of diverse lineages of the Mycobacterium tuberculosis complex are available, divergences between gene prediction methods are still a challenge for unbiased protein dataset generation. M. tuberculosis gene annotation is an example, where the most used datasets from two independent institutions (Sanger Institute and Institute of Genomic Research-TIGR) differ up to 12% in the number of annotated open reading frames, and 46% of the genes contained in both annotations have different start codons. Such differences emphasize the importance of the identification of the sequence of protein products to validate each gene annotation including its sequence coding area. Results With this objective, we submitted a culture filtrate sample from M. tuberculosis to a high-accuracy LTQ-Orbitrap mass spectrometer analysis and applied refined N-terminal prediction to perform comparison of two gene annotations. From a total of 449 proteins identified from the MS data, we validated 35 tryptic peptides that were specific to one of the two datasets, representing 24 different proteins. From those, 5 proteins were only annotated in the Sanger database. In the remaining proteins, the observed differences were due to differences in annotation of transcriptional start sites. Conclusion Our results indicate that, even in a less complex sample likely to represent only 10% of the bacterial proteome, we were still able to detect major differences between different gene annotation approaches. This gives hope that high-throughput proteomics techniques can be used to improve and validate gene annotations, and in particular for verification of high-throughput, automatic gene annotations.
Collapse
Affiliation(s)
- Gustavo A de Souza
- Section for Microbiology and Immunology, The Gade Institute, University of Bergen, Bergen, Norway.
| | | | | | | | | | | | | |
Collapse
|
25
|
Mandel MJ, Stabb EV, Ruby EG. Comparative genomics-based investigation of resequencing targets in Vibrio fischeri: focus on point miscalls and artefactual expansions. BMC Genomics 2008; 9:138. [PMID: 18366731 PMCID: PMC2330054 DOI: 10.1186/1471-2164-9-138] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2007] [Accepted: 03/25/2008] [Indexed: 01/19/2023] Open
Abstract
Background Sequence closure often represents the end-point of a genome project, without a system in place for subsequent improvement and refinement. Building on the genome project of Vibrio fischeri ES114, we used a comparative approach to identify and investigate genes that had a high likelihood of sequence error. Results Comparison of the V. fischeri ES114 genome with that of conspecific strain MJ11 identified 82 target loci in ES114 as containing likely errors, and thus of high-priority for resequencing. Analysis of the targets identified 75 loci in which an error had occurred, resulting in the correction of 10,457 base pairs to generate the new ES114 genomic sequence. A majority of the inaccurate loci involved frameshift errors, correction of which fused adjacent ORFs. Although insertions/deletions are thought to be rare in microbial genome assemblies, fourteen of the loci contained extraneous sequence of over 300 bp, likely due to imperfect contig ends that were misassembled in tandem rather than as overlapping segments. Additionally we updated the entire genome annotation with 113 new features including previously uncalled protein-coding genes, regulatory RNA genes and operon leader peptides, and we analyzed the transcriptional apparatus encoded by ES114. Conclusion We demonstrate that errors in microbial genome sequences, thought to largely be confined to point mutations, may also consist of other prevalent large-scale rearrangements such as insertions. Ongoing genome quality control and annotation programs are necessary to accompany technological advancements in data generation. These updates further advance V. fischeri as an important model for understanding intercellular communication and colonization of animal tissue.
Collapse
Affiliation(s)
- Mark J Mandel
- Department of Medical Microbiology and Immunology, University of Wisconsin School of Medicine and Public Health, 1550 Linden Drive, Madison WI 53706-1521, USA.
| | | | | |
Collapse
|
26
|
Deshayes C, Perrodou E, Euphrasie D, Frapy E, Poch O, Bifani P, Lecompte O, Reyrat JM. Detecting the molecular scars of evolution in the Mycobacterium tuberculosis complex by analyzing interrupted coding sequences. BMC Evol Biol 2008; 8:78. [PMID: 18325090 PMCID: PMC2277376 DOI: 10.1186/1471-2148-8-78] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2007] [Accepted: 03/06/2008] [Indexed: 11/30/2022] Open
Abstract
Background Computer-assisted analyses have shown that all bacterial genomes contain a small percentage of open reading frames with a frameshift or in-frame stop codon We report here a comparative analysis of these interrupted coding sequences (ICDSs) in six isolates of M. tuberculosis, two of M. bovis and one of M. africanum and question their phenotypic impact and evolutionary significance. Results ICDSs were classified as "common to all strains" or "strain-specific". Common ICDSs are believed to result from mutations acquired before the divergence of the species, whereas strain-specific ICDSs were acquired after this divergence. Comparative analyses of these ICDSs therefore define the molecular signature of a particular strain, phylogenetic lineage or species, which may be useful for inferring phenotypic traits such as virulence and molecular relationships. For instance, in silico analysis of the W-Beijing lineage of M. tuberculosis, an emergent family involved in several outbreaks, is readily distinguishable from other phyla by its smaller number of common ICDSs, including at least one known to be associated with virulence. Our observation was confirmed through the sequencing analysis of ICDSs in a panel of 21 clinical M. tuberculosis strains. This analysis further illustrates the divergence of the W-Beijing lineage from other phyla in terms of the number of full-length ORFs not containing a frameshift. We further show that ICDS formation is not associated with the presence of a mutated promoter, and suggest that promoter extinction is not the main cause of pseudogene formation. Conclusion The correlation between ICDSs, function and phenotypes could have important evolutionary implications. This study provides population geneticists with a list of targets, which could undergo selective pressure and thus alters relationships between the various lineages of M. tuberculosis strains and their host. This approach could be applied to any closely related bacterial strains or species for which several genome sequences are available.
Collapse
Affiliation(s)
- Caroline Deshayes
- Université Paris Descartes, Faculté de Médecine René Descartes, Paris Cedex 15, F-75730, France.
| | | | | | | | | | | | | | | |
Collapse
|
27
|
Diemer H, Elias M, Renault F, Rochu D, Contreras-Martel C, Schaeffer C, Van Dorsselaer A, Chabriere E. Tandem use of X-ray crystallography and mass spectrometry to obtain ab initio the complete and exact amino acids sequence of HPBP, a human 38-kDa apolipoprotein. Proteins 2007; 71:1708-20. [DOI: 10.1002/prot.21866] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|