Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Schrimpe-Rutledge AC, Jones MB, Chauhan S, Purvine SO, Sanford JA, Monroe ME, Brewer HM, Payne SH, Ansong C, Frank BC, Smith RD, Peterson SN, Motin VL, Adkins JN. Comparative omics-driven genome annotation refinement: application across Yersiniae. PLoS One 2012;7:e33903. [PMID: 22479471 PMCID: PMC3313959 DOI: 10.1371/journal.pone.0033903] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Accepted: 02/19/2012] [Indexed: 02/03/2023] Open

For:	Schrimpe-Rutledge AC, Jones MB, Chauhan S, Purvine SO, Sanford JA, Monroe ME, Brewer HM, Payne SH, Ansong C, Frank BC, Smith RD, Peterson SN, Motin VL, Adkins JN. Comparative omics-driven genome annotation refinement: application across Yersiniae. PLoS One 2012;7:e33903. [PMID: 22479471 PMCID: PMC3313959 DOI: 10.1371/journal.pone.0033903] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Accepted: 02/19/2012] [Indexed: 02/03/2023] Open

Number

Cited by Other Article(s)

Lê-Bury P, Druart K, Savin C, Lechat P, Mas Fiol G, Matondo M, Bécavin C, Dussurget O, Pizarro-Cerdá J. Yersiniomics, a Multi-Omics Interactive Database for Yersinia Species. Microbiol Spectr 2023;11:e0382622. [PMID: 36847572 PMCID: PMC10100798 DOI: 10.1128/spectrum.03826-22] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2022] [Accepted: 01/26/2023] [Indexed: 03/01/2023] Open

Abstract

The genus Yersinia includes a large variety of nonpathogenic and life-threatening pathogenic bacteria, which cause a broad spectrum of diseases in humans and animals, such as plague, enteritis, Far East scarlet-like fever (FESLF), and enteric redmouth disease. Like most clinically relevant microorganisms, Yersinia spp. are currently subjected to intense multi-omics investigations whose numbers have increased extensively in recent years, generating massive amounts of data useful for diagnostic and therapeutic developments. The lack of a simple and centralized way to exploit these data led us to design Yersiniomics, a web-based platform allowing straightforward analysis of Yersinia omics data. Yersiniomics contains a curated multi-omics database at its core, gathering 200 genomic, 317 transcriptomic, and 62 proteomic data sets for Yersinia species. It integrates genomic, transcriptomic, and proteomic browsers, a genome viewer, and a heatmap viewer to navigate within genomes and experimental conditions. For streamlined access to structural and functional properties, it directly links each gene to GenBank, the Kyoto Encyclopedia of Genes and Genomes (KEGG), UniProt, InterPro, IntAct, and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and each experiment to Gene Expression Omnibus (GEO), the European Nucleotide Archive (ENA), or the Proteomics Identifications Database (PRIDE). Yersiniomics provides a powerful tool for microbiologists to assist with investigations ranging from specific gene studies to systems biology studies. IMPORTANCE The expanding genus Yersinia is composed of multiple nonpathogenic species and a few pathogenic species, including the deadly etiologic agent of plague, Yersinia pestis. In 2 decades, the number of genomic, transcriptomic, and proteomic studies on Yersinia grew massively, delivering a wealth of data. We developed Yersiniomics, an interactive web-based platform, to centralize and analyze omics data sets on Yersinia species. The platform allows user-friendly navigation between genomic data, expression data, and experimental conditions. Yersiniomics will be a valuable tool to microbiologists.

Collapse

Wanichthanarak K, Nookaew I, Pasookhush P, Wongsurawat T, Jenjaroenpun P, Leeratsuwan N, Wattanachaisaereekul S, Visessanguan W, Sirivatanauksorn Y, Nuntasaen N, Kuhakarn C, Reutrakul V, Ajawatanawong P, Khoomrung S. Revisiting chloroplast genomic landscape and annotation towards comparative chloroplast genomes of Rhamnaceae. BMC PLANT BIOLOGY 2023;23:59. [PMID: 36707785 PMCID: PMC9883906 DOI: 10.1186/s12870-023-04074-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 01/18/2023] [Indexed: 06/18/2023]

Abstract

BACKGROUND

Massive parallel sequencing technologies have enabled the elucidation of plant phylogenetic relationships from chloroplast genomes at a high pace. These include members of the family Rhamnaceae. The current Rhamnaceae phylogenetic tree is from 13 out of 24 Rhamnaceae chloroplast genomes, and only one chloroplast genome of the genus Ventilago is available. Hence, the phylogenetic relationships in Rhamnaceae remain incomplete, and more representative species are needed.

RESULTS

The complete chloroplast genome of Ventilago harmandiana Pierre was outlined using a hybrid assembly of long- and short-read technologies. The accuracy and validity of the final genome were confirmed with PCR amplifications and investigation of coverage depth. Sanger sequencing was used to correct for differences in lengths and nucleotide bases between inverted repeats because of the homopolymers. The phylogenetic trees reconstructed using prevalent methods for phylogenetic inference were topologically similar. The clustering based on codon usage was congruent with the molecular phylogenetic tree. The groups of genera in each tribe were in accordance with tribal classification based on molecular markers. We resolved the phylogenetic relationships among six Hovenia species, three Rhamnus species, and two Ventilago species. Our reconstructed tree provides the most complete and reliable low-level taxonomy to date for the family Rhamnaceae. Similar to other higher plants, the RNA editing mostly resulted in converting serine to leucine. Besides, most genes were subjected to purifying selection. Annotation anomalies, including indel calling errors, unaligned open reading frames of the same gene, inconsistent prediction of intergenic regions, and misannotated genes, were identified in the published chloroplast genomes used in this study. These could be a result of the usual imperfections in computational tools, and/or existing errors in reference genomes. Importantly, these are points of concern with regards to utilizing published chloroplast genomes for comparative genomic analysis.

CONCLUSIONS

In summary, we successfully demonstrated the use of comprehensive genomic data, including DNA and amino acid sequences, to build a reliable and high-resolution phylogenetic tree for the family Rhamnaceae. Additionally, our study indicates that the revision of genome annotation before comparative genomic analyses is necessary to prevent the propagation of errors and complications in downstream analysis and interpretation.

Collapse

Affiliation(s)

Kwanjeera Wanichthanarak Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand
Intawat Nookaew Department of Biomedical Informatics, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, AR, 72205, USA
Phongthana Pasookhush Division of Bioinformatics and Data Management for Research, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand
Thidathip Wongsurawat Division of Bioinformatics and Data Management for Research, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand
Piroon Jenjaroenpun Division of Bioinformatics and Data Management for Research, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand
Namkhang Leeratsuwan Department of Biology, Faculty of Science, Mahidol University, Bangkok, 10400, Thailand
Songsak Wattanachaisaereekul School of Food industry, King Mongkut's Institute of Technology Ladkrabang, Bangkok, 10520, Thailand
Wonnop Visessanguan Functional Ingredients and Food Biotechnology Research Unit, National Center for Genetic Engineering and Biotechnology (BIOTEC), Phathumthani, 12120, Thailand
Yongyut Sirivatanauksorn Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand
Narong Nuntasaen Department of Chemistry and Center of Excellence for Innovation in Chemistry (PERCH-CIC), Faculty of Science, Mahidol University, Bangkok, 10400, Thailand Department of National Parks, Wildlife and Plant Conservation, Ministry of Natural Resources and Environment, Bangkok, 10900, Thailand
Chutima Kuhakarn Department of Chemistry and Center of Excellence for Innovation in Chemistry (PERCH-CIC), Faculty of Science, Mahidol University, Bangkok, 10400, Thailand
Vichai Reutrakul Department of Chemistry and Center of Excellence for Innovation in Chemistry (PERCH-CIC), Faculty of Science, Mahidol University, Bangkok, 10400, Thailand
Pravech Ajawatanawong Division of Bioinformatics and Data Management for Research, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand.
Sakda Khoomrung Metabolomics and Systems Biology, Department of Biochemistry, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand. Siriraj Metabolomics and Phenomics Center, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, 10700, Thailand. Department of Chemistry and Center of Excellence for Innovation in Chemistry (PERCH-CIC), Faculty of Science, Mahidol University, Bangkok, 10400, Thailand.

Collapse

Feng Y, Wang Z, Chien KY, Chen HL, Liang YH, Hua X, Chiu CH. "Pseudo-pseudogenes" in bacterial genomes: Proteogenomics reveals a wide but low protein expression of pseudogenes in Salmonella enterica. Nucleic Acids Res 2022;50:5158-5170. [PMID: 35489061 PMCID: PMC9122581 DOI: 10.1093/nar/gkac302] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 04/11/2022] [Accepted: 04/14/2022] [Indexed: 12/03/2022] Open

Belinky F, Ganguly I, Poliakov E, Yurchenko V, Rogozin IB. Analysis of Stop Codons within Prokaryotic Protein-Coding Genes Suggests Frequent Readthrough Events. Int J Mol Sci 2021;22:ijms22041876. [PMID: 33672790 PMCID: PMC7918605 DOI: 10.3390/ijms22041876] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2021] [Revised: 02/05/2021] [Accepted: 02/09/2021] [Indexed: 02/07/2023] Open

Koch L, Poyot T, Schnetterle M, Guillier S, Soulé E, Nolent F, Gorgé O, Neulat-Ripoll F, Valade E, Sebbane F, Biot F. Transcriptomic studies and assessment of Yersinia pestis reference genes in various conditions. Sci Rep 2019;9:2501. [PMID: 30792499 PMCID: PMC6385181 DOI: 10.1038/s41598-019-39072-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2018] [Accepted: 12/14/2018] [Indexed: 12/27/2022] Open

Herrera CM, Henderson JC, Crofts AA, Trent MS. Novel coordination of lipopolysaccharide modifications in Vibrio cholerae promotes CAMP resistance. Mol Microbiol 2017;106:582-596. [PMID: 28906060 DOI: 10.1111/mmi.13835] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/10/2017] [Indexed: 01/02/2023]

Merkley ED, Sego LH, Lin A, Leiser OP, Kaiser BLD, Adkins JN, Keim PS, Wagner DM, Kreuzer HW. Protein abundances can distinguish between naturally-occurring and laboratory strains of Yersinia pestis, the causative agent of plague. PLoS One 2017;12:e0183478. [PMID: 28854255 PMCID: PMC5576697 DOI: 10.1371/journal.pone.0183478] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2016] [Accepted: 08/05/2017] [Indexed: 11/19/2022] Open

Abstract

The rapid pace of bacterial evolution enables organisms to adapt to the laboratory environment with repeated passage and thus diverge from naturally-occurring environmental ("wild") strains. Distinguishing wild and laboratory strains is clearly important for biodefense and bioforensics; however, DNA sequence data alone has thus far not provided a clear signature, perhaps due to lack of understanding of how diverse genome changes lead to convergent phenotypes, difficulty in detecting certain types of mutations, or perhaps because some adaptive modifications are epigenetic. Monitoring protein abundance, a molecular measure of phenotype, can overcome some of these difficulties. We have assembled a collection of Yersinia pestis proteomics datasets from our own published and unpublished work, and from a proteomics data archive, and demonstrated that protein abundance data can clearly distinguish laboratory-adapted from wild. We developed a lasso logistic regression classifier that uses binary (presence/absence) or quantitative protein abundance measures to predict whether a sample is laboratory-adapted or wild that proved to be ~98% accurate, as judged by replicated 10-fold cross-validation. Protein features selected by the classifier accord well with our previous study of laboratory adaptation in Y. pestis. The input data was derived from a variety of unrelated experiments and contained significant confounding variables. We show that the classifier is robust with respect to these variables. The methodology is able to discover signatures for laboratory facility and culture medium that are largely independent of the signature of laboratory adaptation. Going beyond our previous laboratory evolution study, this work suggests that proteomic differences between laboratory-adapted and wild Y. pestis are general, potentially pointing to a process that could apply to other species as well. Additionally, we show that proteomics datasets (even archived data collected for different purposes) contain the information necessary to distinguish wild and laboratory samples. This work has clear applications in biomarker detection as well as biodefense.

Collapse

Mao Y, Yang X, Liu Y, Yan Y, Du Z, Han Y, Song Y, Zhou L, Cui Y, Yang R. Reannotation of Yersinia pestis Strain 91001 Based on Omics Data. Am J Trop Med Hyg 2016;95:562-70. [PMID: 27382076 DOI: 10.4269/ajtmh.16-0215] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 05/17/2016] [Indexed: 12/16/2022] Open

Alves G, Yu YK. Confidence assignment for mass spectrometry based peptide identifications via the extreme value distribution. Bioinformatics 2016;32:2642-9. [PMID: 27153659 DOI: 10.1093/bioinformatics/btw225] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2015] [Accepted: 04/16/2016] [Indexed: 11/14/2022] Open

Alves G, Wang G, Ogurtsov AY, Drake SK, Gucek M, Suffredini AF, Sacks DB, Yu YK. Identification of Microorganisms by High Resolution Tandem Mass Spectrometry with Accurate Statistical Significance. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2016;27:194-210. [PMID: 26510657 PMCID: PMC4723618 DOI: 10.1007/s13361-015-1271-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/29/2015] [Revised: 09/04/2015] [Accepted: 09/05/2015] [Indexed: 05/13/2023]

Locard-Paulet M, Pible O, Gonzalez de Peredo A, Alpha-Bazin B, Almunia C, Burlet-Schiltz O, Armengaud J. Clinical implications of recent advances in proteogenomics. Expert Rev Proteomics 2016;13:185-99. [DOI: 10.1586/14789450.2016.1132169] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]

Yang R, Motin VL. Yersinia pestis in the Age of Big Data. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016;918:257-272. [PMID: 27722866 DOI: 10.1007/978-94-024-0890-4_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]

Kumar D, Mondal AK, Kutum R, Dash D. Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes. Proteomics 2015;16:226-40. [PMID: 26773550 DOI: 10.1002/pmic.201500263] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Revised: 09/18/2015] [Accepted: 09/28/2015] [Indexed: 01/04/2023]

Zimbler DL, Schroeder JA, Eddy JL, Lathem WW. Early emergence of Yersinia pestis as a severe respiratory pathogen. Nat Commun 2015;6:7487. [PMID: 26123398 PMCID: PMC4491175 DOI: 10.1038/ncomms8487] [Citation(s) in RCA: 57] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2015] [Accepted: 05/12/2015] [Indexed: 11/09/2022] Open

Kucharova V, Wiker HG. Proteogenomics in microbiology: taking the right turn at the junction of genomics and proteomics. Proteomics 2014;14:2360-675. [PMID: 25263021 DOI: 10.1002/pmic.201400168] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Revised: 08/18/2014] [Accepted: 09/23/2014] [Indexed: 12/14/2022]

Schellenberg JJ, Verbeke TJ, McQueen P, Krokhin OV, Zhang X, Alvare G, Fristensky B, Thallinger GG, Henrissat B, Wilkins JA, Levin DB, Sparling R. Enhanced whole genome sequence and annotation of Clostridium stercorarium DSM8532T using RNA-seq transcriptomics and high-throughput proteomics. BMC Genomics 2014;15:567. [PMID: 24998381 PMCID: PMC4102724 DOI: 10.1186/1471-2164-15-567] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2013] [Accepted: 06/26/2014] [Indexed: 01/04/2023] Open

Abstract

BACKGROUND

Growing interest in cellulolytic clostridia with potential for consolidated biofuels production is mitigated by low conversion of raw substrates to desired end products. Strategies to improve conversion are likely to benefit from emerging techniques to define molecular systems biology of these organisms. Clostridium stercorarium DSM8532T is an anaerobic thermophile with demonstrated high ethanol production on cellulose and hemicellulose. Although several lignocellulolytic enzymes in this organism have been well-characterized, details concerning carbohydrate transporters and central metabolism have not been described. Therefore, the goal of this study is to define an improved whole genome sequence (WGS) for this organism using in-depth molecular profiling by RNA-seq transcriptomics and tandem mass spectrometry-based proteomics.

RESULTS

A paired-end Roche/454 WGS assembly was closed through application of an in silico algorithm designed to resolve repetitive sequence regions, resulting in a circular replicon with one gap and a region of 2 kilobases with 10 ambiguous bases. RNA-seq transcriptomics resulted in nearly complete coverage of the genome, identifying errors in homopolymer length attributable to 454 sequencing. Peptide sequences resulting from high-throughput tandem mass spectrometry of trypsin-digested protein extracts were mapped to 1,755 annotated proteins (68% of all protein-coding regions). Proteogenomic analysis confirmed the quality of annotation and improvement pipelines, identifying a missing gene and an alternative reading frame. Peptide coverage of genes hypothetically involved in substrate hydrolysis, transport and utilization confirmed multiple pathways for glycolysis, pyruvate conversion and recycling of intermediates. No sequences homologous to transaldolase, a central enzyme in the pentose phosphate pathway, were observed by any method, despite demonstrated growth of this organism on xylose and xylan hemicellulose.

CONCLUSIONS

Complementary omics techniques confirm the quality of genome sequence assembly, annotation and error-reporting. Nearly complete genome coverage by RNA-seq likely indicates background DNA in RNA extracts, however these preps resulted in WGS enhancement and transcriptome profiling in a single Illumina run. No detection of transaldolase by any method despite xylose utilization by this organism indicates an alternative pathway for sedoheptulose-7-phosphate degradation. This report combines next-generation omics techniques to elucidate previously undefined features of substrate transport and central metabolism for this organism and its potential for consolidated biofuels production from lignocellulose.

Collapse

Aryal UK, Callister SJ, McMahon BH, McCue LA, Brown J, Stöckel J, Liberton M, Mishra S, Zhang X, Nicora CD, Angel TE, Koppenaal DW, Smith RD, Pakrasi HB, Sherman LA. Proteomic Profiles of Five Strains of Oxygenic Photosynthetic Cyanobacteria of the Genus Cyanothece. J Proteome Res 2014;13:3262-76. [DOI: 10.1021/pr5000889] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Hilker R, Stadermann KB, Doppmeier D, Kalinowski J, Stoye J, Straube J, Winnebald J, Goesmann A. ReadXplorer--visualization and analysis of mapped sequences. Bioinformatics 2014;30:2247-54. [PMID: 24790157 PMCID: PMC4217279 DOI: 10.1093/bioinformatics/btu205] [Citation(s) in RCA: 92] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open

Affiliation(s)

Rolf Hilker Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany
Kai Bernd Stadermann Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, GermanyInstitute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany
Daniel Doppmeier Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany
Jörn Kalinowski Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany
Jens Stoye Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, GermanyInstitute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany
Jasmin Straube Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany
Jörn Winnebald Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany
Alexander Goesmann Institute of Medical Microbiology, Justus-Liebig-University, 35392 Giessen, Germany, Faculty of Biology, Institute for Bioinformatics, Center for Biotechnology, Computational Genomics, Center for Biotechnology, Technology Platform Genomics, Center for Biotechnology, Genome Informatics, Faculty of Technology, Bielefeld University, 33615 Bielefeld, Germany and Bioinformatics and Systems Biology, Faculty of Biology and Chemistry, Justus-Liebig-University, 35392 Giessen, Germany

Collapse

Ucciferri N, Rocchiccioli S. Proteomics techniques for the detection of translated pseudogenes. Methods Mol Biol 2014;1167:187-95. [PMID: 24823778 DOI: 10.1007/978-1-4939-0835-6_12] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Carlier AL, Omasits U, Ahrens CH, Eberl L. Proteomics analysis of Psychotria leaf nodule symbiosis: improved genome annotation and metabolic predictions. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2013;26:1325-1333. [PMID: 23902262 DOI: 10.1094/mpmi-05-13-0152-r] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Zickmann F, Lindner MS, Renard BY. GIIRA--RNA-Seq driven gene finding incorporating ambiguous reads. ACTA ACUST UNITED AC 2013;30:606-13. [PMID: 24123675 DOI: 10.1093/bioinformatics/btt577] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Armengaud J, Hartmann EM, Bland C. Proteogenomics for environmental microbiology. Proteomics 2013;13:2731-42. [PMID: 23636904 DOI: 10.1002/pmic.201200576] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2012] [Revised: 03/06/2013] [Accepted: 04/09/2013] [Indexed: 11/09/2022]

Abstract

Proteogenomics sensu stricto refers to the use of proteomic data to refine the annotation of genomes from model organisms. Because of the limitations of automatic annotation pipelines, a relatively high number of errors occur during the structural annotation of genes coding for proteins. Whether putative orphan sequences or short genes encoding low-molecular-weight proteins really exist is still frequently a mystery. Whether start codons are well defined is also an open debate. These problems are exacerbated for genomes of microorganisms belonging to poorly documented genera, as related sequences are not always available for homology-guided annotation. The functional annotation of a significant proportion of genes is also another well-known issue when annotating environmental microorganisms. High-throughput shotgun proteomics has recently greatly evolved, allowing the exploration of the proteome from any microorganism at an unprecedented depth. The structural and functional annotation process may be usefully complemented with experimental data. Indeed, proteogenomic mapping has been successfully performed for a wide variety of organisms. Specific approaches devoted to systematically establishing the N-termini of a large set of proteins are being developed. N-terminomics is giving rise to datasets of experimentally proven translational start codons as well as validated peptide signals for secreted proteins. By extension, combining genomic and proteomic data is becoming routine in many research projects. The proteomic analysis of organisms with unfinished genome sequences, the so-called composite proteomics, and the search for microbial biomarkers by bottom-up and top-down combined approaches are some examples of proteogenomic-flavored studies. They illustrate the advent of a new era of environmental microbiology where proteomics and genomics are intimately integrated to answer key biological questions.

Collapse

Bertaccini D, Vaca S, Carapito C, Arsène-Ploetze F, Van Dorsselaer A, Schaeffer-Reiss C. An Improved Stable Isotope N-Terminal Labeling Approach with Light/Heavy TMPP To Automate Proteogenomics Data Validation: dN-TOP. J Proteome Res 2013;12:3063-70. [DOI: 10.1021/pr4002993] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

The genome organization of Thermotoga maritima reflects its lifestyle. PLoS Genet 2013;9:e1003485. [PMID: 23637642 PMCID: PMC3636130 DOI: 10.1371/journal.pgen.1003485] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2012] [Accepted: 03/13/2013] [Indexed: 01/01/2023] Open

Abstract

The generation of genome-scale data is becoming more routine, yet the subsequent analysis of omics data remains a significant challenge. Here, an approach that integrates multiple omics datasets with bioinformatics tools was developed that produces a detailed annotation of several microbial genomic features. This methodology was used to characterize the genome of Thermotoga maritima—a phylogenetically deep-branching, hyperthermophilic bacterium. Experimental data were generated for whole-genome resequencing, transcription start site (TSS) determination, transcriptome profiling, and proteome profiling. These datasets, analyzed in combination with bioinformatics tools, served as a basis for the improvement of gene annotation, the elucidation of transcription units (TUs), the identification of putative non-coding RNAs (ncRNAs), and the determination of promoters and ribosome binding sites. This revealed many distinctive properties of the T. maritima genome organization relative to other bacteria. This genome has a high number of genes per TU (3.3), a paucity of putative ncRNAs (12), and few TUs with multiple TSSs (3.7%). Quantitative analysis of promoters and ribosome binding sites showed increased sequence conservation relative to other bacteria. The 5′UTRs follow an atypical bimodal length distribution comprised of “Short” 5′UTRs (11–17 nt) and “Common” 5′UTRs (26–32 nt). Transcriptional regulation is limited by a lack of intergenic space for the majority of TUs. Lastly, a high fraction of annotated genes are expressed independent of growth state and a linear correlation of mRNA/protein is observed (Pearson r = 0.63, p<2.2×10⁻¹⁶ t-test). These distinctive properties are hypothesized to be a reflection of this organism's hyperthermophilic lifestyle and could yield novel insights into the evolutionary trajectory of microbial life on earth.

Genomic studies have greatly benefited from the advent of high-throughput technologies and bioinformatics tools. Here, a methodology integrating genome-scale data and bioinformatics tools is developed to characterize the genome organization of the hyperthermophilic, phylogenetically deep-branching bacterium Thermotoga maritima. This approach elucidates several features of the genome organization and enables comparative analysis of these features across diverse taxa. Our results suggest that the genome of T. maritima is reflective of its hyperthermophilic lifestyle. Ultimately, constraints imposed on the genome have negative impacts on regulatory complexity and phenotypic diversity. Investigating the genome organization of Thermotogae species will help resolve various causal factors contributing to the genome organization such as phylogeny and environment. Applying a similar analysis of the genome organization to numerous taxa will likely provide insights into microbial evolution.

Collapse

Ansong C, Deatherage BL, Hyduke D, Schmidt B, McDermott JE, Jones MB, Chauhan S, Charusanti P, Kim YM, Nakayasu ES, Li J, Kidwai A, Niemann G, Brown RN, Metz TO, McAteer K, Heffron F, Peterson SN, Motin V, Palsson BO, Smith RD, Adkins JN. Studying Salmonellae and Yersiniae host-pathogen interactions using integrated 'omics and modeling. Curr Top Microbiol Immunol 2013;363:21-41. [PMID: 22886542 DOI: 10.1007/82_2012_247] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Yang R, Du Z, Han Y, Zhou L, Song Y, Zhou D, Cui Y. Omics strategies for revealing Yersinia pestis virulence. Front Cell Infect Microbiol 2012;2:157. [PMID: 23248778 PMCID: PMC3521224 DOI: 10.3389/fcimb.2012.00157] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2012] [Accepted: 11/27/2012] [Indexed: 01/12/2023] Open

Ansong C, Schrimpe-Rutledge AC, Mitchell HD, Chauhan S, Jones MB, Kim YM, McAteer K, Deatherage Kaiser BL, Dubois JL, Brewer HM, Frank BC, McDermott JE, Metz TO, Peterson SN, Smith RD, Motin VL, Adkins JN. A multi-omic systems approach to elucidating Yersinia virulence mechanisms. MOLECULAR BIOSYSTEMS 2012;9:44-54. [PMID: 23147219 DOI: 10.1039/c2mb25287b] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Peterson ES, McCue LA, Schrimpe-Rutledge AC, Jensen JL, Walker H, Kobold MA, Webb SR, Payne SH, Ansong C, Adkins JN, Cannon WR, Webb-Robertson BJM. VESPA: software to facilitate genomic annotation of prokaryotic organisms through integration of proteomic and transcriptomic data. BMC Genomics 2012;13:131. [PMID: 22480257 PMCID: PMC3364912 DOI: 10.1186/1471-2164-13-131] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2011] [Accepted: 04/05/2012] [Indexed: 11/10/2022] Open

Abstract

Background

The procedural aspects of genome sequencing and assembly have become relatively inexpensive, yet the full, accurate structural annotation of these genomes remains a challenge. Next-generation sequencing transcriptomics (RNA-Seq), global microarrays, and tandem mass spectrometry (MS/MS)-based proteomics have demonstrated immense value to genome curators as individual sources of information, however, integrating these data types to validate and improve structural annotation remains a major challenge. Current visual and statistical analytic tools are focused on a single data type, or existing software tools are retrofitted to analyze new data forms. We present Visual Exploration and Statistics to Promote Annotation (VESPA) is a new interactive visual analysis software tool focused on assisting scientists with the annotation of prokaryotic genomes though the integration of proteomics and transcriptomics data with current genome location coordinates.

Results

VESPA is a desktop Java™ application that integrates high-throughput proteomics data (peptide-centric) and transcriptomics (probe or RNA-Seq) data into a genomic context, all of which can be visualized at three levels of genomic resolution. Data is interrogated via searches linked to the genome visualizations to find regions with high likelihood of mis-annotation. Search results are linked to exports for further validation outside of VESPA or potential coding-regions can be analyzed concurrently with the software through interaction with BLAST. VESPA is demonstrated on two use cases (Yersinia pestis Pestoides F and Synechococcus sp. PCC 7002) to demonstrate the rapid manner in which mis-annotations can be found and explored in VESPA using either proteomics data alone, or in combination with transcriptomic data.

Conclusions

VESPA is an interactive visual analytics tool that integrates high-throughput data into a genomic context to facilitate the discovery of structural mis-annotations in prokaryotic genomes. Data is evaluated via visual analysis across multiple levels of genomic resolution, linked searches and interaction with existing bioinformatics tools. We highlight the novel functionality of VESPA and core programming requirements for visualization of these large heterogeneous datasets for a client-side application. The software is freely available at https://www.biopilot.org/docs/Software/Vespa.php.

Collapse