1
|
Kotimoole CN, Ramya VK, Kaur P, Reiling N, Shandil RK, Narayanan S, Flo TH, Prasad TSK. Discovery of Species-Specific Proteotypic Peptides To Establish a Spectral Library Platform for Identification of Nontuberculosis Mycobacteria from Mass Spectrometry-Based Proteomics. J Proteome Res 2024; 23:1102-1117. [PMID: 38358903 DOI: 10.1021/acs.jproteome.3c00850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2024]
Abstract
Nontuberculous mycobacteria are opportunistic bacteria pulmonary and extra-pulmonary infections in humans that closely resemble Mycobacterium tuberculosis. Although genome sequencing strategies helped determine NTMs, a common assay for the detection of coinfection by multiple NTMs with M. tuberculosis in the primary attempt of diagnosis is still elusive. Such a lack of efficiency leads to delayed therapy, an inappropriate choice of drugs, drug resistance, disease complications, morbidity, and mortality. Although a high-resolution LC-MS/MS-based multiprotein panel assay can be developed due to its specificity and sensitivity, it needs a library of species-specific peptides as a platform. Toward this, we performed an analysis of proteomes of 9 NTM species with more than 20 million peptide spectrum matches gathered from 26 proteome data sets. Our metaproteomic analyses determined 48,172 species-specific proteotypic peptides across 9 NTMs. Notably, M. smegmatis (26,008), M. abscessus (12,442), M. vaccae (6487), M. fortuitum (1623), M. avium subsp. paratuberculosis (844), M. avium subsp. hominissuis (580), and M. marinum (112) displayed >100 species-specific proteotypic peptides. Finally, these peptides and corresponding spectra have been compiled into a spectral library, FASTA, and JSON formats for future reference and validation in clinical cohorts by the biomedical community for further translation.
Collapse
Affiliation(s)
- Chinmaya Narayana Kotimoole
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Vadageri Krishnamurthy Ramya
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Parvinder Kaur
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Norbert Reiling
- Microbial Interface Biology, Research Center Borstel, Leibniz Lung Center, Parkallee 22, D-23845 Borstel, Germany
- German Center for Infection Research (DZIF), Site Hamburg-Lübeck-Borstel-Riems, 23845 Borstel, Germany
| | - Radha Krishan Shandil
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Shridhar Narayanan
- Foundation for Neglected Disease Research, 20A, KIADB Industrial Area, Veerapura Village, Doddaballapur, Bengaluru 561203, India
| | - Trude Helen Flo
- Centre of Molecular Inflammation Research, Department of Clinical and Molecular Medicine Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, Kunnskapssenteret, Øya 424.04.035, Norway
| | | |
Collapse
|
2
|
Rosyada ZNA, Pardede BP, Kaiin EM, Gunawan M, Maulana T, Said S, Tumbelaka LITA, Solihin DD, Ulum MF, Purwantara B. A proteomic approach to identifying spermatozoa proteins in Indonesian native Madura bulls. Front Vet Sci 2023; 10:1287676. [PMID: 38111731 PMCID: PMC10725959 DOI: 10.3389/fvets.2023.1287676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 11/06/2023] [Indexed: 12/20/2023] Open
Abstract
Proteins assist sperm mature, transit the female reproductive tract, and recognise sperm oocytes. Indigenous Indonesian bulls, Madura bulls, have not been studied for reproductive proteomics. As local Indonesian beef livestock, Madura cattle assist in achieving food security; hence, their number must be improved. Thus, the identification of molecular proteomics-based bull fertility biomarkers is needed. This study aimed to characterise the sperm fertility function of the superior Madura bull (Bos indicus × Bos Javanicus) spermatozoa proteome. Frozen semen from eight Madura superior bulls (Bos indicus × Bos javanicus) aged 4-8 years was obtained from the artificial insemination centre (AIC) in Singosari and Lembang. Madura superior bulls are those that have passed the bull breeding soundness evaluation. Frozen sperm were thawed and centrifuged at 3000 × g for 30 min. Proteins in sperm were characterised through proteomic analysis using liquid chromatography-tandem mass spectrometry (LC-MS/MS). The resulting gene symbols for each protein were then subjected to bioinformatics tools, including UniProt, DAVID, and STRING databases. Regarding sperm fertility, the analysis revealed that 15 proteins were identified in the sperm of Madura bulls. Amongst the identified proteins, the superior Madura bull sperm contained several motilities, energy-related proteins, and chaperone proteins. A substantial portion of characterised proteins are linked to metabolic pathways and the tricarboxylic acid (TCA) cycle, contributing to sperm energy production. In conclusion, the first in-depth proteome identification of sperm related to sperm quality and bull fertility of a unique indigenous Madura breed of Indonesia was performed using the LC-MS/MS proteomic method. These findings may serve as a reference point for further studies related to the functions of bovine sperm and biomarkers of fertility and sperm quality.
Collapse
Affiliation(s)
- Zulfi Nur Amrina Rosyada
- Division of Reproduction and Obstetrics, School of Veterinary Medicine and Biomedical Sciences, IPB University, Bogor, Indonesia
- Research Center for Applied Zoology, National Research and Innovation Agency (BRIN), Bogor, Indonesia
- Division of Veterinary Anatomy, Faculty of Veterinary Medicine, Universitas Airlangga, Surabaya, Indonesia
| | - Berlin Pandapotan Pardede
- Division of Reproduction and Obstetrics, School of Veterinary Medicine and Biomedical Sciences, IPB University, Bogor, Indonesia
- Research Center for Applied Zoology, National Research and Innovation Agency (BRIN), Bogor, Indonesia
| | - Ekayanti Mulyawati Kaiin
- Research Center for Applied Zoology, National Research and Innovation Agency (BRIN), Bogor, Indonesia
| | - Muhammad Gunawan
- Research Center for Applied Zoology, National Research and Innovation Agency (BRIN), Bogor, Indonesia
| | - Tulus Maulana
- Research Center for Applied Zoology, National Research and Innovation Agency (BRIN), Bogor, Indonesia
| | - Syahruddin Said
- Research Center for Applied Zoology, National Research and Innovation Agency (BRIN), Bogor, Indonesia
| | - Ligaya I. T. A Tumbelaka
- Division of Reproduction and Obstetrics, School of Veterinary Medicine and Biomedical Sciences, IPB University, Bogor, Indonesia
| | | | - Mokhamad Fakhrul Ulum
- Division of Reproduction and Obstetrics, School of Veterinary Medicine and Biomedical Sciences, IPB University, Bogor, Indonesia
| | - Bambang Purwantara
- Division of Reproduction and Obstetrics, School of Veterinary Medicine and Biomedical Sciences, IPB University, Bogor, Indonesia
| |
Collapse
|
3
|
Kotimoole C, Antil N, Kasaragod S, Behera S, Arvind A, Reiling N, Flo T, Prasad T. Development of a spectral library for the discovery of altered genomic events in Mycobacterium avium associated with virulence using mass spectrometry-based proteogenomic analysis. Mol Cell Proteomics 2023; 22:100533. [PMID: 36948415 PMCID: PMC10149365 DOI: 10.1016/j.mcpro.2023.100533] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 02/02/2023] [Accepted: 03/16/2023] [Indexed: 03/24/2023] Open
Abstract
Mycobacterium avium is one of the prominent disease-causing bacteria in humans. It causes lymphadenitis, chronic and extrapulmonary, and disseminated infections in adults, children, and immunocompromised patients. M. avium has ∼4,500 predicted protein-coding regions on average, which can help discover several variants at the proteome level. Many of them are potentially associated with virulence; thus, identifying such proteins can be a helpful feature in developing panel-based theranostics. In line with such a long-term goal, we carried out an in-depth proteomic analysis of M. avium with both data-dependent and data-independent acquisition methods. Further, a set of proteogenomic investigations were carried out using i) a protein database for Mycobacterium tuberculosis, ii) a M. avium genome six-frame translated database, and iii) a variant protein database of M. avium. A search of mass spectrometry data against M. avium protein database resulted in identifying 2,954 proteins. Further, proteogenomic analyses aided in identifying 1,301 novel peptide sequences and correcting translation start sites for 15 proteins. Ultimately, we created a spectral library of M. avium proteins, including novel genome search-specific peptides and variant peptides detected in this study. We validated the spectral library by a data-independent acquisition of the M. avium proteome. Thus, we present an M. avium spectral library of 29,033 peptide precursors supported by 0.4 million fragment ions for further use by the biomedical community.
Collapse
Affiliation(s)
- ChinmayaNarayana Kotimoole
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, 575018, India
| | - Neelam Antil
- Institute of Bioinformatics, International Technology Park, Bangalore, India
| | - Sandeep Kasaragod
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, 575018, India
| | - SantoshKumar Behera
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, 575018, India
| | - Anjana Arvind
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, 575018, India
| | - Norbert Reiling
- Microbial Interface Biology, Research Center Borstel, Leibniz Lung Center, Parkallee 22, D-23845 Borstel, Germany; German Center for Infection Research (DZIF), Site Hamburg-Lübeck-Borstel-Riems, 23845 Borstel, Germany
| | - TrudeHelen Flo
- Centre of Molecular Inflammation Research, Department of Clinical and Molecular Medicine Faculty of Medicine and Health Sciences, Norwegian University of Science and Technology, Kunnskapssenteret, 424.04.035, Øya, Norway
| | | |
Collapse
|
4
|
Rex DB, Patil AH, Modi PK, Kandiyil MK, Kasaragod S, Pinto SM, Tanneru N, Sijwali PS, Prasad TSK. Dissecting Plasmodium yoelii Pathobiology: Proteomic Approaches for Decoding Novel Translational and Post-Translational Modifications. ACS OMEGA 2022; 7:8246-8257. [PMID: 35309442 PMCID: PMC8928344 DOI: 10.1021/acsomega.1c03892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Accepted: 02/21/2022] [Indexed: 06/14/2023]
Abstract
Malaria is a vector-borne disease. It is caused by Plasmodium parasites. Plasmodium yoelii is a rodent model parasite, primarily used for studying parasite development in liver cells and vectors. To better understand parasite biology, we carried out a high-throughput-based proteomic analysis of P. yoelii. From the same mass spectrometry (MS)/MS data set, we also captured several post-translational modified peptides by following a bioinformatics analysis without any prior enrichment. Further, we carried out a proteogenomic analysis, which resulted in improvements to some of the existing gene models along with the identification of several novel genes. Analysis of proteome and post-translational modifications (PTMs) together resulted in the identification of 3124 proteins. The identified PTMs were found to be enriched in mitochondrial metabolic pathways. Subsequent bioinformatics analysis provided an insight into proteins associated with metabolic regulatory mechanisms. Among these, the tricarboxylic acid (TCA) cycle and the isoprenoid synthesis pathway are found to be essential for parasite survival and drug resistance. The proteogenomic analysis discovered 43 novel protein-coding genes. The availability of an in-depth proteomic landscape of a malaria pathogen model will likely facilitate further molecular-level investigations on pre-erythrocytic stages of malaria.
Collapse
Affiliation(s)
- Devasahayam
Arokia Balaya Rex
- Center
for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Arun H. Patil
- Center
for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Prashant Kumar Modi
- Center
for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Mrudula Kinarulla Kandiyil
- Center
for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Sandeep Kasaragod
- Center
for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Sneha M. Pinto
- Center
for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Nandita Tanneru
- CSIR-Centre
for Cellular and Molecular Biology, Hyderabad 500007, Telangana, India
| | - Puran Singh Sijwali
- CSIR-Centre
for Cellular and Molecular Biology, Hyderabad 500007, Telangana, India
- Academy
of Scientific and Innovative Research, Ghaziabad 201002, Uttar Pradesh, India
| | | |
Collapse
|
5
|
Antil N, Kumar M, Behera SK, Arefian M, Kotimoole CN, Rex DAB, Prasad TSK. Unraveling Toxoplasma gondii GT1 Strain Virulence and New Protein-Coding Genes with Proteogenomic Analyses. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2021; 25:591-604. [PMID: 34468217 DOI: 10.1089/omi.2021.0082] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Toxoplasma gondii is one of the most widespread parasites of great relevance to planetary health. It infects approximately one-third of the world population. T. gondii establishes itself in warm-blooded animals and causes adverse health outcomes, particularly in immunocompromised patients. T. gondii is also widely used as a model organism to study other related apicomplexan parasites, which requires a deeper understanding of its molecular biology. Type I strains (GT1 and RH) of T. gondii are considered the most virulent forms. The whole-genome sequencing of T. gondii annotated 8460 predicted gene models in the parasite. To this end, the proteogenomics technology allows harnessing of mass spectrometry (MS)-derived proteomic data to unravel new protein-coding genes, not to mention validation and correction of the existing gene models. In this study using the proteogenomic approach, we report the identification of 31 novel protein-coding genes while reannotating 88 existing gene models. Notably, the genome annotations were corrected for genes, such as SAG5C, GRA6, ROP4, ROP5, and ROP26. The associated proteins are known to play important roles in host-parasite interactions, particularly in relation to parasite virulence, suppression of host immune response, and distinctively pertinent for the survival of the parasite inside the host system. These new findings offer new insights, informing planetary health broadly and the knowledge base on T. gondii virulence specifically. The proteogenomics approach also provides a concrete example to study related apicomplexan organisms of relevance to planetary health, and so as to develop new diagnostics and therapeutics against toxoplasmosis and related diseases.
Collapse
Affiliation(s)
- Neelam Antil
- Institute of Bioinformatics, International Technology Park, Bangalore, India.,Centre for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India.,Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam, India
| | - Manish Kumar
- Institute of Bioinformatics, International Technology Park, Bangalore, India.,Manipal Academy of Higher Education, Manipal, India
| | - Santosh Kumar Behera
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India
| | - Mohammad Arefian
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India
| | - Chinmaya Narayana Kotimoole
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India
| | - Devasahayam Arokia Balaya Rex
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Center, Yenepoya (Deemed to be University), Mangalore, India
| | | |
Collapse
|
6
|
Tian F, Shi J, Li Y, Gao H, Chang L, Zhang Y, Gao L, Xu P, Tang S. Proteogenomics Study of Blastobotrys adeninivorans TMCC 70007-A Dominant Yeast in the Fermentation Process of Pu-erh Tea. J Proteome Res 2021; 20:3290-3304. [PMID: 34008989 DOI: 10.1021/acs.jproteome.1c00205] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Blastobotrys adeninivorans plays an essential role in pile-fermenting of Pu-erh tea. Its ability to assimilate various carbon and nitrogen sources makes it available for application in a wide range of industry sectors. The genome of B. adeninivorans TMCC 70007 isolated from pile-fermented Pu-erh tea was sequenced and assembled. Proteomics analysis indicated that 4900 proteins in TMCC 70007 were expressed under various culture conditions. Proteogenomics mapping revealed 48 previously unknown genes and corrected 118 gene models predicted by GeneMark-ES. Ortho-proteogenomics analysis identified 17 previously unidentified genes in B. adeninivorans LS3, the first strain with a sequenced genome among the genus Blastobotrys as well. More importantly, five species specific genes were identified from TMCC 70007, which could serve as a barcode for strain typing and were applicable for fermentation process protection of this industrial species. The datasets generated from tea aqueous extract culture not only increased the proteome coverage and accuracy but also contributed to the identification of proteins related to polyphenols and caffeine, which were considered to change greatly during the microbial fermentation of Pu-erh tea. This study provides a proteome perspective on TMCC 70007, which was considered to be an important strain in the production of Pu-erh tea. The systematic proteogenomics analysis not only made a better annotation on the genome of B. adeninivorans TMCC 70007 as previous proteogenomics study but also provided solution for fermentation process protection on valuable industrial species with species specific genes uniquely identified from proteogenomics study.
Collapse
Affiliation(s)
- Fei Tian
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education, and Laboratory for Conservation and Utilization of Bio-resources, Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, Research Unit of Proteomics & Research and Development of New Drug, Chinese Academy of Medical Sciences, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Jiahui Shi
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, Research Unit of Proteomics & Research and Development of New Drug, Chinese Academy of Medical Sciences, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding 071002, China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, Research Unit of Proteomics & Research and Development of New Drug, Chinese Academy of Medical Sciences, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Huiying Gao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, Research Unit of Proteomics & Research and Development of New Drug, Chinese Academy of Medical Sciences, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, Research Unit of Proteomics & Research and Development of New Drug, Chinese Academy of Medical Sciences, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Yao Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, Research Unit of Proteomics & Research and Development of New Drug, Chinese Academy of Medical Sciences, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Linrui Gao
- Yunnan Pu-erh Tea Fermentation Engineering Research Center, Yunnan TAETEA Microbial Technology Co., Ltd., Kunming 650217, China
| | - Ping Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, Research Unit of Proteomics & Research and Development of New Drug, Chinese Academy of Medical Sciences, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding 071002, China
| | - Shukun Tang
- Key Laboratory of Microbial Diversity in Southwest China, Ministry of Education, and Laboratory for Conservation and Utilization of Bio-resources, Yunnan Institute of Microbiology, School of Life Sciences, Yunnan University, Kunming 650091, China.,Yunnan Pu-erh Tea Fermentation Engineering Research Center, Yunnan TAETEA Microbial Technology Co., Ltd., Kunming 650217, China
| |
Collapse
|
7
|
Thoduvayil S, Dhandapani G, Brahma R, Devasahayam Arokia Balaya R, Mangalaparthi KK, Patel K, Kumar M, Tennyson J, Satheeshkumar PK, Kulkarni MJ, Pinto SM, Prasad TSK, Madanan MG. Triton X-114 Fractionated Subcellular Proteome of Leptospira interrogans Shows Selective Enrichment of Pathogenic and Outer Membrane Proteins in the Detergent Fraction. Proteomics 2020; 20:e2000170. [PMID: 32846045 DOI: 10.1002/pmic.202000170] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Revised: 07/30/2020] [Indexed: 12/28/2022]
Abstract
The Triton X-114-based solubilization and temperature-dependent phase separation of proteins is used for subcellular fractionation where, aqueous, detergent, and pellet fractions represents cytoplasmic, outer membrane (OM), and inner membrane proteins, respectively. Mass spectrometry-based proteomic analysis of Triton X-114 fractions of proteomic analysis of Leptospira interrogans identified 2957 unique proteins distributed across the fractions. The results are compared with bioinformatics predictions on their subcellular localization and pathogenic nature. Analysis of the distribution of proteins across the Triton X-114 fractions with the predicted characteristics is performed based on "number" of unique type of proteins, and "quantity" which represents the amount of unique protein. The highest number of predicted outer membrane proteins (OMPs) and pathogenic proteins are found in aqueous and pellet fractions, whereas detergent fraction representing the OM has the highest quantity of OMPs and pathogenic proteins though lower in number than the aqueous and pellet fractions. This leaves the possibility of an upsurge in pathogenic proteins and OMPs on the OM under pathogenic conditions suggesting their potential use to combat leptospirosis. Further, the Triton X-114 subcellular fractions are more correlated to enrichment of pathogenic proteins predicted by MP3 software than predicted localization.
Collapse
Affiliation(s)
- Sikha Thoduvayil
- Indian Council of Medical Research, Regional Medical Research Centre Port Blair, Dollygunj, Port Blair, 744103, India.,Department of Pathology, Jawaharlal Institute of Postgraduate Medical Education and Research, Puducherry, 605006, India
| | - Gunasekaran Dhandapani
- Indian Council of Medical Research, Regional Medical Research Centre Port Blair, Dollygunj, Port Blair, 744103, India.,Department of Chemical Sciences, Ariel University, Ariel, 70400, Israel
| | - Rahul Brahma
- Indian Council of Medical Research, Regional Medical Research Centre Port Blair, Dollygunj, Port Blair, 744103, India
| | - Rex Devasahayam Arokia Balaya
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangaluru, 575018, India
| | - Kiran K Mangalaparthi
- Institute of Bioinformatics, International Technology Park, Bengaluru, 560066, India.,NIMHANS-IOB Proteomics and Bioinformatics Laboratory, Neurobiology Research Centre, National Institute of Mental Health and Neurosciences, Bengaluru, 560029, India
| | - Krishna Patel
- Institute of Bioinformatics, International Technology Park, Bengaluru, 560066, India.,Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam, 690525, India
| | - Manish Kumar
- Institute of Bioinformatics, International Technology Park, Bengaluru, 560066, India.,Manipal Academy of Higher Education, Manipal, 576104, India
| | - Jebasingh Tennyson
- School of Biological Sciences, Madurai Kamaraj University, Madurai, 625021, India
| | - P K Satheeshkumar
- Department of Botany, Institute of Science, Banaras Hindu University, Varanasi, 221005, India
| | - Mahesh J Kulkarni
- Biochemical Sciences Division, CSIR-National Chemical Laboratory, Pune, 411008, India
| | - Sneha M Pinto
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangaluru, 575018, India.,Institute of Bioinformatics, International Technology Park, Bengaluru, 560066, India
| | - T S Keshava Prasad
- Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangaluru, 575018, India.,Institute of Bioinformatics, International Technology Park, Bengaluru, 560066, India.,NIMHANS-IOB Proteomics and Bioinformatics Laboratory, Neurobiology Research Centre, National Institute of Mental Health and Neurosciences, Bengaluru, 560029, India
| | - Madathiparambil G Madanan
- Indian Council of Medical Research, Regional Medical Research Centre Port Blair, Dollygunj, Port Blair, 744103, India
| |
Collapse
|
8
|
Ramesha KP, Mol P, Kannegundla U, Thota LN, Gopalakrishnan L, Rana E, Azharuddin N, Mangalaparthi KK, Kumar M, Dey G, Patil A, Saravanan K, Behera SK, Jeyakumar S, Kumaresan A, Kataktalware MA, Prasad TSK. Deep Proteome Profiling of Semen of Indian Indigenous Malnad Gidda (Bos indicus) Cattle. J Proteome Res 2020; 19:3364-3376. [DOI: 10.1021/acs.jproteome.0c00237] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Affiliation(s)
- Kerekoppa P. Ramesha
- Southern Regional Station, ICAR-National Dairy Research Institute, Bangalore 560030, India
| | - Praseeda Mol
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
- Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam, Kerala 690525, India
| | - Uday Kannegundla
- Southern Regional Station, ICAR-National Dairy Research Institute, Bangalore 560030, India
| | | | - Lathika Gopalakrishnan
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
- Manipal Academy of Higher Education, Madhav Nagar, Manipal 576104, India
| | - Ekta Rana
- Southern Regional Station, ICAR-National Dairy Research Institute, Bangalore 560030, India
| | - Nizamuddin Azharuddin
- Southern Regional Station, ICAR-National Dairy Research Institute, Bangalore 560030, India
| | - Kiran K Mangalaparthi
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
- Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam, Kerala 690525, India
| | - Manish Kumar
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - Gourav Dey
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - Arun Patil
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Kumar Saravanan
- Proteomics Facility, Thermo Fisher Scientific India Pvt. Ltd., Bangalore 560066, India
| | - Santosh Kumar Behera
- Centre for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Sakthivel Jeyakumar
- Southern Regional Station, ICAR-National Dairy Research Institute, Bangalore 560030, India
| | - Arumugam Kumaresan
- Southern Regional Station, ICAR-National Dairy Research Institute, Bangalore 560030, India
| | - Mukund A. Kataktalware
- Southern Regional Station, ICAR-National Dairy Research Institute, Bangalore 560030, India
| | | |
Collapse
|
9
|
Abstract
![]()
Secretory proteins are key modulators of host–pathogen interaction.
The human opportunistic fungal pathogen Candida glabrata lacks secreted proteolytic activity but possesses 11 glycosylphosphatidylinositol-anchored
aspartyl proteases, also referred to as Yapsins (CgYps1–11),
that are essential for its virulence. To delineate the role of CgYapsins
in interaction with host cells, we have profiled, through liquid chromatography-tandem
mass spectrometry (LC-MS/MS) approach, the total secretome of wild-type and Cgyps1-11Δ mutant.
The wild-type secretome consisted of 119 proteins
which were primarily involved in cell wall organization, carbohydrate
metabolism, proteolysis, and translation processes. Of eight CgYapsins
identified in the secretome, the release of two major CgYapsins, CgYps1
and CgYps7, to the medium was confirmed by Western analysis. Further,
comparative analysis revealed 20 common proteins, probably signifying
the core fungal secretome, among C. glabrata, Saccharomyces cerevisiae, and Candida albicans secretomes. Strikingly, the Cgyps1-11Δ secretome was 4.6-fold larger, and contained
65 differentially abundant proteins, as revealed by label-free quantitative
profiling, with 49 and 16 being high- and low-abundant proteins, respectively,
compared to the wild-type secretome. Importantly,
the CgMsb2 mucin, a putative CgYapsins’ substrate, was six-fold
underrepresented in the mutant secretome. Altogether, we demonstrate
for the first time that CgYapsins are both bona fide constituents
and key modulators of the C. glabrata secretome.
Collapse
Affiliation(s)
- Mubashshir Rasheed
- Laboratory of Fungal Pathogenesis , Centre for DNA Fingerprinting and Diagnostics , Hyderabad , Telangana 500039 , India
| | - Naveen Kumar
- Laboratory of Fungal Pathogenesis , Centre for DNA Fingerprinting and Diagnostics , Hyderabad , Telangana 500039 , India
| | - Rupinder Kaur
- Laboratory of Fungal Pathogenesis , Centre for DNA Fingerprinting and Diagnostics , Hyderabad , Telangana 500039 , India
| |
Collapse
|
10
|
Agrawal A, Ravikumar R, Varun CN, Kumar M, Chatterjee O, Advani J, Gopalakrishnan L, Nagaraj S, Mohanty V, Patil AH, Sreeramulu B, Malik A, Pinto SM, Prasad TSK. Global Proteome Profiling Reveals Drug-Resistant Traits in Elizabethkingia meningoseptica: An Opportunistic Nosocomial Pathogen. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2019; 23:318-326. [PMID: 31120389 DOI: 10.1089/omi.2019.0039] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Elizabethkingia meningoseptica is Gram-negative, rod-shaped opportunistic bacterial pathogen increasingly reported in hospital-acquired outbreaks. This bacterium is well known to thrive in the hospital environment. One of the leading causes of meningitis in pediatric and immune-compromised patients, E. meningoseptica has been noted as a "pathogen of interest" in the context of nosocomial diseases associated with device-related infections in particular. This pathogen's multidrug-resistant phenotype and attendant lack of adequate molecular mechanistic data limit the current approaches for its effective management in hospitals and public health settings. This study provides the global proteome of E. meningoseptica. The reference strain E. meningoseptica ATCC 13253 was used for proteomic analysis using high-resolution Fourier transform mass spectrometry. The study provided translational evidence for 2506 proteins of E. meningoseptica. We identified multiple metallo-β-lactamases, transcriptional regulators, and efflux transporter proteins associated with multidrug resistance. A protein Car D, which is an enzyme of the carbapenem synthesis pathway, was also discovered in E. meningoseptica. Further, the proteomics data were harnessed for refining the genome annotation. We discovered 39 novel protein-coding genes and corrected four existing translations using proteogenomic workflow. Novel translations reported in this study enhance the molecular data on this organism, thus improving current databases. We believe that the in-depth proteomic data presented in this study offer a platform for accelerated research on this pathogen. The identification of multiple proteins, particularly those involved in drug resistance, offers new future opportunities to design novel and specific antibiotics against infections caused by E. meningoseptica.
Collapse
Affiliation(s)
- Archana Agrawal
- 1 Department of Neuromicrobiology, National Institute of Mental Health and Neurosciences, Bangalore, India
| | - Raju Ravikumar
- 1 Department of Neuromicrobiology, National Institute of Mental Health and Neurosciences, Bangalore, India
| | - Chakrakodi N Varun
- 1 Department of Neuromicrobiology, National Institute of Mental Health and Neurosciences, Bangalore, India
| | - Manish Kumar
- 2 Institute of Bioinformatics, International Technology Park, Bangalore, India
| | - Oishi Chatterjee
- 2 Institute of Bioinformatics, International Technology Park, Bangalore, India.,3 Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India.,4 School of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam, India
| | - Jayshree Advani
- 2 Institute of Bioinformatics, International Technology Park, Bangalore, India
| | - Lathika Gopalakrishnan
- 2 Institute of Bioinformatics, International Technology Park, Bangalore, India.,3 Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India.,5 Manipal Academy of Higher Education, Manipal, India
| | - Sowmya Nagaraj
- 1 Department of Neuromicrobiology, National Institute of Mental Health and Neurosciences, Bangalore, India
| | - Varshasnata Mohanty
- 3 Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Arun H Patil
- 2 Institute of Bioinformatics, International Technology Park, Bangalore, India.,3 Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India.,6 School of Biotechnology, KIIT (Deemed to be University), Bhubaneswar, India
| | | | - Aubid Malik
- 8 CSIR-Indian Institute of Integrative Medicine, Jammu, India
| | - Sneha M Pinto
- 3 Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| | - Thottethodi Subrahmanya Keshava Prasad
- 2 Institute of Bioinformatics, International Technology Park, Bangalore, India.,3 Center for Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore, India
| |
Collapse
|
11
|
Datta KK, Patil AH, Patel K, Dey G, Madugundu AK, Renuse S, Kaviyil JE, Sekhar R, Arunima A, Daswani B, Kaur I, Mohanty J, Sinha R, Jaiswal S, Sivapriya S, Sonnathi Y, Chattoo BB, Gowda H, Ravikumar R, Prasad TSK. Proteogenomics of Candida tropicalis--An Opportunistic Pathogen with Importance for Global Health. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2017; 20:239-47. [PMID: 27093108 DOI: 10.1089/omi.2015.0197] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
The frequency of Candida infections is currently rising, and thus adversely impacting global health. The situation is exacerbated by azole resistance developed by fungal pathogens. Candida tropicalis is an opportunistic pathogen that causes candidiasis, for example, in immune-compromised individuals, cancer patients, and those who undergo organ transplantation. It is a member of the non-albicans group of Candida that are known to be azole-resistant, and is frequently seen in individuals being treated for cancers, HIV-infection, and those who underwent bone marrow transplantation. Although the genome of C. tropicalis was sequenced in 2009, the genome annotation has not been supported by experimental validation. In the present study, we have carried out proteomics profiling of C. tropicalis using high-resolution Fourier transform mass spectrometry. We identified 2743 proteins, thus mapping nearly 44% of the computationally predicted protein-coding genes with peptide level evidence. In addition to identifying 2591 proteins in the cell lysate of this yeast, we also analyzed the proteome of the conditioned media of C. tropicalis culture and identified several unique secreted proteins among a total of 780 proteins. By subjecting the mass spectrometry data derived from cell lysate and conditioned media to proteogenomic analysis, we identified 86 novel genes, 12 novel exons, and corrected 49 computationally-predicted gene models. To our knowledge, this is the first high-throughput proteomics study of C. tropicalis validating predicted protein coding genes and refining the current genome annotation. The findings may prove useful in future global health efforts to fight against Candida infections.
Collapse
Affiliation(s)
- Keshava K Datta
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,2 School of Biotechnology, KIIT University , Bhubaneswar, India
| | - Arun H Patil
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,2 School of Biotechnology, KIIT University , Bhubaneswar, India
| | - Krishna Patel
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,3 Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham , Kollam, India
| | - Gourav Dey
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,4 Manipal University , Madhav Nagar, Manipal, India
| | - Anil K Madugundu
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,5 Centre for Bioinformatics, School of Life Sciences, Pondicherry University , Puducherry, India
| | - Santosh Renuse
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,3 Amrita School of Biotechnology, Amrita Vishwa Vidyapeetham , Kollam, India
| | - Jyothi E Kaviyil
- 6 Department of Neuromicrobiology, Neurobiology Research Centre, National Institute of Mental Health and Neurosciences , Bangalore, India
| | - Raja Sekhar
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,5 Centre for Bioinformatics, School of Life Sciences, Pondicherry University , Puducherry, India
| | | | - Bhavna Daswani
- 7 National Institute for Research in Reproductive Health (ICMR) , Parel, Mumbai, India
| | - Inderjeet Kaur
- 8 Malaria Research Group, International Center for Genetic Engineering and Biotechnology (ICGEB) , New Delhi, India
| | - Jyotirmaya Mohanty
- 9 ICAR-Central Institute of Freshwater Aquaculture , Kausalyaganga, Bhubaneswar, India
| | | | | | - S Sivapriya
- 11 Department of Ocular Pathology, Vision Research Foundation , Chennai, India
| | | | - Bharat B Chattoo
- 13 Centre for Genome Research, Department of Microbiology and Biotechnology Centre, Faculty of Science, The M. S. University of Baroda , Vadodara, India
| | - Harsha Gowda
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,2 School of Biotechnology, KIIT University , Bhubaneswar, India .,14 YU-IOB Center for Systems Biology and Molecular Medicine, Yenepoya University , Mangalore, India
| | - Raju Ravikumar
- 6 Department of Neuromicrobiology, Neurobiology Research Centre, National Institute of Mental Health and Neurosciences , Bangalore, India
| | - T S Keshava Prasad
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, India.,14 YU-IOB Center for Systems Biology and Molecular Medicine, Yenepoya University , Mangalore, India .,15 NIMHANS-IOB Proteomics and Bioinformatics Laboratory, Neurobiology Research Centre, National Institute of Mental Health and Neurosciences , Bangalore, India
| |
Collapse
|
12
|
Abstract
Recent advances in high resolution tandem mass spectrometry (MS) has resulted in the accumulation of high quality data. Paralleled with these advances in instrumentation, bioinformatics software have been developed to analyze such quality datasets. In spite of these advances, data analysis in mass spectrometry still remains critical for protein identification. In addition, the complexity of the generated MS/MS spectra, unpredictable nature of peptide fragmentation, sequence annotation errors, and posttranslational modifications has impeded the protein identification process. In a typical MS data analysis, about 60 % of the MS/MS spectra remains unassigned. While some of these could attribute to the low quality of the MS/MS spectra, a proportion can be classified as high quality. Further analysis may reveal how much of the unassigned MS spectra attribute to search space, sequence annotation errors, mutations, and/or posttranslational modifications. In this chapter, the tools used to identify proteins and ways to assign unassigned tandem MS spectra are discussed.
Collapse
Affiliation(s)
- Mohashin Pathan
- Department of Biochemistry and Genetics, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, VIC, 3086, Australia
| | - Monisha Samuel
- Department of Physiology, Anatomy and Microbiology, La Trobe University, Bundoora, Melbourne, VIC, 3086, Australia
| | - Shivakumar Keerthikumar
- Department of Biochemistry and Genetics, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, VIC, 3086, Australia
| | - Suresh Mathivanan
- Department of Biochemistry and Genetics, La Trobe Institute for Molecular Science, La Trobe University, Melbourne, VIC, 3086, Australia.
| |
Collapse
|
13
|
Zhu X, Xie S, Armengaud J, Xie W, Guo Z, Kang S, Wu Q, Wang S, Xia J, He R, Zhang Y. Tissue-specific Proteogenomic Analysis of Plutella xylostella Larval Midgut Using a Multialgorithm Pipeline. Mol Cell Proteomics 2016; 15:1791-807. [PMID: 26902207 PMCID: PMC5083088 DOI: 10.1074/mcp.m115.050989] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2015] [Revised: 02/04/2016] [Indexed: 11/06/2022] Open
Abstract
The diamondback moth, Plutella xylostella (L.), is the major cosmopolitan pest of brassica and other cruciferous crops. Its larval midgut is a dynamic tissue that interfaces with a wide variety of toxicological and physiological processes. The draft sequence of the P. xylostella genome was recently released, but its annotation remains challenging because of the low sequence coverage of this branch of life and the poor description of exon/intron splicing rules for these insects. Peptide sequencing by computational assignment of tandem mass spectra to genome sequence information provides an experimental independent approach for confirming or refuting protein predictions, a concept that has been termed proteogenomics. In this study, we carried out an in-depth proteogenomic analysis to complement genome annotation of P. xylostella larval midgut based on shotgun HPLC-ESI-MS/MS data by means of a multialgorithm pipeline. A total of 876,341 tandem mass spectra were searched against the predicted P. xylostella protein sequences and a whole-genome six-frame translation database. Based on a data set comprising 2694 novel genome search specific peptides, we discovered 439 novel protein-coding genes and corrected 128 existing gene models. To get the most accurate data to seed further insect genome annotation, more than half of the novel protein-coding genes, i.e. 235 over 439, were further validated after RT-PCR amplification and sequencing of the corresponding transcripts. Furthermore, we validated 53 novel alternative splicings. Finally, a total of 6764 proteins were identified, resulting in one of the most comprehensive proteogenomic study of a nonmodel animal. As the first tissue-specific proteogenomics analysis of P. xylostella, this study provides the fundamental basis for high-throughput proteomics and functional genomics approaches aimed at deciphering the molecular mechanisms of resistance and controlling this pest.
Collapse
Affiliation(s)
- Xun Zhu
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | | | - Jean Armengaud
- ¶CEA-Marcoule, DSV/IBITEC-S/SPI/Li2D, Laboratory, BP 17171, F-30200, Bagnols-sur-Cèze, F-30207, France
| | - Wen Xie
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Zhaojiang Guo
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Shi Kang
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Qingjun Wu
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Shaoli Wang
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Jixing Xia
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Rongjun He
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Youjun Zhang
- From the ‡Department of Plant Protection, Institute of Vegetables and Flowers, Chinese Academy of Agricultural Sciences, Beijing, 100081, China;
| |
Collapse
|
14
|
Subbannayya Y, Pinto SM, Gowda H, Prasad TSK. Proteogenomics for understanding oncology: recent advances and future prospects. Expert Rev Proteomics 2016; 13:297-308. [PMID: 26697917 DOI: 10.1586/14789450.2016.1136217] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023]
Abstract
The concept of proteogenomics has emerged rapidly as a valuable approach to integrate mass spectrometry-derived proteomic data with genomic and transcriptomic data. It is used to harness the full potential of the former dataset in the discovery of potential biomarkers, therapeutic targets and novel proteins associated with various biological processes including diseases. Proteogenomic strategies have been successfully utilized to identify novel genes and redefine annotation of existing gene models in various genomes. In recent years, this approach has been extended to the field of cancer biology to unravel complexities in the tumor genomes and proteomes. Standard proteomics workflows employing translated cancer genomes and transcriptomes can potentially identify peptides from mutant proteins, splice variants and fusion proteins in the tumor proteome, which in addition to the currently available biomarker panels can serve as potential diagnostic and prognostic biomarkers, besides having therapeutic utility. This review focuses on the role of proteogenomics to understand cancer biology.
Collapse
Affiliation(s)
- Yashwanth Subbannayya
- a YU-IOB Center for Systems Biology and Molecular Medicine , Yenepoya University , Mangalore, India.,b Institute of Bioinformatics , Bangalore , India
| | - Sneha M Pinto
- a YU-IOB Center for Systems Biology and Molecular Medicine , Yenepoya University , Mangalore, India.,b Institute of Bioinformatics , Bangalore , India
| | - Harsha Gowda
- a YU-IOB Center for Systems Biology and Molecular Medicine , Yenepoya University , Mangalore, India.,b Institute of Bioinformatics , Bangalore , India
| | - T S Keshava Prasad
- a YU-IOB Center for Systems Biology and Molecular Medicine , Yenepoya University , Mangalore, India.,b Institute of Bioinformatics , Bangalore , India.,c NIMHANS-IOB Proteomics and Bioinformatics Laboratory, Neurobiology Research Centre , National Institute of Mental Health and Neurosciences , Bangalore , India
| |
Collapse
|
15
|
Locard-Paulet M, Pible O, Gonzalez de Peredo A, Alpha-Bazin B, Almunia C, Burlet-Schiltz O, Armengaud J. Clinical implications of recent advances in proteogenomics. Expert Rev Proteomics 2016; 13:185-99. [DOI: 10.1586/14789450.2016.1132169] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
|
16
|
Abstract
We are presenting a quantitative proteomics tally of the most commonly expressed conserved fungal proteins of the cytosol, the cell wall, and the secretome. It was our goal to identify fungi-typical proteins that do not share significant homology with human proteins. Such fungal proteins are of interest to the development of vaccines or drug targets. Protein samples were derived from 13 fungal species, cultured in rich or in minimal media; these included clinical isolates of Aspergillus, Candida, Mucor, Cryptococcus, and Coccidioides species. Proteomes were analyzed by quantitative MSE (Mass Spectrometry-Elevated Collision Energy). Several thousand proteins were identified and quantified in total across all fractions and culture conditions. The 42 most abundant proteins identified in fungal cell walls or supernatants shared no to very little homology with human proteins. In contrast, all but five of the 50 most abundant cytosolic proteins had human homologs with sequence identity averaging 59%. Proteomic comparisons of the secreted or surface localized fungal proteins highlighted conserved homologs of the Aspergillus fumigatus proteins 1,3-β-glucanosyltransferases (Bgt1, Gel1-4), Crf1, Ecm33, EglC, and others. The fact that Crf1 and Gel1 were previously shown to be promising vaccine candidates, underlines the value of the proteomics data presented here.
Collapse
|
17
|
Abstract
Annotation of protein coding genes in sequenced genomes has been routinely carried out using gene prediction programs guided by available transcript data. The advent of mass spectrometry has enabled the identification of proteins in a high-throughput manner. In addition to searching proteins annotated in public databases, mass spectrometry data can also be searched against conceptually translated genome as well as transcriptome to identify novel protein coding regions. This proteogenomics approach has resulted in the identification of novel protein coding regions in both prokaryotic and eukaryotic genomes. These studies have also revealed that some of the annotated noncoding RNAs and pseudogenes code for proteins. This approach is likely to become a part of most genome annotation workflows in the future. Here we describe a general methodology and approach that can be used for proteogenomics.
Collapse
Affiliation(s)
- Keshava K Datta
- Institute of Bioinformatics, International Technology Park, Bangalore, 560066, India
- School of Biotechnology, KIIT University, Bhubaneswar, 751024, Odisha, India
| | - Anil K Madugundu
- Institute of Bioinformatics, International Technology Park, Bangalore, 560066, India
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry, 605014, India
| | - Harsha Gowda
- Institute of Bioinformatics, International Technology Park, Bangalore, 560066, India.
- School of Biotechnology, KIIT University, Bhubaneswar, 751024, Odisha, India.
| |
Collapse
|
18
|
Pawar H, Chavan S, Mahale K, Khobragade S, Kulkarni A, Patil A, Chaphekar D, Varriar P, Sudeep A, Pai K, Prasad T, Gowda H, Patole MS. A proteomic map of the unsequenced kala-azar vector Phlebotomus papatasi using cell line. Acta Trop 2015; 152:80-89. [PMID: 26307495 DOI: 10.1016/j.actatropica.2015.08.012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2015] [Revised: 07/16/2015] [Accepted: 08/18/2015] [Indexed: 11/25/2022]
Abstract
The debilitating disease kala-azar or visceral leishmaniasis is caused by the kinetoplastid protozoan parasite Leishmania donovani. The parasite is transmitted by the hematophagous sand fly vector of the genus Phlebotomus in the old world and Lutzomyia in the new world. The predominant Phlebotomine species associated with the transmission of kala-azar are Phlebotomus papatasi and Phlebotomus argentipes. Understanding the molecular interaction of the sand fly and Leishmania, during the development of parasite within the sand fly gut is crucial to the understanding of the parasite life cycle. The complete genome sequences of sand flies (Phlebotomus and Lutzomyia) are currently not available and this hinders identification of proteins in the sand fly vector. The current study utilizes a three frame translated transcriptomic data of P. papatasi in the absence of genomic sequences to analyze the mass spectrometry data of P. papatasi cell line using a proteogenomic approach. Additionally, we have carried out the proteogenomic analysis of P. papatasi by comparative homology-based searches using related sequenced dipteran protein data. This study resulted in the identification of 1313 proteins from P. papatasi based on homology. Our study demonstrates the power of proteogenomic approaches in mapping the proteomes of unsequenced organisms.
Collapse
|
19
|
Xu X, Liu T, Ren X, Liu B, Yang J, Chen L, Wei C, Zheng J, Dong J, Sun L, Zhu Y, Jin Q. Proteogenomic Analysis of Trichophyton rubrum Aided by RNA Sequencing. J Proteome Res 2015; 14:2207-18. [PMID: 25868943 DOI: 10.1021/acs.jproteome.5b00009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Infections caused by dermatophytes, Trichophyton rubrum in particular, are among the most common diseases in humans. In this study, we present a proteogenomic analysis of T. rubrum based on whole-genome proteomics and RNA-Seq studies. We confirmed 4291 expressed proteins in T. rubrum and validated their annotated gene structures based on 35 874 supporting peptides. In addition, we identified 323 novel peptides (not present in the current annotated protein database of T. rubrum) that can be used to enhance current T. rubrum annotations. A total of 104 predicted genes supported by novel peptides were identified, and 127 gene models suggested by the novel peptides that conflicted with existing annotations were manually assigned based on transcriptomic evidence. RNA-Seq confirmed the validity of 95% of the total peptides. Our study provides evidence that confirms and improves the genome annotation of T. rubrum and represents the first survey of T. rubrum genome annotations based on experimental evidence. Additionally, our integrated proteomics and multisourced transcriptomics approach provides stronger evidence for annotation refinement than proteomic data alone, which helps to address the dilemma of one-hit wonders (uncertainties supported by only one peptide).
Collapse
|
20
|
Reddy PJ, Atak A, Ghantasala S, Kumar S, Gupta S, Prasad TSK, Zingde SM, Srivastava S. Proteomics research in India: an update. J Proteomics 2015; 127:7-17. [PMID: 25868663 DOI: 10.1016/j.jprot.2015.04.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2015] [Accepted: 04/06/2015] [Indexed: 02/04/2023]
Abstract
After a successful completion of the Human Genome Project, deciphering the mystery surrounding the human proteome posed a major challenge. Despite not being largely involved in the Human Genome Project, the Indian scientific community contributed towards proteomic research along with the global community. Currently, more than 76 research/academic institutes and nearly 145 research labs are involved in core proteomic research across India. The Indian researchers have been major contributors in drafting the "human proteome map" along with international efforts. In addition to this, virtual proteomics labs, proteomics courses and remote triggered proteomics labs have helped to overcome the limitations of proteomics education posed due to expensive lab infrastructure. The establishment of Proteomics Society, India (PSI) has created a platform for the Indian proteomic researchers to share ideas, research collaborations and conduct annual conferences and workshops. Indian proteomic research is really moving forward with the global proteomics community in a quest to solve the mysteries of proteomics. A draft map of the human proteome enhances the enthusiasm among intellectuals to promote proteomic research in India to the world.This article is part of a Special Issue entitled: Proteomics in India.
Collapse
Affiliation(s)
- Panga Jaipal Reddy
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Apurva Atak
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Saicharan Ghantasala
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Saurabh Kumar
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - Shabarni Gupta
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India
| | - T S Keshava Prasad
- Institute of Bioinformatics, International Tech Park, Whitefield, Bangalore 560066, India
| | - Surekha M Zingde
- CH3-53 Kendriya Vihar, Kharghar, Navi Mumbai, 410210, India. http://www.psindia.org
| | - Sanjeeva Srivastava
- Department of Biosciences and Bioengineering, Indian Institute of Technology Bombay, Powai, Mumbai 400076, India.
| |
Collapse
|
21
|
A comprehensive proteomic analysis of totarol induced alterations in Bacillus subtilis by multipronged quantitative proteomics. J Proteomics 2015; 114:247-62. [DOI: 10.1016/j.jprot.2014.10.025] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2014] [Revised: 09/28/2014] [Accepted: 10/20/2014] [Indexed: 12/25/2022]
|
22
|
Renuse S, Madugundu AK, Kumar P, Nair BG, Gowda H, Prasad TSK, Pandey A. Proteomic analysis and genome annotation ofPichia pastoris, a recombinant protein expression host. Proteomics 2014; 14:2769-79. [DOI: 10.1002/pmic.201400267] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2014] [Revised: 10/01/2014] [Accepted: 10/20/2014] [Indexed: 12/12/2022]
Affiliation(s)
- Santosh Renuse
- Institute of Bioinformatics; International Technology Park; Bangalore India
- Amrita School of Biotechnology; Amrita Vishwa Vidyapeetham; Kollam India
| | - Anil K. Madugundu
- Institute of Bioinformatics; International Technology Park; Bangalore India
- Centre of Excellence in Bioinformatics, School of Life Sciences; Pondicherry University; Puducherry India
| | - Praveen Kumar
- Institute of Bioinformatics; International Technology Park; Bangalore India
| | - Bipin G. Nair
- Amrita School of Biotechnology; Amrita Vishwa Vidyapeetham; Kollam India
| | - Harsha Gowda
- Institute of Bioinformatics; International Technology Park; Bangalore India
| | - T. S. Keshava Prasad
- Institute of Bioinformatics; International Technology Park; Bangalore India
- Amrita School of Biotechnology; Amrita Vishwa Vidyapeetham; Kollam India
- Centre of Excellence in Bioinformatics, School of Life Sciences; Pondicherry University; Puducherry India
| | - Akhilesh Pandey
- McKusick-Nathans Institute of Genetic Medicine and Departments of Biological Chemistry, Oncology and Pathology; Johns Hopkins University School of Medicine; Baltimore MD USA
| |
Collapse
|
23
|
Kucharova V, Wiker HG. Proteogenomics in microbiology: taking the right turn at the junction of genomics and proteomics. Proteomics 2014; 14:2360-675. [PMID: 25263021 DOI: 10.1002/pmic.201400168] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2014] [Revised: 08/18/2014] [Accepted: 09/23/2014] [Indexed: 12/14/2022]
Abstract
High-accuracy and high-throughput proteomic methods have completely changed the way we can identify and characterize proteins. MS-based proteomics can now provide a unique supplement to genomic data and add a new level of information to the interpretation of genomic sequences. Proteomics-driven genome annotation has become especially relevant in microbiology where genomes are sequenced on a daily basis and limitations of an in silico driven annotation process are well recognized. In this review paper, we outline different strategies on how one can design a proteogenomic experiment, for example on genome-sequenced (synonymous proteogenomics) versus unsequenced organisms (ortho-proteogenomics) or with the aid of other "omic" data such as RNA-seq. We touch upon many challenges that are encountered during a typical proteogenomic study, mostly concerning bioinformatics methods and downstream data analysis, but also related to creation and use of sequence databases. A large list of proteogenomic case studies of different microorganisms is provided to illustrate the mapping of MS/MS-derived peptide spectra to genomic DNA sequences. These investigations have led to accurate determination of translational initiation sites, pointed out eventual read-throughs or programmed frameshifts, detected signal peptide processing or other protein maturation events, removed questionable annotation assignments, and provided evidence for predicted hypothetical proteins.
Collapse
Affiliation(s)
- Veronika Kucharova
- Department of Clinical Science, The Gade Research Group for Infection and Immunity, University of Bergen, Norway
| | | |
Collapse
|
24
|
Pawar H, Renuse S, Khobragade SN, Chavan S, Sathe G, Kumar P, Mahale KN, Gore K, Kulkarni A, Dixit T, Raju R, Prasad TSK, Harsha HC, Patole MS, Pandey A. Neglected Tropical Diseases and Omics Science: Proteogenomics Analysis of the Promastigote Stage ofLeishmania majorParasite. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:499-512. [DOI: 10.1089/omi.2013.0159] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Harsh Pawar
- Institute of Bioinformatics, International Technology Park, Bangalore, India
- Rajiv Gandhi University of Health Sciences, Bangalore, India
| | - Santosh Renuse
- Institute of Bioinformatics, International Technology Park, Bangalore, India
- Department of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam, India
| | | | - Sandip Chavan
- Institute of Bioinformatics, International Technology Park, Bangalore, India
- Manipal University, Madhav Nagar, Manipal, India
| | - Gajanan Sathe
- Institute of Bioinformatics, International Technology Park, Bangalore, India
- Manipal University, Madhav Nagar, Manipal, India
| | - Praveen Kumar
- Institute of Bioinformatics, International Technology Park, Bangalore, India
| | | | | | | | - Tanwi Dixit
- National Centre for Cell Sciences, Pune, India
| | - Rajesh Raju
- Institute of Bioinformatics, International Technology Park, Bangalore, India
| | | | - H. C. Harsha
- Institute of Bioinformatics, International Technology Park, Bangalore, India
| | | | - Akhilesh Pandey
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, Maryland
- Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland
| |
Collapse
|
25
|
Kelkar DS, Provost E, Chaerkady R, Muthusamy B, Manda SS, Subbannayya T, Selvan LDN, Wang CH, Datta KK, Woo S, Dwivedi SB, Renuse S, Getnet D, Huang TC, Kim MS, Pinto SM, Mitchell CJ, Madugundu AK, Kumar P, Sharma J, Advani J, Dey G, Balakrishnan L, Syed N, Nanjappa V, Subbannayya Y, Goel R, Prasad TSK, Bafna V, Sirdeshmukh R, Gowda H, Wang C, Leach SD, Pandey A. Annotation of the zebrafish genome through an integrated transcriptomic and proteomic analysis. Mol Cell Proteomics 2014; 13:3184-98. [PMID: 25060758 DOI: 10.1074/mcp.m114.038299] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Accurate annotation of protein-coding genes is one of the primary tasks upon the completion of whole genome sequencing of any organism. In this study, we used an integrated transcriptomic and proteomic strategy to validate and improve the existing zebrafish genome annotation. We undertook high-resolution mass-spectrometry-based proteomic profiling of 10 adult organs, whole adult fish body, and two developmental stages of zebrafish (SAT line), in addition to transcriptomic profiling of six organs. More than 7,000 proteins were identified from proteomic analyses, and ∼ 69,000 high-confidence transcripts were assembled from the RNA sequencing data. Approximately 15% of the transcripts mapped to intergenic regions, the majority of which are likely long non-coding RNAs. These high-quality transcriptomic and proteomic data were used to manually reannotate the zebrafish genome. We report the identification of 157 novel protein-coding genes. In addition, our data led to modification of existing gene structures including novel exons, changes in exon coordinates, changes in frame of translation, translation in annotated UTRs, and joining of genes. Finally, we discovered four instances of genome assembly errors that were supported by both proteomic and transcriptomic data. Our study shows how an integrative analysis of the transcriptome and the proteome can extend our understanding of even well-annotated genomes.
Collapse
Affiliation(s)
- Dhanashree S Kelkar
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡Amrita School of Biotechnology, Amrita University, Kollam 690 525, India
| | - Elayne Provost
- §Department of Surgery, Johns Hopkins University, Baltimore, Maryland 21205
| | - Raghothama Chaerkady
- ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205
| | - Babylakshmi Muthusamy
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‖Centre of Excellence in Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Srikanth S Manda
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‖Centre of Excellence in Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry 605014, India; **Departments of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| | - Tejaswini Subbannayya
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡Amrita School of Biotechnology, Amrita University, Kollam 690 525, India
| | - Lakshmi Dhevi N Selvan
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡Amrita School of Biotechnology, Amrita University, Kollam 690 525, India
| | - Chieh-Huei Wang
- ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205
| | - Keshava K Datta
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡‡School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Sunghee Woo
- §§Department of Computer Science, University of California, San Diego, California 92093
| | - Sutopa B Dwivedi
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡Amrita School of Biotechnology, Amrita University, Kollam 690 525, India
| | - Santosh Renuse
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡Amrita School of Biotechnology, Amrita University, Kollam 690 525, India
| | - Derese Getnet
- ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205
| | - Tai-Chung Huang
- ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205
| | - Min-Sik Kim
- ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205; **Departments of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| | - Sneha M Pinto
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205; ¶¶Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Christopher J Mitchell
- ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205
| | - Anil K Madugundu
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India
| | - Praveen Kumar
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India
| | - Jyoti Sharma
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ¶¶Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Jayshree Advani
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India
| | - Gourav Dey
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ¶¶Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Lavanya Balakrishnan
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‖‖Department of Biotechnology, Kuvempu University, Shimoga 577 451, India
| | - Nazia Syed
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; Department of Biochemistry and Molecular Biology, School of Life Sciences, Pondicherry University, Puducherry 605 014, India
| | - Vishalakshi Nanjappa
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡Amrita School of Biotechnology, Amrita University, Kollam 690 525, India
| | - Yashwanth Subbannayya
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India
| | - Renu Goel
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India
| | - T S Keshava Prasad
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ‡Amrita School of Biotechnology, Amrita University, Kollam 690 525, India; ‖Centre of Excellence in Bioinformatics, School of Life Sciences, Pondicherry University, Puducherry 605014, India; ¶¶Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Vineet Bafna
- §§Department of Computer Science, University of California, San Diego, California 92093
| | - Ravi Sirdeshmukh
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India
| | - Harsha Gowda
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India
| | - Charles Wang
- The Center for Genomics and Division of Microbiology & Molecular Genetics, School of Medicine, Loma Linda University, Loma Linda, California 92350;
| | - Steven D Leach
- §Department of Surgery, Johns Hopkins University, Baltimore, Maryland 21205; ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205;
| | - Akhilesh Pandey
- From the *Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India; ¶McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University, Baltimore, Maryland 21205; **Departments of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205; Sol Goldman Pancreatic Cancer Research Center, Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205; Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205
| |
Collapse
|
26
|
Manda SS, Nirujogi RS, Pinto SM, Kim MS, Datta KK, Sirdeshmukh R, Prasad TSK, Thongboonkerd V, Pandey A, Gowda H. Identification and Characterization of Proteins Encoded by Chromosome 12 as Part of Chromosome-centric Human Proteome Project. J Proteome Res 2014; 13:3166-77. [DOI: 10.1021/pr401123v] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Srikanth Srinivas Manda
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- Centre
of Excellence in Bioinformatics, Bioinformatics Centre, School of
Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Raja Sekhar Nirujogi
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- Centre
of Excellence in Bioinformatics, Bioinformatics Centre, School of
Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Sneha Maria Pinto
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- Manipal University, Madhav Nagar, Manipal 576104, India
| | | | - Keshava K. Datta
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
- School of
Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Ravi Sirdeshmukh
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - T. S. Keshava Prasad
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - Visith Thongboonkerd
- Medical
Proteomics Unit, Office for Research and Development, Faculty of Medicine
Siriraj Hospital, and Center for Research in Complex Systems Science, Mahidol University, Bangkok 10700, Thailand
| | - Akhilesh Pandey
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - Harsha Gowda
- Institute
of Bioinformatics, International Technology Park, Bangalore 560066, India
| |
Collapse
|
27
|
Dwivedi SB, Muthusamy B, Kumar P, Kim MS, Nirujogi RS, Getnet D, Ahiakonu P, De G, Nair B, Gowda H, Prasad TSK, Kumar N, Pandey A, Okulate M. Brain proteomics of Anopheles gambiae. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:421-37. [PMID: 24937107 DOI: 10.1089/omi.2014.0007] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Anopheles gambiae has a well-adapted system for host localization, feeding, and mating behavior, which are all governed by neuronal processes in the brain. However, there are no published reports characterizing the brain proteome to elucidate neuronal signaling mechanisms in the vector. To this end, a large-scale mapping of the brain proteome of An. gambiae was carried out using high resolution tandem mass spectrometry, revealing a repertoire of >1800 proteins, of which 15% could not be assigned any function. A large proportion of the identified proteins were predicted to be involved in diverse biological processes including metabolism, transport, protein synthesis, and olfaction. This study also led to the identification of 10 GPCR classes of proteins, which could govern sensory pathways in mosquitoes. Proteins involved in metabolic and neural processes, chromatin modeling, and synaptic vesicle transport associated with neuronal transmission were predominantly expressed in the brain. Proteogenomic analysis expanded our findings with the identification of 15 novel genes and 71 cases of gene refinements, a subset of which were validated by RT-PCR and sequencing. Overall, our study offers valuable insights into the brain physiology of the vector that could possibly open avenues for intervention strategies for malaria in the future.
Collapse
Affiliation(s)
- Sutopa B Dwivedi
- 1 Institute of Bioinformatics , International Technology Park, Bangalore, Karnataka, India
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Boja ES, Rodriguez H. Proteogenomic convergence for understanding cancer pathways and networks. Clin Proteomics 2014; 11:22. [PMID: 24994965 PMCID: PMC4067069 DOI: 10.1186/1559-0275-11-22] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2014] [Accepted: 03/31/2014] [Indexed: 11/21/2022] Open
Abstract
During the past several decades, the understanding of cancer at the molecular level has been primarily focused on mechanisms on how signaling molecules transform homeostatically balanced cells into malignant ones within an individual pathway. However, it is becoming more apparent that pathways are dynamic and crosstalk at different control points of the signaling cascades, making the traditional linear signaling models inadequate to interpret complex biological systems. Recent technological advances in high throughput, deep sequencing for the human genomes and proteomic technologies to comprehensively characterize the human proteomes in conjunction with multiplexed targeted proteomic assays to measure panels of proteins involved in biologically relevant pathways have made significant progress in understanding cancer at the molecular level. It is undeniable that proteomic profiling of differentially expressed proteins under many perturbation conditions, or between normal and "diseased" states is important to capture a first glance at the overall proteomic landscape, which has been a main focus of proteomics research during the past 15-20 years. However, the research community is gradually shifting its heavy focus from that initial discovery step to protein target verification using multiplexed quantitative proteomic assays, capable of measuring changes in proteins and their interacting partners, isoforms, and post-translational modifications (PTMs) in response to stimuli in the context of signaling pathways and protein networks. With a critical link to genotypes (i.e., high throughput genomics and transcriptomics data), new and complementary information can be gleaned from multi-dimensional omics data to (1) assess the effect of genomic and transcriptomic aberrations on such complex molecular machinery in the context of cell signaling architectures associated with pathological diseases such as cancer (i.e., from genotype to proteotype to phenotype); and (2) target pathway- and network-driven changes and map the fluctuations of these functional units (proteins) responsible for cellular activities in response to perturbation in a spatiotemporal fashion to better understand cancer biology as a whole system.
Collapse
Affiliation(s)
- Emily S Boja
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, National Institutes of Health, 31 Center Drive, MSC 2580, 20892 Bethesda, MD, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, National Institutes of Health, 31 Center Drive, MSC 2580, 20892 Bethesda, MD, USA
| |
Collapse
|
29
|
Ahmad KM, Kokošar J, Guo X, Gu Z, Ishchuk OP, Piškur J. Genome structure and dynamics of the yeast pathogen Candida glabrata. FEMS Yeast Res 2014; 14:529-35. [PMID: 24528571 PMCID: PMC4320752 DOI: 10.1111/1567-1364.12145] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2013] [Revised: 02/07/2014] [Accepted: 02/08/2014] [Indexed: 01/09/2023] Open
Abstract
The yeast pathogen Candida glabrata is the second most frequent cause of Candida infections. However, from the phylogenetic point of view, C. glabrata is much closer to Saccharomyces cerevisiae than to Candida albicans. Apparently, this yeast has relatively recently changed its life style and become a successful opportunistic pathogen. Recently, several C. glabrata sister species, among them clinical and environmental isolates, have had their genomes characterized. Also, hundreds of C. glabrata clinical isolates have been characterized for their genomes. These isolates display enormous genomic plasticity. The number and size of chromosomes vary drastically, as well as intra- and interchromosomal segmental duplications occur frequently. The observed genome alterations could affect phenotypic properties and thus help to adapt to the highly variable and harsh habitats this yeast finds in different human patients and their tissues. Further genome sequencing of pathogenic isolates will provide a valuable tool to understand the mechanisms behind genome dynamics and help to elucidate the genes contributing to the virulence potential.
Collapse
|
30
|
Bland C, Hartmann EM, Christie-Oleza JA, Fernandez B, Armengaud J. N-Terminal-oriented proteogenomics of the marine bacterium roseobacter denitrificans Och114 using N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP) labeling and diagonal chromatography. Mol Cell Proteomics 2014; 13:1369-81. [PMID: 24536027 DOI: 10.1074/mcp.o113.032854] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Given the ease of whole genome sequencing with next-generation sequencers, structural and functional gene annotation is now purely based on automated prediction. However, errors in gene structure are frequent, the correct determination of start codons being one of the main concerns. Here, we combine protein N termini derivatization using (N-Succinimidyloxycarbonylmethyl)tris(2,4,6-trimethoxyphenyl)phosphonium bromide (TMPP Ac-OSu) as a labeling reagent with the COmbined FRActional DIagonal Chromatography (COFRADIC) sorting method to enrich labeled N-terminal peptides for mass spectrometry detection. Protein digestion was performed in parallel with three proteases to obtain a reliable automatic validation of protein N termini. The analysis of these N-terminal enriched fractions by high-resolution tandem mass spectrometry allowed the annotation refinement of 534 proteins of the model marine bacterium Roseobacter denitrificans OCh114. This study is especially efficient regarding mass spectrometry analytical time. From the 534 validated N termini, 480 confirmed existing gene annotations, 41 highlighted erroneous start codon annotations, five revealed totally new mis-annotated genes; the mass spectrometry data also suggested the existence of multiple start sites for eight different genes, a result that challenges the current view of protein translation initiation. Finally, we identified several proteins for which classical genome homology-driven annotation was inconsistent, questioning the validity of automatic annotation pipelines and emphasizing the need for complementary proteomic data. All data have been deposited to the ProteomeXchange with identifier PXD000337.
Collapse
Affiliation(s)
- Céline Bland
- CEA, DSV, IBEB, Lab Biochim System Perturb, Bagnols-sur-Cèze, F-30207, France
| | | | | | | | | |
Collapse
|
31
|
Nagarajha Selvan LD, Kaviyil JE, Nirujogi RS, Muthusamy B, Puttamallesh VN, Subbannayya T, Syed N, Radhakrishnan A, Kelkar DS, Ahmad S, Pinto SM, Kumar P, Madugundu AK, Nair B, Chatterjee A, Pandey A, Ravikumar R, Gowda H, Prasad TSK. Proteogenomic analysis of pathogenic yeast Cryptococcus neoformans using high resolution mass spectrometry. Clin Proteomics 2014; 11:5. [PMID: 24484775 PMCID: PMC3915034 DOI: 10.1186/1559-0275-11-5] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 12/17/2013] [Indexed: 12/30/2022] Open
Abstract
Background Cryptococcus neoformans, a basidiomycetous fungus of universal occurrence, is a significant opportunistic human pathogen causing meningitis. Owing to an increase in the number of immunosuppressed individuals along with emergence of drug-resistant strains, C. neoformans is gaining importance as a pathogen. Although, whole genome sequencing of three varieties of C. neoformans has been completed recently, no global proteomic studies have yet been reported. Results We performed a comprehensive proteomic analysis of C. neoformans var. grubii (Serotype A), which is the most virulent variety, in order to provide protein-level evidence for computationally predicted gene models and to refine the existing annotations. We confirmed the protein-coding potential of 3,674 genes from a total of 6,980 predicted protein-coding genes. We also identified 4 novel genes and corrected 104 predicted gene models. In addition, our studies led to the correction of translational start site, splice junctions and reading frame used for translation in a number of proteins. Finally, we validated a subset of our novel findings by RT-PCR and sequencing. Conclusions Proteogenomic investigation described here facilitated the validation and refinement of computationally derived gene models in the intron-rich genome of C. neoformans, an important fungal pathogen in humans.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | - Harsha Gowda
- Institute of Bioinformatics, International Technology Park, Bangalore 560 066, India.
| | | |
Collapse
|
32
|
Nirujogi RS, Pawar H, Renuse S, Kumar P, Chavan S, Sathe G, Sharma J, Khobragade S, Pande J, Modak B, Prasad TSK, Harsha HC, Patole MS, Pandey A. Moving from unsequenced to sequenced genome: reanalysis of the proteome of Leishmania donovani. J Proteomics 2014; 97:48-61. [PMID: 23665000 PMCID: PMC4710096 DOI: 10.1016/j.jprot.2013.04.021] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2012] [Revised: 04/02/2013] [Accepted: 04/11/2013] [Indexed: 10/26/2022]
Abstract
The kinetoplastid protozoan parasite, Leishmania donovani, is the causative agent of kala azar or visceral leishmaniasis. Kala azar is a severe form of leishmaniasis that is fatal in the majority of untreated cases. Studies on proteomic analysis of L. donovani thus far have been carried out using homology-based identification based on related Leishmania species (L. infantum, L. major and L. braziliensis) whose genomes have been sequenced. Recently, the genome of L. donovani was fully sequenced and the data became publicly available. We took advantage of the availability of its genomic sequence to carry out a more accurate proteogenomic analysis of L. donovani proteome using our previously generated dataset. This resulted in identification of 17,504 unique peptides upon database-dependent search against the annotated proteins in L. donovani. These peptides were assigned to 3999 unique proteins in L. donovani. 2296 proteins were identified in both the life stages of L. donovani, while 613 and 1090 proteins were identified only from amastigote and promastigote stages, respectively. The proteomic data was also searched against six-frame translated L. donovani genome, which led to 255 genome search-specific peptides (GSSPs) resulting in identification of 20 novel genes and correction of 40 existing gene models in L. donovani. BIOLOGICAL SIGNIFICANCE Leishmania donovani genome sequencing was recently completed, which permitted us to use a proteogenomic approach to map its proteome and to carry out annotation of it genome. This resulted in mapping of 50% (3999 proteins) of L. donovani proteome. Our study identified 20 novel genes previously not predicted from the L. donovani genome in addition to correcting annotations of 40 existing gene models. The identified proteins may help in better understanding of stage-specific protein expression profiles in L. donovani and to identify novel stage-specific drug targets in L. donovani which could be used in the treatment of leishmaniasis. This article is part of a Special Issue entitled: Trends in Microbial Proteomics.
Collapse
Affiliation(s)
- Raja Sekhar Nirujogi
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India; Bioinformatics Centre, School of Life Sciences, Pondicherry University, Puducherry 605014, India
| | - Harsh Pawar
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India; Rajiv Gandhi University of Health Sciences, Bangalore 560041, India
| | - Santosh Renuse
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India; Department of Biotechnology, Amrita Vishwa Vidyapeetham, Kollam 690525, India
| | - Praveen Kumar
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
| | - Sandip Chavan
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India; Manipal University, Madhav Nagar, Manipal 576104, India
| | - Gajanan Sathe
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India; Manipal University, Madhav Nagar, Manipal 576104, India
| | - Jyoti Sharma
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India; Manipal University, Madhav Nagar, Manipal 576104, India
| | | | | | - Bhakti Modak
- National Centre for Cell Sciences, Pune 411007, India
| | - T S Keshava Prasad
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India; Bioinformatics Centre, School of Life Sciences, Pondicherry University, Puducherry 605014, India; Manipal University, Madhav Nagar, Manipal 576104, India
| | - H C Harsha
- Institute of Bioinformatics, International Technology Park, Bangalore 560066, India
| | | | - Akhilesh Pandey
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore 21205, MD, USA; Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore 21205, MD, USA; Department of Oncology, Johns Hopkins University School of Medicine, Baltimore 21205, MD, USA; Department of Pathology, Johns Hopkins University School of Medicine, Baltimore 21205, MD, USA.
| |
Collapse
|
33
|
Tanca A, Palomba A, Deligios M, Cubeddu T, Fraumene C, Biosa G, Pagnozzi D, Addis MF, Uzzau S. Evaluating the impact of different sequence databases on metaproteome analysis: insights from a lab-assembled microbial mixture. PLoS One 2013; 8:e82981. [PMID: 24349410 PMCID: PMC3857319 DOI: 10.1371/journal.pone.0082981] [Citation(s) in RCA: 83] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Accepted: 10/30/2013] [Indexed: 01/10/2023] Open
Abstract
Metaproteomics enables the investigation of the protein repertoire expressed by complex microbial communities. However, to unleash its full potential, refinements in bioinformatic approaches for data analysis are still needed. In this context, sequence databases selection represents a major challenge. This work assessed the impact of different databases in metaproteomic investigations by using a mock microbial mixture including nine diverse bacterial and eukaryotic species, which was subjected to shotgun metaproteomic analysis. Then, both the microbial mixture and the single microorganisms were subjected to next generation sequencing to obtain experimental metagenomic- and genomic-derived databases, which were used along with public databases (namely, NCBI, UniProtKB/SwissProt and UniProtKB/TrEMBL, parsed at different taxonomic levels) to analyze the metaproteomic dataset. First, a quantitative comparison in terms of number and overlap of peptide identifications was carried out among all databases. As a result, only 35% of peptides were common to all database classes; moreover, genus/species-specific databases provided up to 17% more identifications compared to databases with generic taxonomy, while the metagenomic database enabled a slight increment in respect to public databases. Then, database behavior in terms of false discovery rate and peptide degeneracy was critically evaluated. Public databases with generic taxonomy exhibited a markedly different trend compared to the counterparts. Finally, the reliability of taxonomic attribution according to the lowest common ancestor approach (using MEGAN and Unipept software) was assessed. The level of misassignments varied among the different databases, and specific thresholds based on the number of taxon-specific peptides were established to minimize false positives. This study confirms that database selection has a significant impact in metaproteomics, and provides critical indications for improving depth and reliability of metaproteomic results. Specifically, the use of iterative searches and of suitable filters for taxonomic assignments is proposed with the aim of increasing coverage and trustworthiness of metaproteomic data.
Collapse
Affiliation(s)
- Alessandro Tanca
- Porto Conte Ricerche Srl, Tramariglio, Alghero, Italy
- Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy
| | - Antonio Palomba
- Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy
| | - Massimo Deligios
- Porto Conte Ricerche Srl, Tramariglio, Alghero, Italy
- Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy
| | | | | | - Grazia Biosa
- Porto Conte Ricerche Srl, Tramariglio, Alghero, Italy
| | | | - Maria Filippa Addis
- Porto Conte Ricerche Srl, Tramariglio, Alghero, Italy
- Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy
- * E-mail: (MFA); (SU)
| | - Sergio Uzzau
- Porto Conte Ricerche Srl, Tramariglio, Alghero, Italy
- Dipartimento di Scienze Biomediche, Università di Sassari, Sassari, Italy
- * E-mail: (MFA); (SU)
| |
Collapse
|
34
|
Blakeley P, Overton IM, Hubbard SJ. Addressing statistical biases in nucleotide-derived protein databases for proteogenomic search strategies. J Proteome Res 2012; 11:5221-34. [PMID: 23025403 PMCID: PMC3703792 DOI: 10.1021/pr300411q] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Proteogenomics has the potential to advance genome annotation through high quality peptide identifications derived from mass spectrometry experiments, which demonstrate a given gene or isoform is expressed and translated at the protein level. This can advance our understanding of genome function, discovering novel genes and gene structure that have not yet been identified or validated. Because of the high-throughput shotgun nature of most proteomics experiments, it is essential to carefully control for false positives and prevent any potential misannotation. A number of statistical procedures to deal with this are in wide use in proteomics, calculating false discovery rate (FDR) and posterior error probability (PEP) values for groups and individual peptide spectrum matches (PSMs). These methods control for multiple testing and exploit decoy databases to estimate statistical significance. Here, we show that database choice has a major effect on these confidence estimates leading to significant differences in the number of PSMs reported. We note that standard target:decoy approaches using six-frame translations of nucleotide sequences, such as assembled transcriptome data, apparently underestimate the confidence assigned to the PSMs. The source of this error stems from the inflated and unusual nature of the six-frame database, where for every target sequence there exists five "incorrect" targets that are unlikely to code for protein. The attendant FDR and PEP estimates lead to fewer accepted PSMs at fixed thresholds, and we show that this effect is a product of the database and statistical modeling and not the search engine. A variety of approaches to limit database size and remove noncoding target sequences are examined and discussed in terms of the altered statistical estimates generated and PSMs reported. These results are of importance to groups carrying out proteogenomics, aiming to maximize the validation and discovery of gene structure in sequenced genomes, while still controlling for false positives.
Collapse
Affiliation(s)
- Paul Blakeley
- Faculty of Life Sciences, The University of Manchester, Manchester M13 9PT, UK
| | | | | |
Collapse
|
35
|
Abstract
High-throughput identification of proteins with the latest generation of hybrid high-resolution mass spectrometers is opening new perspectives in microbiology. I present, here, an overview of tandem mass spectrometry technology and bioinformatics for shotgun proteomics that make 2D-PAGE approaches obsolete. Non-labelling quantitative approaches have become more popular than labelling techniques on most proteomic platforms because they are easier to carry out while their quantitative outcome is rather robust. Parameters for recording mass spectrometry data, however, need to be chosen carefully and statistics to assess the confidence of the results should not be neglected. Interestingly, next-generation sequencing methodologies make any microbial model quickly amenable to proteomics, leading to the documentation of a wide range of organisms from diverse environments. Some recent discoveries made using microbial proteomics have challenged some biological dogma, such as: (i) initiation of the translation does not occur predominantly from ATG codons in some microorganisms, (ii) non-canonical initiation codons are used to regulate the production of specific but important proteins and (iii) a gene may code for multiple polypeptide species, heterogeneous in terms of sequences. Microbial diversity and microbial physiology can now be revisited by means of exhaustive comparative proteomic surveys where thousands of proteins are detected and quantified. Proteogenomics, consisting of better annotating of genomes with the help of proteomic evidence, is paving the way for integrated multi-omic approaches in microbiology. Finally, meta-proteomic tools and approaches are emerging for tackling the high complexity of the microbial world as a whole, opening new perspectives for assessing how microbial communities function.
Collapse
Affiliation(s)
- Jean Armengaud
- CEA, DSV, IBEB, Lab Biochim System Perturb, F-30207 Bagnols-sur-Cèze, France.
| |
Collapse
|
36
|
Schrimpe-Rutledge AC, Jones MB, Chauhan S, Purvine SO, Sanford JA, Monroe ME, Brewer HM, Payne SH, Ansong C, Frank BC, Smith RD, Peterson SN, Motin VL, Adkins JN. Comparative omics-driven genome annotation refinement: application across Yersiniae. PLoS One 2012; 7:e33903. [PMID: 22479471 PMCID: PMC3313959 DOI: 10.1371/journal.pone.0033903] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2011] [Accepted: 02/19/2012] [Indexed: 02/03/2023] Open
Abstract
Genome sequencing continues to be a rapidly evolving technology, yet most downstream aspects of genome annotation pipelines remain relatively stable or are even being abandoned. The annotation process is now performed almost exclusively in an automated fashion to balance the large number of sequences generated. One possible way of reducing errors inherent to automated computational annotations is to apply data from omics measurements (i.e. transcriptional and proteomic) to the un-annotated genome with a proteogenomic-based approach. Here, the concept of annotation refinement has been extended to include a comparative assessment of genomes across closely related species. Transcriptomic and proteomic data derived from highly similar pathogenic Yersiniae (Y. pestis CO92, Y. pestis Pestoides F, and Y. pseudotuberculosis PB1/+) was used to demonstrate a comprehensive comparative omic-based annotation methodology. Peptide and oligo measurements experimentally validated the expression of nearly 40% of each strain's predicted proteome and revealed the identification of 28 novel and 68 incorrect (i.e., observed frameshifts, extended start sites, and translated pseudogenes) protein-coding sequences within the three current genome annotations. Gene loss is presumed to play a major role in Y. pestis acquiring its niche as a virulent pathogen, thus the discovery of many translated pseudogenes, including the insertion-ablated argD, underscores a need for functional analyses to investigate hypotheses related to divergence. Refinements included the discovery of a seemingly essential ribosomal protein, several virulence-associated factors, a transcriptional regulator, and many hypothetical proteins that were missed during annotation.
Collapse
Affiliation(s)
| | - Marcus B. Jones
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Sadhana Chauhan
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Samuel O. Purvine
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - James A. Sanford
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Matthew E. Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Heather M. Brewer
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Samuel H. Payne
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Charles Ansong
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Bryan C. Frank
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Richard D. Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
| | - Scott N. Peterson
- J. Craig Venter Institute, Rockville, Maryland, United States of America
| | - Vladimir L. Motin
- University of Texas Medical Branch, Galveston, Texas, United States of America
| | - Joshua N. Adkins
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington, United States of America
- * E-mail:
| |
Collapse
|