1
|
Aryal S, Bonanno K, Song B, Mani DR, Keshishian H, Carr SA, Sheng M, Dejanovic B. Deep proteomics identifies shared molecular pathway alterations in synapses of patients with schizophrenia and bipolar disorder and mouse model. Cell Rep 2023; 42:112497. [PMID: 37171958 DOI: 10.1016/j.celrep.2023.112497] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Revised: 03/10/2023] [Accepted: 04/26/2023] [Indexed: 05/14/2023] Open
Abstract
Synaptic dysfunction is implicated in the pathophysiology of schizophrenia (SCZ) and bipolar disorder (BP). We use quantitative mass spectrometry to carry out deep, unbiased proteomic profiling of synapses purified from the dorsolateral prefrontal cortex of 35 cases of SCZ, 35 cases of BP, and 35 controls. Compared with controls, SCZ and BP synapses show substantial and similar proteomic alterations. Network analyses reveal upregulation of proteins associated with autophagy and certain vesicle transport pathways and downregulation of proteins related to synaptic, mitochondrial, and ribosomal function in the synapses of individuals with SCZ or BP. Some of the same pathways are similarly dysregulated in the synaptic proteome of mutant mice deficient in Akap11, a recently discovered shared risk gene for SCZ and BP. Our work provides biological insights into molecular dysfunction at the synapse in SCZ and BP and serves as a resource for understanding the pathophysiology of these disorders.
Collapse
Affiliation(s)
- Sameer Aryal
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kevin Bonanno
- The Proteomics Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Bryan Song
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - D R Mani
- The Proteomics Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Hasmik Keshishian
- The Proteomics Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Steven A Carr
- The Proteomics Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Morgan Sheng
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA 02142, USA.
| | - Borislav Dejanovic
- Stanley Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA.
| |
Collapse
|
2
|
Aggarwal S, Raj A, Kumar D, Dash D, Yadav AK. False discovery rate: the Achilles' heel of proteogenomics. Brief Bioinform 2022; 23:6582880. [PMID: 35534181 DOI: 10.1093/bib/bbac163] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 03/14/2022] [Accepted: 04/12/2022] [Indexed: 12/25/2022] Open
Abstract
Proteogenomics refers to the integrated analysis of the genome and proteome that leverages mass-spectrometry (MS)-based proteomics data to improve genome annotations, understand gene expression control through proteoforms and find sequence variants to develop novel insights for disease classification and therapeutic strategies. However, proteogenomic studies often suffer from reduced sensitivity and specificity due to inflated database size. To control the error rates, proteogenomics depends on the target-decoy search strategy, the de-facto method for false discovery rate (FDR) estimation in proteomics. The proteogenomic databases constructed from three- or six-frame nucleotide database translation not only increase the search space and compute-time but also violate the equivalence of target and decoy databases. These searches result in poorer separation between target and decoy scores, leading to stringent FDR thresholds. Understanding these factors and applying modified strategies such as two-pass database search or peptide-class-specific FDR can result in a better interpretation of MS data without introducing additional statistical biases. Based on these considerations, a user can interpret the proteogenomics results appropriately and control false positives and negatives in a more informed manner. In this review, first, we briefly discuss the proteogenomic workflows and limitations in database construction, followed by various considerations that can influence potential novel discoveries in a proteogenomic study. We conclude with suggestions to counter these challenges for better proteogenomic data interpretation.
Collapse
Affiliation(s)
- Suruchi Aggarwal
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, 3rd milestone, PO Box No. 04, Faridabad-Gurgaon Expressway, Faridabad-121001, Haryana, India
| | - Anurag Raj
- GN Ramachandran Knowledge Centre for Genome Informatics, CSIR-Institute of Genomics & Integrative Biology, South Campus, Mathura Road, New Delhi 110025, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad-201002, India
| | - Dhirendra Kumar
- GN Ramachandran Knowledge Centre for Genome Informatics, CSIR-Institute of Genomics & Integrative Biology, South Campus, Mathura Road, New Delhi 110025, India
| | - Debasis Dash
- GN Ramachandran Knowledge Centre for Genome Informatics, CSIR-Institute of Genomics & Integrative Biology, South Campus, Mathura Road, New Delhi 110025, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad-201002, India
| | - Amit Kumar Yadav
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, 3rd milestone, PO Box No. 04, Faridabad-Gurgaon Expressway, Faridabad-121001, Haryana, India
| |
Collapse
|
3
|
Khadka P, Reitman ZJ, Lu S, Buchan G, Gionet G, Dubois F, Carvalho DM, Shih J, Zhang S, Greenwald NF, Zack T, Shapira O, Pelton K, Hartley R, Bear H, Georgis Y, Jarmale S, Melanson R, Bonanno K, Schoolcraft K, Miller PG, Condurat AL, Gonzalez EM, Qian K, Morin E, Langhnoja J, Lupien LE, Rendo V, Digiacomo J, Wang D, Zhou K, Kumbhani R, Guerra Garcia ME, Sinai CE, Becker S, Schneider R, Vogelzang J, Krug K, Goodale A, Abid T, Kalani Z, Piccioni F, Beroukhim R, Persky NS, Root DE, Carcaboso AM, Ebert BL, Fuller C, Babur O, Kieran MW, Jones C, Keshishian H, Ligon KL, Carr SA, Phoenix TN, Bandopadhayay P. PPM1D mutations are oncogenic drivers of de novo diffuse midline glioma formation. Nat Commun 2022; 13:604. [PMID: 35105861 PMCID: PMC8807747 DOI: 10.1038/s41467-022-28198-8] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Accepted: 01/07/2022] [Indexed: 12/23/2022] Open
Abstract
The role of PPM1D mutations in de novo gliomagenesis has not been systematically explored. Here we analyze whole genome sequences of 170 pediatric high-grade gliomas and find that truncating mutations in PPM1D that increase the stability of its phosphatase are clonal driver events in 11% of Diffuse Midline Gliomas (DMGs) and are enriched in primary pontine tumors. Through the development of DMG mouse models, we show that PPM1D mutations potentiate gliomagenesis and that PPM1D phosphatase activity is required for in vivo oncogenesis. Finally, we apply integrative phosphoproteomic and functional genomics assays and find that oncogenic effects of PPM1D truncation converge on regulators of cell cycle, DNA damage response, and p53 pathways, revealing therapeutic vulnerabilities including MDM2 inhibition.
Collapse
Affiliation(s)
- Prasidda Khadka
- Department of Cancer Biology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Harvard Biological and Biomedical Sciences PhD Program, Harvard University, Cambridge, MA, 02138, USA
| | - Zachary J Reitman
- Department of Radiation Oncology, Duke University, Durham, NC, 27710, USA
- Duke Cancer Institute, Duke University, Durham, NC, 27710, USA
- The Preston Robert Tisch Brain Tumor Center at Duke, Duke University, Durham, NC, 27710, USA
| | - Sophie Lu
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Graham Buchan
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Gabrielle Gionet
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Frank Dubois
- Department of Cancer Biology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Diana M Carvalho
- Division of Molecular Pathology, Institute of Cancer Research, London, UK
| | - Juliann Shih
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Shu Zhang
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Noah F Greenwald
- Department of Cancer Biology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Travis Zack
- Department of Cancer Biology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Ofer Shapira
- Department of Cancer Biology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Kristine Pelton
- Department of Oncologic Pathology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Rachel Hartley
- Division of Pharmaceutical Sciences, James L. Winkle College of Pharmacy, University of Cincinnati, Cincinnati, OH, 45267, USA
| | - Heather Bear
- Research in Patient Services, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45267, USA
| | - Yohanna Georgis
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Spandana Jarmale
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Randy Melanson
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Kevin Bonanno
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Kathleen Schoolcraft
- Department of Oncologic Pathology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Peter G Miller
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
| | - Alexandra L Condurat
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Elizabeth M Gonzalez
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Kenin Qian
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Eric Morin
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Jaldeep Langhnoja
- Division of Pharmaceutical Sciences, James L. Winkle College of Pharmacy, University of Cincinnati, Cincinnati, OH, 45267, USA
| | - Leslie E Lupien
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Veronica Rendo
- Department of Cancer Biology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Jeromy Digiacomo
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Dayle Wang
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Kevin Zhou
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | - Rushil Kumbhani
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
| | | | - Claire E Sinai
- Department of Oncologic Pathology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Sarah Becker
- Department of Oncologic Pathology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Rachel Schneider
- Department of Oncologic Pathology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Jayne Vogelzang
- Department of Oncologic Pathology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Karsten Krug
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Amy Goodale
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Tanaz Abid
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Zohra Kalani
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | | | - Rameen Beroukhim
- Department of Cancer Biology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Nicole S Persky
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - David E Root
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Angel M Carcaboso
- Department of Pediatric Hematology and Oncology, Hospital Sant Joan de Deu, Institut de Recerca Sant Joan de Deu, Barcelona, 08950, Spain
| | - Benjamin L Ebert
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
- Department of Medical Oncology, Dana-Farber Cancer Institute, Boston, MA, 02215, USA
- Howard Hughes Medical Institute, Chevy Chase, MD, 20815, USA
| | - Christine Fuller
- Department of Pathology, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45267, USA
| | - Ozgun Babur
- College of Science and Mathematics, University of Massachusetts Boston, Boston, MA, 02125, USA
| | - Mark W Kieran
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA
- Bristol Myers Squibb, Boston, Devens, MA, 01434, USA
| | - Chris Jones
- Division of Molecular Pathology, Institute of Cancer Research, London, UK
| | | | - Keith L Ligon
- Department of Oncologic Pathology, Dana Farber Cancer Institute, Boston, MA, 02215, USA
| | - Steven A Carr
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Timothy N Phoenix
- Division of Pharmaceutical Sciences, James L. Winkle College of Pharmacy, University of Cincinnati, Cincinnati, OH, 45267, USA.
- Research in Patient Services, Cincinnati Children's Hospital Medical Center, Cincinnati, OH, 45267, USA.
| | - Pratiti Bandopadhayay
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA.
- Dana-Farber/Boston Children's Cancer and Blood Disorders Center, Boston, MA, 02215, USA.
- Department of Pediatrics, Harvard Medical School, Boston, MA, 02215, USA.
| |
Collapse
|
4
|
Fishbein SRS, Tomasi FG, Wolf ID, Dulberger CL, Wang A, Keshishian H, Wallace L, Carr SA, Ioerger TR, Rego EH, Rubin EJ. The conserved translation factor LepA is required for optimal synthesis of a porin family in Mycobacterium smegmatis. J Bacteriol 2020; 203:JB.00604-20. [PMID: 33361193 PMCID: PMC8095456 DOI: 10.1128/jb.00604-20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Accepted: 12/15/2020] [Indexed: 11/20/2022] Open
Abstract
The recalcitrance of mycobacteria to antibiotic therapy is in part due to its ability to build proteins into a multi-layer cell wall. Proper synthesis of both cell wall constituents and associated proteins is crucial to maintaining cell integrity, and intimately tied to antibiotic susceptibility. How mycobacteria properly synthesize the membrane-associated proteome, however, remains poorly understood. Recently, we found that loss of lepA in Mycobacterium smegmatis (Msm) altered tolerance to rifampin, a drug that targets a non-ribosomal cellular process. LepA is a ribosome-associated GTPase found in bacteria, mitochondria, and chloroplasts, yet its physiological contribution to cellular processes is not clear. To uncover the determinants of LepA-mediated drug tolerance, we characterized the whole-cell proteomes and transcriptomes of a lepA deletion mutant relative to strains with lepA We find that LepA is important for the steady-state abundance of a number of membrane-associated proteins, including an outer membrane porin, MspA, which is integral to nutrient uptake and drug susceptibility. Loss of LepA leads to a decreased amount of porin in the membrane which leads to the drug tolerance phenotype of the lepA mutant. In mycobacteria, the translation factor LepA modulates mycobacterial membrane homeostasis, which in turn affects antibiotic tolerance.ImportanceThe mycobacterial cell wall is a promising target for new antibiotics due to the abundance of important membrane-associated proteins. Defining mechanisms of synthesis of the membrane proteome will be critical to uncovering and validating drug targets. We found that LepA, a universally conserved translation factor, controls the synthesis of a number of major membrane proteins in M. smegmatis LepA primarily controls synthesis of the major porin MspA. Loss of LepA results in decreased permeability through the loss of this porin, including permeability to antibiotics like rifampin and vancomycin. In mycobacteria, regulation from the ribosome is critical for the maintenance of membrane homeostasis and, importantly, antibiotic susceptibility.
Collapse
Affiliation(s)
- Skye R S Fishbein
- Department of Immunology and Infectious Disease, Harvard TH Chan School of Public Health, Boston, Massachusetts, 02115, United States
| | - Francesca G Tomasi
- Department of Immunology and Infectious Disease, Harvard TH Chan School of Public Health, Boston, Massachusetts, 02115, United States
| | - Ian D Wolf
- Department of Immunology and Infectious Disease, Harvard TH Chan School of Public Health, Boston, Massachusetts, 02115, United States
| | - Charles L Dulberger
- Department of Immunology and Infectious Disease, Harvard TH Chan School of Public Health, Boston, Massachusetts, 02115, United States
| | - Albert Wang
- Department of Immunology and Infectious Disease, Harvard TH Chan School of Public Health, Boston, Massachusetts, 02115, United States
| | | | - Luke Wallace
- Broad Institute of MIT and Harvard, Cambridge, 02142, United States
| | - Steven A Carr
- Broad Institute of MIT and Harvard, Cambridge, 02142, United States
| | - Thomas R Ioerger
- Department of Computer Science and Engineering, Texas A&M University, Texas, 77843, United States
| | - E Hesper Rego
- Department of Microbial Pathogenesis, Yale University School of Medicine, New Haven, Connecticut, 06510, United States
| | - Eric J Rubin
- Department of Immunology and Infectious Disease, Harvard TH Chan School of Public Health, Boston, Massachusetts, 02115, United States
| |
Collapse
|
5
|
Yilmaz MT, Kesmen Z, Baykal B, Sagdic O, Kulen O, Kacar O, Yetim H, Baykal AT. A novel method to differentiate bovine and porcine gelatins in food products: nanoUPLC-ESI-Q-TOF-MS(E) based data independent acquisition technique to detect marker peptides in gelatin. Food Chem 2013; 141:2450-8. [PMID: 23870980 DOI: 10.1016/j.foodchem.2013.05.096] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2011] [Revised: 09/22/2012] [Accepted: 05/20/2013] [Indexed: 11/17/2022]
Abstract
We presented a novel nanoUPLC-MS(E) workflow method that has potential to identify origin of gelatin in some dairy products; yoghurt, cheese and ice cream. In this study, the method was performed in two steps. In the first step, gelatin was extracted from these products before the MS-sample preparation. In the second step, tryptic gelatin peptides were separated and analyzed with ultra-performance liquid chromatography and electrospray ionization quadrupole time-of-flight mass spectrometry (nanoUPLC-ESI-q-TOF-MS(E)). The novelty of this setup was that it functioned in a data independent acquisition mode and that alternate low and elevated collision energy was applied to acquire precursor and product ion information. This enabled accurate mass acquisition on the peptide level to identify the gelatin peptides. The marker peptides specific for porcine and bovine could be successfully detected in the gelatin added to the dairy products analyzed, revealing that the detection of marker peptides in the digested gelatin samples using nanoUPLC-ESI-q-TOF-MS(E) could be an effective method to differentiate porcine and bovine gelatin in the dairy products.
Collapse
Affiliation(s)
- Mustafa Tahsin Yilmaz
- Yıldız Technical University, Chemical and Metallurgical Engineering Faculty, Food Engineering Department, 34210 İstanbul, Turkey.
| | | | | | | | | | | | | | | |
Collapse
|
6
|
Eastwood DC, Floudas D, Binder M, Majcherczyk A, Schneider P, Aerts A, Asiegbu FO, Baker SE, Barry K, Bendiksby M, Blumentritt M, Coutinho PM, Cullen D, de Vries RP, Gathman A, Goodell B, Henrissat B, Ihrmark K, Kauserud H, Kohler A, LaButti K, Lapidus A, Lavin JL, Lee YH, Lindquist E, Lilly W, Lucas S, Morin E, Murat C, Oguiza JA, Park J, Pisabarro AG, Riley R, Rosling A, Salamov A, Schmidt O, Schmutz J, Skrede I, Stenlid J, Wiebenga A, Xie X, Kües U, Hibbett DS, Hoffmeister D, Högberg N, Martin F, Grigoriev IV, Watkinson SC. The plant cell wall-decomposing machinery underlies the functional diversity of forest fungi. Science 2011; 333:762-5. [PMID: 21764756 DOI: 10.1126/science.1205411] [Citation(s) in RCA: 354] [Impact Index Per Article: 27.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
Abstract
Brown rot decay removes cellulose and hemicellulose from wood--residual lignin contributing up to 30% of forest soil carbon--and is derived from an ancestral white rot saprotrophy in which both lignin and cellulose are decomposed. Comparative and functional genomics of the "dry rot" fungus Serpula lacrymans, derived from forest ancestors, demonstrated that the evolution of both ectomycorrhizal biotrophy and brown rot saprotrophy were accompanied by reductions and losses in specific protein families, suggesting adaptation to an intercellular interaction with plant tissue. Transcriptome and proteome analysis also identified differences in wood decomposition in S. lacrymans relative to the brown rot Postia placenta. Furthermore, fungal nutritional mode diversification suggests that the boreal forest biome originated via genetic coevolution of above- and below-ground biota.
Collapse
Affiliation(s)
- Daniel C Eastwood
- College of Science, University of Swansea, Singleton Park, Swansea SA2 8PP, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
7
|
Cham Mead JA, Bianco L, Bessant C. Free computational resources for designing selected reaction monitoring transitions. Proteomics 2010; 10:1106-26. [DOI: 10.1002/pmic.200900396] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
8
|
Abstract
The peptide identification problem lies at the heart of modern proteomic methodology, from which the presence of a particular protein or proteins in a sample may be inferred. The challenge is to find the most likely amino acid sequence, which corresponds to each tandem mass spectrum that has been collected, and produce some kind of score and associated statistical measure that the putative identification is correct. This approach assumes that the peptide (and parent protein) sequence in question is known and is present in the database which is to be searched, as opposed to de novo methods, which seek to identify the peptide ab initio. This chapter will provide an overview of the methods that common, popular software tools employ to search protein sequence databases to provide the non-expert reader with sufficient background to appreciate the choices they can make. This will cover the approaches used to compare experimental and theoretical spectra and some of the methods used to validate and provide higher confidence in the assignments.
Collapse
Affiliation(s)
- Simon J Hubbard
- Faculty of Life Sciences, University of Manchester, Michael Smith Building, Manchester, UK.
| |
Collapse
|
9
|
Abstract
Multiple reaction monitoring (MRM) of peptides is a popular proteomics technique that employs tandem mass spectrometry to quantify selected proteins of interest, such as those previously identified in differential protein identification studies. Using this technique, the specificity of precursor to product transitions is exploited to determine the absolute quantity of multiple proteins in a single sample. Selection of suitable transitions is critical for the success of MRM experiments, but accurate theoretical prediction of fragmentation patterns and peptide signal intensity is currently not possible. A recently proposed solution to this problem is to combine knowledge of the preferred properties of transitions for MRM, taken from expert practitioners, with MS/MS evidence extracted from a proteomics data repository. In addition, by predicting retention time for each peptide candidate, it allows selection of several compatible transition candidates that can be monitored simultaneously, permitting MRM. In this chapter, we explain how to go about designing transitions using the web-based transition design tool, MRMaid, which leverages high quality MS/MS evidence from the Genome Annotating Proteomic Pipeline (GAPP).
Collapse
|
10
|
Mass spectrometric detection of marker peptides in tryptic digests of gelatin: A new method to differentiate between bovine and porcine gelatin. Food Hydrocoll 2009. [DOI: 10.1016/j.foodhyd.2009.03.010] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
11
|
Jones AR, Siepen JA, Hubbard SJ, Paton NW. Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines. Proteomics 2009; 9:1220-9. [PMID: 19253293 DOI: 10.1002/pmic.200800473] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
LC-MS experiments can generate large quantities of data, for which a variety of database search engines are available to make peptide and protein identifications. Decoy databases are becoming widely used to place statistical confidence in result sets, allowing the false discovery rate (FDR) to be estimated. Different search engines produce different identification sets so employing more than one search engine could result in an increased number of peptides (and proteins) being identified, if an appropriate mechanism for combining data can be defined. We have developed a search engine independent score, based on FDR, which allows peptide identifications from different search engines to be combined, called the FDR Score. The results demonstrate that the observed FDR is significantly different when analysing the set of identifications made by all three search engines, by each pair of search engines or by a single search engine. Our algorithm assigns identifications to groups according to the set of search engines that have made the identification, and re-assigns the score (combined FDR Score). The combined FDR Score can differentiate between correct and incorrect peptide identifications with high accuracy, allowing on average 35% more peptide identifications to be made at a fixed FDR than using a single search engine.
Collapse
Affiliation(s)
- Andrew R Jones
- Department of Preclinical Veterinary Science, Faculty of Veterinary Science, University of Liverpool, Liverpool, UK.
| | | | | | | |
Collapse
|
12
|
Bianco L, Mead JA, Bessant C. Comparison of Novel Decoy Database Designs for Optimizing Protein Identification Searches Using ABRF sPRG2006 Standard MS/MS Data Sets. J Proteome Res 2009; 8:1782-91. [DOI: 10.1021/pr800792z] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Luca Bianco
- Bioinformatics Group, Building 63, Cranfield University, Cranfield, Bedfordshire, United Kingdom MK43 0AL
| | - Jennifer A. Mead
- Bioinformatics Group, Building 63, Cranfield University, Cranfield, Bedfordshire, United Kingdom MK43 0AL
| | - Conrad Bessant
- Bioinformatics Group, Building 63, Cranfield University, Cranfield, Bedfordshire, United Kingdom MK43 0AL
| |
Collapse
|
13
|
Wright JC, Sugden D, Francis-McIntyre S, Riba-Garcia I, Gaskell SJ, Grigoriev IV, Baker SE, Beynon RJ, Hubbard SJ. Exploiting proteomic data for genome annotation and gene model validation in Aspergillus niger. BMC Genomics 2009; 10:61. [PMID: 19193216 PMCID: PMC2644712 DOI: 10.1186/1471-2164-10-61] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2008] [Accepted: 02/04/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Proteomic data is a potentially rich, but arguably unexploited, data source for genome annotation. Peptide identifications from tandem mass spectrometry provide prima facie evidence for gene predictions and can discriminate over a set of candidate gene models. Here we apply this to the recently sequenced Aspergillus niger fungal genome from the Joint Genome Institutes (JGI) and another predicted protein set from another A.niger sequence. Tandem mass spectra (MS/MS) were acquired from 1d gel electrophoresis bands and searched against all available gene models using Average Peptide Scoring (APS) and reverse database searching to produce confident identifications at an acceptable false discovery rate (FDR). RESULTS 405 identified peptide sequences were mapped to 214 different A.niger genomic loci to which 4093 predicted gene models clustered, 2872 of which contained the mapped peptides. Interestingly, 13 (6%) of these loci either had no preferred predicted gene model or the genome annotators' chosen "best" model for that genomic locus was not found to be the most parsimonious match to the identified peptides. The peptides identified also boosted confidence in predicted gene structures spanning 54 introns from different gene models. CONCLUSION This work highlights the potential of integrating experimental proteomics data into genomic annotation pipelines much as expressed sequence tag (EST) data has been. A comparison of the published genome from another strain of A.niger sequenced by DSM showed that a number of the gene models or proteins with proteomics evidence did not occur in both genomes, further highlighting the utility of the method.
Collapse
Affiliation(s)
- James C Wright
- Dept Veterinary Preclinical Sciences, University of Liverpool, Liverpool, UK.
| | | | | | | | | | | | | | | | | |
Collapse
|
14
|
YUN D, LU H, WANG H, ZHANG Y, CHENG G, JIN H, YU Y, XU Y, YANG P, HE F. Iterative Non- m/ z-sharing Rule for Confident and Sensitive Protein Identification of Non-shotgun Proteomics. CHINESE J CHEM 2009. [DOI: 10.1002/cjoc.200990053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
|
15
|
Mead JA, Bianco L, Ottone V, Barton C, Kay RG, Lilley KS, Bond NJ, Bessant C. MRMaid, the web-based tool for designing multiple reaction monitoring (MRM) transitions. Mol Cell Proteomics 2008; 8:696-705. [PMID: 19011259 DOI: 10.1074/mcp.m800192-mcp200] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Multiple reaction monitoring (MRM) of peptides uses tandem mass spectrometry to quantify selected proteins of interest, such as those previously identified in differential studies. Using this technique, the specificity of precursor to product transitions is harnessed for quantitative analysis of multiple proteins in a single sample. The design of transitions is critical for the success of MRM experiments, but predicting signal intensity of peptides and fragmentation patterns ab initio is challenging given existing methods. The tool presented here, MRMaid (pronounced "mermaid") offers a novel alternative for rapid design of MRM transitions for the proteomics researcher. The program uses a combination of knowledge of the properties of optimal MRM transitions taken from expert practitioners and literature with MS/MS evidence derived from interrogation of a database of peptide identifications and their associated mass spectra. The tool also predicts retention time using a published model, allowing ordering of transition candidates. By exploiting available knowledge and resources to generate the most reliable transitions, this approach negates the need for theoretical prediction of fragmentation and the need to undertake prior "discovery" MS studies. MRMaid is a modular tool built around the Genome Annotating Proteomic Pipeline framework, providing a web-based solution with both descriptive and graphical visualizations of transitions. Predicted transition candidates are ranked based on a novel transition scoring system, and users may filter the results by selecting optional stringency criteria, such as omitting frequently modified residues, constraining the length of peptides, or omitting missed cleavages. Comparison with published transitions showed that MRMaid successfully predicted the peptide and product ion pairs in the majority of cases with appropriate retention time estimates. As the data content of the Genome Annotating Proteomic Pipeline repository increases, the coverage and reliability of MRMaid are set to increase further. MRMaid is freely available over the internet as an executable web-based service at www.mrmaid.info.
Collapse
Affiliation(s)
- Jennifer A Mead
- Bioinformatics Group, Cranfield Health, Building 63, Cranfield University, College Road, Cranfield, Bedfordshire MK43 0AL, United Kingdom
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Sanderson SJ, Xia D, Prieto H, Yates J, Heiges M, Kissinger JC, Bromley E, Lal K, Sinden RE, Tomley F, Wastling JM. Determining the protein repertoire of Cryptosporidium parvum sporozoites. Proteomics 2008; 8:1398-414. [PMID: 18306179 DOI: 10.1002/pmic.200700804] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The genome of the intracellular parasite Cryptosporidium parvum has recently been sequenced, but protein expression data for the invasive stages of this important zoonotic gastrointestinal pathogen are limited. In this paper a comprehensive analysis of the expressed protein repertoire of an excysted oocyst/sporozoite preparation of C. parvum is presented. Three independent proteome platforms were employed which yielded more than 4800 individual protein identifications representing 1237 nonredundant proteins, corresponding to approximately 30% of the predicted proteome. Peptide data were mapped to the corresponding locations on the C. parvum genome and a publicly accessible interface for proteome data was developed for data-mining and visualisation at CryptoDB (http://cryptodb.org). These data provide a timely and valuable resource for improved annotation of the genome, verification of predicted hypothetical proteins and identification of proteins not predicted by current gene models. The data indicated the expression of proteins likely to be important to the invasion and intracellular establishment of the parasite, including surface proteins, constituents of the remnant mitochondrion and apical organelles. Comparison of the expressed proteome with existing transcriptional data indicated only a weak correlation. For approximately half the proteome there was limited functional and structural information, highlighting the limitations in the current understanding of Cryptosporidium biology.
Collapse
Affiliation(s)
- Sanya J Sanderson
- Departments of Pre-clinical Veterinary Science and Veterinary Pathology, Faculty of Veterinary Science, University of Liverpool, Liverpool, UK
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Vasilescu J, Smith JC, Zweitzig DR, Denis NJ, Haines DS, Figeys D. Systematic determination of ion score cutoffs based on calculated false positive rates: application for identifying ubiquitinated proteins by tandem mass spectrometry. JOURNAL OF MASS SPECTROMETRY : JMS 2008; 43:296-304. [PMID: 17957819 DOI: 10.1002/jms.1297] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
We report a simple approach for determining ion score cutoffs that permit the confident identification of ubiquitinated proteins by tandem mass spectrometry (MS/MS). Initial experiments involving the analysis of gel bands containing multi-Ubiquitin chains with quadrupole time-of-flight and quadrupole ion trap mass spectrometers revealed that standard ion score cutoffs used for database searching were not sufficiently stringent. We also found that false positive and false negative rates (FPR and FNR) varied significantly depending on the cutoff scores used and that appropriate cutoffs could only be determined following a systematic evaluation of false positive rates. When standard cutoff scores were used for the analysis of complex mixtures of ubiquitinated proteins, unacceptably high FPR were observed. Finally, we found that FPR for ubiquitinated proteins are affected by the size of the protein database that is searched. These observations may be applicable for the study of other post-translational modifications.
Collapse
Affiliation(s)
- Julian Vasilescu
- Ottawa Institute of Systems Biology, University of Ottawa, 451 Smyth Road, Ottawa, ON, K1H 8M5, Canada
| | | | | | | | | | | |
Collapse
|
18
|
Mead JA, Shadforth IP, Bessant C. Public proteomic MS repositories and pipelines: available tools and biological applications. Proteomics 2007; 7:2769-86. [PMID: 17654461 DOI: 10.1002/pmic.200700152] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
As proteomic MS has increased in throughput, so has the demand to catalogue the increasing number of peptides and proteins observed by the community using this technique. As in other 'omics' fields, this brings obvious scientific benefits such as sharing of results and prevention of unnecessary repetition, but also provides technical insights, such as the ability to compare proteome coverage between different laboratories, or between different proteomic platforms. Journals are also moving towards mandating that proteomics data be submitted to public repositories upon publication. In response to these demands, several web-based repositories have been established to store protein and peptide identifications derived from MS data, and a similar number of peptide identification software pipelines have emerged to deliver identifications to these repositories. This paper reviews the latest developments in public domain peptide and protein identification databases and describes the analysis pipelines that feed them. Recent applications of the tools to pertinent biological problems are examined, and through comparing and contrasting the capabilities of each system, the issues facing research users of web-based repositories are explored. Future developments and mechanisms to enhance system functionality and user-interfacing opportunities are also suggested.
Collapse
Affiliation(s)
- Jennifer A Mead
- Cranfield Health, Cranfield University, Silsoe, Bedfordshire, UK
| | | | | |
Collapse
|
19
|
Lubec G, Afjehi-Sadat L. Limitations and pitfalls in protein identification by mass spectrometry. Chem Rev 2007; 107:3568-84. [PMID: 17645314 DOI: 10.1021/cr068213f] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Gert Lubec
- Medical University of Vienna, Department of Pediatrics, Waehringer Guertel 18, A-1090 Vienna, Austria.
| | | |
Collapse
|
20
|
Allmer J, Naumann B, Markert C, Zhang M, Hippler M. Mass spectrometric genomic data mining: Novel insights into bioenergetic pathways in Chlamydomonas reinhardtii. Proteomics 2007; 6:6207-20. [PMID: 17078018 DOI: 10.1002/pmic.200600208] [Citation(s) in RCA: 56] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
A new high-throughput computational strategy was established that improves genomic data mining from MS experiments. The MS/MS data were analyzed by the SEQUEST search algorithm and a combination of de novo amino acid sequencing in conjunction with an error-tolerant database search tool, operating on a 256 processor computer cluster. The error-tolerant search tool, previously established as GenomicPeptideFinder (GPF), enables detection of intron-split and/or alternatively spliced peptides from MS/MS data when deduced from genomic DNA. Isolated thylakoid membranes from the eukaryotic green alga Chlamydomonas reinhardtii were separated by 1-D SDS gel electrophoresis, protein bands were excised from the gel, digested in-gel with trypsin and analyzed by coupling nano-flow LC with MS/MS. The concerted action of SEQUEST and GPF allowed identification of 2622 distinct peptides. In total 448 peptides were identified by GPF analysis alone, including 98 intron-split peptides, resulting in the identification of novel proteins, improved annotation of gene models, and evidence of alternative splicing.
Collapse
Affiliation(s)
- Jens Allmer
- Plant Science Institute, Department of Biology, University of Pennsylvania, Philadelphia, PA, USA
| | | | | | | | | |
Collapse
|
21
|
Abstract
Proteomics based on tandem mass spectrometry is a powerful tool for identifying novel biomarkers and drug targets. Previously, a major bottleneck in high-throughput proteomics has been that the computational techniques needed to reliably identify proteins from proteomic data lagged behind the ability to collect the immense quantity of data generated. This is no longer the case, as fully automated pipelines for peptide and protein identification exist, and these are publicly and privately accessible. Such pipelines can automatically and rapidly generate high-confidence protein identifications from large datasets in a searchable format covering multiple experimental runs. However, the main challenge for the community now is to use these resources as they are, by taking full advantage of the pooling of information, so that the next barrier in our understanding of biology may be broken. There are currently two pipelines in the public domain that provide such potential: PeptideAtlas and the Genome Annotating Proteomic Pipeline. This review will introduce their features in the context of high-throughput proteomics, and provide indicative results as to their usefulness and usability through a side-by-side comparison of results obtained when processing a set of human plasma samples.
Collapse
Affiliation(s)
- Ian Shadforth
- Cranfield University, Cranfield Health, Silsoe, MK45 4DT, UK.
| | | |
Collapse
|
22
|
Sadowski PG, Dunkley TPJ, Shadforth IP, Dupree P, Bessant C, Griffin JL, Lilley KS. Quantitative proteomic approach to study subcellular localization of membrane proteins. Nat Protoc 2006; 1:1778-89. [PMID: 17487160 DOI: 10.1038/nprot.2006.254] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
As proteins within cells are spatially organized according to their role, knowledge about protein localization gives insight into protein function. Here, we describe the LOPIT technique (localization of organelle proteins by isotope tagging) developed for the simultaneous and confident determination of the steady-state distribution of hundreds of integral membrane proteins within organelles. The technique uses a partial membrane fractionation strategy in conjunction with quantitative proteomics. Localization of proteins is achieved by measuring their distribution pattern across the density gradient using amine-reactive isotope tagging and comparing these patterns with those of known organelle residents. LOPIT relies on the assumption that proteins belonging to the same organelle will co-fractionate. Multivariate statistical tools are then used to group proteins according to the similarities in their distributions, and hence localization without complete centrifugal separation is achieved. The protocol requires approximately 3 weeks to complete and can be applied in a high-throughput manner to material from many varied sources.
Collapse
Affiliation(s)
- Pawel G Sadowski
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Downing Site, Cambridge CB2 1QW, UK
| | | | | | | | | | | | | |
Collapse
|
23
|
Shadforth I, Xu W, Crowther D, Bessant C. GAPP: A Fully Automated Software for the Confident Identification of Human Peptides from Tandem Mass Spectra. J Proteome Res 2006; 5:2849-52. [PMID: 17022656 DOI: 10.1021/pr060205s] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
This paper introduces the genome annotating proteomic pipeline (GAPP), a totally automated publicly available software pipeline for the identification of peptides and proteins from human proteomic tandem mass spectrometry data. The pipeline takes as its input a series of MS/MS peak lists from a given experimental sample and produces a series of database entries corresponding to the peptides observed within the sample, along with related confidence scores. The pipeline is capable of finding any peptides expected, including those that cross intron-exon boundaries, and those due to single nucleotide polymorphisms (SNPs), alternate splicing, and post-translational modifications (PTMs). GAPP can therefore be used to re-annotate genomes, and this is supported through the inclusion of a Distributed Annotation System (DAS) server, which allows the peptides identified by the pipeline to be displayed in their genomic context within the Ensembl genome browser. GAPP is freely available via the web, at www. gapp.info.
Collapse
Affiliation(s)
- Ian Shadforth
- Cranfield University, Silsoe, Bedfordshire MK45 4DT, United Kingdom
| | | | | | | |
Collapse
|