51
|
Singhal A, Virmani R, Naz S, Arora G, Gaur M, Kundu P, Sajid A, Misra R, Dabla A, Kumar S, Nellissery J, Molle V, Gerth U, Swaroop A, Sharma K, Nandicoori VK, Singh Y. Methylation of two-component response regulator MtrA in mycobacteria negatively modulates its DNA binding and transcriptional activation. Biochem J 2020; 477:4473-4489. [PMID: 33175092 PMCID: PMC11374129 DOI: 10.1042/bcj20200455] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 11/05/2020] [Accepted: 11/11/2020] [Indexed: 12/23/2022]
Abstract
Post-translational modifications such as phosphorylation, nitrosylation, and pupylation modulate multiple cellular processes in Mycobacterium tuberculosis. While protein methylation at lysine and arginine residues is widespread in eukaryotes, to date only two methylated proteins in Mtb have been identified. Here, we report the identification of methylation at lysine and/or arginine residues in nine mycobacterial proteins. Among the proteins identified, we chose MtrA, an essential response regulator of a two-component signaling system, which gets methylated on multiple lysine and arginine residues to examine the functional consequences of methylation. While methylation of K207 confers a marginal decrease in the DNA-binding ability of MtrA, methylation of R122 or K204 significantly reduces the interaction with the DNA. Overexpression of S-adenosyl homocysteine hydrolase (SahH), an enzyme that modulates the levels of S-adenosyl methionine in mycobacteria decreases the extent of MtrA methylation. Most importantly, we show that decreased MtrA methylation results in transcriptional activation of mtrA and sahH promoters. Collectively, we identify novel methylated proteins, expand the list of modifications in mycobacteria by adding arginine methylation, and show that methylation regulates MtrA activity. We propose that protein methylation could be a more prevalent modification in mycobacterial proteins.
Collapse
Affiliation(s)
- Anshika Singhal
- CSIR-Institute of Genomics and Integrative Biology, Delhi 110007, India
| | - Richa Virmani
- Department of Zoology, University of Delhi, Delhi 110007, India
- CSIR-Institute of Genomics and Integrative Biology, Delhi 110007, India
| | - Saba Naz
- Department of Zoology, University of Delhi, Delhi 110007, India
- National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Gunjan Arora
- CSIR-Institute of Genomics and Integrative Biology, Delhi 110007, India
| | - Mohita Gaur
- Department of Zoology, University of Delhi, Delhi 110007, India
- CSIR-Institute of Genomics and Integrative Biology, Delhi 110007, India
| | - Parijat Kundu
- CSIR-Institute of Genomics and Integrative Biology, Delhi 110007, India
| | - Andaleeb Sajid
- CSIR-Institute of Genomics and Integrative Biology, Delhi 110007, India
| | - Richa Misra
- CSIR-Institute of Genomics and Integrative Biology, Delhi 110007, India
| | - Ankita Dabla
- National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Suresh Kumar
- National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Jacob Nellissery
- Neurobiology-Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, U.S.A
| | - Virginie Molle
- DIMNP, CNRS, University of Montpellier, Montpellier, France
| | - Ulf Gerth
- Institute of Microbiology, Ernst-Moritz-Arndt-University Greifswald, D-17487 Greifswald, Germany
| | - Anand Swaroop
- Neurobiology-Neurodegeneration and Repair Laboratory, National Eye Institute, National Institutes of Health, Bethesda, MD 20892, U.S.A
| | - Kirti Sharma
- Department of Proteomics and Signal Transduction, Max-Planck-Institute of Biochemistry, Am Klopferspitz 18, D-82152 Martinsried, Germany
| | - Vinay K Nandicoori
- National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110067, India
| | - Yogendra Singh
- Department of Zoology, University of Delhi, Delhi 110007, India
| |
Collapse
|
52
|
Smolikova G, Gorbach D, Lukasheva E, Mavropolo-Stolyarenko G, Bilova T, Soboleva A, Tsarev A, Romanovskaya E, Podolskaya E, Zhukov V, Tikhonovich I, Medvedev S, Hoehenwarter W, Frolov A. Bringing New Methods to the Seed Proteomics Platform: Challenges and Perspectives. Int J Mol Sci 2020; 21:E9162. [PMID: 33271881 PMCID: PMC7729594 DOI: 10.3390/ijms21239162] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 11/26/2020] [Accepted: 11/27/2020] [Indexed: 12/14/2022] Open
Abstract
For centuries, crop plants have represented the basis of the daily human diet. Among them, cereals and legumes, accumulating oils, proteins, and carbohydrates in their seeds, distinctly dominate modern agriculture, thus play an essential role in food industry and fuel production. Therefore, seeds of crop plants are intensively studied by food chemists, biologists, biochemists, and nutritional physiologists. Accordingly, seed development and germination as well as age- and stress-related alterations in seed vigor, longevity, nutritional value, and safety can be addressed by a broad panel of analytical, biochemical, and physiological methods. Currently, functional genomics is one of the most powerful tools, giving direct access to characteristic metabolic changes accompanying plant development, senescence, and response to biotic or abiotic stress. Among individual post-genomic methodological platforms, proteomics represents one of the most effective ones, giving access to cellular metabolism at the level of proteins. During the recent decades, multiple methodological advances were introduced in different branches of life science, although only some of them were established in seed proteomics so far. Therefore, here we discuss main methodological approaches already employed in seed proteomics, as well as those still waiting for implementation in this field of plant research, with a special emphasis on sample preparation, data acquisition, processing, and post-processing. Thereby, the overall goal of this review is to bring new methodologies emerging in different areas of proteomics research (clinical, food, ecological, microbial, and plant proteomics) to the broad society of seed biologists.
Collapse
Affiliation(s)
- Galina Smolikova
- Department of Plant Physiology and Biochemistry, St. Petersburg State University; 199034 St. Petersburg, Russia; (G.S.); (T.B.); (S.M.)
| | - Daria Gorbach
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Elena Lukasheva
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Gregory Mavropolo-Stolyarenko
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Tatiana Bilova
- Department of Plant Physiology and Biochemistry, St. Petersburg State University; 199034 St. Petersburg, Russia; (G.S.); (T.B.); (S.M.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| | - Alena Soboleva
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| | - Alexander Tsarev
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| | - Ekaterina Romanovskaya
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
| | - Ekaterina Podolskaya
- Institute of Analytical Instrumentation, Russian Academy of Science; 190103 St. Petersburg, Russia;
- Institute of Toxicology, Russian Federal Medical Agency; 192019 St. Petersburg, Russia
| | - Vladimir Zhukov
- All-Russia Research Institute for Agricultural Microbiology; 196608 St. Petersburg, Russia; (V.Z.); (I.T.)
| | - Igor Tikhonovich
- All-Russia Research Institute for Agricultural Microbiology; 196608 St. Petersburg, Russia; (V.Z.); (I.T.)
- Department of Genetics and Biotechnology, St. Petersburg State University; 199034 St. Petersburg, Russia
| | - Sergei Medvedev
- Department of Plant Physiology and Biochemistry, St. Petersburg State University; 199034 St. Petersburg, Russia; (G.S.); (T.B.); (S.M.)
| | - Wolfgang Hoehenwarter
- Proteome Analytics Research Group, Leibniz Institute of Plant Biochemistry, 06120 Halle (Saale), Germany;
| | - Andrej Frolov
- Department of Biochemistry, St. Petersburg State University; 199178 St. Petersburg, Russia; (D.G.); (E.L.); (G.M.-S.); (A.S.); (A.T.); (E.R.)
- Department of Bioorganic Chemistry, Leibniz Institute of Plant Biochemistry; 06120 Halle (Saale), Germany
| |
Collapse
|
53
|
Azémard C, Dufour E, Zazzo A, Wheeler JC, Goepfert N, Marie A, Zirah S. Untangling the fibre ball: Proteomic characterization of South American camelid hair fibres by untargeted multivariate analysis and molecular networking. J Proteomics 2020; 231:104040. [PMID: 33152504 DOI: 10.1016/j.jprot.2020.104040] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 08/27/2020] [Accepted: 10/29/2020] [Indexed: 12/24/2022]
Abstract
The proteomic analysis of hairs, yarns or textiles has emerged as a powerful method to determine species of origin, mainly used in archaeozoological research and fraud control. Differentiation between the South American camelid (SAC) species (the wild guanaco and vicuña and their respective domesticates the llama and alpaca) is particularly challenging due to poor database information and significant hybridization between species. In this study, we analysed 41 modern and 4 archaeological samples from the four SACs species. Despite strong similarities with Old World Camelidae, we identified 7 peptides specific to SACs assigned to keratin K86 and the keratin-associated proteins KAP13-1 and KAP11-1. Untargeted multivariate analysis of the LC-MS data permitted to distinguish SAC species and propose discriminant features. MS/MS-based molecular networking combined with database-assisted de novo sequencing permitted to identify 5 new taxonomic peptides assigned to K33a, K81 and/or K83 keratins and KAP19-1. These peptides differentiate the two wild species, guanaco and vicuña. These results show the value of combining database search and untargeted metabolomic approaches for paleoproteomics, and reveal for the first time the potential of molecular networks to highlight deamidation related to diagenesis and cluster highly similar peptides related to interchain homologies or intra- or inter-specific polymorphism. SIGNIFICANCE: This study used an innovative approach combining multivariate analysis of LC-MS data together with molecular networking and database-assisted de novo sequencing to identify taxonomic peptides in palaeoproteomics. It constitutes the first attempt to differentiate between hair fibres from the four South American camelids (SACs) based on proteomic analysis of modern and archaeological samples. It provides different proteomic signatures for each of the four SAC species and proposes new SAC taxonomic peptides of interest in archaeozoology and fraud control. SACs have been extensively exploited since human colonization of South America but have not been studied to the extent of their economic, cultural and heritage importance. Applied to the analysis of ancient Andean textiles, our results should permit a better understanding of cultural and pastoral practices in South America. The wild SACs are endangered by poaching and black-market sale of their fibre. For the first time, our results provide discriminant features for the determination of species of origin of contraband fibre.
Collapse
Affiliation(s)
- Clara Azémard
- Unité Molécules de Communication et Adaptations des Microorganismes (MCAM), Muséum National d'Histoire Naturelle, CNRS, CP 54, 63 rue Buffon, 75005 Paris, France; Archéozoologie, Archéobotanique: Sociétés, Pratiques et Environnements (AASPE), Muséum National d'Histoire Naturelle, CNRS, CP 56, 55 rue Buffon, 75005 Paris, France
| | - Elise Dufour
- Archéozoologie, Archéobotanique: Sociétés, Pratiques et Environnements (AASPE), Muséum National d'Histoire Naturelle, CNRS, CP 56, 55 rue Buffon, 75005 Paris, France
| | - Antoine Zazzo
- Archéozoologie, Archéobotanique: Sociétés, Pratiques et Environnements (AASPE), Muséum National d'Histoire Naturelle, CNRS, CP 56, 55 rue Buffon, 75005 Paris, France
| | - Jane C Wheeler
- CONOPA - Instituto de Investigación y Desarrollo de Camélidos Sudamericanos, Av. Reusche M4, Pachacamac, Lima 19, Peru
| | - Nicolas Goepfert
- Archéologie des Amériques, UMR 8096, CNRS - Université Paris 1 Panthéon-Sorbonne, MSH Mondes, 21 allée de l'université, 92023 Nanterre, France
| | - Arul Marie
- Unité Molécules de Communication et Adaptations des Microorganismes (MCAM), Muséum National d'Histoire Naturelle, CNRS, CP 54, 63 rue Buffon, 75005 Paris, France
| | - Séverine Zirah
- Unité Molécules de Communication et Adaptations des Microorganismes (MCAM), Muséum National d'Histoire Naturelle, CNRS, CP 54, 63 rue Buffon, 75005 Paris, France.
| |
Collapse
|
54
|
Abraham-Juárez MJ, Schrager-Lavelle A, Man J, Whipple C, Handakumbura P, Babbitt C, Bartlett M. Evolutionary Variation in MADS Box Dimerization Affects Floral Development and Protein Abundance in Maize. THE PLANT CELL 2020; 32:3408-3424. [PMID: 32873631 PMCID: PMC7610293 DOI: 10.1105/tpc.20.00300] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Revised: 08/19/2020] [Accepted: 09/01/2020] [Indexed: 05/19/2023]
Abstract
Interactions between MADS box transcription factors are critical in the regulation of floral development, and shifting MADS box protein-protein interactions are predicted to have influenced floral evolution. However, precisely how evolutionary variation in protein-protein interactions affects MADS box protein function remains unknown. To assess the impact of changing MADS box protein-protein interactions on transcription factor function, we turned to the grasses, where interactions between B-class MADS box proteins vary. We tested the functional consequences of this evolutionary variability using maize (Zea mays) as an experimental system. We found that differential B-class dimerization was associated with subtle, quantitative differences in stamen shape. In contrast, differential dimerization resulted in large-scale changes to downstream gene expression. Differential dimerization also affected B-class complex composition and abundance, independent of transcript levels. This indicates that differential B-class dimerization affects protein degradation, revealing an important consequence for evolutionary variability in MADS box interactions. Our results highlight complexity in the evolution of developmental gene networks: changing protein-protein interactions could affect not only the composition of transcription factor complexes but also their degradation and persistence in developing flowers. Our results also show how coding change in a pleiotropic master regulator could have small, quantitative effects on development.
Collapse
Affiliation(s)
- María Jazmín Abraham-Juárez
- Biology Department, University of Massachusetts, Amherst, 01003 Massachusetts
- CONACYT-Instituto Potosino de Investigación Científica y Tecnológica A.C., 78216 San Luis Potosi, Mexico
| | - Amanda Schrager-Lavelle
- Biology Department, University of Massachusetts, Amherst, 01003 Massachusetts
- Biology Department, Colorado Mesa University, Grand Junction, 81501 Colorado
| | - Jarrett Man
- Biology Department, University of Massachusetts, Amherst, 01003 Massachusetts
| | - Clinton Whipple
- Biology Department, Brigham Young University, Provo, 84602 Utah
| | - Pubudu Handakumbura
- Biology Department, University of Massachusetts, Amherst, 01003 Massachusetts
- Pacific Northwest National Laboratory, Richland, 99354 Washington
| | - Courtney Babbitt
- Biology Department, University of Massachusetts, Amherst, 01003 Massachusetts
| | - Madelaine Bartlett
- Biology Department, University of Massachusetts, Amherst, 01003 Massachusetts
| |
Collapse
|
55
|
Schwartz TS. The Promises and the Challenges of Integrating Multi-Omics and Systems Biology in Comparative Stress Biology. Integr Comp Biol 2020; 60:89-97. [PMID: 32386307 DOI: 10.1093/icb/icaa026] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Comparative stress biology is inherently a systems biology approach with the goal of integrating the molecular, cellular, and physiological responses with fitness outcomes. In this way, the systems biology approach is expected to provide a holistic understanding of how different stressors result in different fitness outcomes, and how different individuals (or populations or species) respond to stressors differently. In this perceptive article, I focus on the use of multiple types of -omics data in stress biology. Targeting students and those researchers who are considering integrating -omics approaches in their comparative stress biology studies, I discuss the promise of the integration of these measures for furthering our holistic understanding of how organisms respond to different stressors. I also discuss the logistical and conceptual challenges encountered when working with -omics data and the current hurdles to fully utilize these data in studies of stress biology in non-model organisms.
Collapse
Affiliation(s)
- Tonia S Schwartz
- Department of Biological Sciences, Auburn University, Auburn, AL, USA
| |
Collapse
|
56
|
Shamsaei B, Chojnacki S, Pilarczyk M, Najafabadi M, Niu W, Chen C, Ross K, Matlock A, Muhlich J, Chutipongtanate S, Zheng J, Turner J, Vidović D, Jaffe J, MacCoss M, Wu C, Pillai A, Ma'ayan A, Schürer S, Kouril M, Medvedovic M, Meller J. piNET: a versatile web platform for downstream analysis and visualization of proteomics data. Nucleic Acids Res 2020; 48:W85-W93. [PMID: 32469073 PMCID: PMC7319557 DOI: 10.1093/nar/gkaa436] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 04/29/2020] [Accepted: 05/27/2020] [Indexed: 02/03/2023] Open
Abstract
Rapid progress in proteomics and large-scale profiling of biological systems at the protein level necessitates the continued development of efficient computational tools for the analysis and interpretation of proteomics data. Here, we present the piNET server that facilitates integrated annotation, analysis and visualization of quantitative proteomics data, with emphasis on PTM networks and integration with the LINCS library of chemical and genetic perturbation signatures in order to provide further mechanistic and functional insights. The primary input for the server consists of a set of peptides or proteins, optionally with PTM sites, and their corresponding abundance values. Several interconnected workflows can be used to generate: (i) interactive graphs and tables providing comprehensive annotation and mapping between peptides and proteins with PTM sites; (ii) high resolution and interactive visualization for enzyme-substrate networks, including kinases and their phospho-peptide targets; (iii) mapping and visualization of LINCS signature connectivity for chemical inhibitors or genetic knockdown of enzymes upstream of their target PTM sites. piNET has been built using a modular Spring-Boot JAVA platform as a fast, versatile and easy to use tool. The Apache Lucene indexing is used for fast mapping of peptides into UniProt entries for the human, mouse and other commonly used model organism proteomes. PTM-centric network analyses combine PhosphoSitePlus, iPTMnet and SIGNOR databases of validated enzyme-substrate relationships, for kinase networks augmented by DeepPhos predictions and sequence-based mapping of PhosphoSitePlus consensus motifs. Concordant LINCS signatures are mapped using iLINCS. For each workflow, a RESTful API counterpart can be used to generate the results programmatically in the json format. The server is available at http://pinet-server.org, and it is free and open to all users without login requirement.
Collapse
Affiliation(s)
- Behrouz Shamsaei
- Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, USA
| | - Szymon Chojnacki
- Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, USA
| | - Marcin Pilarczyk
- Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, USA
| | - Mehdi Najafabadi
- Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, USA
| | - Wen Niu
- Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, USA
| | - Chuming Chen
- Center for Bioinformatics & Computational Biology; University of Delaware, USA
| | - Karen Ross
- Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, USA
| | - Andrea Matlock
- Department of Biomedical Sciences, Cedars-Sinai Medical Center, USA
| | - Jeremy Muhlich
- Department of Systems Biology, Harvard Medical School, USA
| | - Somchai Chutipongtanate
- Department of Cancer Biology, University of Cincinnati College of Medicine, USA.,Department of Pediatrics, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Thailand
| | - Jie Zheng
- Department of Genetics, University of Pennsylvania Perelman School of Medicine, USA
| | - John Turner
- Department of Pharmacology, Miller School of Medicine, Sylvester Comprehensive Cancer Center, Center for Computational Science, University of Miami, Miami, USA
| | - Dušica Vidović
- Department of Pharmacology, Miller School of Medicine, Sylvester Comprehensive Cancer Center, Center for Computational Science, University of Miami, Miami, USA
| | - Jake Jaffe
- Broad Institute of MIT and Harvard & Inzen Therapeutics, USA
| | - Michael MacCoss
- Department of Genome Sciences, University of Washington, USA
| | - Cathy Wu
- Center for Bioinformatics & Computational Biology; University of Delaware, USA.,Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, USA
| | - Ajay Pillai
- Human Genome Research Institute, National Institutes of Health, Bethesda, USA
| | - Avi Ma'ayan
- Mount Sinai Center for Bioinformatics, Department of Pharmacological Sciences, Icahn School of Medicine at Mount Sinai, USA
| | - Stephan Schürer
- Department of Pharmacology, Miller School of Medicine, Sylvester Comprehensive Cancer Center, Center for Computational Science, University of Miami, Miami, USA
| | - Michal Kouril
- Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, USA
| | - Mario Medvedovic
- Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, USA.,Department of Biomedical Informatics, University of Cincinnati College of Medicine, USA
| | - Jarek Meller
- Department of Environmental and Public Health Sciences, University of Cincinnati College of Medicine, USA.,Division of Biomedical Informatics, Cincinnati Children's Hospital Medical Center, USA.,Department of Electrical Engineering and Computer Science, University of Cincinnati, USA
| |
Collapse
|
57
|
Aguilan JT, Kulej K, Sidoli S. Guide for protein fold change and p-value calculation for non-experts in proteomics. Mol Omics 2020; 16:573-582. [PMID: 32968743 DOI: 10.1039/d0mo00087f] [Citation(s) in RCA: 64] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Proteomics studies generate tables with thousands of entries. A significant component of being a proteomics scientist is the ability to process these tables to identify regulated proteins. Many bioinformatics tools are freely available for the community, some of which within reach for scientists with limited or no background in programming and statistics. However, proteomics has become popular in most other biological and biomedical disciplines, resulting in more and more studies where data processing is delegated to specialists that are not lead authors of the scientific project. This creates a risk or at least a limiting factor, as the biological interpretation of a dataset is contingent of a third-party specialist transforming data without the input of the project leader. We acknowledge in advance that dedicated scripts and software have a higher level of sophistication; but we hereby claim that the approach we describe makes proteomics data processing immediately accessible to every scientist. In this paper, we describe key steps of the typical data transformation, normalization and statistics in proteomics data analysis using a simple spreadsheet. This manuscript aims to demonstrate to those who are not familiar with the math and statistics behind these workflows that a proteomics dataset can be processed, simplified and interpreted in software like Microsoft Excel. With this, we aim to reach the community of non-specialists in proteomics to find a common language and illustrate the basic steps of -omics data processing.
Collapse
Affiliation(s)
- Jennifer T Aguilan
- Laboratory for Macromolecular Analysis and Proteomics Facility, Albert Einstein College of Medicine, NY 10461, USA.
| | | | | |
Collapse
|
58
|
Yu SH, Kyriakidou P, Cox J. Isobaric Matching between Runs and Novel PSM-Level Normalization in MaxQuant Strongly Improve Reporter Ion-Based Quantification. J Proteome Res 2020; 19:3945-3954. [PMID: 32892627 PMCID: PMC7586393 DOI: 10.1021/acs.jproteome.0c00209] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
![]()
Isobaric
labeling has the promise of combining high sample multiplexing
with precise quantification. However, normalization issues and the
missing value problem of complete n-plexes hamper
quantification across more than one n-plex. Here,
we introduce two novel algorithms implemented in MaxQuant that substantially
improve the data analysis with multiple n-plexes.
First, isobaric matching between runs makes use of the three-dimensional
MS1 features to transfer identifications from identified to unidentified
MS/MS spectra between liquid chromatography–mass spectrometry
runs in order to utilize reporter ion intensities in unidentified
spectra for quantification. On typical datasets, we observe a significant
gain in MS/MS spectra that can be used for quantification. Second,
we introduce a novel PSM-level normalization, applicable to data with
and without the common reference channel. It is a weighted median-based
method, in which the weights reflect the number of ions that were
used for fragmentation. On a typical dataset, we observe complete
removal of batch effects and dominance of the biological sample grouping
after normalization. Furthermore, we provide many novel processing
and normalization options in Perseus, the companion software for the
downstream analysis of quantitative proteomics results. All novel
tools and algorithms are available with the regular MaxQuant and Perseus
releases, which are downloadable at http://maxquant.org.
Collapse
Affiliation(s)
- Sung-Huan Yu
- Computational Systems Biochemistry, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, Martinsried 82152, Germany
| | - Pelagia Kyriakidou
- Computational Systems Biochemistry, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, Martinsried 82152, Germany
| | - Jürgen Cox
- Computational Systems Biochemistry, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, Martinsried 82152, Germany.,Department of Biological and Medical Psychology, University of Bergen, Jonas Liesvei 91, Bergen 5009, Norway
| |
Collapse
|
59
|
Different Research Approaches in Unraveling the Venom Proteome of Naja ashei. Biomolecules 2020; 10:biom10091282. [PMID: 32899462 PMCID: PMC7566006 DOI: 10.3390/biom10091282] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 08/29/2020] [Accepted: 09/03/2020] [Indexed: 12/12/2022] Open
Abstract
The dynamic development of venomics in recent years has resulted in a significant increase in publicly available proteomic data. The information contained therein is often used for comparisons between different datasets and to draw biological conclusions therefrom. In this article, we aimed to show the possible differences that can arise, in the final results of the proteomic experiment, while using different research workflows. We applied two software solutions (PeptideShaker and MaxQuant) to process data from shotgun LC-MS/MS analysis of Naja ashei venom and collate it with the previous report concerning this species. We were able to provide new information regarding the protein composition of this venom but also present the qualitative and quantitative limitations of currently used proteomic methods. Moreover, we reported a rapid and straightforward technique for the separation of the fraction of proteins from the three-finger toxin family. Our results underline the necessary caution in the interpretation of data based on a comparative analysis of data derived from different studies.
Collapse
|
60
|
Cicaloni V, Pecorelli A, Tinti L, Rossi M, Benedusi M, Cervellati C, Spiga O, Santucci A, Hayek J, Salvini L, Tinti C, Valacchi G. Proteomic profiling reveals mitochondrial alterations in Rett syndrome. Free Radic Biol Med 2020; 155:37-48. [PMID: 32445864 DOI: 10.1016/j.freeradbiomed.2020.05.014] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 05/14/2020] [Accepted: 05/16/2020] [Indexed: 12/11/2022]
Abstract
Rett syndrome (RTT) is a pervasive neurodevelopmental disorder associated with mutation in MECP2 gene. Despite a well-defined genetic cause, there is a growing consensus that a metabolic component could play a pivotal role in RTT pathophysiology. Indeed, perturbed redox homeostasis and inflammation, i.e. oxinflammation, with mitochondria dysfunction as the central hub between the two phenomena, appear as possible key contributing factors to RTT pathogenesis and its clinical features. While these RTT-related changes have been widely documented by transcriptomic profiling, proteomics studies supporting these evidences are still limited. Here, using primary dermal fibroblasts from control and patients, we perform a large-scale proteomic analysis that, together with data mining approaches, allow us to carry out the first comprehensive characterization of RTT cellular proteome, showing mainly changes in expression of proteins involved in the mitochondrial network. These findings parallel with an altered expression of key mediators of mitochondrial dynamics and mitophagy associated with abnormal mitochondrial morphology. In conclusion, our proteomic analysis confirms the pathological relevance of mitochondrial dysfunction in RTT pathogenesis and progression.
Collapse
Affiliation(s)
- Vittoria Cicaloni
- Toscana Life Science Foundation, Via Fiorentina 1, 53100, Siena, Italy
| | - Alessandra Pecorelli
- Plants for Human Health Institute, Animal Science Dept., NC Research Campus, NC State University, 600 Laureate Way, Kannapolis, NC, 28081, USA
| | - Laura Tinti
- Toscana Life Science Foundation, Via Fiorentina 1, 53100, Siena, Italy
| | - Marco Rossi
- Toscana Life Science Foundation, Via Fiorentina 1, 53100, Siena, Italy
| | - Mascia Benedusi
- Department of Biomedical and Specialist Surgical Sciences, Section of Medical Biochemistry, Molecular Biology and Genetics, University of Ferrara, Ferrara, Italy
| | - Carlo Cervellati
- Department of Morphology and Experimental Medicine University of Ferrara, via Borsari 46, 44121, Ferrara, Italy
| | - Ottavia Spiga
- Department of Biotechnology, Chemistry and Pharmacy, Via Aldo Moro 2, University of Siena, Siena, Italy
| | - Annalisa Santucci
- Department of Biotechnology, Chemistry and Pharmacy, Via Aldo Moro 2, University of Siena, Siena, Italy
| | - Joussef Hayek
- Child Neuropsychiatry Unit, University General Hospital, Azienda Ospedaliera Universitaria Senese, Viale M. Bracci 16, 53100, Siena, Italy
| | - Laura Salvini
- Toscana Life Science Foundation, Via Fiorentina 1, 53100, Siena, Italy
| | - Cristina Tinti
- Toscana Life Science Foundation, Via Fiorentina 1, 53100, Siena, Italy
| | - Giuseppe Valacchi
- Plants for Human Health Institute, Animal Science Dept., NC Research Campus, NC State University, 600 Laureate Way, Kannapolis, NC, 28081, USA; Department of Biomedical and Specialist Surgical Sciences, Section of Medical Biochemistry, Molecular Biology and Genetics, University of Ferrara, Ferrara, Italy; Kyung Hee University, Department of Food and Nutrition, Seoul, South Korea.
| |
Collapse
|
61
|
Preston GW, Yang L, Phillips DH, Maier CS. Visualisation tools for dependent peptide searches to support the exploration of in vitro protein modifications. PLoS One 2020; 15:e0235263. [PMID: 32639981 PMCID: PMC7343161 DOI: 10.1371/journal.pone.0235263] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2020] [Accepted: 06/11/2020] [Indexed: 01/16/2023] Open
Abstract
Dependent peptide searching is a method for discovering covalently-modified peptides-and therefore proteins-in mass-spectrometry-based proteomics experiments. Being more permissive than standard search methods, it has the potential to discover novel modifications (e.g., post-translational modifications occurring in vivo, or modifications introduced in vitro). However, few studies have explored dependent peptide search results in an untargeted way. In the present study, we sought to evaluate dependent peptide searching as a means of characterising proteins that have been modified in vitro. We generated a model data set by analysing N-ethylmaleimide-treated bovine serum albumin, and performed dependent peptide searches using the popular MaxQuant software. To facilitate interpretation of the search results (hundreds of dependent peptides), we developed a series of visualisation tools (R scripts). We used the tools to assess the diversity of putative modifications in the albumin, and to pinpoint hypothesised modifications. We went on to explore the tools' generality via analyses of public data from studies of rat and human proteomes. Of 19 expected sites of modification (one in rat cofilin-1 and 18 across six different human plasma proteins), eight were found and correctly localised. Apparently, some sites went undetected because chemical enrichment had depleted necessary analytes (potential 'base' peptides). Our results demonstrate (i) the ability of the tools to provide accurate and informative visualisations, and (ii) the usefulness of dependent peptide searching for characterising in vitro protein modifications. Our model data are available via PRIDE/ProteomeXchange (accession number PXD013040).
Collapse
Affiliation(s)
- George W. Preston
- Department of Analytical, MRC-PHE Centre for Environment & Health, Environmental & Forensic Sciences, School of Population Health & Environmental Sciences, Faculty of Life Sciences & Medicine, King’s College London, London, England, United Kingdom
- Department of Chemistry, Oregon State University, Corvallis, OR, United States of America
| | - Liping Yang
- Department of Chemistry, Oregon State University, Corvallis, OR, United States of America
| | - David H. Phillips
- Department of Analytical, MRC-PHE Centre for Environment & Health, Environmental & Forensic Sciences, School of Population Health & Environmental Sciences, Faculty of Life Sciences & Medicine, King’s College London, London, England, United Kingdom
| | - Claudia S. Maier
- Department of Chemistry, Oregon State University, Corvallis, OR, United States of America
| |
Collapse
|
62
|
Slavov N. Single-cell protein analysis by mass spectrometry. Curr Opin Chem Biol 2020; 60:1-9. [PMID: 32599342 DOI: 10.1016/j.cbpa.2020.04.018] [Citation(s) in RCA: 88] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2020] [Revised: 04/23/2020] [Accepted: 04/27/2020] [Indexed: 10/24/2022]
Abstract
Human physiology and pathology arise from the coordinated interactions of diverse single cells. However, analyzing single cells has been limited by the low sensitivity and throughput of analytical methods. DNA sequencing has recently made such analysis feasible for nucleic acids but single-cell protein analysis remains limited. Mass spectrometry is the most powerful method for protein analysis, but its application to single cells faces three major challenges: efficiently delivering proteins/peptides to mass spectrometry detectors, identifying their sequences, and scaling the analysis to many thousands of single cells. These challenges have motivated corresponding solutions, including SCoPE design multiplexing and clean, automated, and miniaturized sample preparation. Synergistically applied, these solutions enable quantifying thousands of proteins across many single cells and establish a solid foundation for further advances. Building upon this foundation, the SCoPE concept will enable analyzing subcellular organelles and posttranslational modifications, while increases in multiplexing capabilities will increase the throughput and decrease cost.
Collapse
Affiliation(s)
- Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, MA, 02115, USA; Barnett Institute, Northeastern University, Boston, MA, 02115, USA; Department of Biology, Northeastern University, Boston, MA, 02115, USA.
| |
Collapse
|
63
|
Prianichnikov N, Koch H, Koch S, Lubeck M, Heilig R, Brehmer S, Fischer R, Cox J. MaxQuant Software for Ion Mobility Enhanced Shotgun Proteomics. Mol Cell Proteomics 2020; 19:1058-1069. [PMID: 32156793 PMCID: PMC7261821 DOI: 10.1074/mcp.tir119.001720] [Citation(s) in RCA: 106] [Impact Index Per Article: 26.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Revised: 01/31/2020] [Indexed: 01/08/2023] Open
Abstract
Ion mobility can add a dimension to LC-MS based shotgun proteomics which has the potential to boost proteome coverage, quantification accuracy and dynamic range. Required for this is suitable software that extracts the information contained in the four-dimensional (4D) data space spanned by m/z, retention time, ion mobility and signal intensity. Here we describe the ion mobility enhanced MaxQuant software, which utilizes the added data dimension. It offers an end to end computational workflow for the identification and quantification of peptides and proteins in LC-IMS-MS/MS shotgun proteomics data. We apply it to trapped ion mobility spectrometry (TIMS) coupled to a quadrupole time-of-flight (QTOF) analyzer. A highly parallelizable 4D feature detection algorithm extracts peaks which are assembled to isotope patterns. Masses are recalibrated with a non-linear m/z, retention time, ion mobility and signal intensity dependent model, based on peptides from the sample. A new matching between runs (MBR) algorithm that utilizes collisional cross section (CCS) values of MS1 features in the matching process significantly gains specificity from the extra dimension. Prerequisite for using CCS values in MBR is a relative alignment of the ion mobility values between the runs. The missing value problem in protein quantification over many samples is greatly reduced by CCS aware MBR.MS1 level label-free quantification is also implemented which proves to be highly precise and accurate on a benchmark dataset with known ground truth. MaxQuant for LC-IMS-MS/MS is part of the basic MaxQuant release and can be downloaded from http://maxquant.org.
Collapse
Affiliation(s)
- Nikita Prianichnikov
- Computational Systems Biochemistry Research Group, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany
| | - Heiner Koch
- Bruker Daltonik GmbH, Farenheitstr. 4, 28359 Bremen, Germany
| | - Scarlet Koch
- Bruker Daltonik GmbH, Farenheitstr. 4, 28359 Bremen, Germany
| | - Markus Lubeck
- Bruker Daltonik GmbH, Farenheitstr. 4, 28359 Bremen, Germany
| | - Raphael Heilig
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, United Kingdom
| | - Sven Brehmer
- Bruker Daltonik GmbH, Farenheitstr. 4, 28359 Bremen, Germany
| | - Roman Fischer
- Nuffield Department of Medicine, Target Discovery Institute, University of Oxford, Oxford, United Kingdom
| | - Jürgen Cox
- Computational Systems Biochemistry Research Group, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany; Department of Biological and Medical Psychology, University of Bergen, Bergen, Norway.
| |
Collapse
|
64
|
Proteomic analysis reveals the damaging role of low redox laccase from Yersinia enterocolitica strain 8081 in the midgut of Helicoverpa armigera. Biotechnol Lett 2020; 42:2189-2210. [PMID: 32472187 DOI: 10.1007/s10529-020-02925-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Accepted: 05/25/2020] [Indexed: 01/25/2023]
Abstract
OBJECTIVE Earlier, we have found that the enteropathogenic Yersinia enterocolitica have evolved the survival mechanisms that regulate the expression of laccase-encoding genes in the gut. The present study aims to characterize the purified recombinant laccase from Y. enterocolitica strain 8081 biovar 1B and understand its effect on the midgut of cotton bollworm, Helicoverpa armigera (Hübner) larvae. RESULTS The recombinant laccase protein showed high purity fold and low molecular mass (~ 43 kDa). H. armigera larvae fed with laccase protein showed a significant decrease in body weight and damage in the midgut. Further, transmission electron microscopy (TEM) studies revealed the negative effect of laccase protein on trachea, malpighian tubules, and villi of the insect. The proteome comparison between control and laccase-fed larvae of cotton bollworm showed significant expression of proteolytic enzymes, oxidoreductases, cytoskeletal proteins, ribosomal proteins; and proteins for citrate (TCA cycle) cycle, glycolysis, stress response, cell redox homeostasis, xenobiotic degradation, and insect defence. Moreover, it also resulted in the reduction of antioxidants, increased melanization (insect innate immune response), and enhanced free radical generation. CONCLUSIONS All these data collectively suggest that H. armigera (Hübner) larvae can be used to study the effect of microbes and their metabolites on the host physiology, anatomy, and survival.
Collapse
|
65
|
Chang HY, Kong AT, da Veiga Leprevost F, Avtonomov DM, Haynes SE, Nesvizhskii AI. Crystal-C: A Computational Tool for Refinement of Open Search Results. J Proteome Res 2020; 19:2511-2515. [PMID: 32338005 DOI: 10.1021/acs.jproteome.0c00119] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Shotgun proteomics using liquid chromatography coupled to mass spectrometry (LC-MS) is commonly used to identify peptides containing post-translational modifications. With the emergence of fast database search tools such as MSFragger, the approach of enlarging precursor mass tolerances during the search (termed "open search") has been increasingly used for comprehensive characterization of post-translational and chemical modifications of protein samples. However, not all mass shifts detected using the open search strategy represent true modifications, as artifacts exist from sources such as unaccounted missed cleavages or peptide co-fragmentation (chimeric MS/MS spectra). Here, we present Crystal-C, a computational tool that detects and removes such artifacts from open search results. Our analysis using Crystal-C shows that, in a typical shotgun proteomics data set, the number of such observations is relatively small. Nevertheless, removing these artifacts helps to simplify the interpretation of the mass shift histograms, which in turn should improve the ability of open search-based tools to detect potentially interesting mass shifts for follow-up investigation.
Collapse
Affiliation(s)
- Hui-Yin Chang
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Andy T Kong
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | | | - Dmitry M Avtonomov
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Sarah E Haynes
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, Michigan 48109, United States.,Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
66
|
Shi Q, Chen W, Huang S, Wang Y, Xue Z. Deep learning for mining protein data. Brief Bioinform 2019; 22:194-218. [PMID: 31867611 DOI: 10.1093/bib/bbz156] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 10/21/2019] [Accepted: 11/07/2019] [Indexed: 01/16/2023] Open
Abstract
The recent emergence of deep learning to characterize complex patterns of protein big data reveals its potential to address the classic challenges in the field of protein data mining. Much research has revealed the promise of deep learning as a powerful tool to transform protein big data into valuable knowledge, leading to scientific discoveries and practical solutions. In this review, we summarize recent publications on deep learning predictive approaches in the field of mining protein data. The application architectures of these methods include multilayer perceptrons, stacked autoencoders, deep belief networks, two- or three-dimensional convolutional neural networks, recurrent neural networks, graph neural networks, and complex neural networks and are described from five perspectives: residue-level prediction, sequence-level prediction, three-dimensional structural analysis, interaction prediction, and mass spectrometry data mining. The advantages and deficiencies of these architectures are presented in relation to various tasks in protein data mining. Additionally, some practical issues and their future directions are discussed, such as robust deep learning for protein noisy data, architecture optimization for specific tasks, efficient deep learning for limited protein data, multimodal deep learning for heterogeneous protein data, and interpretable deep learning for protein understanding. This review provides comprehensive perspectives on general deep learning techniques for protein data analysis.
Collapse
Affiliation(s)
- Qiang Shi
- School of Software Engineering, Huazhong University of Science and Technology. His main interests cover machine learning especially deep learning, protein data analysis, and big data mining
| | - Weiya Chen
- School of Software Engineering, Huazhong University of Science & Technology, Wuhan, China. His research interests cover bioinformatics, virtual reality, and data visualization
| | - Siqi Huang
- Software Engineering at Huazhong University of science and technology, focusing on Machine learning and data mining
| | - Yan Wang
- School of life, University of Science & Technology; her main interests cover protein structure and function prediction and big data mining
| | - Zhidong Xue
- School of Software Engineering, Huazhong University of Science & Technology, Wuhan, China. His research interests cover bioinformatics, machine learning, and image processing
| |
Collapse
|
67
|
Callahan N, Tullman J, Kelman Z, Marino J. Strategies for Development of a Next-Generation Protein Sequencing Platform. Trends Biochem Sci 2019; 45:76-89. [PMID: 31676211 DOI: 10.1016/j.tibs.2019.09.005] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Revised: 09/11/2019] [Accepted: 09/17/2019] [Indexed: 02/08/2023]
Abstract
Proteomic analysis can be a critical bottleneck in cellular characterization. The current paradigm relies primarily on mass spectrometry of peptides and affinity reagents (i.e., antibodies), both of which require a priori knowledge of the sample. An unbiased protein sequencing method, with a dynamic range that covers the full range of protein concentrations in proteomes, would revolutionize the field of proteomics, allowing a more facile characterization of novel gene products and subcellular complexes. To this end, several new platforms based on single-molecule protein-sequencing approaches have been proposed. This review summarizes four of these approaches, highlighting advantages, limitations, and challenges for each method towards advancing as a core technology for next-generation protein sequencing.
Collapse
Affiliation(s)
- Nicholas Callahan
- Institute for Bioscience and Biotechnology Research, National Institute of Standards and Technology, and University of Maryland, Rockville, MD 20850, USA.
| | - Jennifer Tullman
- Institute for Bioscience and Biotechnology Research, National Institute of Standards and Technology, and University of Maryland, Rockville, MD 20850, USA
| | - Zvi Kelman
- Institute for Bioscience and Biotechnology Research, National Institute of Standards and Technology, and University of Maryland, Rockville, MD 20850, USA; Biomolecular Labeling Laboratory, Institute for Bioscience and Biotechnology Research, Rockville, MD 20850, USA
| | - John Marino
- Institute for Bioscience and Biotechnology Research, National Institute of Standards and Technology, and University of Maryland, Rockville, MD 20850, USA
| |
Collapse
|
68
|
Chasing Intracellular Zika Virus Using Proteomics. Viruses 2019; 11:v11090878. [PMID: 31546825 PMCID: PMC6783930 DOI: 10.3390/v11090878] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2019] [Revised: 09/11/2019] [Accepted: 09/17/2019] [Indexed: 12/11/2022] Open
Abstract
Flaviviruses are the most medically relevant group of arboviruses causing a wide range of diseases in humans and are associated with high mortality and morbidity, as such posing a major health concern. Viruses belonging to this family can be endemic (e.g., dengue virus), but can also cause fulminant outbreaks (e.g., West Nile virus, Japanese encephalitis virus and Zika virus). Intense research efforts in the past decades uncovered shared fundamental strategies used by flaviviruses to successfully replicate in their respective hosts. However, the distinct features contributing to the specific host and tissue tropism as well as the pathological outcomes unique to each individual flavivirus are still largely elusive. The profound footprint of individual viruses on their respective hosts can be investigated using novel technologies in the field of proteomics that have rapidly developed over the last decade. An unprecedented sensitivity and throughput of mass spectrometers, combined with the development of new sample preparation and bioinformatics analysis methods, have made the systematic investigation of virus-host interactions possible. Furthermore, the ability to assess dynamic alterations in protein abundances, protein turnover rates and post-translational modifications occurring in infected cells now offer the unique possibility to unravel complex viral perturbations induced in the infected host. In this review, we discuss the most recent contributions of mass spectrometry-based proteomic approaches in flavivirus biology with a special focus on Zika virus, and their basic and translational potential and implications in understanding and characterizing host responses to arboviral infections.
Collapse
|
69
|
Elpa DP, Prabhu GRD, Wu SP, Tay KS, Urban PL. Automation of mass spectrometric detection of analytes and related workflows: A review. Talanta 2019; 208:120304. [PMID: 31816721 DOI: 10.1016/j.talanta.2019.120304] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2019] [Revised: 08/26/2019] [Accepted: 08/28/2019] [Indexed: 12/13/2022]
Abstract
The developments in mass spectrometry (MS) in the past few decades reveal the power and versatility of this technology. MS methods are utilized in routine analyses as well as research activities involving a broad range of analytes (elements and molecules) and countless matrices. However, manual MS analysis is gradually becoming a thing of the past. In this article, the available MS automation strategies are critically evaluated. Automation of analytical workflows culminating with MS detection encompasses involvement of automated operations in any of the steps related to sample handling/treatment before MS detection, sample introduction, MS data acquisition, and MS data processing. Automated MS workflows help to overcome the intrinsic limitations of MS methodology regarding reproducibility, throughput, and the expertise required to operate MS instruments. Such workflows often comprise automated off-line and on-line steps such as sampling, extraction, derivatization, and separation. The most common instrumental tools include autosamplers, multi-axis robots, flow injection systems, and lab-on-a-chip. Prototyping customized automated MS systems is a way to introduce non-standard automated features to MS workflows. The review highlights the enabling role of automated MS procedures in various sectors of academic research and industry. Examples include applications of automated MS workflows in bioscience, environmental studies, and exploration of the outer space.
Collapse
Affiliation(s)
- Decibel P Elpa
- Department of Applied Chemistry, National Chiao Tung University, 1001 University Rd., Hsinchu, 300, Taiwan; Department of Chemistry, National Tsing Hua University, 101, Section 2, Kuang-Fu Rd., Hsinchu, 30013, Taiwan
| | - Gurpur Rakesh D Prabhu
- Department of Applied Chemistry, National Chiao Tung University, 1001 University Rd., Hsinchu, 300, Taiwan; Department of Chemistry, National Tsing Hua University, 101, Section 2, Kuang-Fu Rd., Hsinchu, 30013, Taiwan
| | - Shu-Pao Wu
- Department of Applied Chemistry, National Chiao Tung University, 1001 University Rd., Hsinchu, 300, Taiwan.
| | - Kheng Soo Tay
- Department of Chemistry, Faculty of Science, University of Malaya, 50603 Kuala Lumpur, Malaysia
| | - Pawel L Urban
- Department of Chemistry, National Tsing Hua University, 101, Section 2, Kuang-Fu Rd., Hsinchu, 30013, Taiwan; Frontier Research Center on Fundamental and Applied Sciences of Matters, National Tsing Hua University, 101, Section 2, Kuang-Fu Rd., Hsinchu, 30013, Taiwan.
| |
Collapse
|
70
|
Brademan DR, Riley NM, Kwiecien NW, Coon JJ. Interactive Peptide Spectral Annotator: A Versatile Web-based Tool for Proteomic Applications. Mol Cell Proteomics 2019; 18:S193-S201. [PMID: 31088857 PMCID: PMC6692776 DOI: 10.1074/mcp.tir118.001209] [Citation(s) in RCA: 99] [Impact Index Per Article: 19.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 03/21/2019] [Indexed: 11/06/2022] Open
Abstract
Here we present IPSA, an innovative web-based spectrum annotator that visualizes and characterizes peptide tandem mass spectra. A tool for the scientific community, IPSA can visualize peptides collected using a wide variety of experimental and instrumental configurations. Annotated spectra are customizable via a selection of interactive features and can be exported as editable scalable vector graphics to aid in the production of publication-quality figures. Single spectra can be analyzed through provided web forms, whereas data for multiple peptide spectral matches can be uploaded using the Proteomics Standards Initiative file formats mzTab, mzIdentML, and mzML. Alternatively, peptide identifications and spectral data can be provided using generic file formats. IPSA provides supports for annotating spectra collecting using negative-mode ionization and facilitates the characterization of experimental MS/MS performance through the optional export of fragment ion statistics from one to many peptide spectral matches. This resource is made freely accessible at http://interactivepeptidespectralannotator.com, whereas the source code and user guides are available at https://github.com/coongroup/IPSA for private hosting or custom implementations.
Collapse
Affiliation(s)
- Dain R Brademan
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706; Genome Center of Wisconsin, Madison, WI 53706
| | - Nicholas M Riley
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706; Genome Center of Wisconsin, Madison, WI 53706
| | - Nicholas W Kwiecien
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706; Genome Center of Wisconsin, Madison, WI 53706
| | - Joshua J Coon
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI 53706; Morgridge Institute for Research, Madison, WI 53715; Genome Center of Wisconsin, Madison, WI 53706; Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI 53706.
| |
Collapse
|
71
|
Saleh S, Staes A, Deborggraeve S, Gevaert K. Targeted Proteomics for Studying Pathogenic Bacteria. Proteomics 2019; 19:e1800435. [DOI: 10.1002/pmic.201800435] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2019] [Revised: 06/04/2019] [Indexed: 02/04/2023]
Affiliation(s)
- Sara Saleh
- Department of Biomedical SciencesInstitute of Tropical Medicine B‐2000 Antwerp Belgium
- VIB Center for Medical Biotechnology B‐9000 Ghent Belgium
- Department of Biomolecular MedicineGhent University B‐9000 Ghent Belgium
| | - An Staes
- VIB Center for Medical Biotechnology B‐9000 Ghent Belgium
- Department of Biomolecular MedicineGhent University B‐9000 Ghent Belgium
| | - Stijn Deborggraeve
- Department of Biomedical SciencesInstitute of Tropical Medicine B‐2000 Antwerp Belgium
| | - Kris Gevaert
- VIB Center for Medical Biotechnology B‐9000 Ghent Belgium
- Department of Biomolecular MedicineGhent University B‐9000 Ghent Belgium
| |
Collapse
|
72
|
Tiwary S, Levy R, Gutenbrunner P, Salinas Soto F, Palaniappan KK, Deming L, Berndl M, Brant A, Cimermancic P, Cox J. High-quality MS/MS spectrum prediction for data-dependent and data-independent acquisition data analysis. Nat Methods 2019; 16:519-525. [PMID: 31133761 DOI: 10.1038/s41592-019-0427-6] [Citation(s) in RCA: 150] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Accepted: 04/19/2019] [Indexed: 02/05/2023]
Abstract
Peptide fragmentation spectra are routinely predicted in the interpretation of mass-spectrometry-based proteomics data. However, the generation of fragment ions has not been understood well enough for scientists to estimate fragment ion intensities accurately. Here, we demonstrate that machine learning can predict peptide fragmentation patterns in mass spectrometers with accuracy within the uncertainty of measurement. Moreover, analysis of our models reveals that peptide fragmentation depends on long-range interactions within a peptide sequence. We illustrate the utility of our models by applying them to the analysis of both data-dependent and data-independent acquisition datasets. In the former case, we observe a q-value-dependent increase in the total number of peptide identifications. In the latter case, we confirm that the use of predicted tandem mass spectrometry spectra is nearly equivalent to the use of spectra from experimental libraries.
Collapse
Affiliation(s)
- Shivani Tiwary
- Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Roie Levy
- Verily Life Sciences, South San Francisco, CA, USA
| | - Petra Gutenbrunner
- Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany
| | - Favio Salinas Soto
- Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany
| | | | | | | | - Arthur Brant
- Verily Life Sciences, South San Francisco, CA, USA
| | | | - Jürgen Cox
- Computational Systems Biochemistry Research Group, Max Planck Institute of Biochemistry, Martinsried, Germany. .,Department of Biological and Medical Psychology, University of Bergen, Bergen, Norway.
| |
Collapse
|
73
|
Prosit: proteome-wide prediction of peptide tandem mass spectra by deep learning. Nat Methods 2019; 16:509-518. [DOI: 10.1038/s41592-019-0426-7] [Citation(s) in RCA: 340] [Impact Index Per Article: 68.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 04/18/2019] [Indexed: 11/08/2022]
|
74
|
Rudolph JD, Cox J. A Network Module for the Perseus Software for Computational Proteomics Facilitates Proteome Interaction Graph Analysis. J Proteome Res 2019; 18:2052-2064. [PMID: 30931570 PMCID: PMC6578358 DOI: 10.1021/acs.jproteome.8b00927] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Proteomics data analysis strongly benefits from not studying single proteins in isolation but taking their multivariate interdependence into account. We introduce PerseusNet, the new Perseus network module for the biological analysis of proteomics data. Proteomics is commonly used to generate networks, e.g., with affinity purification experiments, but networks are also used to explore proteomics data. PerseusNet supports the biomedical researcher for both modes of data analysis with a multitude of activities. For affinity purification, a volcano-plot-based statistical analysis method for network generation is featured which is scalable to large numbers of baits. For posttranslational modifications of proteins, such as phosphorylation, a collection of dedicated network analysis tools helps in elucidating cellular signaling events. Co-expression network analysis of proteomics data adopts established tools from transcriptome co-expression analysis. PerseusNet is extensible through a plugin architecture in a multi-lingual way, integrating analyses in C#, Python, and R, and is freely available at http://www.perseus-framework.org .
Collapse
Affiliation(s)
- Jan Daniel Rudolph
- Computational Systems Biochemistry , Max-Planck Institute of Biochemistry , Am Klopferspitz 18 , 82152 Martinsried , Germany
| | - Jürgen Cox
- Computational Systems Biochemistry , Max-Planck Institute of Biochemistry , Am Klopferspitz 18 , 82152 Martinsried , Germany.,Department of Biological and Medical Psychology , University of Bergen , Jonas Liesvei 91 , 5009 Bergen , Norway
| |
Collapse
|
75
|
Saito MA, Bertrand EM, Duffy ME, Gaylord DA, Held NA, Hervey WJ, Hettich RL, Jagtap PD, Janech MG, Kinkade DB, Leary DH, McIlvin MR, Moore EK, Morris RM, Neely BA, Nunn BL, Saunders JK, Shepherd AI, Symmonds NI, Walsh DA. Progress and Challenges in Ocean Metaproteomics and Proposed Best Practices for Data Sharing. J Proteome Res 2019; 18:1461-1476. [PMID: 30702898 DOI: 10.1021/acs.jproteome.8b00761] [Citation(s) in RCA: 38] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Ocean metaproteomics is an emerging field enabling discoveries about marine microbial communities and their impact on global biogeochemical processes. Recent ocean metaproteomic studies have provided insight into microbial nutrient transport, colimitation of carbon fixation, the metabolism of microbial biofilms, and dynamics of carbon flux in marine ecosystems. Future methodological developments could provide new capabilities such as characterizing long-term ecosystem changes, biogeochemical reaction rates, and in situ stoichiometries. Yet challenges remain for ocean metaproteomics due to the great biological diversity that produces highly complex mass spectra, as well as the difficulty in obtaining and working with environmental samples. This review summarizes the progress and challenges facing ocean metaproteomic scientists and proposes best practices for data sharing of ocean metaproteomic data sets, including the data types and metadata needed to enable intercomparisons of protein distributions and annotations that could foster global ocean metaproteomic capabilities.
Collapse
Affiliation(s)
- Mak A Saito
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States
| | - Erin M Bertrand
- Department of Biology , Dalhousie University , Halifax , Nova Scotia B3H 4R2 , Canada
| | - Megan E Duffy
- School of Oceanography , University of Washington , Seattle , Washington 98195-7940 , United States
| | - David A Gaylord
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States
| | - Noelle A Held
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States
| | | | - Robert L Hettich
- Oak Ridge National Laboratory and Microbiology Department , University of Tennessee , Knoxville , Tennessee 37996 , United States
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics , University of Minnesota , Saint Paul , Minnesota 55108 , United States
| | - Michael G Janech
- College of Charleston , Charleston , South Carolina 29424 , United States
| | - Danie B Kinkade
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States
| | - Dagmar H Leary
- U.S. Naval Research Laboratory , Washington , D.C. 20375 , United States
| | - Matthew R McIlvin
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States
| | - Eli K Moore
- Department of Environmental Science , Rowan University , Glassboro , New Jersey 08028 , United States
| | - Robert M Morris
- School of Oceanography , University of Washington , Seattle , Washington 98195-7940 , United States
| | - Benjamin A Neely
- National Institute of Standards and Technology , Charleston , South Carolina 29412 , United States
| | - Brook L Nunn
- Department of Genome Sciences , University of Washington , Seattle , Washington 98195 , United States
| | - Jaclyn K Saunders
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States.,School of Oceanography , University of Washington , Seattle , Washington 98195-7940 , United States
| | - Adam I Shepherd
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States
| | - Nicholas I Symmonds
- Woods Hole Oceanographic Institution , Woods Hole , Massachusetts 02543 , United States
| | - David A Walsh
- Department of Biology , Concordia University , Montreal , Quebec H4B 1R6 , Canada
| |
Collapse
|
76
|
Doll S, Gnad F, Mann M. The Case for Proteomics and Phospho-Proteomics in Personalized Cancer Medicine. Proteomics Clin Appl 2019; 13:e1800113. [PMID: 30790462 PMCID: PMC6519247 DOI: 10.1002/prca.201800113] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2018] [Revised: 02/01/2019] [Indexed: 02/06/2023]
Abstract
The concept of personalized medicine is predominantly been pursued through genomic and transcriptomic technologies, leading to the identification of multiple mutations in a large variety of cancers. However, it has proven challenging to distinguish driver and passenger mutations and to deal with tumor heterogeneity and resistant clonal populations. More generally, these heterogeneous mutation patterns do not in themselves predict the tumor phenotype. Analysis of the expressed proteins in a tumor and their modification states reveals if and how these mutations are translated to the functional level. It is already known that proteomic changes including posttranslational modifications are crucial drivers of oncogenesis, but proteomics technology has only recently become comparable in depth and accuracy to RNAseq. These advances also allow the rapid and highly sensitive analysis of formalin-fixed and paraffin-embedded biobank tissues, on both the proteome and phosphoproteome levels. In this perspective, pioneering mass spectrometry-based proteomic studies are highlighted that pave the way toward clinical implementation. It is argued that proteomics and phosphoproteomics could provide the missing link to make omics analysis actionable in the clinic.
Collapse
Affiliation(s)
- Sophia Doll
- Department of Proteomics and Signal TransductionMax Planck Institute of Biochemistry82152MartinsriedGermany
- NNF Center for Protein ResearchFaculty of Health SciencesUniversity of CopenhagenCopenhagenDenmark
| | - Florian Gnad
- Department of Bioinformatics and Computational BiologyCell Signaling Technology Inc01923DanversMAUSA
| | - Matthias Mann
- Department of Proteomics and Signal TransductionMax Planck Institute of Biochemistry82152MartinsriedGermany
- NNF Center for Protein ResearchFaculty of Health SciencesUniversity of CopenhagenCopenhagenDenmark
| |
Collapse
|
77
|
Emmott E, Jovanovic M, Slavov N. Approaches for Studying Ribosome Specialization. Trends Biochem Sci 2019; 44:478-479. [PMID: 30792028 DOI: 10.1016/j.tibs.2019.01.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Accepted: 01/23/2019] [Indexed: 12/14/2022]
Abstract
Contrary to the textbook model, recent measurements demonstrated unexpected diversity in ribosomal composition that likely enables specialized translational functions. Methods based on liquid chromatography coupled to tandem mass-spectrometry (LC-MS/MS) enable direct quantification of ribosomal proteins with high specificity, accuracy, and throughput. LC-MS/MS can be 'top-down', analyzing intact proteins, or more commonly 'bottom-up', where proteins are digested to peptides prior to analysis. Changes to rRNA can be examined using either LC-MS/MS or sequencing-based approaches. The regulation of protein synthesis by specialized ribosomes can be examined by multiple methods. These include the popular 'Ribo-Seq' method for analyzing ribosome density on a given mRNA, as well as LC-MS/MS approaches incorporating pulse-labelling with stable isotopes (SILAC) to monitor protein synthesis and degradation.
Collapse
Affiliation(s)
- Edward Emmott
- Department of Bioengineering, Northeastern University, Boston, MA, USA; Barnett Institute for Chemical and Biological Analysis, Northeastern University, Boston, MA, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, MA, USA; Barnett Institute for Chemical and Biological Analysis, Northeastern University, Boston, MA, USA; Department of Biology, Northeastern University, Boston, MA, USA.
| |
Collapse
|
78
|
Wichmann C, Meier F, Virreira Winter S, Brunner AD, Cox J, Mann M. MaxQuant.Live Enables Global Targeting of More Than 25,000 Peptides. Mol Cell Proteomics 2019; 18:982-994. [PMID: 30755466 PMCID: PMC6495250 DOI: 10.1074/mcp.tir118.001131] [Citation(s) in RCA: 78] [Impact Index Per Article: 15.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2018] [Revised: 02/04/2019] [Accepted: 02/04/2019] [Indexed: 11/06/2022] Open
Abstract
Mass spectrometry (MS)-based proteomics is often performed in a shotgun format, in which as many peptide precursors as possible are selected from full or MS1 scans so that their fragment spectra can be recorded in MS2 scans. Although achieving great proteome depths, shotgun proteomics cannot guarantee that each precursor will be fragmented in each run. In contrast, targeted proteomics aims to reproducibly and sensitively record a restricted number of precursor/fragment combinations in each run, based on prescheduled mass-to-charge and retention time windows. Here we set out to unify these two concepts by a global targeting approach in which an arbitrary number of precursors of interest are detected in real-time, followed by standard fragmentation or advanced peptide-specific analyses. We made use of a fast application programming interface to a quadrupole Orbitrap instrument and real-time recalibration in mass, retention time and intensity dimensions to predict precursor identity. MaxQuant.Live is freely available (www.maxquant.live) and has a graphical user interface to specify many predefined data acquisition strategies. Acquisition speed is as fast as with the vendor software and the power of our approach is demonstrated with the acquisition of breakdown curves for hundreds of precursors of interest. We also uncover precursors that are not even visible in MS1 scans, using elution time prediction based on the auto-adjusted retention time alone. Finally, we successfully recognized and targeted more than 25,000 peptides in single LC-MS runs. Global targeting combines the advantages of two classical approaches in MS-based proteomics, whereas greatly expanding the analytical toolbox.
Collapse
Affiliation(s)
- Christoph Wichmann
- From the ‡Computational Systems Biochemistry, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany
| | - Florian Meier
- §Proteomics and Signal Transduction, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany.,¶NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen, Denmark
| | - Sebastian Virreira Winter
- §Proteomics and Signal Transduction, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany
| | - Andreas-David Brunner
- §Proteomics and Signal Transduction, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany
| | - Jürgen Cox
- From the ‡Computational Systems Biochemistry, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany;
| | - Matthias Mann
- §Proteomics and Signal Transduction, Max-Planck Institute of Biochemistry, Am Klopferspitz 18, 82152 Martinsried, Germany; .,¶NNF Center for Protein Research, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen, Denmark
| |
Collapse
|
79
|
Emmott E, Jovanovic M, Slavov N. Ribosome Stoichiometry: From Form to Function. Trends Biochem Sci 2019; 44:95-109. [PMID: 30473427 PMCID: PMC6340777 DOI: 10.1016/j.tibs.2018.10.009] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2018] [Revised: 08/27/2018] [Accepted: 10/20/2018] [Indexed: 12/11/2022]
Abstract
The existence of eukaryotic ribosomes with distinct ribosomal protein (RP) stoichiometry and regulatory roles in protein synthesis has been speculated for over 60 years. Recent advances in mass spectrometry (MS) and high-throughput analysis have begun to identify and characterize distinct ribosome stoichiometry in yeast and mammalian systems. In addition to RP stoichiometry, ribosomes host a vast array of protein modifications, effectively expanding the number of human RPs from 80 to many thousands of distinct proteoforms. Is it possible that these proteoforms combine to function as a 'ribosome code' to tune protein synthesis? We outline the specific benefits that translational regulation by specialized ribosomes can offer and discuss the means and methodologies available to correlate and characterize RP stoichiometry with function. We highlight previous research with a focus on formulating hypotheses that can guide future experiments and crack the ribosome code.
Collapse
Affiliation(s)
- Edward Emmott
- Department of Bioengineering, Northeastern University, Boston, MA, USA
| | - Marko Jovanovic
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, MA, USA; Department of Biology, Northeastern University, Boston, MA, USA.
| |
Collapse
|
80
|
Malioutov D, Chen T, Airoldi E, Jaffe J, Budnik B, Slavov N. Quantifying Homologous Proteins and Proteoforms. Mol Cell Proteomics 2019; 18:162-168. [PMID: 30282776 PMCID: PMC6317479 DOI: 10.1074/mcp.tir118.000947] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Indexed: 01/30/2023] Open
Abstract
Many proteoforms-arising from alternative splicing, post-translational modifications (PTM), or paralogous genes-have distinct biological functions, such as histone PTM proteoforms. However, their quantification by existing bottom-up mass-spectrometry (MS) methods is undermined by peptide-specific biases. To avoid these biases, we developed and implemented a first-principles model (HIquant) for quantifying proteoform stoichiometries. We characterized when MS data allow inferring proteoform stoichiometries by HIquant and derived an algorithm for optimal inference. We applied this algorithm to infer proteoform stoichiometries in two experimental systems that supported rigorous bench-marking: alkylated proteoforms spiked-in at known ratios and endogenous histone 3 PTM proteoforms quantified relative to internal heavy standards. When compared with the benchmarks, the proteoform stoichiometries interfered by HIquant without using external standards had relative error of 5-15% for simple proteoforms and 20-30% for complex proteoforms. A HIquant server is implemented at: https://web.northeastern.edu/slavov/2014HIquant/.
Collapse
Affiliation(s)
- Dmitry Malioutov
- From the ‡T. J. Watson IBM Research Center, 1101 Kitchawan Road, Yorktown Heights, NY 10598
| | - Tianchi Chen
- §Department of Bioengineering, Northeastern University, Boston, MA 02115
| | - Edoardo Airoldi
- ‖Department of Statistics, Harvard University, Cambridge, MA 02138
| | - Jacob Jaffe
- ¶Proteomics Platform, Broad Institute of MIT and Harvard, Cambridge, MA 02142
| | - Bogdan Budnik
- *MSPRL, FAS Division of Science, Harvard University, Cambridge, MA 02138
| | - Nikolai Slavov
- §Department of Bioengineering, Northeastern University, Boston, MA 02115;.
| |
Collapse
|
81
|
Kumar P, Panigrahi P, Johnson J, Weber WJ, Mehta S, Sajulga R, Easterly C, Crooker BA, Heydarian M, Anamika K, Griffin TJ, Jagtap PD. QuanTP: A Software Resource for Quantitative Proteo-Transcriptomic Comparative Data Analysis and Informatics. J Proteome Res 2018; 18:782-790. [DOI: 10.1021/acs.jproteome.8b00727] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Praveen Kumar
- Bioinformatics and Computational Biology Program, University of Minnesota-Rochester, Rochester, Minnesota 55904, United States
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | | | - James Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Wanda J. Weber
- Department of Animal Science, University of Minnesota, St. Paul, Minnesota 55108, United States
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Ray Sajulga
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Caleb Easterly
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Brian A. Crooker
- Department of Animal Science, University of Minnesota, St. Paul, Minnesota 55108, United States
| | - Mohammad Heydarian
- Department of Biology, Johns Hopkins University, Baltimore, Maryland 21218, United States
| | - Krishanpal Anamika
- LABS, Persistent Systems, Aryabhata-Pingala, Erandwane, Pune 411004, India
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| |
Collapse
|
82
|
Pascovici D, Wu JX, McKay MJ, Joseph C, Noor Z, Kamath K, Wu Y, Ranganathan S, Gupta V, Mirzaei M. Clinically Relevant Post-Translational Modification Analyses-Maturing Workflows and Bioinformatics Tools. Int J Mol Sci 2018; 20:E16. [PMID: 30577541 PMCID: PMC6337699 DOI: 10.3390/ijms20010016] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Revised: 12/09/2018] [Accepted: 12/17/2018] [Indexed: 01/04/2023] Open
Abstract
Post-translational modifications (PTMs) can occur soon after translation or at any stage in the lifecycle of a given protein, and they may help regulate protein folding, stability, cellular localisation, activity, or the interactions proteins have with other proteins or biomolecular species. PTMs are crucial to our functional understanding of biology, and new quantitative mass spectrometry (MS) and bioinformatics workflows are maturing both in labelled multiplexed and label-free techniques, offering increasing coverage and new opportunities to study human health and disease. Techniques such as Data Independent Acquisition (DIA) are emerging as promising approaches due to their re-mining capability. Many bioinformatics tools have been developed to support the analysis of PTMs by mass spectrometry, from prediction and identifying PTM site assignment, open searches enabling better mining of unassigned mass spectra-many of which likely harbour PTMs-through to understanding PTM associations and interactions. The remaining challenge lies in extracting functional information from clinically relevant PTM studies. This review focuses on canvassing the options and progress of PTM analysis for large quantitative studies, from choosing the platform, through to data analysis, with an emphasis on clinically relevant samples such as plasma and other body fluids, and well-established tools and options for data interpretation.
Collapse
Affiliation(s)
- Dana Pascovici
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
- Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW 2109, Australia.
| | - Jemma X Wu
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
- Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW 2109, Australia.
| | - Matthew J McKay
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
- Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW 2109, Australia.
| | - Chitra Joseph
- Department of Clinical Medicine, Macquarie University, Sydney, NSW 2109, Australia.
| | - Zainab Noor
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
| | - Karthik Kamath
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
- Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW 2109, Australia.
| | - Yunqi Wu
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
- Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW 2109, Australia.
| | - Shoba Ranganathan
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
| | - Vivek Gupta
- Department of Clinical Medicine, Macquarie University, Sydney, NSW 2109, Australia.
| | - Mehdi Mirzaei
- Department of Molecular Sciences, Macquarie University, Sydney, NSW 2109, Australia.
- Australian Proteome Analysis Facility, Macquarie University, Sydney, NSW 2109, Australia.
- Department of Clinical Medicine, Macquarie University, Sydney, NSW 2109, Australia.
| |
Collapse
|
83
|
Fan Z, Kong F, Zhou Y, Chen Y, Dai Y. Intelligence Algorithms for Protein Classification by Mass Spectrometry. BIOMED RESEARCH INTERNATIONAL 2018; 2018:2862458. [PMID: 30534555 PMCID: PMC6252195 DOI: 10.1155/2018/2862458] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Revised: 09/27/2018] [Accepted: 10/29/2018] [Indexed: 11/17/2022]
Abstract
Mass spectrometry (MS) is an important technique in protein research. Effective classification methods by MS data could contribute to early and less-invasive diagnosis and also facilitate developments in the bioinformatics field. As MS data is featured by high dimension, appropriate methods which can effectively deal with the large amount of MS data have been widely studied. In this paper, the applications of methods based on intelligence algorithms have been investigated. Firstly, classification and biomarker analysis methods using typical machine learning approaches have been discussed. Then those are followed by the Ensemble strategy algorithms. Clearly, simple and basic machine learning algorithms hardly addressed the various needs of protein MS classification. Preprocessing algorithms have been also studied, as these methods are useful for feature selection or feature extraction to improve classification performance. Protein MS data growing with data volume becomes complicated and large; improvements in classification methods in terms of classifier selection and combinations of different algorithms and preprocessing algorithms are more emphasized in further work.
Collapse
Affiliation(s)
- Zichuan Fan
- School of Computer and Information Science, Southwest University, Chongqing 400715, China
| | - Fanchen Kong
- School of Computer and Information Science, Southwest University, Chongqing 400715, China
| | - Yang Zhou
- School of Computer and Information Science, Southwest University, Chongqing 400715, China
| | - Yiqing Chen
- School of Computer and Information Science, Southwest University, Chongqing 400715, China
| | - Yalan Dai
- School of Computer and Information Science, Southwest University, Chongqing 400715, China
| |
Collapse
|
84
|
Budnik B, Levy E, Harmange G, Slavov N. SCoPE-MS: mass spectrometry of single mammalian cells quantifies proteome heterogeneity during cell differentiation. Genome Biol 2018; 19:161. [PMID: 30343672 PMCID: PMC6196420 DOI: 10.1186/s13059-018-1547-5] [Citation(s) in RCA: 517] [Impact Index Per Article: 86.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2018] [Accepted: 09/19/2018] [Indexed: 11/16/2022] Open
Abstract
Some exciting biological questions require quantifying thousands of proteins in single cells. To achieve this goal, we develop Single Cell ProtEomics by Mass Spectrometry (SCoPE-MS) and validate its ability to identify distinct human cancer cell types based on their proteomes. We use SCoPE-MS to quantify over a thousand proteins in differentiating mouse embryonic stem cells. The single-cell proteomes enable us to deconstruct cell populations and infer protein abundance relationships. Comparison between single-cell proteomes and transcriptomes indicates coordinated mRNA and protein covariation, yet many genes exhibit functionally concerted and distinct regulatory patterns at the mRNA and the protein level.
Collapse
Affiliation(s)
- Bogdan Budnik
- MSPRL, FAS Division of Science, Harvard University, Cambridge, MA, 02138, USA.
| | - Ezra Levy
- Department of Biology, Northeastern University, Boston, MA, 02115, USA
| | | | - Nikolai Slavov
- Department of Biology, Northeastern University, Boston, MA, 02115, USA.
- Department of Bioengineering, Northeastern University, Boston, MA, 02115, USA.
| |
Collapse
|
85
|
Abstract
Many pressing medical challenges, such as diagnosing disease, enhancing directed stem-cell differentiation, and classifying cancers, have long been hindered by limitations in our ability to quantify proteins in single cells. Mass spectrometry (MS) is poised to transcend these limitations by developing powerful methods to routinely quantify thousands of proteins and proteoforms across many thousands of single cells. We outline specific technological developments and ideas that can increase the sensitivity and throughput of single-cell MS by orders of magnitude and usher in this new age. These advances will transform medicine and ultimately contribute to understanding biological systems on an entirely new level.
Collapse
Affiliation(s)
- Harrison Specht
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, Massachusetts 02115, United States
- Department of Biology, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
86
|
Lehmann R, Schmidt A, Pastuschek J, Müller MM, Fritzsche A, Dieterle S, Greb RR, Markert UR, Slevogt H. Comparison of sample preparation techniques and data analysis for the LC-MS/MS-based identification of proteins in human follicular fluid. Am J Reprod Immunol 2018; 80:e12994. [PMID: 29938851 DOI: 10.1111/aji.12994] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 05/23/2018] [Indexed: 12/20/2022] Open
Abstract
The proteomic analysis of complex body fluids by liquid chromatography tandem mass spectrometry (LC-MS/MS) analysis requires the selection of suitable sample preparation techniques and optimal parameter settings in data analysis software packages to obtain reliable results. Proteomic analysis of follicular fluid, as a representative of a complex body fluid similar to serum or plasma, is difficult as it contains a vast amount of high abundant proteins and a variety of proteins with different concentrations. However, the accessibility of this complex body fluid for LC-MS/MS analysis is an opportunity to gain insights into the status, the composition of fertility-relevant proteins including immunological factors or for the discovery of new diagnostic and prognostic markers for, for example, the treatment of infertility. In this study, we compared different sample preparation methods (FASP, eFASP and in-solution digestion) and three different data analysis software packages (Proteome Discoverer with SEQUEST, Mascot and MaxQuant with Andromeda) combined with semi- and full-tryptic databank search options to obtain a maximum coverage of the follicular fluid proteome. We found that the most comprehensive proteome coverage is achieved by the eFASP sample preparation method using SDS in the initial denaturing step and the SEQUEST-based semi-tryptic data analysis. In conclusion, we have developed a fractionation-free methodical workflow for in depth LC-MS/MS-based analysis for the standardized investigation of human follicle fluid as an important representative of a complex body fluid. Taken together, we were able to identify a total of 1392 proteins in follicular fluid.
Collapse
Affiliation(s)
- Roland Lehmann
- Host Septomics Research Group, Jena University Hospital, Jena, Germany
| | - André Schmidt
- Department of Obstetrics, Placenta Lab, Jena University Hospital, Jena, Germany
| | - Jana Pastuschek
- Department of Obstetrics, Placenta Lab, Jena University Hospital, Jena, Germany
| | - Mario M Müller
- Host Septomics Research Group, Jena University Hospital, Jena, Germany
| | | | | | | | - Udo R Markert
- Department of Obstetrics, Placenta Lab, Jena University Hospital, Jena, Germany
| | - Hortense Slevogt
- Host Septomics Research Group, Jena University Hospital, Jena, Germany
| |
Collapse
|