1
|
On the utility of ultrafast MS1-only proteomics in drug target discovery studies based on thermal proteome profiling method. Anal Bioanal Chem 2024:10.1007/s00216-024-05330-9. [PMID: 38744720 DOI: 10.1007/s00216-024-05330-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 05/03/2024] [Accepted: 05/06/2024] [Indexed: 05/16/2024]
Abstract
Advances in high-throughput high-resolution mass spectrometry and the development of thermal proteome profiling approach (TPP) have made it possible to accelerate a drug target search. Since its introduction in 2014, TPP quickly became a method of choice in chemical proteomics for identifying drug-to-protein interactions on a proteome-wide scale and mapping the pathways of these interactions, thus further elucidating the unknown mechanisms of action of a drug under study. However, the current TPP implementations based on tandem mass spectrometry (MS/MS), associated with employing lengthy peptide separation protocols and expensive labeling techniques for sample multiplexing, limit the scaling of this approach for the ever growing variety of drug-to-proteomes. A variety of ultrafast proteomics methods have been developed in the last couple of years. Among them, DirectMS1 provides MS/MS-free quantitative proteome-wide analysis in 5-min time scale, thus opening the way for sample-hungry applications, such as TPP. In this work, we demonstrate the first implementation of the TPP approach using the ultrafast proteome-wide analysis based on DirectMS1. Using a drug topotecan, which is a known topoisomerase I (TOP1) inhibitor, the feasibility of the method for identifying drug targets at the whole proteome level was demonstrated for an ovarian cancer cell line.
Collapse
|
2
|
IQMMA: Efficient MS1 Intensity Extraction Pipeline Using Multiple Feature Detection Algorithms for DDA Proteomics. J Proteome Res 2023; 22:2827-2835. [PMID: 37579078 DOI: 10.1021/acs.jproteome.3c00075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/16/2023]
Abstract
One of the key steps in data dependent acquisition (DDA) proteomics is detection of peptide isotopic clusters, also called "features", in MS1 spectra and matching them to MS/MS-based peptide identifications. A number of peptide feature detection tools became available in recent years, each relying on its own matching algorithm. Here, we provide an integrated solution, the intensity-based Quantitative Mix and Match Approach (IQMMA), which integrates a number of untargeted peptide feature detection algorithms and returns the most probable intensity values for the MS/MS-based identifications. IQMMA was tested using available proteomic data acquired for both well-characterized (ground truth) and real-world biological samples, including a mix of Yeast and E. coli digests spiked at different concentrations into the Human K562 digest used as a background, and a set of glioblastoma cell lines. Three open-source feature detection algorithms were integrated: Dinosaur, biosaur2, and OpenMS FeatureFinder. None of them was found optimal when applied individually to all the data sets employed in this work; however, their combined use in IQMMA improved efficiency of subsequent protein quantitation. The software implementing IQMMA is freely available at https://github.com/PostoenkoVI/IQMMA under Apache 2.0 license.
Collapse
|
3
|
Massive Proteogenomic Reanalysis of Publicly Available Proteomic Datasets of Human Tissues in Search for Protein Recoding via Adenosine-to-Inosine RNA Editing. J Proteome Res 2023. [PMID: 37158322 DOI: 10.1021/acs.jproteome.2c00740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
The proteogenomic search pipeline developed in this work has been applied for reanalysis of 40 publicly available shotgun proteomic datasets from various human tissues comprising more than 8000 individual LC-MS/MS runs, of which 5442 .raw data files were processed in total. This reanalysis was focused on searching for ADAR-mediated RNA editing events, their clustering across samples of different origins, and classification. In total, 33 recoded protein sites were identified in 21 datasets. Of those, 18 sites were detected in at least two datasets, representing the core human protein editome. In agreement with prior artworks, neural and cancer tissues were found to be enriched with recoded proteins. Quantitative analysis indicated that recoding the rate of specific sites did not directly depend on the levels of ADAR enzymes or targeted proteins themselves, rather it was governed by differential and yet undescribed regulation of interaction of enzymes with mRNA. Nine recoding sites conservative between humans and rodents were validated by targeted proteomics using stable isotope standards in the murine brain cortex and cerebellum, and an additional one was validated in human cerebrospinal fluid. In addition to previous data of the same type from cancer proteomes, we provide a comprehensive catalog of recoding events caused by ADAR RNA editing in the human proteome.
Collapse
|
4
|
Proteomics-based scoring of cellular response to stimuli for improved characterization of signaling pathway activity. Proteomics 2023; 23:e2200275. [PMID: 36478387 DOI: 10.1002/pmic.202200275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 11/04/2022] [Accepted: 11/25/2022] [Indexed: 12/12/2022]
Abstract
Omics technologies focus on uncovering the complex nature of molecular mechanisms in cells and organisms, including biomarkers and drug targets discovery. Aiming at these tasks, we see that information extracted from omics data is still underused. In particular, characteristics of differentially regulated molecules can be combined in a single score to quantify the signaling pathway activity. Such a metric can be useful for comprehensive data interpretation to follow: (1) developing molecular responses in time; (2) potency of a drug on a certain cell culture; (3) ranking the signaling pathway activity in stimulated cells; and (4) integration of the omics data and assay-based measurements of cell viability, cytotoxicity, and proliferation. With recent advances in ultrafast mass spectrometry for quantitative proteomics allowing data collection in a few minutes, proteomics score for cellular response to stimuli can become a fast, accurate, and informative complement to bioassays. Here, we utilized an interquartile-based selection of differentially regulated features and a variety of schemes for quantifying cellular responses to come up with the quantitative metric for total cellular response and pathway activity. Validation was performed using antiproliferative and virus assays and label-free proteomics data collected for cancer cells subjected to drug stimulation.
Collapse
|
5
|
On the Feasibility of Using an Ultra-Fast DirectMS1 Method of Proteome-Wide Analysis for Searching Drug Targets in Chemical Proteomics. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:1342-1353. [PMID: 36509723 DOI: 10.1134/s000629792211013x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Protein quantitation in tissue cells or physiological fluids based on liquid chromatography/mass spectrometry is one of the key sources of information on the mechanisms of cell functioning during chemotherapeutic treatment. Information on significant changes in protein expression upon treatment can be obtained by chemical proteomics and requires analysis of the cellular proteomes, as well as development of experimental and bioinformatic methods for identification of the drug targets. Low throughput of whole proteome analysis based on liquid chromatography and tandem mass spectrometry is one of the main factors limiting the scale of these studies. The method of direct mass spectrometric identification of proteins, DirectMS1, is one of the approaches developed in recent years allowing ultrafast proteome-wide analyses employing minute-scale gradients for separation of proteolytic mixtures. Aim of this work was evaluation of both possibilities and limitations of the method for identification of drug targets at the level of whole proteome and for revealing cellular processes activated by the treatment. Particularly, the available literature data on chemical proteomics obtained earlier for a large set of onco-pharmaceuticals using multiplex quantitative proteome profiling were analyzed. The results obtained were further compared with the proteome-wide data acquired by the DirectMS1 method using ultrashort separation gradients to evaluate efficiency of the method in identifying known drug targets. Using ovarian cancer cell line A2780 as an example, a whole-proteome comparison of two cell lysis techniques was performed, including the freeze-thaw lysis commonly employed in chemical proteomics and the one based on ultrasonication for cell disruption, which is the widely accepted as a standard in proteomic studies. Also, the proteome-wide profiling was performed using ultrafast DirectMS1 method for A2780 cell line treated with lonidamine, followed by gene ontology analyses to evaluate capabilities of the method in revealing regulation of proteins in the cellular processes associated with drug treatment.
Collapse
|
6
|
DirectMS1Quant: Ultrafast Quantitative Proteomics with MS/MS-Free Mass Spectrometry. Anal Chem 2022; 94:13068-13075. [PMID: 36094425 DOI: 10.1021/acs.analchem.2c02255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Recently, we presented the DirectMS1 method of ultrafast proteome-wide analysis based on minute-long LC gradients and MS1-only mass spectra acquisition. Currently, the method provides the depth of human cell proteome coverage of 2500 proteins at a 1% false discovery rate (FDR) when using 5 min LC gradients and 7.3 min runtime in total. While the standard MS/MS approaches provide 4000-5000 protein identifications within a couple of hours of instrumentation time, we advocate here that the higher number of identified proteins does not always translate into better quantitation quality of the proteome analysis. To further elaborate on this issue, we performed a one-on-one comparison of quantitation results obtained using DirectMS1 with three popular MS/MS-based quantitation methods: label-free (LFQ) and tandem mass tag quantitation (TMT), both based on data-dependent acquisition (DDA) and data-independent acquisition (DIA). For comparison, we performed a series of proteome-wide analyses of well-characterized (ground truth) and biologically relevant samples, including a mix of UPS1 proteins spiked at different concentrations into an Echerichia coli digest used as a background and a set of glioblastoma cell lines. MS1-only data was analyzed using a novel quantitation workflow called DirectMS1Quant developed in this work. The results obtained in this study demonstrated comparable quantitation efficiency of 5 min DirectMS1 with both TMT and DIA methods, yet the latter two utilized a 10-20-fold longer instrumentation time.
Collapse
|
7
|
A Comprehensive Evaluation of Consensus Spectrum Generation Methods in Proteomics. J Proteome Res 2022; 21:1566-1574. [PMID: 35549218 PMCID: PMC9171829 DOI: 10.1021/acs.jproteome.2c00069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
Spectrum clustering
is a powerful strategy to minimize redundant
mass spectra by grouping them based on similarity, with the aim of
forming groups of mass spectra from the same repeatedly measured analytes.
Each such group of near-identical spectra can be represented by its
so-called consensus spectrum for downstream processing. Although several
algorithms for spectrum clustering have been adequately benchmarked
and tested, the influence of the consensus spectrum generation step
is rarely evaluated. Here, we present an implementation and benchmark
of common consensus spectrum algorithms, including spectrum averaging,
spectrum binning, the most similar spectrum, and the best-identified
spectrum. We have analyzed diverse public data sets using two different
clustering algorithms (spectra-cluster and MaRaCluster) to evaluate
how the consensus spectrum generation procedure influences downstream
peptide identification. The BEST and BIN methods were found the most
reliable methods for consensus spectrum generation, including for
data sets with post-translational modifications (PTM) such as phosphorylation.
All source code and data of the present study are freely available
on GitHub at https://github.com/statisticalbiotechnology/representative-spectra-benchmark.
Collapse
|
8
|
Multi-Omics Analysis of Glioblastoma Cells' Sensitivity to Oncolytic Viruses. Cancers (Basel) 2021; 13:cancers13215268. [PMID: 34771433 PMCID: PMC8582528 DOI: 10.3390/cancers13215268] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Revised: 10/11/2021] [Accepted: 10/15/2021] [Indexed: 12/28/2022] Open
Abstract
Simple Summary This study aims to uncover the contribution of interferon-dependent antiviral mechanisms preserved in tumor cells to the resistance of glioblastoma multiforme cells to oncolytic viruses. To characterize the functionality of interferon signaling, we used omics profiling and titration-based measurements of cell sensitivity to a panel of viruses of diverse oncolytic potential. This study shows why patient-derived glioblastoma cultures can acquire increased resistance to oncolytic viruses in the presence of interferons and suggests an approach to ranking glioblastoma cells by the acquired resistance. Our findings are important for monitoring the oncolytic potential of viruses to overcome IFN-induced resistance of tumor cells and contribute to successful therapy. Abstract Oncolytic viruses have gained momentum in the last decades as a promising tool for cancer treatment. Despite the progress, only a fraction of patients show a positive response to viral therapy. One of the key variable factors contributing to therapy outcomes is interferon-dependent antiviral mechanisms in tumor cells. Here, we evaluated this factor using patient-derived glioblastoma multiforme (GBM) cultures. Cell response to the type I interferons’ (IFNs) stimulation was characterized at mRNA and protein levels. Omics analysis revealed that GBM cells overexpress interferon-stimulated genes (ISGs) and upregulate their proteins, similar to the normal cells. A conserved molecular pattern unambiguously differentiates between the preserved and defective responses. Comparing ISGs’ portraits with titration-based measurements of cell sensitivity to a panel of viruses, the “strength” of IFN-induced resistance acquired by GBM cells was ranked. The study demonstrates that suppressing a single ISG and encoding an essential antiviral protein, does not necessarily increase sensitivity to viruses. Conversely, silencing IFIT3 and PLSCR1 genes in tumor cells can negatively affect the internalization of vesicular stomatitis and Newcastle disease viruses. We present evidence of a complex relationship between the interferon response genes and other factors affecting the sensitivity of tumor cells to viruses.
Collapse
|
9
|
AA_stat: Intelligent profiling of in vivo and in vitro modifications from open search results. J Proteomics 2021; 248:104350. [PMID: 34389500 DOI: 10.1016/j.jprot.2021.104350] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Revised: 07/21/2021] [Accepted: 08/04/2021] [Indexed: 11/18/2022]
Abstract
Characterization of post-translational modifications is among the most challenging tasks in tandem mass spectrometry-based proteomics which has yet to find an efficient solution. The ultra-tolerant (open) database search attempts to meet this challenge. However, interpretation of the mass shifts observed in open search still requires an effective and automated solution. We have previously introduced the AA_stat tool for analysis of amino acid frequencies at different mass shifts and generation of hypotheses on unaccounted in vitro modifications. Here, we report on the new version of AA_stat, which now complements amino acid frequency statistics with a number of new features: (1) MS/MS-based localization of mass shifts and localization scoring, including shifts which are the sum of modifications; (2) inferring fixed modifications to increase method sensitivity; (3) inferring monoisotopic peak assignment errors and variable modifications based on abundant mass shift localizations to increase the yield of closed search; (4) new mass calibration algorithm to account for partial systematic shifts; (5) interactive integration of all results and a rated list of possible mass shift interpretations. With these options, we improve interpretation of open search results and demonstrate the utility of AA_stat for profiling of abundant and rare amino acid modifications. AA_stat is implemented in Python as an open-source command-line tool available at https://github.com/SimpleNumber/aa_stat. SIGNIFICANCE: Mass spectrometry-based PTM characterization has a long history, yet most of the methods rely on a priori knowledge of modifications of interest and do not provide a whole proteome modification landscape in a blind manner. The open database search is an efficient attempt to address this challenge by identifying peptides with mass shifts corresponding to possible modifications. Then, interpreting these mass shifts is required. Therefore, development of bioinformatics software for post-processing of the open search results, which is capable of detection and accurate annotation of new or unexpected modifications, from characterization of sample preparation efficiency and quality control to discovery of rare post-translational modifications, is of high importance.
Collapse
|
10
|
Improving the Protein Inference from Bottom-Up Proteomic Data Using Identifications from MS1 Spectra. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2021; 32:1258-1262. [PMID: 33900766 DOI: 10.1021/jasms.1c00061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Protein inference is one of the crucial steps in proteome characterization using a bottom-up approach. Multiple algorithms to solve the problem are focused on extensive analysis of shared peptides identified from fragmentation mass spectra (MS/MS). However, many protein homologues with a similar amino acid sequence typically have identical lists of identified peptides due to the problem of proteome undersampling in a bottom-up approach and, thus, cannot be distinguished by existing protein inference methods. Here, we propose the use of peptide feature information extracted from precursor mass spectra to assist in identification of proteins otherwise indistinguishable from MS/MS. The proposed method was integrated with a protein inference algorithm based on the parsimony principle and built-in in the postsearch utility Scavager. The results demonstrate increasing accuracy and efficiency of homologous protein identifications for the well characterized data sets including the one with known protein sequences from iPRG-2016 study.
Collapse
|
11
|
Boosting MS1-only Proteomics with Machine Learning Allows 2000 Protein Identifications in Single-Shot Human Proteome Analysis Using 5 min HPLC Gradient. J Proteome Res 2021; 20:1864-1873. [PMID: 33720732 DOI: 10.1021/acs.jproteome.0c00863] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Abstract
Proteome-wide analyses rely on tandem mass spectrometry and the extensive separation of proteolytic mixtures. This imposes considerable instrumental time consumption, which is one of the main obstacles in the broader acceptance of proteomics in biomedical and clinical research. Recently, we presented a fast proteomic method termed DirectMS1 based on ultrashort LC gradients as well as MS1-only mass spectra acquisition and data processing. The method allows significant reduction of the proteome-wide analysis time to a few minutes at the depth of quantitative proteome coverage of 1000 proteins at 1% false discovery rate (FDR). In this work, to further increase the capabilities of the DirectMS1 method, we explored the opportunities presented by the recent progress in the machine-learning area and applied the LightGBM decision tree boosting algorithm to the scoring of peptide feature matches when processing MS1 spectra. Furthermore, we integrated the peptide feature identification algorithm of DirectMS1 with the recently introduced peptide retention time prediction utility, DeepLC. Additional approaches to improve the performance of the DirectMS1 method are discussed and demonstrated, such as using FAIMS for gas-phase ion separation. As a result of all improvements to DirectMS1, we succeeded in identifying more than 2000 proteins at 1% FDR from the HeLa cell line in a 5 min gradient LC-FAIMS/MS1 analysis. The data sets generated and analyzed during the current study have been deposited to the ProteomeXchange Consortium via the PRIDE partner repository with the data set identifier PXD023977.
Collapse
|
12
|
Biosaur: An open-source Python software for liquid chromatography-mass spectrometry peptide feature detection with ion mobility support. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2021:e9045. [PMID: 33450063 DOI: 10.1002/rcm.9045] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2020] [Revised: 12/20/2020] [Accepted: 01/04/2021] [Indexed: 06/12/2023]
Abstract
RATIONALE One of the important steps in initial data processing of peptide mass spectra is the detection of peptide features in full-range mass spectra. Ion mobility offers advantages over previous methods performing this detection by providing an additional structure-specific separation dimension. However, there is a lack of open-source software that utilizes these advantages and detects peptide features in mass spectra acquired along with ion mobility data using new instruments such as timsTOF and/or FAIMS-Orbitrap. METHODS Recently, a utility called Dinosaur was presented, which provides an efficient way for feature detection in peptide ion mass spectra. In this work we extended its functionality by developing Biosaur software to fully employ the additional information provided by ion mobility data. Biosaur was developed using the Python 3.8 programming language. RESULTS Biosaur supports the processing of data acquired using mass spectrometers with ion mobility capabilities, specifically timsTOF and FAIMS. In addition, it processes mass spectra obtained in negative ion mode and reports cosine correlation table for peptide features which is useful for differentiation between in-source fragments and semi-tryptic peptides. CONCLUSIONS Biosaur is a utility for detecting peptide features in liquid chromatography-mass spectra with ion mobility and negative ion supports. The software is distributed with an open-source APACHE 2.0 license and is freely available on Github: https://github.com/abdrakhimov1/Biosaur.
Collapse
|
13
|
Cysteine alkylation methods in shotgun proteomics and their possible effects on methionine residues. J Proteomics 2020; 231:104022. [PMID: 33096305 DOI: 10.1016/j.jprot.2020.104022] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Revised: 10/06/2020] [Accepted: 10/15/2020] [Indexed: 01/06/2023]
Abstract
In order to optimize sample preparation for shotgun proteomics, we compared four cysteine alkylating agents: iodoacetamide, chloroacetamide, 4-vinylpyridine and methyl methanethiosulfonate, and estimated their effects on the results of proteome analysis. Because alkylation may result in methionine modification in vitro, proteomics data were searched for methionine to isothreonine conversions, which may mimic genomic methionine to threonine substitutions found in proteogenomic analyses. We found that chloroacetamide was superior to the other reagents in terms of the number of identified peptides and undesirable off-site reactions. Among the reagents evaluated, iodoacetamide increased the rate of methionine-to-isothreonine conversion, especially if the sample was prepared in gel. The presence of proline following methionine in a protein sequence increased the modification rate as well. Generally, the methionine-to-isothreonine conversion events were relatively rare, but should be taken into account in proteogenomic studies when searching for single nucleotide polymorphism events at the protein level. Additionally, we have evaluated other methionine modifications, such as oxidation and carbamidomethylation. We found that carbamidomethylation may affect up to 80% of peptides containing methionine under the condition of iodoacetamide alkylation. In this case, carbamidomethylation of methionine is more common than oxidation and should be accounted for as a variable modification during proteomic search. SIGNIFICANCE: One of the most trending questions in bottom-up proteomics is the depth of proteome profiling, in other words, the coverage of proteins by identified tryptic peptides. In proteogenomics, where the identification of a single peptide, e.g. bearing an amino acid substitution, may be of interest, high sequence coverage is especially important. Chemical modifications during sample preparation may mimic biologically significant coding mutations at the proteome level. A typical example of such modification is methionine to isothreonine conversion during alkylation, which mimics methionine to threonine substitution in protein sequences due to respective genomic mutations. Therefore, the studies on the proper selection of alkylating reagents which balance the cysteine alkylation efficiency and the extent of methionine conversion upon conventional proteomic sample preparation workflow are crucial for the outcome of proteogenomic analyses and should present a general interest for the proteomic community.
Collapse
|
14
|
PhosphoShield: Improving Trypsin Digestion of Phosphoproteins by Shielding the Negatively Charged Phosphate Moiety. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2020; 31:2053-2060. [PMID: 32840367 DOI: 10.1021/jasms.0c00171] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Protein phosphorylation is a post-translational modification that is essential to cellular signaling, cellular function, and associated disease progression. Bottom-up proteomics based on enzymatic digestion is the most widely used approach for identifying and quantifying phosphoproteins in complex biological samples. Researchers have largely optimized the experimental conditions for trypsin digestion, and it is now a routine procedure. However, trypsin digestion is impaired by the presence of phosphorylated residues in the protein sequence. This impairment arises from the fact that there are commonly salt bridges between a negatively charged phosphate group and the side chain of protonated arginine or lysine. On average, 55% of all phosphopeptides have their phosphosites located less than three amino acid residues from a cleavage site. Salt bridges reduce the cleavage accessibility for trypsin by masking the basic site chain groups of arginine and lysine. Thus, there are frequent missed cleavages in the vicinity of phosphorylation sites, thereby lessening both the depth of proteome coverage and the quantification accuracy of phosphoproteomics. In this work, we propose a method termed PhosphoShield to mitigate salt bridge formation by adding a digallium complex that exhibits a high binding affinity to the phosphate group. We tested our method using quantitative mass spectrometry analysis of the phosphoproteome of human liver cancer cells (HepG2). PhosphoShield enhances the cleavage frequency of at least 17% of tryptic phosphopeptides having cleavage sites close to the phosphate group.
Collapse
|
15
|
HUMOS: How to Understand My Orbitrap Spectrum?-An Interactive Web-Based Tool to Teach the Basics of Mass-Spectrometry-Based Proteomics. J Proteome Res 2020; 19:3910-3918. [PMID: 32813527 DOI: 10.1021/acs.jproteome.0c00395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The Orbitrap mass analyzer can provide high mass accuracy and throughput, which has significantly improved proteome research and made this type of instrumentation one of the most frequently applied in proteomics. The efficient use of Orbitrap mass spectrometers requires training. Students in the field of proteomics can benefit from a deeper understanding of the Orbitrap technology to comprehend mass spectral interpretation, troubleshooting, and judgment of experimental settings. Unfortunately, the cost of high-end mass spectrometers limits the implementation of this type of equipment in educational laboratories. Guided by these concerns, we developed an eLearning web application called HUMOS aimed to help teach Orbitrap mass spectrometry. Although a typical proteomics experiment includes the use of several different technologies, such as liquid chromatography, mass spectrometry, and bioinformatics, the learning objectives of HUMOS are focused on mass spectrometry. HUMOS models a mass spectrum of a peptide mixture, allowing us to investigate the influence of mass spectral acquisition parameters. By changing the parameters and observing the differences, students can learn more about the mass spectral resolution, duty cycle, throughput of the analysis, ion accumulation, and spectral dynamic range and get familiar with advanced spectral acquisition methods, such as BoxCar. HUMOS is an open-source software published under the Apache license; the live installation is available at http://humos.bmb.sdu.dk.
Collapse
|
16
|
DirectMS1: MS/MS-Free Identification of 1000 Proteins of Cellular Proteomes in 5 Minutes. Anal Chem 2020; 92:4326-4333. [DOI: 10.1021/acs.analchem.9b05095] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
|
17
|
Protein corona formed on silver nanoparticles in blood plasma is highly selective and resistant to physicochemical changes of the solution. ENVIRONMENTAL SCIENCE. NANO 2019; 6:1089-1098. [PMID: 31304020 PMCID: PMC6592156 DOI: 10.1039/c8en01054d] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/21/2018] [Accepted: 01/22/2019] [Indexed: 05/18/2023]
Abstract
Nanoparticles (NPs) in contact with protein-containing media such as biological fluids rapidly acquire a surface layer of proteins, known as the protein corona. The protein composition and structural properties of the protein corona are crucial for NP interactions with living cells. Although much has been learned about the protein corona phenomenon, further elucidation could benefit from extensive quantitative proteomics analysis. Herein we report a comprehensive quantitative characterization (>350 proteins) of the corona that formed on 60 nm silver NPs via interaction with human blood plasma, as a function of pH and temperature. By varying the pH and temperature one can access different conformational spaces and charge localizations of the plasma proteins, which in turn provide knowledge pertinent to how the proteome corresponds to binding affinity. Thirty-eight percent of the quantified proteins bind at all temperatures, 47% at all pH values, and of these most persistent proteins, approximately 60% do not significantly change in abundance within the protein corona. Evaluation of 544 protein properties (present in the Kyoto databank) suggests that binding of these proteins to NPs is determined by the extent of hydrophobicity, β-sheet propensity, α-helical structure (and turns), and amino acid composition. Protein binding is promoted by a larger amount of β-sheets, higher hydrophobicity, and a smaller amount of α-helices. Our work enhances researchers' knowledge of a long-standing, vexing aspect of the nano-bio interface.
Collapse
|
18
|
Scavager: A Versatile Postsearch Validation Algorithm for Shotgun Proteomics Based on Gradient Boosting. Proteomics 2018; 19:e1800280. [DOI: 10.1002/pmic.201800280] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2018] [Revised: 11/30/2018] [Indexed: 11/06/2022]
|
19
|
Validation of Peptide Identification Results in Proteomics Using Amino Acid Counting. Proteomics 2018; 18:e1800117. [PMID: 30307114 DOI: 10.1002/pmic.201800117] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2018] [Revised: 09/12/2018] [Indexed: 01/11/2023]
Abstract
The efficiency of proteome analysis depends strongly on the configuration parameters of the search engine. One of the murkiest and nontrivial among them is the list of amino acid modifications included for the search. Here, an approach called AA_stat is presented for uncovering the unexpected modifications of amino acid residues in the protein sequences, as well as possible artifacts of data acquisition or processing, in the results of proteome analyses. The approach is based on comparing the amino acid frequencies of different mass shifts observed using the open search method introduced recently. In this work, the proposed approach is applied to publicly available proteomic data is applied and its feasibility for discovering unaccounted modifications or possible pitfalls of the identification workflow is demonstrated. The algorithm is implemented in Python as an open-source command-line tool available at https://bitbucket.org/J_Bale/aa_stat/.
Collapse
|
20
|
Abstract
We present an open-source, extensible search engine for shotgun proteomics. Implemented in Python programming language, IdentiPy shows competitive processing speed and sensitivity compared with the state-of-the-art search engines. It is equipped with a user-friendly web interface, IdentiPy Server, enabling the use of a single server installation accessed from multiple workstations. Using a simplified version of X!Tandem scoring algorithm and its novel "autotune" feature, IdentiPy outperforms the popular alternatives on high-resolution data sets. Autotune adjusts the search parameters for the particular data set, resulting in improved search efficiency and simplifying the user experience. IdentiPy with the autotune feature shows higher sensitivity compared with the evaluated search engines. IdentiPy Server has built-in postprocessing and protein inference procedures and provides graphic visualization of the statistical properties of the data set and the search results. It is open-source and can be freely extended to use third-party scoring functions or processing algorithms and allows customization of the search workflow for specialized applications.
Collapse
|
21
|
Comparative proteomics as a tool for identifying specific alterations within interferon response pathways in human glioblastoma multiforme cells. Oncotarget 2018; 9:1785-1802. [PMID: 29416731 PMCID: PMC5788599 DOI: 10.18632/oncotarget.22751] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Accepted: 10/27/2017] [Indexed: 12/13/2022] Open
Abstract
An acquisition of increased sensitivity of cancer cells to viruses is a common outcome of malignant progression that justifies the development of oncolytic viruses as anticancer therapeutics. Studying molecular changes that underlie the sensitivity to viruses would help to identify cases where oncolytic virus therapy would be most effective. We quantified changes in protein abundances in two glioblastoma multiforme (GBM) cell lines that differ in the ability to induce resistance to vesicular stomatitis virus (VSV) infection in response to type I interferon (IFN) treatment. In IFN-treated samples we observed an up-regulation of protein products of some IFN-regulated genes (IRGs). In total, the proteome analysis revealed up to 20% more proteins encoded by IRGs in the glioblastoma cell line, which develops resistance to VSV infection after pre-treatment with IFN. In both cell lines protein-protein interaction and signaling pathway analyses have revealed a significant stimulation of processes related to type I IFN signaling and defense responses to viruses. However, we observed a deficiency in STAT2 protein in the VSV-sensitive cell line that suggests a de-regulation of the JAK/STAT/IRF9 signaling. The study has shown that the up-regulation of IRG proteins induced by the IFNα treatment of GBM cells can be detected at the proteome level. Similar analyses could be applied for revealing functional alterations within the antiviral mechanisms in glioblastoma samples, accompanying by acquisition of sensitivity to oncolytic viruses. The approach can be useful for discovering the biomarkers that predict a potential sensitivity of individual glioblastoma tumors to oncolytic virus therapy.
Collapse
|
22
|
MS/MS-Free Protein Identification in Complex Mixtures Using Multiple Enzymes with Complementary Specificity. J Proteome Res 2017; 16:3989-3999. [DOI: 10.1021/acs.jproteome.7b00365] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
|
23
|
Comparative evaluation of label-free quantification methods for shotgun proteomics. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2017; 31:606-612. [PMID: 28097710 DOI: 10.1002/rcm.7829] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2016] [Revised: 12/24/2016] [Accepted: 01/15/2017] [Indexed: 06/06/2023]
Abstract
RATIONALE Label-free quantification (LFQ) is a popular strategy for shotgun proteomics. A variety of LFQ algorithms have been developed recently. However, a comprehensive comparison of the most commonly used LFQ methods is still rare, in part due to a lack of clear metrics for their evaluation and an annotated and quantitatively well-characterized data set. METHODS Five LFQ methods were compared: spectral counting based algorithms SIN , emPAI, and NSAF, and approaches relying on the extracted ion chromatogram (XIC) intensities, MaxLFQ and Quanti. We used three criteria for performance evaluation: coefficient of variation (CV) of protein abundances between replicates; analysis of variance (ANOVA); and the root-mean-square error of logarithmized calculated concentration ratios, referred to as standard quantification error (SQE). Comparison was performed using a quantitatively annotated publicly available data set. RESULTS The best results in terms of inter-replicate reproducibility were observed for MaxLFQ and NSAF, although they exhibited larger standard quantification errors. Using NSAF, all quantitatively annotated proteins were correctly identified in the Bonferronni-corrected results of the ANOVA test. SIN was found to be the most accurate in terms of SQE. Finally, the current implementations of XIC-based LFQ methods did not outperform the methods based on spectral counting for the data set used in this study. CONCLUSIONS Surprisingly, the performances of XIC-based approaches measured using three independent metrics were found to be comparable with more straightforward and simple MS/MS-based spectral counting approaches. The study revealed no clear leader among the latter. Copyright © 2017 John Wiley & Sons, Ltd.
Collapse
|
24
|
Temporal manipulation of ejaculate components by newly fertilized Drosophila melanogaster females. Anim Behav 1998; 55:1637-45. [PMID: 9642007 DOI: 10.1006/anbe.1997.0723] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Observations of newly mated Drosophila melanogaster females suggest that females control the times at which components of the ejaculate cause behavioural and physiological changes characteristic of fertilized females. Females that were assayed immediately after they mated elicited as much courtship as they did when they were virgins, but were unreceptive to copulation. Within a few minutes of when they disengaged from copulation, most females performed ovipositor extrusion, which has been classified as a rejection behaviour, in response to courting males or males that had previously performed courtship. Most females that were assayed immediately after mating had already ovulated. The females, however, do not begin to lay eggs until 4-6 h after mating, at which time they elicit very little courtship (Scott & Richmond 1985, Anim. Behav., 33, 817-824). Our observations suggest that neither ovipositor extrusion nor male-synthesized pheromones that are transferred to females' cuticles during copulation inhibit males' courtship of newly fertilized females. Thus, males cannot determine that newly fertilized females are unreceptive to copulation. These observations also indicate that the failure of newly fertilized females to mate with males is not a consequence of the females' inability to elicit vigorous courtship. Copyright 1998 The Association for the Study of Animal Behaviour. Copyright 1998 The Association for the Study of Animal Behaviour.
Collapse
|