1
|
von Gerichten J, Saunders K, Bailey MJ, Gethings LA, Onoja A, Geifman N, Spick M. Challenges in Lipidomics Biomarker Identification: Avoiding the Pitfalls and Improving Reproducibility. Metabolites 2024; 14:461. [PMID: 39195557 DOI: 10.3390/metabo14080461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2024] [Revised: 08/14/2024] [Accepted: 08/15/2024] [Indexed: 08/29/2024] Open
Abstract
Identification of features with high levels of confidence in liquid chromatography-mass spectrometry (LC-MS) lipidomics research is an essential part of biomarker discovery, but existing software platforms can give inconsistent results, even from identical spectral data. This poses a clear challenge for reproducibility in biomarker identification. In this work, we illustrate the reproducibility gap for two open-access lipidomics platforms, MS DIAL and Lipostar, finding just 14.0% identification agreement when analyzing identical LC-MS spectra using default settings. Whilst the software platforms performed more consistently using fragmentation data, agreement was still only 36.1% for MS2 spectra. This highlights the critical importance of validation across positive and negative LC-MS modes, as well as the manual curation of spectra and lipidomics software outputs, in order to reduce identification errors caused by closely related lipids and co-elution issues. This curation process can be supplemented by data-driven outlier detection in assessing spectral outputs, which is demonstrated here using a novel machine learning approach based on support vector machine regression combined with leave-one-out cross-validation. These steps are essential to reduce the frequency of false positive identifications and close the reproducibility gap, including between software platforms, which, for downstream users such as bioinformaticians and clinicians, can be an underappreciated source of biomarker identification errors.
Collapse
Affiliation(s)
- Johanna von Gerichten
- School of Chemistry and Chemical Engineering, Faculty of Engineering and Physical Sciences, University of Surrey, Guildford GU2 7XH, UK
| | - Kyle Saunders
- School of Chemistry and Chemical Engineering, Faculty of Engineering and Physical Sciences, University of Surrey, Guildford GU2 7XH, UK
| | - Melanie J Bailey
- School of Chemistry and Chemical Engineering, Faculty of Engineering and Physical Sciences, University of Surrey, Guildford GU2 7XH, UK
| | | | - Anthony Onoja
- School of Health Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford GU2 7XH, UK
| | - Nophar Geifman
- School of Health Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford GU2 7XH, UK
| | - Matt Spick
- School of Health Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford GU2 7XH, UK
| |
Collapse
|
2
|
Kalhor M, Lapin J, Picciani M, Wilhelm M. Rescoring Peptide Spectrum Matches: Boosting Proteomics Performance by Integrating Peptide Property Predictors Into Peptide Identification. Mol Cell Proteomics 2024; 23:100798. [PMID: 38871251 PMCID: PMC11269915 DOI: 10.1016/j.mcpro.2024.100798] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 05/26/2024] [Accepted: 06/09/2024] [Indexed: 06/15/2024] Open
Abstract
Rescoring of peptide spectrum matches originating from database search engines enabled by peptide property predictors is exceeding the performance of peptide identification from traditional database search engines. In contrast to the peptide spectrum match scores calculated by traditional database search engines, rescoring peptide spectrum matches generates scores based on comparing observed and predicted peptide properties, such as fragment ion intensities and retention times. These newly generated scores enable a more efficient discrimination between correct and incorrect peptide spectrum matches. This approach was shown to lead to substantial improvements in the number of confidently identified peptides, facilitating the analysis of challenging datasets in various fields such as immunopeptidomics, metaproteomics, proteogenomics, and single-cell proteomics. In this review, we summarize the key elements leading up to the recent introduction of multiple data-driven rescoring pipelines. We provide an overview of relevant post-processing rescoring tools, introduce prominent data-driven rescoring pipelines for various applications, and highlight limitations, opportunities, and future perspectives of this approach and its impact on mass spectrometry-based proteomics.
Collapse
Affiliation(s)
- Mostafa Kalhor
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Joel Lapin
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mario Picciani
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany
| | - Mathias Wilhelm
- Computational Mass Spectrometry, TUM School of Life Sciences, Technical University of Munich, Freising, Germany; Munich Data Science Institute, Technical University of Munich, Garching, Germany.
| |
Collapse
|
3
|
Lu H, Wang B, Liu Y, Wang D, Fields L, Zhang H, Li M, Shi X, Zetterberg H, Li L. DiLeu Isobaric Labeling Coupled with Limited Proteolysis Mass Spectrometry for High-Throughput Profiling of Protein Structural Changes in Alzheimer's Disease. Anal Chem 2023; 95:9746-9753. [PMID: 37307028 PMCID: PMC10330787 DOI: 10.1021/acs.analchem.2c05731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
High-throughput quantitative analysis of protein conformational changes has a profound impact on our understanding of the pathological mechanisms of Alzheimer's disease (AD). To establish an effective workflow enabling quantitative analysis of changes in protein conformation within multiple samples simultaneously, here we report the combination of N,N-dimethyl leucine (DiLeu) isobaric tag labeling with limited proteolysis mass spectrometry (DiLeu-LiP-MS) for high-throughput structural protein quantitation in serum samples collected from AD patients and control donors. Twenty-three proteins were discovered to undergo structural changes, mapping to 35 unique conformotypic peptides with significant changes between the AD group and the control group. Seven out of 23 proteins, including CO3, CO9, C4BPA, APOA1, APOA4, C1R, and APOA, exhibited a potential correlation with AD. Moreover, we found that complement proteins (e.g., CO3, CO9, and C4BPA) related to AD exhibited elevated levels in the AD group compared to those in the control group. These results provide evidence that the established DiLeu-LiP-MS method can be used for high-throughput structural protein quantitation, which also showed great potential in achieving large-scale and in-depth quantitative analysis of protein conformational changes in other biological systems.
Collapse
Affiliation(s)
- Haiyan Lu
- School of Pharmacy, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Bin Wang
- School of Pharmacy, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Yuan Liu
- School of Pharmacy, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Danqing Wang
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Lauren Fields
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Hua Zhang
- School of Pharmacy, University of Wisconsin-Madison, Madison, WI, 53705, USA
| | - Miyang Li
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Xudong Shi
- Division of Otolaryngology, Department of Surgery, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, WI, 53792, USA
| | - Henrik Zetterberg
- Institute of Neuroscience and Physiology, Sahlgrenska Academy, University of Gothenburg, 43141, Gothenburg, Sweden
- Clinical Neurochemistry Laboratory, Sahlgrenska University Hospital, Mölndal, 43130, Sweden
- Department of Molecular Neuroscience, UCL Institute of Neurology, London, WC1N 3BG, UK
- UK Dementia Research Institute at UCL, London, WC1N 3BG, UK
- Hong Kong Center for Neurodegenerative Diseases, Clear Water Bay, Hong Kong, 999077, China
| | - Lingjun Li
- School of Pharmacy, University of Wisconsin-Madison, Madison, WI, 53705, USA
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Lachman Institute for Pharmaceutical Development, School of Pharmacy, University of Wisconsin-Madison, Madison, WI 53705, USA
| |
Collapse
|
4
|
Hollebrands B, Hageman JA, van de Sande JW, Albada B, Janssen HG. Improved LC-MS identification of short homologous peptides using sequence-specific retention time predictors. Anal Bioanal Chem 2023; 415:2715-2726. [PMID: 37000211 PMCID: PMC10185643 DOI: 10.1007/s00216-023-04670-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 03/17/2023] [Accepted: 03/21/2023] [Indexed: 04/01/2023]
Abstract
Peptides are an important group of compounds contributing to the desired, as well as the undesired taste of a food product. Their taste impressions can include aspects of sweetness, bitterness, savoury, umami and many other impressions depending on the amino acids present as well as their sequence. Identification of short peptides in foods is challenging. We developed a method to assign identities to short peptides including homologous structures, i.e. peptides containing the same amino acids with a different sequence order, by accurate prediction of the retention times during reversed phase separation. To train the method, a large set of well-defined short peptides with systematic variations in the amino acid sequence was prepared by a novel synthesis strategy called 'swapped-sequence synthesis'. Additionally, several proteins were enzymatically digested to yield short peptides. Experimental retention times were determined after reversed phase separation and peptide MS2 data was acquired using a high-resolution mass spectrometer operated in data-dependent acquisition mode (DDA). A support vector regression model was trained using a combination of existing sequence-independent peptide descriptors and a newly derived set of selected amino acid index derived sequence-specific peptide (ASP) descriptors. The model was trained and validated using the experimental retention times of the 713 small food-relevant peptides prepared. Whilst selecting the most useful ASP descriptors for our model, special attention was given to predict the retention time differences between homologous peptide structures. Inclusion of ASP descriptors greatly improved the ability to accurately predict retention times, including retention time differences between 157 homologous peptide pairs. The final prediction model had a goodness-of-fit (Q2) of 0.94; moreover for 93% of the short peptides, the elution order was correctly predicted.
Collapse
Affiliation(s)
- Boudewijn Hollebrands
- Unilever Foods Innovation Centre - Hive, Bronland 14, 6708 WH, Wageningen, the Netherlands.
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands.
| | - Jos A Hageman
- Wageningen University & Research, Biometris, P.O. Box 16, 6700 AA, Wageningen, the Netherlands
| | - Jasper W van de Sande
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands
| | - Bauke Albada
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands
| | - Hans-Gerd Janssen
- Unilever Foods Innovation Centre - Hive, Bronland 14, 6708 WH, Wageningen, the Netherlands
- Laboratory of Organic Chemistry, Wageningen University & Research, Stippeneng 4, 6708 WE, Wageningen, the Netherlands
| |
Collapse
|
5
|
Prediction of surface excess adsorption and retention factors in reversed-phase liquid chromatography from molecular dynamics simulations. J Chromatogr A 2022; 1685:463627. [DOI: 10.1016/j.chroma.2022.463627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2022] [Revised: 10/27/2022] [Accepted: 10/29/2022] [Indexed: 11/06/2022]
|
6
|
A Comprehensive Study of Gradient Conditions for Deep Proteome Discovery in a Complex Protein Matrix. Int J Mol Sci 2022; 23:ijms231911714. [PMID: 36233016 PMCID: PMC9569591 DOI: 10.3390/ijms231911714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 09/26/2022] [Accepted: 09/27/2022] [Indexed: 11/28/2022] Open
Abstract
Bottom–up mass-spectrometry-based proteomics is a well-developed technology based on complex peptide mixtures from proteolytic cleavage of proteins and is widely applied in protein identification, characterization, and quantitation. A tims-ToF mass spectrometer is an excellent platform for bottom–up proteomics studies due to its rapid acquisition with high sensitivity. It remains challenging for bottom–up proteomics approaches to achieve 100% proteome coverage. Liquid chromatography (LC) is commonly used prior to mass spectrometry (MS) analysis to fractionate peptide mixtures, and the LC gradient can affect the peptide fractionation and proteome coverage. We investigated the effects of gradient type and time duration to find optimal gradient conditions. Five gradient types (linear, logarithm-like, exponent-like, stepwise, and step-linear), three different gradient lengths (22 min, 44 min, and 66 min), two sample loading amounts (100 ng and 200 ng), and two loading conditions (the use of trap column and no trap column) were studied. The effect of these chromatography variables on protein groups, peptides, and spectral counts using HeLa cell digests was explored. The results indicate that (1) a step-linear gradient performs best among the five gradient types studied; (2) the optimal gradient duration depends on protein sample loading amount; (3) the use of a trap column helps to enhance protein identification, especially low-abundance proteins; (4) MSFragger and PEAKS Studio have high similarity in protein group identification; (5) MSFragger identified more protein groups among the different gradient conditions compared to PEAKS Studio; and (6) combining results from both database search engines can expand identified protein groups by 9–11%.
Collapse
|
7
|
Borkar MR, Coutinho E. Amalgamation of comparative protein modeling with quantitative structure-retention relationship for prediction of the chromatographic behavior of peptides. J Chromatogr A 2022; 1669:462967. [DOI: 10.1016/j.chroma.2022.462967] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 03/09/2022] [Accepted: 03/11/2022] [Indexed: 10/18/2022]
|
8
|
Liapikos T, Zisi C, Kodra D, Kademoglou K, Diamantidou D, Begou O, Pappa-Louisi A, Theodoridis G. Quantitative Structure Retention Relationship (QSRR) Modelling for Analytes’ Retention Prediction in LC-HRMS by Applying Different Machine Learning Algorithms and Evaluating Their Performance. J Chromatogr B Analyt Technol Biomed Life Sci 2022; 1191:123132. [DOI: 10.1016/j.jchromb.2022.123132] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 01/12/2022] [Accepted: 01/16/2022] [Indexed: 12/26/2022]
|
9
|
Bo W, Chen L, Qin D, Geng S, Li J, Mei H, Li B, Liang G. Application of quantitative structure-activity relationship to food-derived peptides: Methods, situations, challenges and prospects. Trends Food Sci Technol 2021. [DOI: 10.1016/j.tifs.2021.05.031] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
10
|
Gritti F. Perspective on the Future Approaches to Predict Retention in Liquid Chromatography. Anal Chem 2021; 93:5653-5664. [PMID: 33797872 DOI: 10.1021/acs.analchem.0c05078] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The demand for rapid column screening, computer-assisted method development and method transfer, and unambiguous compound identification by LC/MS analyses has pushed analysts to adopt experimental protocols and software for the accurate prediction of the retention time in liquid chromatography (LC). This Perspective discusses the classical approaches used to predict retention times in LC over the last three decades and proposes future requirements to increase their accuracy. First, inverse methods for retention prediction are essentially applied during screening and gradient method optimization: a minimum number of experiments or design of experiments (DoE) is run to train and calibrate a model (either purely statistical or based on the principles and fundamentals of liquid chromatography) by a mere fitting process. They do not require the accurate knowledge of the true column hold-up volume V0, system dwell volume Vdwell (in gradient elution), and the retention behavior (k versus the content of strong solvent φ, temperature T, pH, and ionic strength I) of the analytes. Their relative accuracy is often excellent below a few percent. Statistical methods are expected to be the most attractive to handle very complex retention behavior such as in mixed-mode chromatography (MMC). Fundamentally correct retention models accounting for the simultaneous impact of φ, I, pH, and T in MMC are needed for method development based on chromatography principles. Second, direct methods for retention prediction are ideally suited for accurate method transfer from one column/system configuration to another: these quality by design (QbD) methods are based on the fundamentals and principles of solid-liquid adsorption and gradient chromatography. No model calibration is necessary; however, they require universal conventions for the accurate determination of true retention factors (for 1 < k < 30) as a function of the experimental variables (φ, T, pH, and I) and of the true column/system parameters (V0, Vdwell, dispersion volume, σ, and relaxation volume, τ, of the programmed gradient profile at the column inlet and gradient distortion at the column outlet). Finally, when the molecular structure of the analytes is either known or assumed, retention prediction has essentially been made on the basis of statistical approaches such as the linear solvation energy relationships (LSERs) and the quantitative structure retention relationships (QSRRs): their ability to accurately predict the retention remains limited within 10-30%. They have been combined with molecular similarity approaches (where the retention model is calibrated with compounds having structures similar to that of the targeted analytes) and artificial intelligence algorithms to further improve their accuracy below 10%. In this Perspective, it is proposed to adopt a more rigorous and fundamental approach by considering the very details of the solid-liquid adsorption process: Monte Carlo (MC) or molecular dynamics (MD) simulations are promising tools to explain and interpret retention data that are too complex to be described by either empirical or statistical retention models.
Collapse
Affiliation(s)
- Fabrice Gritti
- Waters Corporation, 34 Maple Street, Milford, Massachusetts 01757, United States
| |
Collapse
|
11
|
Krmar J, Vukićević M, Kovačević A, Protić A, Zečević M, Otašević B. Performance comparison of nonlinear and linear regression algorithms coupled with different attribute selection methods for quantitative structure - retention relationships modelling in micellar liquid chromatography. J Chromatogr A 2020; 1623:461146. [PMID: 32505269 DOI: 10.1016/j.chroma.2020.461146] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2019] [Revised: 04/16/2020] [Accepted: 04/18/2020] [Indexed: 01/30/2023]
Abstract
In micellar liquid chromatography (MLC), the addition of a surfactant to the mobile phase in excess is accompanied by an alteration of its solubilising capacity and a change in the stationary phase's properties. As an implication, the prediction of the analytes' retention in MLC mode becomes a challenging task. Mixed Quantitative Structure - Retention Relationships (QSRR) modelling represents a powerful tool for estimating the analytes' retention. This study compares 48 successfully developed mixed QSRR models with respect to their ability to predict retention of aripiprazole and its five impurities from molecular structures and factors that describe the Brij - acetonitrile system. The development of the models was based on an automatic combining of six attribute (feature) selection methods with eight predictive algorithms and the optimization of hyper-parameters. The feature selection methods included Principal Component Analysis (PCA), Non-negative Matrix Factorization (NMF), ReliefF, Multiple Linear Regression (MLR), Mutual Info and F-Regression. The series of investigated predictive algorithms comprised Linear Regressions (LR), Ridge Regression, Lasso Regression, Artificial Neural Networks (ANN), Support Vector Regression (SVR), Random Forest (RF), Gradient Boosted Trees (GBT) and K-Nearest neighbourhood (k-NN). A sufficient amount of data for building the model (78 cases in total) was provided by conducting 13 experiments for each of the 6 analytes and collecting the target responses afterwards. Different experimental settings were established by varying the values of the concentration of Brij L23, pH of the aqueous phase and acetonitrile content in the mobile phase according to the Box-Behnken design. In addition to the chromatographic parameters, the pool of independent variables was expanded by 27 molecular descriptors from all major groups (physicochemical, quantum chemical, topological and spatial structural descriptors). The best model was chosen by taking into consideration the Root Mean Square Error (RMSE) and cross-validation (CV) correlation coefficient (Q2) values. Interestingly, the comparative analysis indicated that a change in the set of input variables had a minor impact on the performance of the final models. On the other hand, different regression algorithms showed great diversity in the ability to learn patterns conserved in the data. In this regard, testing many regression algorithms is necessary in order to find the most suitable technique for model building. In the specific case, GBT-based models have demonstrated the best ability to predict the retention factor in the MLC mode. Steric factors and dipole-dipole interactions have proven to be relevant to the observed retention behaviour. This study, although being of a smaller scale, is a most promising starting point for comprehensive MLC retention prediction.
Collapse
Affiliation(s)
- Jovana Krmar
- Department of Drug Analysis, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Milan Vukićević
- Center for business decision making, University of Belgrade - Faculty of Organizational Sciences, 154 Jove Ilića, 11000 Belgrade, Serbia
| | - Ana Kovačević
- Center for business decision making, University of Belgrade - Faculty of Organizational Sciences, 154 Jove Ilića, 11000 Belgrade, Serbia; Saga D.O.O, Bulevar Zorana Đinđića 64a, 11000 Belgrade, Serbia
| | - Ana Protić
- Department of Drug Analysis, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Mira Zečević
- Department of Drug Analysis, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia
| | - Biljana Otašević
- Department of Drug Analysis, University of Belgrade - Faculty of Pharmacy, Vojvode Stepe 450, 11221 Belgrade, Serbia.
| |
Collapse
|
12
|
Ivanov MV, Bubis JA, Gorshkov V, Tarasova IA, Levitsky LI, Lobas AA, Solovyeva EM, Pridatchenko ML, Kjeldsen F, Gorshkov MV. DirectMS1: MS/MS-Free Identification of 1000 Proteins of Cellular Proteomes in 5 Minutes. Anal Chem 2020; 92:4326-4333. [DOI: 10.1021/acs.analchem.9b05095] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Affiliation(s)
- Mark V. Ivanov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Julia A. Bubis
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Vladimir Gorshkov
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M DK-5230, Denmark
| | - Irina A. Tarasova
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Lev I. Levitsky
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Anna A. Lobas
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Elizaveta M. Solovyeva
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Marina L. Pridatchenko
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Frank Kjeldsen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense M DK-5230, Denmark
| | - Mikhail V. Gorshkov
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
- Moscow Institute of Physics and Technology (State University), 141700 Dolgoprudny, Russia
| |
Collapse
|
13
|
Tarasova IA, Masselon CD, Gorshkov AV, Gorshkov MV. Predictive chromatography of peptides and proteins as a complementary tool for proteomics. Analyst 2018; 141:4816-4832. [PMID: 27419248 DOI: 10.1039/c6an00919k] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
In the last couple of decades, considerable effort has been focused on developing methods for quantitative and qualitative proteome characterization. The method of choice in this characterization is mass spectrometry used in combination with sample separation. One of the most widely used separation techniques at the front end of a mass spectrometer is high performance liquid chromatography (HPLC). A unique feature of HPLC is its specificity to the amino acid sequence of separated peptides and proteins. This specificity may provide additional information about the peptides or proteins under study which is complementary to the mass spectrometry data. The value of this information for proteomics has been recognized in the past few decades, which has stimulated significant effort in the development and implementation of computational and theoretical models for the prediction of peptide retention time for a given sequence. Here we review the advances in this area and the utility of predicted retention times for proteomic applications.
Collapse
Affiliation(s)
- Irina A Tarasova
- Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia.
| | - Christophe D Masselon
- CEA, iRTSV-BGE, Laboratoire d'Etude de la Dynamique des Protéomes, Grenoble, F-38000, France and INSERM, U1038-BGE, F-38000, Grenoble, France
| | - Alexander V Gorshkov
- N.N. Semenov Institute of Chemical Physics, Russian Academy of Sciences, Moscow 119991, Russia
| | - Mikhail V Gorshkov
- Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, Moscow 119334, Russia. and Moscow Institute of Physics and Technology (State University), Dolgoprudny, Moscow region 141700, Russia
| |
Collapse
|
14
|
Chen H, Shi P, Fan F, Tu M, Xu Z, Xu X, Du M. Complementation of UPLC-Q-TOF-MS and CESI-Q-TOF-MS on identification and determination of peptides from bovine lactoferrin. J Chromatogr B Analyt Technol Biomed Life Sci 2018; 1084:150-157. [DOI: 10.1016/j.jchromb.2018.03.022] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2018] [Revised: 03/05/2018] [Accepted: 03/10/2018] [Indexed: 12/27/2022]
|
15
|
Lobas AA, Levitsky LI, Fichtenbaum A, Surin AK, Pridatchenko ML, Mitulovic G, Gorshkov AV, Gorshkov MV. Predictive Liquid Chromatography of Peptides Based on Hydrophilic Interactions for Mass Spectrometry-Based Proteomics. JOURNAL OF ANALYTICAL CHEMISTRY 2018. [DOI: 10.1134/s1061934817140076] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
16
|
Parr MK, Schmidt AH. Life cycle management of analytical methods. J Pharm Biomed Anal 2018; 147:506-517. [DOI: 10.1016/j.jpba.2017.06.020] [Citation(s) in RCA: 53] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2017] [Revised: 06/10/2017] [Accepted: 06/12/2017] [Indexed: 11/30/2022]
|
17
|
Taraji M, Haddad PR, Amos RIJ, Talebi M, Szucs R, Dolan JW, Pohl CA. Chemometric-assisted method development in hydrophilic interaction liquid chromatography: A review. Anal Chim Acta 2017; 1000:20-40. [PMID: 29289311 DOI: 10.1016/j.aca.2017.09.041] [Citation(s) in RCA: 62] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2017] [Revised: 09/22/2017] [Accepted: 09/24/2017] [Indexed: 02/09/2023]
Abstract
With an enormous growth in the application of hydrophilic interaction liquid chromatography (HILIC), there has also been significant progress in HILIC method development. HILIC is a chromatographic method that utilises hydro-organic mobile phases with a high organic content, and a hydrophilic stationary phase. It has been applied predominantly in the determination of small polar compounds. Theoretical studies in computer-aided modelling tools, most importantly the predictive, quantitative structure retention relationship (QSRR) modelling methods, have attracted the attention of researchers and these approaches greatly assist the method development process. This review focuses on the application of computer-aided modelling tools in understanding the retention mechanism, the classification of HILIC stationary phases, prediction of retention times in HILIC systems, optimisation of chromatographic conditions, and description of the interaction effects of the chromatographic factors in HILIC separations. Additionally, what has been achieved in the potential application of QSRR methodology in combination with experimental design philosophy in the optimisation of chromatographic separation conditions in the HILIC method development process is communicated. Developing robust predictive QSRR models will undoubtedly facilitate more application of this chromatographic mode in a broader variety of research areas, significantly minimising cost and time of the experimental work.
Collapse
Affiliation(s)
- Maryam Taraji
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia
| | - Paul R Haddad
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia.
| | - Ruth I J Amos
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia
| | - Mohammad Talebi
- Australian Centre for Research on Separation Science (ACROSS), School of Physical Sciences-Chemistry, University of Tasmania, Private Bag 75, Hobart 7001, Australia
| | - Roman Szucs
- Pfizer Global Research and Development, CT13 9NJ, Sandwich, UK
| | - John W Dolan
- LC Resources, 1795 NW Wallace Rd., McMinnville, OR 97128, USA
| | | |
Collapse
|
18
|
Ivanov MV, Tarasova IA, Levitsky LI, Solovyeva EM, Pridatchenko ML, Lobas AA, Bubis JA, Gorshkov MV. MS/MS-Free Protein Identification in Complex Mixtures Using Multiple Enzymes with Complementary Specificity. J Proteome Res 2017; 16:3989-3999. [DOI: 10.1021/acs.jproteome.7b00365] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Mark V. Ivanov
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
- Moscow
Institute
of Physics and Technology (State University), 9 Institutsky Per. Dolgoprudny, Moscow 141700, Russia
| | - Irina A. Tarasova
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
| | - Lev I. Levitsky
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
- Moscow
Institute
of Physics and Technology (State University), 9 Institutsky Per. Dolgoprudny, Moscow 141700, Russia
| | - Elizaveta M. Solovyeva
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
- Moscow
Institute
of Physics and Technology (State University), 9 Institutsky Per. Dolgoprudny, Moscow 141700, Russia
| | - Marina L. Pridatchenko
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
| | - Anna A. Lobas
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
- Moscow
Institute
of Physics and Technology (State University), 9 Institutsky Per. Dolgoprudny, Moscow 141700, Russia
| | - Julia A. Bubis
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
- Moscow
Institute
of Physics and Technology (State University), 9 Institutsky Per. Dolgoprudny, Moscow 141700, Russia
| | - Mikhail V. Gorshkov
- V.L.
Talrose Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, 38 Leninsky Pr., Bld. 2, Moscow 119334, Russia
- Moscow
Institute
of Physics and Technology (State University), 9 Institutsky Per. Dolgoprudny, Moscow 141700, Russia
| |
Collapse
|
19
|
Use of dual-filtering to create training sets leading to improved accuracy in quantitative structure-retention relationships modelling for hydrophilic interaction liquid chromatographic systems. J Chromatogr A 2017; 1507:53-62. [DOI: 10.1016/j.chroma.2017.05.044] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2017] [Revised: 05/17/2017] [Accepted: 05/18/2017] [Indexed: 01/31/2023]
|
20
|
Swanson RK, Xu R, Nettleton DS, Glatz CE. Accounting for host cell protein behavior in anion-exchange chromatography. Biotechnol Prog 2016; 32:1453-1463. [PMID: 27556579 DOI: 10.1002/btpr.2342] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Revised: 07/27/2016] [Indexed: 11/11/2022]
Abstract
Host cell proteins (HCP) are a problematic set of impurities in downstream processing (DSP) as they behave most similarly to the target protein during separation. Approaching DSP with the knowledge of HCP separation behavior would be beneficial for the production of high purity recombinant biologics. Therefore, this work was aimed at characterizing the separation behavior of complex mixtures of HCP during a commonly used method: anion-exchange chromatography (AEX). An additional goal was to evaluate the performance of a statistical methodology, based on the characterization data, as a tool for predicting protein separation behavior. Aqueous two-phase partitioning followed by two-dimensional electrophoresis provided data on the three physicochemical properties most commonly exploited during DSP for each HCP: pI (isoelectric point), molecular weight, and surface hydrophobicity. The protein separation behaviors of two alternative expression host extracts (corn germ and E. coli) were characterized. A multivariate random forest (MVRF) statistical methodology was then applied to the database of characterized proteins creating a tool for predicting the AEX behavior of a mixture of proteins. The accuracy of the MVRF method was determined by calculating a root mean squared error value for each database. This measure never exceeded a value of 0.045 (fraction of protein populating each of the multiple separation fractions) for AEX. © 2016 American Institute of Chemical Engineers Biotechnol. Prog., 32:1453-1463, 2016.
Collapse
Affiliation(s)
- Ryan K Swanson
- Dept. of Chemical and Biological Engineering, Iowa State University, Ames, IA, 50011
| | - Ruo Xu
- Dept. of Statistics, Iowa State University, Ames, IA, 50011
| | | | - Charles E Glatz
- Dept. of Chemical and Biological Engineering, Iowa State University, Ames, IA, 50011
| |
Collapse
|
21
|
Falchi F, Bertozzi SM, Ottonello G, Ruda GF, Colombano G, Fiorelli C, Martucci C, Bertorelli R, Scarpelli R, Cavalli A, Bandiera T, Armirotti A. Kernel-Based, Partial Least Squares Quantitative Structure-Retention Relationship Model for UPLC Retention Time Prediction: A Useful Tool for Metabolite Identification. Anal Chem 2016; 88:9510-9517. [DOI: 10.1021/acs.analchem.6b02075] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
- Federico Falchi
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Sine Mandrup Bertozzi
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Giuliana Ottonello
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Gian Filippo Ruda
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Giampiero Colombano
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Claudio Fiorelli
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Cataldo Martucci
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Rosalia Bertorelli
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Rita Scarpelli
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Andrea Cavalli
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
- Department
of Pharmacy and Biotechnology, University of Bologna, Via Belmeloro
6, 40126 Bologna, Italy
| | - Tiziano Bandiera
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| | - Andrea Armirotti
- Drug
Discovery and Development Department, Fondazione Istituto Italiano di Tecnologia, Via Morego 30, 16163 Genova, Italy
| |
Collapse
|
22
|
Žuvela P, Macur K, Jay Liu J, Bączek T. Exploiting non-linear relationships between retention time and molecular structure of peptides originating from proteomes and comparing three multivariate approaches. J Pharm Biomed Anal 2016; 127:94-100. [DOI: 10.1016/j.jpba.2016.01.055] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2015] [Revised: 01/11/2016] [Accepted: 01/23/2016] [Indexed: 12/21/2022]
|
23
|
Maes E, Kelchtermans P, Bittremieux W, De Grave K, Degroeve S, Hooyberghs J, Mertens I, Baggerman G, Ramon J, Laukens K, Martens L, Valkenborg D. Designing biomedical proteomics experiments: state-of-the-art and future perspectives. Expert Rev Proteomics 2016; 13:495-511. [PMID: 27031651 DOI: 10.1586/14789450.2016.1172967] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
With the current expanded technical capabilities to perform mass spectrometry-based biomedical proteomics experiments, an improved focus on the design of experiments is crucial. As it is clear that ignoring the importance of a good design leads to an unprecedented rate of false discoveries which would poison our results, more and more tools are developed to help researchers designing proteomic experiments. In this review, we apply statistical thinking to go through the entire proteomics workflow for biomarker discovery and validation and relate the considerations that should be made at the level of hypothesis building, technology selection, experimental design and the optimization of the experimental parameters.
Collapse
Affiliation(s)
- Evelyne Maes
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
| | - Pieter Kelchtermans
- b CFP , University of Antwerp , Antwerp , Belgium.,c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
| | - Wout Bittremieux
- f Department of Mathematics and Computer Science , University of Antwerp , Antwerp , Belgium.,g Biomedical Informatics Research Center Antwerp (biomina) , University of Antwerp/Antwerp University Hospital , Antwerp , Belgium
| | - Kurt De Grave
- h Department of Computer Science , KU Leuven , Leuven , Belgium
| | - Sven Degroeve
- c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
| | - Jef Hooyberghs
- a Applied Bio & molecular systems , VITO , Mol , Belgium
| | - Inge Mertens
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
| | - Geert Baggerman
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium
| | - Jan Ramon
- h Department of Computer Science , KU Leuven , Leuven , Belgium.,i INRIA , Lille , France
| | - Kris Laukens
- f Department of Mathematics and Computer Science , University of Antwerp , Antwerp , Belgium.,g Biomedical Informatics Research Center Antwerp (biomina) , University of Antwerp/Antwerp University Hospital , Antwerp , Belgium
| | - Lennart Martens
- c Medical Biotechnology Center , VIB , Ghent , Belgium.,d Department of Biochemistry , Ghent University , Ghent , Belgium.,e Bioinformatics Institute Ghent , Ghent University , Ghent , Belgium
| | - Dirk Valkenborg
- a Applied Bio & molecular systems , VITO , Mol , Belgium.,b CFP , University of Antwerp , Antwerp , Belgium.,j Interuniversity Institute for Biostatistics and statistical Bioinformatics , Hasselt University , Hasselt , Belgium
| |
Collapse
|
24
|
Bischoff R, Permentier H, Guryev V, Horvatovich P. Genomic variability and protein species — Improving sequence coverage for proteogenomics. J Proteomics 2016; 134:25-36. [DOI: 10.1016/j.jprot.2015.09.021] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2015] [Revised: 09/06/2015] [Accepted: 09/14/2015] [Indexed: 12/30/2022]
|
25
|
Hou S, Wang J, Li Z, Wang Y, Wang Y, Yang S, Xu J, Zhu W. Five-descriptor model to predict the chromatographic sequence of natural compounds. J Sep Sci 2016; 39:864-72. [PMID: 26718117 DOI: 10.1002/jssc.201501016] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2015] [Revised: 11/18/2015] [Accepted: 12/17/2015] [Indexed: 02/02/2023]
Abstract
Despite the recent introduction of mass detection techniques, ultraviolet detection is still widely applied in the field of the chromatographic analysis of natural medicines. Here, a neural network cascade model consisting of nine small artificial neural network units was innovatively developed to predict the chromatographic sequence of natural compounds by integrating five molecular descriptors as the input. A total of 117 compounds of known structure were collected for model building. The order of appearance of each compound was determined in gradient chromatography. Strong linear correlation was found between the predicted and actual chromatographic position orders (Spearman's rho = 0.883, p < 0.0001). Application of the model to the external validation set of nine natural compounds was shown to dramatically increase the prediction accuracy of the real chromatographic order of multiple compounds. A case study shows that chromatographic sequence prediction based on a neural network cascade facilitated compound identification in the chromatographic fingerprint of Radix Salvia miltiorrhiza. For natural medicines of known compound composition, our method provides a feasible means for identifying the constituents of interest when only ultraviolet detection is available.
Collapse
Affiliation(s)
- Shuying Hou
- Department of Pharmacy Intravenous Admixture Service, the First Affiliated Hospital of Harbin Medical University, Harbin, P. R., China
| | - Jinhua Wang
- Department of Pharmacy Intravenous Admixture Service, the First Affiliated Hospital of Harbin Medical University, Harbin, P. R., China
| | - Zhangming Li
- Department of Pharmacy Administration, Harbin Medical University, Harbin, P. R., China
| | - Yang Wang
- Department of Pharmacy Intravenous Admixture Service, the First Affiliated Hospital of Harbin Medical University, Harbin, P. R., China
| | - Ying Wang
- Department of Pharmacy Intravenous Admixture Service, the First Affiliated Hospital of Harbin Medical University, Harbin, P. R., China
| | - Songling Yang
- Department of Biology Pharmacy, Heilongjiang Vocational College of Biology Science and Technology, Harbin, P. R., China
| | - Jia Xu
- Department of Nephrology, the Fourth Affiliated Hospital, Harbin Medical University, Harbin, P. R., China
| | - Wenliang Zhu
- Institute of Clinical Pharmacology, the Second Affiliated Hospital of Harbin Medical University, Harbin, P. R., China
| |
Collapse
|
26
|
Žuvela P, Liu JJ, Macur K, Bączek T. Molecular Descriptor Subset Selection in Theoretical Peptide Quantitative Structure–Retention Relationship Model Development Using Nature-Inspired Optimization Algorithms. Anal Chem 2015; 87:9876-83. [DOI: 10.1021/acs.analchem.5b02349] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Affiliation(s)
- Petar Žuvela
- Department
of Chemical Engineering, Pukyong National University, 365 Sinseon-ro, 608-739 Busan, Korea
| | - J. Jay Liu
- Department
of Chemical Engineering, Pukyong National University, 365 Sinseon-ro, 608-739 Busan, Korea
| | - Katarzyna Macur
- Laboratory
of Mass Spectrometry, Intercollegiate Faculty of Biotechnology, University of Gdańsk and Medical University of Gdańsk, Kładki
24, 80-822 Gdańsk, Poland
| | - Tomasz Bączek
- Department
of Pharmaceutical Chemistry, Medical University of Gdańsk, Hallera
107, 80-416 Gdańsk, Poland
| |
Collapse
|
27
|
Toward greener analytical techniques for the absolute quantification of peptides in pharmaceutical and biological samples. J Pharm Biomed Anal 2015; 113:181-8. [DOI: 10.1016/j.jpba.2015.03.023] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2015] [Revised: 03/19/2015] [Accepted: 03/23/2015] [Indexed: 11/22/2022]
|
28
|
Horvatovich P, Lundberg EK, Chen YJ, Sung TY, He F, Nice EC, Goode RJ, Yu S, Ranganathan S, Baker MS, Domont GB, Velasquez E, Li D, Liu S, Wang Q, He QY, Menon R, Guan Y, Corrales FJ, Segura V, Casal JI, Pascual-Montano A, Albar JP, Fuentes M, Gonzalez-Gonzalez M, Diez P, Ibarrola N, Degano RM, Mohammed Y, Borchers CH, Urbani A, Soggiu A, Yamamoto T, Salekdeh GH, Archakov A, Ponomarenko E, Lisitsa A, Lichti CF, Mostovenko E, Kroes RA, Rezeli M, Végvári Á, Fehniger TE, Bischoff R, Vizcaíno JA, Deutsch EW, Lane L, Nilsson CL, Marko-Varga G, Omenn GS, Jeong SK, Lim JS, Paik YK, Hancock WS. Quest for Missing Proteins: Update 2015 on Chromosome-Centric Human Proteome Project. J Proteome Res 2015; 14:3415-31. [DOI: 10.1021/pr5013009] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Affiliation(s)
- Péter Horvatovich
- Analytical
Biochemistry, Department of Pharmacy, University of Groningen, A. Deusinglaan
1, 9713 AV Groningen, The Netherlands
| | - Emma K. Lundberg
- Science
for Life Laboratory, KTH - Royal Institute of Technology, SE-171 21 Stockholm, Sweden
| | - Yu-Ju Chen
- Institute
of Chemistry, Academia Sinica, 128 Academia Road Sec. 2, Taipei 115, Taiwan
| | - Ting-Yi Sung
- Institute
of Information Science, Academia Sinica, 128 Academia Road Sec. 2, Taipei 115, Taiwan
| | - Fuchu He
- The State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - Edouard C. Nice
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Robert J. Goode
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Simon Yu
- Department
of Biochemistry and Molecular Biology, Monash University, Clayton, Victoria 3800, Australia
| | - Shoba Ranganathan
- Department
of Chemistry and Biomolecular Sciences and ARC Centre of Excellence
in Bioinformatics, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Mark S. Baker
- Australian
School of Advanced Medicine, Macquarie University, Sydney, NSW 2109, Australia
| | - Gilberto B. Domont
- Proteomics Unit, Institute of Chemistry, Federal University of Rio de Janeiro, Cidade Universitária, Av Athos da Silveira Ramos 149, CT-A542, 21941-909 Rio de Janeriro, Rj, Brazil
| | - Erika Velasquez
- Proteomics Unit, Institute of Chemistry, Federal University of Rio de Janeiro, Cidade Universitária, Av Athos da Silveira Ramos 149, CT-A542, 21941-909 Rio de Janeriro, Rj, Brazil
| | - Dong Li
- The State Key Laboratory of Proteomics, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine, No. 27 Taiping Road, Haidian District, Beijing 100850, China
| | - Siqi Liu
- Beijing Institute of Genomics and BGI Shenzhen, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
- BGI Shenzhen, Beishan Road, Yantian District, Shenzhen, 518083, China
| | - Quanhui Wang
- Beijing Institute of Genomics and BGI Shenzhen, No. 1 Beichen West Road, Chaoyang District, Beijing 100101, China
| | - Qing-Yu He
- Key Laboratory of Functional Protein
Research of Guangdong
Higher Education Institutes, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Rajasree Menon
- Department of Computational Medicine & Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Yuanfang Guan
- Departments of Computational Medicine & Bioinformatics and Computer Sciences, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Fernando J. Corrales
- ProteoRed-ISCIII,
Biomolecular and Bioinformatics Resources Platform (PRB2), Spanish
Consortium of C-HPP (Chr-16), CIMA, University of Navarra, 31008 Pamplona, Spain
- Chr16 SpHPP Consortium, CIMA, University of Navarra, 31008 Pamplona, Spain
| | - Victor Segura
- ProteoRed-ISCIII,
Biomolecular and Bioinformatics Resources Platform (PRB2), Spanish
Consortium of C-HPP (Chr-16), CIMA, University of Navarra, 31008 Pamplona, Spain
- Chr16 SpHPP Consortium, CIMA, University of Navarra, 31008 Pamplona, Spain
| | - J. Ignacio Casal
- Department
of Cellular and Molecular Medicine, Centro de Investigaciones Biológicas (CIB-CSIC), 28040 Madrid, Spain
| | | | - Juan P. Albar
- Centro Nacional de Biotecnologia (CNB-CSIC), Cantoblanco, 28049 Madrid, Spain
| | - Manuel Fuentes
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Maria Gonzalez-Gonzalez
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Paula Diez
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Nieves Ibarrola
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Rosa M. Degano
- Cancer
Research Center. Proteomics Unit and General Service of Cytometry,
Department of Medicine, University of Salmanca-CSIC, IBSAL, Campus Miguel de Unamuno
s/n, 37007 Salamanca, Spain
| | - Yassene Mohammed
- University of Victoria-Genome British Columbia Proteomics
Centre, Vancouver Island
Technology Park, #3101−4464 Markham Street, Victoria, British Columbia V8Z 7X8, Canada
- Center
for Proteomics and Metabolomics, Leiden University Medical Center, 2333 ZA Leiden, The Netherlands
| | - Christoph H. Borchers
- University of Victoria-Genome British Columbia Proteomics
Centre, Vancouver Island
Technology Park, #3101−4464 Markham Street, Victoria, British Columbia V8Z 7X8, Canada
| | - Andrea Urbani
- Proteomics
and Metabonomic, Laboratory, Fondazione Santa Lucia, Rome, Italy
- Department
of Experimental Medicine and Surgery, University of Rome “Tor Vergata”, Rome, Italy
| | - Alessio Soggiu
- Department
of Veterinary Science and Public Health (DIVET), University of Milano, via Celoria 10, 20133 Milano, Italy
| | - Tadashi Yamamoto
- Institute
of Nephrology, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan
| | - Ghasem Hosseini Salekdeh
- Department of Molecular Systems Biology at Cell Science Research Center, Royan Institute for Stem Cell Biology and Technology, ACECR, Tehran, Iran
- Department of Systems Biology, Agricultural Biotechnology Research Institute of Iran, Karaj, Iran
| | | | | | - Andrey Lisitsa
- Orechovich Institute of Biomedical Chemistry, Moscow, Russia
| | - Cheryl F. Lichti
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - Ekaterina Mostovenko
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - Roger A. Kroes
- Falk Center for Molecular Therapeutics, Department of Biomedical Engineering, Northwestern University, 1801 Maple Ave., Suite 4300, Evanston, Illinois 60201, United States
| | - Melinda Rezeli
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Ákos Végvári
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Thomas E. Fehniger
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Rainer Bischoff
- Analytical
Biochemistry, Department of Pharmacy, University of Groningen, A. Deusinglaan
1, 9713 AV Groningen, The Netherlands
| | - Juan Antonio Vizcaíno
- European Molecular
Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, CB10 1SD, Hinxton, Cambridge, United Kingdom
| | - Eric W. Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109, United States
| | - Lydie Lane
- SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- Department
of Human Protein Science, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Carol L. Nilsson
- Department
of Pharmacology and Toxicology, The University of Texas Medical Branch, Galveston, Texas 77555-0617, United States
| | - György Marko-Varga
- Clinical Protein Science & Imaging, Department of Biomedical Engineering, Lund University, BMC D13, 221 84 Lund, Sweden
| | - Gilbert S. Omenn
- Departments of Computational Medicine & Bioinformatics, Internal Medicine, Human Genetics and School of Public Health, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
| | - Seul-Ki Jeong
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - Jong-Sun Lim
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - Young-Ki Paik
- Departments of Integrated Omics for Biomedical Science & Biochemistry, College of Life Science and Technology, Yonsei Proteome Research Center, Yonsei University, Seoul, 120-749, Korea
| | - William S. Hancock
- The
Barnett Institute of Chemical and Biological Analysis, Northeastern University, 140 The Fenway, Boston, Massachusetts 02115, United States
| |
Collapse
|
29
|
Iwaniak A, Minkiewicz P, Darewicz M, Protasiewicz M, Mogut D. Chemometrics and cheminformatics in the analysis of biologically active peptides from food sources. J Funct Foods 2015. [DOI: 10.1016/j.jff.2015.04.038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
|
30
|
Wolfender JL, Marti G, Thomas A, Bertrand S. Current approaches and challenges for the metabolite profiling of complex natural extracts. J Chromatogr A 2015; 1382:136-64. [DOI: 10.1016/j.chroma.2014.10.091] [Citation(s) in RCA: 352] [Impact Index Per Article: 39.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2014] [Revised: 10/23/2014] [Accepted: 10/26/2014] [Indexed: 12/11/2022]
|
31
|
Support Vector Regression Based QSPR for the Prediction of Retention Time of Peptides in Reversed-Phase Liquid Chromatography. Chromatographia 2014. [DOI: 10.1007/s10337-014-2819-1] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
32
|
Applications of Peptide Retention Time in Proteomic Data Analysis. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2014; 845:67-75. [DOI: 10.1007/978-94-017-9523-4_7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
|
33
|
Kelchtermans P, Bittremieux W, De Grave K, Degroeve S, Ramon J, Laukens K, Valkenborg D, Barsnes H, Martens L. Machine learning applications in proteomics research: how the past can boost the future. Proteomics 2014; 14:353-66. [PMID: 24323524 DOI: 10.1002/pmic.201300289] [Citation(s) in RCA: 46] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2013] [Revised: 09/24/2013] [Accepted: 10/14/2013] [Indexed: 01/22/2023]
Abstract
Machine learning is a subdiscipline within artificial intelligence that focuses on algorithms that allow computers to learn solving a (complex) problem from existing data. This ability can be used to generate a solution to a particularly intractable problem, given that enough data are available to train and subsequently evaluate an algorithm on. Since MS-based proteomics has no shortage of complex problems, and since publicly available data are becoming available in ever growing amounts, machine learning is fast becoming a very popular tool in the field. We here therefore present an overview of the different applications of machine learning in proteomics that together cover nearly the entire wet- and dry-lab workflow, and that address key bottlenecks in experiment planning and design, as well as in data processing and analysis.
Collapse
Affiliation(s)
- Pieter Kelchtermans
- Department of Medical Protein Research, VIB, Ghent, Belgium; Faculty of Medicine and Health Sciences, Department of Biochemistry, Ghent University, Ghent, Belgium; Flemish Institute for Technological Research (VITO), Boeretang, Mol, Belgium
| | | | | | | | | | | | | | | | | |
Collapse
|
34
|
Influence of sample and mobile phase composition on peptide retention behaviour and sensitivity in reversed-phase liquid chromatography/mass spectrometry. J Chromatogr A 2013; 1314:199-207. [DOI: 10.1016/j.chroma.2013.09.036] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2013] [Revised: 09/09/2013] [Accepted: 09/10/2013] [Indexed: 11/20/2022]
|
35
|
LC–MS and chemometrics for steroid biomarker profiles in view of the future diagnostics of diseases? Bioanalysis 2013; 5:1347-51. [DOI: 10.4155/bio.13.63] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
|
36
|
Lobas AA, Verenchikov AN, Goloborodko AA, Levitsky LI, Gorshkov MV. Combination of Edman degradation of peptides with liquid chromatography/mass spectrometry workflow for peptide identification in bottom-up proteomics. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2013; 27:391-400. [PMID: 23280970 DOI: 10.1002/rcm.6462] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2012] [Revised: 11/01/2012] [Accepted: 11/02/2012] [Indexed: 06/01/2023]
Abstract
RATIONALE High-throughput methods of proteomics are essential for identification of proteins in a cell or tissue under certain conditions. Most of these methods require tandem mass spectrometry (MS/MS). A multidimensional approach including predictive chromatography and partial chemical degradation could be a valuable alternative and/or addition to MS/MS. METHODS In the proposed strategy peptides are identified in a three-dimensional (3D) search space consisting of retention time (RT), mass, and reduced mass after one-step partial Edman degradation. The strategy was evaluated in silico for two databases: baker's yeast and human proteins. Rates of unambiguous identifications were estimated for mass accuracies from 0.001 to 0.05 Da and RT prediction accuracies from 0.1 to 5 min. Rates of Edman reactions were measured for test peptides. RESULTS A 3D description of proteolytic peptides allowing unambiguous identification without employing MS/MS of up to 95% and 80% of tryptic peptides from the yeast and human proteomes, respectively, was considered. Further extension of the search space to a four-dimensional one by incorporating the second N-terminal amino acid residue as the fourth dimension was also considered and was shown to result in up to 90% of human peptides being identified unambiguously. CONCLUSIONS The proposed 3D search space can be a useful alternative to MS/MS-based peptide identification approach. Experimental implementations of the proposed method within the on-line liquid chromatography/mass spectrometry (LC/MS) and off-line matrix-assisted laser desorption/ionization (MALDI) workflows are in progress.
Collapse
Affiliation(s)
- Anna A Lobas
- Institute for Energy Problems of Chemical Physics, Russian Academy of Sciences, Moscow, Russia
| | | | | | | | | |
Collapse
|
37
|
Silvestre DD, Zoppis I, Brambilla F, Bellettato V, Mauri G, Mauri P. Availability of MudPIT data for classification of biological samples. J Clin Bioinforma 2013; 3:1. [PMID: 23317455 PMCID: PMC3563498 DOI: 10.1186/2043-9113-3-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2012] [Accepted: 01/07/2013] [Indexed: 01/18/2023] Open
Abstract
Background Mass spectrometry is an important analytical tool for clinical proteomics. Primarily employed for biomarker discovery, it is increasingly used for developing methods which may help to provide unambiguous diagnosis of biological samples. In this context, we investigated the classification of phenotypes by applying support vector machine (SVM) on experimental data obtained by MudPIT approach. In particular, we compared the performance capabilities of SVM by using two independent collection of complex samples and different data-types, such as mass spectra (m/z), peptides and proteins. Results Globally, protein and peptide data allowed a better discriminant informative content than experimental mass spectra (overall accuracy higher than 87% in both collection 1 and 2). These results indicate that sequencing of peptides and proteins reduces the experimental noise affecting the raw mass spectra, and allows the extraction of more informative features available for the effective classification of samples. In addition, proteins and peptides features selected by SVM matched for 80% with the differentially expressed proteins identified by the MAProMa software. Conclusions These findings confirm the availability of the most label-free quantitative methods based on processing of spectral count and SEQUEST-based SCORE values. On the other hand, it stresses the usefulness of MudPIT data for a correct grouping of sample phenotypes, by applying both supervised and unsupervised learning algorithms. This capacity permit the evaluation of actual samples and it is a good starting point to translate proteomic methodology to clinical application.
Collapse
Affiliation(s)
- Dario Di Silvestre
- , Institute for Biomedical Technologies (ITB-CNR), via F.lli Cervi 93, Segrate (Milan), Italy
| | - Italo Zoppis
- Department of Informatics, Systems and Communication, Viale Sarca 336, University of Milano-Bicocca, Milan, Italy
| | - Francesca Brambilla
- , Institute for Biomedical Technologies (ITB-CNR), via F.lli Cervi 93, Segrate (Milan), Italy
| | - Valeria Bellettato
- , Institute for Biomedical Technologies (ITB-CNR), via F.lli Cervi 93, Segrate (Milan), Italy
| | - Giancarlo Mauri
- Department of Informatics, Systems and Communication, Viale Sarca 336, University of Milano-Bicocca, Milan, Italy
| | - Pierluigi Mauri
- , Institute for Biomedical Technologies (ITB-CNR), via F.lli Cervi 93, Segrate (Milan), Italy
| |
Collapse
|
38
|
Moskovets E, Goloborodko AA, Gorshkov AV, Gorshkov MV. Limitation of predictive 2-D liquid chromatography in reducing the database search space in shotgun proteomics: in silico studies. J Sep Sci 2012; 35:1771-8. [PMID: 22807359 DOI: 10.1002/jssc.201100798] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
A two-dimensional (2-D) liquid chromatography (LC) separation of complex peptide mixtures that combines a normal phase utilizing hydrophilic interactions and a reversed phase offers reportedly the highest level of 2-D LC orthogonality by providing an even spread of peptides across multiple LC fractions. Matching experimental peptide retention times to those predicted by empirical models describing chromatographic separation in each LC dimension leads to a significant reduction in a database search space. In this work, we calculated the retention times of tryptic peptides separated in the C18 reversed phase at different separation conditions (pH 2 and pH 10) and in TSK gel Amide-80 normal phase. We show that retention times calculated for different 2-D LC separation schemes utilizing these phases start to correlate once the mass range of peptides under analysis becomes progressively narrow. This effect is explained by high degree of correlation between retention coefficients in the considered phases.
Collapse
|
39
|
Abstract
Selected reaction monitoring (SRM) has a long history of use in the area of quantitative MS. In recent years, the approach has seen increased application to quantitative proteomics, facilitating multiplexed relative and absolute quantification studies in a variety of organisms. This article discusses SRM, after introducing the context of quantitative proteomics (specifically primarily absolute quantification) where it finds most application, and considers topics such as the theory and advantages of SRM, the selection of peptide surrogates for protein quantification, the design of optimal SRM co-ordinates and the handling of SRM data. A number of published studies are also discussed to demonstrate the impact that SRM has had on the field of quantitative proteomics.
Collapse
|
40
|
Krokhin O. Peptide retention prediction in reversed-phase chromatography: proteomic applications. Expert Rev Proteomics 2012; 9:1-4. [PMID: 22292816 DOI: 10.1586/epr.11.79] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
41
|
Darewicz M, Dziuba B, Minkiewicz P, Dziuba J. The Preventive Potential of Milk and Colostrum Proteins and Protein Fragments. FOOD REVIEWS INTERNATIONAL 2011. [DOI: 10.1080/87559129.2011.563396] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
42
|
On the utility of predictive chromatography to complement mass spectrometry based intact protein identification. Anal Bioanal Chem 2011; 402:2521-9. [PMID: 21901462 DOI: 10.1007/s00216-011-5350-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2011] [Revised: 07/22/2011] [Accepted: 08/19/2011] [Indexed: 10/17/2022]
Abstract
The amino acid sequence determines the individual protein three-dimensional structure and its functioning in an organism. Therefore, "reading" a protein sequence and determining its changes due to mutations or post-translational modifications is one of the objectives of proteomic experiments. The commonly utilized approach is gradient high-performance liquid chromatography (HPLC) in combination with tandem mass spectrometry. While serving as a way to simplify the protein mixture, the liquid chromatography may be an additional analytical tool providing complementary information about the protein structure. Previous attempts to develop "predictive" HPLC for large biomacromolecules were limited by empirically derived equations based purely on the adsorption mechanisms of the retention and applicable to relatively small polypeptide molecules. A mechanism of the large biomacromolecule retention in reversed-phase gradient HPLC was described recently in thermodynamics terms by the analytical model of liquid chromatography at critical conditions (BioLCCC). In this work, we applied the BioLCCC model to predict retention of the intact proteins as well as their large proteolytic peptides separated under different HPLC conditions. The specific aim of these proof-of-principle studies was to demonstrate the feasibility of using "predictive" HPLC as a complementary tool to support the analysis of identified intact proteins in top-down, middle-down, and/or targeted selected reaction monitoring (SRM)-based proteomic experiments.
Collapse
|
43
|
Easy and accurate high-performance liquid chromatography retention prediction with different gradients, flow rates, and instruments by back-calculation of gradient and flow rate profiles. J Chromatogr A 2011; 1218:6742-9. [DOI: 10.1016/j.chroma.2011.07.070] [Citation(s) in RCA: 40] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2011] [Revised: 07/09/2011] [Accepted: 07/21/2011] [Indexed: 11/19/2022]
|
44
|
Boswell PG, Schellenberg JR, Carr PW, Cohen JD, Hegeman AD. A study on retention “projection” as a supplementary means for compound identification by liquid chromatography–mass spectrometry capable of predicting retention with different gradients, flow rates, and instruments. J Chromatogr A 2011; 1218:6732-41. [DOI: 10.1016/j.chroma.2011.07.105] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2011] [Revised: 07/09/2011] [Accepted: 07/21/2011] [Indexed: 11/16/2022]
|
45
|
Sargaeva NP, Goloborodko AA, O'Connor PB, Moskovets E, Gorshkov MV. Sequence-specific predictive chromatography to assist mass spectrometric analysis of asparagine deamidation and aspartate isomerization in peptides. Electrophoresis 2011; 32:1962-9. [DOI: 10.1002/elps.201000507] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2010] [Revised: 12/21/2010] [Accepted: 12/30/2010] [Indexed: 11/08/2022]
|
46
|
Fæste CK, Rønning HT, Christians U, Granum PE. Liquid chromatography and mass spectrometry in food allergen detection. J Food Prot 2011; 74:316-45. [PMID: 21333155 DOI: 10.4315/0362-028x.jfp-10-336] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Food allergy is an important issue in the field of food safety because of the hazards for affected persons and the hygiene requirements and legal regulations imposed on the food industry. Consumer protection and law enforcement require suitable analytical techniques for the detection of allergens in foods. Immunological methods are currently preferred; however, confirmatory alternatives are needed. The determination of allergenic proteins by liquid chromatography and mass spectrometry has greatly advanced in recent years, and gel-free allergenomics is becoming a routinely used approach for the identification and quantitation of food allergens. The present review provides a brief overview of the principles of proteomic procedures, various chromatographic set ups, and mass spectrometry instrumentation used in allergenomics. A compendium of published liquid chromatography methods, proteomic analyses, typical marker peptides, and quantitative assays for 14 main allergy-causing foods is also included.
Collapse
Affiliation(s)
- Christiane Kruse Fæste
- Section of Chemistry, Department of Feed and Food Safety, National Veterinary Institute, P.O. Box 750 Sentrum, Oslo N-0106, Norway.
| | | | | | | |
Collapse
|
47
|
Data processing pipelines for comprehensive profiling of proteomics samples by label-free LC–MS for biomarker discovery. Talanta 2011; 83:1209-24. [DOI: 10.1016/j.talanta.2010.10.029] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2010] [Revised: 10/18/2010] [Accepted: 10/21/2010] [Indexed: 01/30/2023]
|
48
|
Bochet P, Rügheimer F, Guina T, Brooks P, Goodlett D, Clote P, Schwikowski B. Fragmentation-free LC-MS can identify hundreds of proteins. Proteomics 2010; 11:22-32. [DOI: 10.1002/pmic.200900765] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2009] [Revised: 09/02/2010] [Accepted: 09/20/2010] [Indexed: 11/09/2022]
|
49
|
Macur K, Temporini C, Massolini G, Grzenkowicz-Wydra J, Obuchowski M, Bączek T. Proteomic analysis of small acid soluble proteins in the spore core of Bacillus subtilis ΔprpE and 168 strains with predictions of peptides liquid chromatography retention times as an additional tool in protein identification. Proteome Sci 2010; 8:60. [PMID: 21092197 PMCID: PMC3003637 DOI: 10.1186/1477-5956-8-60] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2010] [Accepted: 11/22/2010] [Indexed: 12/03/2022] Open
Abstract
Background Sporulation, characteristic for some bacteria such as Bacillus subtilis, has not been entirely defined yet. Protein phosphatase E (PrpE) and small, acid soluble spore proteins (SASPs) influence this process. Nevertheless, direct result of PrpE interaction on SASPs content in spore coat of B. subtilis has not been evidenced so far. As proteomic approach enables global analysis of occurring proteins, therefore it was chosen in this experiment to compare SASPs occurrence in two strains of B. subtilis, standard 168 and ΔprpE, lacking PrpE phosphatase. Proteomic analysis is still a challenge, and despite of big approach in mass spectrometry (MS) field, the identification reliability remains unsatisfactory. Therefore there is a rising interest in new methods, particularly bioinformatic tools that would harden protein identification. Most of currently applied algorithms are based on MS-data. Information from separation steps is not still in routine usage, even though they also provide valuable facts about analyzed structures. The aim of this research was to apply a model for peptides retention times prediction, based on quantitative structure-retention relationships (QSRR) in SASPs analysis, obtained from two strains of B. subtilis proteome digests after separation and identification of the peptides by LC-ESI-MS/MS. The QSRR approach was applied as the additional constraint in proteomic research verifying results of MS/MS ion search and confirming the correctness of the peptides identifications along with the indication of the potential false positives and false negatives. Results In both strains of B. subtilis, peptides characteristic for SASPs were found, however their identification confidence varied. According to the MS identity parameter Xcorr and difference between predicted and experimental retention times (ΔtR) four groups could be distinguished: correctly and incorrectly identified, potential false positives and false negatives. The ΔprpE strain was characterized by much higher amount of SASPs peptides than standard 168 and their identification confidence was, mostly for alpha- and beta-type SASP, satisfactory. Conclusions The QSRR-based model for predicting retention times of the peptides, was a useful additional to MS tool, enhancing protein identification. Higher content of SASPs in strain lacking PrpE phosphatase suggests that this enzyme may influence their occurrence in the spores, lowering levels of these proteins.
Collapse
Affiliation(s)
- Katarzyna Macur
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, Medical University of Gdańsk, Hallera 107, 80-416 Gdańsk, Poland.
| | | | | | | | | | | |
Collapse
|
50
|
Nesvizhskii AI. A survey of computational methods and error rate estimation procedures for peptide and protein identification in shotgun proteomics. J Proteomics 2010; 73:2092-123. [PMID: 20816881 DOI: 10.1016/j.jprot.2010.08.009] [Citation(s) in RCA: 370] [Impact Index Per Article: 26.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2010] [Revised: 08/25/2010] [Accepted: 08/25/2010] [Indexed: 12/18/2022]
Abstract
This manuscript provides a comprehensive review of the peptide and protein identification process using tandem mass spectrometry (MS/MS) data generated in shotgun proteomic experiments. The commonly used methods for assigning peptide sequences to MS/MS spectra are critically discussed and compared, from basic strategies to advanced multi-stage approaches. A particular attention is paid to the problem of false-positive identifications. Existing statistical approaches for assessing the significance of peptide to spectrum matches are surveyed, ranging from single-spectrum approaches such as expectation values to global error rate estimation procedures such as false discovery rates and posterior probabilities. The importance of using auxiliary discriminant information (mass accuracy, peptide separation coordinates, digestion properties, and etc.) is discussed, and advanced computational approaches for joint modeling of multiple sources of information are presented. This review also includes a detailed analysis of the issues affecting the interpretation of data at the protein level, including the amplification of error rates when going from peptide to protein level, and the ambiguities in inferring the identifies of sample proteins in the presence of shared peptides. Commonly used methods for computing protein-level confidence scores are discussed in detail. The review concludes with a discussion of several outstanding computational issues.
Collapse
|