1
|
Golba B, Zhong Z, Romio M, Almey R, Deforce D, Dhaenens M, Sanders NN, Benetti EM, De Geest BG. Cyclic Poly(2-methyl-2-oxazoline)-Lipid Conjugates Are Good Alternatives to Poly(ethylene glycol)-Lipids for Lipid Nanopartcile mRNA Formulation. Biomacromolecules 2025. [PMID: 39970327 DOI: 10.1021/acs.biomac.4c01587] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
Lipid nanoparticles (LNPs) with ionizable cationic lipids have revolutionized RNA drug delivery, playing a key role in the success of mRNA-based therapeutics, such as COVID-19 vaccines. A vital component of these LNPs is the poly(ethylene glycol) (PEG)-lipid conjugate, which enhances colloidal stability but may trigger the production of anti-PEG antibodies, resulting in accelerated blood clearance (ABC) and diminished therapeutic efficacy. In this study, we explored poly(2-methyl-2-oxazoline) (PMOXA) as an alternative stabilizing agent for mRNA LNPs. We synthesized both cyclic and linear PMOXA, conjugated them to dialkyl lipids, and created lipid-polymer amphiphiles. We systematically evaluated how polymer topology influenced the physicochemical properties of LNPs, including in vitro cellular uptake, transfection efficiency, and protein corona formation, and directly compared these properties with those of PEG-stabilized counterparts. In vivo experiments in mice further assessed the biodistribution and protein translation efficiency of these LNPs following intravenous administration. Our results showed that cyclic PMOXA conjugates not only matched but potentially surpassed the performance of PEG-based analogues, highlighting their promise as a superior alternative in mRNA-LNP formulations.
Collapse
Affiliation(s)
- Bianka Golba
- Department of Pharmaceutics, Ghent University, Ottergemsesteenweg 460, Ghent 9000, Belgium
| | - Zifu Zhong
- Department of Pharmaceutics, Ghent University, Ottergemsesteenweg 460, Ghent 9000, Belgium
| | - Matteo Romio
- Department of Chemical Sciences (DiSC), Laboratory for Macromolecular and Organic Chemistry, University of Padova, Via F. Marzolo 1, Padova 35131, Italy
| | - Ruben Almey
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Department of Pharmaceutics, Ghent University, Ottergemsesteenweg 460, Ghent 9000, Belgium
| | - Dieter Deforce
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Department of Pharmaceutics, Ghent University, Ottergemsesteenweg 460, Ghent 9000, Belgium
| | - Maarten Dhaenens
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Department of Pharmaceutics, Ghent University, Ottergemsesteenweg 460, Ghent 9000, Belgium
| | - Niek N Sanders
- Laboratory of Gene Therapy, Faculty of Veterinary Medicine, Ghent University, 9 Heidestraat 19, 820, Merelbeke, Ghent 9000, Belgium
| | - Edmondo M Benetti
- Department of Chemical Sciences (DiSC), Laboratory for Macromolecular and Organic Chemistry, University of Padova, Via F. Marzolo 1, Padova 35131, Italy
| | - Bruno G De Geest
- Department of Pharmaceutics, Ghent University, Ottergemsesteenweg 460, Ghent 9000, Belgium
| |
Collapse
|
2
|
Wolski WE, Grossmann J, Schwarz L, Leary P, Türker C, Nanni P, Schlapbach R, Panse C. prolfquapp ─ A User-Friendly Command-Line Tool Simplifying Differential Expression Analysis in Quantitative Proteomics. J Proteome Res 2025; 24:955-965. [PMID: 39849819 PMCID: PMC11812002 DOI: 10.1021/acs.jproteome.4c00911] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2024] [Revised: 12/09/2024] [Accepted: 01/14/2025] [Indexed: 01/25/2025]
Abstract
Mass spectrometry is a cornerstone of quantitative proteomics, enabling relative protein quantification and differential expression analysis (DEA) of proteins. As experiments grow in complexity, involving more samples, groups, and identified proteins, interactive differential expression analysis tools become impractical. The prolfquapp addresses this challenge by providing a command-line interface that simplifies DEA, making it accessible to nonprogrammers and seamlessly integrating it into workflow management systems. Prolfquapp streamlines data processing and result visualization by generating dynamic HTML reports that facilitate the exploration of differential expression results. These reports allow for investigating complex experiments, such as those involving repeated measurements or multiple explanatory variables. Additionally, prolfquapp supports various output formats, including XLSX files, SummarizedExperiment objects and rank files, for further interactive analysis using spreadsheet software, the exploreDE Shiny application, or gene set enrichment analysis software, respectively. By leveraging advanced statistical models from the prolfqua R package, prolfquapp offers a user-friendly, integrated solution for large-scale quantitative proteomics studies, combining efficient data processing with insightful, publication-ready outputs.
Collapse
Affiliation(s)
- Witold E. Wolski
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- Swiss
Institute of Bioinformatics (SIB) Quartier Sorge - Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Jonas Grossmann
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- Swiss
Institute of Bioinformatics (SIB) Quartier Sorge - Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Leonardo Schwarz
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- Swiss
Institute of Bioinformatics (SIB) Quartier Sorge - Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Peter Leary
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- Swiss
Institute of Bioinformatics (SIB) Quartier Sorge - Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Can Türker
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Paolo Nanni
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Ralph Schlapbach
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Christian Panse
- Functional
Genomics Center Zurich (FGCZ) - University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
- Swiss
Institute of Bioinformatics (SIB) Quartier Sorge - Batiment Amphipole, 1015 Lausanne, Switzerland
| |
Collapse
|
3
|
Rider MH, Vertommen D, Johanns M. How mass spectrometry can be exploited to study AMPK. Essays Biochem 2024; 68:283-294. [PMID: 39056150 DOI: 10.1042/ebc20240009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 07/12/2024] [Accepted: 07/17/2024] [Indexed: 07/28/2024]
Abstract
AMP-activated protein kinase (AMPK) is a key regulator of metabolism and a recognised target for the treatment of metabolic diseases such as Type 2 diabetes (T2D). Here, we review how mass spectrometry (MS) can be used to study short-term control by AMPK via protein phosphorylation and long-term control due to changes in protein expression. We discuss how MS can quantify AMPK subunit levels in tissues from different species. We propose hydrogen-deuterium exchange (HDX)-MS to investigate molecular mechanisms of AMPK activation and thermoproteomic profiling (TPP) to assess off-target effects of pharmacological AMPK activators/inhibitors. Lastly, because large MS data sets are generated, we consider different approaches that can be used for their interpretation.
Collapse
Affiliation(s)
- Mark H Rider
- Protein Phosphorylation (PHOS) laboratory, Université catholique de Louvain and de Duve Institute, Avenue Hippocrate 75, B-1200 Brussels, Belgium
| | - Didier Vertommen
- Protein Phosphorylation (PHOS) laboratory, Université catholique de Louvain and de Duve Institute, Avenue Hippocrate 75, B-1200 Brussels, Belgium
| | - Manuel Johanns
- Protein Phosphorylation (PHOS) laboratory, Université catholique de Louvain and de Duve Institute, Avenue Hippocrate 75, B-1200 Brussels, Belgium
| |
Collapse
|
4
|
Kohler D, Staniak M, Yu F, Nesvizhskii AI, Vitek O. An MSstats workflow for detecting differentially abundant proteins in large-scale data-independent acquisition mass spectrometry experiments with FragPipe processing. Nat Protoc 2024; 19:2915-2938. [PMID: 38769142 DOI: 10.1038/s41596-024-01000-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 03/11/2024] [Indexed: 05/22/2024]
Abstract
Technological advances in mass spectrometry and proteomics have made it possible to perform larger-scale and more-complex experiments. The volume and complexity of the resulting data create major challenges for downstream analysis. In particular, next-generation data-independent acquisition (DIA) experiments enable wider proteome coverage than more traditional targeted approaches but require computational workflows that can manage much larger datasets and identify peptide sequences from complex and overlapping spectral features. Data-processing tools such as FragPipe, DIA-NN and Spectronaut have undergone substantial improvements to process spectral features in a reasonable time. Statistical analysis tools are needed to draw meaningful comparisons between experimental samples, but these tools were also originally designed with smaller datasets in mind. This protocol describes an updated version of MSstats that has been adapted to be compatible with large-scale DIA experiments. A very large DIA experiment, processed with FragPipe, is used as an example to demonstrate different MSstats workflows. The choice of workflow depends on the user's computational resources. For datasets that are too large to fit into a standard computer's memory, we demonstrate the use of MSstatsBig, a companion R package to MSstats. The protocol also highlights key decisions that have a major effect on both the results and the processing time of the analysis. The MSstats processing can be expected to take 1-3 h depending on the usage of MSstatsBig. The protocol can be run in the point-and-click graphical user interface MSstatsShiny or implemented with minimal coding expertise in R.
Collapse
Affiliation(s)
- Devon Kohler
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
- Barnett Institute for Chemical and Biological Analysis, Northeastern University, Boston, MA, USA
| | | | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
| | - Alexey I Nesvizhskii
- Department of Pathology, University of Michigan, Ann Arbor, MI, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA
| | - Olga Vitek
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA.
- Barnett Institute for Chemical and Biological Analysis, Northeastern University, Boston, MA, USA.
| |
Collapse
|
5
|
Tian P, Koudis NM, Morais MRPT, Pickard A, Fresquet M, Adamson A, Derby B, Lennon R. Collagen IV assembly is influenced by fluid flow in kidney cell-derived matrices. Cells Dev 2024; 179:203923. [PMID: 38670459 DOI: 10.1016/j.cdev.2024.203923] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2023] [Revised: 01/30/2024] [Accepted: 04/22/2024] [Indexed: 04/28/2024]
Abstract
Kidney podocytes and endothelial cells assemble a complex and dynamic basement membrane that is essential for kidney filtration. Whilst many components of this specialised matrix are known, the influence of fluid flow on its assembly and organisation remains poorly understood. Using the coculture of podocytes and glomerular endothelial cells in a low-shear stress, high-flow bioreactor, we investigated the effect of laminar fluid flow on the composition and assembly of cell-derived matrix. With immunofluorescence and matrix image analysis we found flow-mediated remodelling of collagen IV. Using proteomic analysis of the cell-derived matrix we identified changes in both abundance and composition of matrix proteins under flow, including the collagen-modifying enzyme, prolyl 4-hydroxylase (P4HA1). To track collagen IV assembly, we used CRISPR-Cas9 to knock in the luminescent marker HiBiT to the endogenous COL4A2 gene in podocytes. With this system, we found that collagen IV was secreted and accumulated consistently under both static and flow conditions. However knockdown of P4HA1 in podocytes led to a reduction in the secretion of collagen IV and this was more pronounced under flow. Together, this work demonstrates the effect of fluid flow on the composition, modification, and organisation of kidney cell-derived matrix and provides an in vitro system for investigating flow-induced matrix alteration in the context of kidney development and disease.
Collapse
Affiliation(s)
- Pinyuan Tian
- Wellcome Centre for Cell-Matrix Research, School of Biological Science, Faculty of Biology, Medicine and Health, University of Manchester, UK.
| | - Nikki-Maria Koudis
- Wellcome Centre for Cell-Matrix Research, School of Biological Science, Faculty of Biology, Medicine and Health, University of Manchester, UK
| | - Mychel R P T Morais
- Wellcome Centre for Cell-Matrix Research, School of Biological Science, Faculty of Biology, Medicine and Health, University of Manchester, UK.
| | - Adam Pickard
- Wellcome Centre for Cell-Matrix Research, School of Biological Science, Faculty of Biology, Medicine and Health, University of Manchester, UK
| | - Maryline Fresquet
- Wellcome Centre for Cell-Matrix Research, School of Biological Science, Faculty of Biology, Medicine and Health, University of Manchester, UK.
| | - Antony Adamson
- Genome Editing Unit Core Facility, Faculty of Biology, Medicine and Health, University of Manchester, UK.
| | - Brian Derby
- School of Materials, University of Manchester, UK.
| | - Rachel Lennon
- Wellcome Centre for Cell-Matrix Research, School of Biological Science, Faculty of Biology, Medicine and Health, University of Manchester, UK; Royal Manchester Children's Hospital, Manchester, UK.
| |
Collapse
|
6
|
Stockhammer A, Spalt C, Klemt A, Benz LS, Harel S, Natalia V, Wiench L, Freund C, Kuropka B, Bottanelli F. When less is more - a fast TurboID knock-in approach for high-sensitivity endogenous interactome mapping. J Cell Sci 2024; 137:jcs261952. [PMID: 39056144 PMCID: PMC11385326 DOI: 10.1242/jcs.261952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 07/10/2024] [Indexed: 07/28/2024] Open
Abstract
In recent years, proximity labeling has established itself as an unbiased and powerful approach to map the interactome of specific proteins. Although physiological expression of labeling enzymes is beneficial for the mapping of interactors, generation of the desired cell lines remains time-consuming and challenging. Using our established pipeline for rapid generation of C- and N-terminal CRISPR-Cas9 knock-ins (KIs) based on antibiotic selection, we were able to compare the performance of commonly used labeling enzymes when endogenously expressed. Endogenous tagging of the µ subunit of the adaptor protein (AP)-1 complex with TurboID allowed identification of known interactors and cargo proteins that simple overexpression of a labeling enzyme fusion protein could not reveal. We used the KI strategy to compare the interactome of the different AP complexes and clathrin and were able to assemble lists of potential interactors and cargo proteins that are specific for each sorting pathway. Our approach greatly simplifies the execution of proximity labeling experiments for proteins in their native cellular environment and allows going from CRISPR transfection to mass spectrometry analysis and interactome data in just over a month.
Collapse
Affiliation(s)
- Alexander Stockhammer
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Carissa Spalt
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Antonia Klemt
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Laila S Benz
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Shelly Harel
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Vini Natalia
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Lukas Wiench
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Christian Freund
- Laboratory of Protein Biochemistry, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Benno Kuropka
- Laboratory of Protein Biochemistry, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| | - Francesca Bottanelli
- Membrane Trafficking Laboratory, Institute for Chemistry and Biochemistry, Freie Universität Berlin, Thielallee 63, 14195 Berlin, Germany
| |
Collapse
|
7
|
Peng H, Wang H, Kong W, Li J, Goh WWB. Optimizing differential expression analysis for proteomics data via high-performing rules and ensemble inference. Nat Commun 2024; 15:3922. [PMID: 38724498 PMCID: PMC11082229 DOI: 10.1038/s41467-024-47899-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 04/16/2024] [Indexed: 05/12/2024] Open
Abstract
Identification of differentially expressed proteins in a proteomics workflow typically encompasses five key steps: raw data quantification, expression matrix construction, matrix normalization, missing value imputation (MVI), and differential expression analysis. The plethora of options in each step makes it challenging to identify optimal workflows that maximize the identification of differentially expressed proteins. To identify optimal workflows and their common properties, we conduct an extensive study involving 34,576 combinatoric experiments on 24 gold standard spike-in datasets. Applying frequent pattern mining techniques to top-ranked workflows, we uncover high-performing rules that demonstrate optimality has conserved properties. Via machine learning, we confirm optimal workflows are indeed predictable, with average cross-validation F1 scores and Matthew's correlation coefficients surpassing 0.84. We introduce an ensemble inference to integrate results from individual top-performing workflows for expanding differential proteome coverage and resolve inconsistencies. Ensemble inference provides gains in pAUC (up to 4.61%) and G-mean (up to 11.14%) and facilitates effective aggregation of information across varied quantification approaches such as topN, directLFQ, MaxLFQ intensities, and spectral counts. However, further development and evaluation are needed to establish acceptable frameworks for conducting ensemble inference on multiple proteomics workflows.
Collapse
Affiliation(s)
- Hui Peng
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - He Wang
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Weijia Kong
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Jinyan Li
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
| | - Wilson Wen Bin Goh
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.
- Center for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore.
- Center of AI in Medicine, Nanyang Technological University, Singapore, Singapore.
- Division of Neurology, Department of Brain Sciences, Faculty of Medicine, Imperial College London, London, UK.
| |
Collapse
|
8
|
Chew C, Brand OJ, Yamamura T, Lawless C, Morais MRPT, Zeef L, Lin IH, Howell G, Lui S, Lausecker F, Jagger C, Shaw TN, Krishnan S, McClure FA, Bridgeman H, Wemyss K, Konkel JE, Hussell T, Lennon R. Kidney resident macrophages have distinct subsets and multifunctional roles. Matrix Biol 2024; 127:23-37. [PMID: 38331051 DOI: 10.1016/j.matbio.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 02/02/2024] [Accepted: 02/05/2024] [Indexed: 02/10/2024]
Abstract
BACKGROUND The kidney contains distinct glomerular and tubulointerstitial compartments with diverse cell types and extracellular matrix components. The role of immune cells in glomerular environment is crucial for dampening inflammation and maintaining homeostasis. Macrophages are innate immune cells that are influenced by their tissue microenvironment. However, the multifunctional role of kidney macrophages remains unclear. METHODS Flow and imaging cytometry were used to determine the relative expression of CD81 and CX3CR1 (C-X3-C motif chemokine receptor 1) in kidney macrophages. Monocyte replenishment was assessed in Cx3cr1CreER X R26-yfp-reporter and shielded chimeric mice. Bulk RNA-sequencing and mass spectrometry-based proteomics were performed on isolated kidney macrophages from wild type and Col4a5-/- (Alport) mice. RNAscope was used to visualize transcripts and macrophage purity in bulk RNA assessed by CIBERSORTx analyses. RESULTS In wild type mice we identified three distinct kidney macrophage subsets using CD81 and CX3CR1 and these subsets showed dependence on monocyte replenishment. In addition to their immune function, bulk RNA-sequencing of macrophages showed enrichment of biological processes associated with extracellular matrix. Proteomics identified collagen IV and laminins in kidney macrophages from wild type mice whilst other extracellular matrix proteins including cathepsins, ANXA2 and LAMP2 were enriched in Col4a5-/- (Alport) mice. A subset of kidney macrophages co-expressed matrix and macrophage transcripts. CONCLUSIONS We identified CD81 and CX3CR1 positive kidney macrophage subsets with distinct dependence for monocyte replenishment. Multiomic analysis demonstrated that these cells have diverse functions that underscore the importance of macrophages in kidney health and disease.
Collapse
Affiliation(s)
- Christine Chew
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom; Wellcome Centre for Cell-Matrix Research, Division of Cell-Matrix Biology and Regenerative Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PT, United Kingdom
| | - Oliver J Brand
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Tomohiko Yamamura
- Wellcome Centre for Cell-Matrix Research, Division of Cell-Matrix Biology and Regenerative Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PT, United Kingdom
| | - Craig Lawless
- Wellcome Centre for Cell-Matrix Research, Division of Cell-Matrix Biology and Regenerative Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PT, United Kingdom
| | - Mychel Raony Paiva Teixeira Morais
- Wellcome Centre for Cell-Matrix Research, Division of Cell-Matrix Biology and Regenerative Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PT, United Kingdom
| | - Leo Zeef
- Bioinformatics Core Facility, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - I-Hsuan Lin
- Bioinformatics Core Facility, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Gareth Howell
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Sylvia Lui
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Franziska Lausecker
- Wellcome Centre for Cell-Matrix Research, Division of Cell-Matrix Biology and Regenerative Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PT, United Kingdom
| | - Christopher Jagger
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Tovah N Shaw
- Institute of Immunology and Infection Research, School of Biological Sciences, University of Edinburgh, Ashworth Laboratories, Edinburgh EH9 3FL, United Kingdom
| | - Siddharth Krishnan
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Flora A McClure
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Hayley Bridgeman
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Kelly Wemyss
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Joanne E Konkel
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom
| | - Tracy Hussell
- Lydia Becker Institute for Immunology and Inflammation, Division of Infection, Immunity and Respiratory Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester M13 9PL, United Kingdom.
| | - Rachel Lennon
- Wellcome Centre for Cell-Matrix Research, Division of Cell-Matrix Biology and Regenerative Medicine, School of Biological Sciences, Faculty of Biology Medicine and Health, The University of Manchester, Manchester Academic Health Science Centre, Manchester M13 9PT, United Kingdom; Department of Paediatric Nephrology, Royal Manchester Children's Hospital, Manchester University Hospitals NHS Foundation Trust, Manchester Academic Health Science Centre, Manchester M13 9WL, United Kingdom.
| |
Collapse
|
9
|
Demeulemeester N, Gébelin M, Caldi Gomes L, Lingor P, Carapito C, Martens L, Clement L. msqrob2PTM: Differential Abundance and Differential Usage Analysis of MS-Based Proteomics Data at the Posttranslational Modification and Peptidoform Level. Mol Cell Proteomics 2024; 23:100708. [PMID: 38154689 PMCID: PMC10875266 DOI: 10.1016/j.mcpro.2023.100708] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 12/19/2023] [Accepted: 12/24/2023] [Indexed: 12/30/2023] Open
Abstract
In the era of open-modification search engines, more posttranslational modifications than ever can be detected by LC-MS/MS-based proteomics. This development can switch proteomics research into a higher gear, as PTMs are key in many cellular pathways important in cell proliferation, migration, metastasis, and aging. However, despite these advances in modification identification, statistical methods for PTM-level quantification and differential analysis have yet to catch up. This absence can partly be explained by statistical challenges inherent to the data, such as the confounding of PTM intensities with its parent protein abundance. Therefore, we have developed msqrob2PTM, a new workflow in the msqrob2 universe capable of differential abundance analysis at the PTM and at the peptidoform level. The latter is important for validating PTMs found as significantly differential. Indeed, as our method can deal with multiple PTMs per peptidoform, there is a possibility that significant PTMs stem from one significant peptidoform carrying another PTM, hinting that it might be the other PTM driving the perceived differential abundance. Our workflows can flag both differential peptidoform abundance (DPA) and differential peptidoform usage (DPU). This enables a distinction between direct assessment of differential abundance of peptidoforms (DPA) and differences in the relative usage of peptidoforms corrected for corresponding protein abundances (DPU). For DPA, we directly model the log2-transformed peptidoform intensities, while for DPU, we correct for parent protein abundance by an intermediate normalization step which calculates the log2-ratio of the peptidoform intensities to their summarized parent protein intensities. We demonstrated the utility and performance of msqrob2PTM by applying it to datasets with known ground truth, as well as to biological PTM-rich datasets. Our results show that msqrob2PTM is on par with, or surpassing the performance of, the current state-of-the-art methods. Moreover, msqrob2PTM is currently unique in providing output at the peptidoform level.
Collapse
Affiliation(s)
- Nina Demeulemeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Ghent, Belgium; Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
| | - Marie Gébelin
- Laboratoire de Spectrométrie de Masse BioOrganique, IPHC UMR 7178, CNRS, Infrastructure Nationale de Protéomique ProFI - FR2048, Université de Strasbourg, Strasbourg, France
| | - Lucas Caldi Gomes
- Department of Neurology, Technical University Munich, Munich, Germany
| | - Paul Lingor
- Department of Neurology, Technical University Munich, Munich, Germany
| | - Christine Carapito
- Laboratoire de Spectrométrie de Masse BioOrganique, IPHC UMR 7178, CNRS, Infrastructure Nationale de Protéomique ProFI - FR2048, Université de Strasbourg, Strasbourg, France
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium; Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Lieven Clement
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium.
| |
Collapse
|
10
|
Declercq A, Demeulemeester N, Gabriels R, Bouwmeester R, Degroeve S, Martens L. Bioinformatics Pipeline for Processing Single-Cell Data. Methods Mol Biol 2024; 2817:221-239. [PMID: 38907156 DOI: 10.1007/978-1-0716-3934-4_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
Single-cell proteomics can offer valuable insights into dynamic cellular interactions, but identifying proteins at this level is challenging due to their low abundance. In this chapter, we present a state-of-the-art bioinformatics pipeline for single-cell proteomics that combines the search engine Sage (via SearchGUI), identification rescoring with MS2Rescore, quantification through FlashLFQ, and differential expression analysis using MSqRob2. MS2Rescore leverages LC-MS/MS behavior predictors, such as MS2PIP and DeepLC, to recalibrate scores with Percolator or mokapot. Combining these tools into a unified pipeline, this approach improves the detection of low-abundance peptides, resulting in increased identifications while maintaining stringent FDR thresholds.
Collapse
Affiliation(s)
- Arthur Declercq
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Nina Demeulemeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- StatOmics, Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Sven Degroeve
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
| |
Collapse
|
11
|
Grégoire S, Vanderaa C, Dit Ruys SP, Kune C, Mazzucchelli G, Vertommen D, Gatto L. Standardized Workflow for Mass-Spectrometry-Based Single-Cell Proteomics Data Processing and Analysis Using the scp Package. Methods Mol Biol 2024; 2817:177-220. [PMID: 38907155 DOI: 10.1007/978-1-0716-3934-4_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
Mass-spectrometry (MS)-based single-cell proteomics (SCP) explores cellular heterogeneity by focusing on the functional effectors of the cells-proteins. However, extracting meaningful biological information from MS data is far from trivial, especially with single cells. Currently, data analysis workflows are substantially different from one research team to another. Moreover, it is difficult to evaluate pipelines as ground truths are missing. Our team has developed the R/Bioconductor package called scp to provide a standardized framework for SCP data analysis. It relies on the widely used QFeatures and SingleCellExperiment data structures. In addition, we used a design containing cell lines mixed in known proportions to generate controlled variability for data analysis benchmarking. In this chapter, we provide a flexible data analysis protocol for SCP data using the scp package together with comprehensive explanations at each step of the processing. Our main steps are quality control on the feature and cell level, aggregation of the raw data into peptides and proteins, normalization, and batch correction. We validate our workflow using our ground truth data set. We illustrate how to use this modular, standardized framework and highlight some crucial steps.
Collapse
Affiliation(s)
- Samuel Grégoire
- Computational Biology and Bioinformatics Unit, de Duve Institute, UCLouvain, Brussels, Belgium
| | - Christophe Vanderaa
- Computational Biology and Bioinformatics Unit, de Duve Institute, UCLouvain, Brussels, Belgium
| | | | - Christopher Kune
- Laboratory of Mass Spectrometry, MolSys Research Unit, University of Liège, Liège, Belgium
| | - Gabriel Mazzucchelli
- Laboratory of Mass Spectrometry, MolSys Research Unit, University of Liège, Liège, Belgium
- GIGA Proteomics Facility, University of Liège, Liège, Belgium
| | - Didier Vertommen
- Protein Phosphorylation Unit, de Duve Institute, UCLouvain, Brussels, Belgium
| | - Laurent Gatto
- Computational Biology and Bioinformatics Unit, de Duve Institute, UCLouvain, Brussels, Belgium.
| |
Collapse
|
12
|
Claeys T, Van Den Bossche T, Perez-Riverol Y, Gevaert K, Vizcaíno JA, Martens L. lesSDRF is more: maximizing the value of proteomics data through streamlined metadata annotation. Nat Commun 2023; 14:6743. [PMID: 37875519 PMCID: PMC10598006 DOI: 10.1038/s41467-023-42543-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Accepted: 10/13/2023] [Indexed: 10/26/2023] Open
Abstract
Public proteomics data often lack essential metadata, limiting its potential. To address this, we present lesSDRF, a tool to simplify the process of metadata annotation, thereby ensuring that data leave a lasting, impactful legacy well beyond its initial publication.
Collapse
Affiliation(s)
- Tine Claeys
- VIB-UGent Center for Medical Biotechnology, VIB, 9000, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000, Ghent, Belgium
| | - Tim Van Den Bossche
- VIB-UGent Center for Medical Biotechnology, VIB, 9000, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000, Ghent, Belgium
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, CB10 1SD, UK
| | - Kris Gevaert
- VIB-UGent Center for Medical Biotechnology, VIB, 9000, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, 9000, Ghent, Belgium
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, CB10 1SD, UK.
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, 9000, Ghent, Belgium.
- Department of Biomolecular Medicine, Ghent University, 9000, Ghent, Belgium.
| |
Collapse
|
13
|
Hutchings C, Dawson CS, Krueger T, Lilley KS, Breckels LM. A Bioconductor workflow for processing, evaluating, and interpreting expression proteomics data. F1000Res 2023; 12:1402. [PMID: 38021401 PMCID: PMC10683783 DOI: 10.12688/f1000research.139116.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/15/2023] [Indexed: 12/01/2023] Open
Abstract
Background: Expression proteomics involves the global evaluation of protein abundances within a system. In turn, differential expression analysis can be used to investigate changes in protein abundance upon perturbation to such a system. Methods: Here, we provide a workflow for the processing, analysis and interpretation of quantitative mass spectrometry-based expression proteomics data. This workflow utilizes open-source R software packages from the Bioconductor project and guides users end-to-end and step-by-step through every stage of the analyses. As a use-case we generated expression proteomics data from HEK293 cells with and without a treatment. Of note, the experiment included cellular proteins labelled using tandem mass tag (TMT) technology and secreted proteins quantified using label-free quantitation (LFQ). Results: The workflow explains the software infrastructure before focusing on data import, pre-processing and quality control. This is done individually for TMT and LFQ datasets. The application of statistical differential expression analysis is demonstrated, followed by interpretation via gene ontology enrichment analysis. Conclusions: A comprehensive workflow for the processing, analysis and interpretation of expression proteomics is presented. The workflow is a valuable resource for the proteomics community and specifically beginners who are at least familiar with R who wish to understand and make data-driven decisions with regards to their analyses.
Collapse
Affiliation(s)
- Charlotte Hutchings
- Cambridge Centre for Proteomics, University of Cambridge, Cambridge, CB2 1QR, UK
| | - Charlotte S. Dawson
- Cambridge Centre for Proteomics, University of Cambridge, Cambridge, CB2 1QR, UK
| | - Thomas Krueger
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QR, UK
| | - Kathryn S. Lilley
- Cambridge Centre for Proteomics, University of Cambridge, Cambridge, CB2 1QR, UK
| | - Lisa M. Breckels
- Cambridge Centre for Proteomics, University of Cambridge, Cambridge, CB2 1QR, UK
| |
Collapse
|
14
|
Abstract
Missing values are a notable challenge when analyzing mass spectrometry-based proteomics data. While the field is still actively debating the best practices, the challenge increased with the emergence of mass spectrometry-based single-cell proteomics and the dramatic increase in missing values. A popular approach to deal with missing values is to perform imputation. Imputation has several drawbacks for which alternatives exist, but currently, imputation is still a practical solution widely adopted in single-cell proteomics data analysis. This perspective discusses the advantages and drawbacks of imputation. We also highlight 5 main challenges linked to missing value management in single-cell proteomics. Future developments should aim to solve these challenges, whether it is through imputation or data modeling. The perspective concludes with recommendations for reporting missing values, for reporting methods that deal with missing values, and for proper encoding of missing values.
Collapse
Affiliation(s)
- Christophe Vanderaa
- Computational Biology and Bioinformatics Unit (CBIO), de Duve Institute, UCLouvain, 1200 Brussels, Belgium
| | - Laurent Gatto
- Computational Biology and Bioinformatics Unit (CBIO), de Duve Institute, UCLouvain, 1200 Brussels, Belgium
| |
Collapse
|
15
|
Correia CM, Præstholm SM, Havelund JF, Pedersen FB, Siersbæk MS, Ebbesen MF, Gerhart-Hines Z, Heeren J, Brewer J, Larsen S, Blagoev B, Færgeman NJ, Grøntved L. Acute Deletion of the Glucocorticoid Receptor in Hepatocytes Disrupts Postprandial Lipid Metabolism in Male Mice. Endocrinology 2023; 164:bqad128. [PMID: 37610219 DOI: 10.1210/endocr/bqad128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Revised: 07/09/2023] [Accepted: 08/21/2023] [Indexed: 08/24/2023]
Abstract
Hepatic lipid metabolism is highly dynamic, and disruption of several circadian transcriptional regulators results in hepatic steatosis. This includes genetic disruption of the glucocorticoid receptor (GR) as the liver develops. To address the functional role of GR in the adult liver, we used an acute hepatocyte-specific GR knockout model to study temporal hepatic lipid metabolism governed by GR at several preprandial and postprandial circadian timepoints. Lipidomics analysis revealed significant temporal lipid metabolism, where GR disruption results in impaired regulation of specific triglycerides, nonesterified fatty acids, and sphingolipids. This correlates with increased number and size of lipid droplets and mildly reduced mitochondrial respiration, most noticeably in the postprandial phase. Proteomics and transcriptomics analyses suggest that dysregulated lipid metabolism originates from pronounced induced expression of enzymes involved in fatty acid synthesis, β-oxidation, and sphingolipid metabolism. Integration of GR cistromic data suggests that induced gene expression is a result of regulatory actions secondary to direct GR effects on gene transcription.
Collapse
Affiliation(s)
- Catarina Mendes Correia
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Stine Marie Præstholm
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Jesper Foged Havelund
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Felix Boel Pedersen
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Majken Storm Siersbæk
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Morten Frendø Ebbesen
- DaMBIC, Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Zach Gerhart-Hines
- Novo Nordisk Foundation Center for Basic Metabolic Research (CBMR), Department of Biomedical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Joerg Heeren
- Department of Biochemistry and Molecular Cell Biology, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Jonathan Brewer
- DaMBIC, Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Steen Larsen
- Xlab, Department of Biomedical Sciences, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Blagoy Blagoev
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Nils Joakim Færgeman
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Lars Grøntved
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| |
Collapse
|
16
|
Muhandiram S, Dissanayake K, Orro T, Godakumara K, Kodithuwakku S, Fazeli A. Secretory Proteomic Responses of Endometrial Epithelial Cells to Trophoblast-Derived Extracellular Vesicles. Int J Mol Sci 2023; 24:11924. [PMID: 37569298 PMCID: PMC10418763 DOI: 10.3390/ijms241511924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Revised: 07/14/2023] [Accepted: 07/18/2023] [Indexed: 08/13/2023] Open
Abstract
Synchronized crosstalk between the embryo and endometrium during the periconception period is integral to pregnancy establishment. Increasing evidence suggests that the exchange of extracellular vesicles (EVs) of both embryonic and endometrial origin is a critical component of embryo-maternal communication during peri-implantation. Here, we investigated whether embryonic signals in the form of EVs can modulate the endometrial epithelial cell secretome. Receptive endometrial analog RL95-2 cells were supplemented with trophoblast analog JAr cell-derived EVs, and the secretory protein changes occurring in the RL95-2 cells were analyzed using mass spectrometry. EVs of non-trophoblastic origin (HEK 293 cells) were used as the control EV source to supplement endometrial cells. Trophoblast cell-derived EVs enriched endometrial epithelial cell secretions with proteins that support embryo development, attachment, or implantation, whereas control EVs were unable to induce the same effect. The present study suggests that embryonic signals in the form of EVs may prime receptive endometrial epithelial cells to enrich their secretory proteome with critical proteomic molecules with functional importance for periconception milieu formation.
Collapse
Affiliation(s)
- Subhashini Muhandiram
- Institute of Veterinary Medicine and Animal Sciences, Estonian University of Life Sciences, Kreutzwaldi 62, 51006 Tartu, Estonia; (S.M.); (K.D.); (T.O.); (K.G.); (S.K.)
| | - Keerthie Dissanayake
- Institute of Veterinary Medicine and Animal Sciences, Estonian University of Life Sciences, Kreutzwaldi 62, 51006 Tartu, Estonia; (S.M.); (K.D.); (T.O.); (K.G.); (S.K.)
- Department of Pathophysiology, Institute of Biomedicine and Translational Medicine, University of Tartu, Ravila St. 14B, 50411 Tartu, Estonia
- Department of Anatomy, Faculty of Medicine, University of Peradeniya, Kandy 20400, Sri Lanka
| | - Toomos Orro
- Institute of Veterinary Medicine and Animal Sciences, Estonian University of Life Sciences, Kreutzwaldi 62, 51006 Tartu, Estonia; (S.M.); (K.D.); (T.O.); (K.G.); (S.K.)
| | - Kasun Godakumara
- Institute of Veterinary Medicine and Animal Sciences, Estonian University of Life Sciences, Kreutzwaldi 62, 51006 Tartu, Estonia; (S.M.); (K.D.); (T.O.); (K.G.); (S.K.)
| | - Suranga Kodithuwakku
- Institute of Veterinary Medicine and Animal Sciences, Estonian University of Life Sciences, Kreutzwaldi 62, 51006 Tartu, Estonia; (S.M.); (K.D.); (T.O.); (K.G.); (S.K.)
- Department of Animal Science, Faculty of Agriculture, University of Peradeniya, Kandy 20400, Sri Lanka
| | - Alireza Fazeli
- Institute of Veterinary Medicine and Animal Sciences, Estonian University of Life Sciences, Kreutzwaldi 62, 51006 Tartu, Estonia; (S.M.); (K.D.); (T.O.); (K.G.); (S.K.)
- Department of Pathophysiology, Institute of Biomedicine and Translational Medicine, University of Tartu, Ravila St. 14B, 50411 Tartu, Estonia
- Academic Unit of Reproductive and Developmental Medicine, Department of Oncology and Metabolism, Medical School, University of Sheffield, Sheffield S10 2TN, UK
| |
Collapse
|
17
|
Jones J, MacKrell EJ, Wang TY, Lomenick B, Roukes ML, Chou TF. Tidyproteomics: an open-source R package and data object for quantitative proteomics post analysis and visualization. BMC Bioinformatics 2023; 24:239. [PMID: 37280522 DOI: 10.1186/s12859-023-05360-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 05/25/2023] [Indexed: 06/08/2023] Open
Abstract
BACKGROUND The analysis of mass spectrometry-based quantitative proteomics data can be challenging given the variety of established analysis platforms, the differences in reporting formats, and a general lack of approachable standardized post-processing analyses such as sample group statistics, quantitative variation and even data filtering. We developed tidyproteomics to facilitate basic analysis, improve data interoperability and potentially ease the integration of new processing algorithms, mainly through the use of a simplified data-object. RESULTS The R package tidyproteomics was developed as both a framework for standardizing quantitative proteomics data and a platform for analysis workflows, containing discrete functions that can be connected end-to-end, thus making it easier to define complex analyses by breaking them into small stepwise units. Additionally, as with any analysis workflow, choices made during analysis can have large impacts on the results and as such, tidyproteomics allows researchers to string each function together in any order, select from a variety of options and in some cases develop and incorporate custom algorithms. CONCLUSIONS Tidyproteomics aims to simplify data exploration from multiple platforms, provide control over individual functions and analysis order, and serve as a tool to assemble complex repeatable processing workflows in a logical flow. Datasets in tidyproteomics are easy to work with, have a structure that allows for biological annotations to be added, and come with a framework for developing additional analysis tools. The consistent data structure and accessible analysis and plotting tools also offers a way for researchers to save time on mundane data manipulation tasks.
Collapse
Affiliation(s)
- Jeff Jones
- Proteome Exploration Laboratory, Beckman Institute, California Institute of Technology, Pasadena, CA, 91125, USA.
- Division of Physics, Mathematics and Astronomy, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA, 91125, USA.
| | - Elliot J MacKrell
- Division of Chemistry and Chemical Engineering, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA, 91125, USA
| | - Ting-Yu Wang
- Proteome Exploration Laboratory, Beckman Institute, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Brett Lomenick
- Proteome Exploration Laboratory, Beckman Institute, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Michael L Roukes
- Division of Physics, Mathematics and Astronomy, California Institute of Technology, 1200 East California Boulevard, Pasadena, CA, 91125, USA
| | - Tsui-Fen Chou
- Proteome Exploration Laboratory, Beckman Institute, California Institute of Technology, Pasadena, CA, 91125, USA
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| |
Collapse
|
18
|
Bai M, Deng J, Dai C, Pfeuffer J, Sachsenberg T, Perez-Riverol Y. LFQ-Based Peptide and Protein Intensity Differential Expression Analysis. J Proteome Res 2023. [PMID: 37220883 DOI: 10.1021/acs.jproteome.2c00812] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Testing for significant differences in quantities at the protein level is a common goal of many LFQ-based mass spectrometry proteomics experiments. Starting from a table of protein and/or peptide quantities from a given proteomics quantification software, many tools and R packages exist to perform the final tasks of imputation, summarization, normalization, and statistical testing. To evaluate the effects of packages and settings in their substeps on the final list of significant proteins, we studied several packages on three public data sets with known expected protein fold changes. We found that the results between packages and even across different parameters of the same package can vary significantly. In addition to usability aspects and feature/compatibility lists of different packages, this paper highlights sensitivity and specificity trade-offs that come with specific packages and settings.
Collapse
Affiliation(s)
- Mingze Bai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Jingwen Deng
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
| | - Chengxin Dai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Julianus Pfeuffer
- Algorithmic Bioinformatics, Freie Universität Berlin, Berlin 14195, Germany
- Visualization and Data Analysis, Zuse Institute Berlin, Berlin 14195, Germany
| | - Timo Sachsenberg
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen 72076, Germany
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hixton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
19
|
Kohler D, Staniak M, Tsai TH, Huang T, Shulman N, Bernhardt OM, MacLean BX, Nesvizhskii AI, Reiter L, Sabido E, Choi M, Vitek O. MSstats Version 4.0: Statistical Analyses of Quantitative Mass Spectrometry-Based Proteomic Experiments with Chromatography-Based Quantification at Scale. J Proteome Res 2023; 22:1466-1482. [PMID: 37018319 PMCID: PMC10629259 DOI: 10.1021/acs.jproteome.2c00834] [Citation(s) in RCA: 32] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Indexed: 04/06/2023]
Abstract
The MSstats R-Bioconductor family of packages is widely used for statistical analyses of quantitative bottom-up mass spectrometry-based proteomic experiments to detect differentially abundant proteins. It is applicable to a variety of experimental designs and data acquisition strategies and is compatible with many data processing tools used to identify and quantify spectral features. In the face of ever-increasing complexities of experiments and data processing strategies, the core package of the family, with the same name MSstats, has undergone a series of substantial updates. Its new version MSstats v4.0 improves the usability, versatility, and accuracy of statistical methodology, and the usage of computational resources. New converters integrate the output of upstream processing tools directly with MSstats, requiring less manual work by the user. The package's statistical models have been updated to a more robust workflow. Finally, MSstats' code has been substantially refactored to improve memory use and computation speed. Here we detail these updates, highlighting methodological differences between the new and old versions. An empirical comparison of MSstats v4.0 to its previous implementations, as well as to the packages MSqRob and DEqMS, on controlled mixtures and biological experiments demonstrated a stronger performance and better usability of MSstats v4.0 as compared to existing methods.
Collapse
Affiliation(s)
- Devon Kohler
- Khoury College
of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| | | | - Tsung-Heng Tsai
- Khoury College
of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| | - Ting Huang
- Khoury College
of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| | - Nicholas Shulman
- Department
of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | | | - Brendan X. MacLean
- Department
of Genome Sciences, University of Washington, Seattle, Washington 98195, United States
| | - Alexey I. Nesvizhskii
- Department
of Pathology and Computational Medicine & Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, United States
| | | | - Eduard Sabido
- Center for
Genomic Regulation, Barcelona Institute
of Science and Technology, Barcelona 08003, Spain
- Universitat
Pompeu Fabra, Barcelona 08002, Spain
| | - Meena Choi
- Khoury College
of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| | - Olga Vitek
- Khoury College
of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
20
|
Prokai L, Zaman K, Prokai-Tatrai K. Mass spectrometry-based retina proteomics. MASS SPECTROMETRY REVIEWS 2023; 42:1032-1062. [PMID: 35670041 PMCID: PMC9730434 DOI: 10.1002/mas.21786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 04/28/2022] [Accepted: 04/29/2022] [Indexed: 06/15/2023]
Abstract
A subfield of neuroproteomics, retina proteomics has experienced a transformative growth since its inception due to methodological advances in enabling chemical, biochemical, and molecular biology techniques. This review focuses on mass spectrometry's contributions to facilitate mammalian and avian retina proteomics to catalog and quantify retinal protein expressions, determine their posttranslational modifications, as well as its applications to study the proteome of the retina in the context of biology, health and diseases, and therapy developments.
Collapse
Affiliation(s)
- Laszlo Prokai
- Department of Pharmacology and Neuroscience, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Khadiza Zaman
- Department of Pharmacology and Neuroscience, University of North Texas Health Science Center, Fort Worth, Texas, USA
| | - Katalin Prokai-Tatrai
- Department of Pharmacology and Neuroscience, University of North Texas Health Science Center, Fort Worth, Texas, USA
| |
Collapse
|
21
|
Bakker R, Ellers J, Roelofs D, Vooijs R, Dijkstra T, van Gestel CAM, Hoedjes KM. Combining time-resolved transcriptomics and proteomics data for Adverse Outcome Pathway refinement in ecotoxicology. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 869:161740. [PMID: 36708843 DOI: 10.1016/j.scitotenv.2023.161740] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 01/14/2023] [Accepted: 01/17/2023] [Indexed: 06/18/2023]
Abstract
Conventional Environmental Risk Assessment (ERA) of pesticide pollution is based on soil concentrations and apical endpoints, such as the reproduction of test organisms, but has traditionally disregarded information along the organismal response cascade leading to an adverse outcome. The Adverse Outcome Pathway (AOP) framework includes response information at any level of biological organization, providing opportunities to use intermediate responses as a predictive read-out for adverse outcomes instead. Transcriptomic and proteomic data can provide thousands of data points on the response to toxic exposure. Combining multiple omics data types is necessary for a comprehensive overview of the response cascade and, therefore, AOP development. However, it is unclear if transcript and protein responses are synchronized in time or time lagged. To understand if analysis of multi-omics data obtained at the same timepoint reveal one synchronized response cascade, we studied time-resolved shifts in gene transcript and protein abundance in the springtail Folsomia candida, a soil ecotoxicological model, after exposure to the neonicotinoid insecticide imidacloprid. We analyzed transcriptome and proteome data every 12 h up to 72 h after onset of exposure. The most pronounced shift in both transcript and protein abundances was observed after 48 h exposure. Moreover, cross-correlation analyses indicate that most genes displayed the highest correlation between transcript and protein abundances without a time-lag. This demonstrates that a combined analysis of transcriptomic and proteomic data from the same time-point can be used for AOP improvement. This data will promote the development of biomarkers for the presence of neonicotinoid insecticides or chemicals with a similar mechanism of action in soils.
Collapse
Affiliation(s)
- Ruben Bakker
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Jacintha Ellers
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Dick Roelofs
- Keygene N.V., Agro Business Park 90, 6708 PW Wageningen, the Netherlands
| | - Riet Vooijs
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Tjeerd Dijkstra
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 25, D-72076 Tübingen, Germany
| | - Cornelis A M van Gestel
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Katja M Hoedjes
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands.
| |
Collapse
|
22
|
Wolski WE, Nanni P, Grossmann J, d'Errico M, Schlapbach R, Panse C. prolfqua: A Comprehensive R-Package for Proteomics Differential Expression Analysis. J Proteome Res 2023; 22:1092-1104. [PMID: 36939687 PMCID: PMC10088014 DOI: 10.1021/acs.jproteome.2c00441] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/21/2023]
Abstract
Mass spectrometry is widely used for quantitative proteomics studies, relative protein quantification, and differential expression analysis of proteins. There is a large variety of quantification software and analysis tools. Nevertheless, there is a need for a modular, easy-to-use application programming interface in R that transparently supports a variety of well principled statistical procedures to make applying them to proteomics data, comparing and understanding their differences easy. The prolfqua package integrates essential steps of the mass spectrometry-based differential expression analysis workflow: quality control, data normalization, protein aggregation, statistical modeling, hypothesis testing, and sample size estimation. The package makes integrating new data formats easy. It can be used to model simple experimental designs with a single explanatory variable and complex experiments with multiple factors and hypothesis testing. The implemented methods allow sensitive and specific differential expression analysis. Furthermore, the package implements benchmark functionality that can help to compare data acquisition, data preprocessing, or data modeling methods using a gold standard data set. The application programmer interface of prolfqua strives to be clear, predictable, discoverable, and consistent to make proteomics data analysis application development easy and exciting. Finally, the prolfqua R-package is available on GitHub https://github.com/fgcz/prolfqua, distributed under the MIT license. It runs on all platforms supported by the R free software environment for statistical computing and graphics.
Collapse
Affiliation(s)
- Witold E Wolski
- Functional Genomics Center Zurich (FGCZ)-University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.,Swiss Institute of Bioinformatics (SIB) Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Paolo Nanni
- Functional Genomics Center Zurich (FGCZ)-University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Jonas Grossmann
- Functional Genomics Center Zurich (FGCZ)-University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.,Swiss Institute of Bioinformatics (SIB) Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Maria d'Errico
- Functional Genomics Center Zurich (FGCZ)-University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.,Swiss Institute of Bioinformatics (SIB) Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland
| | - Ralph Schlapbach
- Functional Genomics Center Zurich (FGCZ)-University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland
| | - Christian Panse
- Functional Genomics Center Zurich (FGCZ)-University of Zurich/ETH Zurich, Winterthurerstrasse 190, CH-8057 Zurich, Switzerland.,Swiss Institute of Bioinformatics (SIB) Quartier Sorge-Batiment Amphipole, 1015 Lausanne, Switzerland
| |
Collapse
|
23
|
Vanderaa C, Gatto L. The Current State of Single-Cell Proteomics Data Analysis. Curr Protoc 2023; 3:e658. [PMID: 36633424 DOI: 10.1002/cpz1.658] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Sound data analysis is essential to retrieve meaningful biological information from single-cell proteomics experiments. This analysis is carried out by computational methods that are assembled into workflows, and their implementations influence the conclusions that can be drawn from the data. In this work, we explore and compare the computational workflows that have been used over the last four years and identify a profound lack of consensus on how to analyze single-cell proteomics data. We highlight the need for benchmarking of computational workflows and standardization of computational tools and data, as well as carefully designed experiments. Finally, we cover the current standardization efforts that aim to fill the gap, list the remaining missing pieces, and conclude with lessons learned from the replication of published single-cell proteomics analyses. © 2023 Wiley Periodicals LLC.
Collapse
Affiliation(s)
- Christophe Vanderaa
- Computational Biology and Bioinformatics Unit (CBIO), de Duve Institute, Université catholique de Louvain, Belgium
| | - Laurent Gatto
- Computational Biology and Bioinformatics Unit (CBIO), de Duve Institute, Université catholique de Louvain, Belgium
| |
Collapse
|
24
|
Proteomic Comparison of Three Wild-Type Pseudorabies Virus Strains and the Attenuated Bartha Strain Reveals Reduced Incorporation of Several Tegument Proteins in Bartha Virions. J Virol 2022; 96:e0115822. [PMID: 36453884 PMCID: PMC9769387 DOI: 10.1128/jvi.01158-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
Pseudorabies virus (PRV) is a member of the alphaherpesvirus subfamily and the causative agent of Aujeszky's disease in pigs. Driven by the large economic losses associated with PRV infection, several vaccines and vaccine programs have been developed. To this day, the attenuated Bartha strain, generated by serial passaging, represents the golden standard for PRV vaccination. However, a proteomic comparison of the Bartha virion to wild-type (WT) PRV virions is lacking. Here, we present a comprehensive mass spectrometry-based proteome comparison of the attenuated Bartha strain and three commonly used WT PRV strains: Becker, Kaplan, and NIA3. We report the detection of 40 structural and 14 presumed nonstructural proteins through a combination of data-dependent and data-independent acquisition. Interstrain comparisons revealed that packaging of the capsid and most envelope proteins is largely comparable in-between all four strains, except for the envelope protein pUL56, which is less abundant in Bartha virions. However, distinct differences were noted for several tegument proteins. Most strikingly, we noted a severely reduced incorporation of the tegument proteins IE180, VP11/12, pUS3, VP22, pUL41, pUS1, and pUL40 in Bartha virions. Moreover, and likely as a consequence, we also observed that Bartha virions are on average smaller and more icosahedral compared to WT virions. Finally, we detected at least 28 host proteins that were previously described in PRV virions and noticed considerable strain-specific differences with regard to host proteins, arguing that the potential role of packaged host proteins in PRV replication and spread should be further explored. IMPORTANCE The pseudorabies virus (PRV) vaccine strain Bartha-an attenuated strain created by serial passaging-represents an exceptional success story in alphaherpesvirus vaccination. Here, we used mass spectrometry to analyze the Bartha virion composition in comparison to three established WT PRV strains. Many viral tegument proteins that are considered nonessential for viral morphogenesis were drastically less abundant in Bartha virions compared to WT virions. Interestingly, many of the proteins that are less incorporated in Bartha participate in immune evasion strategies of alphaherpesviruses. In addition, we observed a reduced size and more icosahedral morphology of the Bartha virions compared to WT PRV. Given that the Bartha vaccine strain elicits potent immune responses, our findings here suggest that differences in protein packaging may contribute to its immunogenicity. Further exploration of these observations could aid the development of efficacious vaccines against other alphaherpesvirus vaccines such as HSV-1/2 or EHV-1.
Collapse
|
25
|
Chion M, Carapito C, Bertrand F. Accounting for multiple imputation-induced variability for differential analysis in mass spectrometry-based label-free quantitative proteomics. PLoS Comput Biol 2022; 18:e1010420. [PMID: 36037245 PMCID: PMC9462777 DOI: 10.1371/journal.pcbi.1010420] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 09/09/2022] [Accepted: 07/21/2022] [Indexed: 11/20/2022] Open
Abstract
Imputing missing values is common practice in label-free quantitative proteomics. Imputation aims at replacing a missing value with a user-defined one. However, the imputation itself may not be optimally considered downstream of the imputation process, as imputed datasets are often considered as if they had always been complete. Hence, the uncertainty due to the imputation is not adequately taken into account. We provide a rigorous multiple imputation strategy, leading to a less biased estimation of the parameters' variability thanks to Rubin's rules. The imputation-based peptide's intensities' variance estimator is then moderated using Bayesian hierarchical models. This estimator is finally included in moderated t-test statistics to provide differential analyses results. This workflow can be used both at peptide and protein-level in quantification datasets. Indeed, an aggregation step is included for protein-level results based on peptide-level quantification data. Our methodology, named mi4p, was compared to the state-of-the-art limma workflow implemented in the DAPAR R package, both on simulated and real datasets. We observed a trade-off between sensitivity and specificity, while the overall performance of mi4p outperforms DAPAR in terms of F-Score.
Collapse
Affiliation(s)
- Marie Chion
- Institut de Recherche Mathématique Avancée, UMR 7501, CNRS-Université de Strasbourg, Strasbourg, France
- Laboratoire de Spectrométrie de Masse Bio-Organique, Institut Pluridisciplinaire Hubert Curien, UMR 7178, CNRS-Université de Strasbourg, Strasbourg, France
- Laboratoire Mathématiques appliquées à Paris 5, UMR 8145, CNRS-Université Paris Cité, Paris, France
| | - Christine Carapito
- Laboratoire de Spectrométrie de Masse Bio-Organique, Institut Pluridisciplinaire Hubert Curien, UMR 7178, CNRS-Université de Strasbourg, Strasbourg, France
- Infrastructure Nationale de Protéomique ProFi - FR2048, 67087 Strasbourg, France
| | - Frédéric Bertrand
- Institut de Recherche Mathématique Avancée, UMR 7501, CNRS-Université de Strasbourg, Strasbourg, France
- Laboratoire Informatique et Société Numérique, Université de Technologie de Troyes, Troyes, France
| |
Collapse
|
26
|
Proteomic Assessment of C57BL/6 Hippocampi after Non-Selective Pharmacological Inhibition of Nitric Oxide Synthase Activity: Implications of Seizure-like Neuronal Hyperexcitability Followed by Tauopathy. Biomedicines 2022; 10:biomedicines10081772. [PMID: 35892672 PMCID: PMC9331517 DOI: 10.3390/biomedicines10081772] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 07/18/2022] [Accepted: 07/19/2022] [Indexed: 11/17/2022] Open
Abstract
Nitric oxide (NO) is a small gaseous signaling molecule responsible for maintaining homeostasis in a myriad of tissues and molecular pathways in neurology and the cardiovasculature. In recent years, there has been increasing interest in the potential interaction between arterial stiffness (AS), an independent cardiovascular risk factor, and neurodegenerative syndromes given increasingly epidemiological study reports. For this reason, we previously investigated the mechanistic convergence between AS and neurodegeneration via the progressive non-selective inhibition of all nitric oxide synthase (NOS) isoforms with N(G)-nitro-L-arginine methyl ester (L-NAME) in C57BL/6 mice. Our previous results showed progressively increased AS in vivo and impaired visuospatial learning and memory in L-NAME-treated C57BL/6 mice. In the current study, we sought to further investigate the progressive molecular signatures in hippocampal tissue via LC–MS/MS proteomic analysis. Our data implicate mitochondrial dysfunction due to progressive L-NAME treatment. Two weeks of L-NAME treatment implicates altered G-protein-coupled-receptor signaling in the nerve synapse and associated presence of seizures and altered emotional behavior. Furthermore, molecular signatures implicate the cerebral presence of seizure-related hyperexcitability after short-term (8 weeks) treatment followed by ribosomal dysfunction and tauopathy after long-term (16 weeks) treatment.
Collapse
|
27
|
Plancade S, Berland M, Blein-Nicolas M, Langella O, Bassignani A, Juste C. A combined test for feature selection on sparse metaproteomics data-an alternative to missing value imputation. PeerJ 2022; 10:e13525. [PMID: 35769140 PMCID: PMC9235818 DOI: 10.7717/peerj.13525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 05/11/2022] [Indexed: 01/18/2023] Open
Abstract
One of the difficulties encountered in the statistical analysis of metaproteomics data is the high proportion of missing values, which are usually treated by imputation. Nevertheless, imputation methods are based on restrictive assumptions regarding missingness mechanisms, namely "at random" or "not at random". To circumvent these limitations in the context of feature selection in a multi-class comparison, we propose a univariate selection method that combines a test of association between missingness and classes, and a test for difference of observed intensities between classes. This approach implicitly handles both missingness mechanisms. We performed a quantitative and qualitative comparison of our procedure with imputation-based feature selection methods on two experimental data sets, as well as simulated data with various scenarios regarding the missingness mechanisms and the nature of the difference of expression (differential intensity or differential presence). Whereas we observed similar performances in terms of prediction on the experimental data set, the feature ranking and selection from various imputation-based methods were strongly divergent. We showed that the combined test reaches a compromise by correlating reasonably with other methods, and remains efficient in all simulated scenarios unlike imputation-based feature selection methods.
Collapse
Affiliation(s)
- Sandra Plancade
- UR875 MIAT, Université fédérale de Toulouse, INRAE, Castanet-Tolosan, France
| | - Magali Berland
- Université Paris-Saclay, INRAE, MGP, Jouy en Josas, France
| | - Mélisande Blein-Nicolas
- Université Paris-Saclay, CNRS, INRAE, AgroParisTech, GQE-Le Moulon, Gif-sur-Yvette, France,Université Paris-Saclay, CNRS, INRAE, AgroParisTech, PAPPSO, Gif-sur-Yvette, France
| | - Olivier Langella
- Université Paris-Saclay, CNRS, INRAE, AgroParisTech, GQE-Le Moulon, Gif-sur-Yvette, France,Université Paris-Saclay, CNRS, INRAE, AgroParisTech, PAPPSO, Gif-sur-Yvette, France
| | - Ariane Bassignani
- Université Paris-Saclay, INRAE, MGP, Jouy en Josas, France,Université Paris-Saclay, CNRS, INRAE, AgroParisTech, PAPPSO, Gif-sur-Yvette, France
| | - Catherine Juste
- Micalis Institute, Université Paris-Saclay, INRAE, AgroParis Tech, Jouy-en-Josas, France
| |
Collapse
|
28
|
Crook OM, Chung CW, Deane CM. Challenges and Opportunities for Bayesian Statistics in Proteomics. J Proteome Res 2022; 21:849-864. [PMID: 35258980 PMCID: PMC8982455 DOI: 10.1021/acs.jproteome.1c00859] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Indexed: 12/27/2022]
Abstract
Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approaches only produce a point estimate, such as a mean, leaving little room for more nuanced interpretations. By contrast, Bayesian statistics allows quantification of uncertainty through the use of probability distributions. These probability distributions enable scientists to ask complex questions of their proteomics data. Bayesian statistics also offers a modular framework for data analysis by making dependencies between data and parameters explicit. Hence, specifying complex hierarchies of parameter dependencies is straightforward in the Bayesian framework. This allows us to use a statistical methodology which equals, rather than neglects, the sophistication of experimental design and instrumentation present in proteomics. Here, we review Bayesian methods applied to proteomics, demonstrating their potential power, alongside the challenges posed by adopting this new statistical framework. To illustrate our review, we give a walk-through of the development of a Bayesian model for dynamic organic orthogonal phase-separation (OOPS) data.
Collapse
Affiliation(s)
- Oliver M. Crook
- Department
of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom
| | - Chun-wa Chung
- Structural
and Biophysical Sciences, GlaxoSmithKline
R&D, Stevenage SG1 2NY, United Kingdom
| | - Charlotte M. Deane
- Department
of Statistics, University of Oxford, Oxford OX1 3LB, United Kingdom
| |
Collapse
|
29
|
Klann K, Münch C. PBLMM: Peptide-based linear mixed models for differential expression analysis of shotgun proteomics data. J Cell Biochem 2022; 123:691-696. [PMID: 35132673 DOI: 10.1002/jcb.30225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 01/23/2022] [Accepted: 01/27/2022] [Indexed: 11/07/2022]
Abstract
Here, we present a peptide-based linear mixed models tool-PBLMM, a standalone desktop application for differential expression analysis of proteomics data. We also provide a Python package that allows streamlined data analysis workflows implementing the PBLMM algorithm. PBLMM is easy to use without scripting experience and calculates differential expression by peptide-based linear mixed regression models. We show that peptide-based models outperform classical methods of statistical inference of differentially expressed proteins. In addition, PBLMM exhibits superior statistical power in situations of low effect size and/or low sample size. Taken together our tool provides an easy-to-use, high-statistical-power method to infer differentially expressed proteins from proteomics data.
Collapse
Affiliation(s)
- Kevin Klann
- Institute of Biochemistry II, Faculty of Medicine, Goethe University Frankfurt, Frankfurt am Main, Germany
| | - Christian Münch
- Institute of Biochemistry II, Faculty of Medicine, Goethe University Frankfurt, Frankfurt am Main, Germany
| |
Collapse
|
30
|
Comparative assessment and novel strategy on methods for imputing proteomics data. Sci Rep 2022; 12:1067. [PMID: 35058491 PMCID: PMC8776850 DOI: 10.1038/s41598-022-04938-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Accepted: 01/04/2022] [Indexed: 11/09/2022] Open
Abstract
Missing values are a major issue in quantitative proteomics analysis. While many methods have been developed for imputing missing values in high-throughput proteomics data, a comparative assessment of imputation accuracy remains inconclusive, mainly because mechanisms contributing to true missing values are complex and existing evaluation methodologies are imperfect. Moreover, few studies have provided an outlook of future methodological development. We first re-evaluate the performance of eight representative methods targeting three typical missing mechanisms. These methods are compared on both simulated and masked missing values embedded within real proteomics datasets, and performance is evaluated using three quantitative measures. We then introduce fused regularization matrix factorization, a low-rank global matrix factorization framework, capable of integrating local similarity derived from additional data types. We also explore a biologically-inspired latent variable modeling strategy—convex analysis of mixtures—for missing value imputation and present preliminary experimental results. While some winners emerged from our comparative assessment, the evaluation is intrinsically imperfect because performance is evaluated indirectly on artificial missing or masked values not authentic missing values. Nevertheless, we show that our fused regularization matrix factorization provides a novel incorporation of external and local information, and the exploratory implementation of convex analysis of mixtures presents a biologically plausible new approach.
Collapse
|
31
|
Vanderaa C, Gatto L. Replication of single-cell proteomics data reveals important computational challenges. Expert Rev Proteomics 2021; 18:835-843. [PMID: 34602016 DOI: 10.1080/14789450.2021.1988571] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
INTRODUCTION Mass spectrometry-based proteomics is actively embracing quantitative, single-cell level analyses. Indeed, recent advances in sample preparation and mass spectrometry (MS) have enabled the emergence of quantitative MS-based single-cell proteomics (SCP). While exciting and promising, SCP still has many rough edges. The current analysis workflows are custom and built from scratch. The field is therefore craving for standardized software that promotes principled and reproducible SCP data analyses. AREAS COVERED This special report is the first step toward the formalization and standardization of SCP data analysis. scp, the software that accompanies this work, successfully replicates one of the landmark SCP studies and is applicable to other experiments and designs. We created a repository containing the replicated workflow with comprehensive documentation in order to favor further dissemination and improvements of SCP data analyses. EXPERT OPINION Replicating SCP data analyses uncovers important challenges in SCP data analysis. We describe two such challenges in detail: batch correction and data missingness. We provide the current state-of-the-art and illustrate the associated limitations. We also highlight the intimate dependence that exists between batch effects and data missingness and offer avenues for dealing with these exciting challenges.
Collapse
Affiliation(s)
- Christophe Vanderaa
- Computational Biology and Bioinformatics Unit (CBIO), de Duve Institute, UCLouvain, Belgium
| | - Laurent Gatto
- Computational Biology and Bioinformatics Unit (CBIO), de Duve Institute, UCLouvain, Belgium
| |
Collapse
|
32
|
Verhelst S, De Clerck L, Willems S, Van Puyvelde B, Daled S, Deforce D, Dhaenens M. Comprehensive histone epigenetics: A mass spectrometry based screening assay to measure epigenetic toxicity. MethodsX 2020; 7:101055. [PMID: 32995308 PMCID: PMC7508989 DOI: 10.1016/j.mex.2020.101055] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 09/02/2020] [Indexed: 01/23/2023] Open
Abstract
Evidence of the involvement of epigenetics in pathologies such as cancer, diabetes, and neurodegeneration has increased global interest in epigenetic modifications. For nearly thirty years, it has been known that cancer cells exhibit abnormal DNA methylation patterns. In contrast, the large-scale analysis of histone post-translational modifications (hPTMs) has lagged behind because classically, histone modification analysis has relied on site specific antibody-based techniques. Mass spectrometry (MS) is a technique that holds the promise to picture the histone code comprehensively in a single experiment. Therefore, we developed an MS-based method that is capable of tracking all possible hPTMs in an untargeted approach. In this way, trends in single and combinatorial hPTMs can be reported and enable prediction of the epigenetic toxicity of compounds. Moreover, this method is based on the use of human cells to provide preliminary data, thereby omitting the need to sacrifice laboratory animals. Improving the workflow and the user-friendliness in order to become a high throughput, easily applicable, toxicological screening assay is an ongoing effort. Still, this novel toxicoepigenetic assay and the data it generates holds great potential for, among others, pharmaceutical industry, food science, clinical diagnostics and, environmental toxicity screening. •There is a growing interest in epigenetic modifications, and more specifically in histone post-translational modifications (hPTMs).•We describe an MS-based workflow that is capable of tracking all possible hPTMs in an untargeted approach that makes use of human cells.•Improving the workflow and the user-friendliness in order to become a high throughput, easily applicable, toxicological screening assay is an ongoing effort.
Collapse
Key Words
- AUC, area under the curve
- DDA, data-dependent acquisition
- DIA, data-independent acquisition
- DTT, dithiothreitol
- Drug safety
- FA, formic acid
- FDR, false discovery rate
- GABA, gamma-aminobutyric acid
- GRX, gingisrex
- HAT, histone acetyltransferase
- HDACi, histone deacetylase inhibitor
- HLB, hypotonic lysis buffer
- HPLC, high-performance liquid chromatography
- Histone post-translational modifications
- K, Lysine
- LC-MS/MS
- M, Methionine
- MS, Mass spectrometry
- MS/MS, tandem mass spectrometry
- N, asparagine
- PBS, phosphate buffered saline
- Pharmacoepigenetics
- Proteomics
- Q, glutamine
- R, arginine
- RA, relative abundance
- RP, reversed phase
- RT, room temperature
- S, serine
- SWATH, sequential window acquisition of all theoretical fragment ion spectra
- T, threonine
- TEAB, triethylammonium bicarbonate
- Toxicoepigenetics
- VPA, valproic acid
- Y, tyrosine
- hESC, human embryonic stem cell
- hPTM, histone post-translational modification
Collapse
Affiliation(s)
- Sigrid Verhelst
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium
| | - Laura De Clerck
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium
| | - Sander Willems
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium
| | - Bart Van Puyvelde
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium
| | - Simon Daled
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium
| | - Dieter Deforce
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium
| | - Maarten Dhaenens
- ProGenTomics, Laboratory of Pharmaceutical Biotechnology, Ghent University, Ottergemsesteenweg 460, 9000, Ghent, Belgium
| |
Collapse
|