1
|
Chua EW, Ooi DJ, Nor Muhammad NA. A concise guide to essential R packages for analyses of DNA, RNA, and proteins. Mol Cells 2024; 47:100120. [PMID: 39374792 PMCID: PMC11541695 DOI: 10.1016/j.mocell.2024.100120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2024] [Revised: 09/30/2024] [Accepted: 09/30/2024] [Indexed: 10/09/2024] Open
Abstract
R is widely regarded as unrivaled by other high-level programming languages for its statistical functions. The popularity of R as a statistical language has led many to overlook its applications outside the statistical realm. In this brief review, we present a list of R packages for supporting projects that entail analyses of DNA, RNA, and proteins. These R packages span the gamut of important molecular techniques, from routine quantitative polymerase chain reaction (qPCR) and Western blotting to high-throughput sequencing and proteomics generating very large datasets. The text-mining power of R can also be harnessed to facilitate literature reviews and predict future research trends and avenues. We encourage researchers to make full use of R in their work, given the versatility of the language, as well as its straightforward syntax which eases the initial learning curve.
Collapse
Affiliation(s)
- Eng Wee Chua
- Centre for Drug and Herbal Development, Faculty of Pharmacy, Universiti Kebangsaan Malaysia, 50300 Kuala Lumpur, Malaysia.
| | - Der Jiun Ooi
- Department of Preclinical Sciences, Faculty of Dentistry, MAHSA University, 42610 Jenjarom, Selangor, Malaysia
| | - Nor Azlan Nor Muhammad
- Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia, 43600 Bangi, Selangor, Malaysia
| |
Collapse
|
2
|
Byrdwell WC, Kalscheur KF. An interactive R-based custom quantification program for semi-quantitative analysis of triacylglycerols in bovine milk. Anal Bioanal Chem 2024; 416:5527-5555. [PMID: 39289202 DOI: 10.1007/s00216-024-05528-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 08/06/2024] [Accepted: 09/03/2024] [Indexed: 09/19/2024]
Abstract
The R programming language, RStudio, and open-source software solutions for analysis of liquid chromatography-mass spectrometry (LC-MS) data have been used with user-written R-based custom quantification programs (CQP) for semi-quantification of triacylglycerols (TAGs) in bovine milk lipid extracts. Using the peak-finding capabilities of the package "xcms" in RStudio, peaks were integrated, and retention times aligned, normalized, and then used for semi-quantitative analysis of a custom set of four extraction internal standards (EISs) and 29 TAG regioisomers using the choice of four analytical internal standards (AISs). Alternating stereospecific numbering (sn) 1,3 TAG regioisomers (standards 1, 3, and 5 of six calibration standards) and sn-1,2 TAG regioisomers (standards 2, 4, and 6 of six standards) were used to make a set of six calibration standards, which were used for quantification using a linear fit model, polynomial fit model, power fit model, level-bracketed linear fit, replicate-bracketed polynomial fit, replicate-bracketed power fit, and replicate- and level-bracketed linear fit and response factors. For example, the linear fit for EIS1 gave an unacceptable coefficient of determination (CoD), r2 = 0.9616, whereas the polynomial fit gave r2 = 0.9908 and the power fit gave r2 = 0.9928, while the double-bracketed linear fit gave CoDs of r2 = 0.9960, 0.9848, and 0.9781 for the three brackets, yet gave the least % difference to known calibration concentrations. For unparalleled transparency, the CQP produced webpages that allowed every step in the data processing and quantification sequence to be verified and reproduced, and contained interactive figures. The data are publicly available using a digital object identifier (DOI). The R code can be downloaded and used with the downloadable data to reproduce the results, to modify the code and further customize the results, or to copy and paste and adapt the code to other quantification applications.
Collapse
Affiliation(s)
- Wm Craig Byrdwell
- Methods and Application of Food Composition Lab, Beltsville Human Nutrition Research Center, Agricultural Research Service, U.S. Department of Agriculture, Beltsville, MD, 20705, USA.
| | - Kenneth F Kalscheur
- U.S. Dairy Forage Research Center, Agricultural Research Service, U.S. Department of Agriculture, Madison, WI, USA
| |
Collapse
|
3
|
Weickert P, Dürauer S, Götz MJ, Li HY, Stingele J. Electro-elution-based purification of covalent DNA-protein cross-links. Nat Protoc 2024; 19:2891-2914. [PMID: 38890499 DOI: 10.1038/s41596-024-01004-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Accepted: 02/19/2024] [Indexed: 06/20/2024]
Abstract
Covalent DNA-protein cross-links (DPCs) are pervasive DNA lesions that challenge genome stability and can be induced by metabolic or chemotherapeutic cross-linking agents including reactive aldehydes, topoisomerase poisons and DNMT1 inhibitors. The purification of x-linked proteins (PxP), where DNA-cross-linked proteins are separated from soluble proteins via electro-elution, can be used to identify DPCs. Here we describe a versatile and sensitive strategy for PxP. Mammalian cells are collected following exposure to a DPC-inducing agent, embedded in low-melt agarose plugs and lysed under denaturing conditions. Following lysis, the soluble proteins are extracted from the agarose plug by electro-elution, while genomic DNA and cross-linked proteins are retained in the plug. The cross-linked proteins can then be analyzed by standard analytical techniques such as sodium dodecyl-sulfate-polyacrylamide gel electrophoresis followed by western blotting or fluorescent staining. Alternatively, quantitative mass spectrometry-based proteomics can be used for the unbiased identification of DPCs. The isolation and analysis of DPCs by PxP overcomes the limitations of alternative methods to analyze DPCs that rely on precipitation as the separating principle and can be performed by users trained in molecular or cell biology within 2-3 d. The protocol has been optimized to study DPC induction and repair in mammalian cells but may also be adapted to other sample types including bacteria, yeast and tissue samples.
Collapse
Affiliation(s)
- Pedro Weickert
- Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Sophie Dürauer
- Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Maximilian J Götz
- Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Hao-Yi Li
- Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Julian Stingele
- Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany.
- Gene Center, Ludwig-Maximilians-Universität München, Munich, Germany.
| |
Collapse
|
4
|
Bielow C, Hoffmann N, Jimenez-Morales D, Van Den Bossche T, Vizcaíno JA, Tabb DL, Bittremieux W, Walzer M. Communicating Mass Spectrometry Quality Information in mzQC with Python, R, and Java. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1875-1882. [PMID: 38918936 PMCID: PMC11311537 DOI: 10.1021/jasms.4c00174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 06/07/2024] [Accepted: 06/11/2024] [Indexed: 06/27/2024]
Abstract
Mass spectrometry is a powerful technique for analyzing molecules in complex biological samples. However, inter- and intralaboratory variability and bias can affect the data due to various factors, including sample handling and preparation, instrument calibration and performance, and data acquisition and processing. To address this issue, the Quality Control (QC) working group of the Human Proteome Organization's Proteomics Standards Initiative has established the standard mzQC file format for reporting and exchanging information relating to data quality. mzQC is based on the JavaScript Object Notation (JSON) format and provides a lightweight yet versatile file format that can be easily implemented in software. Here, we present open-source software libraries to process mzQC data in three programming languages: Python, using pymzqc; R, using rmzqc; and Java, using jmzqc. The libraries follow a common data model and provide shared functionalities, including the (de)serialization and validation of mzQC files. We demonstrate use of the software libraries in a workflow for extracting, analyzing, and visualizing QC metrics from different sources. Additionally, we show how these libraries can be integrated with each other, with existing software tools, and in automated workflows for the QC of mass spectrometry data. All software libraries are available as open source under the MS-Quality-Hub organization on GitHub (https://github.com/MS-Quality-Hub).
Collapse
Affiliation(s)
- Chris Bielow
- Bioinformatics
Solution Center, Institut für Mathematik und Informatik, Freie Universität Berlin, Takustrasse 9, 14195 Berlin, Germany
| | - Nils Hoffmann
- Institute
for Bio- and Geosciences (IBG-5), Forschungszentrum Jülich
GmbH, 52428 Jülich, Germany
| | - David Jimenez-Morales
- Department
of Medicine, Stanford University School
of Medicine, Stanford, California 94305, United States
| | - Tim Van Den Bossche
- Department
of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
- VIB-UGent
Center for Medical Biotechnology, VIB, Technologiepark-Zwijnaarde 75, 9052 Ghent, Belgium
| | - Juan Antonio Vizcaíno
- European
Molecular Biology Laboratory, EMBL-European
Bioinformatics Institute (EMBL-EBI),
Hinxton, Cambridge CB10 1SD, United Kingdom
| | - David L. Tabb
- European
Research Institute for the Biology of Ageing, University Medical Center Groningen, Groningen 9713 AV, The Netherlands
| | - Wout Bittremieux
- Department
of Computer Science, University of Antwerp, Antwerpen 2020, Belgium
| | - Mathias Walzer
- European
Molecular Biology Laboratory, EMBL-European
Bioinformatics Institute (EMBL-EBI),
Hinxton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
5
|
Degnan DJ, Lewis LA, Bramer LM, McCue LA, Pesavento JJ, Zhou M, Bilbao A. IsoForma: An R Package for Quantifying and Visualizing Positional Isomers in Top-Down LC-MS/MS Data. J Proteome Res 2024; 23:3318-3321. [PMID: 38421884 DOI: 10.1021/acs.jproteome.3c00681] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024]
Abstract
Proteoforms, the different forms of a protein with sequence variations including post-translational modifications (PTMs), execute vital functions in biological systems, such as cell signaling and epigenetic regulation. Advances in top-down mass spectrometry (MS) technology have permitted the direct characterization of intact proteoforms and their exact number of modification sites, allowing for the relative quantification of positional isomers (PI). Protein positional isomers refer to a set of proteoforms with identical total mass and set of modifications, but varying PTM site combinations. The relative abundance of PI can be estimated by matching proteoform-specific fragment ions to top-down tandem MS (MS2) data to localize and quantify modifications. However, the current approaches heavily rely on manual annotation. Here, we present IsoForma, an open-source R package for the relative quantification of PI within a single tool. Benchmarking IsoForma's performance against two existing workflows produced comparable results and improvements in speed. Overall, IsoForma provides a streamlined process for quantifying PI, reduces the analysis time, and offers an essential framework for developing customized proteoform analysis workflows. The software is open source and available at https://github.com/EMSL-Computing/isoforma-lib.
Collapse
Affiliation(s)
- David J Degnan
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Logan A Lewis
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Lisa M Bramer
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Lee Ann McCue
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - James J Pesavento
- Saint Mary's College of California, Moraga, California 94575, United States
| | - Mowei Zhou
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Aivett Bilbao
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| |
Collapse
|
6
|
Hansen J, Kunert C, Münstermann H, Raezke KP, Seifert S. Application of untargeted liquid chromatography-mass spectrometry to routine analysis of food using three-dimensional bucketing and machine learning. Sci Rep 2024; 14:16594. [PMID: 39026016 PMCID: PMC11258308 DOI: 10.1038/s41598-024-67459-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 07/11/2024] [Indexed: 07/20/2024] Open
Abstract
For the detection of food adulteration, sensitive and reproducible analytical methods are required. Liquid chromatography coupled to high-resolution mass spectrometry (LC-HRMS) is a highly sensitive method that can be used to obtain analytical fingerprints consisting of a variety of different components. Since the comparability of measurements carried out with different devices and at different times is not given, specific adulterants are usually detected in targeted analyses instead of analyzing the entire fingerprint. However, this comprehensive analysis is desirable in order to stay ahead in the race against food fraudsters, who are constantly adapting their adulterations to the latest state of the art in analytics. We have developed and optimized an approach that enables the separate processing of untargeted LC‑HRMS data obtained from different devices and at different times. We demonstrate this by the successful determination of the geographical origin of honey samples using a random forest model. We then show that this approach can be applied to develop a continuously learning classification model and our final model, based on data from 835 samples, achieves a classification accuracy of 94% for 126 test samples from 6 different countries.
Collapse
Affiliation(s)
- Jule Hansen
- Institute of Food Chemistry, Hamburg School of Food Science, University of Hamburg, Grindelallee 117, 20146, Hamburg, Germany
| | - Christof Kunert
- Eurofins Food Integrity Control Services GmbH, Berliner Str. 2, 27721, Ritterhude, Germany
| | - Hella Münstermann
- Institute of Food Chemistry, Hamburg School of Food Science, University of Hamburg, Grindelallee 117, 20146, Hamburg, Germany
| | - Kurt-Peter Raezke
- Eurofins Food Integrity Control Services GmbH, Berliner Str. 2, 27721, Ritterhude, Germany
| | - Stephan Seifert
- Institute of Food Chemistry, Hamburg School of Food Science, University of Hamburg, Grindelallee 117, 20146, Hamburg, Germany.
| |
Collapse
|
7
|
Peng H, Wang H, Kong W, Li J, Goh WWB. Optimizing differential expression analysis for proteomics data via high-performing rules and ensemble inference. Nat Commun 2024; 15:3922. [PMID: 38724498 PMCID: PMC11082229 DOI: 10.1038/s41467-024-47899-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2023] [Accepted: 04/16/2024] [Indexed: 05/12/2024] Open
Abstract
Identification of differentially expressed proteins in a proteomics workflow typically encompasses five key steps: raw data quantification, expression matrix construction, matrix normalization, missing value imputation (MVI), and differential expression analysis. The plethora of options in each step makes it challenging to identify optimal workflows that maximize the identification of differentially expressed proteins. To identify optimal workflows and their common properties, we conduct an extensive study involving 34,576 combinatoric experiments on 24 gold standard spike-in datasets. Applying frequent pattern mining techniques to top-ranked workflows, we uncover high-performing rules that demonstrate optimality has conserved properties. Via machine learning, we confirm optimal workflows are indeed predictable, with average cross-validation F1 scores and Matthew's correlation coefficients surpassing 0.84. We introduce an ensemble inference to integrate results from individual top-performing workflows for expanding differential proteome coverage and resolve inconsistencies. Ensemble inference provides gains in pAUC (up to 4.61%) and G-mean (up to 11.14%) and facilitates effective aggregation of information across varied quantification approaches such as topN, directLFQ, MaxLFQ intensities, and spectral counts. However, further development and evaluation are needed to establish acceptable frameworks for conducting ensemble inference on multiple proteomics workflows.
Collapse
Affiliation(s)
- Hui Peng
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - He Wang
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Weijia Kong
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Jinyan Li
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China.
| | - Wilson Wen Bin Goh
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore.
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore.
- Center for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore.
- Center of AI in Medicine, Nanyang Technological University, Singapore, Singapore.
- Division of Neurology, Department of Brain Sciences, Faculty of Medicine, Imperial College London, London, UK.
| |
Collapse
|
8
|
Carnie CJ, Acampora AC, Bader AS, Erdenebat C, Zhao S, Bitensky E, van den Heuvel D, Parnas A, Gupta V, D'Alessandro G, Sczaniecka-Clift M, Weickert P, Aygenli F, Götz MJ, Cordes J, Esain-Garcia I, Melidis L, Wondergem AP, Lam S, Robles MS, Balasubramanian S, Adar S, Luijsterburg MS, Jackson SP, Stingele J. Transcription-coupled repair of DNA-protein cross-links depends on CSA and CSB. Nat Cell Biol 2024; 26:797-810. [PMID: 38600235 PMCID: PMC11098753 DOI: 10.1038/s41556-024-01391-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Accepted: 02/29/2024] [Indexed: 04/12/2024]
Abstract
Covalent DNA-protein cross-links (DPCs) are toxic DNA lesions that block replication and require repair by multiple pathways. Whether transcription blockage contributes to the toxicity of DPCs and how cells respond when RNA polymerases stall at DPCs is unknown. Here we find that DPC formation arrests transcription and induces ubiquitylation and degradation of RNA polymerase II. Using genetic screens and a method for the genome-wide mapping of DNA-protein adducts, DPC sequencing, we discover that Cockayne syndrome (CS) proteins CSB and CSA provide resistance to DPC-inducing agents by promoting DPC repair in actively transcribed genes. Consequently, CSB- or CSA-deficient cells fail to efficiently restart transcription after induction of DPCs. In contrast, nucleotide excision repair factors that act downstream of CSB and CSA at ultraviolet light-induced DNA lesions are dispensable. Our study describes a transcription-coupled DPC repair pathway and suggests that defects in this pathway may contribute to the unique neurological features of CS.
Collapse
Affiliation(s)
- Christopher J Carnie
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK.
- The Gurdon Institute and Department of Biochemistry, University of Cambridge, Cambridge, UK.
| | - Aleida C Acampora
- Gene Center and Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Aldo S Bader
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- The Gurdon Institute and Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Chimeg Erdenebat
- Gene Center and Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Shubo Zhao
- Gene Center and Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Elnatan Bitensky
- Department of Microbiology and Molecular Genetics, The Institute for Medical Research Israel-Canada, The Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Diana van den Heuvel
- Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Avital Parnas
- Department of Microbiology and Molecular Genetics, The Institute for Medical Research Israel-Canada, The Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Vipul Gupta
- The Gurdon Institute and Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Giuseppina D'Alessandro
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- The Gurdon Institute and Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | - Pedro Weickert
- Gene Center and Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Fatih Aygenli
- Institute of Medical Psychology and Biomedical Center, Faculty of Medicine, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Maximilian J Götz
- Gene Center and Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Jacqueline Cordes
- Gene Center and Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Isabel Esain-Garcia
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Larry Melidis
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Annelotte P Wondergem
- Department of Human Genetics, Leiden University Medical Center, Leiden, the Netherlands
| | - Simon Lam
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- The Gurdon Institute and Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Maria S Robles
- Institute of Medical Psychology and Biomedical Center, Faculty of Medicine, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Shankar Balasubramanian
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Sheera Adar
- Department of Microbiology and Molecular Genetics, The Institute for Medical Research Israel-Canada, The Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | | | - Stephen P Jackson
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK.
- The Gurdon Institute and Department of Biochemistry, University of Cambridge, Cambridge, UK.
| | - Julian Stingele
- Gene Center and Department of Biochemistry, Ludwig-Maximilians-Universität München, Munich, Germany.
| |
Collapse
|
9
|
Bouyssié D, Altıner P, Capella-Gutierrez S, Fernández JM, Hagemeijer YP, Horvatovich P, Hubálek M, Levander F, Mauri P, Palmblad M, Raffelsberger W, Rodríguez-Navas L, Di Silvestre D, Kunkli BT, Uszkoreit J, Vandenbrouck Y, Vizcaíno JA, Winkelhardt D, Schwämmle V. WOMBAT-P: Benchmarking Label-Free Proteomics Data Analysis Workflows. J Proteome Res 2024; 23:418-429. [PMID: 38038272 DOI: 10.1021/acs.jproteome.3c00636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
The inherent diversity of approaches in proteomics research has led to a wide range of software solutions for data analysis. These software solutions encompass multiple tools, each employing different algorithms for various tasks such as peptide-spectrum matching, protein inference, quantification, statistical analysis, and visualization. To enable an unbiased comparison of commonly used bottom-up label-free proteomics workflows, we introduce WOMBAT-P, a versatile platform designed for automated benchmarking and comparison. WOMBAT-P simplifies the processing of public data by utilizing the sample and data relationship format for proteomics (SDRF-Proteomics) as input. This feature streamlines the analysis of annotated local or public ProteomeXchange data sets, promoting efficient comparisons among diverse outputs. Through an evaluation using experimental ground truth data and a realistic biological data set, we uncover significant disparities and a limited overlap in the quantified proteins. WOMBAT-P not only enables rapid execution and seamless comparison of workflows but also provides valuable insights into the capabilities of different software solutions. These benchmarking metrics are a valuable resource for researchers in selecting the most suitable workflow for their specific data sets. The modular architecture of WOMBAT-P promotes extensibility and customization. The software is available at https://github.com/wombat-p/WOMBAT-Pipelines.
Collapse
Affiliation(s)
- David Bouyssié
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III─Paul Sabatier (UT3), 31062 Toulouse, France
- Proteomics French Infrastructure, ProFI, FR 2048 Toulouse, France
| | - Pınar Altıner
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III─Paul Sabatier (UT3), 31062 Toulouse, France
| | | | - José M Fernández
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Yanick Paco Hagemeijer
- Department of Analytical Biochemistry, University of Groningen, Groningen Research Institute of Pharmacy, 9712 CP Groningen, The Netherlands
- European Research Institute for the Biology of Ageing, University Medical Center Groningen, 9713 GZ Groningen, The Netherlands
| | - Peter Horvatovich
- Department of Analytical Biochemistry, University of Groningen, Groningen Research Institute of Pharmacy, 9712 CP Groningen, The Netherlands
| | - Martin Hubálek
- Institute of Organic Chemistry and Biochemistry, CAS, 160 00 Prague, Czech Republic
| | - Fredrik Levander
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Department of Immunotechnology, Lund University, 22100 Lund, Sweden
| | - Pierluigi Mauri
- Institute for Biomedical Technologies (ITB), Department of Biomedical Sciences, National Research Council (CNR), Segrate, 20054 Milan, Italy
| | - Magnus Palmblad
- Leiden University Medical Center, Postbus 9600, 2300 RC Leiden, The Netherlands
| | - Wolfgang Raffelsberger
- Wolfgang Raffelsberger: Institut de Génétique et de Biologie Moléculaire et Cellulaire, Université de Strasbourg, CNRS UMR7104, INSERM U1258, Illkirch, 1 Rue Laurent Fries, 67404 Illkirch, France
| | - Laura Rodríguez-Navas
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Dario Di Silvestre
- Institute for Biomedical Technologies (ITB), Department of Biomedical Sciences, National Research Council (CNR), Segrate, 20054 Milan, Italy
| | - Balázs Tibor Kunkli
- Balázs Tibor Kunkli: Department of Biochemistry and Molecular Biology, University of Debrecen, 4032 Debrecen, Hungary
| | - Julian Uszkoreit
- Medical Faculty, Medical Bioinformatics, Ruhr University Bochum, 44801 Bochum, Germany
- Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Ruhr University Bochum, 44801 Bochum, Germany
- Medical Faculty, Medizinisches Proteom-Center, Ruhr University Bochum, 44801 Bochum, Germany
| | - Yves Vandenbrouck
- Proteomics French Infrastructure, ProFI, FR 2048 Toulouse, France
- CEA, Fundamental Research Division, Proteomics French Infrastructure, 91191 Gif-sur-Yvette, France
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory European Bioinformatics Institute (EMBL-EBI), Wellcome Trust, Genome Campus, Hinxton, Cambridge CB10 1SD, U.K
| | - Dirk Winkelhardt
- Medical Faculty, Medizinisches Proteom-Center, Ruhr University Bochum, 44801 Bochum, Germany
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
| |
Collapse
|
10
|
Villanueva E, Smith T, Pizzinga M, Elzek M, Queiroz RML, Harvey RF, Breckels LM, Crook OM, Monti M, Dezi V, Willis AE, Lilley KS. System-wide analysis of RNA and protein subcellular localization dynamics. Nat Methods 2024; 21:60-71. [PMID: 38036857 PMCID: PMC10776395 DOI: 10.1038/s41592-023-02101-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2023] [Accepted: 10/24/2023] [Indexed: 12/02/2023]
Abstract
Although the subcellular dynamics of RNA and proteins are key determinants of cell homeostasis, their characterization is still challenging. Here we present an integrative framework to simultaneously interrogate the dynamics of the transcriptome and proteome at subcellular resolution by combining two methods: localization of RNA (LoRNA) and a streamlined density-based localization of proteins by isotope tagging (dLOPIT) to map RNA and protein to organelles (nucleus, endoplasmic reticulum and mitochondria) and membraneless compartments (cytosol, nucleolus and cytosolic granules). Interrogating all RNA subcellular locations at once enables system-wide quantification of the proportional distribution of RNA. We obtain a cell-wide overview of localization dynamics for 31,839 transcripts and 5,314 proteins during the unfolded protein response, revealing that endoplasmic reticulum-localized transcripts are more efficiently recruited to cytosolic granules than cytosolic RNAs, and that the translation initiation factor eIF3d is key to sustaining cytoskeletal function. Overall, we provide the most comprehensive overview so far of RNA and protein subcellular localization dynamics.
Collapse
Affiliation(s)
- Eneko Villanueva
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Tom Smith
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Mariavittoria Pizzinga
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
- Structural Biology Research Centre, Human Technopole, Milan, Italy
| | - Mohamed Elzek
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Rayner M L Queiroz
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
| | | | - Lisa M Breckels
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Oliver M Crook
- Department of Statistics, University of Oxford, Oxford, UK
| | - Mie Monti
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Veronica Dezi
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Anne E Willis
- MRC Toxicology Unit, University of Cambridge, Cambridge, UK.
| | - Kathryn S Lilley
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, UK.
| |
Collapse
|
11
|
Rutz A, Wolfender JL. Automated Composition Assessment of Natural Extracts: Untargeted Mass Spectrometry-Based Metabolite Profiling Integrating Semiquantitative Detection. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:18010-18023. [PMID: 37949451 PMCID: PMC10683005 DOI: 10.1021/acs.jafc.3c03099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2023] [Revised: 09/19/2023] [Accepted: 09/22/2023] [Indexed: 11/12/2023]
Abstract
Recent developments in mass spectrometry-based metabolite profiling allow unprecedented qualitative coverage of complex biological extract composition. However, the electrospray ionization used in metabolite profiling generates multiple artifactual signals for a single analyte. This leads to thousands of signals per analysis without satisfactory means of filtering those corresponding to abundant constituents. Generic approaches are therefore needed for the qualitative and quantitative annotation of a broad range of relevant constituents. For this, we used an analytical platform combining liquid chromatography-mass spectrometry (LC-MS) with Charged Aerosol Detection (CAD). We established a generic metabolite profiling for the concomitant recording of qualitative MS data and semiquantitative CAD profiles. The MS features (recorded in high-resolution tandem MS) are grouped and annotated using state-of-the-art tools. To efficiently attribute features to their corresponding extracted and integrated CAD peaks, a custom signal pretreatment and peak-shape comparison workflow is built. This strategy allows us to automatically contextualize features at both major and minor metabolome levels, together with a detailed reporting of their annotation including relevant orthogonal information (taxonomy, retention time). Signals not attributed to CAD peaks are considered minor metabolites. Results are illustrated on an ethanolic extract of Swertia chirayita (Roxb.) H. Karst., a bitter plant of industrial interest, exhibiting the typical complexity of plant extracts as a proof of concept. This generic qualitative and quantitative approach paves the way to automatically assess the composition of single natural extracts of interest or broader collections, thus facilitating new ingredient registrations or natural-extracts-based drug discovery campaigns.
Collapse
Affiliation(s)
- Adriano Rutz
- School
of Pharmaceutical Sciences, University of
Geneva, 1211 Geneva, Switzerland
- Institute
of Pharmaceutical Sciences of Western Switzerland, University of Geneva, 1211 Geneva, Switzerland
- Institute
of Molecular Systems Biology, ETH Zürich, 8093 Zürich, Switzerland
| | - Jean-Luc Wolfender
- School
of Pharmaceutical Sciences, University of
Geneva, 1211 Geneva, Switzerland
- Institute
of Pharmaceutical Sciences of Western Switzerland, University of Geneva, 1211 Geneva, Switzerland
| |
Collapse
|
12
|
Elapavalore A, Kondić T, Singh RR, Shoemaker BA, Thiessen PA, Zhang J, Bolton EE, Schymanski EL. Adding open spectral data to MassBank and PubChem using open source tools to support non-targeted exposomics of mixtures. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2023; 25:1788-1801. [PMID: 37431591 PMCID: PMC10648001 DOI: 10.1039/d3em00181d] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Accepted: 06/25/2023] [Indexed: 07/12/2023]
Abstract
The term "exposome" is defined as a comprehensive study of life-course environmental exposures and the associated biological responses. Humans are exposed to many different chemicals, which can pose a major threat to the well-being of humanity. Targeted or non-targeted mass spectrometry techniques are widely used to identify and characterize various environmental stressors when linking exposures to human health. However, identification remains challenging due to the huge chemical space applicable to exposomics, combined with the lack of sufficient relevant entries in spectral libraries. Addressing these challenges requires cheminformatics tools and database resources to share curated open spectral data on chemicals to improve the identification of chemicals in exposomics studies. This article describes efforts to contribute spectra relevant for exposomics to the open mass spectral library MassBank (https://www.massbank.eu) using various open source software efforts, including the R packages RMassBank and Shinyscreen. The experimental spectra were obtained from ten mixtures containing toxicologically relevant chemicals from the US Environmental Protection Agency (EPA) Non-Targeted Analysis Collaborative Trial (ENTACT). Following processing and curation, 5582 spectra from 783 of the 1268 ENTACT compounds were added to MassBank, and through this to other open spectral libraries (e.g., MoNA, GNPS) for community benefit. Additionally, an automated deposition and annotation workflow was developed with PubChem to enable the display of all MassBank mass spectra in PubChem, which is rerun with each MassBank release. The new spectral records have already been used in several studies to increase the confidence in identification in non-target small molecule identification workflows applied to environmental and exposomics research.
Collapse
Affiliation(s)
- Anjana Elapavalore
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg.
| | - Todor Kondić
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg.
| | - Randolph R Singh
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg.
- IFREMER (Institut Français de Recherche pour l'Exploitation de la Mer), Laboratoire Biogéochimie des Contaminants Organiques, Rue de l'Ile d'Yeu, BP 21105, Nantes Cedex 3, 44311, France
| | - Benjamin A Shoemaker
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, 20894, USA
| | - Paul A Thiessen
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, 20894, USA
| | - Jian Zhang
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, 20894, USA
| | - Evan E Bolton
- National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH), Bethesda, MD, 20894, USA
| | - Emma L Schymanski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, 6 Avenue du Swing, 4367, Belvaux, Luxembourg.
| |
Collapse
|
13
|
Rempfert KR, Kraus EA, Nothaft DB, Dildar N, Spear JR, Sepúlveda J, Templeton AS. Intact polar lipidome and membrane adaptations of microbial communities inhabiting serpentinite-hosted fluids. Front Microbiol 2023; 14:1198786. [PMID: 38029177 PMCID: PMC10667739 DOI: 10.3389/fmicb.2023.1198786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2023] [Accepted: 09/25/2023] [Indexed: 12/01/2023] Open
Abstract
The generation of hydrogen and reduced carbon compounds during serpentinization provides sustained energy for microorganisms on Earth, and possibly on other extraterrestrial bodies (e.g., Mars, icy satellites). However, the geochemical conditions that arise from water-rock reaction also challenge the known limits of microbial physiology, such as hyperalkaline pH, limited electron acceptors and inorganic carbon. Because cell membranes act as a primary barrier between a cell and its environment, lipids are a vital component in microbial acclimation to challenging physicochemical conditions. To probe the diversity of cell membrane lipids produced in serpentinizing settings and identify membrane adaptations to this environment, we conducted the first comprehensive intact polar lipid (IPL) biomarker survey of microbial communities inhabiting the subsurface at a terrestrial site of serpentinization. We used an expansive, custom environmental lipid database that expands the application of targeted and untargeted lipodomics in the study of microbial and biogeochemical processes. IPLs extracted from serpentinite-hosted fluid communities were comprised of >90% isoprenoidal and non-isoprenoidal diether glycolipids likely produced by archaeal methanogens and sulfate-reducing bacteria. Phospholipids only constituted ~1% of the intact polar lipidome. In addition to abundant diether glycolipids, betaine and trimethylated-ornithine aminolipids and glycosphingolipids were also detected, indicating pervasive membrane modifications in response to phosphate limitation. The carbon oxidation state of IPL backbones was positively correlated with the reduction potential of fluids, which may signify an energy conservation strategy for lipid synthesis. Together, these data suggest microorganisms inhabiting serpentinites possess a unique combination of membrane adaptations that allow for their survival in polyextreme environments. The persistence of IPLs in fluids beyond the presence of their source organisms, as indicated by 16S rRNA genes and transcripts, is promising for the detection of extinct life in serpentinizing settings through lipid biomarker signatures. These data contribute new insights into the complexity of lipid structures generated in actively serpentinizing environments and provide valuable context to aid in the reconstruction of past microbial activity from fossil lipid records of terrestrial serpentinites and the search for biosignatures elsewhere in our solar system.
Collapse
Affiliation(s)
- Kaitlin R. Rempfert
- Department of Geological Sciences, University of Colorado, Boulder, CO, United States
| | - Emily A. Kraus
- Department of Civil and Environmental Engineering, Colorado School of Mines, Golden, CO, United States
| | - Daniel B. Nothaft
- Department of Geological Sciences, University of Colorado, Boulder, CO, United States
| | - Nadia Dildar
- Department of Geological Sciences, University of Colorado, Boulder, CO, United States
| | - John R. Spear
- Department of Civil and Environmental Engineering, Colorado School of Mines, Golden, CO, United States
- Department of Quantitative Biosciences and Engineering, Colorado School of Mines, Golden, CO, United States
| | - Julio Sepúlveda
- Department of Geological Sciences, University of Colorado, Boulder, CO, United States
| | - Alexis S. Templeton
- Department of Geological Sciences, University of Colorado, Boulder, CO, United States
| |
Collapse
|
14
|
Zhu R, Chen H, Liu M, Xu Y, Jiang W, Si X, Yi L, Gu Y, Ren D, Wang J. Nontargeted screening of aldehydes and ketones by chemical isotope labeling combined with ultra-high performance liquid chromatography-high resolution mass spectrometry followed by hybrid filtering of features. J Chromatogr A 2023; 1708:464332. [PMID: 37703764 DOI: 10.1016/j.chroma.2023.464332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/19/2023] [Accepted: 08/24/2023] [Indexed: 09/15/2023]
Abstract
Aldehydes and ketones are important carbonyl compounds that are widely present in foodstuffs, biological organisms and human living environment. However, it is still challenging to comprehensively detect and capture them using liquid chromatography - mass spectrometry. In this work, a chemical isotope labeling (CIL) coupled with ultra-high performance liquid chromatography - high resolution mass spectrometry (UHPLC-HRMS) strategy was developed for the capture and detection of this class of compounds. 2,4-Dinitrophenylhydrazine (DNPH) and isotope-labeled DNPH (DNPH-d3) were utilized to selectively label the target analytes. To address the difficulties in processing UHPLC-HRMS data, a post-acquisition data processing method called MSFilter was proposed to facilitate the screening and identification aldehydes and ketones in complex matrices. The MSFilter consists of four independent filters, namely statistical characteristic-based filtering, mass defect filtering, CIL paired peaks filtering, and diagnostic fragmentation ion filtering. These filters can be used individually or in combination to eliminate unrelated interfering MS features and efficiently detect DNPH-labeled aldehydes and ketones. The results of a mixture containing 48 model compounds showed that although all individual filtering methods could significantly reduce more than 95% of the raw MS features with acceptable recall rates above 85%, but they had relatively high false positive ratios of over 90%. In comparison, the hybrid filtering method combining four filters is able to eliminate massive interfering features (> 99.5%) with a high recall rate of 81.25% and a much lower false positive ratio of 15.22%. By implementing the hybrid filtering method in MSFilter, a total of 154 features were identified as potential signals of CCs from the original 45,961 features of real tobacco samples, of which 70 were annotated. We believe that the proposed strategy is promising to analyze the potential CCs in complex samples by UHPLC-HRMS.
Collapse
Affiliation(s)
- Ruizhi Zhu
- Yunnan Key Laboratory of Tobacco Chemistry, R&D Center of China Tobacco Yunnan Industrial Co., Ltd., Kunming 650231, China
| | - Han Chen
- Yunnan Key Laboratory of Tobacco Chemistry, R&D Center of China Tobacco Yunnan Industrial Co., Ltd., Kunming 650231, China; Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, 650500, China
| | - Meiyan Liu
- Yunnan Key Laboratory of Tobacco Chemistry, R&D Center of China Tobacco Yunnan Industrial Co., Ltd., Kunming 650231, China; Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, 650500, China
| | - Yanqun Xu
- Yunnan Key Laboratory of Tobacco Chemistry, R&D Center of China Tobacco Yunnan Industrial Co., Ltd., Kunming 650231, China
| | - Wei Jiang
- Yunnan Key Laboratory of Tobacco Chemistry, R&D Center of China Tobacco Yunnan Industrial Co., Ltd., Kunming 650231, China
| | - Xiaoxi Si
- Yunnan Key Laboratory of Tobacco Chemistry, R&D Center of China Tobacco Yunnan Industrial Co., Ltd., Kunming 650231, China
| | - Lunzhao Yi
- Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, 650500, China
| | - Ying Gu
- Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, 650500, China
| | - Dabing Ren
- Faculty of Agriculture and Food, Kunming University of Science and Technology, Kunming, 650500, China.
| | - Juan Wang
- College of Arts and Sciences·Kunming, Kunming, 650221, China.
| |
Collapse
|
15
|
Zheng J, Yang J, Zhao F, Peng B, Wang Y, Fang M. CIL-ExPMRM: An Ultrasensitive Chemical Isotope Labeling Assisted Pseudo-MRM Platform to Accelerate Exposomic Suspect Screening. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2023; 57:10962-10973. [PMID: 37469223 DOI: 10.1021/acs.est.3c01830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/21/2023]
Abstract
Exposome is the future of next-generation environmental health to establish the association between environmental exposure and diseases. However, due to low concentrations of exposure chemicals, exposome has been hampered by lacking an effective analytical platform to characterize its composition. In this study, by combining the benefit of chemical isotope labeling and pseudo-multiple reaction monitoring (CIL-pseudo-MRM), we have developed one highly sensitive and high-throughput platform (CIL-ExPMRM) by isotope labeling urinary exposure biomarkers. Dansyl chloride (DnsCl), N-methylphenylethylamine (MPEA), and their isotope-labeled forms were used to derivatize polar hydroxyl and carboxyl compounds, respectively. We have programmed a series of scripts to optimize MRM transition parameters, curate the MRM database (>70,000 compounds), predict accurate retention time (RT), and automize dynamic MRMs. This was followed by an automated MRM peak assignment, peak alignment, and statistical analysis. A computational pipeline was eventually incorporated into a user-friendly website interface, named CIL-ExPMRM (http://www.exposomemrm.com/). The performance of this platform has been validated with a relatively low false positive rate (10.7%) across instrumental platforms. CIL-ExPMRM has systematically overcome key bottlenecks of exposome studies to some extent and outperforms previous methods due to its independence of MS/MS availability, accurate RT prediction, and collision energy optimization, as well as the ultrasensitivity and automated robust intensity-based quantification. Overall, CIL-ExPMRM has great potential to advance the exposomic studies based on urinary biomarkers.
Collapse
Affiliation(s)
- Jie Zheng
- Singapore Phenome Center, Lee Kong Chian School of Medicine, Nanyang Technological University, 639798 Singapore
| | - Junjie Yang
- School of Civil and Environmental Engineering, Nanyang Technological University, 639798 Singapore
| | - Fanrong Zhao
- School of Civil and Environmental Engineering, Nanyang Technological University, 639798 Singapore
| | - Bo Peng
- School of Civil and Environmental Engineering, Nanyang Technological University, 639798 Singapore
| | - Yulan Wang
- Singapore Phenome Center, Lee Kong Chian School of Medicine, Nanyang Technological University, 639798 Singapore
| | - Mingliang Fang
- School of Civil and Environmental Engineering, Nanyang Technological University, 639798 Singapore
- Department of Environmental Science and Engineering, Fudan University, Shanghai 200433, China
| |
Collapse
|
16
|
Moloney NM, Barylyuk K, Tromer E, Crook OM, Breckels LM, Lilley KS, Waller RF, MacGregor P. Mapping diversity in African trypanosomes using high resolution spatial proteomics. Nat Commun 2023; 14:4401. [PMID: 37479728 PMCID: PMC10361982 DOI: 10.1038/s41467-023-40125-z] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 07/06/2023] [Indexed: 07/23/2023] Open
Abstract
African trypanosomes are dixenous eukaryotic parasites that impose a significant human and veterinary disease burden on sub-Saharan Africa. Diversity between species and life-cycle stages is concomitant with distinct host and tissue tropisms within this group. Here, the spatial proteomes of two African trypanosome species, Trypanosoma brucei and Trypanosoma congolense, are mapped across two life-stages. The four resulting datasets provide evidence of expression of approximately 5500 proteins per cell-type. Over 2500 proteins per cell-type are classified to specific subcellular compartments, providing four comprehensive spatial proteomes. Comparative analysis reveals key routes of parasitic adaptation to different biological niches and provides insight into the molecular basis for diversity within and between these pathogen species.
Collapse
Affiliation(s)
- Nicola M Moloney
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QW, UK
| | | | - Eelco Tromer
- Cell Biochemistry, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, 9747 AG, Groningen, Netherlands
| | - Oliver M Crook
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QW, UK
- Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK
| | - Lisa M Breckels
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QW, UK
| | - Kathryn S Lilley
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QW, UK
| | - Ross F Waller
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QW, UK
| | - Paula MacGregor
- Department of Biochemistry, University of Cambridge, Cambridge, CB2 1QW, UK.
- School of Biological Sciences, University of Bristol, Bristol, BS8 1TQ, UK.
| |
Collapse
|
17
|
Schuhmacher JS, Tom Dieck S, Christoforidis S, Landerer C, Davila Gallesio J, Hersemann L, Seifert S, Schäfer R, Giner A, Toth-Petroczy A, Kalaidzidis Y, Bohnsack KE, Bohnsack MT, Schuman EM, Zerial M. The Rab5 effector FERRY links early endosomes with mRNA localization. Mol Cell 2023; 83:1839-1855.e13. [PMID: 37267905 DOI: 10.1016/j.molcel.2023.05.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 12/06/2022] [Accepted: 05/08/2023] [Indexed: 06/04/2023]
Abstract
Localized translation is vital to polarized cells and requires precise and robust distribution of different mRNAs and ribosomes across the cell. However, the underlying molecular mechanisms are poorly understood and important players are lacking. Here, we discovered a Rab5 effector, the five-subunit endosomal Rab5 and RNA/ribosome intermediary (FERRY) complex, that recruits mRNAs and ribosomes to early endosomes through direct mRNA-interaction. FERRY displays preferential binding to certain groups of transcripts, including mRNAs encoding mitochondrial proteins. Deletion of FERRY subunits reduces the endosomal localization of transcripts in cells and has a significant impact on mRNA levels. Clinical studies show that genetic disruption of FERRY causes severe brain damage. We found that, in neurons, FERRY co-localizes with mRNA on early endosomes, and mRNA loaded FERRY-positive endosomes are in close proximity of mitochondria. FERRY thus transforms endosomes into mRNA carriers and plays a key role in regulating mRNA distribution and transport.
Collapse
Affiliation(s)
- Jan S Schuhmacher
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Susanne Tom Dieck
- Max Planck Institute for Brain Research, Max-von-Laue-Str. 4, 60438 Frankfurt am Main, Germany
| | - Savvas Christoforidis
- Biomedical Research Institute, Foundation for Research and Technology, 45110 Ioannina, Greece; Laboratory of Biological Chemistry, Department of Medicine, School of Health Sciences, University of Ioannina, 45110 Ioannina, Greece
| | - Cedric Landerer
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Jimena Davila Gallesio
- Department of Molecular Biology, University Medical Center Göttingen, Humboldtallee 23, 37073 Göttingen, Germany
| | - Lena Hersemann
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Sarah Seifert
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Ramona Schäfer
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Angelika Giner
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Agnes Toth-Petroczy
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Yannis Kalaidzidis
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany
| | - Katherine E Bohnsack
- Department of Molecular Biology, University Medical Center Göttingen, Humboldtallee 23, 37073 Göttingen, Germany
| | - Markus T Bohnsack
- Department of Molecular Biology, University Medical Center Göttingen, Humboldtallee 23, 37073 Göttingen, Germany; Göttingen Centre for Molecular Biosciences, University of Göttingen, Justus-von-Liebig-Weg 11, 37077 Göttingen, Germany; Max Planck Institute for Multidisciplinary Sciences, Am Fassberg 11, 37077 Göttingen, Germany
| | - Erin M Schuman
- Max Planck Institute for Brain Research, Max-von-Laue-Str. 4, 60438 Frankfurt am Main, Germany
| | - Marino Zerial
- Max Planck Institute of Molecular Cell Biology and Genetics, Pfotenhauerstrasse 108, 01307 Dresden, Germany; Center for Systems Biology Dresden, Pfotenhauerstrasse 108, 01307 Dresden, Germany.
| |
Collapse
|
18
|
Vangeenderhuysen P, Van Arnhem J, Pomian B, De Graeve M, De Commer L, Falony G, Raes J, Zhernakova A, Fu J, Hemeryck LY, Vanhaecke L. Dual UHPLC-HRMS Metabolomics and Lipidomics and Automated Data Processing Workflow for Comprehensive High-Throughput Gut Phenotyping. Anal Chem 2023. [PMID: 37220321 DOI: 10.1021/acs.analchem.2c05371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
In recent years, feces has surfaced as the matrix of choice for investigating the gut microbiome-health axis because of its non-invasive sampling and the unique reflection it offers of an individual's lifestyle. In cohort studies where the number of samples required is large, but availability is scarce, a clear need exists for high-throughput analyses. Such analyses should combine a wide physicochemical range of molecules with a minimal amount of sample and resources and downstream data processing workflows that are as automated and time efficient as possible. We present a dual fecal extraction and ultra high performance liquid chromatography-high resolution-quadrupole-orbitrap-mass spectrometry (UHPLC-HR-Q-Orbitrap-MS)-based workflow that enables widely targeted and untargeted metabolome and lipidome analysis. A total of 836 in-house standards were analyzed, of which 360 metabolites and 132 lipids were consequently detected in feces. Their targeted profiling was validated successfully with respect to repeatability (78% CV < 20%), reproducibility (82% CV < 20%), and linearity (81% R2 > 0.9), while also enabling holistic untargeted fingerprinting (15,319 features, CV < 30%). To automate targeted processing, we optimized an R-based targeted peak extraction (TaPEx) algorithm relying on a database comprising retention time and mass-to-charge ratio (360 metabolites and 132 lipids), with batch-specific quality control curation. The latter was benchmarked toward vendor-specific targeted and untargeted software and our isotopologue parameter optimization/XCMS-based untargeted pipeline in LifeLines Deep cohort samples (n = 97). TaPEx clearly outperformed the untargeted approaches (81.3 vs 56.7-66.0% compounds detected). Finally, our novel dual fecal metabolomics-lipidomics-TaPEx method was successfully applied to Flemish Gut Flora Project cohort (n = 292) samples, leading to a sample-to-result time reduction of 60%.
Collapse
Affiliation(s)
- P Vangeenderhuysen
- Laboratory of Integrative Metabolomics (LIMET), Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - J Van Arnhem
- Laboratory of Integrative Metabolomics (LIMET), Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - B Pomian
- Laboratory of Integrative Metabolomics (LIMET), Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - M De Graeve
- Laboratory of Integrative Metabolomics (LIMET), Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - L De Commer
- Department of Microbiology and Immunology, Rega Institute, KU Leuven, Herestraat 49, 3000 Leuven, Belgium
- VIB, Center for Microbiology, Gaston Geenslaan 1, 3001 Leuven, Belgium
| | - G Falony
- Department of Microbiology and Immunology, Rega Institute, KU Leuven, Herestraat 49, 3000 Leuven, Belgium
- VIB, Center for Microbiology, Gaston Geenslaan 1, 3001 Leuven, Belgium
| | - J Raes
- Department of Microbiology and Immunology, Rega Institute, KU Leuven, Herestraat 49, 3000 Leuven, Belgium
- VIB, Center for Microbiology, Gaston Geenslaan 1, 3001 Leuven, Belgium
| | - A Zhernakova
- Department of Genetics, University of Groningen, Antonius Deusinglaan 1, 9700 AB Groningen, The Netherlands
| | - J Fu
- Department of Genetics, University of Groningen, Antonius Deusinglaan 1, 9700 AB Groningen, The Netherlands
- Department of Pediatrics, University of Groningen, Hanzeplein 1, 9713 GZ Groningen, The Netherlands
| | - L Y Hemeryck
- Laboratory of Integrative Metabolomics (LIMET), Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
| | - L Vanhaecke
- Laboratory of Integrative Metabolomics (LIMET), Department of Translational Physiology, Infectiology and Public Health, Faculty of Veterinary Medicine, Ghent University, Salisburylaan 133, 9820 Merelbeke, Belgium
- Institute for Global Food Security, School of Biological Sciences, Queen's University, University Road, BT7 1NN Belfast, Northern Ireland, U.K
| |
Collapse
|
19
|
Bai M, Deng J, Dai C, Pfeuffer J, Sachsenberg T, Perez-Riverol Y. LFQ-Based Peptide and Protein Intensity Differential Expression Analysis. J Proteome Res 2023. [PMID: 37220883 DOI: 10.1021/acs.jproteome.2c00812] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Testing for significant differences in quantities at the protein level is a common goal of many LFQ-based mass spectrometry proteomics experiments. Starting from a table of protein and/or peptide quantities from a given proteomics quantification software, many tools and R packages exist to perform the final tasks of imputation, summarization, normalization, and statistical testing. To evaluate the effects of packages and settings in their substeps on the final list of significant proteins, we studied several packages on three public data sets with known expected protein fold changes. We found that the results between packages and even across different parameters of the same package can vary significantly. In addition to usability aspects and feature/compatibility lists of different packages, this paper highlights sensitivity and specificity trade-offs that come with specific packages and settings.
Collapse
Affiliation(s)
- Mingze Bai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Jingwen Deng
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
| | - Chengxin Dai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Julianus Pfeuffer
- Algorithmic Bioinformatics, Freie Universität Berlin, Berlin 14195, Germany
- Visualization and Data Analysis, Zuse Institute Berlin, Berlin 14195, Germany
| | - Timo Sachsenberg
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen 72076, Germany
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hixton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
20
|
Ternet C, Junk P, Sevrin T, Catozzi S, Wåhlén E, Heldin J, Oliviero G, Wynne K, Kiel C. Analysis of context-specific KRAS-effector (sub)complexes in Caco-2 cells. Life Sci Alliance 2023; 6:e202201670. [PMID: 36894174 PMCID: PMC9998658 DOI: 10.26508/lsa.202201670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 02/24/2023] [Accepted: 02/27/2023] [Indexed: 03/11/2023] Open
Abstract
Ras is a key switch controlling cell behavior. In the GTP-bound form, Ras interacts with numerous effectors in a mutually exclusive manner, where individual Ras-effectors are likely part of larger cellular (sub)complexes. The molecular details of these (sub)complexes and their alteration in specific contexts are not understood. Focusing on KRAS, we performed affinity purification (AP)-mass spectrometry (MS) experiments of exogenously expressed FLAG-KRAS WT and three oncogenic mutants ("genetic contexts") in the human Caco-2 cell line, each exposed to 11 different culture media ("culture contexts") that mimic conditions relevant in the colon and colorectal cancer. We identified four effectors present in complex with KRAS in all genetic and growth contexts ("context-general effectors"). Seven effectors are found in KRAS complexes in only some contexts ("context-specific effectors"). Analyzing all interactors in complex with KRAS per condition, we find that the culture contexts had a larger impact on interaction rewiring than genetic contexts. We investigated how changes in the interactome impact functional outcomes and created a Shiny app for interactive visualization. We validated some of the functional differences in metabolism and proliferation. Finally, we used networks to evaluate how KRAS-effectors are involved in the modulation of functions by random walk analyses of effector-mediated (sub)complexes. Altogether, our work shows the impact of environmental contexts on network rewiring, which provides insights into tissue-specific signaling mechanisms. This may also explain why KRAS oncogenic mutants may be causing cancer only in specific tissues despite KRAS being expressed in most cells and tissues.
Collapse
Affiliation(s)
- Camille Ternet
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin 4, Ireland
- UCD Charles Institute of Dermatology, School of Medicine, University College Dublin, Dublin 4, Ireland
| | - Philipp Junk
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin 4, Ireland
- UCD Charles Institute of Dermatology, School of Medicine, University College Dublin, Dublin 4, Ireland
| | - Thomas Sevrin
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin 4, Ireland
- UCD Charles Institute of Dermatology, School of Medicine, University College Dublin, Dublin 4, Ireland
| | - Simona Catozzi
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin 4, Ireland
- UCD Charles Institute of Dermatology, School of Medicine, University College Dublin, Dublin 4, Ireland
| | - Erik Wåhlén
- Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
| | - Johan Heldin
- Department of Pharmaceutical Biosciences, Uppsala University, Uppsala, Sweden
| | - Giorgio Oliviero
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin 4, Ireland
| | - Kieran Wynne
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin 4, Ireland
- Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Dublin 4, Ireland
| | - Christina Kiel
- Department of Molecular Medicine, University of Pavia, Pavia, Italy
- Systems Biology Ireland, School of Medicine, University College Dublin, Dublin 4, Ireland
- UCD Charles Institute of Dermatology, School of Medicine, University College Dublin, Dublin 4, Ireland
| |
Collapse
|
21
|
Rao A, de Kok NAW, Driessen AJM. Membrane Adaptations and Cellular Responses of Sulfolobus acidocaldarius to the Allylamine Terbinafine. Int J Mol Sci 2023; 24:ijms24087328. [PMID: 37108491 PMCID: PMC10138448 DOI: 10.3390/ijms24087328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 04/11/2023] [Accepted: 04/13/2023] [Indexed: 04/29/2023] Open
Abstract
Cellular membranes are essential for compartmentalization, maintenance of permeability, and fluidity in all three domains of life. Archaea belong to the third domain of life and have a distinct phospholipid composition. Membrane lipids of archaea are ether-linked molecules, specifically bilayer-forming dialkyl glycerol diethers (DGDs) and monolayer-forming glycerol dialkyl glycerol tetraethers (GDGTs). The antifungal allylamine terbinafine has been proposed as an inhibitor of GDGT biosynthesis in archaea based on radiolabel incorporation studies. The exact target(s) and mechanism of action of terbinafine in archaea remain elusive. Sulfolobus acidocaldarius is a strictly aerobic crenarchaeon thriving in a thermoacidophilic environment, and its membrane is dominated by GDGTs. Here, we comprehensively analyzed the lipidome and transcriptome of S. acidocaldarius in the presence of terbinafine. Depletion of GDGTs and the accompanying accumulation of DGDs upon treatment with terbinafine were growth phase-dependent. Additionally, a major shift in the saturation of caldariellaquinones was observed, which resulted in the accumulation of unsaturated molecules. Transcriptomic data indicated that terbinafine has a multitude of effects, including significant differential expression of genes in the respiratory complex, motility, cell envelope, fatty acid metabolism, and GDGT cyclization. Combined, these findings suggest that the response of S. acidocaldarius to terbinafine inhibition involves respiratory stress and the differential expression of genes involved in isoprenoid biosynthesis and saturation.
Collapse
Affiliation(s)
- Alka Rao
- Department of Molecular Microbiology, Groningen Biomolecular Science and Biotechnology Institute, University of Groningen, 9747 AG Groningen, The Netherlands
| | - Niels A W de Kok
- Department of Chemical and Pharmaceutical Biology, Groningen Research Institute of Pharmacy, University of Groningen, Antonius Deusinglaan 1, 9713 AV Groningen, The Netherlands
| | - Arnold J M Driessen
- Department of Molecular Microbiology, Groningen Biomolecular Science and Biotechnology Institute, University of Groningen, 9747 AG Groningen, The Netherlands
| |
Collapse
|
22
|
Burr SP, Klimm F, Glynos A, Prater M, Sendon P, Nash P, Powell CA, Simard ML, Bonekamp NA, Charl J, Diaz H, Bozhilova LV, Nie Y, Zhang H, Frison M, Falkenberg M, Jones N, Minczuk M, Stewart JB, Chinnery PF. Cell lineage-specific mitochondrial resilience during mammalian organogenesis. Cell 2023; 186:1212-1229.e21. [PMID: 36827974 DOI: 10.1016/j.cell.2023.01.034] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 10/28/2022] [Accepted: 01/26/2023] [Indexed: 02/25/2023]
Abstract
Mitochondrial activity differs markedly between organs, but it is not known how and when this arises. Here we show that cell lineage-specific expression profiles involving essential mitochondrial genes emerge at an early stage in mouse development, including tissue-specific isoforms present before organ formation. However, the nuclear transcriptional signatures were not independent of organelle function. Genetically disrupting intra-mitochondrial protein synthesis with two different mtDNA mutations induced cell lineage-specific compensatory responses, including molecular pathways not previously implicated in organellar maintenance. We saw downregulation of genes whose expression is known to exacerbate the effects of exogenous mitochondrial toxins, indicating a transcriptional adaptation to mitochondrial dysfunction during embryonic development. The compensatory pathways were both tissue and mutation specific and under the control of transcription factors which promote organelle resilience. These are likely to contribute to the tissue specificity which characterizes human mitochondrial diseases and are potential targets for organ-directed treatments.
Collapse
Affiliation(s)
- Stephen P Burr
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Florian Klimm
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Department of Mathematics, Imperial College London, London, UK; EPSRC Centre for Mathematics of Precision Healthcare, Imperial College, London, UK; Department of Computational Molecular Biology, Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, D-14195 Berlin, Germany; Department of Computer Science, Freie Universität Berlin, Arnimallee 3, D-14195 Berlin, Germany
| | - Angelos Glynos
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Malwina Prater
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Pamella Sendon
- Max Planck Institute for Biology of Ageing, Cologne, Germany
| | - Pavel Nash
- Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Christopher A Powell
- Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | | | - Nina A Bonekamp
- Max Planck Institute for Biology of Ageing, Cologne, Germany; Department of Neuroanatomy, Mannheim Centre for Translational Neuroscience (MCTN), Medical Faculty Mannheim/Heidelberg University, Heidelberg, Germany
| | - Julia Charl
- Institute of Biochemistry, University of Cologne, Otto-Fischer-Strasse 12-14, Cologne, Germany
| | - Hector Diaz
- Department of Medical Biochemistry and Cell Biology, University of Gothenburg, PO Box 440, Gothenburg 405 30, Sweden
| | - Lyuba V Bozhilova
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Yu Nie
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Haixin Zhang
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Michele Frison
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - Maria Falkenberg
- Department of Medical Biochemistry and Cell Biology, University of Gothenburg, PO Box 440, Gothenburg 405 30, Sweden
| | - Nick Jones
- Department of Mathematics, Imperial College London, London, UK
| | - Michal Minczuk
- Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK
| | - James B Stewart
- Max Planck Institute for Biology of Ageing, Cologne, Germany; Biosciences Institute, Faculty of Medical Sciences, Wellcome Centre for Mitochondrial Research, Newcastle University, Newcastle upon Tyne, UK
| | - Patrick F Chinnery
- Department of Clinical Neurosciences, School of Clinical Medicine, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK; Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge Biomedical Campus, Cambridge, UK.
| |
Collapse
|
23
|
Du X, Dastmalchi F, Ye H, Garrett TJ, Diller MA, Liu M, Hogan WR, Brochhausen M, Lemas DJ. Evaluating LC-HRMS metabolomics data processing software using FAIR principles for research software. Metabolomics 2023; 19:11. [PMID: 36745241 DOI: 10.1007/s11306-023-01974-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 01/20/2023] [Indexed: 02/07/2023]
Abstract
BACKGROUND Liquid chromatography-high resolution mass spectrometry (LC-HRMS) is a popular approach for metabolomics data acquisition and requires many data processing software tools. The FAIR Principles - Findability, Accessibility, Interoperability, and Reusability - were proposed to promote open science and reusable data management, and to maximize the benefit obtained from contemporary and formal scholarly digital publishing. More recently, the FAIR principles were extended to include Research Software (FAIR4RS). AIM OF REVIEW This study facilitates open science in metabolomics by providing an implementation solution for adopting FAIR4RS in the LC-HRMS metabolomics data processing software. We believe our evaluation guidelines and results can help improve the FAIRness of research software. KEY SCIENTIFIC CONCEPTS OF REVIEW We evaluated 124 LC-HRMS metabolomics data processing software obtained from a systematic review and selected 61 software for detailed evaluation using FAIR4RS-related criteria, which were extracted from the literature along with internal discussions. We assigned each criterion one or more FAIR4RS categories through discussion. The minimum, median, and maximum percentages of criteria fulfillment of software were 21.6%, 47.7%, and 71.8%. Statistical analysis revealed no significant improvement in FAIRness over time. We identified four criteria covering multiple FAIR4RS categories but had a low %fulfillment: (1) No software had semantic annotation of key information; (2) only 6.3% of evaluated software were registered to Zenodo and received DOIs; (3) only 14.5% of selected software had official software containerization or virtual machine; (4) only 16.7% of evaluated software had a fully documented functions in code. According to the results, we discussed improvement strategies and future directions.
Collapse
Affiliation(s)
- Xinsong Du
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Farhad Dastmalchi
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Hao Ye
- Health Science Center Libraries, University of Florida, Florida, USA
| | - Timothy J Garrett
- Department of Pathology, Immunology and Laboratory Medicine, College of Medicine, University of Florida, Florida, USA
| | - Matthew A Diller
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Mei Liu
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - William R Hogan
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Mathias Brochhausen
- Department of Biomedical Informatics, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, USA
| | - Dominick J Lemas
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA.
- Department of Obstetrics and Gynecology, University of Florida College of Medicine, Florida, Gainesville, United States.
- Center for Perinatal Outcomes Research, University of Florida College of Medicine, Gainesville, United States.
| |
Collapse
|
24
|
Martin EA, Fulcher JM, Zhou M, Monroe ME, Petyuk VA. TopPICR: A Companion R Package for Top-Down Proteomics Data Analysis. J Proteome Res 2023; 22:399-409. [PMID: 36631391 DOI: 10.1021/acs.jproteome.2c00570] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Top-down proteomics is the analysis of proteins in their intact form without proteolysis, thus preserving valuable information about post-translational modifications, isoforms, and proteolytic processing. However, it is still a developing field due to limitations in the instrumentation, difficulties with the interpretation of complex mass spectra, and a lack of well-established quantification approaches. TopPIC is one of the popular tools for proteoform identification. We extended its capabilities into label-free proteoform quantification by developing a companion R package (TopPICR). Key steps in the TopPICR pipeline include filtering identifications, inferring a minimal set of protein accessions explaining the observed sequences, aligning retention times, recalibrating measured masses, clustering features across data sets, and finally compiling feature intensities using the match-between-runs approach. The output of the pipeline is an MSnSet object which makes downstream data analysis seamlessly compatible with packages from the Bioconductor project. It also provides the capability for visualizing proteoforms within the context of the parent protein sequence. The functionality of TopPICR is demonstrated on top-down LC-MS/MS data sets of 10 human-in-mouse xenografts of luminal and basal breast tumor samples.
Collapse
Affiliation(s)
- Evan A Martin
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington99352, United States
| | - James M Fulcher
- Environmental and Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington99352, United States
| | - Mowei Zhou
- Environmental and Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington99352, United States
| | - Matthew E Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington99352, United States
| | - Vladislav A Petyuk
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington99352, United States
| |
Collapse
|
25
|
Bittremieux W, Levitsky L, Pilz M, Sachsenberg T, Huber F, Wang M, Dorrestein PC. Unified and Standardized Mass Spectrometry Data Processing in Python Using spectrum_utils. J Proteome Res 2023; 22:625-631. [PMID: 36688502 DOI: 10.1021/acs.jproteome.2c00632] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
spectrum_utils is a Python package for mass spectrometry data processing and visualization. Since its introduction, spectrum_utils has grown into a fundamental software solution that powers various applications in proteomics and metabolomics, ranging from spectrum preprocessing prior to spectrum identification and machine learning applications to spectrum plotting from online data repositories and assisting data analysis tasks for dozens of other projects. Here, we present updates to spectrum_utils, which include new functionality to integrate mass spectrometry community data standards, enhanced mass spectral data processing, and unified mass spectral data visualization in Python. spectrum_utils is freely available as open source at https://github.com/bittremieux/spectrum_utils.
Collapse
Affiliation(s)
- Wout Bittremieux
- Department of Computer Science, University of Antwerp, 2020 Antwerp, Belgium.,Biomedical Informatics Network Antwerpen (biomina), 2020 Antwerp, Belgium
| | - Lev Levitsky
- V. L. Talrose Institute for Energy Problems of Chemical Physics, N. N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 119334 Moscow, Russia
| | - Matteo Pilz
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
| | - Timo Sachsenberg
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, 72076 Tübingen, Germany
| | - Florian Huber
- Centre for Digitalisation and Digitality, University of Applied Sciences Düsseldorf, 40476 Düsseldorf, Germany
| | - Mingxun Wang
- Department of Computer Science, University of California─Riverside, Riverside, California 92507, United States
| | - Pieter C Dorrestein
- Collaborative Mass Spectrometry Innovation Center, University of California─San Diego, La Jolla, California 92093, United States.,Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California─San Diego, La Jolla California 92093, United States
| |
Collapse
|
26
|
Volani C, Malfertheiner C, Caprioli G, Fjelstrup S, Pramstaller PP, Rainer J, Paglia G. VAMS-Based Blood Capillary Sampling for Mass Spectrometry-Based Human Metabolomics Studies. Metabolites 2023; 13:metabo13020146. [PMID: 36837765 PMCID: PMC9958641 DOI: 10.3390/metabo13020146] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 01/14/2023] [Accepted: 01/15/2023] [Indexed: 01/20/2023] Open
Abstract
Volumetric absorptive microsampling (VAMS) is a recently developed sample collection method that enables single-drop blood collection in a minimally invasive manner. Blood biomolecules can then be extracted and processed for analysis using several analytical platforms. The integration of VAMS with conventional mass spectrometry (MS)-based metabolomics approaches is an attractive solution for human studies representing a less-invasive procedure compared to phlebotomy with the additional potential for remote sample collection. However, as we recently demonstrated, VAMS samples require long-term storage at -80 °C. This study investigated the stability of VAMS samples during short-term storage and compared the metabolome obtained from capillary blood collected from the fingertip to those of plasma and venous blood from 22 healthy volunteers. Our results suggest that the blood metabolome collected by VAMS samples is stable at room temperature only for up to 6 h requiring subsequent storage at -80 °C to avoid significant changes in the metabolome. We also demonstrated that capillary blood provides better coverage of the metabolome compared to plasma enabling the analysis of several intracellular metabolites presented in red blood cells. Finally, this work demonstrates that with the appropriate pre-analytical protocol capillary blood can be successfully used for untargeted metabolomics studies.
Collapse
Affiliation(s)
- Chiara Volani
- Institute for Biomedicine, Affiliated to the University of Lübeck, Eurac Research, 39100 Bolzano, Italy
| | - Christa Malfertheiner
- Institute for Biomedicine, Affiliated to the University of Lübeck, Eurac Research, 39100 Bolzano, Italy
| | - Giulia Caprioli
- Institute for Biomedicine, Affiliated to the University of Lübeck, Eurac Research, 39100 Bolzano, Italy
| | - Søren Fjelstrup
- Interdisciplinary Nanoscience Center, Aarhus University, 8000 Aarhus, Denmark
| | - Peter P. Pramstaller
- Institute for Biomedicine, Affiliated to the University of Lübeck, Eurac Research, 39100 Bolzano, Italy
| | - Johannes Rainer
- Institute for Biomedicine, Affiliated to the University of Lübeck, Eurac Research, 39100 Bolzano, Italy
| | - Giuseppe Paglia
- School of Medicine and Surgery, University of Milano-Bicocca, 20854 Vedano al Lambro, Italy
- Correspondence:
| |
Collapse
|
27
|
Vatanen T, Jabbar KS, Ruohtula T, Honkanen J, Avila-Pacheco J, Siljander H, Stražar M, Oikarinen S, Hyöty H, Ilonen J, Mitchell CM, Yassour M, Virtanen SM, Clish CB, Plichta DR, Vlamakis H, Knip M, Xavier RJ. Mobile genetic elements from the maternal microbiome shape infant gut microbial assembly and metabolism. Cell 2022; 185:4921-4936.e15. [PMID: 36563663 PMCID: PMC9869402 DOI: 10.1016/j.cell.2022.11.023] [Citation(s) in RCA: 49] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 09/30/2022] [Accepted: 11/11/2022] [Indexed: 12/24/2022]
Abstract
The perinatal period represents a critical window for cognitive and immune system development, promoted by maternal and infant gut microbiomes and their metabolites. Here, we tracked the co-development of microbiomes and metabolomes from late pregnancy to 1 year of age using longitudinal multi-omics data from a cohort of 70 mother-infant dyads. We discovered large-scale mother-to-infant interspecies transfer of mobile genetic elements, frequently involving genes associated with diet-related adaptations. Infant gut metabolomes were less diverse than maternal but featured hundreds of unique metabolites and microbe-metabolite associations not detected in mothers. Metabolomes and serum cytokine signatures of infants who received regular-but not extensively hydrolyzed-formula were distinct from those of exclusively breastfed infants. Taken together, our integrative analysis expands the concept of vertical transmission of the gut microbiome and provides original insights into the development of maternal and infant microbiomes and metabolomes during late pregnancy and early life.
Collapse
Affiliation(s)
- Tommi Vatanen
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Research Program for Clinical and Molecular Metabolism, Faculty of Medicine, University of Helsinki, Helsinki, Finland; Liggins Institute, University of Auckland, Auckland, New Zealand
| | | | - Terhi Ruohtula
- New Children's Hospital, Helsinki University Hospital, Helsinki, Finland
| | - Jarno Honkanen
- New Children's Hospital, Helsinki University Hospital, Helsinki, Finland
| | | | - Heli Siljander
- New Children's Hospital, Helsinki University Hospital, Helsinki, Finland; Centre for Military Medicine, Finnish Defence Forces, Riihimäki, Finland
| | - Martin Stražar
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Sami Oikarinen
- Department of Virology, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
| | - Heikki Hyöty
- Department of Virology, Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland; Fimlab Laboratories, Tampere, Finland
| | - Jorma Ilonen
- Immunogenetics Laboratory, Institute of Biomedicine, University of Turku, Turku, Finland
| | - Caroline M Mitchell
- Vincent Obstetrics & Gynecology Department, Massachusetts General Hospital, Boston, MA 02114, USA
| | - Moran Yassour
- Microbiology & Molecular Genetics Department, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel; The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Suvi M Virtanen
- Department of Public Health and Welfare, Finnish Institute for Health and Welfare, Helsinki, Finland; Unit of Health Sciences, Faculty of Social Sciences, Tampere University, Tampere, Finland; Center for Child Health Research and Development and Innovation Center, Tampere University Hospital, Tampere, Finland
| | - Clary B Clish
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Damian R Plichta
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Center for Microbiome Informatics and Therapeutics, MIT, Cambridge, MA 02139, USA
| | - Hera Vlamakis
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Center for Microbiome Informatics and Therapeutics, MIT, Cambridge, MA 02139, USA
| | - Mikael Knip
- Research Program for Clinical and Molecular Metabolism, Faculty of Medicine, University of Helsinki, Helsinki, Finland; New Children's Hospital, Helsinki University Hospital, Helsinki, Finland; Tampere Center for Child Health Research, Tampere University Hospital, Tampere, Finland
| | - Ramnik J Xavier
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Center for Microbiome Informatics and Therapeutics, MIT, Cambridge, MA 02139, USA; Center for Computational and Integrative Biology, Department of Molecular Biology, Massachusetts General Hospital, Harvard Medical School, Boston, MA 02114, USA.
| |
Collapse
|
28
|
Morabito A, De Simone G, Ferrario M, Falcetta F, Pastorelli R, Brunelli L. EASY-FIA: A Readably Usable Standalone Tool for High-Resolution Mass Spectrometry Metabolomics Data Pre-Processing. Metabolites 2022; 13:metabo13010013. [PMID: 36676938 PMCID: PMC9861133 DOI: 10.3390/metabo13010013] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 12/15/2022] [Accepted: 12/16/2022] [Indexed: 12/24/2022] Open
Abstract
Flow injection analysis coupled with high-resolution mass spectrometry (FIA-HRMS) is a fair trade-off between resolution and speed. However, free software available for data pre-processing is few, web-based, and often requires advanced user specialization. These tools rarely embedded blank and noise evaluation strategies, and direct feature annotation. We developed EASY-FIA, a free standalone application that can be employed for FIA-HRMS metabolomic data pre-processing by users with no bioinformatics/programming skills. We validated the tool's performance and applicability in two clinical metabolomics case studies. The main functions of our application are blank subtraction, alignment of the metabolites, and direct feature annotation by means of the Human Metabolome Database (HMDB) using a minimum number of mass spectrometry parameters. In a scenario where FIA-HRMS is increasingly recognized as a reliable strategy for fast metabolomics analysis, EASY-FIA could become a standardized and feasible tool easily usable by all scientists dealing with MS-based metabolomics. EASY-FIA was implemented in MATLAB with the App Designer tool and it is freely available for download.
Collapse
Affiliation(s)
- Aurelia Morabito
- Laboratory of Mass Spectrometry, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156 Milan, Italy
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133 Milan, Italy
| | - Giulia De Simone
- Laboratory of Mass Spectrometry, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156 Milan, Italy
- Department of Biotechnologies and Biosciences, Università degli Studi Milano Bicocca, 20126 Milan, Italy
| | - Manuela Ferrario
- Department of Electronics, Information and Bioengineering, Politecnico di Milano, 20133 Milan, Italy
| | - Francesca Falcetta
- Unit of Biophysics, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156 Milan, Italy
| | - Roberta Pastorelli
- Laboratory of Mass Spectrometry, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156 Milan, Italy
| | - Laura Brunelli
- Laboratory of Mass Spectrometry, Istituto di Ricerche Farmacologiche Mario Negri IRCCS, 20156 Milan, Italy
- Correspondence: ; Tel.: +39-0239014742
| |
Collapse
|
29
|
Cui M, Cheng C, Zhang L. High-throughput proteomics: a methodological mini-review. J Transl Med 2022; 102:1170-1181. [PMID: 36775443 PMCID: PMC9362039 DOI: 10.1038/s41374-022-00830-7] [Citation(s) in RCA: 102] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Revised: 07/06/2022] [Accepted: 07/10/2022] [Indexed: 11/15/2022] Open
Abstract
Proteomics plays a vital role in biomedical research in the post-genomic era. With the technological revolution and emerging computational and statistic models, proteomic methodology has evolved rapidly in the past decade and shed light on solving complicated biomedical problems. Here, we summarize scientific research and clinical practice of existing and emerging high-throughput proteomics approaches, including mass spectrometry, protein pathway array, next-generation tissue microarrays, single-cell proteomics, single-molecule proteomics, Luminex, Simoa and Olink Proteomics. We also discuss important computational methods and statistical algorithms that can maximize the mining of proteomic data with clinical and/or other 'omics data. Various principles and precautions are provided for better utilization of these tools. In summary, the advances in high-throughput proteomics will not only help better understand the molecular mechanisms of pathogenesis, but also to identify the signature signaling networks of specific diseases. Thus, modern proteomics have a range of potential applications in basic research, prognostic oncology, precision medicine, and drug discovery.
Collapse
Affiliation(s)
- Miao Cui
- Department of Pathology, Icahn School of Medicine at Mount Sinai, New York, NY, USA.,Department of Pathology, Mount Sinai West, New York, NY, USA
| | - Chao Cheng
- Department of Medicine, Section of Epidemiology and Population Sciences, Baylor College of Medicine, Houston, TX, USA. .,Department of Medicine, Baylor College of Medicine, Houston, TX, USA. .,Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX, USA.
| | - Lanjing Zhang
- Department of Biological Sciences, Rutgers University, Newark, NJ, USA. .,Department of Pathology, Princeton Medical Center, Plainsboro, NJ, USA. .,Rutgers Cancer Institute of New Jersey, New Brunswick, NJ, USA. .,Department of Chemical Biology, Ernest Mario School of Pharmacy, Rutgers University, Piscataway, NJ, USA.
| |
Collapse
|
30
|
Metabology: Analysis of metabolomics data using community ecology tools. Anal Chim Acta 2022; 1232:340469. [DOI: 10.1016/j.aca.2022.340469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2022] [Revised: 09/15/2022] [Accepted: 09/28/2022] [Indexed: 11/17/2022]
|
31
|
Matalkah F, Jeong B, Sheridan M, Horstick E, Ramamurthy V, Stoilov P. The Musashi proteins direct post-transcriptional control of protein expression and alternate exon splicing in vertebrate photoreceptors. Commun Biol 2022; 5:1011. [PMID: 36153373 PMCID: PMC9509328 DOI: 10.1038/s42003-022-03990-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 09/13/2022] [Indexed: 11/23/2022] Open
Abstract
The Musashi proteins, MSI1 and MSI2, are conserved RNA binding proteins with a role in the maintenance and renewal of stem cells. Contrasting with this role, terminally differentiated photoreceptor cells express high levels of MSI1 and MSI2, pointing to a role for the two proteins in vision. Combined knockout of Msi1 and Msi2 in mature photoreceptor cells abrogated the retinal response to light and caused photoreceptor cell death. In photoreceptor cells the Musashi proteins perform distinct nuclear and cytoplasmic functions. In the nucleus, the Musashi proteins promote splicing of photoreceptor-specific alternative exons. Surprisingly, conserved photoreceptor-specific alternative exons in genes critical for vision proved to be dispensable, raising questions about the selective pressures that lead to their conservation. In the cytoplasm MSI1 and MSI2 activate protein expression. Loss of Msi1 and Msi2 lead to reduction in the levels of multiple proteins including proteins required for vision and photoreceptor survival. The requirement for MSI1 and MSI2 in terminally differentiated photoreceptors alongside their role in stem cells shows that, depending on cellular context, these two proteins can control processes ranging from cell proliferation to sensory perception.
Collapse
Affiliation(s)
- Fatimah Matalkah
- Department of Biochemistry and Molecular Medicine, West Virginia University, Morgantown, WV, USA
| | - Bohye Jeong
- Department of Biochemistry and Molecular Medicine, West Virginia University, Morgantown, WV, USA
| | - Macie Sheridan
- Undergraduate Program in Biochemistry, West Virginia University, Morgantown, WV, USA
| | - Eric Horstick
- Department of Biology, West Virginia University, Morgantown, WV, USA
- Department of Neuroscience, West Virginia University, Morgantown, WV, USA
| | - Visvanathan Ramamurthy
- Department of Biochemistry and Molecular Medicine, West Virginia University, Morgantown, WV, USA
- Department of Ophthalmology and Visual Sciences, West Virginia University, Morgantown, WV, USA
| | - Peter Stoilov
- Department of Biochemistry and Molecular Medicine, West Virginia University, Morgantown, WV, USA.
| |
Collapse
|
32
|
Wirth NT, Gurdo N, Krink N, Vidal-Verdú À, Donati S, Férnandez-Cabezón L, Wulff T, Nikel PI. A synthetic C2 auxotroph of Pseudomonas putida for evolutionary engineering of alternative sugar catabolic routes. Metab Eng 2022; 74:83-97. [PMID: 36155822 DOI: 10.1016/j.ymben.2022.09.004] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Revised: 09/17/2022] [Accepted: 09/17/2022] [Indexed: 11/30/2022]
Abstract
Acetyl-coenzyme A (AcCoA) is a metabolic hub in virtually all living cells, serving as both a key precursor of essential biomass components and a metabolic sink for catabolic pathways for a large variety of substrates. Owing to this dual role, tight growth-production coupling schemes can be implemented around the AcCoA node. Building on this concept, a synthetic C2 auxotrophy was implemented in the platform bacterium Pseudomonas putida through an in silico-informed engineering approach. A growth-coupling strategy, driven by AcCoA demand, allowed for direct selection of an alternative sugar assimilation route-the phosphoketolase (PKT) shunt from bifidobacteria. Adaptive laboratory evolution forced the synthetic P. putida auxotroph to rewire its metabolic network to restore C2 prototrophy via the PKT shunt. Large-scale structural chromosome rearrangements were identified as possible mechanisms for adjusting the network-wide proteome profile, resulting in improved PKT-dependent growth phenotypes. 13C-based metabolic flux analysis revealed an even split between the native Entner-Doudoroff pathway and the synthetic PKT bypass for glucose processing, leading to enhanced carbon conservation. These results demonstrate that the P. putida metabolism can be radically rewired to incorporate a synthetic C2 metabolism, creating novel network connectivities and highlighting the importance of unconventional engineering strategies to support efficient microbial production.
Collapse
Affiliation(s)
- Nicolas T Wirth
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, 220 2800, Kongens Lyngby, Denmark
| | - Nicolás Gurdo
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, 220 2800, Kongens Lyngby, Denmark
| | - Nicolas Krink
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, 220 2800, Kongens Lyngby, Denmark
| | - Àngela Vidal-Verdú
- Institute for Integrative Systems Biology I2SysBio (Universitat de València-CSIC), Calle del Catedràtic Agustin Escardino Benlloch 9, 46980, Paterna, Spain
| | - Stefano Donati
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, 220 2800, Kongens Lyngby, Denmark
| | - Lorena Férnandez-Cabezón
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, 220 2800, Kongens Lyngby, Denmark
| | - Tune Wulff
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, 220 2800, Kongens Lyngby, Denmark
| | - Pablo I Nikel
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, 220 2800, Kongens Lyngby, Denmark.
| |
Collapse
|
33
|
Kafkia E, Andres-Pons A, Ganter K, Seiler M, Smith TS, Andrejeva A, Jouhten P, Pereira F, Franco C, Kuroshchenkova A, Leone S, Sawarkar R, Boston R, Thaventhiran J, Zaugg JB, Lilley KS, Lancrin C, Beck M, Patil KR. Operation of a TCA cycle subnetwork in the mammalian nucleus. SCIENCE ADVANCES 2022; 8:eabq5206. [PMID: 36044572 PMCID: PMC9432838 DOI: 10.1126/sciadv.abq5206] [Citation(s) in RCA: 20] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Accepted: 07/14/2022] [Indexed: 05/23/2023]
Abstract
Nucleic acid and histone modifications critically depend on the tricarboxylic acid (TCA) cycle for substrates and cofactors. Although a few TCA cycle enzymes have been reported in the nucleus, the corresponding pathways are considered to operate in mitochondria. Here, we show that a part of the TCA cycle is operational also in the nucleus. Using 13C-tracer analysis, we identified activity of glutamine-to-fumarate, citrate-to-succinate, and glutamine-to-aspartate routes in the nuclei of HeLa cells. Proximity labeling mass spectrometry revealed a spatial vicinity of the involved enzymes with core nuclear proteins. We further show nuclear localization of aconitase 2 and 2-oxoglutarate dehydrogenase in mouse embryonic stem cells. Nuclear localization of the latter enzyme, which produces succinyl-CoA, changed from pluripotency to a differentiated state with accompanying changes in the nuclear protein succinylation. Together, our results demonstrate operation of an extended metabolic pathway in the nucleus, warranting a revision of the canonical view on metabolic compartmentalization.
Collapse
Affiliation(s)
- Eleni Kafkia
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Amparo Andres-Pons
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Kerstin Ganter
- European Molecular Biology Laboratory (EMBL), Rome, Italy
| | - Markus Seiler
- Buchmann Institute for Molecular Life Sciences, Goethe University Frankfurt, Frankfurt, Germany
| | - Tom S. Smith
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Anna Andrejeva
- Department of Biochemistry, University of Cambridge, Cambridge, UK
| | - Paula Jouhten
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- VTT Technical Research Center of Finland, Helsinki, Finland
| | - Filipa Pereira
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | - Catarina Franco
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Anna Kuroshchenkova
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Sergio Leone
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Ritwick Sawarkar
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Rebecca Boston
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - James Thaventhiran
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Judith B. Zaugg
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
| | | | | | - Martin Beck
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- Max Planck Institute of Biophysics, Frankfurt, Germany
| | - Kiran Raosaheb Patil
- European Molecular Biology Laboratory (EMBL), Heidelberg, Germany
- The Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| |
Collapse
|
34
|
Kim KQ, Burgute BD, Tzeng SC, Jing C, Jungers C, Zhang J, Yan LL, Vierstra RD, Djuranovic S, Evans BS, Zaher HS. N1-methylpseudouridine found within COVID-19 mRNA vaccines produces faithful protein products. Cell Rep 2022; 40:111300. [PMID: 35988540 PMCID: PMC9376333 DOI: 10.1016/j.celrep.2022.111300] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 06/07/2022] [Accepted: 08/10/2022] [Indexed: 11/24/2022] Open
Abstract
Synthetic mRNA technology is a promising avenue for treating and preventing disease. Key to the technology is the incorporation of modified nucleotides such as N1-methylpseudouridine (m1Ψ) to decrease immunogenicity of the RNA. However, relatively few studies have addressed the effects of modified nucleotides on the decoding process. Here, we investigate the effect of m1Ψ and the related modification pseudouridine (Ψ) on translation. In a reconstituted system, we find that m1Ψ does not significantly alter decoding accuracy. More importantly, we do not detect an increase in miscoded peptides when mRNA containing m1Ψ is translated in cell culture, compared with unmodified mRNA. We also find that m1Ψ does not stabilize mismatched RNA-duplex formation and only marginally promotes errors during reverse transcription. Overall, our results suggest that m1Ψ does not significantly impact translational fidelity, a welcome sign for future RNA therapeutics.
Collapse
Affiliation(s)
- Kyusik Q Kim
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Bhagyashri D Burgute
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | | | - Crystal Jing
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Courtney Jungers
- Department of Cell Biology and Physiology, Washington University in St. Louis School of Medicine, St. Louis, MO 63110, USA
| | - Junya Zhang
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Liewei L Yan
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Richard D Vierstra
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA
| | - Sergej Djuranovic
- Department of Cell Biology and Physiology, Washington University in St. Louis School of Medicine, St. Louis, MO 63110, USA
| | - Bradley S Evans
- Donald Danforth Plant Science Center, St. Louis, MO 63132, USA
| | - Hani S Zaher
- Department of Biology, Washington University in St. Louis, St. Louis, MO 63130, USA.
| |
Collapse
|
35
|
Salzer L, Witting M, Schmitt-Kopplin P. MobilityTransformR: an R package for effective mobility transformation of CE-MS data. Bioinformatics 2022; 38:4044-4045. [PMID: 35781328 DOI: 10.1093/bioinformatics/btac441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 05/28/2022] [Accepted: 06/30/2022] [Indexed: 12/24/2022] Open
Abstract
SUMMARY We present MobilityTransformR, an R/Bioconductor package for the effective mobility scaling of capillary zone electrophoresis-mass spectrometry (CE-MS) data. It uses functionality from different R packages that are frequently used for data processing and analysis in MS-based metabolomics workflows, allowing the subsequent use of reproducible transformed CE-MS data in existing workflows. AVAILABILITY AND IMPLEMENTATION MobilityTransformR is implemented in R (Version >= 4.2) and can be downloaded directly from the Bioconductor database (https://bioconductor.org/packages/MobilityTransformR) or GitHub (https://github.com/LiesaSalzer/MobilityTransformR). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Liesa Salzer
- Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, D-85764 Neuherberg, Germany
| | - Michael Witting
- Chair of Analytical Food Chemistry, TUM School of Life Sciences, Technical University of Munich, D-85354 Freising, Germany
- Metabolomics and Proteomics Core, Helmholtz Zentrum München, D-85764 Neuherberg, Germany
| | - Philippe Schmitt-Kopplin
- Research Unit Analytical BioGeoChemistry, Helmholtz Zentrum München, D-85764 Neuherberg, Germany
- Chair of Analytical Food Chemistry, TUM School of Life Sciences, Technical University of Munich, D-85354 Freising, Germany
| |
Collapse
|
36
|
Spatial Proteomics Reveals Differences in the Cellular Architecture of Antibody-Producing CHO and Plasma Cell-Derived Cells. Mol Cell Proteomics 2022; 21:100278. [PMID: 35934186 PMCID: PMC9562429 DOI: 10.1016/j.mcpro.2022.100278] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 07/28/2022] [Accepted: 07/30/2022] [Indexed: 01/18/2023] Open
Abstract
Most of the recombinant biotherapeutics employed today to combat severe illnesses, for example, various types of cancer or autoimmune diseases, are produced by Chinese hamster ovary (CHO) cells. To meet the growing demand of these pharmaceuticals, CHO cells are under constant development in order to enhance their stability and productivity. The last decades saw a shift from empirical cell line optimization toward rational cell engineering using a growing number of large omics datasets to alter cell physiology on various levels. Especially proteomics workflows reached new levels in proteome coverage and data quality because of advances in high-resolution mass spectrometry instrumentation. One type of workflow concentrates on spatial proteomics by usage of subcellular fractionation of organelles with subsequent shotgun mass spectrometry proteomics and machine learning algorithms to determine the subcellular localization of large portions of the cellular proteome at a certain time point. Here, we present the first subcellular spatial proteome of a CHO-K1 cell line producing high titers of recombinant antibody in comparison to the spatial proteome of an antibody-producing plasma cell-derived myeloma cell line. Both cell lines show colocalization of immunoglobulin G chains with chaperones and proteins associated in protein glycosylation within the endoplasmic reticulum compartment. However, we report differences in the localization of proteins associated to vesicle-mediated transport, transcription, and translation, which may affect antibody production in both cell lines. Furthermore, pairing subcellular localization data with protein expression data revealed elevated protein masses for organelles in the secretory pathway in plasma cell-derived MPC-11 (Merwin plasma cell tumor-11) cells. Our study highlights the potential of subcellular spatial proteomics combined with protein expression as potent workflow to identify characteristics of highly efficient recombinant protein-expressing cell lines. Data are available via ProteomeXchange with identifier PXD029115.
Collapse
|
37
|
Pang Z, Zhou G, Ewald J, Chang L, Hacariz O, Basu N, Xia J. Using MetaboAnalyst 5.0 for LC-HRMS spectra processing, multi-omics integration and covariate adjustment of global metabolomics data. Nat Protoc 2022; 17:1735-1761. [PMID: 35715522 DOI: 10.1038/s41596-022-00710-w] [Citation(s) in RCA: 637] [Impact Index Per Article: 318.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 05/18/2022] [Indexed: 01/01/2023]
Abstract
Liquid chromatography coupled with high-resolution mass spectrometry (LC-HRMS) has become a workhorse in global metabolomics studies with growing applications across biomedical and environmental sciences. However, outstanding bioinformatics challenges in terms of data processing, statistical analysis and functional interpretation remain critical barriers to the wider adoption of this technology. To help the user community overcome these barriers, we have made major updates to the well-established MetaboAnalyst platform ( www.metaboanalyst.ca ). This protocol extends the previous 2011 Nature Protocol by providing stepwise instructions on how to use MetaboAnalyst 5.0 to: optimize parameters for LC-HRMS spectra processing; obtain functional insights from peak list data; integrate metabolomics data with transcriptomics data or combine multiple metabolomics datasets; conduct exploratory statistical analysis with complex metadata. Parameter optimization may take ~2 h to complete depending on the server load, and the remaining three stages may be executed in ~60 min.
Collapse
Affiliation(s)
- Zhiqiang Pang
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Guangyan Zhou
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Jessica Ewald
- Department of Natural Resources Sciences, McGill University, Montreal, Quebec, Canada
| | - Le Chang
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada
| | - Orcun Hacariz
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Niladri Basu
- Department of Natural Resources Sciences, McGill University, Montreal, Quebec, Canada
| | - Jianguo Xia
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada.
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
38
|
Gloss AD, Vergnol A, Morton TC, Laurin PJ, Roux F, Bergelson J. Genome-wide association mapping within a local Arabidopsis thaliana population more fully reveals the genetic architecture for defensive metabolite diversity. Philos Trans R Soc Lond B Biol Sci 2022; 377:20200512. [PMID: 35634919 PMCID: PMC9149790 DOI: 10.1098/rstb.2020.0512] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 03/08/2022] [Indexed: 12/16/2022] Open
Abstract
A paradoxical finding from genome-wide association studies (GWAS) in plants is that variation in metabolite profiles typically maps to a small number of loci, despite the complexity of underlying biosynthetic pathways. This discrepancy may partially arise from limitations presented by geographically diverse mapping panels. Properties of metabolic pathways that impede GWAS by diluting the additive effect of a causal variant, such as allelic and genetic heterogeneity and epistasis, would be expected to increase in severity with the geographical range of the mapping panel. We hypothesized that a population from a single locality would reveal an expanded set of associated loci. We tested this in a French Arabidopsis thaliana population (less than 1 km transect) by profiling and conducting GWAS for glucosinolates, a suite of defensive metabolites that have been studied in depth through functional and genetic mapping approaches. For two distinct classes of glucosinolates, we discovered more associations at biosynthetic loci than the previous GWAS with continental-scale mapping panels. Candidate genes underlying novel associations were supported by concordance between their observed effects in the TOU-A population and previous functional genetic and biochemical characterization. Local populations complement geographically diverse mapping panels to reveal a more complete genetic architecture for metabolic traits. This article is part of the theme issue 'Genetic basis of adaptation and speciation: from loci to causative mutations'.
Collapse
Affiliation(s)
- Andrew D. Gloss
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, NY, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Amélie Vergnol
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Timothy C. Morton
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Peter J. Laurin
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, NY, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Fabrice Roux
- LIPME, Université de Toulouse, INRAE, CNRS, Castanet-Tolosan, France
| | - Joy Bergelson
- Department of Biology and Center for Genomics and Systems Biology, New York University, New York, NY, USA
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| |
Collapse
|
39
|
Jaeger AM, Stopfer LE, Ahn R, Sanders EA, Sandel DA, Freed-Pastor WA, Rideout WM, Naranjo S, Fessenden T, Nguyen KB, Winter PS, Kohn RE, Westcott PMK, Schenkel JM, Shanahan SL, Shalek AK, Spranger S, White FM, Jacks T. Deciphering the immunopeptidome in vivo reveals new tumour antigens. Nature 2022; 607:149-155. [PMID: 35705813 PMCID: PMC9945857 DOI: 10.1038/s41586-022-04839-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 05/06/2022] [Indexed: 11/09/2022]
Abstract
Immunosurveillance of cancer requires the presentation of peptide antigens on major histocompatibility complex class I (MHC-I) molecules1-5. Current approaches to profiling of MHC-I-associated peptides, collectively known as the immunopeptidome, are limited to in vitro investigation or bulk tumour lysates, which limits our understanding of cancer-specific patterns of antigen presentation in vivo6. To overcome these limitations, we engineered an inducible affinity tag into the mouse MHC-I gene (H2-K1) and targeted this allele to the KrasLSL-G12D/+Trp53fl/fl mouse model (KP/KbStrep)7. This approach enabled us to precisely isolate MHC-I peptides from autochthonous pancreatic ductal adenocarcinoma and from lung adenocarcinoma (LUAD) in vivo. In addition, we profiled the LUAD immunopeptidome from the alveolar type 2 cell of origin up to late-stage disease. Differential peptide presentation in LUAD was not predictable by mRNA expression or translation efficiency and is probably driven by post-translational mechanisms. Vaccination with peptides presented by LUAD in vivo induced CD8+ T cell responses in naive mice and tumour-bearing mice. Many peptides specific to LUAD, including immunogenic peptides, exhibited minimal expression of the cognate mRNA, which prompts the reconsideration of antigen prediction pipelines that triage peptides according to transcript abundance8. Beyond cancer, the KbStrep allele is compatible with other Cre-driver lines to explore antigen presentation in vivo in the pursuit of understanding basic immunology, infectious disease and autoimmunity.
Collapse
Affiliation(s)
- Alex M Jaeger
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Lauren E Stopfer
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Ryuhjin Ahn
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Emma A Sanders
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Demi A Sandel
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - William A Freed-Pastor
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Medical Oncology, Dana Farber Cancer Institute, Harvard Medical School, Boston, MA, USA
| | - William M Rideout
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Santiago Naranjo
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Tim Fessenden
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Kim B Nguyen
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Peter S Winter
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, USA
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Ryan E Kohn
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Peter M K Westcott
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Jason M Schenkel
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Pathology, Brigham and Women's Hospital, Boston, MA, USA
| | - Sean-Luc Shanahan
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
| | - Alex K Shalek
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Broad Institute of Massachusetts Institute of Technology and Harvard, Cambridge, MA, USA
- Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, MA, USA
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
- Ragon Institute of MGH, Massachusetts Institute of Technology and Harvard, Cambridge, MA, USA
| | - Stefani Spranger
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Forest M White
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Tyler Jacks
- David H. Koch Institute for Integrative Cancer Research, Cambridge, MA, USA.
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
40
|
Smith TS, Andrejeva A, Christopher J, Crook OM, Elzek M, Lilley KS. Prior Signal Acquisition Software Versions for Orbitrap Underestimate Low Isobaric Mass Tag Intensities, Without Detriment to Differential Abundance Experiments. ACS MEASUREMENT SCIENCE AU 2022; 2:233-240. [PMID: 35726249 PMCID: PMC9204819 DOI: 10.1021/acsmeasuresciau.1c00053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 01/24/2022] [Accepted: 01/25/2022] [Indexed: 06/15/2023]
Abstract
Tandem mass tags (TMTs) enable simple and accurate quantitative proteomics for multiplexed samples by relative quantification of tag reporter ions. Orbitrap quantification of reporter ions has been associated with a characteristic notch region in intensity distribution, within which few reporter intensities are recorded. This has been resolved in version 3 of the instrument acquisition software Tune. However, 47% of Orbitrap Fusion, Lumos, or Eclipse submissions to PRIDE were generated using prior software versions. To quantify the impact of the notch on existing quantitative proteomics data, we generated a mixed species benchmark and acquired quantitative data using Tune versions 2 and 3. Intensities below the notch are predominantly underestimated with Tune version 2, leading to overestimation of the true differences in intensities between samples. However, when summarizing reporter ion intensities to higher-level features, such as peptides and proteins, few features are significantly affected. Targeted removal of spectra with reporter ion intensities below the notch is not beneficial for differential peptide or protein testing. Overall, we find that the systematic quantification bias associated with the notch is not detrimental for a typical proteomics experiment.
Collapse
Affiliation(s)
- Tom S. Smith
- MRC
Toxicology Unit, University of Cambridge, Cambridge CB2 1QR, U.K.
| | - Anna Andrejeva
- Department
of Biochemistry, University of Cambridge, Cambridge CB2 1QW, U.K.
| | - Josie Christopher
- Department
of Biochemistry, University of Cambridge, Cambridge CB2 1QW, U.K.
| | - Oliver M. Crook
- Department
of Statistics, University of Oxford, Oxford OX1 3LB, U.K.
| | - Mohamed Elzek
- MRC
Toxicology Unit, University of Cambridge, Cambridge CB2 1QR, U.K.
| | - Kathryn S. Lilley
- Department
of Biochemistry, University of Cambridge, Cambridge CB2 1QW, U.K.
| |
Collapse
|
41
|
Flight RM, Mitchell JM, Moseley HNB. Scan-Centric, Frequency-Based Method for Characterizing Peaks from Direct Injection Fourier Transform Mass Spectrometry Experiments. Metabolites 2022; 12:metabo12060515. [PMID: 35736448 PMCID: PMC9228344 DOI: 10.3390/metabo12060515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Revised: 05/20/2022] [Accepted: 05/30/2022] [Indexed: 11/16/2022] Open
Abstract
We present a novel, scan-centric method for characterizing peaks from direct injection multi-scan Fourier transform mass spectra of complex samples that utilizes frequency values derived directly from the spacing of raw m/z points in spectral scans. Our peak characterization method utilizes intensity-independent noise removal and normalization of scan-level data to provide a much better fit of relative intensity to natural abundance probabilities for low abundance isotopologues that are not present in all of the acquired scans. Moreover, our method calculates both peak- and scan-specific statistics incorporated within a series of quality control steps that are designed to robustly derive peak centers, intensities, and intensity ratios with their scan-level variances. These cross-scan characterized peaks are suitable for use in our previously published peak assignment methodology, Small Molecule Isotope Resolved Formula Enumeration (SMIRFE).
Collapse
Affiliation(s)
- Robert M. Flight
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA; (R.M.F.); (J.M.M.)
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
| | - Joshua M. Mitchell
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA; (R.M.F.); (J.M.M.)
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
| | - Hunter N. B. Moseley
- Markey Cancer Center, University of Kentucky, Lexington, KY 40536, USA; (R.M.F.); (J.M.M.)
- Department of Molecular & Cellular Biochemistry, University of Kentucky, Lexington, KY 40536, USA
- Resource Center for Stable Isotope Resolved Metabolomics, University of Kentucky, Lexington, KY 40536, USA
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY 40536, USA
- Department of Toxicology and Cancer Biology, University of Kentucky, Lexington, KY 40536, USA
- Correspondence:
| |
Collapse
|
42
|
Byron A, Griffith BGC, Herrero A, Loftus AEP, Koeleman ES, Kogerman L, Dawson JC, McGivern N, Culley J, Grimes GR, Serrels B, von Kriegsheim A, Brunton VG, Frame MC. Characterisation of a nucleo-adhesome. Nat Commun 2022; 13:3053. [PMID: 35650196 PMCID: PMC9160004 DOI: 10.1038/s41467-022-30556-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Accepted: 05/02/2022] [Indexed: 11/09/2022] Open
Abstract
In addition to central functions in cell adhesion signalling, integrin-associated proteins have wider roles at sites distal to adhesion receptors. In experimentally defined adhesomes, we noticed that there is clear enrichment of proteins that localise to the nucleus, and conversely, we now report that nuclear proteomes contain a class of adhesome components that localise to the nucleus. We here define a nucleo-adhesome, providing experimental evidence for a remarkable scale of nuclear localisation of adhesion proteins, establishing a framework for interrogating nuclear adhesion protein functions. Adding to nuclear FAK's known roles in regulating transcription, we now show that nuclear FAK regulates expression of many adhesion-related proteins that localise to the nucleus and that nuclear FAK binds to the adhesome component and nuclear protein Hic-5. FAK and Hic-5 work together in the nucleus, co-regulating a subset of genes transcriptionally. We demonstrate the principle that there are subcomplexes of nuclear adhesion proteins that cooperate to control transcription.
Collapse
Affiliation(s)
- Adam Byron
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK.
- Division of Molecular and Cellular Function, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, M13 9PT, UK.
| | - Billie G C Griffith
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| | - Ana Herrero
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
- Instituto de Biomedicina y Biotecnología de Cantabria (IBBTEC), Consejo Superior de Investigaciones Científicas (CSIC)-Universidad de Cantabria, 39011, Santander, Spain
| | - Alexander E P Loftus
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| | - Emma S Koeleman
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
- Leiden University Medical Center, 2333 ZC, Leiden, The Netherlands
- Division of Chromatin Networks, German Cancer Research Center (DKFZ) and Bioquant, 69120, Heidelberg, Germany
| | - Linda Kogerman
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| | - John C Dawson
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| | - Niamh McGivern
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
- Almac Diagnostic Services, 19 Seagoe Industrial Estate, Craigavon, BT63 5QD, UK
| | - Jayne Culley
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| | - Graeme R Grimes
- MRC Human Genetics Unit, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - Bryan Serrels
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
- NanoString Technologies, Inc., Seattle, WA, 98109, USA
| | - Alex von Kriegsheim
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| | - Valerie G Brunton
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| | - Margaret C Frame
- Cancer Research UK Edinburgh Centre, Institute of Genetics and Cancer, University of Edinburgh, Edinburgh, EH4 2XR, UK
| |
Collapse
|
43
|
Tsiamis V, Schwämmle V. VIQoR: a web service for visually supervised protein inference and protein quantification. Bioinformatics 2022; 38:2757-2764. [PMID: 35561162 DOI: 10.1093/bioinformatics/btac182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Revised: 03/07/2022] [Accepted: 03/22/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION In quantitative bottom-up mass spectrometry (MS)-based proteomics, the reliable estimation of protein concentration changes from peptide quantifications between different biological samples is essential. This estimation is not a single task but comprises the two processes of protein inference and protein abundance summarization. Furthermore, due to the high complexity of proteomics data and associated uncertainty about the performance of these processes, there is a demand for comprehensive visualization methods able to integrate protein with peptide quantitative data including their post-translational modifications. Hence, there is a lack of a suitable tool that provides post-identification quantitative analysis of proteins with simultaneous interactive visualization. RESULTS In this article, we present VIQoR, a user-friendly web service that accepts peptide quantitative data of both labeled and label-free experiments and accomplishes the crucial components protein inference and summarization and interactive visualization modules, including the novel VIQoR plot. We implemented two different parsimonious algorithms to solve the protein inference problem, while protein summarization is facilitated by a well-established factor analysis algorithm called fast-FARMS followed by a weighted average summarization function that minimizes the effect of missing values. In addition, summarization is optimized by the so-called Global Correlation Indicator (GCI). We test the tool on three publicly available ground truth datasets and demonstrate the ability of the protein inference algorithms to handle shared peptides. We furthermore show that GCI increases the accuracy of the quantitative analysis in datasets with replicated design. AVAILABILITY AND IMPLEMENTATION VIQoR is accessible at: http://computproteomics.bmb.sdu.dk/Apps/VIQoR/. The source code is available at: https://bitbucket.org/veitveit/viqor/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Vasileios Tsiamis
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, 5230 Odense, Denmark
| |
Collapse
|
44
|
In silico deconjugation of glucuronide conjugates enhances tandem mass spectra library annotation of human samples. Anal Bioanal Chem 2022; 414:2629-2640. [PMID: 35080654 PMCID: PMC8888480 DOI: 10.1007/s00216-022-03899-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2021] [Revised: 11/26/2021] [Accepted: 01/12/2022] [Indexed: 11/01/2022]
Abstract
Mass spectral library annotation of liquid chromatography-high resolution tandem mass spectrometry (LC-HRMS/MS) data is a reliable approach for fast identification of organic contaminants and toxicants in complex environmental and biological matrices. While determining the exposure of humans or mammals, it is indispensable to include phase I and phase II metabolites (conjugates) along with the parent compounds, but often, tandem mass spectra for these are unavailable. In this study, we present and evaluate a strategy for annotating glucuronide conjugates in LC-HRMS/MS scans by applying a neutral loss search for detection, then truncating the spectra which we refer to as in silico deconjugation, and finally searching these against mass spectral libraries of the aglycones. The workflow was tested on a dataset of in vitro-generated glucuronides of reference standard mixtures and a dataset of 51 authentic urine samples collected from patients with known medication status, acquired on different instrumentations. A total number of 75 different glucuronidated molecular structures were identified by in silico deconjugation and spectral library annotation. We also identified specific molecular structures (sulfonamides, ether bonds, di-glucuronides), which resulted in slightly different fragmentation patterns between the glucuronide and the unconjugated compound. This led to a decreased spectral matching score and in some cases to a false-negative identification. Still, by applying this method, we revealed a reliable annotation of most common glucuronides, leading to a new strategy reducing the need for deconjugation steps or for recording many reference glucuronide spectra for screening approaches.
Collapse
|
45
|
Isaksson M, Karlsson C, Laurell T, Kirkeby A, Heusel M. MSLibrarian: Optimized Predicted Spectral Libraries for Data-Independent Acquisition Proteomics. J Proteome Res 2022; 21:535-546. [PMID: 35042333 PMCID: PMC8822486 DOI: 10.1021/acs.jproteome.1c00796] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
Data-independent
acquisition-mass spectrometry (DIA-MS) is the
method of choice for deep, consistent, and accurate single-shot profiling
in bottom-up proteomics. While classic workflows for targeted quantification
from DIA-MS data require auxiliary data-dependent acquisition (DDA)
MS analysis of subject samples to derive prior-knowledge spectral
libraries, library-free approaches based on in silico prediction promise deep DIA-MS profiling with reduced experimental
effort and cost. Coverage and sensitivity in such analyses are however
limited, in part, by the large library size and persistent deviations
from the experimental data. We present MSLibrarian, a new workflow
and tool to obtain optimized predicted spectral libraries by the integrated
usage of spectrum-centric DIA data interpretation via the DIA-Umpire
approach to inform and calibrate the in silico predicted
library and analysis approach. Predicted-vs-observed comparisons enabled
optimization of intensity prediction parameters, calibration of retention
time prediction for deviating chromatographic setups, and optimization
of the library scope and sample representativeness. Benchmarking via
a dedicated ground-truth-embedded experiment of species-mixed proteins
and quantitative ratio-validation confirmed gains of up to 13% on
peptide and 8% on protein level at equivalent FDR control and validation
criteria. MSLibrarian is made available as an open-source R software
package, including step-by-step user instructions, at https://github.com/MarcIsak/MSLibrarian.
Collapse
Affiliation(s)
- Marc Isaksson
- Department of Biomedical Engineering, Lund University, 22100 Lund, Sweden.,Department of Experimental Medical Science and Wallenberg Center for Molecular Medicine, Lund University, 22100 Lund, Sweden
| | - Christofer Karlsson
- Infection Medicine Proteomics Lab, Division of Infection Medicine (BMC), Faculty of Medicine, Lund University, 22100 Lund, Sweden
| | - Thomas Laurell
- Department of Biomedical Engineering, Lund University, 22100 Lund, Sweden
| | - Agnete Kirkeby
- Department of Experimental Medical Science and Wallenberg Center for Molecular Medicine, Lund University, 22100 Lund, Sweden.,Department of Neuroscience, University of Copenhagen, DK-2200 Copenhagen, Denmark.,The Novo Nordisk Foundation Center for Stem Cell Biology (DanStem), Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200 Copenhagen, Denmark
| | - Moritz Heusel
- Infection Medicine Proteomics Lab, Division of Infection Medicine (BMC), Faculty of Medicine, Lund University, 22100 Lund, Sweden
| |
Collapse
|
46
|
Heming S, Hansen P, Vlasov A, Schwörer F, Schaumann S, Frolovaitė P, Lehmann WD, Timmer J, Schilling M, Helm B, Klingmüller U. MSPypeline: a python package for streamlined data analysis of mass spectrometry-based proteomics. BIOINFORMATICS ADVANCES 2022; 2:vbac004. [PMID: 36699356 PMCID: PMC9710650 DOI: 10.1093/bioadv/vbac004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 12/01/2021] [Accepted: 01/13/2022] [Indexed: 01/28/2023]
Abstract
Summary Mass spectrometry-based proteomics is increasingly employed in biology and medicine. To generate reliable information from large datasets and ensure comparability of results, it is crucial to implement and standardize the quality control of the raw data, the data processing steps and the statistical analyses. MSPypeline provides a platform for importing MaxQuant output tables, generating quality control reports, data preprocessing including normalization and performing exploratory analyses by statistical inference plots. These standardized steps assess data quality, provide customizable figures and enable the identification of differentially expressed proteins to reach biologically relevant conclusions. Availability and implementation The source code is available under the MIT license at https://github.com/siheming/mspypeline with documentation at https://mspypeline.readthedocs.io. Benchmark mass spectrometry data are available on ProteomeXchange (PXD025792). Supplementary information Supplementary data are available at Bioinformatics Advances online.
Collapse
Affiliation(s)
- Simon Heming
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Pauline Hansen
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany,Institute for Physics and CIBSS Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg 79104, Germany
| | - Artyom Vlasov
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Florian Schwörer
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Stephen Schaumann
- Institute for Physics and CIBSS Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg 79104, Germany
| | - Paulina Frolovaitė
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Wolf-Dieter Lehmann
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Jens Timmer
- Institute for Physics and CIBSS Centre for Integrative Biological Signalling Studies, University of Freiburg, Freiburg 79104, Germany
| | - Marcel Schilling
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Barbara Helm
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany
| | - Ursula Klingmüller
- Division Systems Biology of Signal Transduction, German Cancer Research Center (DKFZ), Heidelberg 69120, Germany,To whom correspondence should be addressed.
| |
Collapse
|
47
|
Rajwani R, Ohlemacher SI, Zhao G, Liu HB, Bewley CA. Genome-Guided Discovery of Natural Products through Multiplexed Low-Coverage Whole-Genome Sequencing of Soil Actinomycetes on Oxford Nanopore Flongle. mSystems 2021; 6:e0102021. [PMID: 34812649 PMCID: PMC8609971 DOI: 10.1128/msystems.01020-21] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 10/31/2021] [Indexed: 12/02/2022] Open
Abstract
Genome mining is an important tool for discovery of new natural products; however, the number of publicly available genomes for natural product-rich microbes such as actinomycetes, relative to human pathogens with smaller genomes, is small. To obtain contiguous DNA assemblies and identify large (ca. 10 to greater than 100 kb) biosynthetic gene clusters (BGCs) with high GC (>70%) and high-repeat content, it is necessary to use long-read sequencing methods when sequencing actinomycete genomes. One of the hurdles to long-read sequencing is the higher cost. In the current study, we assessed Flongle, a recently launched platform by Oxford Nanopore Technologies, as a low-cost DNA sequencing option to obtain contiguous DNA assemblies and analyze BGCs. To make the workflow more cost-effective, we multiplexed up to four samples in a single Flongle sequencing experiment while expecting low-sequencing coverage per sample. We hypothesized that contiguous DNA assemblies might enable analysis of BGCs even at low sequencing depth. To assess the value of these assemblies, we collected high-resolution mass spectrometry data and conducted a multi-omics analysis to connect BGCs to secondary metabolites. In total, we assembled genomes for 20 distinct strains across seven sequencing experiments. In each experiment, 50% of the bases were in reads longer than 10 kb, which facilitated the assembly of reads into contigs with an average N50 value of 3.5 Mb. The programs antiSMASH and PRISM predicted 629 and 295 BGCs, respectively. We connected BGCs to metabolites for N,N-dimethyl cyclic-di-tryptophan, two novel lasso peptides, and three known actinomycete-associated siderophores, namely, mirubactin, heterobactin, and salinichelin. IMPORTANCE Short-read sequencing of GC-rich genomes such as those from actinomycetes results in a fragmented genome assembly and truncated biosynthetic gene clusters (often 10 to >100 kb long), which hinders our ability to understand the biosynthetic potential of a given strain and predict the molecules that can be produced. The current study demonstrates that contiguous DNA assemblies, suitable for analysis of BGCs, can be obtained through low-coverage, multiplexed sequencing on Flongle, which provides a new low-cost workflow ($30 to 40 per strain) for sequencing actinomycete strain libraries.
Collapse
Affiliation(s)
- Rahim Rajwani
- Laboratory of Bioorganic Chemistry, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, USA
| | - Shannon I. Ohlemacher
- Laboratory of Bioorganic Chemistry, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, USA
| | - Gengxiang Zhao
- Laboratory of Bioorganic Chemistry, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, USA
| | - Hong-Bing Liu
- Laboratory of Bioorganic Chemistry, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, USA
| | - Carole A. Bewley
- Laboratory of Bioorganic Chemistry, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland, USA
| |
Collapse
|
48
|
Quast JP, Schuster D, Picotti P. protti: an R package for comprehensive data analysis of peptide- and protein-centric bottom-up proteomics data. BIOINFORMATICS ADVANCES 2021; 2:vbab041. [PMID: 36699412 PMCID: PMC9710675 DOI: 10.1093/bioadv/vbab041] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 10/28/2021] [Accepted: 12/06/2021] [Indexed: 01/28/2023]
Abstract
Summary We present a flexible, user-friendly R package called protti for comprehensive quality control, analysis and interpretation of quantitative bottom-up proteomics data. protti supports the analysis of protein-centric data such as those associated with protein expression analyses, as well as peptide-centric data such as those resulting from limited proteolysis-coupled mass spectrometry analysis. Due to its flexible design, it supports analysis of label-free, data-dependent, data-independent and targeted proteomics datasets. protti can be run on the output of any search engine and software package commonly used for bottom-up proteomics experiments such as Spectronaut, Skyline, MaxQuant or Proteome Discoverer, adequately exported to table format. Availability and implementation protti is implemented as an open-source R package. Release versions are available via CRAN (https://CRAN.R-project.org/package=protti) and work on all major operating systems. The development version is maintained on GitHub (https://github.com/jpquast/protti). Full documentation including examples is provided in the form of vignettes on our package website (jpquast.github.io/protti/).
Collapse
Affiliation(s)
- Jan-Philipp Quast
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland
| | - Dina Schuster
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland,Department of Biology, Institute of Molecular Biology and Biophysics, ETH Zurich, Zurich 8093, Switzerland,Laboratory of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen 5232, Switzerland
| | - Paola Picotti
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich 8093, Switzerland,To whom correspondence should be addressed.
| |
Collapse
|
49
|
Imbert A, Rompais M, Selloum M, Castelli F, Mouton-Barbosa E, Brandolini-Bunlon M, Chu-Van E, Joly C, Hirschler A, Roger P, Burger T, Leblanc S, Sorg T, Ouzia S, Vandenbrouck Y, Médigue C, Junot C, Ferro M, Pujos-Guillot E, de Peredo AG, Fenaille F, Carapito C, Herault Y, Thévenot EA. ProMetIS, deep phenotyping of mouse models by combined proteomics and metabolomics analysis. Sci Data 2021; 8:311. [PMID: 34862403 PMCID: PMC8642540 DOI: 10.1038/s41597-021-01095-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 11/02/2021] [Indexed: 01/20/2023] Open
Abstract
Genes are pleiotropic and getting a better knowledge of their function requires a comprehensive characterization of their mutants. Here, we generated multi-level data combining phenomic, proteomic and metabolomic acquisitions from plasma and liver tissues of two C57BL/6 N mouse models lacking the Lat (linker for activation of T cells) and the Mx2 (MX dynamin-like GTPase 2) genes, respectively. Our dataset consists of 9 assays (1 preclinical, 2 proteomics and 6 metabolomics) generated with a fully non-targeted and standardized approach. The data and processing code are publicly available in the ProMetIS R package to ensure accessibility, interoperability, and reusability. The dataset thus provides unique molecular information about the physiological role of the Lat and Mx2 genes. Furthermore, the protocols described herein can be easily extended to a larger number of individuals and tissues. Finally, this resource will be of great interest to develop new bioinformatic and biostatistic methods for multi-omics data integration.
Collapse
Affiliation(s)
- Alyssa Imbert
- CEA, LIST, Laboratoire Sciences des Données et de la Décision, IFB, MetaboHUB, Gif-sur-Yvette, France.
- IFB-core, UMS3601, Genoscope, Evry, France.
| | - Magali Rompais
- Laboratoire de Spectrométrie de Masse BioOrganique, Université de Strasbourg, CNRS, IPHC UMR 7178, ProFI, Strasbourg, France
| | - Mohammed Selloum
- Université de Strasbourg, CNRS, INSERM, Institut Clinique de la Souris, Phenomin-ICS, Illkirch, France
| | - Florence Castelli
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (MTS), MetaboHUB, Gif-sur-Yvette, France
| | - Emmanuelle Mouton-Barbosa
- Institut de Pharmacologie et Biologie Structurale (IPBS), Université de Toulouse, CNRS, UPS, ProFI, Toulouse, France
| | - Marion Brandolini-Bunlon
- Université Clermont Auvergne, INRAE, UNH, Plateforme d'Exploration du Métabolisme, MetaboHUB, Clermont-Ferrand, France
| | - Emeline Chu-Van
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (MTS), MetaboHUB, Gif-sur-Yvette, France
| | - Charlotte Joly
- Université Clermont Auvergne, INRAE, UNH, Plateforme d'Exploration du Métabolisme, MetaboHUB, Clermont-Ferrand, France
| | - Aurélie Hirschler
- Laboratoire de Spectrométrie de Masse BioOrganique, Université de Strasbourg, CNRS, IPHC UMR 7178, ProFI, Strasbourg, France
| | - Pierrick Roger
- CEA, LIST, Laboratoire Intelligence Artificielle et Apprentissage Automatique, MetaboHUB, Gif-sur-Yvette, France
| | - Thomas Burger
- Université Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, FR2048, ProFI, Grenoble, France
| | - Sophie Leblanc
- Université de Strasbourg, CNRS, INSERM, Institut Clinique de la Souris, Phenomin-ICS, Illkirch, France
| | - Tania Sorg
- Université de Strasbourg, CNRS, INSERM, Institut Clinique de la Souris, Phenomin-ICS, Illkirch, France
| | - Sadia Ouzia
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (MTS), MetaboHUB, Gif-sur-Yvette, France
| | - Yves Vandenbrouck
- Université Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, FR2048, ProFI, Grenoble, France
| | - Claudine Médigue
- IFB-core, UMS3601, Genoscope, Evry, France
- Laboratoire d'Analyses Bioinformatique en Génomique et Métabolisme (LABGeM), CNRS & CEA/DRF/IFJ, UMR8030, Evry, France
| | - Christophe Junot
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (MTS), MetaboHUB, Gif-sur-Yvette, France
| | - Myriam Ferro
- Université Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, FR2048, ProFI, Grenoble, France
| | - Estelle Pujos-Guillot
- Université Clermont Auvergne, INRAE, UNH, Plateforme d'Exploration du Métabolisme, MetaboHUB, Clermont-Ferrand, France
| | - Anne Gonzalez de Peredo
- Institut de Pharmacologie et Biologie Structurale (IPBS), Université de Toulouse, CNRS, UPS, ProFI, Toulouse, France
| | - François Fenaille
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (MTS), MetaboHUB, Gif-sur-Yvette, France
| | - Christine Carapito
- Laboratoire de Spectrométrie de Masse BioOrganique, Université de Strasbourg, CNRS, IPHC UMR 7178, ProFI, Strasbourg, France
| | - Yann Herault
- Université de Strasbourg, CNRS, INSERM, Institut Clinique de la Souris, Phenomin-ICS, Illkirch, France
- Université de Strasbourg, CNRS, INSERM, Institut de Génétique Biologie Moléculaire et Cellulaire, IGBMC, Illkirch, France
| | - Etienne A Thévenot
- Université Paris Saclay, CEA, INRAE, Département Médicaments et Technologies pour la Santé (MTS), MetaboHUB, Gif-sur-Yvette, France.
| |
Collapse
|
50
|
Gade IL, Schultz JG, Brøndum RF, Kjærgaard B, Nielsen-Kudsk JE, Andersen A, Kristensen SR, Honoré B. Putative Biomarkers for Acute Pulmonary Embolism in Exhaled Breath Condensate. J Clin Med 2021; 10:5165. [PMID: 34768685 PMCID: PMC8584843 DOI: 10.3390/jcm10215165] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/27/2021] [Accepted: 11/03/2021] [Indexed: 12/12/2022] Open
Abstract
Current diagnostic markers for pulmonary embolism (PE) are unspecific. We investigated the proteome of the exhaled breath condensate (EBC) in a porcine model of acute PE in order to identify putative diagnostic markers for PE. EBC was collected at baseline and after the induction of autologous intermediate-risk PE in 14 pigs, plus four negative control pigs. The protein profiles of the EBC were analyzed using label-free quantitative nano liquid chromatography-tandem mass spectrometry. A total of 897 proteins were identified in the EBCs from the pigs. Alterations were found in the levels of 145 different proteins after PE compared with the baseline and negative controls: albumin was among the most upregulated proteins, with 14-fold higher levels 2.5 h after PE (p-value: 0.02). The levels of 49 other proteins were between 1.3- and 17.1-fold higher after PE. The levels of 95 proteins were lower after PE. Neutrophil gelatinase-associated lipocalin (fold change 0.3, p-value < 0.01) was among the most reduced proteins 2.5 h after PE. A prediction model based on penalized regression identified five proteins including albumin and neutrophil gelatinase-associated lipocalin. The model was capable of discriminating baseline samples from EBC samples collected 2.5 h after PE correctly in 22 out of 27 samples. In conclusion, the EBC from pigs with acute PE contained several putative diagnostic markers of PE.
Collapse
Affiliation(s)
- Inger Lise Gade
- Department of Hematology and Clinical Cancer Research Center, Aalborg University Hospital, 9000 Aalborg, Denmark;
- Department of Clinical Medicine, Aalborg University, 9000 Aalborg, Denmark; (S.R.K.); (B.H.)
- Department of Clinical Biochemistry, Aalborg University Hospital, 9000 Aalborg, Denmark
| | - Jacob Gammelgaard Schultz
- Department of Cardiology, Aarhus University Hospital, 8200 Aarhus, Denmark; (J.G.S.); (J.E.N.-K.); (A.A.)
- Department of Clinical Medicine, Faculty of Health, Aarhus University, 8200 Aarhus, Denmark
| | - Rasmus Froberg Brøndum
- Department of Hematology and Clinical Cancer Research Center, Aalborg University Hospital, 9000 Aalborg, Denmark;
- Department of Clinical Medicine, Aalborg University, 9000 Aalborg, Denmark; (S.R.K.); (B.H.)
| | - Benedict Kjærgaard
- Department of Cardiothoracic Surgery, Aalborg University Hospital, 9000 Aalborg, Denmark;
| | - Jens Erik Nielsen-Kudsk
- Department of Cardiology, Aarhus University Hospital, 8200 Aarhus, Denmark; (J.G.S.); (J.E.N.-K.); (A.A.)
- Department of Clinical Medicine, Faculty of Health, Aarhus University, 8200 Aarhus, Denmark
| | - Asger Andersen
- Department of Cardiology, Aarhus University Hospital, 8200 Aarhus, Denmark; (J.G.S.); (J.E.N.-K.); (A.A.)
- Department of Clinical Medicine, Faculty of Health, Aarhus University, 8200 Aarhus, Denmark
| | - Søren Risom Kristensen
- Department of Clinical Medicine, Aalborg University, 9000 Aalborg, Denmark; (S.R.K.); (B.H.)
- Department of Clinical Biochemistry, Aalborg University Hospital, 9000 Aalborg, Denmark
| | - Bent Honoré
- Department of Clinical Medicine, Aalborg University, 9000 Aalborg, Denmark; (S.R.K.); (B.H.)
- Department of Biomedicine, Aarhus University, 8000 Aarhus, Denmark
| |
Collapse
|