1
|
Rempfer C, Hoernstein SN, van Gessel N, Graf AW, Spiegelhalder RP, Bertolini A, Bohlender LL, Parsons J, Decker EL, Reski R. Differential prolyl hydroxylation by six Physcomitrella prolyl-4 hydroxylases. Comput Struct Biotechnol J 2024; 23:2580-2594. [PMID: 39021582 PMCID: PMC11252719 DOI: 10.1016/j.csbj.2024.06.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 06/11/2024] [Accepted: 06/12/2024] [Indexed: 07/20/2024] Open
Abstract
Hydroxylation of prolines to 4-trans-hydroxyproline (Hyp) is mediated by prolyl-4 hydroxylases (P4Hs). In plants, Hyps occur in Hydroxyproline-rich glycoproteins (HRGPs), and are frequently O-glycosylated. While both modifications are important, e.g. for cell wall stability, they are undesired in plant-made pharmaceuticals. Sequence motifs for prolyl-hydroxylation were proposed but did not include data from mosses, such as Physcomitrella. We identified six moss P4Hs by phylogenetic reconstruction. Our analysis of 73 Hyps in 24 secretory proteins from multiple mass spectrometry datasets revealed that prolines near other prolines, alanine, serine, threonine and valine were preferentially hydroxylated. About 95 % of Hyps were predictable with combined established methods. In our data, AOV was the most frequent pattern. A combination of 443 AlphaFold models and MS data with 3000 prolines found Hyps mainly on protein surfaces in disordered regions. Moss-produced human erythropoietin (EPO) exhibited O-glycosylation with arabinose chains on two Hyps. This modification was significantly reduced in a p4h1 knock-out (KO) Physcomitrella mutant. Quantitative proteomics with different p4h mutants revealed specific changes in protein amounts, and a modified prolyl-hydroxylation pattern, suggesting a differential function of the Physcomitrella P4Hs. Quantitative RT-PCR revealed a differential effect of single p4h KOs on the expression of the other five p4h genes, suggesting a partial compensation of the mutation. AlphaFold-Multimer models for Physcomitrella P4H1 and its target EPO peptide superposed with the crystal structure of Chlamydomonas P4H1 suggested significant amino acids in the active centre of the enzyme and revealed differences between P4H1 and the other Physcomitrella P4Hs.
Collapse
Affiliation(s)
- Christine Rempfer
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
- Spemann Graduate School of Biology and Medicine SGBM, University of Freiburg, Albertstraße 19A, 79104 Freiburg, Germany
| | - Sebastian N.W. Hoernstein
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Nico van Gessel
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Andreas W. Graf
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Roxane P. Spiegelhalder
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Anne Bertolini
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Lennard L. Bohlender
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Juliana Parsons
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Eva L. Decker
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
| | - Ralf Reski
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Schaenzlestr. 1, 79104 Freiburg, Germany
- Spemann Graduate School of Biology and Medicine SGBM, University of Freiburg, Albertstraße 19A, 79104 Freiburg, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Schaenzlestr. 18, 79104, Germany
| |
Collapse
|
2
|
Bochenek M, Ciach MA, Smeets S, Beckers O, Vanderspikken J, Miasojedow B, Domżał B, Valkenborg D, Maes W, Gambin A. An Automated Analysis of Homocoupling Defects Using MALDI-MS and Open-Source Computer Software. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:2366-2375. [PMID: 39291650 DOI: 10.1021/jasms.4c00225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/19/2024]
Abstract
Conjugated organic polymers have substantial potential for multiple applications but their properties are strongly influenced by structural defects such as homocoupling of monomer units and unexpected end-groups. Detecting and/or quantifying these defects requires complex experimental techniques, which hinder the optimization of synthesis protocols and fundamental studies on the influence of structural defects. Mass spectrometry offers a simple way to detect these defects but a manual analysis of many complex spectra is tedious and provides only approximate results. In this work, we develop a computational methodology for analyzing complex mass spectra of organic copolymers. Our method annotates spectra similarly to a human expert and provides quantitative information about the proportions of signal assigned to each ion. Our method is based on the open-source Masserstein algorithm, which we modify to handle large libraries of reference spectra required for annotating complex mass spectra of polymers. We develop a statistical methodology to analyze the quantitative annotations and compare the statistical distributions of structural defects in polymer chains between samples. We apply this methodology to analyze commercial and lab-made samples of a benchmark polymer and show that the samples differ both in the amount and in the types of structural defects.
Collapse
Affiliation(s)
- Maria Bochenek
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, Warsaw 02-097, Poland
| | - Michał Aleksander Ciach
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, Warsaw 02-097, Poland
- Data Science Institute, Hasselt University, Hasselt 3500, Belgium
- Department of Applied Biomedical Science, Faculty of Health Sciences, University of Malta, Msida, MSD 2080, Malta
| | - Sander Smeets
- Institute for Materials Research (IMO), Hasselt University, Agoralaan, Diepenbeek 3590, Belgium
- IMEC, Associated lab IMOMEC, Wetenschapspark 1, Diepenbeek,3590, Belgium
- Energyville, Thorpark, Genk 3600, Belgium
| | - Omar Beckers
- Institute for Materials Research (IMO), Hasselt University, Agoralaan, Diepenbeek 3590, Belgium
- IMEC, Associated lab IMOMEC, Wetenschapspark 1, Diepenbeek,3590, Belgium
- Energyville, Thorpark, Genk 3600, Belgium
| | - Jochen Vanderspikken
- Institute for Materials Research (IMO), Hasselt University, Agoralaan, Diepenbeek 3590, Belgium
- IMEC, Associated lab IMOMEC, Wetenschapspark 1, Diepenbeek,3590, Belgium
- Energyville, Thorpark, Genk 3600, Belgium
| | - Błażej Miasojedow
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, Warsaw 02-097, Poland
| | - Barbara Domżał
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, Warsaw 02-097, Poland
| | - Dirk Valkenborg
- Data Science Institute, Hasselt University, Hasselt 3500, Belgium
| | - Wouter Maes
- Institute for Materials Research (IMO), Hasselt University, Agoralaan, Diepenbeek 3590, Belgium
- IMEC, Associated lab IMOMEC, Wetenschapspark 1, Diepenbeek,3590, Belgium
- Energyville, Thorpark, Genk 3600, Belgium
| | - Anna Gambin
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Banacha 2, Warsaw 02-097, Poland
| |
Collapse
|
3
|
Sweatt AJ, Griffiths CD, Groves SM, Paudel BB, Wang L, Kashatus DF, Janes KA. Proteome-wide copy-number estimation from transcriptomics. Mol Syst Biol 2024:10.1038/s44320-024-00064-3. [PMID: 39333715 DOI: 10.1038/s44320-024-00064-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Revised: 08/22/2024] [Accepted: 09/02/2024] [Indexed: 09/29/2024] Open
Abstract
Protein copy numbers constrain systems-level properties of regulatory networks, but proportional proteomic data remain scarce compared to RNA-seq. We related mRNA to protein statistically using best-available data from quantitative proteomics and transcriptomics for 4366 genes in 369 cell lines. The approach starts with a protein's median copy number and hierarchically appends mRNA-protein and mRNA-mRNA dependencies to define an optimal gene-specific model linking mRNAs to protein. For dozens of cell lines and primary samples, these protein inferences from mRNA outmatch stringent null models, a count-based protein-abundance repository, empirical mRNA-to-protein ratios, and a proteogenomic DREAM challenge winner. The optimal mRNA-to-protein relationships capture biological processes along with hundreds of known protein-protein complexes, suggesting mechanistic relationships. We use the method to identify a viral-receptor abundance threshold for coxsackievirus B3 susceptibility from 1489 systems-biology infection models parameterized by protein inference. When applied to 796 RNA-seq profiles of breast cancer, inferred copy-number estimates collectively re-classify 26-29% of luminal tumors. By adopting a gene-centered perspective of mRNA-protein covariation across different biological contexts, we achieve accuracies comparable to the technical reproducibility of contemporary proteomics.
Collapse
Affiliation(s)
- Andrew J Sweatt
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA
| | - Cameron D Griffiths
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA
| | - Sarah M Groves
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA
| | - B Bishal Paudel
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA
| | - Lixin Wang
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA
| | - David F Kashatus
- Department of Microbiology, Immunology & Cancer Biology, University of Virginia, Charlottesville, VA, 22908, USA
| | - Kevin A Janes
- Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, 22908, USA.
- Department of Biochemistry & Molecular Genetics, University of Virginia, Charlottesville, VA, 22908, USA.
| |
Collapse
|
4
|
de Baat ML, Narain-Ford DM, de Weert J, Giesen D, Beeltje H, Hamers T, Helmus R, de Voogt P, Kraak MHS. Passive sampler housing and sorbent type determine aquatic micropollutant adsorption and subsequent bioassay responses. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2024; 357:124488. [PMID: 38960122 DOI: 10.1016/j.envpol.2024.124488] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Revised: 06/24/2024] [Accepted: 06/30/2024] [Indexed: 07/05/2024]
Abstract
The combination of integrative passive sampling and bioassays is a promising approach for monitoring the toxicity of polar organic contaminants in aquatic environments. However, the design of integrative passive samplers can affect the accumulation of compounds and therewith the bioassay responses. The present study aimed to determine the effects of sampler housing and sorbent type on the number of chemical features accumulated in polar passive samplers and the subsequent bioassay responses to extracts of these samplers. To this end, four integrative passive sampler configurations, resulting from the combination of polar organic chemical integrative sampler (POCIS) and Speedisk housings with hydrophilic-lipophilic balance and hydrophilic divinylbenzene sorbents, were simultaneously exposed at reference and contaminated surface water locations. The passive sampler extracts were subjected to chemical non-target screening and a battery of five bioassays. Extracts from POCIS contained a higher number of chemical features and caused higher bioassay responses in 91% of cases, while the two sorbents accumulated similar numbers of features and caused equally frequent but different bioassay responses. Hence, the passive sampler design critically affected the number of accumulated polar organic contaminants as well as their toxicity, highlighting the importance of passive sampler design for effect-based water quality assessment.
Collapse
Affiliation(s)
- M L de Baat
- Department of Freshwater and Marine Ecology (FAME), Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands.
| | - D M Narain-Ford
- Department of Freshwater and Marine Ecology (FAME), Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands; National Institute for Public Health and the Environment (RIVM), Antonie van Leeuwenhoeklaan 9, 3721 MA, Bilthoven, the Netherlands
| | - J de Weert
- Deltares, Daltonlaan 600, 3584 BK, Utrecht, the Netherlands; Water Authority of Rijnland, Archimedesweg 1, 2333 CM, Leiden, the Netherlands
| | - D Giesen
- Deltares, Daltonlaan 600, 3584 BK, Utrecht, the Netherlands; World Business Council for Sustainable Development, Avenue du Bouchet 2bis, 1209, Geneva, Switzerland
| | - H Beeltje
- Environmental Modelling, Sensing and Analysis, TNO, Utrecht, the Netherlands; AQUON, De Blomboogerd 12, 4003 BX, Tiel, the Netherlands
| | - T Hamers
- Amsterdam Institute for Life and Environment (A-LIFE), Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV, Amsterdam, the Netherlands
| | - R Helmus
- Department of Freshwater and Marine Ecology (FAME), Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands
| | - P de Voogt
- Department of Freshwater and Marine Ecology (FAME), Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands
| | - M H S Kraak
- Department of Freshwater and Marine Ecology (FAME), Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, Science Park 904, 1098 XH, Amsterdam, the Netherlands
| |
Collapse
|
5
|
Zhang X, Zhang X, Yang H, Cheng X, Zhu YG, Ma J, Cui D, Zhang Z. Spatial and temporal changes of air quality in Shandong Province from 2016 to 2022 and model prediction. JOURNAL OF HAZARDOUS MATERIALS 2024; 477:135408. [PMID: 39096641 DOI: 10.1016/j.jhazmat.2024.135408] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Revised: 07/30/2024] [Accepted: 07/31/2024] [Indexed: 08/05/2024]
Abstract
This study investigates the spatial and temporal dynamics of air quality in Shandong Province from 2016 to 2022. The Air Quality Index (AQI) showed a seasonal pattern, with higher values in winter due to temperature inversions and heating emissions, and lower values in summer aided by favorable dispersion conditions. The AQI improved significantly, decreasing by approximately 39.4 % from 6.44 to 3.90. Coastal cities exhibited better air quality than inland areas, influenced by industrial activities and geographical features. For instance, Zibo's geography restricts pollutant dispersion, resulting in poor air quality. CO levels remained stable, while O3 increased seasonally due to photochemical reactions in summer, with correlation coefficients indicating a strong positive correlation with temperature (r = 0.65). Winter saw elevated NO2 levels linked to heating and vehicular emissions, with an observed increase in correlation with AQI (r = 0.78). PM2.5 and PM10 concentrations were higher in colder months due to heating and atmospheric dust, showing a significant decrease of 45 % and 40 %, respectively, over the study period. Predictive modeling forecasts continued air quality improvements, contingent on sustained policy enforcement and technological advancements. This approach provides a comprehensive framework for future air quality management and improvement.
Collapse
Affiliation(s)
- Xu Zhang
- School of Municipal and Environmental Engineering, Shandong Jianzhu University, Jinan 250101, China; Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Xinrui Zhang
- School of Municipal and Environmental Engineering, Shandong Jianzhu University, Jinan 250101, China
| | - Huanhuan Yang
- School of Life Sciences, Qilu Normal University, Jinan 250200, China.
| | - Xu Cheng
- Institute for Advanced Technology, Shandong University, Jinan 250061, China
| | - Yong Guan Zhu
- Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China
| | - Jun Ma
- School of Environment, Harbin Institute of Technology, Harbin 150090, China
| | - Dayong Cui
- School of Life Sciences, Qilu Normal University, Jinan 250200, China
| | - Zhibin Zhang
- School of Municipal and Environmental Engineering, Shandong Jianzhu University, Jinan 250101, China; School of Environment, Harbin Institute of Technology, Harbin 150090, China.
| |
Collapse
|
6
|
Zhu C, Liu LY, Ha A, Yamaguchi TN, Zhu H, Hugh-White R, Livingstone J, Patel Y, Kislinger T, Boutros PC. moPepGen: Rapid and Comprehensive Identification of Non-canonical Peptides. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.28.587261. [PMID: 38585946 PMCID: PMC10996593 DOI: 10.1101/2024.03.28.587261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Gene expression is a multi-step transformation of biological information from its storage form (DNA) into functional forms (protein and some RNAs). Regulatory activities at each step of this transformation multiply a single gene into a myriad of proteoforms. Proteogenomics is the study of how genomic and transcriptomic variation creates this proteomic diversity, and is limited by the challenges of modeling the complexities of gene-expression. We therefore created moPepGen, a graph-based algorithm that comprehensively generates non-canonical peptides in linear time. moPepGen works with multiple technologies, in multiple species and on all types of genetic and transcriptomic data. In human cancer proteomes, it enumerates previously unobservable noncanonical peptides arising from germline and somatic genomic variants, noncoding open reading frames, RNA fusions and RNA circularization. By enabling efficient detection and quantitation of previously hidden proteins in both existing and new proteomic data, moPepGen facilitates all proteogenomics applications. It is available at: https://github.com/uclahs-cds/package-moPepGen .
Collapse
|
7
|
Heuckeroth S, Damiani T, Smirnov A, Mokshyna O, Brungs C, Korf A, Smith JD, Stincone P, Dreolin N, Nothias LF, Hyötyläinen T, Orešič M, Karst U, Dorrestein PC, Petras D, Du X, van der Hooft JJJ, Schmid R, Pluskal T. Reproducible mass spectrometry data processing and compound annotation in MZmine 3. Nat Protoc 2024; 19:2597-2641. [PMID: 38769143 DOI: 10.1038/s41596-024-00996-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Accepted: 02/26/2024] [Indexed: 05/22/2024]
Abstract
Untargeted mass spectrometry (MS) experiments produce complex, multidimensional data that are practically impossible to investigate manually. For this reason, computational pipelines are needed to extract relevant information from raw spectral data and convert it into a more comprehensible format. Depending on the sample type and/or goal of the study, a variety of MS platforms can be used for such analysis. MZmine is an open-source software for the processing of raw spectral data generated by different MS platforms. Examples include liquid chromatography-MS, gas chromatography-MS and MS-imaging. These data might typically be associated with various applications including metabolomics and lipidomics. Moreover, the third version of the software, described herein, supports the processing of ion mobility spectrometry (IMS) data. The present protocol provides three distinct procedures to perform feature detection and annotation of untargeted MS data produced by different instrumental setups: liquid chromatography-(IMS-)MS, gas chromatography-MS and (IMS-)MS imaging. For training purposes, example datasets are provided together with configuration batch files (i.e., list of processing steps and parameters) to allow new users to easily replicate the described workflows. Depending on the number of data files and available computing resources, we anticipate this to take between 2 and 24 h for new MZmine users and nonexperts. Within each procedure, we provide a detailed description for all processing parameters together with instructions/recommendations for their optimization. The main generated outputs are represented by aligned feature tables and fragmentation spectra lists that can be used by other third-party tools for further downstream analysis.
Collapse
Affiliation(s)
| | - Tito Damiani
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | | | - Olena Mokshyna
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Corinna Brungs
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Ansgar Korf
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
| | - Joshua David Smith
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic
- First Faculty of Medicine, Charles University, Prague, Czech Republic
| | | | | | - Louis-Félix Nothias
- University of Geneva, Geneva, Switzerland
- Université Côte d'Azur, CNRS, ICN, Nice, France
| | | | - Matej Orešič
- Örebro University, Örebro, Sweden
- University of Turku and Åbo Akademi University, Turku, Finland
| | - Uwe Karst
- University of Münster, Münster, Germany
| | - Pieter C Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Daniel Petras
- University of Tuebingen, Tuebingen, Germany
- University of California Riverside, Riverside, CA, USA
| | - Xiuxia Du
- University of North Carolina at Charlotte, Charlotte, NC, USA
| | - Justin J J van der Hooft
- Wageningen University & Research, Wageningen, the Netherlands
- University of Johannesburg, Johannesburg, South Africa
| | - Robin Schmid
- University of Münster, Münster, Germany.
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic.
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
| | - Tomáš Pluskal
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague, Czech Republic.
| |
Collapse
|
8
|
Welp LM, Sachsenberg T, Wulf A, Chernev A, Horokhovskyi Y, Neumann P, Pašen M, Siraj A, Raabe M, Johannsson S, Schmitzova J, Netz E, Pfeuffer J, He Y, Fritzemeier K, Delanghe B, Viner R, Vos SM, Cramer P, Ficner R, Liepe J, Kohlbacher O, Urlaub H. Chemical crosslinking extends and complements UV crosslinking in analysis of RNA/DNA nucleic acid-protein interaction sites by mass spectrometry. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.29.610268. [PMID: 39257782 PMCID: PMC11383681 DOI: 10.1101/2024.08.29.610268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2024]
Abstract
UV (ultra-violet) crosslinking with mass spectrometry (XL-MS) has been established for identifying RNA-and DNA-binding proteins along with their domains and amino acids involved. Here, we explore chemical XL-MS for RNA-protein, DNA-protein, and nucleotide-protein complexes in vitro and in vivo . We introduce a specialized nucleotide-protein-crosslink search engine, NuXL, for robust and fast identification of such crosslinks at amino acid resolution. Chemical XL-MS complements UV XL-MS by generating different crosslink species, increasing crosslinked protein yields in vivo almost four-fold and thus it expands the structural information accessible via XL-MS. Our workflow facilitates integrative structural modelling of nucleic acid-protein complexes and adds spatial information to the described RNA-binding properties of enzymes, for which crosslinking sites are often observed close to their cofactor-binding domains. In vivo UV and chemical XL-MS data from E. coli cells analysed by NuXL establish a comprehensive nucleic acid-protein crosslink inventory with crosslink sites at amino acid level for more than 1500 proteins. Our new workflow combined with the dedicated NuXL search engine identified RNA crosslinks that cover most RNA-binding proteins, with DNA and RNA crosslinks detected in transcriptional repressors and activators.
Collapse
|
9
|
Jiang Y, Rex DA, Schuster D, Neely BA, Rosano GL, Volkmar N, Momenzadeh A, Peters-Clarke TM, Egbert SB, Kreimer S, Doud EH, Crook OM, Yadav AK, Vanuopadath M, Hegeman AD, Mayta M, Duboff AG, Riley NM, Moritz RL, Meyer JG. Comprehensive Overview of Bottom-Up Proteomics Using Mass Spectrometry. ACS MEASUREMENT SCIENCE AU 2024; 4:338-417. [PMID: 39193565 PMCID: PMC11348894 DOI: 10.1021/acsmeasuresciau.3c00068] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 05/03/2024] [Accepted: 05/03/2024] [Indexed: 08/29/2024]
Abstract
Proteomics is the large scale study of protein structure and function from biological systems through protein identification and quantification. "Shotgun proteomics" or "bottom-up proteomics" is the prevailing strategy, in which proteins are hydrolyzed into peptides that are analyzed by mass spectrometry. Proteomics studies can be applied to diverse studies ranging from simple protein identification to studies of proteoforms, protein-protein interactions, protein structural alterations, absolute and relative protein quantification, post-translational modifications, and protein stability. To enable this range of different experiments, there are diverse strategies for proteome analysis. The nuances of how proteomic workflows differ may be challenging to understand for new practitioners. Here, we provide a comprehensive overview of different proteomics methods. We cover from biochemistry basics and protein extraction to biological interpretation and orthogonal validation. We expect this Review will serve as a handbook for researchers who are new to the field of bottom-up proteomics.
Collapse
Affiliation(s)
- Yuming Jiang
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| | - Devasahayam Arokia
Balaya Rex
- Center for
Systems Biology and Molecular Medicine, Yenepoya Research Centre, Yenepoya (Deemed to be University), Mangalore 575018, India
| | - Dina Schuster
- Department
of Biology, Institute of Molecular Systems
Biology, ETH Zurich, Zurich 8093, Switzerland
- Department
of Biology, Institute of Molecular Biology
and Biophysics, ETH Zurich, Zurich 8093, Switzerland
- Laboratory
of Biomolecular Research, Division of Biology and Chemistry, Paul Scherrer Institute, Villigen 5232, Switzerland
| | - Benjamin A. Neely
- Chemical
Sciences Division, National Institute of
Standards and Technology, NIST, Charleston, South Carolina 29412, United States
| | - Germán L. Rosano
- Mass
Spectrometry
Unit, Institute of Molecular and Cellular
Biology of Rosario, Rosario, 2000 Argentina
| | - Norbert Volkmar
- Department
of Biology, Institute of Molecular Systems
Biology, ETH Zurich, Zurich 8093, Switzerland
| | - Amanda Momenzadeh
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| | - Trenton M. Peters-Clarke
- Department
of Pharmaceutical Chemistry, University
of California—San Francisco, San Francisco, California, 94158, United States
| | - Susan B. Egbert
- Department
of Chemistry, University of Manitoba, Winnipeg, Manitoba, R3T 2N2 Canada
| | - Simion Kreimer
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| | - Emma H. Doud
- Center
for Proteome Analysis, Indiana University
School of Medicine, Indianapolis, Indiana, 46202-3082, United States
| | - Oliver M. Crook
- Oxford
Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, United
Kingdom
| | - Amit Kumar Yadav
- Translational
Health Science and Technology Institute, NCR Biotech Science Cluster 3rd Milestone Faridabad-Gurgaon
Expressway, Faridabad, Haryana 121001, India
| | | | - Adrian D. Hegeman
- Departments
of Horticultural Science and Plant and Microbial Biology, University of Minnesota, Twin Cities, Minnesota 55108, United States
| | - Martín
L. Mayta
- School
of Medicine and Health Sciences, Center for Health Sciences Research, Universidad Adventista del Plata, Libertador San Martin 3103, Argentina
- Molecular
Biology Department, School of Pharmacy and Biochemistry, Universidad Nacional de Rosario, Rosario 2000, Argentina
| | - Anna G. Duboff
- Department
of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Nicholas M. Riley
- Department
of Chemistry, University of Washington, Seattle, Washington 98195, United States
| | - Robert L. Moritz
- Institute
for Systems biology, Seattle, Washington 98109, United States
| | - Jesse G. Meyer
- Department
of Computational Biomedicine, Cedars Sinai
Medical Center, Los Angeles, California 90048, United States
- Smidt Heart
Institute, Cedars Sinai Medical Center, Los Angeles, California 90048, United States
- Advanced
Clinical Biosystems Research Institute, Cedars Sinai Medical Center, Los
Angeles, California 90048, United States
| |
Collapse
|
10
|
Lynn KS, Tang HY, Lo CJ, Yang CH, Tseng YT, Cheng ML. MRMQuant: Automated MRM Data Quantitation for Large-Scale Targeted Metabolomics Analysis. Anal Chem 2024; 96:13625-13635. [PMID: 39127919 PMCID: PMC11339730 DOI: 10.1021/acs.analchem.4c02462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Revised: 07/27/2024] [Accepted: 08/02/2024] [Indexed: 08/12/2024]
Abstract
Multiple reaction monitoring (MRM) is a powerful and popular technique used for metabolite quantification in targeted metabolomics. Accurate and consistent quantitation of metabolites from the MRM data is essential for subsequent analyses. Here, we developed an automated tool, MRMQuant, for targeted metabolomic quantitation using high-throughput liquid chromatography-tandem mass spectrometry MRM data to provide users with an easy-to-use tool for accurate MRM data quantitation with minimal human intervention. This tool has many user-friendly functions and features to inspect and correct the quantitation results as required. MRMQuant possesses the following features to ensure accurate quantitation: (1) dynamic signal smoothing, (2) automatic deconvolution of coeluted peaks, (3) absolute quantitation via standard curves and/or internal standards, (4) visualized inspection and correction, (5) corrections applicable to multiple samples, and (6) batch-effect correction.
Collapse
Affiliation(s)
- Ke-Shiuan Lynn
- Department
of Mathematics, Fu Jen Catholic University, New Taipei City 24205, Taiwan
| | - Hsiang-Yu Tang
- Metabolomics
Core Laboratory, Healthy Aging Research Center, Chang Gung University, Taoyuan 333, Taiwan
| | - Chi-Jen Lo
- Metabolomics
Core Laboratory, Healthy Aging Research Center, Chang Gung University, Taoyuan 333, Taiwan
| | - Cheng-Hung Yang
- Metabolomics
Core Laboratory, Healthy Aging Research Center, Chang Gung University, Taoyuan 333, Taiwan
- Department
of Biomedical Sciences, College of Medicine, Chang Gung University, Taoyuan 333, Taiwan
| | - Yi-Ting Tseng
- Metabolomics
Core Laboratory, Healthy Aging Research Center, Chang Gung University, Taoyuan 333, Taiwan
- Department
of Biomedical Sciences, College of Medicine, Chang Gung University, Taoyuan 333, Taiwan
| | - Mei-Ling Cheng
- Metabolomics
Core Laboratory, Healthy Aging Research Center, Chang Gung University, Taoyuan 333, Taiwan
- Department
of Biomedical Sciences, College of Medicine, Chang Gung University, Taoyuan 333, Taiwan
| |
Collapse
|
11
|
Xiong Y, Mueller RS, Feng S, Guo X, Pan C. Proteomic stable isotope probing with an upgraded Sipros algorithm for improved identification and quantification of isotopically labeled proteins. MICROBIOME 2024; 12:148. [PMID: 39118147 PMCID: PMC11313024 DOI: 10.1186/s40168-024-01866-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Accepted: 07/02/2024] [Indexed: 08/10/2024]
Abstract
BACKGROUND Proteomic stable isotope probing (SIP) is used in microbial ecology to trace a non-radioactive isotope from a labeled substrate into de novo synthesized proteins in specific populations that are actively assimilating and metabolizing the substrate in a complex microbial community. The Sipros algorithm is used in proteomic SIP to identify variably labeled proteins and quantify their isotopic enrichment levels (atom%) by performing enrichment-resolved database searching. RESULTS In this study, Sipros was upgraded to improve the labeled protein identification, isotopic enrichment quantification, and database searching speed. The new Sipros 4 was compared with the existing Sipros 3, Calisp, and MetaProSIP in terms of the number of identifications and the accuracy and precision of atom% quantification on both the peptide and protein levels using standard E. coli cultures with 1.07 atom%, 2 atom%, 5 atom%, 25 atom%, 50 atom%, and 99 atom% 13C enrichment. Sipros 4 outperformed Calisp and MetaProSIP across all samples, especially in samples with ≥ 5 atom% 13C labeling. The computational speed on Sipros 4 was > 20 times higher than Sipros 3 and was on par with the overall speed of Calisp- and MetaProSIP-based pipelines. Sipros 4 also demonstrated higher sensitivity for the detection of labeled proteins in two 13C-SIP experiments on a real-world soil community. The labeled proteins were used to trace 13C from 13C-methanol and 13C-labeled plant exudates to the consuming soil microorganisms and their newly synthesized proteins. CONCLUSION Overall, Sipros 4 improved the quality of the proteomic SIP results and reduced the computational cost of SIP database searching, which will make proteomic SIP more useful and accessible to the border community. Video Abstract.
Collapse
Affiliation(s)
- Yi Xiong
- School of Biological Sciences, University of Oklahoma, Norman, OK, USA
| | - Ryan S Mueller
- Department of Microbiology, Oregon State University, Corvallis, OR, USA
| | - Shichao Feng
- Department of Computer Science and Engineering, University of North Texas, Denton, TX, USA
| | - Xuan Guo
- Department of Computer Science and Engineering, University of North Texas, Denton, TX, USA
| | - Chongle Pan
- School of Biological Sciences, University of Oklahoma, Norman, OK, USA.
- School of Computer Science, University of Oklahoma, Norman, OK, USA.
| |
Collapse
|
12
|
Dang TC, Fields L, Li L. MotifQuest: An Automated Pipeline for Motif Database Creation to Improve Peptidomics Database Searching Programs. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2024; 35:1902-1912. [PMID: 39058243 DOI: 10.1021/jasms.4c00192] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/28/2024]
Abstract
Endogenous peptides are an abundant and versatile class of biomolecules with vital roles pertinent to the functionality of the nervous, endocrine, and immune systems and others. Mass spectrometry stands as a premier technique for identifying endogenous peptides, yet the field still faces challenges due to the lack of optimized computational resources for reliable raw mass spectra analysis and interpretation. Current database searching programs can exhibit discrepancies due to the unique properties of endogenous peptides, which typically require specialized search considerations. Herein, we present a high throughput, novel scoring algorithm for the extraction and ranking of conserved amino acid sequence motifs within any endogenous peptide database. Motifs are conserved patterns across organisms, representing sequence moieties crucial for biological functions, including maintenance of homeostasis. MotifQuest, our novel motif database generation algorithm, is designed to work in partnership with EndoGenius, a program optimized for database searching of endogenous peptides and that is powered by a motif database to capitalize on biological context to produce identifications. MotifQuest aims to quickly develop motif databases without any prior knowledge, a laborious task not possible with traditional sequence alignment resources. In this work we illustrate the utility of MotifQuest to expand EndoGenius' identification utility to other endogenous peptides by showcasing its ability to identify antimicrobial peptides. Additionally, we discuss the potential utility of MotifQuest to parse out motifs from a FASTA database file that can be further validated as new peptide drug candidates.
Collapse
Affiliation(s)
- Tina C Dang
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Avenue, Madison, Wisconsin 53705, United States
| | - Lauren Fields
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, Wisconsin 53706, United States
| | - Lingjun Li
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Avenue, Madison, Wisconsin 53705, United States
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, Wisconsin 53706, United States
| |
Collapse
|
13
|
Bueschl C, Riquelme G, Zabalegui N, Rey MA, Monge ME. Tidy-Direct-to-MS: An Open-Source Data-Processing Pipeline for Direct Mass Spectrometry-Based Metabolomics Experiments. J Proteome Res 2024; 23:3208-3216. [PMID: 38833568 DOI: 10.1021/acs.jproteome.3c00784] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/06/2024]
Abstract
Direct-to-Mass Spectrometry and ambient ionization techniques can be used for biochemical fingerprinting in a fast way. Data processing is typically accomplished with vendor-provided software tools. Here, a novel, open-source functionality, entitled Tidy-Direct-to-MS, was developed for data processing of direct-to-MS data sets. It allows for fast and user-friendly processing using different modules for optional sample position detection and separation, mass-to-charge ratio drift detection and correction, consensus spectra calculation, and bracketing across sample positions as well as feature abundance calculation. The tool also provides functionality for the automated comparison of different sets of parameters, thereby assisting the user in the complex task of finding an optimal combination to maximize the total number of detected features while also checking for the detection of user-provided reference features. In addition, Tidy-Direct-to-MS has the capability for data quality review and subsequent data analysis, thereby simplifying the workflow of untargeted ambient MS-based metabolomics studies. Tidy-Direct-to-MS is implemented in the Python programming language as part of the TidyMS library and can thus be easily extended. Capabilities of Tidy-Direct-to-MS are showcased in a data set acquired in a marine metabolomics study reported in MetaboLights (MTBLS1198) using a transmission mode Direct Analysis in Real Time-Mass Spectrometry (TM-DART-MS)-based method.
Collapse
Affiliation(s)
- Christoph Bueschl
- Centro de Investigaciones en Bionanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2390, C1425FQD Ciudad de Buenos Aires, Argentina
| | - Gabriel Riquelme
- Centro de Investigaciones en Bionanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2390, C1425FQD Ciudad de Buenos Aires, Argentina
| | - Nicolás Zabalegui
- Centro de Investigaciones en Bionanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2390, C1425FQD Ciudad de Buenos Aires, Argentina
| | - Maximilian A Rey
- Centro de Investigaciones en Bionanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2390, C1425FQD Ciudad de Buenos Aires, Argentina
- Departamento de Química Inorgánica Analítica y Química Física, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Ciudad Universitaria, C1428EGA Buenos Aires, Argentina
| | - María Eugenia Monge
- Centro de Investigaciones en Bionanociencias (CIBION), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Godoy Cruz 2390, C1425FQD Ciudad de Buenos Aires, Argentina
| |
Collapse
|
14
|
Fields L, Vu NQ, Dang TC, Yen HC, Ma M, Wu W, Gray M, Li L. EndoGenius: Optimized Neuropeptide Identification from Mass Spectrometry Datasets. J Proteome Res 2024; 23:3041-3051. [PMID: 38426863 PMCID: PMC11296898 DOI: 10.1021/acs.jproteome.3c00758] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024]
Abstract
Neuropeptides represent a unique class of signaling molecules that have garnered much attention but require special consideration when identifications are gleaned from mass spectra. With highly variable sequence lengths, neuropeptides must be analyzed in their endogenous state. Further, neuropeptides share great homology within families, differing by as little as a single amino acid residue, complicating even routine analyses and necessitating optimized computational strategies for confident and accurate identifications. We present EndoGenius, a database searching strategy designed specifically for elucidating neuropeptide identifications from mass spectra by leveraging optimized peptide-spectrum matching approaches, an expansive motif database, and a novel scoring algorithm to achieve broader representation of the neuropeptidome and minimize reidentification. This work describes an algorithm capable of reporting more neuropeptide identifications at 1% false-discovery rate than alternative software in five Callinectes sapidus neuronal tissue types.
Collapse
Affiliation(s)
- Lauren Fields
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, WI 53706, USA
| | - Nhu Q. Vu
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, WI 53706, USA
| | - Tina C. Dang
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Avenue, Madison, WI 53705, USA
| | - Hsu-Ching Yen
- Department of Biochemistry, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI 53706, USA
| | - Min Ma
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Avenue, Madison, WI 53705, USA
| | - Wenxin Wu
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, WI 53706, USA
| | - Mitchell Gray
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, WI 53706, USA
| | - Lingjun Li
- Department of Chemistry, University of Wisconsin-Madison, 1101 University Avenue, Madison, WI 53706, USA
- School of Pharmacy, University of Wisconsin-Madison, 777 Highland Avenue, Madison, WI 53705, USA
- Lachman Institute for Pharmaceutical Development, School of Pharmacy, University of Wisconsin-Madison, Madison, WI 53705, USA
- Wisconsin Center for NanoBioSystems, School of Pharmacy, University of Wisconsin-Madison, Madison, WI 53705, USA
| |
Collapse
|
15
|
Duan H, Ning Z, Zhang A, Figeys D. Spectral entropy as a measure of the metaproteome complexity. Proteomics 2024; 24:e2300570. [PMID: 38794877 DOI: 10.1002/pmic.202300570] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2024] [Revised: 05/13/2024] [Accepted: 05/14/2024] [Indexed: 05/26/2024]
Abstract
The diversity and complexity of the microbiome's genomic landscape are not always mirrored in its proteomic profile. Despite the anticipated proteomic diversity, observed complexities of microbiome samples are often lower than expected. Two main factors contribute to this discrepancy: limitations in mass spectrometry's detection sensitivity and bioinformatics challenges in metaproteomics identification. This study introduces a novel approach to evaluating sample complexity directly at the full mass spectrum (MS1) level rather than relying on peptide identifications. When analyzing under identical mass spectrometry conditions, microbiome samples displayed significantly higher complexity, as evidenced by the spectral entropy and peptide candidate entropy, compared to single-species samples. The research provides solid evidence for the complexity of microbiome in proteomics indicating the optimization potential of the bioinformatics workflow.
Collapse
Affiliation(s)
- Haonan Duan
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
| | - Zhibin Ning
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
| | - Ailing Zhang
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
| | - Daniel Figeys
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, Ontario, Canada
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, Ontario, Canada
| |
Collapse
|
16
|
Dablanc A, Hennechart S, Perez A, Cabanac G, Guitton Y, Paulhe N, Lyan B, Jamin EL, Giacomoni F, Marti G. FragHub: A Mass Spectral Library Data Integration Workflow. Anal Chem 2024; 96. [PMID: 39028894 PMCID: PMC11295123 DOI: 10.1021/acs.analchem.4c02219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2024] [Revised: 06/16/2024] [Accepted: 06/24/2024] [Indexed: 07/21/2024]
Abstract
Open mass spectral libraries (OMSLs) are critical for metabolite annotation and machine learning, especially given the rising volume of untargeted metabolomic studies and the development of annotation pipelines. Despite their importance, the practical application of OMSLs is hampered by the lack of standardized file formats, metadata fields, and supporting ontology. Current libraries, often restricted to specific topics or matrices, such as natural products, lipids, or the human metabolome, may limit the discovery potential of untargeted studies. The goal of FragHub is to provide users with the capability to integrate various OMSLs into a single unified format, thereby enhancing the annotation accuracy and reliability. FragHub addresses these challenges by integrating multiple OMSLs into a single comprehensive database, supporting various data formats, and harmonizing metadata. It also proposes some generic filters for the mass spectrum using a graphical user interface. Additionally, a workflow to generate in-house libraries compatible with FragHub is proposed. FragHub dynamically segregates libraries based on ionization modes and chromatography techniques, thereby enhancing data utility in metabolomic research. The FragHub Python code is publicly available under a MIT license, at the following repository: https://github.com/eMetaboHUB/FragHub. Generated data can be accessed at 10.5281/zenodo.11057687.
Collapse
Affiliation(s)
- Axel Dablanc
- Laboratoire
de Recherche en Sciences Végétales, Metatoul-AgromiX
Platform, Université de Toulouse,
CNRS, INP, 24 Chemin de Borde Rouge, Auzeville, Auzeville-Tolosane 31320, France
- MetaboHUB-MetaToul,
National Infrastructure of Metabolomics and Fluxomics, Toulouse 31000, France
| | - Solweig Hennechart
- Laboratoire
de Recherche en Sciences Végétales, Metatoul-AgromiX
Platform, Université de Toulouse,
CNRS, INP, 24 Chemin de Borde Rouge, Auzeville, Auzeville-Tolosane 31320, France
- MetaboHUB-MetaToul,
National Infrastructure of Metabolomics and Fluxomics, Toulouse 31000, France
- Université
Toulouse 3—Paul Sabatier, IRIT UMR 5505 CNRS, Toulouse 31062, France
| | - Amélie Perez
- Laboratoire
de Recherche en Sciences Végétales, Metatoul-AgromiX
Platform, Université de Toulouse,
CNRS, INP, 24 Chemin de Borde Rouge, Auzeville, Auzeville-Tolosane 31320, France
- MetaboHUB-MetaToul,
National Infrastructure of Metabolomics and Fluxomics, Toulouse 31000, France
| | - Guillaume Cabanac
- Université
Toulouse 3—Paul Sabatier, IRIT UMR 5505 CNRS, Toulouse 31062, France
- Institut
Universitaire de France, Paris 75005, France
| | | | - Nils Paulhe
- Université
Clermont Auvergne, INRAE, UNH, Plateforme d’Exploration du
Métabolisme, MetaboHUB Clermont, Clermont-Ferrand F-63000, France
| | - Bernard Lyan
- Université
Clermont Auvergne, INRAE, UNH, Plateforme d’Exploration du
Métabolisme, MetaboHUB Clermont, Clermont-Ferrand F-63000, France
| | - Emilien L. Jamin
- MetaboHUB-MetaToul,
National Infrastructure of Metabolomics and Fluxomics, Toulouse 31000, France
- Toxalim
(Research Centre in Food Toxicology), Université
de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse 31076, France
| | - Franck Giacomoni
- Université
Clermont Auvergne, INRAE, UNH, Plateforme d’Exploration du
Métabolisme, MetaboHUB Clermont, Clermont-Ferrand F-63000, France
| | - Guillaume Marti
- Laboratoire
de Recherche en Sciences Végétales, Metatoul-AgromiX
Platform, Université de Toulouse,
CNRS, INP, 24 Chemin de Borde Rouge, Auzeville, Auzeville-Tolosane 31320, France
- MetaboHUB-MetaToul,
National Infrastructure of Metabolomics and Fluxomics, Toulouse 31000, France
| |
Collapse
|
17
|
Shi J, Liu Y, Xu YJ. MS based foodomics: An edge tool integrated metabolomics and proteomics for food science. Food Chem 2024; 446:138852. [PMID: 38428078 DOI: 10.1016/j.foodchem.2024.138852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 02/05/2024] [Accepted: 02/24/2024] [Indexed: 03/03/2024]
Abstract
Foodomics has become a popular methodology in food science studies. Mass spectrometry (MS) based metabolomics and proteomics analysis played indispensable roles in foodomics research. So far, several methodologies have been developed to detect the metabolites and proteins in diets and consumers, including sample preparation, MS data acquisition, annotation and interpretation. Moreover, multiomics analysis integrated metabolomics and proteomics have received considerable attentions in the field of food safety and nutrition, because of more comprehensive and deeply. In this context, we intended to review the emerging strategies and their applications in MS-based foodomics, as well as future challenges and trends. The principle and application of multiomics were also discussed, such as the optimization of data acquisition, development of analysis algorithm and exploration of systems biology.
Collapse
Affiliation(s)
- Jiachen Shi
- State Key Laboratory of Food Science and Technology, School of Food Science and Technology, National Engineering Research Center for Functional Food, National Engineering Laboratory for Cereal Fermentation Technology, Collaborative Innovation Center of Food Safety and Quality Control in Jiangsu Province, Jiangnan University, 1800 Lihu Road, Wuxi 214122, Jiangsu, People's Republic of China.
| | - Yuanfa Liu
- State Key Laboratory of Food Science and Technology, School of Food Science and Technology, National Engineering Research Center for Functional Food, National Engineering Laboratory for Cereal Fermentation Technology, Collaborative Innovation Center of Food Safety and Quality Control in Jiangsu Province, Jiangnan University, 1800 Lihu Road, Wuxi 214122, Jiangsu, People's Republic of China.
| | - Yong-Jiang Xu
- State Key Laboratory of Food Science and Technology, School of Food Science and Technology, National Engineering Research Center for Functional Food, National Engineering Laboratory for Cereal Fermentation Technology, Collaborative Innovation Center of Food Safety and Quality Control in Jiangsu Province, Jiangnan University, 1800 Lihu Road, Wuxi 214122, Jiangsu, People's Republic of China.
| |
Collapse
|
18
|
Sadia M, Boudguiyer Y, Helmus R, Seijo M, Praetorius A, Samanipour S. A stochastic approach for parameter optimization of feature detection algorithms for non-target screening in mass spectrometry. Anal Bioanal Chem 2024:10.1007/s00216-024-05425-3. [PMID: 38995405 DOI: 10.1007/s00216-024-05425-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 06/05/2024] [Accepted: 06/18/2024] [Indexed: 07/13/2024]
Abstract
Feature detection plays a crucial role in non-target screening (NTS), requiring careful selection of algorithm parameters to minimize false positive (FP) features. In this study, a stochastic approach was employed to optimize the parameter settings of feature detection algorithms used in processing high-resolution mass spectrometry data. This approach was demonstrated using four open-source algorithms (OpenMS, SAFD, XCMS, and KPIC2) within the patRoon software platform for processing extracts from drinking water samples spiked with 46 per- and polyfluoroalkyl substances (PFAS). The designed method is based on a stochastic strategy involving random sampling from variable space and the use of Pearson correlation to assess the impact of each parameter on the number of detected suspect analytes. Using our approach, the optimized parameters led to improvement in the algorithm performance by increasing suspect hits in case of SAFD and XCMS, and reducing the total number of detected features (i.e., minimizing FP) for OpenMS. These improvements were further validated on three different drinking water samples as test dataset. The optimized parameters resulted in a lower false discovery rate (FDR%) compared to the default parameters, effectively increasing the detection of true positive features. This work also highlights the necessity of algorithm parameter optimization prior to starting the NTS to reduce the complexity of such datasets.
Collapse
Affiliation(s)
- Mohammad Sadia
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands.
| | - Youssef Boudguiyer
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Rick Helmus
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Marianne Seijo
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Antonia Praetorius
- Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, Amsterdam, The Netherlands
| | - Saer Samanipour
- Van'T Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Amsterdam, The Netherlands
| |
Collapse
|
19
|
Mohanty I, Allaband C, Mannochio-Russo H, El Abiead Y, Hagey LR, Knight R, Dorrestein PC. The changing metabolic landscape of bile acids - keys to metabolism and immune regulation. Nat Rev Gastroenterol Hepatol 2024; 21:493-516. [PMID: 38575682 DOI: 10.1038/s41575-024-00914-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/14/2024] [Indexed: 04/06/2024]
Abstract
Bile acids regulate nutrient absorption and mitochondrial function, they establish and maintain gut microbial community composition and mediate inflammation, and they serve as signalling molecules that regulate appetite and energy homeostasis. The observation that there are hundreds of bile acids, especially many amidated bile acids, necessitates a revision of many of the classical descriptions of bile acids and bile acid enzyme functions. For example, bile salt hydrolases also have transferase activity. There are now hundreds of known modifications to bile acids and thousands of bile acid-associated genes, especially when including the microbiome, distributed throughout the human body (for example, there are >2,400 bile salt hydrolases alone). The fact that so much of our genetic and small-molecule repertoire, in both amount and diversity, is dedicated to bile acid function highlights the centrality of bile acids as key regulators of metabolism and immune homeostasis, which is, in large part, communicated via the gut microbiome.
Collapse
Affiliation(s)
- Ipsita Mohanty
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Celeste Allaband
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Helena Mannochio-Russo
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Yasin El Abiead
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - Lee R Hagey
- Department of Medicine, University of California San Diego, San Diego, CA, USA
| | - Rob Knight
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA
- Department of Computer Science and Engineering, University of California San Diego, La Jolla, CA, USA
- Department of Bioengineering, University of California San Diego, La Jolla, CA, USA
| | - Pieter C Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
- Center for Microbiome Innovation, University of California San Diego, La Jolla, CA, USA.
- Department of Pharmacology, University of California San Diego, La Jolla, CA, USA.
- Collaborative Mass Spectrometry Innovation Center, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
20
|
Hu Y, Schnaubelt M, Chen L, Zhang B, Hoang T, Lih TM, Zhang Z, Zhang H. MS-PyCloud: A Cloud Computing-Based Pipeline for Proteomic and Glycoproteomic Data Analyses. Anal Chem 2024; 96:10145-10151. [PMID: 38869158 DOI: 10.1021/acs.analchem.3c01497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
Rapid development and wide adoption of mass spectrometry-based glycoproteomic technologies have empowered scientists to study proteins and protein glycosylation in complex samples on a large scale. This progress has also created unprecedented challenges for individual laboratories to store, manage, and analyze proteomic and glycoproteomic data, both in the cost for proprietary software and high-performance computing and in the long processing time that discourages on-the-fly changes of data processing settings required in explorative and discovery analysis. We developed an open-source, cloud computing-based pipeline, MS-PyCloud, with graphical user interface (GUI), for proteomic and glycoproteomic data analysis. The major components of this pipeline include data file integrity validation, MS/MS database search for spectral assignments to peptide sequences, false discovery rate estimation, protein inference, quantitation of global protein levels, and specific glycan-modified glycopeptides as well as other modification-specific peptides such as phosphorylation, acetylation, and ubiquitination. To ensure the transparency and reproducibility of data analysis, MS-PyCloud includes open-source software tools with comprehensive testing and versioning for spectrum assignments. Leveraging public cloud computing infrastructure via Amazon Web Services (AWS), MS-PyCloud scales seamlessly based on analysis demand to achieve fast and efficient performance. Application of the pipeline to the analysis of large-scale LC-MS/MS data sets demonstrated the effectiveness and high performance of MS-PyCloud. The software can be downloaded at https://github.com/huizhanglab-jhu/ms-pycloud.
Collapse
Affiliation(s)
- Yingwei Hu
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Michael Schnaubelt
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Li Chen
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Bai Zhang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Trung Hoang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - T Mamie Lih
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Zhen Zhang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| | - Hui Zhang
- Department of Pathology, School of Medicine, Johns Hopkins University, Baltimore, Maryland 21231, United States
| |
Collapse
|
21
|
Vlnieska V, Khanda A, Gilshtein E, Beltrán JL, Heier J, Kunka D. Polypy: A Framework to Interpret Polymer Properties from Mass Spectroscopy Data. Polymers (Basel) 2024; 16:1771. [PMID: 39000627 PMCID: PMC11244493 DOI: 10.3390/polym16131771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 06/13/2024] [Accepted: 06/20/2024] [Indexed: 07/17/2024] Open
Abstract
Mass spectroscopy (MS) is a robust technique for polymer characterization, and it can provide the chemical fingerprint of a complete sample regarding polymer distribution chains. Nevertheless, polymer chemical properties such as polydispersity (Pd), average molecular mass (Mn), weight average molecular mass (Mw) and others are not determined by MS, as they are commonly characterized by gel permeation chromatography (GPC). In order to calculate polymer properties from MS, a Python script was developed to interpret polymer properties from spectroscopic raw data. Polypy script can be considered a peak detection and area distribution method, and represents the result of combining the MS raw data filtered using Root Mean Square (RMS) calculation with molecular classification based on theoretical molar masses. Polypy filters out areas corresponding to repetitive units. This approach facilitates the identification of the polymer chains and calculates their properties. The script also integrates visualization graphic tools for data analysis. In this work, aryl resin (poly(2,2-bis(4-oxy-(2-(methyloxirane)phenyl)propan) was the study case polymer molecule, and is composed of oligomer chains distributed mainly in the range of dimers to tetramers, in some cases presenting traces of pentamers and hexamers in the distribution profile of the oligomeric chains. Epoxy resin has Mn = 607 Da, Mw = 631 Da, and polydispersity (Pd) of 1.015 (data given by GPC). With Polypy script, calculations resulted in Mn = 584.42 Da, Mw = 649.29 Da, and Pd = 1.11, which are consistent results if compared with GPC characterization. Additional information, such as the percentage of oligomer distribution, was also calculated and for this polymer matrix it was not possible to retrieve it from the GPC method. Polypy is an approach to characterizing major polymer chemical properties using only MS raw spectra, and it can be utilized with any MS raw data for any polymer matrix.
Collapse
Affiliation(s)
- Vitor Vlnieska
- Laboratory for Functional Polymers, Swiss Federal Laboratories for Materials Science and Technology (EMPA), Überlandstrasse 129, 8600 Dübendorf, Switzerland
- Laboratory for Thin Films and Photovoltaics, Swiss Federal Laboratories for Materials Science and Technology (EMPA), Überlandstrasse 129, 8600 Dübendorf, Switzerland
| | - Ankita Khanda
- Integrated Quantum Optics, Institute for Photonic Quantum Systems (PhoQS), Paderborn University, Warburger Str. 100, 33098 Paderborn, Germany
| | - Evgeniia Gilshtein
- Department of Electrical and Photonics Engineering, Technical University of Denmark, Anker Engelunds Vej 101, 2800 Kongens Lyngby, Denmark
| | - Jorge Luis Beltrán
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| | - Jakob Heier
- Laboratory for Thin Films and Photovoltaics, Swiss Federal Laboratories for Materials Science and Technology (EMPA), Überlandstrasse 129, 8600 Dübendorf, Switzerland
| | - Danays Kunka
- Institute of Microstructure Technology, Karlsruhe Institute of Technology (KIT), Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| |
Collapse
|
22
|
Roberts DS, Loo JA, Tsybin YO, Liu X, Wu S, Chamot-Rooke J, Agar JN, Paša-Tolić L, Smith LM, Ge Y. Top-down proteomics. NATURE REVIEWS. METHODS PRIMERS 2024; 4:38. [PMID: 39006170 PMCID: PMC11242913 DOI: 10.1038/s43586-024-00318-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 04/24/2024] [Indexed: 07/16/2024]
Abstract
Proteoforms, which arise from post-translational modifications, genetic polymorphisms and RNA splice variants, play a pivotal role as drivers in biology. Understanding proteoforms is essential to unravel the intricacies of biological systems and bridge the gap between genotypes and phenotypes. By analysing whole proteins without digestion, top-down proteomics (TDP) provides a holistic view of the proteome and can decipher protein function, uncover disease mechanisms and advance precision medicine. This Primer explores TDP, including the underlying principles, recent advances and an outlook on the future. The experimental section discusses instrumentation, sample preparation, intact protein separation, tandem mass spectrometry techniques and data collection. The results section looks at how to decipher raw data, visualize intact protein spectra and unravel data analysis. Additionally, proteoform identification, characterization and quantification are summarized, alongside approaches for statistical analysis. Various applications are described, including the human proteoform project and biomedical, biopharmaceutical and clinical sciences. These are complemented by discussions on measurement reproducibility, limitations and a forward-looking perspective that outlines areas where the field can advance, including potential future applications.
Collapse
Affiliation(s)
- David S Roberts
- Department of Chemistry, Stanford University, Stanford, CA, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
| | - Joseph A Loo
- Department of Chemistry and Biochemistry, Department of Biological Chemistry, University of California - Los Angeles, Los Angeles, CA, USA
| | | | - Xiaowen Liu
- Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, USA
| | - Si Wu
- Department of Chemistry and Biochemistry, The University of Alabama, Tuscaloosa, AL, USA
| | | | - Jeffrey N Agar
- Departments of Chemistry and Chemical Biology and Pharmaceutical Sciences, Northeastern University, Boston, MA, USA
| | - Ljiljana Paša-Tolić
- Environmental and Molecular Sciences Division, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin, Madison, WI, USA
| | - Ying Ge
- Department of Chemistry, University of Wisconsin, Madison, WI, USA
- Department of Cell and Regenerative Biology, Human Proteomics Program, University of Wisconsin - Madison, Madison, WI, USA
| |
Collapse
|
23
|
BOLLON JORDY, SHORTREED MICHAELR, JORDAN BENT, MILLER RACHEL, JEFFERY ERIN, CAVALLI ANDREA, SMITH LLOYDM, DEWEY COLIN, SHEYNKMAN GLORIAM, TIBERI SIMONE. IsoBayes: a Bayesian approach for single-isoform proteomics inference. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.10.598223. [PMID: 38915658 PMCID: PMC11195044 DOI: 10.1101/2024.06.10.598223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/26/2024]
Abstract
Studying protein isoforms is an essential step in biomedical research; at present, the main approach for analyzing proteins is via bottom-up mass spectrometry proteomics, which return peptide identifications, that are indirectly used to infer the presence of protein isoforms. However, the detection and quantification processes are noisy; in particular, peptides may be erroneously detected, and most peptides, known as shared peptides, are associated to multiple protein isoforms. As a consequence, studying individual protein isoforms is challenging, and inferred protein results are often abstracted to the gene-level or to groups of protein isoforms. Here, we introduce IsoBayes, a novel statistical method to perform inference at the isoform level. Our method enhances the information available, by integrating mass spectrometry proteomics and transcriptomics data in a Bayesian probabilistic framework. To account for the uncertainty in the measurement process, we propose a two-layer latent variable approach: first, we sample if a peptide has been correctly detected (or, alternatively filter peptides); second, we allocate the abundance of such selected peptides across the protein(s) they are compatible with. This enables us, starting from peptide-level data, to recover protein-level data; in particular, we: i) infer the presence/absence of each protein isoform (via a posterior probability), ii) estimate its abundance (and credible interval), and iii) target isoforms where transcript and protein relative abundances significantly differ. We benchmarked our approach in simulations, and in two multi-protease real datasets: our method displays good sensitivity and specificity when detecting protein isoforms, its estimated abundances highly correlate with the ground truth, and can detect changes between protein and transcript relative abundances. IsoBayes is freely distributed as a Bioconductor R package, and is accompanied by an example usage vignette.
Collapse
Affiliation(s)
- JORDY BOLLON
- Computational and Chemical Biology, Italian Institute of Technology, CMPVdA, Aosta, Italy
- Astronomical Observatory of the Autonomous Region of the Aosta Valley (OAVdA), Nus, Italy
| | | | - BEN T JORDAN
- Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - RACHEL MILLER
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - ERIN JEFFERY
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - ANDREA CAVALLI
- Computational and Chemical Biology, Italian Institute of Technology, CMPVdA, Aosta, Italy
- Centre Européen de Calcul Atomique et Moléculaire, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - LLOYD M SMITH
- Department of Chemistry, University of Wisconsin-Madison, Madison, WI, USA
| | - COLIN DEWEY
- Department of Biostatistics and Medical Informatics, University of Wisconsin, Madison, WI, USA
| | - GLORIA M SHEYNKMAN
- Department of Molecular Physiology and Biological Physics, University of Virginia, Charlottesville, VA, USA
| | - SIMONE TIBERI
- Department of Statistical Sciences, University of Bologna, Bologna, Italy
| |
Collapse
|
24
|
Fendler NL, Ly J, Welp L, Urlaub H, Vos SM. Identification and characterization of a human MORC2 DNA binding region that is required for gene silencing. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.05.597643. [PMID: 38895295 PMCID: PMC11185635 DOI: 10.1101/2024.06.05.597643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/21/2024]
Abstract
The eukaryotic microrchidia (MORC) protein family are DNA gyrase, Hsp90, histidine kinase, MutL (GHKL)-type ATPases involved in gene expression regulation and chromatin compaction. The molecular mechanisms underlying these activities are incompletely understood. Here we studied the full-length human MORC2 protein biochemically. We identified a DNA binding site in the C-terminus of the protein, and we observe that this region is heavily phosphorylated in cells. Phosphorylation of MORC2 reduces its affinity for DNA and appears to exclude the protein from the nucleus. We observe that DNA binding by MORC2 reduces its ATPase activity and that MORC2 can topologically entrap multiple DNA substrates between its N-terminal GHKL and C-terminal coiled coil 3 dimerization domains. Finally, we observe that the MORC2 C-terminal DNA binding region is required for gene silencing in cells. Together, our data provide a model to understand how MORC2 engages with DNA substrates to mediate gene silencing.
Collapse
Affiliation(s)
- Nikole L. Fendler
- Department of Biology, Massachusetts Institute of Technology, Building 68, 31 Ames St., Cambridge, MA 02139
| | - Jimmy Ly
- Department of Biology, Massachusetts Institute of Technology, Building 68, 31 Ames St., Cambridge, MA 02139
- Whitehead Institute for Biomedical Research, Cambridge, MA 02139
| | - Luisa Welp
- Bioanalytical Mass Spectrometry, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
- Bioanalytics Group, University Medical Center Göttingen, Department of Clinical Chemistry, Göttingen, Germany
| | - Henning Urlaub
- Bioanalytical Mass Spectrometry, Max Planck Institute for Multidisciplinary Sciences, Göttingen, Germany
- Bioanalytics Group, University Medical Center Göttingen, Department of Clinical Chemistry, Göttingen, Germany
| | - Seychelle M. Vos
- Department of Biology, Massachusetts Institute of Technology, Building 68, 31 Ames St., Cambridge, MA 02139
- Howard Hughes Medical Institute
| |
Collapse
|
25
|
Mitchell JM, Chi Y, Thapa M, Pang Z, Xia J, Li S. Common data models to streamline metabolomics processing and annotation, and implementation in a Python pipeline. PLoS Comput Biol 2024; 20:e1011912. [PMID: 38843301 PMCID: PMC11185459 DOI: 10.1371/journal.pcbi.1011912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Revised: 06/18/2024] [Accepted: 05/20/2024] [Indexed: 06/18/2024] Open
Abstract
To standardize metabolomics data analysis and facilitate future computational developments, it is essential to have a set of well-defined templates for common data structures. Here we describe a collection of data structures involved in metabolomics data processing and illustrate how they are utilized in a full-featured Python-centric pipeline. We demonstrate the performance of the pipeline, and the details in annotation and quality control using large-scale LC-MS metabolomics and lipidomics data and LC-MS/MS data. Multiple previously published datasets are also reanalyzed to showcase its utility in biological data analysis. This pipeline allows users to streamline data processing, quality control, annotation, and standardization in an efficient and transparent manner. This work fills a major gap in the Python ecosystem for computational metabolomics.
Collapse
Affiliation(s)
- Joshua M. Mitchell
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut, United States of America
| | - Yuanye Chi
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut, United States of America
| | - Maheshwor Thapa
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut, United States of America
| | - Zhiqiang Pang
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Jianguo Xia
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Shuzhao Li
- The Jackson Laboratory for Genomic Medicine, Farmington, Connecticut, United States of America
- University of Connecticut School of Medicine, Farmington, Connecticut, United States of America
| |
Collapse
|
26
|
Newman NK, Macovsky MS, Rodrigues RR, Bruce AM, Pederson JW, Padiadpu J, Shan J, Williams J, Patil SS, Dzutsev AK, Shulzhenko N, Trinchieri G, Brown K, Morgun A. Transkingdom Network Analysis (TkNA): a systems framework for inferring causal factors underlying host-microbiota and other multi-omic interactions. Nat Protoc 2024; 19:1750-1778. [PMID: 38472495 DOI: 10.1038/s41596-024-00960-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 11/29/2023] [Indexed: 03/14/2024]
Abstract
We present Transkingdom Network Analysis (TkNA), a unique causal-inference analytical framework that offers a holistic view of biological systems by integrating data from multiple cohorts and diverse omics types. TkNA helps to decipher key players and mechanisms governing host-microbiota (or any multi-omic data) interactions in specific conditions or diseases. TkNA reconstructs a network that represents a statistical model capturing the complex relationships between different omics in the biological system. It identifies robust and reproducible patterns of fold change direction and correlation sign across several cohorts to select differential features and their per-group correlations. The framework then uses causality-sensitive metrics, statistical thresholds and topological criteria to determine the final edges forming the transkingdom network. With the subsequent network's topological features, TkNA identifies nodes controlling a given subnetwork or governing communication between kingdoms and/or subnetworks. The computational time for the millions of correlations necessary for network reconstruction in TkNA typically takes only a few minutes, varying with the study design. Unlike most other multi-omics approaches that find only associations, TkNA focuses on establishing causality while accounting for the complex structure of multi-omic data. It achieves this without requiring huge sample sizes. Moreover, the TkNA protocol is user friendly, requiring minimal installation and basic familiarity with Unix. Researchers can access the TkNA software at https://github.com/CAnBioNet/TkNA/ .
Collapse
Affiliation(s)
- Nolan K Newman
- College of Pharmacy, Oregon State University, Corvallis, OR, USA
| | | | - Richard R Rodrigues
- Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
- Microbiome and Genetics Core, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Amanda M Bruce
- College of Pharmacy, Oregon State University, Corvallis, OR, USA
| | - Jacob W Pederson
- Carlson College of Veterinary Medicine, Oregon State University, Corvallis, OR, USA
| | - Jyothi Padiadpu
- College of Pharmacy, Oregon State University, Corvallis, OR, USA
| | - Jigui Shan
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Joshua Williams
- Advanced Biomedical Computational Science, Frederick National Laboratory for Cancer Research, Frederick, MD, USA
| | - Sankalp S Patil
- College of Pharmacy, Oregon State University, Corvallis, OR, USA
| | - Amiran K Dzutsev
- Cancer Immunobiology Section, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Natalia Shulzhenko
- Carlson College of Veterinary Medicine, Oregon State University, Corvallis, OR, USA
| | - Giorgio Trinchieri
- Cancer Immunobiology Section, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA.
| | - Kevin Brown
- College of Pharmacy, Oregon State University, Corvallis, OR, USA.
| | - Andrey Morgun
- College of Pharmacy, Oregon State University, Corvallis, OR, USA.
| |
Collapse
|
27
|
Yan S, Santoro A, Niphakis MJ, Pinto AM, Jacobs CL, Ahmad R, Suciu RM, Fonslow BR, Herbst-Graham RA, Ngo N, Henry CL, Herbst DM, Saghatelian A, Kahn BB, Rosen ED. Inflammation causes insulin resistance in mice via interferon regulatory factor 3 (IRF3)-mediated reduction in FAHFA levels. Nat Commun 2024; 15:4605. [PMID: 38816388 PMCID: PMC11139994 DOI: 10.1038/s41467-024-48220-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Accepted: 04/24/2024] [Indexed: 06/01/2024] Open
Abstract
Obesity-induced inflammation causes metabolic dysfunction, but the mechanisms remain elusive. Here we show that the innate immune transcription factor interferon regulatory factor (IRF3) adversely affects glucose homeostasis through induction of the endogenous FAHFA hydrolase androgen induced gene 1 (AIG1) in adipocytes. Adipocyte-specific knockout of IRF3 protects male mice against high-fat diet-induced insulin resistance, whereas overexpression of IRF3 or AIG1 in adipocytes promotes insulin resistance on a high-fat diet. Furthermore, pharmacological inhibition of AIG1 reversed obesity-induced insulin resistance and restored glucose homeostasis in the setting of adipocyte IRF3 overexpression. We, therefore, identify the adipocyte IRF3/AIG1 axis as a crucial link between obesity-induced inflammation and insulin resistance and suggest an approach for limiting the metabolic dysfunction accompanying obesity.
Collapse
Affiliation(s)
- Shuai Yan
- Division of Endocrinology, Diabetes, and Metabolism, Beth Israel Deaconess Medical Center, 330 Brookline Ave, Boston, MA, 02215, USA
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02130, USA
| | - Anna Santoro
- Division of Endocrinology, Diabetes, and Metabolism, Beth Israel Deaconess Medical Center, 330 Brookline Ave, Boston, MA, 02215, USA
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02130, USA
| | - Micah J Niphakis
- Lundbeck La Jolla Research Center Inc., 10835 Road To The Cure Dr. #250, San Diego, CA, 92121, USA
| | - Antonio M Pinto
- The Salk Institute for Biological Studies, 10010 N. Torey Pines Rd, La Jolla, CA, 92037-1002, USA
| | - Christopher L Jacobs
- Division of Endocrinology, Diabetes, and Metabolism, Beth Israel Deaconess Medical Center, 330 Brookline Ave, Boston, MA, 02215, USA
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02130, USA
| | - Rasheed Ahmad
- Immunology and Microbiology Department, Dasman Diabetes Institute, Jasim Mohamad Al Bahar St., Kuwait City, Kuwait
| | - Radu M Suciu
- Lundbeck La Jolla Research Center Inc., 10835 Road To The Cure Dr. #250, San Diego, CA, 92121, USA
| | - Bryan R Fonslow
- Lundbeck La Jolla Research Center Inc., 10835 Road To The Cure Dr. #250, San Diego, CA, 92121, USA
| | - Rachel A Herbst-Graham
- Lundbeck La Jolla Research Center Inc., 10835 Road To The Cure Dr. #250, San Diego, CA, 92121, USA
| | - Nhi Ngo
- Lundbeck La Jolla Research Center Inc., 10835 Road To The Cure Dr. #250, San Diego, CA, 92121, USA
| | - Cassandra L Henry
- Lundbeck La Jolla Research Center Inc., 10835 Road To The Cure Dr. #250, San Diego, CA, 92121, USA
| | - Dylan M Herbst
- Lundbeck La Jolla Research Center Inc., 10835 Road To The Cure Dr. #250, San Diego, CA, 92121, USA
| | - Alan Saghatelian
- The Salk Institute for Biological Studies, 10010 N. Torey Pines Rd, La Jolla, CA, 92037-1002, USA
| | - Barbara B Kahn
- Division of Endocrinology, Diabetes, and Metabolism, Beth Israel Deaconess Medical Center, 330 Brookline Ave, Boston, MA, 02215, USA
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02130, USA
- Broad Institute of Harvard and MIT, 320 Charles St., Cambridge, MA, 02141, USA
| | - Evan D Rosen
- Division of Endocrinology, Diabetes, and Metabolism, Beth Israel Deaconess Medical Center, 330 Brookline Ave, Boston, MA, 02215, USA.
- Harvard Medical School, 25 Shattuck St, Boston, MA, 02130, USA.
- Broad Institute of Harvard and MIT, 320 Charles St., Cambridge, MA, 02141, USA.
| |
Collapse
|
28
|
Rosenberger G, Li W, Turunen M, He J, Subramaniam PS, Pampou S, Griffin AT, Karan C, Kerwin P, Murray D, Honig B, Liu Y, Califano A. Network-based elucidation of colon cancer drug resistance mechanisms by phosphoproteomic time-series analysis. Nat Commun 2024; 15:3909. [PMID: 38724493 PMCID: PMC11082183 DOI: 10.1038/s41467-024-47957-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Accepted: 04/16/2024] [Indexed: 05/12/2024] Open
Abstract
Aberrant signaling pathway activity is a hallmark of tumorigenesis and progression, which has guided targeted inhibitor design for over 30 years. Yet, adaptive resistance mechanisms, induced by rapid, context-specific signaling network rewiring, continue to challenge therapeutic efficacy. Leveraging progress in proteomic technologies and network-based methodologies, we introduce Virtual Enrichment-based Signaling Protein-activity Analysis (VESPA)-an algorithm designed to elucidate mechanisms of cell response and adaptation to drug perturbations-and use it to analyze 7-point phosphoproteomic time series from colorectal cancer cells treated with clinically-relevant inhibitors and control media. Interrogating tumor-specific enzyme/substrate interactions accurately infers kinase and phosphatase activity, based on their substrate phosphorylation state, effectively accounting for signal crosstalk and sparse phosphoproteome coverage. The analysis elucidates time-dependent signaling pathway response to each drug perturbation and, more importantly, cell adaptive response and rewiring, experimentally confirmed by CRISPR knock-out assays, suggesting broad applicability to cancer and other diseases.
Collapse
Affiliation(s)
- George Rosenberger
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Wenxue Li
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA
| | - Mikko Turunen
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Jing He
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- Regeneron Genetics Center, Tarrytown, NY, USA
| | - Prem S Subramaniam
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Sergey Pampou
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- J.P. Sulzberger Columbia Genome Center, Columbia University Irving Medical Center, New York, NY, USA
| | - Aaron T Griffin
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- Medical Scientist Training Program, Columbia University Irving Medical Center, New York, NY, USA
| | - Charles Karan
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- J.P. Sulzberger Columbia Genome Center, Columbia University Irving Medical Center, New York, NY, USA
| | - Patrick Kerwin
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Diana Murray
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Barry Honig
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
- Department of Biochemistry & Molecular Biophysics, Columbia University Irving Medical Center, New York, NY, USA
- Zuckerman Mind Brain and Behavior Institute, Columbia University, New York, NY, USA
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA
| | - Yansheng Liu
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA.
- Department of Pharmacology, Yale University School of Medicine, New Haven, CT, USA.
| | - Andrea Califano
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA.
- Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA.
- Department of Biochemistry & Molecular Biophysics, Columbia University Irving Medical Center, New York, NY, USA.
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA.
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA.
- Chan Zuckerberg Biohub New York, New York, NY, USA.
| |
Collapse
|
29
|
Ewald JD, Zhou G, Lu Y, Kolic J, Ellis C, Johnson JD, Macdonald PE, Xia J. Web-based multi-omics integration using the Analyst software suite. Nat Protoc 2024; 19:1467-1497. [PMID: 38355833 DOI: 10.1038/s41596-023-00950-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2023] [Accepted: 11/21/2023] [Indexed: 02/16/2024]
Abstract
The growing number of multi-omics studies demands clear conceptual workflows coupled with easy-to-use software tools to facilitate data analysis and interpretation. This protocol covers three key components involved in multi-omics analysis, including single-omics data analysis, knowledge-driven integration using biological networks and data-driven integration through joint dimensionality reduction. Using the dataset from a recent multi-omics study of human pancreatic islet tissue and plasma samples, the first section introduces how to perform transcriptomics/proteomics data analysis using ExpressAnalyst and lipidomics data analysis using MetaboAnalyst. On the basis of significant features detected in these workflows, the second section demonstrates how to perform knowledge-driven integration using OmicsNet. The last section illustrates how to perform data-driven integration from the normalized omics data and metadata using OmicsAnalyst. The complete protocol can be executed in ~2 h. Compared with other available options for multi-omics integration, the Analyst software suite described in this protocol enables researchers to perform a wide range of omics data analysis tasks via a user-friendly web interface.
Collapse
Affiliation(s)
- Jessica D Ewald
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Guangyan Zhou
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Yao Lu
- Department of Microbiology and Immunology, McGill University, Montreal, Quebec, Canada
| | - Jelena Kolic
- Life Sciences Institute, Department of Cellular and Physiological Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Cara Ellis
- Department of Pharmacology, University of Alberta, Edmonton, Alberta, Canada
| | - James D Johnson
- Life Sciences Institute, Department of Cellular and Physiological Sciences, University of British Columbia, Vancouver, British Columbia, Canada
| | - Patrick E Macdonald
- Department of Pharmacology, University of Alberta, Edmonton, Alberta, Canada
| | - Jianguo Xia
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada.
- Department of Microbiology and Immunology, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
30
|
Perez de Souza L, Fernie AR. Computational methods for processing and interpreting mass spectrometry-based metabolomics. Essays Biochem 2024; 68:5-13. [PMID: 37999335 PMCID: PMC11065554 DOI: 10.1042/ebc20230019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/10/2023] [Accepted: 11/15/2023] [Indexed: 11/25/2023]
Abstract
Metabolomics has emerged as an indispensable tool for exploring complex biological questions, providing the ability to investigate a substantial portion of the metabolome. However, the vast complexity and structural diversity intrinsic to metabolites imposes a great challenge for data analysis and interpretation. Liquid chromatography mass spectrometry (LC-MS) stands out as a versatile technique offering extensive metabolite coverage. In this mini-review, we address some of the hurdles posed by the complex nature of LC-MS data, providing a brief overview of computational tools designed to help tackling these challenges. Our focus centers on two major steps that are essential to most metabolomics investigations: the translation of raw data into quantifiable features, and the extraction of structural insights from mass spectra to facilitate metabolite identification. By exploring current computational solutions, we aim at providing a critical overview of the capabilities and constraints of mass spectrometry-based metabolomics, while introduce some of the most recent trends in data processing and analysis within the field.
Collapse
Affiliation(s)
- Leonardo Perez de Souza
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
| | - Alisdair R Fernie
- Max Planck Institute of Molecular Plant Physiology, Am Mühlenberg 1, 14476 Potsdam-Golm, Germany
- Center for Plant Systems Biology and Biotechnology, 4000 Plovdiv, Bulgaria
| |
Collapse
|
31
|
Williams A. Multiomics data integration, limitations, and prospects to reveal the metabolic activity of the coral holobiont. FEMS Microbiol Ecol 2024; 100:fiae058. [PMID: 38653719 PMCID: PMC11067971 DOI: 10.1093/femsec/fiae058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 03/25/2024] [Accepted: 04/22/2024] [Indexed: 04/25/2024] Open
Abstract
Since their radiation in the Middle Triassic period ∼240 million years ago, stony corals have survived past climate fluctuations and five mass extinctions. Their long-term survival underscores the inherent resilience of corals, particularly when considering the nutrient-poor marine environments in which they have thrived. However, coral bleaching has emerged as a global threat to coral survival, requiring rapid advancements in coral research to understand holobiont stress responses and allow for interventions before extensive bleaching occurs. This review encompasses the potential, as well as the limits, of multiomics data applications when applied to the coral holobiont. Synopses for how different omics tools have been applied to date and their current restrictions are discussed, in addition to ways these restrictions may be overcome, such as recruiting new technology to studies, utilizing novel bioinformatics approaches, and generally integrating omics data. Lastly, this review presents considerations for the design of holobiont multiomics studies to support lab-to-field advancements of coral stress marker monitoring systems. Although much of the bleaching mechanism has eluded investigation to date, multiomic studies have already produced key findings regarding the holobiont's stress response, and have the potential to advance the field further.
Collapse
Affiliation(s)
- Amanda Williams
- Microbial Biology Graduate Program, Rutgers University, 76 Lipman Drive, New Brunswick, NJ 08901, United States
- Department of Biochemistry and Microbiology, Rutgers University, 76 Lipman Drive, New Brunswick, NJ 08901, United States
| |
Collapse
|
32
|
Bianco V, Svecla M, Vingiani GB, Kolb D, Schwarz B, Buerger M, Beretta G, Norata GD, Kratky D. Regional Differences in the Small Intestinal Proteome of Control Mice and of Mice Lacking Lysosomal Acid Lipase. J Proteome Res 2024; 23:1506-1518. [PMID: 38422518 PMCID: PMC7615810 DOI: 10.1021/acs.jproteome.4c00082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2024] [Accepted: 02/16/2024] [Indexed: 03/02/2024]
Abstract
The metabolic contribution of the small intestine (SI) is still unclear despite recent studies investigating the involvement of single cells in regional differences. Using untargeted proteomics, we identified regional characteristics of the three intestinal tracts of C57BL/6J mice and found that proteins abundant in the mouse ileum correlated with the high ileal expression of the corresponding genes in humans. In the SI of C57BL/6J mice, we also detected an increasing abundance of lysosomal acid lipase (LAL), which is responsible for degrading triacylglycerols and cholesteryl esters within the lysosome. LAL deficiency in patients and mice leads to lipid accumulation, gastrointestinal disturbances, and malabsorption. We previously demonstrated that macrophages massively infiltrated the SI of Lal-deficient (KO) mice, especially in the duodenum. Using untargeted proteomics (ProteomeXchange repository, data identifier PXD048378), we revealed a general inflammatory response and a common lipid-associated macrophage phenotype in all three intestinal segments of Lal KO mice, accompanied by a higher expression of GPNMB and concentrations of circulating sTREM2. However, only duodenal macrophages activated a metabolic switch from lipids to other pathways, which were downregulated in the jejunum and ileum of Lal KO mice. Our results provide new insights into the process of absorption in control mice and possible novel markers of LAL-D and/or systemic inflammation in LAL-D.
Collapse
Affiliation(s)
- Valentina Bianco
- Gottfried
Schatz Research Center, Molecular Biology and Biochemistry, Medical University of Graz, Neue Stiftingtalstrasse 6/4, 8010 Graz, Austria
| | - Monika Svecla
- Department
of Pharmacological and Biomolecular Sciences, Università degli Studi di Milano, Via Balzaretti 9, 20133 Milan, Italy
| | - Giovanni Battista Vingiani
- Department
of Pharmacological and Biomolecular Sciences, Università degli Studi di Milano, Via Balzaretti 9, 20133 Milan, Italy
| | - Dagmar Kolb
- Core
Facility Ultrastructural Analysis, Medical
University of Graz, 8010 Graz, Austria
- BioTechMed-Graz, 8010 Graz, Austria
- Gottfried
Schatz Research Center, Cell Biology, Histology and Embryology, Medical University of Graz, 8010 Graz, Austria
| | - Birgit Schwarz
- Gottfried
Schatz Research Center, Molecular Biology and Biochemistry, Medical University of Graz, Neue Stiftingtalstrasse 6/4, 8010 Graz, Austria
| | - Martin Buerger
- Gottfried
Schatz Research Center, Molecular Biology and Biochemistry, Medical University of Graz, Neue Stiftingtalstrasse 6/4, 8010 Graz, Austria
| | - Giangiacomo Beretta
- Department
of Environmental Science and Policy, Università
degli Studi di Milano, 20133 Milan, Italy
| | - Giuseppe Danilo Norata
- Department
of Pharmacological and Biomolecular Sciences, Università degli Studi di Milano, Via Balzaretti 9, 20133 Milan, Italy
- Centro
SISA per lo studio dell’Aterosclerosi, Ospedale Bassini, 20092 Cinisello Balsamo, Italy
| | - Dagmar Kratky
- Gottfried
Schatz Research Center, Molecular Biology and Biochemistry, Medical University of Graz, Neue Stiftingtalstrasse 6/4, 8010 Graz, Austria
- BioTechMed-Graz, 8010 Graz, Austria
| |
Collapse
|
33
|
Lin A, See D, Fondrie WE, Keich U, Noble WS. Target-decoy false discovery rate estimation using Crema. Proteomics 2024; 24:e2300084. [PMID: 38380501 DOI: 10.1002/pmic.202300084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2023] [Revised: 01/06/2024] [Accepted: 01/16/2024] [Indexed: 02/22/2024]
Abstract
Assigning statistical confidence estimates to discoveries produced by a tandem mass spectrometry proteomics experiment is critical to enabling principled interpretation of the results and assessing the cost/benefit ratio of experimental follow-up. The most common technique for computing such estimates is to use target-decoy competition (TDC), in which observed spectra are searched against a database of real (target) peptides and a database of shuffled or reversed (decoy) peptides. TDC procedures for estimating the false discovery rate (FDR) at a given score threshold have been developed for application at the level of spectra, peptides, or proteins. Although these techniques are relatively straightforward to implement, it is common in the literature to skip over the implementation details or even to make mistakes in how the TDC procedures are applied in practice. Here we present Crema, an open-source Python tool that implements several TDC methods of spectrum-, peptide- and protein-level FDR estimation. Crema is compatible with a variety of existing database search tools and provides a straightforward way to obtain robust FDR estimates.
Collapse
Affiliation(s)
- Andy Lin
- Chemical and Biological Signatures, Pacific Northwest National Laboratory, Seattle, Washington, USA
| | - Donavan See
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
| | | | - Uri Keich
- School of Mathematics and Statistics, University of Sydney, Sydney, Australia
| | - William Stafford Noble
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| |
Collapse
|
34
|
Zhang K, Li H, Shi J, Liu W, Wang Y, Tu P, Li J, Song Y. Strategy strengthens structural identification through hyphenating full collision energy ramp-MS 2 and full exciting energy ramp-MS 3 spectra: An application for metabolites identification of rosmarinic acid. Anal Chim Acta 2024; 1296:342346. [PMID: 38401935 DOI: 10.1016/j.aca.2024.342346] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Revised: 01/11/2024] [Accepted: 02/04/2024] [Indexed: 02/26/2024]
Abstract
"MS/MS spectrum to structure" analysis is the most challenging task for MS/MS-relied qualitative characterization. The conventional database- and computation-assisted strategies cannot reach confirmative identification, notably for isomers. Hence, an advanced strategy was proposed here through tackling the two determinant obstacles such as the transformation from elemental compositions to fragment ion structures and the linkage style amongst substructures. As typical conjugated structures, esters were measured for strategy illustration, and metabolite identification of a famous natural antioxidant namely rosmarinic acid (RosA) in rat was undertaken for applicability justification. Through programming online energy-resolved (ER)-MS for the first collision cell of Qtrap-MS device, full collision energy ramp (FCER)-MS2 spectrum was configured for [M-H]- ion of each ester to provide optimal collision energies (OCEs) for all concerned diagnostic fragment ions (DFIs), i.e. a-, b-, c-, y-, and z-type ions. The linear correlations between masses and OCEs were built for each ion type to facilitate DFIs recognition from chaotic MS2 spectrum. To identify 1st-generation fragment ions, full exciting energy ramp (FEER)-MS3 spectra were configured for key DFIs via programming the second ER-MS in the latter collision chamber. FEER-MS3 spectrum of 1st-generation fragment ion for ester was demonstrated to be identical with FEER-MS2 spectrum of certain hydrolysis product when sharing the same structure. After applying the advanced strategy to recognize DFIs and identify 1st-generation fragment ions, a total of forty metabolites (M1-M40), resulted from hydrolysis, methylation, sulfation, and glucuronidation, were unambiguously identified for RosA after oral administration. Together, the advanced bottom-up strategy hyphenating FCER-MS2 and FEER-MS3 spectra, is meaningful to strengthen "MS/MS spectrum to structure" analysis through recognizing and identifying fragment ions.
Collapse
Affiliation(s)
- Ke Zhang
- Modern Research Center for Traditional Chinese Medicine, Beijing Research Institute of Chinese Medicine, Beijing University of Chinese Medicine, East Road of North 3rd Ring, Chaoyang District, Beijing, 100029, China
| | - Han Li
- Modern Research Center for Traditional Chinese Medicine, Beijing Research Institute of Chinese Medicine, Beijing University of Chinese Medicine, East Road of North 3rd Ring, Chaoyang District, Beijing, 100029, China
| | - Jingjing Shi
- Modern Research Center for Traditional Chinese Medicine, Beijing Research Institute of Chinese Medicine, Beijing University of Chinese Medicine, East Road of North 3rd Ring, Chaoyang District, Beijing, 100029, China
| | - Wenjing Liu
- School of Pharmacy, Henan University of Chinese Medicine, Zhengzhou, 450046, China
| | - Yitao Wang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences, University of Macau, Taipa, 999078, Macao
| | - Pengfei Tu
- Modern Research Center for Traditional Chinese Medicine, Beijing Research Institute of Chinese Medicine, Beijing University of Chinese Medicine, East Road of North 3rd Ring, Chaoyang District, Beijing, 100029, China
| | - Jun Li
- Modern Research Center for Traditional Chinese Medicine, Beijing Research Institute of Chinese Medicine, Beijing University of Chinese Medicine, East Road of North 3rd Ring, Chaoyang District, Beijing, 100029, China
| | - Yuelin Song
- Modern Research Center for Traditional Chinese Medicine, Beijing Research Institute of Chinese Medicine, Beijing University of Chinese Medicine, East Road of North 3rd Ring, Chaoyang District, Beijing, 100029, China.
| |
Collapse
|
35
|
Oostrom MT, Colby SM, Metz TO. DEIMoS GUI: An Open-Source User Interface for a High-Dimensional Mass Spectrometry Data Processing Tool. J Chem Inf Model 2024; 64:1419-1424. [PMID: 38412257 DOI: 10.1021/acs.jcim.3c01222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
We report here the creation of a graphical user interface (GUI) for the Data Extraction for Integrated Multidimensional Spectrometry (DEIMoS) tool. DEIMoS is a Python package that processes data from high-dimensional mass spectrometry measurements. It is divided into several modules, each representing a data processing step such as peak detection, alignment, and tandem mass spectra extraction and deconvolution. The inputs for and outputs from DEIMoS can include millions of N-dimensional data points, which can be challenging to visualize in a way that is interactive, informative, and responsive. Here, we used the HoloViz Python data visualization stack, including DataShader and Param, to create an interactive visualization of the mass spectrometry data. We believe the GUI will increase the accessibility of DEIMoS and that the visualization methods could be useful for other open-source mass spectrometry tools.
Collapse
Affiliation(s)
- Marjolein T Oostrom
- National Security Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Sean M Colby
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Thomas O Metz
- Earth and Biological Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| |
Collapse
|
36
|
Pfeuffer J, Bielow C, Wein S, Jeong K, Netz E, Walter A, Alka O, Nilse L, Colaianni PD, McCloskey D, Kim J, Rosenberger G, Bichmann L, Walzer M, Veit J, Boudaud B, Bernt M, Patikas N, Pilz M, Startek MP, Kutuzova S, Heumos L, Charkow J, Sing JC, Feroz A, Siraj A, Weisser H, Dijkstra TMH, Perez-Riverol Y, Röst H, Kohlbacher O, Sachsenberg T. OpenMS 3 enables reproducible analysis of large-scale mass spectrometry data. Nat Methods 2024; 21:365-367. [PMID: 38366242 DOI: 10.1038/s41592-024-02197-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2024]
Affiliation(s)
- Julianus Pfeuffer
- Algorithmic Bioinformatics, Freie Universität Berlin, Berlin, Germany
- Visual and Data-Centric Computing, Zuse Institute Berlin, Berlin, Germany
| | - Chris Bielow
- Bioinformatics Solution Center, Institut für Mathematik und Informatik, Freie Universität Berlin, Berlin, Germany
| | - Samuel Wein
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Kyowon Jeong
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Eugen Netz
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Axel Walter
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Oliver Alka
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Lars Nilse
- Institute of Molecular Medicine and Cell Research, University of Freiburg, Freiburg, Germany
| | | | - Douglas McCloskey
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
- BioMed X Institute, Heidelberg, Germany
| | - Jihyung Kim
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | | | - Leon Bichmann
- Yale Center for Systems and Engineering Immunology and Department of Immunobiology, Yale University School of Medicine, New Haven, CT, USA
| | - Mathias Walzer
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), , Wellcome Trust Genome Campus, Hinxton, UK
| | - Johannes Veit
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Bertrand Boudaud
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark
| | - Matthias Bernt
- Department of Computational Biology, Helmholtz Centre for Environmental Research GmbH-UFZ, Leipzig, Germany
| | - Nikolaos Patikas
- Evergrande Center for Immunologic Diseases Harvard Medical School and Brigham and Women's Hospital, Boston, MA, USA
| | - Matteo Pilz
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Michał Piotr Startek
- Faculty of Mathematics, Informatics and Mechanics, University of Warsaw, Warsaw, Poland
- Institute for Immunology, University Medical Center of the Johannes-Gutenberg University, Mainz, Germany
| | - Svetlana Kutuzova
- Department of Computer Science/Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Lukas Heumos
- Institute of Computational Biology, Department of Computational Health, Helmholtz Munich, Oberschleißheim, Germany
- Institute of Lung Health and Immunity and Comprehensive Pneumology Center with the CPC-M bioArchive, Helmholtz Zentrum Munich, Member of the German Center for Lung Research (DZL), Munich, Germany
- TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Joshua Charkow
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Justin Cyril Sing
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Ayesha Feroz
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | - Arslan Siraj
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
| | | | - Tjeerd M H Dijkstra
- Department for Women's Health, University Clinic Tübingen, Tübingen, Germany
- Institute for Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), , Wellcome Trust Genome Campus, Hinxton, UK
| | - Hannes Röst
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
- The Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - Oliver Kohlbacher
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany
- Institute for Translational Bioinformatics, University Hospital Tübingen, Tübingen, Germany
| | - Timo Sachsenberg
- Applied Bioinformatics, Department of Computer Science, University of Tuebingen, Tübingen, Germany.
- Institute for Bioinformatics and Medical Informatics, University of Tuebingen, Tübingen, Germany.
| |
Collapse
|
37
|
Jayaprakash R, Pook C, Ramzan F, Miles-Chan JL, Mithen RF, Foster M. Human Metabolism and Excretion of Kawakawa (Piper excelsum) Leaf Chemicals. Mol Nutr Food Res 2024; 68:e2300583. [PMID: 38389156 DOI: 10.1002/mnfr.202300583] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Indexed: 02/24/2024]
Abstract
SCOPE Piper excelsum (kawakawa) has a history of therapeutic use by Māori in Aotearoa New Zealand. It is currently widely consumed as a beverage and included as an ingredient in "functional" food product. Leaves contain compounds that are also found in a wide range of other spices, foods, and medicinal plants. This study investigates the human metabolism and excretion of kawakawa leaf chemicals. METHODS AND RESULTS Six healthy male volunteers in one study (Bioavailability of Kawakawa Tea metabolites in human volunteers [BOKA-T]) and 30 volunteers (15 male and 15 female) in a second study (Impact of acute Kawakawa Tea ingestion on postprandial glucose metabolism in healthy human volunteers [TOAST]) consume a hot water infusion of dried kawakawa leaves (kawakawa tea [KT]). Untargeted Liquid Chromatography-Tandem Mass spectrometry (LC-MS/MS) analyses of urine samples from BOKA-T identified 26 urinary metabolites that are significantly associated with KT consumption, confirmed by the analysis of samples from the independent TOAST study. Seven of the 26 metabolites are also detected in plasma. Thirteen of the 26 urinary compounds are provisionally identified as metabolites of specific compounds in KT, eight metabolites are identified as being derived from specific compounds in KT but without resolution of chemical structure, and five are of unknown origin. CONCLUSIONS Several kawakawa compounds that are also widely found in other plants are bioavailable and are modified by phase 1 and 2 metabolism.
Collapse
Affiliation(s)
- Ramya Jayaprakash
- Liggins Institute, Waipapa Taumata Rau - The University of Auckland, 85 Park Road, Private Bag 92019, Auckland, 1142, New Zealand
| | - Chris Pook
- Liggins Institute, Waipapa Taumata Rau - The University of Auckland, 85 Park Road, Private Bag 92019, Auckland, 1142, New Zealand
| | - Farha Ramzan
- Liggins Institute, Waipapa Taumata Rau - The University of Auckland, 85 Park Road, Private Bag 92019, Auckland, 1142, New Zealand
| | - Jennifer L Miles-Chan
- Human Nutrition Unit, School of Biological Sciences, Waipapa Taumata Rau - The University of Auckland, Auckland, New Zealand
| | - Richard F Mithen
- Liggins Institute, Waipapa Taumata Rau - The University of Auckland, 85 Park Road, Private Bag 92019, Auckland, 1142, New Zealand
| | - Meika Foster
- Liggins Institute, Waipapa Taumata Rau - The University of Auckland, 85 Park Road, Private Bag 92019, Auckland, 1142, New Zealand
- AuOra Ltd, Wakatū Incorporation, Nelson, 7010, New Zealand
| |
Collapse
|
38
|
Mitchell JM, Chi Y, Thapa M, Pang Z, Xia J, Li S. Common data models to streamline metabolomics processing and annotation, and implementation in a Python pipeline. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.13.580048. [PMID: 38405981 PMCID: PMC10888883 DOI: 10.1101/2024.02.13.580048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
To standardize metabolomics data analysis and facilitate future computational developments, it is essential is have a set of well-defined templates for common data structures. Here we describe a collection of data structures involved in metabolomics data processing and illustrate how they are utilized in a full-featured Python-centric pipeline. We demonstrate the performance of the pipeline, and the details in annotation and quality control using large-scale LC-MS metabolomics and lipidomics data and LC-MS/MS data. Multiple previously published datasets are also reanalyzed to showcase its utility in biological data analysis. This pipeline allows users to streamline data processing, quality control, annotation, and standardization in an efficient and transparent manner. This work fills a major gap in the Python ecosystem for computational metabolomics.
Collapse
Affiliation(s)
- Joshua M. Mitchell
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Yuanye Chi
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Maheshwor Thapa
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
| | - Zhiqiang Pang
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Jianguo Xia
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Shuzhao Li
- The Jackson Laboratory for Genomic Medicine, 10 Discovery Drive, Farmington, CT 06032, USA
- University of Connecticut School of Medicine, Farmington, CT 06032, USA
| |
Collapse
|
39
|
Steens JA, Bravo JP, Salazar CRP, Yildiz C, Amieiro AM, Köstlbacher S, Prinsen SH, Andres AS, Patinios C, Bardis A, Barendregt A, Scheltema RA, Ettema TJ, van der Oost J, Taylor DW, Staals RH. Type III-B CRISPR-Cas cascade of proteolytic cleavages. Science 2024; 383:512-519. [PMID: 38301007 PMCID: PMC11220425 DOI: 10.1126/science.adk0378] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2023] [Accepted: 12/20/2023] [Indexed: 02/03/2024]
Abstract
The generation of cyclic oligoadenylates and subsequent allosteric activation of proteins that carry sensory domains is a distinctive feature of type III CRISPR-Cas systems. In this work, we characterize a set of associated genes of a type III-B system from Haliangium ochraceum that contains two caspase-like proteases, SAVED-CHAT and PCaspase (prokaryotic caspase), co-opted from a cyclic oligonucleotide-based antiphage signaling system (CBASS). Cyclic tri-adenosine monophosphate (AMP)-induced oligomerization of SAVED-CHAT activates proteolytic activity of the CHAT domains, which specifically cleave and activate PCaspase. Subsequently, activated PCaspase cleaves a multitude of proteins, which results in a strong interference phenotype in vivo in Escherichia coli. Taken together, our findings reveal how a CRISPR-Cas-based detection of a target RNA triggers a cascade of caspase-associated proteolytic activities.
Collapse
Affiliation(s)
- Jurre A. Steens
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
- Scope Biosciences B.V.; Wageningen, The Netherlands
| | - Jack P.K. Bravo
- Department of Molecular Biosciences, University of Texas at Austin; Austin, USA
| | | | - Caglar Yildiz
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | - Afonso M. Amieiro
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | - Stephan Köstlbacher
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | | | - Ane S. Andres
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | - Constantinos Patinios
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | - Andreas Bardis
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | - Arjan Barendregt
- Biomolecular Mass Spectrometry and Proteomics, University of Utrecht; Utrecht, The Netherlands
| | - Richard A. Scheltema
- Biomolecular Mass Spectrometry and Proteomics, University of Utrecht; Utrecht, The Netherlands
| | - Thijs J.G. Ettema
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | - John van der Oost
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| | - David W. Taylor
- Department of Molecular Biosciences, University of Texas at Austin; Austin, USA
| | - Raymond H.J. Staals
- Laboratory of Microbiology, Wageningen University and Research; Wageningen, The Netherlands
| |
Collapse
|
40
|
Lou R, Shui W. Acquisition and Analysis of DIA-Based Proteomic Data: A Comprehensive Survey in 2023. Mol Cell Proteomics 2024; 23:100712. [PMID: 38182042 PMCID: PMC10847697 DOI: 10.1016/j.mcpro.2024.100712] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/27/2023] [Accepted: 01/02/2024] [Indexed: 01/07/2024] Open
Abstract
Data-independent acquisition (DIA) mass spectrometry (MS) has emerged as a powerful technology for high-throughput, accurate, and reproducible quantitative proteomics. This review provides a comprehensive overview of recent advances in both the experimental and computational methods for DIA proteomics, from data acquisition schemes to analysis strategies and software tools. DIA acquisition schemes are categorized based on the design of precursor isolation windows, highlighting wide-window, overlapping-window, narrow-window, scanning quadrupole-based, and parallel accumulation-serial fragmentation-enhanced DIA methods. For DIA data analysis, major strategies are classified into spectrum reconstruction, sequence-based search, library-based search, de novo sequencing, and sequencing-independent approaches. A wide array of software tools implementing these strategies are reviewed, with details on their overall workflows and scoring approaches at different steps. The generation and optimization of spectral libraries, which are critical resources for DIA analysis, are also discussed. Publicly available benchmark datasets covering global proteomics and phosphoproteomics are summarized to facilitate performance evaluation of various software tools and analysis workflows. Continued advances and synergistic developments of versatile components in DIA workflows are expected to further enhance the power of DIA-based proteomics.
Collapse
Affiliation(s)
- Ronghui Lou
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| | - Wenqing Shui
- iHuman Institute, ShanghaiTech University, Shanghai, China; School of Life Science and Technology, ShanghaiTech University, Shanghai, China.
| |
Collapse
|
41
|
Roach J, Mital R, Haffner JJ, Colwell N, Coats R, Palacios HM, Liu Z, Godinho JLP, Ness M, Peramuna T, McCall LI. Microbiome metabolite quantification methods enabling insights into human health and disease. Methods 2024; 222:81-99. [PMID: 38185226 DOI: 10.1016/j.ymeth.2023.12.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Revised: 10/27/2023] [Accepted: 12/13/2023] [Indexed: 01/09/2024] Open
Abstract
Many of the health-associated impacts of the microbiome are mediated by its chemical activity, producing and modifying small molecules (metabolites). Thus, microbiome metabolite quantification has a central role in efforts to elucidate and measure microbiome function. In this review, we cover general considerations when designing experiments to quantify microbiome metabolites, including sample preparation, data acquisition and data processing, since these are critical to downstream data quality. We then discuss data analysis and experimental steps to demonstrate that a given metabolite feature is of microbial origin. We further discuss techniques used to quantify common microbial metabolites, including short-chain fatty acids (SCFA), secondary bile acids (BAs), tryptophan derivatives, N-acyl amides and trimethylamine N-oxide (TMAO). Lastly, we conclude with challenges and future directions for the field.
Collapse
Affiliation(s)
- Jarrod Roach
- Department of Chemistry and Biochemistry, University of Oklahoma
| | - Rohit Mital
- Department of Biology, University of Oklahoma
| | - Jacob J Haffner
- Department of Anthropology, University of Oklahoma; Laboratories of Molecular Anthropology and Microbiome Research, University of Oklahoma
| | - Nathan Colwell
- Department of Chemistry and Biochemistry, University of Oklahoma
| | - Randy Coats
- Department of Chemistry and Biochemistry, University of Oklahoma
| | - Horvey M Palacios
- Department of Anthropology, University of Oklahoma; Laboratories of Molecular Anthropology and Microbiome Research, University of Oklahoma
| | - Zongyuan Liu
- Department of Chemistry and Biochemistry, University of Oklahoma
| | | | - Monica Ness
- Department of Chemistry and Biochemistry, University of Oklahoma
| | - Thilini Peramuna
- Department of Chemistry and Biochemistry, University of Oklahoma
| | - Laura-Isobel McCall
- Department of Chemistry and Biochemistry, University of Oklahoma; Laboratories of Molecular Anthropology and Microbiome Research, University of Oklahoma; Department of Chemistry and Biochemistry, San Diego State University.
| |
Collapse
|
42
|
Sandström H, Rissanen M, Rousu J, Rinke P. Data-Driven Compound Identification in Atmospheric Mass Spectrometry. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2306235. [PMID: 38095508 PMCID: PMC10885664 DOI: 10.1002/advs.202306235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 11/04/2023] [Indexed: 02/24/2024]
Abstract
Aerosol particles found in the atmosphere affect the climate and worsen air quality. To mitigate these adverse impacts, aerosol particle formation and aerosol chemistry in the atmosphere need to be better mapped out and understood. Currently, mass spectrometry is the single most important analytical technique in atmospheric chemistry and is used to track and identify compounds and processes. Large amounts of data are collected in each measurement of current time-of-flight and orbitrap mass spectrometers using modern rapid data acquisition practices. However, compound identification remains a major bottleneck during data analysis due to lacking reference libraries and analysis tools. Data-driven compound identification approaches could alleviate the problem, yet remain rare to non-existent in atmospheric science. In this perspective, the authors review the current state of data-driven compound identification with mass spectrometry in atmospheric science and discuss current challenges and possible future steps toward a digital era for atmospheric mass spectrometry.
Collapse
Affiliation(s)
- Hilda Sandström
- Department of Applied Physics, Aalto University, P.O. Box 11000, FI-00076, Aalto, Espoo, Finland
| | - Matti Rissanen
- Aerosol Physics Laboratory, Tampere University, FI-33720, Tampere, Finland
- Department of Chemistry, University of Helsinki, P.O. Box 55, A.I. Virtasen aukio 1, FI-00560, Helsinki, Finland
| | - Juho Rousu
- Department of Computer Science, Aalto University, P.O. Box 11000, FI-00076, Aalto, Espoo, Finland
| | - Patrick Rinke
- Department of Applied Physics, Aalto University, P.O. Box 11000, FI-00076, Aalto, Espoo, Finland
| |
Collapse
|
43
|
Svecla M, Da Dalt L, Moregola A, Nour J, Baragetti A, Uboldi P, Donetti E, Arnaboldi L, Beretta G, Bonacina F, Norata GD. ASGR1 deficiency diverts lipids toward adipose tissue but results in liver damage during obesity. Cardiovasc Diabetol 2024; 23:42. [PMID: 38281933 PMCID: PMC10823681 DOI: 10.1186/s12933-023-02099-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 12/20/2023] [Indexed: 01/30/2024] Open
Abstract
BACKGROUND Asialoglycoprotein receptor 1 (ASGR1), primarily expressed on hepatocytes, promotes the clearance and the degradation of glycoproteins, including lipoproteins, from the circulation. In humans, loss-of-function variants of ASGR1 are associated with a favorable metabolic profile and reduced incidence of cardiovascular diseases. The molecular mechanisms by which ASGR1 could affect the onset of metabolic syndrome and obesity are unclear. Therefore, here we investigated the contribution of ASGR1 in the development of metabolic syndrome and obesity. METHODS ASGR1 deficient mice (ASGR1-/-) were subjected to a high-fat diet (45% Kcal from fat) for 20 weeks. The systemic metabolic profile, hepatic and visceral adipose tissue were characterized for metabolic and structural alterations, as well as for immune cells infiltration. RESULTS ASGR1-/- mice present a hypertrophic adipose tissue with 41% increase in fat accumulation in visceral adipose tissue (VAT), alongside with alteration in lipid metabolic pathways. Intriguingly, ASGR1-/- mice exhibit a comparable response to an acute glucose and insulin challenge in circulation, coupled with notably decreased in circulating cholesterol levels. Although the liver of ASGR1-/- have similar lipid accumulation to the WT mice, they present elevated levels of liver inflammation and a decrease in mitochondrial function. CONCLUSION ASGR1 deficiency impacts energetic homeostasis during obesity leading to improved plasma lipid levels but increased VAT lipid accumulation and liver damage.
Collapse
Affiliation(s)
- Monika Svecla
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Lorenzo Da Dalt
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Annalisa Moregola
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Jasmine Nour
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Andrea Baragetti
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Patrizia Uboldi
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Elena Donetti
- Department of Biomedical Science for Health, Università degli Studi di Milano, Milan, Italy
| | - Lorenzo Arnaboldi
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Giangiacomo Beretta
- Department of Environmental Science and Policy, Università degli Studi di Milano, Milan, Italy
| | - Fabrizia Bonacina
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy
| | - Giuseppe Danilo Norata
- Department of Pharmacological and Biomolecular Science "Rodolfo Paoletti", Università degli Studi di Milano, Milan, Italy.
| |
Collapse
|
44
|
Shahbazy M, Ramarathinam SH, Li C, Illing PT, Faridi P, Croft NP, Purcell AW. MHCpLogics: an interactive machine learning-based tool for unsupervised data visualization and cluster analysis of immunopeptidomes. Brief Bioinform 2024; 25:bbae087. [PMID: 38487848 PMCID: PMC10940831 DOI: 10.1093/bib/bbae087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 12/12/2023] [Accepted: 02/15/2024] [Indexed: 03/18/2024] Open
Abstract
The major histocompatibility complex (MHC) encodes a range of immune response genes, including the human leukocyte antigens (HLAs) in humans. These molecules bind peptide antigens and present them on the cell surface for T cell recognition. The repertoires of peptides presented by HLA molecules are termed immunopeptidomes. The highly polymorphic nature of the genres that encode the HLA molecules confers allotype-specific differences in the sequences of bound ligands. Allotype-specific ligand preferences are often defined by peptide-binding motifs. Individuals express up to six classical class I HLA allotypes, which likely present peptides displaying different binding motifs. Such complex datasets make the deconvolution of immunopeptidomic data into allotype-specific contributions and further dissection of binding-specificities challenging. Herein, we developed MHCpLogics as an interactive machine learning-based tool for mining peptide-binding sequence motifs and visualization of immunopeptidome data across complex datasets. We showcase the functionalities of MHCpLogics by analyzing both in-house and published mono- and multi-allelic immunopeptidomics data. The visualization modalities of MHCpLogics allow users to inspect clustered sequences down to individual peptide components and to examine broader sequence patterns within multiple immunopeptidome datasets. MHCpLogics can deconvolute large immunopeptidome datasets enabling the interrogation of clusters for the segregation of allotype-specific peptide sequence motifs, identification of sub-peptidome motifs, and the exportation of clustered peptide sequence lists. The tool facilitates rapid inspection of immunopeptidomes as a resource for the immunology and vaccine communities. MHCpLogics is a standalone application available via an executable installation at: https://github.com/PurcellLab/MHCpLogics.
Collapse
Affiliation(s)
- Mohammad Shahbazy
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Sri H Ramarathinam
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Chen Li
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Patricia T Illing
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Pouya Faridi
- Centre for Cancer Research, Hudson Institute of Medical Research, Clayton, VIC 3168, Australia
- Monash Proteomics and Metabolomics Platform, Department of Medicine, School of Clinical Sciences, Monash University, Clayton, VIC 3800, Australia
| | - Nathan P Croft
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Anthony W Purcell
- Department of Biochemistry and Molecular Biology and Infection and Immunity Program, Biomedicine Discovery Institute, Monash University, Melbourne, VIC 3800, Australia
| |
Collapse
|
45
|
Kochale K, Cunha R, Teutenberg T, Schmidt TC. Development of a column switching for direct online enrichment and separation of polar and nonpolar analytes from aqueous matrices. J Chromatogr A 2024; 1714:464554. [PMID: 38065029 DOI: 10.1016/j.chroma.2023.464554] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 11/28/2023] [Accepted: 11/30/2023] [Indexed: 01/05/2024]
Abstract
Trace substances in surface waters may threaten health and pose a risk for the aquatic environment. Moreover, separation and detection by instrumental analysis is challenging due to the low concentration and the wide range of polarities. Separation of polar and nonpolar analytes can be achieved by using stationary phases with different selectivity. Lower limits of detection of trace substances can be obtained by offline enrichment on solid phase materials. However, these practices require substantial effort and are time consuming and costly. Therefore, in this study, a column switching was developed to enrich and separate both polar and nonpolar analytes by an on-column large volume injection of aqueous samples. The column switching can significantly reduce the effort and time for analyzing trace substances without compromising on separation and detection. A reversed phase (RP) column is used to trap the nonpolar analytes. The polar analytes are enriched on a porous graphitized carbon column (PGC) coupled serially behind the RP column. A novel valve switching system is implemented to enable elution of the nonpolar analytes from the RP column and, subsequently, elution of polar analytes from the PGC column and separation on a hydrophilic interaction liquid chromatography (HILIC) column. To enable separation of polar analytes dissolved in an aqueous matrix by HILIC, the water plug that is flushed from the PGC column is diluted by dosing organic solvent directly upstream of the HILIC column. The developed method was tested by applying target analysis and non-target screening, highlighting the advantage to effectively separate and detect both polar and nonpolar compounds in a single chromatographic run. In the target analysis, the analytes, with a logD at pH 3 ranging from -2.8 to + 4.5, could be enriched and separated. Besides the 965 features in the RP phase, 572 features from real wastewater were observed in the HILIC phase which would otherwise elute in the void time in conventional one-dimensional RP methods.
Collapse
Affiliation(s)
- Kjell Kochale
- Institut für Umwelt & Energie, Technik & Analytik e. V. (IUTA), Bliersheimer Str. 58-60, 47229 Duisburg, Germany; Instrumental Analytical Chemistry, University of Duisburg-Essen, Universitätsstr. 5, 45141 Essen, Germany
| | - Ricardo Cunha
- Institut für Umwelt & Energie, Technik & Analytik e. V. (IUTA), Bliersheimer Str. 58-60, 47229 Duisburg, Germany
| | - Thorsten Teutenberg
- Institut für Umwelt & Energie, Technik & Analytik e. V. (IUTA), Bliersheimer Str. 58-60, 47229 Duisburg, Germany.
| | - Torsten C Schmidt
- Instrumental Analytical Chemistry, University of Duisburg-Essen, Universitätsstr. 5, 45141 Essen, Germany
| |
Collapse
|
46
|
Zancolli G, von Reumont BM, Anderluh G, Caliskan F, Chiusano ML, Fröhlich J, Hapeshi E, Hempel BF, Ikonomopoulou MP, Jungo F, Marchot P, de Farias TM, Modica MV, Moran Y, Nalbantsoy A, Procházka J, Tarallo A, Tonello F, Vitorino R, Zammit ML, Antunes A. Web of venom: exploration of big data resources in animal toxin research. Gigascience 2024; 13:giae054. [PMID: 39250076 PMCID: PMC11382406 DOI: 10.1093/gigascience/giae054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2024] [Revised: 07/01/2024] [Accepted: 07/13/2024] [Indexed: 09/10/2024] Open
Abstract
Research on animal venoms and their components spans multiple disciplines, including biology, biochemistry, bioinformatics, pharmacology, medicine, and more. Manipulating and analyzing the diverse array of data required for venom research can be challenging, and relevant tools and resources are often dispersed across different online platforms, making them less accessible to nonexperts. In this article, we address the multifaceted needs of the scientific community involved in venom and toxin-related research by identifying and discussing web resources, databases, and tools commonly used in this field. We have compiled these resources into a comprehensive table available on the VenomZone website (https://venomzone.expasy.org/10897). Furthermore, we highlight the challenges currently faced by researchers in accessing and using these resources and emphasize the importance of community-driven interdisciplinary approaches. We conclude by underscoring the significance of enhancing standards, promoting interoperability, and encouraging data and method sharing within the venom research community.
Collapse
Affiliation(s)
- Giulia Zancolli
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Björn Marcus von Reumont
- Goethe University Frankfurt, Faculty of Biological Sciences, 60438 Frankfurt, Germany
- LOEWE Centre for Translational Biodiversity Genomics, 60325 Frankfurt, Germany
| | - Gregor Anderluh
- Department of Molecular Biology and Nanobiotechnology, National Institute of Chemistry, 1000 Ljubljana, Slovenia
| | - Figen Caliskan
- Department of Biology, Faculty of Science, Eskisehir Osmangazi University, 26040 Eskişehir, Turkey
| | - Maria Luisa Chiusano
- Department of Agricultural Sciences, University Federico II of Naples, 80055 Portici, Naples, Italy
- Department of Research Infrastructures for Marine Biological Resources, Stazione Zoologica Anton Dohrn, Villa Comunale, 80121 Naples, Italy
| | - Jacob Fröhlich
- Veterinary Center for Resistance Research (TZR), Freie Universität Berlin, 14163 Berlin, Germany
| | - Evroula Hapeshi
- Department of Health Sciences, School of Life and Health Sciences, University of Nicosia, 1700 Nicosia, Cyprus
| | - Benjamin-Florian Hempel
- Veterinary Center for Resistance Research (TZR), Freie Universität Berlin, 14163 Berlin, Germany
| | - Maria P Ikonomopoulou
- Madrid Institute of Advanced Studies in Food, Precision Nutrition & Aging Program, 28049 Madrid, Spain
| | - Florence Jungo
- SIB Swiss Institute of Bioinformatics, Swiss-Prot Group, 1211 Geneva, Switzerland
| | - Pascale Marchot
- Laboratory Architecture et Fonction des Macromolécules Biologiques, Aix-Marseille University, Centre National de la Recherche Scientifique, Faculté des Sciences, Campus Luminy, 13288 Marseille, France
| | - Tarcisio Mendes de Farias
- Department of Ecology and Evolution, University of Lausanne, 1015 Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Maria Vittoria Modica
- Department of Biology and Evolution of Marine Organisms, Stazione Zoologica Anton Dohrn, 00198 Rome, Italy
| | - Yehu Moran
- Department of Ecology, Evolution and Behavior, Alexander Silberman Institute of Life Sciences, Faculty of Science, The Hebrew University of Jerusalem, 9190401 Jerusalem, Israel
| | - Ayse Nalbantsoy
- Engineering Faculty, Bioengineering Department, Ege University, 35100 Bornova-Izmir, Turkey
| | - Jan Procházka
- Laboratory of Transgenic Models of Diseases, Institute of Molecular Genetics of the Czech Academy of Sciences, 252 50 Vestec, Czech Republic
| | - Andrea Tarallo
- Institute of Research on Terrestrial Ecosystems (IRET), National Research Council (CNR), 73100 Lecce, Italy
| | - Fiorella Tonello
- Neuroscience Institute, National Research Council (CNR), 35131 Padua, Italy
| | - Rui Vitorino
- Department of Medical Sciences, iBiMED, University of Aveiro, 3810-193 Aveiro, Portugal
| | - Mark Lawrence Zammit
- Department of Clinical Pharmacology & Therapeutics, Faculty of Medicine & Surgery, University of Malta, 2090 Msida, Malta
- Malta National Poisons Centre, Malta Life Sciences Park, 3000 San Ġwann, Malta
| | - Agostinho Antunes
- CIIMAR/CIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, 4450-208 Porto, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, 4169-007 Porto, Portugal
| |
Collapse
|
47
|
Declercq A, Demeulemeester N, Gabriels R, Bouwmeester R, Degroeve S, Martens L. Bioinformatics Pipeline for Processing Single-Cell Data. Methods Mol Biol 2024; 2817:221-239. [PMID: 38907156 DOI: 10.1007/978-1-0716-3934-4_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
Single-cell proteomics can offer valuable insights into dynamic cellular interactions, but identifying proteins at this level is challenging due to their low abundance. In this chapter, we present a state-of-the-art bioinformatics pipeline for single-cell proteomics that combines the search engine Sage (via SearchGUI), identification rescoring with MS2Rescore, quantification through FlashLFQ, and differential expression analysis using MSqRob2. MS2Rescore leverages LC-MS/MS behavior predictors, such as MS2PIP and DeepLC, to recalibrate scores with Percolator or mokapot. Combining these tools into a unified pipeline, this approach improves the detection of low-abundance peptides, resulting in increased identifications while maintaining stringent FDR thresholds.
Collapse
Affiliation(s)
- Arthur Declercq
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Nina Demeulemeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- StatOmics, Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Sven Degroeve
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
| |
Collapse
|
48
|
Wu W, Huang Z, Kong W, Peng H, Goh WWB. Optimizing the PROTREC network-based missing protein prediction algorithm. Proteomics 2024; 24:e2200332. [PMID: 37876146 DOI: 10.1002/pmic.202200332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2022] [Revised: 09/30/2023] [Accepted: 10/06/2023] [Indexed: 10/26/2023]
Abstract
This article summarizes the PROTREC method and investigates the impact that the different hyper-parameters have on the task of missing protein prediction using PROTREC. We evaluate missing protein recovery rates using different PROTREC score selection approaches (MAX, MIN, MEDIAN, and MEAN), different PROTREC score thresholds, as well as different complex size thresholds. In addition, we included two additional cancer datasets in our analysis and introduced a new validation method to check both the robustness of the PROTREC method as well as the correctness of our analysis. Our analysis showed that the missing protein recovery rate can be improved by adopting PROTREC score selection operations of MIN, MEDIAN, and MEAN instead of the default MAX. However, this may come at a cost of reduced numbers of proteins predicted and validated. The users should therefore choose their hyper-parameters carefully to find a balance in the accuracy-quantity trade-off. We also explored the possibility of combining PROTREC with a p-value-based method (FCS) and demonstrated that PROTREC is able to perform well independently without any help from a p-value-based method. Furthermore, we conducted a downstream enrichment analysis to understand the biological pathways and protein networks within the cancerous tissues using the recovered proteins. Missing protein recovery rate using PROTREC can be improved by selecting a different PROTREC score selection method. Different PROTREC score selection methods and other hyper-parameters such as PROTREC score threshold and complex size threshold introduce accuracy-quantity trade-off. PROTREC is able to perform well independently of any filtering using a p-value-based method. Verification of the PROTREC method on additional cancer datasets. Downstream Enrichment Analysis to understand the biological pathways and protein networks in cancerous tissues.
Collapse
Affiliation(s)
- Wenshan Wu
- School of Computer Science and Engineering, Nanyang Technological University, Singapore, Singapore
| | - Zelu Huang
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore, Singapore
| | - Weijia Kong
- Department of Computer Science, National University of Singapore, Singapore, Singapore
- School of Biological Science, Nanyang Technological University, Singapore, Singapore
| | - Hui Peng
- School of Biological Science, Nanyang Technological University, Singapore, Singapore
| | - Wilson Wen Bin Goh
- School of Biological Science, Nanyang Technological University, Singapore, Singapore
- Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Singapore
- Center for Biomedical Informatics, Nanyang Technological University, Singapore, Singapore
| |
Collapse
|
49
|
Bichmann L, Gupta S, Röst H. Data-Independent Acquisition Peptidomics. Methods Mol Biol 2024; 2758:77-88. [PMID: 38549009 DOI: 10.1007/978-1-0716-3646-6_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/02/2024]
Abstract
In recent years, data-independent acquisition (DIA) has emerged as a powerful analysis method in biological mass spectrometry (MS). Compared to the previously predominant data-dependent acquisition (DDA), it offers a way to achieve greater reproducibility, sensitivity, and dynamic range in MS measurements. To make DIA accessible to non-expert users, a multifunctional, automated high-throughput pipeline DIAproteomics was implemented in the computational workflow framework "Nextflow" ( https://nextflow.io ). This allows high-throughput processing of proteomics and peptidomics DIA datasets on diverse computing infrastructures. This chapter provides a short summary and usage protocol guide for the most important modes of operation of this pipeline regarding the analysis of peptidomics datasets using the command line. In brief, DIAproteomics is a wrapper around the OpenSwathWorkflow and relies on either existing or ad-hoc generated spectral libraries from matching DDA runs. The OpenSwathWorkflow extracts chromatograms from the DIA runs and performs chromatographic peak-picking. Further downstream of the pipeline, these peaks are scored, aligned, and statistically evaluated for qualitative and quantitative differences across conditions depending on the user's interest. DIAproteomics is open-source and available under a permissive license. We encourage the scientific community to use or modify the pipeline to meet their specific requirements.
Collapse
Affiliation(s)
- Leon Bichmann
- Department of Computer Science, Applied Bioinformatics, University of Tübingen, Tübingen, Germany
| | - Shubham Gupta
- Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| | - Hannes Röst
- Donnelly Center for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
50
|
Zweigle J, Bugsel B, Fabregat-Palau J, Zwiener C. PFΔScreen - an open-source tool for automated PFAS feature prioritization in non-target HRMS data. Anal Bioanal Chem 2024; 416:349-362. [PMID: 38030884 PMCID: PMC10761406 DOI: 10.1007/s00216-023-05070-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2023] [Revised: 11/17/2023] [Accepted: 11/21/2023] [Indexed: 12/01/2023]
Abstract
Per- and polyfluoroalkyl substances (PFAS) are a huge group of anthropogenic chemicals with unique properties that are used in countless products and applications. Due to the high stability of their C-F bonds, PFAS or their transformation products (TPs) are persistent in the environment, leading to ubiquitous detection in various samples worldwide. Since PFAS are industrial chemicals, the availability of authentic PFAS reference standards is limited, making non-target screening (NTS) approaches based on high-resolution mass spectrometry (HRMS) necessary for a more comprehensive characterization. NTS usually is a time-consuming process, since only a small fraction of the detected chemicals can be identified. Therefore, efficient prioritization of relevant HRMS signals is one of the most crucial steps. We developed PFΔScreen, a Python-based open-source tool with a simple graphical user interface (GUI) to perform efficient feature prioritization using several PFAS-specific techniques such as the highly promising MD/C-m/C approach, Kendrick mass defect analysis, diagnostic fragments (MS2), fragment mass differences (MS2), and suspect screening. Feature detection from vendor-independent MS raw data (mzML, data-dependent acquisition) is performed via pyOpenMS (or custom feature lists) with subsequent calculations for prioritization and identification of PFAS in both HPLC- and GC-HRMS data. The PFΔScreen workflow is presented on four PFAS-contaminated agricultural soil samples from south-western Germany. Over 15 classes of PFAS (more than 80 single compounds with several isomers) could be identified, including four novel classes, potentially TPs of the precursors fluorotelomer mercapto alkyl phosphates (FTMAPs). PFΔScreen can be used within the Python environment and is easily automatically installable and executable on Windows. Its source code is freely available on GitHub ( https://github.com/JonZwe/PFAScreen ).
Collapse
Affiliation(s)
- Jonathan Zweigle
- Environmental Analytical Chemistry, Department of Geosciences, University of Tübingen, Schnarrenbergstraße 94-96, 72076, Tübingen, Germany.
| | - Boris Bugsel
- Environmental Analytical Chemistry, Department of Geosciences, University of Tübingen, Schnarrenbergstraße 94-96, 72076, Tübingen, Germany
| | - Joel Fabregat-Palau
- Hydrogeochemistry, Department of Geosciences, University of Tübingen, Schnarrenbergstraße 94-96, 72076, Tübingen, Germany
| | - Christian Zwiener
- Environmental Analytical Chemistry, Department of Geosciences, University of Tübingen, Schnarrenbergstraße 94-96, 72076, Tübingen, Germany.
| |
Collapse
|