101
|
Shah NM, Jang HJ, Liang Y, Maeng JH, Tzeng SC, Wu A, Basri NL, Qu X, Fan C, Li A, Katz B, Li D, Xing X, Evans BS, Wang T. Pan-cancer analysis identifies tumor-specific antigens derived from transposable elements. Nat Genet 2023; 55:631-639. [PMID: 36973455 DOI: 10.1038/s41588-023-01349-3] [Citation(s) in RCA: 35] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2020] [Accepted: 02/23/2023] [Indexed: 03/29/2023]
Abstract
Cryptic promoters within transposable elements (TEs) can be transcriptionally reactivated in tumors to create new TE-chimeric transcripts, which can produce immunogenic antigens. We performed a comprehensive screen for these TE exaptation events in 33 TCGA tumor types, 30 GTEx adult tissues and 675 cancer cell lines, and identified 1,068 TE-exapted candidates with the potential to generate shared tumor-specific TE-chimeric antigens (TS-TEAs). Whole-lysate and HLA-pulldown mass spectrometry data confirmed that TS-TEAs are presented on the surface of cancer cells. In addition, we highlight tumor-specific membrane proteins transcribed from TE promoters that constitute aberrant epitopes on the extracellular surface of cancer cells. Altogether, we showcase the high pan-cancer prevalence of TS-TEAs and atypical membrane proteins that could potentially be therapeutically exploited and targeted.
Collapse
Affiliation(s)
- Nakul M Shah
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - H Josh Jang
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
- Department of Epigenetics, Van Andel Institute, Grand Rapids, MI, USA
| | - Yonghao Liang
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Ju Heon Maeng
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Angela Wu
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Noah L Basri
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Xuan Qu
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Changxu Fan
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Amy Li
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Benjamin Katz
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Daofeng Li
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | - Xiaoyun Xing
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA
| | | | - Ting Wang
- Department of Genetics, Washington University School of Medicine, St. Louis, MO, USA.
- The Edison Family Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO, USA.
- McDonnell Genome Institute, Washington University School of Medicine, St. Louis, MO, USA.
| |
Collapse
|
102
|
Albergamo V, Wohlleben W, Plata DL. Photochemical weathering of polyurethane microplastics produced complex and dynamic mixtures of dissolved organic chemicals. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2023; 25:432-444. [PMID: 36691826 DOI: 10.1039/d2em00415a] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Sunlight exposure can naturally mitigate microplastics pollution in the surface ocean, however it results in emissions of dissolved organic carbon (DOC) whose characteristics and fate remain largely unknown. In this work, we investigated the effects of solar radiation on polyether (TPU_Ether) and polyester (TPU_Ester) thermoplastic polyurethane, and on a thermoset polyurethane (PU_Hardened). The microplastics were irradiated with simulated solar light with a UV dose of 350 MJ m-2, which corresponds to roughly 15 months outdoor exposure at 31° N latitude. The particles were characterized using ATR-FTIR and elemental analysis. The DOC released to the aqueous phase was quantified by total organic carbon analysis and characterized by nontarget liquid chromatography coupled to high-resolution mass spectrometry. Polyurethane microplastics were degraded following mechanisms reconcilable with UV photo-oxidation. The carbon mass fraction released to the aqueous phase was 8.5 ± 0.5%, 3.7 ± 0.2%, and 2.8 ± 0.2% for TPU_Ether, TPU_Ester, and PU_Hardened, respectively. The corresponding DOC release rates, expressed as mg carbon per UV dose were 0.023, 0.013, and 0.010 mg MJ-1 for TPU_Ether, TPU_Ester and PU_Hardened, respectively. Roughly three thousand unique by-products were released from photo-weathered TPUs, whereas 540 were detected in the DOC of PU_Hardened. This carbon pool was highly complex and dynamic in terms of physicochemical properties and susceptibility to further photodegradation after dissolution from the particles. Our results show that plastics photodegradation in the ocean requires chemical assessment of the DOC emissions in addition to the analysis of aged microplastics and that polymer chemistry influences the chain scission products.
Collapse
Affiliation(s)
- Vittorio Albergamo
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, 15 Vassar Street, Cambridge, Massachusetts 02139, USA.
| | - Wendel Wohlleben
- Department of Material Physics and Analytics, Advanced Materials Research, BASF SE, Carl-Bosch-Str. 38, 67056 Ludwigshafen, Germany
- Department of Experimental Toxicology and Ecology, Advanced Materials Research, BASF SE, Carl-Bosch-Str. 38, 67056 Ludwigshafen, Germany
| | - Desirée L Plata
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, 15 Vassar Street, Cambridge, Massachusetts 02139, USA.
| |
Collapse
|
103
|
ProInfer: An interpretable protein inference tool leveraging on biological networks. PLoS Comput Biol 2023; 19:e1010961. [PMID: 36930671 PMCID: PMC10057851 DOI: 10.1371/journal.pcbi.1010961] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Revised: 03/29/2023] [Accepted: 02/20/2023] [Indexed: 03/18/2023] Open
Abstract
In mass spectrometry (MS)-based proteomics, protein inference from identified peptides (protein fragments) is a critical step. We present ProInfer (Protein Inference), a novel protein assembly method that takes advantage of information in biological networks. ProInfer assists recovery of proteins supported only by ambiguous peptides (a peptide which maps to more than one candidate protein) and enhances the statistical confidence for proteins supported by both unique and ambiguous peptides. Consequently, ProInfer rescues weakly supported proteins thereby improving proteome coverage. Evaluated across THP1 cell line, lung cancer and RAW267.4 datasets, ProInfer always infers the most numbers of true positives, in comparison to mainstream protein inference tools Fido, EPIFANY and PIA. ProInfer is also adept at retrieving differentially expressed proteins, signifying its usefulness for functional analysis and phenotype profiling. Source codes of ProInfer are available at https://github.com/PennHui2016/ProInfer.
Collapse
|
104
|
Fu J, Zhu F, Xu CJ, Li Y. Metabolomics meets systems immunology. EMBO Rep 2023; 24:e55747. [PMID: 36916532 PMCID: PMC10074123 DOI: 10.15252/embr.202255747] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2022] [Revised: 12/24/2022] [Accepted: 02/24/2023] [Indexed: 03/16/2023] Open
Abstract
Metabolic processes play a critical role in immune regulation. Metabolomics is the systematic analysis of small molecules (metabolites) in organisms or biological samples, providing an opportunity to comprehensively study interactions between metabolism and immunity in physiology and disease. Integrating metabolomics into systems immunology allows the exploration of the interactions of multilayered features in the biological system and the molecular regulatory mechanism of these features. Here, we provide an overview on recent technological developments of metabolomic applications in immunological research. To begin, two widely used metabolomics approaches are compared: targeted and untargeted metabolomics. Then, we provide a comprehensive overview of the analysis workflow and the computational tools available, including sample preparation, raw spectra data preprocessing, data processing, statistical analysis, and interpretation. Third, we describe how to integrate metabolomics with other omics approaches in immunological studies using available tools. Finally, we discuss new developments in metabolomics and its prospects for immunology research. This review provides guidance to researchers using metabolomics and multiomics in immunity research, thus facilitating the application of systems immunology to disease research.
Collapse
Affiliation(s)
- Jianbo Fu
- Centre for Individualised Infection Medicine (CiiM), a joint venture between the Helmholtz Centre for Infection Research (HZI) and Hannover Medical School (MHH), Hannover, Germany.,TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Helmholtz Centre for Infection Research (HZI) and the Hannover Medical School (MHH), Hannover, Germany.,College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, China
| | - Cheng-Jian Xu
- Centre for Individualised Infection Medicine (CiiM), a joint venture between the Helmholtz Centre for Infection Research (HZI) and Hannover Medical School (MHH), Hannover, Germany.,TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Helmholtz Centre for Infection Research (HZI) and the Hannover Medical School (MHH), Hannover, Germany.,Department of Internal Medicine and Radboud Center for Infectious Diseases, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Yang Li
- Centre for Individualised Infection Medicine (CiiM), a joint venture between the Helmholtz Centre for Infection Research (HZI) and Hannover Medical School (MHH), Hannover, Germany.,TWINCORE Centre for Experimental and Clinical Infection Research, a joint venture between the Helmholtz Centre for Infection Research (HZI) and the Hannover Medical School (MHH), Hannover, Germany.,Department of Internal Medicine and Radboud Center for Infectious Diseases, Radboud University Medical Center, Nijmegen, The Netherlands
| |
Collapse
|
105
|
Serag A, Salem MA, Gong S, Wu JL, Farag MA. Decoding Metabolic Reprogramming in Plants under Pathogen Attacks, a Comprehensive Review of Emerging Metabolomics Technologies to Maximize Their Applications. Metabolites 2023; 13:424. [PMID: 36984864 PMCID: PMC10055942 DOI: 10.3390/metabo13030424] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2023] [Revised: 03/01/2023] [Accepted: 03/09/2023] [Indexed: 03/15/2023] Open
Abstract
In their environment, plants interact with a multitude of living organisms and have to cope with a large variety of aggressions of biotic or abiotic origin. What has been known for several decades is that the extraordinary variety of chemical compounds the plants are capable of synthesizing may be estimated in the range of hundreds of thousands, but only a fraction has been fully characterized to be implicated in defense responses. Despite the vast importance of these metabolites for plants and also for human health, our knowledge about their biosynthetic pathways and functions is still fragmentary. Recent progress has been made particularly for the phenylpropanoids and oxylipids metabolism, which is more emphasized in this review. With an increasing interest in monitoring plant metabolic reprogramming, the development of advanced analysis methods should now follow. This review capitalizes on the advanced technologies used in metabolome mapping in planta, including different metabolomics approaches, imaging, flux analysis, and interpretation using bioinformatics tools. Advantages and limitations with regards to the application of each technique towards monitoring which metabolite class or type are highlighted, with special emphasis on the necessary future developments to better mirror such intricate metabolic interactions in planta.
Collapse
Affiliation(s)
- Ahmed Serag
- Pharmaceutical Analytical Chemistry Department, Faculty of Pharmacy, Al-Azhar University, Cairo 11751, Egypt
| | - Mohamed A. Salem
- Department of Pharmacognosy and Natural Products, Faculty of Pharmacy, Menoufia University, Gamal Abd El Nasr st., Shibin Elkom 32511, Menoufia, Egypt
| | - Shilin Gong
- State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau 999078, China
| | - Jian-Lin Wu
- State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macau 999078, China
| | - Mohamed A. Farag
- Pharmacognosy Department, College of Pharmacy, Cairo University, Kasr el Aini St., Cairo 11562, Egypt
| |
Collapse
|
106
|
Tian X, Permentier HP, Bischoff R. Chemical isotope labeling for quantitative proteomics. MASS SPECTROMETRY REVIEWS 2023; 42:546-576. [PMID: 34091937 PMCID: PMC10078755 DOI: 10.1002/mas.21709] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/22/2021] [Accepted: 05/17/2021] [Indexed: 05/05/2023]
Abstract
Advancements in liquid chromatography and mass spectrometry over the last decades have led to a significant development in mass spectrometry-based proteome quantification approaches. An increasingly attractive strategy is multiplex isotope labeling, which significantly improves the accuracy, precision and throughput of quantitative proteomics in the data-dependent acquisition mode. Isotope labeling-based approaches can be classified into MS1-based and MS2-based quantification. In this review, we give an overview of approaches based on chemical isotope labeling and discuss their principles, benefits, and limitations with the goal to give insights into fundamental questions and provide a useful reference for choosing a method for quantitative proteomics. As a perspective, we discuss the current possibilities and limitations of multiplex, isotope labeling approaches for the data-independent acquisition mode, which is increasing in popularity.
Collapse
Affiliation(s)
- Xiaobo Tian
- Department of Analytical Biochemistry and Interfaculty Mass Spectrometry Center, Groningen Research Institute of PharmacyUniversity of GroningenGroningenThe Netherlands
| | - Hjalmar P. Permentier
- Department of Analytical Biochemistry and Interfaculty Mass Spectrometry Center, Groningen Research Institute of PharmacyUniversity of GroningenGroningenThe Netherlands
| | - Rainer Bischoff
- Department of Analytical Biochemistry and Interfaculty Mass Spectrometry Center, Groningen Research Institute of PharmacyUniversity of GroningenGroningenThe Netherlands
| |
Collapse
|
107
|
Nakayama K, Li X, Shimizu K, Akamatsu S, Inoue T, Kobayashi T, Ogawa O, Goto T. qShot MALDI analysis: A rapid, simple, convenient, and reliable quantitative phospholipidomics approach using MALDI-TOF/MS. Talanta 2023; 254:124099. [PMID: 36502612 DOI: 10.1016/j.talanta.2022.124099] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 11/09/2022] [Accepted: 11/12/2022] [Indexed: 11/29/2022]
Abstract
Matrix-assisted laser desorption/ionization mass spectrometry (MALDI/MS) has potential applications in the qualitative analysis of phospholipids (PLs). However, its capability for quantitative analysis is limited by the unavailability and/or high cost of isotope-labeled internal standards (interSTDs, e.g., 1-oleoyl (d7)-2-hydroxy-sn-glycero-3-phosphocholine, 1-pentadecanoyl-2-oleoyl (d7)-sn-glycero-3-phosphocholine). This study investigated and validated whether only two PL interSTDs could be used to normalize the entire PL species in a complex bio-lipid background (i.e., urinary lipid extracts). The normalized intensities of PL ionization standards (ionSTDs) were found to have better linear regressions (R2 > 0.984 for all PL subcategories) than those of traditional methods, such as total ion current and matrix-peak normalization methods. Furthermore, the intra-day precision of all the analyte concentrations after normalizing using our ionSTD method was superior to those of traditional methods. The inter-day precision of all the negatively charged analytes also differed statistically between our ionSTD and the two traditional methods. Meanwhile, a comparison of the three normalization methods revealed that the precision of all the positive analytes using the ionSTD method was comparable. Consequently, a cost-effective, fast, simple, convenient, and reliable quantitative method, defined as "qShot MALDI analysis," was developed to analyze PLs that could potentially be applied in clinical biomarker screening, especially in a negative mode.
Collapse
Affiliation(s)
- Kenji Nakayama
- Department of Urology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Xin Li
- Department of Urology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Koji Shimizu
- Clinical Research Center for Medical Equipment Development, Kyoto University Hospital, Kyoto, Japan
| | - Shusuke Akamatsu
- Department of Urology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Takahiro Inoue
- Department of Nephro-Urologic Surgery and Andrology, Mie University Graduate School of Medicine, Tsu, Japan
| | - Takashi Kobayashi
- Department of Urology, Graduate School of Medicine, Kyoto University, Kyoto, Japan.
| | - Osamu Ogawa
- Department of Urology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| | - Takayuki Goto
- Department of Urology, Graduate School of Medicine, Kyoto University, Kyoto, Japan
| |
Collapse
|
108
|
Rosenberger G, Li W, Turunen M, He J, Subramaniam PS, Pampou S, Griffin AT, Karan C, Kerwin P, Murray D, Honig B, Liu Y, Califano A. Network-based elucidation of colon cancer drug resistance by phosphoproteomic time-series analysis. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.15.528736. [PMID: 36824919 PMCID: PMC9949144 DOI: 10.1101/2023.02.15.528736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Abstract
Aberrant signaling pathway activity is a hallmark of tumorigenesis and progression, which has guided targeted inhibitor design for over 30 years. Yet, adaptive resistance mechanisms, induced by rapid, context-specific signaling network rewiring, continue to challenge therapeutic efficacy. By leveraging progress in proteomic technologies and network-based methodologies, over the past decade, we developed VESPA-an algorithm designed to elucidate mechanisms of cell response and adaptation to drug perturbations-and used it to analyze 7-point phosphoproteomic time series from colorectal cancer cells treated with clinically-relevant inhibitors and control media. Interrogation of tumor-specific enzyme/substrate interactions accurately inferred kinase and phosphatase activity, based on their inferred substrate phosphorylation state, effectively accounting for signal cross-talk and sparse phosphoproteome coverage. The analysis elucidated time-dependent signaling pathway response to each drug perturbation and, more importantly, cell adaptive response and rewiring that was experimentally confirmed by CRISPRko assays, suggesting broad applicability to cancer and other diseases.
Collapse
Affiliation(s)
- George Rosenberger
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Wenxue Li
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA
| | - Mikko Turunen
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Jing He
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- Present address: Regeneron Genetics Center, Tarrytown, NY, USA
| | - Prem S Subramaniam
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Sergey Pampou
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- J.P. Sulzberger Columbia Genome Center, Columbia University Irving Medical Center, New York, NY, USA
| | - Aaron T Griffin
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- Medical Scientist Training Program, Columbia University Irving Medical Center, New York, NY, USA
| | - Charles Karan
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- J.P. Sulzberger Columbia Genome Center, Columbia University Irving Medical Center, New York, NY, USA
| | - Patrick Kerwin
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Diana Murray
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Barry Honig
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
| | - Yansheng Liu
- Yale Cancer Biology Institute, Yale University, West Haven, CT, USA
- Department of Pharmacology, Yale University School of Medicine, New Haven, CT, USA
| | - Andrea Califano
- Department of Systems Biology, Columbia University Irving Medical Center, New York, NY, USA
- J.P. Sulzberger Columbia Genome Center, Columbia University Irving Medical Center, New York, NY, USA
- Department of Medicine, Columbia University Irving Medical Center, New York, NY, USA
- Herbert Irving Comprehensive Cancer Center, Columbia University Irving Medical Center, New York, NY, USA
- Department of Biochemistry & Molecular Biophysics, Columbia University Irving Medical Center, New York, NY, USA
- Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, USA
| |
Collapse
|
109
|
Yang Y, Qiao L. Profiling Serum Intact N-Glycopeptides Using Data-Independent Acquisition Mass Spectrometry. Methods Mol Biol 2023; 2628:365-391. [PMID: 36781798 DOI: 10.1007/978-1-0716-2978-9_24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
Large-scale profiling of intact glycopeptides is critical but challenging in glycoproteomics. Data-independent acquisition (DIA) mass spectrometry is an emerging technology with deep proteome coverage as well as accurate quantitative capability for large-scale proteomics studies and has also been applied to the field of glycoproteomics. In this protocol, we describe how to analyze data from a DIA experiment for profiling serum intact N-glycopeptides. We present a comprehensive data analysis workflow using GproDIA, including glycopeptide spectral library building, chromatographic feature extraction from the DIA data, and feature scoring with appropriate statistical control of error rates. We anticipate that this method could provide a powerful tool to explore the serum glycoproteome.
Collapse
Affiliation(s)
- Yi Yang
- Department of Chemistry and Shanghai Stomatological Hospital, Fudan University, Shanghai, China.,ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou, China
| | - Liang Qiao
- Department of Chemistry and Shanghai Stomatological Hospital, Fudan University, Shanghai, China.
| |
Collapse
|
110
|
Gabrielli N, Maga-Nteve C, Kafkia E, Rettel M, Loeffler J, Kamrad S, Typas A, Patil KR. Unravelling metabolic cross-feeding in a yeast-bacteria community using 13 C-based proteomics. Mol Syst Biol 2023; 19:e11501. [PMID: 36779294 PMCID: PMC10090948 DOI: 10.15252/msb.202211501] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Revised: 01/24/2023] [Accepted: 01/25/2023] [Indexed: 02/14/2023] Open
Abstract
Cross-feeding is fundamental to the diversity and function of microbial communities. However, identification of cross-fed metabolites is often challenging due to the universality of metabolic and biosynthetic intermediates. Here, we use 13 C isotope tracing in peptides to elucidate cross-fed metabolites in co-cultures of Saccharomyces cerevisiae and Lactococcus lactis. The community was grown on lactose as the main carbon source with either glucose or galactose fraction of the molecule labelled with 13 C. Data analysis allowing for the possible mass-shifts yielded hundreds of peptides for which we could assign both species identity and labelling degree. The labelling pattern showed that the yeast utilized galactose and, to a lesser extent, lactic acid shared by L. lactis as carbon sources. While the yeast provided essential amino acids to the bacterium as expected, the data also uncovered a complex pattern of amino acid exchange. The identity of the cross-fed metabolites was further supported by metabolite labelling in the co-culture supernatant, and by diminished fitness of a galactose-negative yeast mutant in the community. Together, our results demonstrate the utility of 13 C-based proteomics for uncovering microbial interactions.
Collapse
Affiliation(s)
| | | | - Eleni Kafkia
- European Molecular Biology Laboratory, Heidelberg, Germany.,Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Mandy Rettel
- European Molecular Biology Laboratory, Heidelberg, Germany
| | - Jakob Loeffler
- European Molecular Biology Laboratory, Heidelberg, Germany
| | - Stephan Kamrad
- Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | | | - Kiran Raosaheb Patil
- European Molecular Biology Laboratory, Heidelberg, Germany.,Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| |
Collapse
|
111
|
Kleiner M, Kouris A, Violette M, D'Angelo G, Liu Y, Korenek A, Tolić N, Sachsenberg T, McCalder J, Lipton MS, Strous M. Ultra-sensitive isotope probing to quantify activity and substrate assimilation in microbiomes. MICROBIOME 2023; 11:24. [PMID: 36755313 PMCID: PMC9909930 DOI: 10.1186/s40168-022-01454-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 12/20/2022] [Indexed: 06/18/2023]
Abstract
BACKGROUND Stable isotope probing (SIP) approaches are a critical tool in microbiome research to determine associations between species and substrates, as well as the activity of species. The application of these approaches ranges from studying microbial communities important for global biogeochemical cycling to host-microbiota interactions in the intestinal tract. Current SIP approaches, such as DNA-SIP or nanoSIMS allow to analyze incorporation of stable isotopes with high coverage of taxa in a community and at the single cell level, respectively, however they are limited in terms of sensitivity, resolution or throughput. RESULTS Here, we present an ultra-sensitive, high-throughput protein-based stable isotope probing approach (Protein-SIP), which cuts cost for labeled substrates by 50-99% as compared to other SIP and Protein-SIP approaches and thus enables isotope labeling experiments on much larger scales and with higher replication. The approach allows for the determination of isotope incorporation into microbiome members with species level resolution using standard metaproteomics liquid chromatography-tandem mass spectrometry (LC-MS/MS) measurements. At the core of the approach are new algorithms to analyze the data, which have been implemented in an open-source software ( https://sourceforge.net/projects/calis-p/ ). We demonstrate sensitivity, precision and accuracy using bacterial cultures and mock communities with different labeling schemes. Furthermore, we benchmark our approach against two existing Protein-SIP approaches and show that in the low labeling range used our approach is the most sensitive and accurate. Finally, we measure translational activity using 18O heavy water labeling in a 63-species community derived from human fecal samples grown on media simulating two different diets. Activity could be quantified on average for 27 species per sample, with 9 species showing significantly higher activity on a high protein diet, as compared to a high fiber diet. Surprisingly, among the species with increased activity on high protein were several Bacteroides species known as fiber consumers. Apparently, protein supply is a critical consideration when assessing growth of intestinal microbes on fiber, including fiber-based prebiotics. CONCLUSIONS We demonstrate that our Protein-SIP approach allows for the ultra-sensitive (0.01 to 10% label) detection of stable isotopes of elements found in proteins, using standard metaproteomics data.
Collapse
Affiliation(s)
- Manuel Kleiner
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA.
| | - Angela Kouris
- Department of Geoscience, University of Calgary, Calgary, AB, Canada
| | - Marlene Violette
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Grace D'Angelo
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
- Max Planck Institute for Marine Microbiology, Bremen, Germany
| | - Yihua Liu
- Department of Geoscience, University of Calgary, Calgary, AB, Canada
- Max Planck Institute for Biology, Tübingen, Germany
| | - Abigail Korenek
- Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA
| | - Nikola Tolić
- Environmental Molecular Science Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Timo Sachsenberg
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, Germany
| | - Janine McCalder
- Department of Geoscience, University of Calgary, Calgary, AB, Canada
| | - Mary S Lipton
- Environmental Molecular Science Laboratory, Pacific Northwest National Laboratory, Richland, WA, USA
| | - Marc Strous
- Department of Geoscience, University of Calgary, Calgary, AB, Canada.
| |
Collapse
|
112
|
Du X, Dastmalchi F, Ye H, Garrett TJ, Diller MA, Liu M, Hogan WR, Brochhausen M, Lemas DJ. Evaluating LC-HRMS metabolomics data processing software using FAIR principles for research software. Metabolomics 2023; 19:11. [PMID: 36745241 DOI: 10.1007/s11306-023-01974-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/08/2022] [Accepted: 01/20/2023] [Indexed: 02/07/2023]
Abstract
BACKGROUND Liquid chromatography-high resolution mass spectrometry (LC-HRMS) is a popular approach for metabolomics data acquisition and requires many data processing software tools. The FAIR Principles - Findability, Accessibility, Interoperability, and Reusability - were proposed to promote open science and reusable data management, and to maximize the benefit obtained from contemporary and formal scholarly digital publishing. More recently, the FAIR principles were extended to include Research Software (FAIR4RS). AIM OF REVIEW This study facilitates open science in metabolomics by providing an implementation solution for adopting FAIR4RS in the LC-HRMS metabolomics data processing software. We believe our evaluation guidelines and results can help improve the FAIRness of research software. KEY SCIENTIFIC CONCEPTS OF REVIEW We evaluated 124 LC-HRMS metabolomics data processing software obtained from a systematic review and selected 61 software for detailed evaluation using FAIR4RS-related criteria, which were extracted from the literature along with internal discussions. We assigned each criterion one or more FAIR4RS categories through discussion. The minimum, median, and maximum percentages of criteria fulfillment of software were 21.6%, 47.7%, and 71.8%. Statistical analysis revealed no significant improvement in FAIRness over time. We identified four criteria covering multiple FAIR4RS categories but had a low %fulfillment: (1) No software had semantic annotation of key information; (2) only 6.3% of evaluated software were registered to Zenodo and received DOIs; (3) only 14.5% of selected software had official software containerization or virtual machine; (4) only 16.7% of evaluated software had a fully documented functions in code. According to the results, we discussed improvement strategies and future directions.
Collapse
Affiliation(s)
- Xinsong Du
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Farhad Dastmalchi
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Hao Ye
- Health Science Center Libraries, University of Florida, Florida, USA
| | - Timothy J Garrett
- Department of Pathology, Immunology and Laboratory Medicine, College of Medicine, University of Florida, Florida, USA
| | - Matthew A Diller
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Mei Liu
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - William R Hogan
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA
| | - Mathias Brochhausen
- Department of Biomedical Informatics, College of Medicine, University of Arkansas for Medical Sciences, Little Rock, USA
| | - Dominick J Lemas
- Department of Health Outcomes and Biomedical Informatics, University of Florida College of Medicine, Gainesville, FL, USA.
- Department of Obstetrics and Gynecology, University of Florida College of Medicine, Florida, Gainesville, United States.
- Center for Perinatal Outcomes Research, University of Florida College of Medicine, Gainesville, United States.
| |
Collapse
|
113
|
Gabriels R, Declercq A, Bouwmeester R, Degroeve S, Martens L. psm_utils: A High-Level Python API for Parsing and Handling Peptide-Spectrum Matches and Proteomics Search Results. J Proteome Res 2023; 22:557-560. [PMID: 36508242 DOI: 10.1021/acs.jproteome.2c00609] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
A plethora of proteomics search engine output file formats are in circulation. This lack of standardized output files greatly complicates generic downstream processing of peptide-spectrum matches (PSMs) and PSM files. While standards exist to solve this problem, these are far from universally supported by search engines. Moreover, software libraries are available to read a selection of PSM file formats, but a package to parse PSM files into a unified data structure has been missing. Here, we present psm_utils, a Python package to read and write various PSM file formats and to handle peptidoforms, PSMs, and PSM lists in a unified and user-friendly Python-, command line-, and web-interface. psm_utils was developed with pragmatism and maintainability in mind, adhering to community standards and relying on existing packages where possible. The Python API and command line interface greatly facilitate handling various PSM file formats. Moreover, a user-friendly web application was built using psm_utils that allows anyone to interconvert PSM files and retrieve basic PSM statistics. psm_utils is freely available under the permissive Apache2 license at https://github.com/compomics/psm_utils.
Collapse
Affiliation(s)
- Ralf Gabriels
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Arthur Declercq
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Sven Degroeve
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
114
|
Lin A, Deatherage Kaiser BL, Hutchison JR, Bilmes JA, Noble WS. MS1Connect: a mass spectrometry run similarity measure. Bioinformatics 2023; 39:7005198. [PMID: 36702456 PMCID: PMC9913042 DOI: 10.1093/bioinformatics/btad058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 01/05/2023] [Accepted: 01/24/2023] [Indexed: 01/28/2023] Open
Abstract
MOTIVATION Interpretation of newly acquired mass spectrometry data can be improved by identifying, from an online repository, previous mass spectrometry runs that resemble the new data. However, this retrieval task requires computing the similarity between an arbitrary pair of mass spectrometry runs. This is particularly challenging for runs acquired using different experimental protocols. RESULTS We propose a method, MS1Connect, that calculates the similarity between a pair of runs by examining only the intact peptide (MS1) scans, and we show evidence that the MS1Connect score is accurate. Specifically, we show that MS1Connect outperforms several baseline methods on the task of predicting the species from which a given proteomics sample originated. In addition, we show that MS1Connect scores are highly correlated with similarities computed from fragment (MS2) scans, even though these data are not used by MS1Connect. AVAILABILITY AND IMPLEMENTATION The MS1Connect software is available at https://github.com/bmx8177/MS1Connect. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Andy Lin
- Chemical and Biological Signatures, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | | | - Janine R Hutchison
- Chemical and Biological Signatures, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Jeffrey A Bilmes
- Department of Electrical and Computer Engineering, University of Washington, Seattle, WA 98195, USA
| | - William Stafford Noble
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA.,Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
115
|
Deutsch EW, Vizcaíno JA, Jones AR, Binz PA, Lam H, Klein J, Bittremieux W, Perez-Riverol Y, Tabb DL, Walzer M, Ricard-Blum S, Hermjakob H, Neumann S, Mak TD, Kawano S, Mendoza L, Van Den Bossche T, Gabriels R, Bandeira N, Carver J, Pullman B, Sun Z, Hoffmann N, Shofstahl J, Zhu Y, Licata L, Quaglia F, Tosatto SCE, Orchard SE. Proteomics Standards Initiative at Twenty Years: Current Activities and Future Work. J Proteome Res 2023; 22:287-301. [PMID: 36626722 PMCID: PMC9903322 DOI: 10.1021/acs.jproteome.2c00637] [Citation(s) in RCA: 19] [Impact Index Per Article: 19.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Indexed: 01/11/2023]
Abstract
The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has been successfully developing guidelines, data formats, and controlled vocabularies (CVs) for the proteomics community and other fields supported by mass spectrometry since its inception 20 years ago. Here we describe the general operation of the PSI, including its leadership, working groups, yearly workshops, and the document process by which proposals are thoroughly and publicly reviewed in order to be ratified as PSI standards. We briefly describe the current state of the many existing PSI standards, some of which remain the same as when originally developed, some of which have undergone subsequent revisions, and some of which have become obsolete. Then the set of proposals currently being developed are described, with an open call to the community for participation in the forging of the next generation of standards. Finally, we describe some synergies and collaborations with other organizations and look to the future in how the PSI will continue to promote the open sharing of data and thus accelerate the progress of the field of proteomics.
Collapse
Affiliation(s)
- Eric W. Deutsch
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Juan Antonio Vizcaíno
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Andrew R. Jones
- Institute
of Systems, Molecular and Integrative Biology, University of Liverpool, Liverpool L69 7ZB, United Kingdom
| | - Pierre-Alain Binz
- Clinical
Chemistry Service, Lausanne University Hospital, 1011 976 Lausanne, Switzerland
| | - Henry Lam
- Department
of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong 999077, P. R. China.
| | - Joshua Klein
- Program for
Bioinformatics, Boston University, Boston, Massachusetts 02215, United States
| | - Wout Bittremieux
- Skaggs
School
of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
- Department
of Computer Science, University of Antwerp, 2020 Antwerpen, Belgium
| | - Yasset Perez-Riverol
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - David L. Tabb
- SA MRC
Centre for TB Research, DST/NRF Centre of Excellence for Biomedical
TB Research, Division of Molecular Biology and Human Genetics, Faculty
of Medicine and Health Sciences, Stellenbosch
University, Cape Town 7602, South Africa
| | - Mathias Walzer
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Sylvie Ricard-Blum
- Univ.
Lyon, Université Lyon 1, ICBMS, UMR 5246, 69622 Villeurbanne, France
| | - Henning Hermjakob
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Steffen Neumann
- Bioinformatics
and Scientific Data, Leibniz Institute of
Plant Biochemistry, 06120 Halle, Germany
- German
Centre for Integrative Biodiversity Research (iDiv), 04103 Halle-Jena-Leipzig, Germany
| | - Tytus D. Mak
- Mass Spectrometry
Data Center, National Institute of Standards
and Technology, 100 Bureau Drive, Gaithersburg, Maryland 20899, United
States
| | - Shin Kawano
- Database
Center for Life Science, Joint Support Center for Data Science Research, Research Organization of Information and Systems, Chiba 277-0871, Japan
- Faculty
of Contemporary Society, Toyama University
of International Studies, Toyama 930-1292, Japan
- School
of Frontier Engineering, Kitasato University, Sagamihara 252-0373, Japan
| | - Luis Mendoza
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Tim Van Den Bossche
- VIB-UGent
Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department
of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, 9052 Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent
Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium
- Department
of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, 9052 Ghent, Belgium
| | - Nuno Bandeira
- Skaggs
School
of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, California 92093, United States
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Jeremy Carver
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Benjamin Pullman
- Center
for Computational Mass Spectrometry, Department of Computer Science
and Engineering, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, San Diego 92093-0404, United States
| | - Zhi Sun
- Institute
for Systems Biology, Seattle, Washington 98109, United States
| | - Nils Hoffmann
- Institute
for Bio- and Geosciences (IBG-5), Forschungszentrum
Jülich GmbH, 52428 Jülich, Germany
| | - Jim Shofstahl
- Thermo
Fisher Scientific, 355 River Oaks Parkway, San Jose, California 95134, United States
| | - Yunping Zhu
- National
Center for Protein Sciences (Beijing), Beijing
Institute of Lifeomics, #38, Life Science Park, Changping District, Beijing 102206, China
| | - Luana Licata
- Fondazione
Human Technopole, 20157 Milan, Italy
- Department
of Biology, University of Rome Tor Vergata, 00133 Rome, Italy
| | - Federica Quaglia
- Institute
of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR-IBIOM), 70126 Bari, Italy
- Department
of Biomedical Sciences, University of Padova, 35131 Padova, Italy
| | | | - Sandra E. Orchard
- European
Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
116
|
Kohler D, Kaza M, Pasi C, Huang T, Staniak M, Mohandas D, Sabido E, Choi M, Vitek O. MSstatsShiny: A GUI for Versatile, Scalable, and Reproducible Statistical Analyses of Quantitative Proteomic Experiments. J Proteome Res 2023; 22:551-556. [PMID: 36622173 DOI: 10.1021/acs.jproteome.2c00603] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Liquid chromatography coupled with bottom-up mass spectrometry (LC-MS/MS)-based proteomics is a versatile technology for identifying and quantifying proteins in complex biological mixtures. Postidentification, analysis of changes in protein abundances between conditions requires increasingly complex and specialized statistical methods. Many of these methods, in particular the family of open-source Bioconductor packages MSstats, are implemented in a coding language such as R. To make the methods in MSstats accessible to users with limited programming and statistical background, we have created MSstatsShiny, an R-Shiny graphical user interface (GUI) integrated with MSstats, MSstatsTMT, and MSstatsPTM. The GUI provides a point and click analysis pipeline applicable to a wide variety of proteomics experimental types, including label-free data-dependent acquisitions (DDAs) or data-independent acquisitions (DIAs), or tandem mass tag (TMT)-based TMT-DDAs, answering questions such as relative changes in the abundance of peptides, proteins, or post-translational modifications (PTMs). To support reproducible research, the application saves user's selections and builds an R script that programmatically recreates the analysis. MSstatsShiny can be installed locally via Github and Bioconductor, or utilized on the cloud at www.msstatsshiny.com. We illustrate the utility of the platform using two experimental data sets (MassIVE IDs MSV000086623 and MSV000085565).
Collapse
Affiliation(s)
- Devon Kohler
- Khoury College of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| | - Maanasa Kaza
- Khoury College of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| | - Cristina Pasi
- Universitat Oberta de Catalunya, Barcelona 08018, Spain
| | - Ting Huang
- Khoury College of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| | | | | | - Eduard Sabido
- Center for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona 08003, Spain.,Universitat Pompeu Fabra, Barcelona 08002, Spain
| | - Meena Choi
- MPL, Genentech, South San Francisco, California 94080, United States
| | - Olga Vitek
- Khoury College of Computer Science, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
117
|
Rusilowicz M, Newman DW, Creamer DR, Johnson J, Adair K, Harman VM, Grant CM, Beynon RJ, Hubbard SJ. AlacatDesigner─Computational Design of Peptide Concatamers for Protein Quantitation. J Proteome Res 2023; 22:594-604. [PMID: 36688735 PMCID: PMC9903321 DOI: 10.1021/acs.jproteome.2c00608] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Protein quantitation via mass spectrometry relies on peptide proxies for the parent protein from which abundances are estimated. Owing to the variability in signal from individual peptides, accurate absolute quantitation usually relies on the addition of an external standard. Typically, this involves stable isotope-labeled peptides, delivered singly or as a concatenated recombinant protein. Consequently, the selection of the most appropriate surrogate peptides and the attendant design in recombinant proteins termed QconCATs are challenges for proteome science. QconCATs can now be built in a "a-la-carte" assembly method using synthetic biology: ALACATs. To assist their design, we present "AlacatDesigner", a tool that supports the peptide selection for recombinant protein standards based on the user's target protein. The user-customizable tool considers existing databases, occurrence in the literature, potential post-translational modifications, predicted miscleavage, predicted divergence of the peptide and protein quantifications, and ionization potential within the mass spectrometer. We show that peptide selections are enriched for good proteotypic and quantotypic candidates compared to empirical data. The software is freely available to use either via a web interface AlacatDesigner, downloaded as a Desktop application or imported as a Python package for the command line interface or in scripts.
Collapse
Affiliation(s)
- Martin Rusilowicz
- Division
of Evolution, Infection and Genomics, School of Biological Sciences,
Faculty of Biology, Medicine and Health, Manchester Academic Health
Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
| | - David W. Newman
- Division
of Evolution, Infection and Genomics, School of Biological Sciences,
Faculty of Biology, Medicine and Health, Manchester Academic Health
Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
| | - Declan R. Creamer
- Division
of Molecular and Cellular Function, School of Biological Sciences,
Faculty of Biology, Medicine and Health, Manchester Academic Health
Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
| | - James Johnson
- GeneMill,
Institute of Systems Molecular and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United
Kingdom
| | - Kareena Adair
- Centre
for Proteome Research, Institute of Systems and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United
Kingdom
| | - Victoria M. Harman
- Centre
for Proteome Research, Institute of Systems and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United
Kingdom
| | - Chris M. Grant
- Division
of Molecular and Cellular Function, School of Biological Sciences,
Faculty of Biology, Medicine and Health, Manchester Academic Health
Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom
| | - Robert J. Beynon
- Centre
for Proteome Research, Institute of Systems and Integrative Biology, University of Liverpool, Crown Street, Liverpool L69 7ZB, United
Kingdom
| | - Simon J. Hubbard
- Division
of Evolution, Infection and Genomics, School of Biological Sciences,
Faculty of Biology, Medicine and Health, Manchester Academic Health
Science Centre, University of Manchester, Manchester M13 9PT, United Kingdom,
| |
Collapse
|
118
|
Letourneau DR, August DD, Volmer DA. New algorithms demonstrate untargeted detection of chemically meaningful changing units and formula assignment for HRMS data of polymeric mixtures in the open-source constellation web application. J Cheminform 2023; 15:7. [PMID: 36653829 PMCID: PMC9850690 DOI: 10.1186/s13321-023-00680-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2022] [Accepted: 01/06/2023] [Indexed: 01/19/2023] Open
Abstract
The field of high-resolution mass spectrometry (HRMS) and ancillary hyphenated techniques comprise a rapidly expanding and evolving area. As popularity of HRMS instruments grows, there is a concurrent need for tools and solutions to simplify and automate the processing of the large and complex datasets that result from these analyses. Constellation is one such of these tools, developed by our group over the last two years to perform unsupervised trend detection for repeating, polymeric units in HRMS data of complex mixtures such as natural organic matter, oil, or lignin. In this work, we develop two new unsupervised algorithms for finding chemically-meaningful changing units in HRMS data, and incorporate a molecular-formula-finding algorithm from the open-source CoreMS software package, both demonstrated here in the Constellation software environment. These algorithms are evaluated on a collection of open-source HRMS datasets containing polymeric analytes (PEG 400 and NIST standard reference material 1950, both metabolites in human plasma, as well as a swab extract containing polymers), and are able to successfully identify all known changing units in the data, including assigning the correct formulas. Through these new developments, we are excited to add to a growing body of open-source software specialized in extracting useful information from complex datasets without the high costs, technical knowledge, and processor-demand typically associated with such tools.
Collapse
Affiliation(s)
- Dane R. Letourneau
- grid.7468.d0000 0001 2248 7639Department of Chemistry, Humboldt University Berlin, 12489 Berlin, Germany
| | - Dennis D. August
- grid.7468.d0000 0001 2248 7639Department of Chemistry, Humboldt University Berlin, 12489 Berlin, Germany
| | - Dietrich A. Volmer
- grid.7468.d0000 0001 2248 7639Department of Chemistry, Humboldt University Berlin, 12489 Berlin, Germany
| |
Collapse
|
119
|
Carrillo-Rodriguez P, Selheim F, Hernandez-Valladares M. Mass Spectrometry-Based Proteomics Workflows in Cancer Research: The Relevance of Choosing the Right Steps. Cancers (Basel) 2023; 15:555. [PMID: 36672506 PMCID: PMC9856946 DOI: 10.3390/cancers15020555] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2023] [Accepted: 01/12/2023] [Indexed: 01/19/2023] Open
Abstract
The qualitative and quantitative evaluation of proteome changes that condition cancer development can be achieved with liquid chromatography-mass spectrometry (LC-MS). LC-MS-based proteomics strategies are carried out according to predesigned workflows that comprise several steps such as sample selection, sample processing including labeling, MS acquisition methods, statistical treatment, and bioinformatics to understand the biological meaning of the findings and set predictive classifiers. As the choice of best options might not be straightforward, we herein review and assess past and current proteomics approaches for the discovery of new cancer biomarkers. Moreover, we review major bioinformatics tools for interpreting and visualizing proteomics results and suggest the most popular machine learning techniques for the selection of predictive biomarkers. Finally, we consider the approximation of proteomics strategies for clinical diagnosis and prognosis by discussing current barriers and proposals to circumvent them.
Collapse
Affiliation(s)
- Paula Carrillo-Rodriguez
- Proteomics Unit of University of Bergen (PROBE), University of Bergen, Jonas Lies vei 91, 5009 Bergen, Norway
- Vall d’Hebron Institute of Oncology (VHIO), 08035 Barcelona, Spain
| | - Frode Selheim
- Proteomics Unit of University of Bergen (PROBE), University of Bergen, Jonas Lies vei 91, 5009 Bergen, Norway
| | - Maria Hernandez-Valladares
- Proteomics Unit of University of Bergen (PROBE), University of Bergen, Jonas Lies vei 91, 5009 Bergen, Norway
- Department of Physical Chemistry, University of Granada, Avenida de la Fuente Nueva S/N, 18071 Granada, Spain
- Instituto de Investigación Biosanitaria ibs.GRANADA, 18012 Granada, Spain
| |
Collapse
|
120
|
Babačić H, Galardi S, Umer HM, Hellström M, Uhrbom L, Maturi N, Cardinali D, Pellegatta S, Michienzi A, Trevisi G, Mangiola A, Lehtiö J, Ciafrè SA, Pernemalm M. Glioblastoma stem cells express non-canonical proteins and exclusive mesenchymal-like or non-mesenchymal-like protein signatures. Mol Oncol 2023; 17:238-260. [PMID: 36495079 PMCID: PMC9892829 DOI: 10.1002/1878-0261.13355] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 12/08/2022] [Indexed: 12/14/2022] Open
Abstract
Glioblastoma (GBM) cancer stem cells (GSCs) contribute to GBM's origin, recurrence, and resistance to treatment. However, the understanding of how mRNA expression patterns of GBM subtypes are reflected at global proteome level in GSCs is limited. To characterize protein expression in GSCs, we performed in-depth proteogenomic analysis of patient-derived GSCs by RNA-sequencing and mass-spectrometry. We quantified > 10 000 proteins in two independent GSC panels and propose a GSC-associated proteomic signature characterizing two distinct phenotypic conditions; one defined by proteins upregulated in proneural and classical GSCs (GPC-like), and another by proteins upregulated in mesenchymal GSCs (GM-like). The GM-like protein set in GBM tissue was associated with necrosis, recurrence, and worse overall survival. Through proteogenomics, we discovered 252 non-canonical peptides in the GSCs, i.e., protein sequences that are variant or derive from genome regions previously considered non-protein-coding, including variants of the heterogeneous ribonucleoproteins implicated in RNA splicing. In summary, GSCs express two protein sets that have an inverse association with clinical outcomes in GBM. The discovery of non-canonical protein sequences questions existing gene models and pinpoints new protein targets for research in GBM.
Collapse
Affiliation(s)
- Haris Babačić
- Department of Oncology and PathologyKarolinska Institute, Science for Life LaboratoryStockholmSweden
| | - Silvia Galardi
- Department of Biomedicine and PreventionUniversity of Rome Tor VergataItaly
| | - Husen M. Umer
- Department of Oncology and PathologyKarolinska Institute, Science for Life LaboratoryStockholmSweden
| | - Mats Hellström
- Department of Immunology, Genetics and PathologyUppsala UniversitySweden
| | - Lene Uhrbom
- Department of Immunology, Genetics and PathologyUppsala UniversitySweden
| | | | - Deborah Cardinali
- Department of Biomedicine and PreventionUniversity of Rome Tor VergataItaly
| | - Serena Pellegatta
- Unit of Immunotherapy of Brain Tumors, Department of Molecular Neuro‐Oncology, Foundation IRCCSInstitute for Neurology Carlo BestaMilanItaly
| | | | - Gianluca Trevisi
- Neurosurgical UnitHospital Spirito Santo, Pescara, “G. D'Annunzio” UniversityChietiItaly
| | - Annunziato Mangiola
- Neurosurgical UnitHospital Spirito Santo, Pescara, “G. D'Annunzio” UniversityChietiItaly
| | - Janne Lehtiö
- Department of Oncology and PathologyKarolinska Institute, Science for Life LaboratoryStockholmSweden
| | - Silvia Anna Ciafrè
- Department of Biomedicine and PreventionUniversity of Rome Tor VergataItaly
| | - Maria Pernemalm
- Department of Oncology and PathologyKarolinska Institute, Science for Life LaboratoryStockholmSweden
| |
Collapse
|
121
|
Shen S, Zhan C, Yang C, Fernie AR, Luo J. Metabolomics-centered mining of plant metabolic diversity and function: Past decade and future perspectives. MOLECULAR PLANT 2023; 16:43-63. [PMID: 36114669 DOI: 10.1016/j.molp.2022.09.007] [Citation(s) in RCA: 49] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 09/06/2022] [Accepted: 09/11/2022] [Indexed: 06/15/2023]
Abstract
Plants are natural experts in organic synthesis, being able to generate large numbers of specific metabolites with widely varying structures that help them adapt to variable survival challenges. Metabolomics is a research discipline that integrates the capabilities of several types of research including analytical chemistry, statistics, and biochemistry. Its ongoing development provides strategies for gaining a systematic understanding of quantitative changes in the levels of metabolites. Metabolomics is usually performed by targeting either a specific cell, a specific tissue, or the entire organism. Considerable advances in science and technology over the last three decades have propelled us into the era of multi-omics, in which metabolomics, despite at an earlier developmental stage than genomics, transcriptomics, and proteomics, offers the distinct advantage of studying the cellular entities that have the greatest influence on end phenotype. Here, we summarize the state of the art of metabolite detection and identification, and illustrate these techniques with four case study applications: (i) comparing metabolite composition within and between species, (ii) assessing spatio-temporal metabolic changes during plant development, (iii) mining characteristic metabolites of plants in different ecological environments and upon exposure to various stresses, and (iv) assessing the performance of metabolomics as a means of functional gene identification , metabolic pathway elucidation, and metabolomics-assisted breeding through analyzing plant populations with diverse genetic variations. In addition, we highlight the prominent contributions of joint analyses of plant metabolomics and other omics datasets, including those from genomics, transcriptomics, proteomics, epigenomics, phenomics, microbiomes, and ion-omics studies. Finally, we discuss future directions and challenges exploiting metabolomics-centered approaches in understanding plant metabolic diversity.
Collapse
Affiliation(s)
- Shuangqian Shen
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya 572025, China; College of Tropical Crops, Hainan University, Haikou 570228, China
| | - Chuansong Zhan
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya 572025, China; College of Tropical Crops, Hainan University, Haikou 570228, China
| | - Chenkun Yang
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya 572025, China; College of Tropical Crops, Hainan University, Haikou 570228, China
| | - Alisdair R Fernie
- Max Planck Institute of Molecular Plant Physiology, Potsdam-Golm 14476, Germany
| | - Jie Luo
- Sanya Nanfan Research Institute of Hainan University, Hainan Yazhou Bay Seed Laboratory, Sanya 572025, China; College of Tropical Crops, Hainan University, Haikou 570228, China.
| |
Collapse
|
122
|
Anziani P, Becker J, Mignon C, Arnaud-Barbe N, Courtois V, Izac M, Pizzato R, Abi-Ghanem J, Tran VD, Sarafian M, Bunescu A, Garnier D, Abachin E, Renauld-Mongénie G, Guyard C. Deep longitudinal multi-omics analysis of Bordetella pertussis cultivated in bioreactors highlights medium starvations and transitory metabolisms, associated to vaccine antigen biosynthesis variations and global virulence regulation. Front Microbiol 2023; 14:1036386. [PMID: 36876086 PMCID: PMC9976334 DOI: 10.3389/fmicb.2023.1036386] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 01/05/2023] [Indexed: 02/16/2023] Open
Abstract
Bordetella pertussis is the bacterial causative agent of whooping cough, a serious respiratory illness. An extensive knowledge on its virulence regulation and metabolism is a key factor to ensure pertussis vaccine manufacturing process robustness. The aim of this study was to refine our comprehension of B. pertussis physiology during in vitro cultures in bioreactors. A longitudinal multi-omics analysis was carried out over 26 h small-scale cultures of B. pertussis. Cultures were performed in batch mode and under culture conditions intending to mimic industrial processes. Putative cysteine and proline starvations were, respectively, observed at the beginning of the exponential phase (from 4 to 8 h) and during the exponential phase (18 h 45 min). As revealed by multi-omics analyses, the proline starvation induced major molecular changes, including a transient metabolism with internal stock consumption. In the meantime, growth and specific total PT, PRN, and Fim2 antigen productions were negatively affected. Interestingly, the master virulence-regulating two-component system of B. pertussis (BvgASR) was not evidenced as the sole virulence regulator in this in vitro growth condition. Indeed, novel intermediate regulators were identified as putatively involved in the expression of some virulence-activated genes (vags). Such longitudinal multi-omics analysis applied to B. pertussis culture process emerges as a powerful tool for characterization and incremental optimization of vaccine antigen production.
Collapse
Affiliation(s)
- Paul Anziani
- Sanofi, Marcy-l'Étoile, France.,BIOASTER, Lyon, France
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
123
|
Lignieres L, Legros V, Khelil M, Senecaut N, Lauber MA, Camadro JM, Chevreux G. Capillary liquid chromatography coupled with mass spectrometry for analysis of nanogram protein quantities on a wide-pore superficially porous particle column in top-down proteomics. J Chromatogr B Analyt Technol Biomed Life Sci 2023; 1214:123566. [PMID: 36516651 DOI: 10.1016/j.jchromb.2022.123566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 12/01/2022] [Accepted: 12/02/2022] [Indexed: 12/13/2022]
Abstract
In top-down proteomics experiments, intact protein ions are subjected to gas-phase fragmentation for MS analysis without prior digestion. This approach is used to characterize post-translational modifications and clipped forms of proteins, avoids several "inference" problems associated with bottom-up proteomics, and is well suited to the study of proteoforms. In the past decade, top-down proteomics has progressed rapidly, taking advantage of MS instrumentation improvements and the efforts of pioneering groups working to improve sample handling and data processing. The potential of this technology has been established through its successful use in a number of important biological studies. However, many challenges remain to be addressed like improving protein separation capabilities such that it might become possible to expand the dynamic range of whole proteome analysis, address co-elution and convoluted mass spectral data, and aid final data processing from peak identification to quantification. In this study, we investigated the use of a wide-pore silica-based superficially porous media with a high coverage phenyl bonding, commercially packed into customized capillary columns for the purpose of top-down proteomics. Protein samples of increasing complexity were tested, namely subunit digests of a monoclonal antibody, components of purified histones and proteins extracted from eukaryotic ribosomes. High quality mass spectra were obtained from only 100 ng of protein sample while using difluoroacetic acid as an ion pairing agent to improve peak shape and chromatographic resolution. A peak width at half height of about 15 s for a 45 min gradient time was observed on a complex mixture giving an estimated peak capacity close to 100. Most importantly, efficient separations were obtained for highly diverse proteins and there was no need to make method specific adjustments, suggesting this is a highly versatile and easy-to-use setup for top-down proteomics.
Collapse
Affiliation(s)
- Laurent Lignieres
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Véronique Legros
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Manel Khelil
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Nicolas Senecaut
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France
| | - Matthew A Lauber
- Waters Corporation, 34, Maple Street, Milford, MA 01757-3696, United States
| | | | - Guillaume Chevreux
- Université Paris Cité, CNRS, Institut Jacques Monod, F-75013 Paris, France.
| |
Collapse
|
124
|
Ahmad F, Nadeem H. Mass Spectroscopy as an Analytical Tool to Harness the Production of Secondary Plant Metabolites: The Way Forward for Drug Discovery. Methods Mol Biol 2023; 2575:77-103. [PMID: 36301472 DOI: 10.1007/978-1-0716-2716-7_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The molecular map of diverse biological molecules linked with structure, function, signaling, and regulation within a cell can be elucidated using an analytically demanding omic approach. The latest trend of using "metabolomics" technologies has explained the natural phenomenon of opening a new avenue to understand and enhance bioactive compounds' production. Examination of sequenced plant genomes has revealed that a considerable portion of these encodes genes of secondary metabolism. In addition to genetic and molecular tools developed in the current era, the ever-increasing knowledge about plant metabolism's biochemistry has initiated an approach for wisely designed, more productive genetic engineering of plant secondary metabolism for improved defense systems and enhanced biosynthesis of beneficial metabolites. Secondary plant metabolites are natural products synthesized by plants that are not directly involved with their average growth and development but play a vital role in plant defense mechanisms. Plant secondary metabolites are classified into four major classes: terpenoids, phenolic compounds, alkaloids, and sulfur-containing compounds. More than 200,000 secondary metabolites are synthesized by plants having a unique and complex structure. Secondary plant metabolites are well characterized and quantified by omics approaches and therefore used by humans in different sectors such as agriculture, pharmaceuticals, chemical industries, and biofuel. The aim is to establish metabolomics as a comprehensive and dynamic model of diverse biological molecules for biomarkers and drug discovery. In this chapter, we aim to illustrate the role of metabolomic technology, precisely liquid chromatography-mass spectrometry, capillary electrophoresis mass spectrometry, gas chromatography-mass spectrometry, and nuclear magnetic resonance spectroscopy, specifically as a research tool in the production and identification of novel bioactive compounds for drug discovery and to obtain a unified insight of secondary metabolism in plants.
Collapse
Affiliation(s)
- Faheem Ahmad
- Department of Botany, Aligarh Muslim University, Aligarh, Uttar Pradesh, India.
| | - Hera Nadeem
- Department of Botany, Aligarh Muslim University, Aligarh, Uttar Pradesh, India
| |
Collapse
|
125
|
Anand KP, Suthindhiran K. Microbial signature and biosynthetic gene cluster profiling of poly extremophilic marine actinobacteria isolated from Vhan Island, Tamil Nadu, India. GENE REPORTS 2023. [DOI: 10.1016/j.genrep.2023.101742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
|
126
|
Gao B, Lu W, Jin M, Chen Y. Non-targeted metabolomics of moldy wheat by ultra-performance liquid chromatography - quadrupole time-of-flight mass spectrometry. Front Microbiol 2023; 14:1136516. [PMID: 37089557 PMCID: PMC10119584 DOI: 10.3389/fmicb.2023.1136516] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Accepted: 02/28/2023] [Indexed: 04/25/2023] Open
Abstract
Introduction As one of the staple foods for the world's major populations, the safety of wheat is critical in ensuring people's wellbeing. However, mildew is one of the prevalent safety issues that threatens the quality of wheat during growth, production, and storage. Due to the complex nature of the microbial metabolites, the rapid identification of moldy wheat is challenging. Methods In this research, identification of moldy wheat samples was studied using ultra-performance liquid chromatography - quadrupole time-of-flight mass spectrometry (UPLC-QTOF-MS) coupled with chemometrics. The non-targeted PCA model for identifying moldy wheat from normal wheat was established by using previously established compounds database of authentic wheat samples. The partial least squares-discriminant analysis (PLS-DA) was performed. Results and discussion By optimizing the model parameters, correct discrimination of the moldy wheat as low as 5% (w/w) adulteration level could be achieved. Differential biomarkers unique to moldy wheat were also extracted to identify between the moldy and authentic wheat samples. The results demonstrated that the chemical information of wheat combined with the existing PCA model could efficiently discriminate between the constructed moldy wheat samples. The study offered an effective method toward screening wheat safety.
Collapse
|
127
|
Iravani S, Conrad TOF. An Interpretable Deep Learning Approach for Biomarker Detection in LC-MS Proteomics Data. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:151-161. [PMID: 35007196 DOI: 10.1109/tcbb.2022.3141656] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Analyzing mass spectrometry-based proteomics data with deep learning (DL) approaches poses several challenges due to the high dimensionality, low sample size, and high level of noise. Additionally, DL-based workflows are often hindered to be integrated into medical settings due to the lack of interpretable explanation. We present DLearnMS, a DL biomarker detection framework, to address these challenges on proteomics instances of liquid chromatography-mass spectrometry (LC-MS) - a well-established tool for quantifying complex protein mixtures. Our DLearnMS framework learns the clinical state of LC-MS data instances using convolutional neural networks. Based on the trained neural networks, we show how biomarkers can be identified using layer-wise relevance propagation. This enables detecting discriminating regions of the data and the design of more robust networks. One of the main advantages over other established methods is that no explicit preprocessing step is needed in our DLearnMS framework. Our evaluation shows that DLearnMS outperforms conventional LC-MS biomarker detection approaches in identifying fewer false positive peaks while maintaining a comparable amount of true positives peaks. Code availability: The code is available from the following GIT repository: https://github.com/SaharIravani/DlearnMS.
Collapse
|
128
|
de Medeiros LS, de Araújo Júnior MB, Peres EG, da Silva JCI, Bassicheto MC, Di Gioia G, Veiga TAM, Koolen HHF. Discovering New Natural Products Using Metabolomics-Based Approaches. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1439:185-224. [PMID: 37843810 DOI: 10.1007/978-3-031-41741-2_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/17/2023]
Abstract
The incessant search for new natural molecules with biological activities has forced researchers in the field of chemistry of natural products to seek different approaches for their prospection studies. In particular, researchers around the world are turning to approaches in metabolomics to avoid high rates of re-isolation of certain compounds, something recurrent in this branch of science. Thanks to the development of new technologies in the analytical instrumentation of spectroscopic and spectrometric techniques, as well as the advance in the computational processing modes of the results, metabolomics has been gaining more and more space in studies that involve the prospection of natural products. Thus, this chapter summarizes the precepts and good practices in the metabolomics of microbial natural products using mass spectrometry and nuclear magnetic resonance spectroscopy, and also summarizes several examples where this approach has been applied in the discovery of bioactive molecules.
Collapse
Affiliation(s)
- Lívia Soman de Medeiros
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil.
| | - Moysés B de Araújo Júnior
- Grupo de Pesquisa em Metabolômica e Espectrometria de Massas, Universidade do Estado do Amazonas, Manaus, Brazil
| | - Eldrinei G Peres
- Grupo de Pesquisa em Metabolômica e Espectrometria de Massas, Universidade do Estado do Amazonas, Manaus, Brazil
| | | | - Milena Costa Bassicheto
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | - Giordanno Di Gioia
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | - Thiago André Moura Veiga
- Grupo de Pesquisas LaBiORG - Laboratório de Química Bio-orgânica Otto Richard Gottlieb, Universidade Federal de São Paulo, Diadema, Brazil
| | | |
Collapse
|
129
|
Qin GF, Zhang X, Zhu F, Huo ZQ, Yao QQ, Feng Q, Liu Z, Zhang GM, Yao JC, Liang HB. MS/MS-Based Molecular Networking: An Efficient Approach for Natural Products Dereplication. MOLECULES (BASEL, SWITZERLAND) 2022; 28:molecules28010157. [PMID: 36615351 PMCID: PMC9822519 DOI: 10.3390/molecules28010157] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/08/2022] [Revised: 12/13/2022] [Accepted: 12/20/2022] [Indexed: 12/29/2022]
Abstract
Natural products (NPs) have historically played a primary role in the discovery of small-molecule drugs. However, due to the advent of other methodologies and the drawbacks of NPs, the pharmaceutical industry has largely declined in interest regarding the screening of new drugs from NPs since 2000. There are many technical bottlenecks to quickly obtaining new bioactive NPs on a large scale, which has made NP-based drug discovery very time-consuming, and the first thorny problem faced by researchers is how to dereplicate NPs from crude extracts. Remarkably, with the rapid development of omics, analytical instrumentation, and artificial intelligence technology, in 2012, an efficient approach, known as tandem mass spectrometry (MS/MS)-based molecular networking (MN) analysis, was developed to avoid the rediscovery of known compounds from the complex natural mixtures. Then, in the past decade, based on the classical MN (CLMN), feature-based MN (FBMN), ion identity MN (IIMN), building blocks-based molecular network (BBMN), substructure-based MN (MS2LDA), and bioactivity-based MN (BMN) methods have been presented. In this paper, we review the basic principles, general workflow, and application examples of the methods mentioned above, to further the research and applications of these methods.
Collapse
Affiliation(s)
- Guo-Fei Qin
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
- Correspondence: (G.-F.Q.); (J.-C.Y.); (H.-B.L.); Tel.: +86-539-503-0319 (G.-F.Q.)
| | - Xiao Zhang
- College of Pharmacy, Shandong University of Traditional Chinese Medicine, Jinan 250355, China
| | - Feng Zhu
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
| | - Zong-Qing Huo
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
| | | | - Qun Feng
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
| | - Zhong Liu
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
| | - Gui-Min Zhang
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
- College of Pharmacy, Shandong University of Traditional Chinese Medicine, Jinan 250355, China
| | - Jing-Chun Yao
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
- Correspondence: (G.-F.Q.); (J.-C.Y.); (H.-B.L.); Tel.: +86-539-503-0319 (G.-F.Q.)
| | - Hong-Bao Liang
- State Key Laboratory of Generic Manufacture Technology of Chinese Traditional Medicine, Lunan Pharmaceutical Group Co., Ltd., Linyi 273400, China
- College of Pharmacy, Shandong University of Traditional Chinese Medicine, Jinan 250355, China
- Correspondence: (G.-F.Q.); (J.-C.Y.); (H.-B.L.); Tel.: +86-539-503-0319 (G.-F.Q.)
| |
Collapse
|
130
|
He J, Liu O, Guo X. Deep Learning Based MS2 Feature Detection for Data-Independent Shotgun Proteomics. PROCEEDINGS. IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE 2022; 2022:2342-2348. [PMID: 37635836 PMCID: PMC10457098 DOI: 10.1109/bibm55620.2022.9995258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/29/2023]
Abstract
Accuracy of peptide identification in LC-MS analysis is crucial for information regarding the aspects of proteins that aid in biomarker discovery and the profiling of complex proteomes. The detection of peptide fragment ions in tandem mass spectrometry is still challenging given that current tools were not created or tested for the low-abundance, low-peak fragments of peptides found in MS2 data. Feature detection, a crucial pre-processing step in the LC-MS analysis pipeline that quantifies peptides by their mass-to-charge ratio, retention time, and intensity, is particularly challenging due to the overlapping nature of peptides and weak signals that are often indistinguishable from noises, thus creating a reliance on rigid mathematical structures and heuristics. In this study, we developed a deep-learning-based model with an innovative sliding window process that enables high-resolution processing of quantitative MS/MS data to conduct MS2 feature detection. Experimental results show that our model can produce more accurate values and identifications than existing feature detection tools, as well as a high rate of true positive features quantified. Therefore, we believe that our model illustrates the advantages of deep learning techniques applied towards computational proteomics.
Collapse
Affiliation(s)
- Jonathan He
- Department of Computer Science and Engineering, Univeristy of North Texas, Denton, USA
| | - Olivia Liu
- Department of Computer Science and Engineering, Univeristy of North Texas, Denton, USA
| | - Xuan Guo
- Department of Computer Science and Engineering, Univeristy of North Texas, Denton, USA
| |
Collapse
|
131
|
González-Plaza JJ, Furlan C, Rijavec T, Lapanje A, Barros R, Tamayo-Ramos JA, Suarez-Diez M. Advances in experimental and computational methodologies for the study of microbial-surface interactions at different omics levels. Front Microbiol 2022; 13:1006946. [PMID: 36519168 PMCID: PMC9744117 DOI: 10.3389/fmicb.2022.1006946] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2022] [Accepted: 11/02/2022] [Indexed: 08/31/2023] Open
Abstract
The study of the biological response of microbial cells interacting with natural and synthetic interfaces has acquired a new dimension with the development and constant progress of advanced omics technologies. New methods allow the isolation and analysis of nucleic acids, proteins and metabolites from complex samples, of interest in diverse research areas, such as materials sciences, biomedical sciences, forensic sciences, biotechnology and archeology, among others. The study of the bacterial recognition and response to surface contact or the diagnosis and evolution of ancient pathogens contained in archeological tissues require, in many cases, the availability of specialized methods and tools. The current review describes advances in in vitro and in silico approaches to tackle existing challenges (e.g., low-quality sample, low amount, presence of inhibitors, chelators, etc.) in the isolation of high-quality samples and in the analysis of microbial cells at genomic, transcriptomic, proteomic and metabolomic levels, when present in complex interfaces. From the experimental point of view, tailored manual and automatized methodologies, commercial and in-house developed protocols, are described. The computational level focuses on the discussion of novel tools and approaches designed to solve associated issues, such as sample contamination, low quality reads, low coverage, etc. Finally, approaches to obtain a systems level understanding of these complex interactions by integrating multi omics datasets are presented.
Collapse
Affiliation(s)
- Juan José González-Plaza
- International Research Centre in Critical Raw Materials-ICCRAM, University of Burgos, Burgos, Spain
| | - Cristina Furlan
- Laboratory of Systems and Synthetic Biology, Wageningen University and Research, Wageningen, Netherlands
| | - Tomaž Rijavec
- Department of Environmental Sciences, Jožef Stefan Institute, Ljubljana, Slovenia
| | - Aleš Lapanje
- Department of Environmental Sciences, Jožef Stefan Institute, Ljubljana, Slovenia
| | - Rocío Barros
- International Research Centre in Critical Raw Materials-ICCRAM, University of Burgos, Burgos, Spain
| | | | - Maria Suarez-Diez
- Laboratory of Systems and Synthetic Biology, Wageningen University and Research, Wageningen, Netherlands
| |
Collapse
|
132
|
Shaver AO, Garcia BM, Gouveia GJ, Morse AM, Liu Z, Asef CK, Borges RM, Leach FE, Andersen EC, Amster IJ, Fernández FM, Edison AS, McIntyre LM. An anchored experimental design and meta-analysis approach to address batch effects in large-scale metabolomics. Front Mol Biosci 2022; 9:930204. [PMID: 36438654 PMCID: PMC9682135 DOI: 10.3389/fmolb.2022.930204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Accepted: 10/10/2022] [Indexed: 11/27/2022] Open
Abstract
Untargeted metabolomics studies are unbiased but identifying the same feature across studies is complicated by environmental variation, batch effects, and instrument variability. Ideally, several studies that assay the same set of metabolic features would be used to select recurring features to pursue for identification. Here, we developed an anchored experimental design. This generalizable approach enabled us to integrate three genetic studies consisting of 14 test strains of Caenorhabditis elegans prior to the compound identification process. An anchor strain, PD1074, was included in every sample collection, resulting in a large set of biological replicates of a genetically identical strain that anchored each study. This enables us to estimate treatment effects within each batch and apply straightforward meta-analytic approaches to combine treatment effects across batches without the need for estimation of batch effects and complex normalization strategies. We collected 104 test samples for three genetic studies across six batches to produce five analytical datasets from two complementary technologies commonly used in untargeted metabolomics. Here, we use the model system C. elegans to demonstrate that an augmented design combined with experimental blocks and other metabolomic QC approaches can be used to anchor studies and enable comparisons of stable spectral features across time without the need for compound identification. This approach is generalizable to systems where the same genotype can be assayed in multiple environments and provides biologically relevant features for downstream compound identification efforts. All methods are included in the newest release of the publicly available SECIMTools based on the open-source Galaxy platform.
Collapse
Affiliation(s)
- Amanda O. Shaver
- Department of Genetics, University of Georgia, Athens, GA, United States,Complex Carbohydrate Research Center, University of Georgia, Athens, GA, United States
| | - Brianna M. Garcia
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, United States,Department of Chemistry, University of Georgia, Athens, GA, United States
| | - Goncalo J. Gouveia
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, United States,Department of Biochemistry, University of Georgia, Athens, GA, United States
| | - Alison M. Morse
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, United States
| | - Zihao Liu
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, United States
| | - Carter K. Asef
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, United States
| | - Ricardo M. Borges
- Walter Mors Institute of Research on Natural Products, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Franklin E. Leach
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, United States,Department of Environmental Health Science, University of Georgia, Athens, GA, United States
| | - Erik C. Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, IL, United States
| | - I. Jonathan Amster
- Department of Chemistry, University of Georgia, Athens, GA, United States
| | - Facundo M. Fernández
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, United States
| | - Arthur S. Edison
- Department of Genetics, University of Georgia, Athens, GA, United States,Complex Carbohydrate Research Center, University of Georgia, Athens, GA, United States,Department of Biochemistry, University of Georgia, Athens, GA, United States
| | - Lauren M. McIntyre
- Department of Molecular Genetics and Microbiology, University of Florida, Gainesville, FL, United States,University of Florida Genetics Institute, University of Florida, Gainesville, FL, United States,*Correspondence: Lauren M. McIntyre,
| |
Collapse
|
133
|
Titaley IA, Trine LSD, Wang T, Duberg D, Davis EL, Engwall M, Massey Simonich SL, Larsson M. Extensive chemical and bioassay analysis of polycyclic aromatic compounds in a creosote-contaminated superfund soil following steam enhanced extraction. ENVIRONMENTAL POLLUTION (BARKING, ESSEX : 1987) 2022; 312:120014. [PMID: 36007793 PMCID: PMC9869926 DOI: 10.1016/j.envpol.2022.120014] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/16/2022] [Revised: 08/08/2022] [Accepted: 08/17/2022] [Indexed: 06/15/2023]
Abstract
Polycyclic aromatic compounds (PACs) are organic compounds commonly found in contaminated soil. Previous studies have shown the removal of polycyclic aromatic hydrocarbons (PAHs) in creosote-contaminated soils during steam enhanced extraction (SEE). However, less is known about the removal of alkyl-PAHs and heterocyclic compounds, such as azaarenes, and oxygen- and sulfur-heterocyclic PACs (OPACs and PASHs, respectively). Further, the impact of SEE on the freely dissolved concentration of PACs in soil as well as the soil bioactivity pre- and post-SEE have yet to be addressed. To fulfil these research gaps, chemical and bioanalytical analysis of a creosote-contaminated soil, collected from a U.S. Superfund site, pre- and post-SEE were performed. The decrease of 64 PACs (5-100%) and increase in the concentrations of nine oxygenated-PAHs (OPAHs) (150%) during SEE, some of which are known to be toxic and can potentially contaminate ground water, were observed. The freely dissolved concentrations of PACs in soil were assessed using polyoxymethylene (POM) strips and the concentrations of 66 PACs decreased post-SEE (1-100%). Three in vitro reporter gene bioassays (DR-CALUX®, ERα-CALUX® and anti-AR CALUX®) were used to measure soil bioactivities pre- and post-SEE and all reporter gene bioassays measured soil bioactivity decreases post-SEE. Mass defect suspect screening tentatively identified 27 unique isomers of azaarenes and OPAC in the soil. As a remediation technique, SEE was found to remove alkyl-PAHs and heterocyclic PACs, reduce the concentrations of freely dissolved PACs, and decrease soil bioactivities.
Collapse
Affiliation(s)
- Ivan A Titaley
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, Örebro SE-701 82, Sweden.
| | | | - Thanh Wang
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, Örebro SE-701 82, Sweden
| | - Daniel Duberg
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, Örebro SE-701 82, Sweden
| | - Eva L Davis
- Center for Environmental Solutions & Emergency Response, Groundwater, Watershed and Ecosystems Restoration Division, United States Environmental Protection Agency, Ada, OK, 74820, USA
| | - Magnus Engwall
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, Örebro SE-701 82, Sweden
| | - Staci L Massey Simonich
- Department of Environmental and Molecular Toxicology, Oregon State University, Corvallis, OR, 97331, USA; Department of Chemistry, Oregon State University, Corvallis, OR, 97331, USA
| | - Maria Larsson
- Man-Technology-Environment (MTM) Research Centre, School of Science and Technology, Örebro University, Örebro SE-701 82, Sweden
| |
Collapse
|
134
|
Identification of Universally Applicable and Species-Specific Marker Peptides for Bacillus anthracis. LIFE (BASEL, SWITZERLAND) 2022; 12:life12101549. [PMID: 36294983 PMCID: PMC9605612 DOI: 10.3390/life12101549] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 09/09/2022] [Accepted: 09/28/2022] [Indexed: 11/16/2022]
Abstract
Anthrax is a zoonotic infection caused by the bacterium Bacillus anthracis (BA). Specific identification of this pathogen often relies on targeting genes located on two extrachromosomal plasmids, which represent the major pathogenicity factors of BA. However, more recent findings show that these plasmids have also been found in other closely related Bacillus species. In this study, we investigated the possibility of identifying species-specific and universally applicable marker peptides for BA. For this purpose, we applied a high-resolution mass spectrometry-based approach for 42 BA isolates. Along with the genomic sequencing data and by developing a bioinformatics data evaluation pipeline, which uses a database containing most of the publicly available protein sequences worldwide (UniParc), we were able to identify eleven universal marker peptides unique to BA. These markers are located on the chromosome and therefore, might overcome known problems, such as observable loss of plasmids in environmental species, plasmid loss during cultivation in the lab, and the fact that the virulence plasmids are not necessarily a unique feature of BA. The identified chromosomally encoded markers in this study could extend the small panel of already existing chromosomal targets and along with targets for the virulence plasmids, may pave the way to an even more reliable identification of BA using genomics- as well as proteomics-based techniques.
Collapse
|
135
|
Bioactive Compounds from Marine Sponges and Algae: Effects on Cancer Cell Metabolome and Chemical Structures. Int J Mol Sci 2022; 23:ijms231810680. [PMID: 36142592 PMCID: PMC9502410 DOI: 10.3390/ijms231810680] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2022] [Revised: 09/04/2022] [Accepted: 09/08/2022] [Indexed: 11/17/2022] Open
Abstract
Metabolomics represent the set of small organic molecules generally called metabolites, which are located within cells, tissues or organisms. This new “omic” technology, together with other similar technologies (genomics, transcriptomics and proteomics) is becoming a widely used tool in cancer research, aiming at the understanding of global biology systems in their physiologic or altered conditions. Cancer is among the most alarming human diseases and it causes a considerable number of deaths each year. Cancer research is one of the most important fields in life sciences. In fact, several scientific advances have been made in recent years, aiming to illuminate the metabolism of cancer cells, which is different from that of healthy cells, as suggested by Otto Warburg in the 1950s. Studies on sponges and algae revealed that these organisms are the main sources of the marine bioactive compounds involved in drug discovery for cancer treatment and prevention. In this review, we analyzed these two promising groups of marine organisms to focus on new metabolomics approaches for the study of metabolic changes in cancer cell lines treated with chemical extracts from sponges and algae, and for the classification of the chemical structures of bioactive compounds that may potentially prove useful for specific biotechnological applications.
Collapse
|
136
|
Guo J, Huan T. Turning Metabolomics Data Processing from a “Black Box” to a “White Box”. LCGC NORTH AMERICA 2022. [DOI: 10.56530/lcgc.na.tn9486s6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Extracting thousands of metabolic features from liquid chromatography–mass spectrometry (LC–MS)–based metabolomics data is not easy. Although many feature extraction algorithms have been developed over the past few decades, automated feature extraction is still not a “white box” process. For instance, it is challenging to quickly determine the optimal parameters for the best feature extraction outcome. It is also impossible to extract every true metabolic feature. Moreover, there is contamination from false metabolic features of different sources, such as signal noise and in-source fragmentation. Our laboratory has recently developed a suite of bioinformatics tools to address these metabolic peak-picking challenges. The goal is to improve the peak-picking outcome quality, so we can effectively obtain biological information from the metabolomics data.
Collapse
|
137
|
MetaProClust-MS1: an MS1 Profiling Approach for Large-Scale Microbiome Screening. mSystems 2022; 7:e0038122. [PMID: 35950762 PMCID: PMC9426440 DOI: 10.1128/msystems.00381-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Metaproteomics is used to explore the functional dynamics of microbial communities. However, acquiring metaproteomic data by tandem mass spectrometry (MS/MS) is time-consuming and resource-intensive, and there is a demand for computational methods that can be used to reduce these resource requirements. We present MetaProClust-MS1, a computational framework for microbiome feature screening developed to prioritize samples for follow-up MS/MS. In this proof-of-concept study, we tested and compared MetaProClust-MS1 results on gut microbiome data, from fecal samples, acquired using short 15-min MS1-only chromatographic gradients and MS1 spectra from longer 60-min gradients to MS/MS-acquired data. We found that MetaProClust-MS1 identified robust gut microbiome responses caused by xenobiotics with significantly correlated cluster topologies of comparable data sets. We also used MetaProClust-MS1 to reanalyze data from both a clinical MS/MS diagnostic study of pediatric patients with inflammatory bowel disease and an experiment evaluating the therapeutic effects of a small molecule on the brain tissue of Alzheimer's disease mouse models. MetaProClust-MS1 clusters could distinguish between inflammatory bowel disease diagnoses (ulcerative colitis and Crohn's disease) using samples from mucosal luminal interface samples and identified hippocampal proteome shifts of Alzheimer's disease mouse models after small-molecule treatment. Therefore, we demonstrate that MetaProClust-MS1 can screen both microbiomes and single-species proteomes using only MS1 profiles, and our results suggest that this approach may be generalizable to any proteomics experiment. MetaProClust-MS1 may be especially useful for large-scale metaproteomic screening for the prioritization of samples for further metaproteomic characterization, using MS/MS, for instance, in addition to being a promising novel approach for clinical diagnostic screening. IMPORTANCE Growing evidence suggests that human gut microbiome composition and function are highly associated with health and disease. As such, high-throughput metaproteomic studies are becoming more common in gut microbiome research. However, using a conventional long liquid chromatography (LC)-MS/MS gradient metaproteomics approach as an initial screen in large-scale microbiome experiments can be slow and expensive. To combat this challenge, we introduce MetaProClust-MS1, a computational framework for microbiome screening using MS1-only profiles. In this proof-of-concept study, we show that MetaProClust-MS1 identifies clusters of gut microbiome treatments using MS1-only profiles similar to those identified using MS/MS. Our approach allows researchers to prioritize samples and treatments of interest for further metaproteomic analyses and may be generally applicable to any proteomic analysis. In particular, this approach may be especially useful for large-scale metaproteomic screening or in clinical settings where rapid diagnostic evidence is required.
Collapse
|
138
|
Pang Z, Zhou G, Ewald J, Chang L, Hacariz O, Basu N, Xia J. Using MetaboAnalyst 5.0 for LC-HRMS spectra processing, multi-omics integration and covariate adjustment of global metabolomics data. Nat Protoc 2022; 17:1735-1761. [PMID: 35715522 DOI: 10.1038/s41596-022-00710-w] [Citation(s) in RCA: 632] [Impact Index Per Article: 316.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Accepted: 05/18/2022] [Indexed: 01/01/2023]
Abstract
Liquid chromatography coupled with high-resolution mass spectrometry (LC-HRMS) has become a workhorse in global metabolomics studies with growing applications across biomedical and environmental sciences. However, outstanding bioinformatics challenges in terms of data processing, statistical analysis and functional interpretation remain critical barriers to the wider adoption of this technology. To help the user community overcome these barriers, we have made major updates to the well-established MetaboAnalyst platform ( www.metaboanalyst.ca ). This protocol extends the previous 2011 Nature Protocol by providing stepwise instructions on how to use MetaboAnalyst 5.0 to: optimize parameters for LC-HRMS spectra processing; obtain functional insights from peak list data; integrate metabolomics data with transcriptomics data or combine multiple metabolomics datasets; conduct exploratory statistical analysis with complex metadata. Parameter optimization may take ~2 h to complete depending on the server load, and the remaining three stages may be executed in ~60 min.
Collapse
Affiliation(s)
- Zhiqiang Pang
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Guangyan Zhou
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Jessica Ewald
- Department of Natural Resources Sciences, McGill University, Montreal, Quebec, Canada
| | - Le Chang
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada
| | - Orcun Hacariz
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada
| | - Niladri Basu
- Department of Natural Resources Sciences, McGill University, Montreal, Quebec, Canada
| | - Jianguo Xia
- Institute of Parasitology, McGill University, Montreal, Quebec, Canada.
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
139
|
Jeong K, Babović M, Gorshkov V, Kim J, Jensen ON, Kohlbacher O. FLASHIda enables intelligent data acquisition for top-down proteomics to boost proteoform identification counts. Nat Commun 2022; 13:4407. [PMID: 35906205 PMCID: PMC9338294 DOI: 10.1038/s41467-022-31922-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Accepted: 07/07/2022] [Indexed: 11/17/2022] Open
Abstract
The detailed analysis and structural characterization of proteoforms by top-down proteomics (TDP) has gained a lot of interest in biomedical research. Data-dependent acquisition (DDA) of intact proteins is non-trivial due to the diversity and complexity of proteoforms. Dedicated acquisition methods thus have the potential to greatly improve TDP. Here, we present FLASHIda, an intelligent online data acquisition algorithm for TDP that ensures the real-time selection of high-quality precursors of diverse proteoforms. FLASHIda combines fast charge deconvolution algorithms and machine learning-based quality assessment for optimal precursor selection. In an analysis of E. coli lysate, FLASHIda increases the number of unique proteoform level identifications from 800 to 1500 or generates a near-identical number of identifications in one third of the instrument time when compared to standard DDA mode. Furthermore, FLASHIda enables sensitive mapping of post-translational modifications and detection of chemical adducts. As a software extension module to the instrument, FLASHIda can be readily adopted for TDP studies of complex samples to enhance proteoform identification rates.
Collapse
Affiliation(s)
- Kyowon Jeong
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
| | - Maša Babović
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Odense, Denmark
| | - Vladimir Gorshkov
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Odense, Denmark
| | - Jihyung Kim
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Sand 14, 72076, Tübingen, Germany
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany
| | - Ole N Jensen
- Department of Biochemistry & Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, DK-5230 Odense M, Odense, Denmark
| | - Oliver Kohlbacher
- Applied Bioinformatics, Computer Science Department, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
- Translational Bioinformatics, University Hospital Tübingen, Hoppe-Seyler-Str. 9, 72076, Tübingen, Germany.
| |
Collapse
|
140
|
Jiang J, Zhan L, Dai L, Yao X, Qin Y, Zhu Z, Zhang M, Tong W, Wang G. Evaluation of the reliability of MS1-based approach to profile naturally occurring peptides with clinical relevance in urine samples. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2022:e9369. [PMID: 35906701 DOI: 10.1002/rcm.9369] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 07/02/2022] [Accepted: 07/27/2022] [Indexed: 06/15/2023]
Abstract
RATIONALE The profiling of natural urinary peptides is a valuable indicator of kidney condition. While front-end separation limits the speed of peptidomic profiling, MS1-based results suffer from limited peptide coverage and specificity. Clinical studies on chronic kidney disease require an effective strategy to balance the trade-off between identification depth and throughput. METHODS CKD273, a urinary proteome classifier associated with chronic kidney disease, in samples from diabetic nephropathy patients was profiled in parallel using capillary electrophoresis-mass spectrometry (CE-MS), liquid chromatography with mass spectrometry (LC-MS), and matrix-assisted laser desorption/ionization-mass spectrometry (MALDI-MS). Through cross-comparison of results from MS1 of unfractionated peptides and elution-time-resolved MS1 as well as MS/MS in LC- and CE-MS approaches, we evaluated the contribution of false-positive identification to MS1-based identification and quantitation, and analyzed the benefit of front-end separation in terms of accuracy and efficiency. RESULTS In LC- and CE-MS, although MS1 data resulted in higher number of identifications than MS/MS, elution-time-dependent analysis revealed extensive interference by non-CKD273 peptides, which would contribute up to 50% to quantitation if they are not separated from genuine CKD273 peptides. In the absence of separation, MS1 data resulted in lower numbers of identifications and abundance pattern that significantly deviated from those by liquid chromatography with tandem mass spectrometry (LC-MS/MS) or capillary electrophoresis with tandem mass spectrometry (CE-MS/MS). CE showed higher identification efficiency even when less sample was used or achieved faster separation. CONCLUSIONS To ensure the reliability of MS1-based urinary peptide profiling, front-end separation should not be omitted, and elution time should be used in addition to intact mass for identification. Including MS/MS in data acquisition does not compromise the speed or identification number, while benefiting data reliability by providing real-time sequence verification.
Collapse
Affiliation(s)
- Jialu Jiang
- School of Chemistry and Materials Science, Nanjing Normal University, Nanjing, China
- Shenzhen Bay Laboratory, Institute for Cell Analysis, Shenzhen, China
| | - Lingpeng Zhan
- Shenzhen Bay Laboratory, Institute for Cell Analysis, Shenzhen, China
| | - Liuyan Dai
- Department of Endocrinology, The First Affiliated Hospital with Nanjing Medical University, Nanjing Medical University, Nanjing, China
| | - Xiaopeng Yao
- School of Chemistry and Materials Science, Nanjing Normal University, Nanjing, China
- Shenzhen Bay Laboratory, Institute for Cell Analysis, Shenzhen, China
| | - Yao Qin
- Department of Endocrinology, The First Affiliated Hospital with Nanjing Medical University, Nanjing Medical University, Nanjing, China
| | - Zhongqin Zhu
- School of Chemistry and Materials Science, Nanjing Normal University, Nanjing, China
- Shenzhen Bay Laboratory, Institute for Cell Analysis, Shenzhen, China
| | - Mei Zhang
- Department of Endocrinology, The First Affiliated Hospital with Nanjing Medical University, Nanjing Medical University, Nanjing, China
| | - Wenjun Tong
- School of Chemistry and Materials Science, Nanjing Normal University, Nanjing, China
| | - Guanbo Wang
- Shenzhen Bay Laboratory, Institute for Cell Analysis, Shenzhen, China
- Biomedical Pioneering Innovation Centre, Peking University, Beijing, China
| |
Collapse
|
141
|
Wolf EJ, Grünberg S, Dai N, Chen TH, Roy B, Yigit E, Corrêa I. Human RNase 4 improves mRNA sequence characterization by LC–MS/MS. Nucleic Acids Res 2022; 50:e106. [PMID: 35871301 PMCID: PMC9561288 DOI: 10.1093/nar/gkac632] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2022] [Revised: 06/22/2022] [Accepted: 07/20/2022] [Indexed: 11/17/2022] Open
Abstract
With the rapid growth of synthetic messenger RNA (mRNA)-based therapeutics and vaccines, the development of analytical tools for characterization of long, complex RNAs has become essential. Tandem liquid chromatography–mass spectrometry (LC–MS/MS) permits direct assessment of the mRNA primary sequence and modifications thereof without conversion to cDNA or amplification. It relies upon digestion of mRNA with site-specific endoribonucleases to generate pools of short oligonucleotides that are then amenable to MS-based sequence analysis. Here, we showed that the uridine-specific human endoribonuclease hRNase 4 improves mRNA sequence coverage, in comparison with the benchmark enzyme RNase T1, by producing a larger population of uniquely mappable cleavage products. We deployed hRNase 4 to characterize mRNAs fully substituted with 1-methylpseudouridine (m1Ψ) or 5-methoxyuridine (mo5U), as well as mRNAs selectively depleted of uridine–two key strategies to reduce synthetic mRNA immunogenicity. Lastly, we demonstrated that hRNase 4 enables direct assessment of the 5′ cap incorporation into in vitro transcribed mRNA. Collectively, this study highlights the power of hRNase 4 to interrogate mRNA sequence, identity, and modifications by LC–MS/MS.
Collapse
Affiliation(s)
- Eric J Wolf
- New England Biolabs, Inc, 43/44 Dunham Ridge, Beverly, MA 01915, USA
| | | | - Nan Dai
- New England Biolabs, Inc, 43/44 Dunham Ridge, Beverly, MA 01915, USA
| | - Tien-Hao Chen
- New England Biolabs, Inc, 43/44 Dunham Ridge, Beverly, MA 01915, USA
| | - Bijoyita Roy
- New England Biolabs, Inc, 43/44 Dunham Ridge, Beverly, MA 01915, USA
| | - Erbay Yigit
- New England Biolabs, Inc, 43/44 Dunham Ridge, Beverly, MA 01915, USA
| | - Ivan R Corrêa
- To whom correspondence should be addressed. Tel: +1 978 380 7504;
| |
Collapse
|
142
|
Yan S, Bhawal R, Yin Z, Thannhauser TW, Zhang S. Recent advances in proteomics and metabolomics in plants. MOLECULAR HORTICULTURE 2022; 2:17. [PMID: 37789425 PMCID: PMC10514990 DOI: 10.1186/s43897-022-00038-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Accepted: 06/20/2022] [Indexed: 10/05/2023]
Abstract
Over the past decade, systems biology and plant-omics have increasingly become the main stream in plant biology research. New developments in mass spectrometry and bioinformatics tools, and methodological schema to integrate multi-omics data have leveraged recent advances in proteomics and metabolomics. These progresses are driving a rapid evolution in the field of plant research, greatly facilitating our understanding of the mechanistic aspects of plant metabolisms and the interactions of plants with their external environment. Here, we review the recent progresses in MS-based proteomics and metabolomics tools and workflows with a special focus on their applications to plant biology research using several case studies related to mechanistic understanding of stress response, gene/protein function characterization, metabolic and signaling pathways exploration, and natural product discovery. We also present a projection concerning future perspectives in MS-based proteomics and metabolomics development including their applications to and challenges for system biology. This review is intended to provide readers with an overview of how advanced MS technology, and integrated application of proteomics and metabolomics can be used to advance plant system biology research.
Collapse
Affiliation(s)
- Shijuan Yan
- Guangdong Key Laboratory for Crop Germplasm Resources Preservation and Utilization, Agro-biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | - Ruchika Bhawal
- Proteomics and Metabolomics Facility, Institute of Biotechnology, Cornell University, 139 Biotechnology Building, 526 Campus Road, Ithaca, NY, 14853, USA
| | - Zhibin Yin
- Guangdong Key Laboratory for Crop Germplasm Resources Preservation and Utilization, Agro-biological Gene Research Center, Guangdong Academy of Agricultural Sciences, Guangzhou, China
| | | | - Sheng Zhang
- Proteomics and Metabolomics Facility, Institute of Biotechnology, Cornell University, 139 Biotechnology Building, 526 Campus Road, Ithaca, NY, 14853, USA.
| |
Collapse
|
143
|
Albores-Mendez EM, Aguilera Hernández AD, Melo-González A, Vargas-Hernández MA, Gutierrez de la Cruz N, Vazquez-Guzman MA, Castro-Marín M, Romero-Morelos P, Winkler R. A diagnostic model for overweight and obesity from untargeted urine metabolomics of soldiers. PeerJ 2022; 10:e13754. [PMID: 35898940 PMCID: PMC9310780 DOI: 10.7717/peerj.13754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2022] [Accepted: 06/28/2022] [Indexed: 01/17/2023] Open
Abstract
Soldiers in active military service need optimal physical fitness for successfully carrying out their operations. Therefore, their health status is regularly checked by army doctors. These inspections include physical parameters such as the body-mass index (BMI), functional tests, and biochemical studies. If a medical exam reveals an individual's excess weight, further examinations are made, and corrective actions for weight lowering are initiated. The collection of urine is non-invasive and therefore attractive for frequent metabolic screening. We compared the chemical profiles of urinary samples of 146 normal weight, excess weight, and obese soldiers of the Mexican Army, using untargeted metabolomics with liquid chromatography coupled to high-resolution mass spectrometry (LC-MS). In combination with data mining, statistical and metabolic pathway analyses suggest increased S-adenosyl-L-methionine (SAM) levels and changes of amino acid metabolites as important variables for overfeeding. We will use these potential biomarkers for the ongoing metabolic monitoring of soldiers in active service. In addition, after validation of our results, we will develop biochemical screening tests that are also suitable for civil applications.
Collapse
Affiliation(s)
- Exsal M. Albores-Mendez
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico
| | - Alexis D. Aguilera Hernández
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico
| | - Alejandra Melo-González
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico
| | - Marco A. Vargas-Hernández
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico
| | - Neptalí Gutierrez de la Cruz
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico
| | - Miguel A. Vazquez-Guzman
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico,Centro de Investigación en Ciencias de la Salud (CICSA), FCS, Universidad Anahuac Mexico, Campus Norte, Mexico City, Mexico
| | - Melchor Castro-Marín
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico
| | - Pablo Romero-Morelos
- Escuela Militar de Graduados de Sanidad, Universidad del Ejército y Fuerza Aérea Mexicanos, Secretaría de la Defensa Nacional, Mexico City, Mexico,Universidad Estatal del Valle de Ecatepec, Ecatepec, Mexico
| | - Robert Winkler
- UGA-Langebio, CINVESTAV, Irapuato, Gto., Mexico,Biotechnology and Biochemistry, CINVESTAV Unidad Irapuato, Irapuato, Gto., Mexico
| |
Collapse
|
144
|
Tzanakis K, Nattkemper TW, Niehaus K, Albaum SP. MetHoS: a platform for large-scale processing, storage and analysis of metabolomics data. BMC Bioinformatics 2022; 23:267. [PMID: 35804309 PMCID: PMC9270834 DOI: 10.1186/s12859-022-04793-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2021] [Accepted: 06/14/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Modern mass spectrometry has revolutionized the detection and analysis of metabolites but likewise, let the data skyrocket with repositories for metabolomics data filling up with thousands of datasets. While there are many software tools for the analysis of individual experiments with a few to dozens of chromatograms, we see a demand for a contemporary software solution capable of processing and analyzing hundreds or even thousands of experiments in an integrative manner with standardized workflows. RESULTS Here, we introduce MetHoS as an automated web-based software platform for the processing, storage and analysis of great amounts of mass spectrometry-based metabolomics data sets originating from different metabolomics studies. MetHoS is based on Big Data frameworks to enable parallel processing, distributed storage and distributed analysis of even larger data sets across clusters of computers in a highly scalable manner. It has been designed to allow the processing and analysis of any amount of experiments and samples in an integrative manner. In order to demonstrate the capabilities of MetHoS, thousands of experiments were downloaded from the MetaboLights database and used to perform a large-scale processing, storage and statistical analysis in a proof-of-concept study. CONCLUSIONS MetHoS is suitable for large-scale processing, storage and analysis of metabolomics data aiming at untargeted metabolomic analyses. It is freely available at: https://methos.cebitec.uni-bielefeld.de/ . Users interested in analyzing their own data are encouraged to apply for an account.
Collapse
Affiliation(s)
- Konstantinos Tzanakis
- International Research Training Group "Computational Methods for the Analysis of the Diversity and Dynamics of Genomes", Faculty of Technology, Bielefeld University, Bielefeld, Germany.
| | - Tim W Nattkemper
- Biodata Mining Group, Center for Biotechnology (CeBiTec), Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Karsten Niehaus
- Proteome and Metabolome Research, Center for Biotechnology (CeBiTec), Faculty of Biology, Bielefeld University, Bielefeld, Germany
| | - Stefan P Albaum
- Bioinformatics Resource Facility, Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, Germany
| |
Collapse
|
145
|
Hoffmann N, Mayer G, Has C, Kopczynski D, Al Machot F, Schwudke D, Ahrends R, Marcus K, Eisenacher M, Turewicz M. A Current Encyclopedia of Bioinformatics Tools, Data Formats and Resources for Mass Spectrometry Lipidomics. Metabolites 2022; 12:584. [PMID: 35888710 PMCID: PMC9319858 DOI: 10.3390/metabo12070584] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/17/2022] [Accepted: 06/19/2022] [Indexed: 12/13/2022] Open
Abstract
Mass spectrometry is a widely used technology to identify and quantify biomolecules such as lipids, metabolites and proteins necessary for biomedical research. In this study, we catalogued freely available software tools, libraries, databases, repositories and resources that support lipidomics data analysis and determined the scope of currently used analytical technologies. Because of the tremendous importance of data interoperability, we assessed the support of standardized data formats in mass spectrometric (MS)-based lipidomics workflows. We included tools in our comparison that support targeted as well as untargeted analysis using direct infusion/shotgun (DI-MS), liquid chromatography-mass spectrometry, ion mobility or MS imaging approaches on MS1 and potentially higher MS levels. As a result, we determined that the Human Proteome Organization-Proteomics Standards Initiative standard data formats, mzML and mzTab-M, are already supported by a substantial number of recent software tools. We further discuss how mzTab-M can serve as a bridge between data acquisition and lipid bioinformatics tools for interpretation, capturing their output and transmitting rich annotated data for downstream processing. However, we identified several challenges of currently available tools and standards. Potential areas for improvement were: adaptation of common nomenclature and standardized reporting to enable high throughput lipidomics and improve its data handling. Finally, we suggest specific areas where tools and repositories need to improve to become FAIRer.
Collapse
Affiliation(s)
- Nils Hoffmann
- Forschungszentrum Jülich GmbH, Institute for Bio- and Geosciences (IBG-5), 52425 Jülich, Germany
| | - Gerhard Mayer
- Institute of Medical Systems Biology, Ulm University, 89081 Ulm, Germany;
| | - Canan Has
- Biological Mass Spectrometry, Max Planck Institute of Molecular Cell Biology and Genetics, 01307 Dresden, Germany;
- University Hospital Carl Gustav Carus, 01307 Dresden, Germany
- CENTOGENE GmbH, 18055 Rostock, Germany
| | - Dominik Kopczynski
- Department of Analytical Chemistry, University of Vienna, 1090 Vienna, Austria; (D.K.); (R.A.)
| | - Fadi Al Machot
- Faculty of Science and Technology, Norwegian University for Life Science (NMBU), 1433 Ås, Norway;
| | - Dominik Schwudke
- Bioanalytical Chemistry, Forschungszentrum Borstel, Leibniz Lung Center, 23845 Borstel, Germany;
- Airway Research Center North, German Center for Lung Research (DZL), 23845 Borstel, Germany
- German Center for Infection Research (DZIF), TTU Tuberculosis, 23845 Borstel, Germany
| | - Robert Ahrends
- Department of Analytical Chemistry, University of Vienna, 1090 Vienna, Austria; (D.K.); (R.A.)
| | - Katrin Marcus
- Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Ruhr University Bochum, 44801 Bochum, Germany; (K.M.); (M.E.)
| | - Martin Eisenacher
- Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Ruhr University Bochum, 44801 Bochum, Germany; (K.M.); (M.E.)
- Faculty of Medicine, Medizinisches Proteom-Center, Ruhr University Bochum, 44801 Bochum, Germany
| | - Michael Turewicz
- Institute for Clinical Biochemistry and Pathobiochemistry, German Diabetes Center (DDZ), Leibniz Center for Diabetes Research at Heinrich-Heine-University Düsseldorf, 40225 Düsseldorf, Germany
- German Center for Diabetes Research (DZD), Partner Düsseldorf, 85764 Neuherberg, Germany
| |
Collapse
|
146
|
Geddes-McAlister J, Prudhomme N, Gutierrez Gongora D, Cossar D, McLean MD. The emerging role of mass spectrometry-based proteomics in molecular pharming practices. Curr Opin Chem Biol 2022; 68:102133. [DOI: 10.1016/j.cbpa.2022.102133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2021] [Revised: 02/02/2022] [Accepted: 02/23/2022] [Indexed: 12/11/2022]
|
147
|
Bueschl C, Doppler M, Varga E, Seidl B, Flasch M, Warth B, Zanghellini J. PeakBot: Machine learning based chromatographic peak picking. Bioinformatics 2022; 38:3422-3428. [PMID: 35604083 PMCID: PMC9237678 DOI: 10.1093/bioinformatics/btac344] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Revised: 04/12/2022] [Accepted: 05/17/2022] [Indexed: 11/13/2022] Open
Abstract
Motivation Chromatographic peak picking is among the first steps in data processing workflows of raw LC-HRMS datasets in untargeted metabolomics applications. Its performance is crucial for the holistic detection of all metabolic features as well as their relative quantification for statistical analysis and metabolite identification. Random noise, non-baseline separated compounds and unspecific background signals complicate this task. Results A machine-learning-based approach entitled PeakBot was developed for detecting chromatographic peaks in LC-HRMS profile-mode data. It first detects all local signal maxima in a chromatogram, which are then extracted as super-sampled standardized areas (retention-time versus m/z). These are subsequently inspected by a custom-trained convolutional neural network that forms the basis of PeakBot’s architecture. The model reports if the respective local maximum is the apex of a chromatographic peak or not as well as its peak center and bounding box. In training and independent validation datasets used for development, PeakBot achieved a high performance with respect to discriminating between chromatographic peaks and background signals (accuracy of 0.99). For training the machine-learning model a minimum of 100 reference features are needed to learn their characteristics to achieve high-quality peak-picking results for detecting such chromatographic peaks in an untargeted fashion. PeakBot is implemented in python (3.8) and uses the TensorFlow (2.5.0) package for machine-learning related tasks. It has been tested on Linux and Windows OSs. Availability and implementation The package is available free of charge for non-commercial use (CC BY-NC-SA). It is available at https://github.com/christophuv/PeakBot. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Christoph Bueschl
- Department of Analytical Chemistry, University of Vienna, Waehringer Str. 40, A-1090 Vienna.,University of Natural Resources and Life Sciences, Vienna, Department of Agrobiotechnology IFA-Tulln, Institute of Bioanalytics and Agro-Metabolomics, Konrad-Lorenz-Str. 20, A-3430 Tulln
| | - Maria Doppler
- University of Natural Resources and Life Sciences, Vienna, Department of Agrobiotechnology IFA-Tulln, Institute of Bioanalytics and Agro-Metabolomics, Konrad-Lorenz-Str. 20, A-3430 Tulln.,University of Natural Resources and Life Sciences, Vienna, Core Facility Bioactive Molecules: Screening and Analysis, Konrad-Lorenz-Str. 20, A-3430 Tulln
| | - Elisabeth Varga
- Department of Food Chemistry and Toxicology, University of Vienna, Waehringer Str. 38, A-1090 Vienna
| | - Bernhard Seidl
- University of Natural Resources and Life Sciences, Vienna, Department of Agrobiotechnology IFA-Tulln, Institute of Bioanalytics and Agro-Metabolomics, Konrad-Lorenz-Str. 20, A-3430 Tulln
| | - Mira Flasch
- Department of Food Chemistry and Toxicology, University of Vienna, Waehringer Str. 38, A-1090 Vienna
| | - Benedikt Warth
- Department of Food Chemistry and Toxicology, University of Vienna, Waehringer Str. 38, A-1090 Vienna
| | - Juergen Zanghellini
- Department of Analytical Chemistry, University of Vienna, Waehringer Str. 40, A-1090 Vienna
| |
Collapse
|
148
|
Popov RS, Ivanchina NV, Dmitrenok PS. Application of MS-Based Metabolomic Approaches in Analysis of Starfish and Sea Cucumber Bioactive Compounds. Mar Drugs 2022; 20:320. [PMID: 35621972 PMCID: PMC9147407 DOI: 10.3390/md20050320] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Revised: 05/11/2022] [Accepted: 05/11/2022] [Indexed: 12/12/2022] Open
Abstract
Today, marine natural products are considered one of the main sources of compounds for drug development. Starfish and sea cucumbers are potential sources of natural products of pharmaceutical interest. Among their metabolites, polar steroids, triterpene glycosides, and polar lipids have attracted a great deal of attention; however, studying these compounds by conventional methods is challenging. The application of modern MS-based approaches can help to obtain valuable information about such compounds. This review provides an up-to-date overview of MS-based applications for starfish and sea cucumber bioactive compounds analysis. While describing most characteristic features of MS-based approaches in the context of starfish and sea cucumber metabolites, including sample preparation and MS analysis steps, the present paper mainly focuses on the application of MS-based metabolic profiling of polar steroid compounds, triterpene glycosides, and lipids. The application of MS in metabolomics studies is also outlined.
Collapse
Affiliation(s)
- Roman S. Popov
- G.B. Elyakov Pacific Institute of Bioorganic Chemistry, Far Eastern Branch of Russian Academy of Sciences, 159 Prospect 100-let Vladivostoku, Vladivostok 690022, Russia;
| | | | - Pavel S. Dmitrenok
- G.B. Elyakov Pacific Institute of Bioorganic Chemistry, Far Eastern Branch of Russian Academy of Sciences, 159 Prospect 100-let Vladivostoku, Vladivostok 690022, Russia;
| |
Collapse
|
149
|
Pinter N, Glätzer D, Fahrner M, Fröhlich K, Johnson J, Grüning BA, Warscheid B, Drepper F, Schilling O, Föll MC. MaxQuant and MSstats in Galaxy Enable Reproducible Cloud-Based Analysis of Quantitative Proteomics Experiments for Everyone. J Proteome Res 2022; 21:1558-1565. [PMID: 35503992 DOI: 10.1021/acs.jproteome.2c00051] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Quantitative mass spectrometry-based proteomics has become a high-throughput technology for the identification and quantification of thousands of proteins in complex biological samples. Two frequently used tools, MaxQuant and MSstats, allow for the analysis of raw data and finding proteins with differential abundance between conditions of interest. To enable accessible and reproducible quantitative proteomics analyses in a cloud environment, we have integrated MaxQuant (including TMTpro 16/18plex), Proteomics Quality Control (PTXQC), MSstats, and MSstatsTMT into the open-source Galaxy framework. This enables the web-based analysis of label-free and isobaric labeling proteomics experiments via Galaxy's graphical user interface on public clouds. MaxQuant and MSstats in Galaxy can be applied in conjunction with thousands of existing Galaxy tools and integrated into standardized, sharable workflows. Galaxy tracks all metadata and intermediate results in analysis histories, which can be shared privately for collaborations or publicly, allowing full reproducibility and transparency of published analysis. To further increase accessibility, we provide detailed hands-on training materials. The integration of MaxQuant and MSstats into the Galaxy framework enables their usage in a reproducible way on accessible large computational infrastructures, hence realizing the foundation for high-throughput proteomics data science for everyone.
Collapse
Affiliation(s)
- Niko Pinter
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany
| | - Damian Glätzer
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Klemens Fröhlich
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), Albert-Ludwigs-University Freiburg, 79104 Freiburg, Germany
| | - James Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | | | - Bettina Warscheid
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany.,Faculty of Chemistry and Pharmacy, Department of Biochemistry, Julius Maximilian University of Würzburg, 97074 Würzburg, Germany
| | - Friedel Drepper
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), 79106 Freiburg, Germany
| | - Melanie Christine Föll
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Khoury College of Computer Sciences, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
150
|
Priyadarshini M, Navarro G, Reiman DJ, Sharma A, Xu K, Lednovich K, Manzella CR, Khan MW, Garcia MS, Allard S, Wicksteed B, Chlipala GE, Szynal B, Bernabe BP, Maki PM, Gill RK, Perdew GH, Gilbert J, Dai Y, Layden BT. Gestational Insulin Resistance Is Mediated by the Gut Microbiome-Indoleamine 2,3-Dioxygenase Axis. Gastroenterology 2022; 162:1675-1689.e11. [PMID: 35032499 PMCID: PMC9040389 DOI: 10.1053/j.gastro.2022.01.008] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Revised: 12/23/2021] [Accepted: 01/03/2022] [Indexed: 01/07/2023]
Abstract
BACKGROUND & AIMS Normal gestation involves a reprogramming of the maternal gut microbiome (GM) that contributes to maternal metabolic changes by unclear mechanisms. This study aimed to understand the mechanistic underpinnings of the GM-maternal metabolism interaction. METHODS The GM and plasma metabolome of CD1, NIH-Swiss, and C57 mice were analyzed with the use of 16S rRNA sequencing and untargeted liquid chromatography-mass spectrometry throughout gestation. Pharmacologic and genetic knockout mouse models were used to identify the role of indoleamine 2,3-dioxygenase (IDO1) in pregnancy-associated insulin resistance (IR). Involvement of gestational GM was studied with the use of fecal microbial transplants (FMTs). RESULTS Significant variation in GM alpha diversity occurred throughout pregnancy. Enrichment in gut bacterial taxa was mouse strain and pregnancy time point specific, with the species enriched at gestation day 15/19 (G15/19), a point of heightened IR, being distinct from those enriched before or after pregnancy. Metabolomics revealed elevated plasma kynurenine at G15/19 in all 3 mouse strains. IDO1, the rate-limiting enzyme for kynurenine production, had increased intestinal expression at G15, which was associated with mild systemic and gut inflammation. Pharmacologic and genetic inhibition of IDO1 inhibited kynurenine levels and reversed pregnancy-associated IR. FMT revealed that IDO1 induction and local kynurenine level effects on IR derive from the GM in both mouse and human pregnancy. CONCLUSIONS GM changes accompanying pregnancy shift IDO1-dependent tryptophan metabolism toward kynurenine production, intestinal inflammation, and gestational IR, a phenotype reversed by genetic deletion or inhibition of IDO1. (Gestational Gut Microbiome-IDO1 Axis Mediates Pregnancy Insulin Resistance; EMBL-ENA ID: PRJEB45047. MetaboLights ID: MTBLS3598).
Collapse
Affiliation(s)
- Medha Priyadarshini
- Division of Endocrinology, Diabetes, and Metabolism and UIC, Chicago-IL, U.S.A
| | - Guadalupe Navarro
- Division of Endocrinology, Diabetes, and Metabolism and UIC, Chicago-IL, U.S.A
| | - Derek J Reiman
- Department of Biomedical Engineering, UIC, Chicago-IL, U.S.A
| | - Anukriti Sharma
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic Main Campus, Cleveland-OH, U.S.A
| | - Kai Xu
- Division of Endocrinology, Diabetes, and Metabolism and UIC, Chicago-IL, U.S.A
| | - Kristen Lednovich
- Division of Endocrinology, Diabetes, and Metabolism and UIC, Chicago-IL, U.S.A
| | | | - Md Wasim Khan
- Division of Endocrinology, Diabetes, and Metabolism and UIC, Chicago-IL, U.S.A
| | - Mariana Salas Garcia
- Department of Pediatrics, University of California San Diego (UCSD) School of Medicine, La Jolla-CA, U.S.A
| | - Sarah Allard
- Department of Pediatrics, University of California San Diego (UCSD) School of Medicine, La Jolla-CA, U.S.A
| | - Barton Wicksteed
- Division of Endocrinology, Diabetes, and Metabolism and UIC, Chicago-IL, U.S.A
| | - George E Chlipala
- Research Informatics Core, Research Resources Center, UIC, Chicago-IL, U.S.A
| | - Barbara Szynal
- Division of Endocrinology, Diabetes, and Metabolism and UIC, Chicago-IL, U.S.A
| | | | - Pauline M Maki
- Department of Psychiatry, UIC, Chicago-IL, U.S.A.; Department of Psychology, and UIC, Chicago-IL, U.S.A.; Department of Obstetrics and Gynecology, UIC, Chicago-IL, U.S.A
| | - Ravinder K Gill
- Division of Gastroenterology and Hepatology, UIC, Chicago-IL, U.S.A
| | - Gary H Perdew
- Department of Veterinary and Biomedical Sciences, Center for Molecular Toxicology and Carcinogenesis, The Pennsylvania State University, Pennsylvania, U.S.A
| | - Jack Gilbert
- Department of Pediatrics, University of California San Diego (UCSD) School of Medicine, La Jolla-CA, U.S.A.; Scripps Institution of Oceanography, UCSD, La Jolla-CA, U.S.A
| | - Yang Dai
- Department of Biomedical Engineering, UIC, Chicago-IL, U.S.A
| | - Brian T Layden
- Division of Endocrinology, Diabetes, and Metabolism, University of Illinois, Chicago, Illinois; Jesse Brown Veterans Affair Medical Center, Chicago, Illinois.
| |
Collapse
|