1
|
Fröhlich K, Fahrner M, Brombacher E, Seredynska A, Maldacker M, Kreutz C, Schmidt A, Schilling O. Data-Independent Acquisition: A Milestone and Prospect in Clinical Mass Spectrometry-Based Proteomics. Mol Cell Proteomics 2024; 23:100800. [PMID: 38880244 PMCID: PMC11380018 DOI: 10.1016/j.mcpro.2024.100800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 06/08/2024] [Accepted: 06/13/2024] [Indexed: 06/18/2024] Open
Abstract
Data-independent acquisition (DIA) has revolutionized the field of mass spectrometry (MS)-based proteomics over the past few years. DIA stands out for its ability to systematically sample all peptides in a given m/z range, allowing an unbiased acquisition of proteomics data. This greatly mitigates the issue of missing values and significantly enhances quantitative accuracy, precision, and reproducibility compared to many traditional methods. This review focuses on the critical role of DIA analysis software tools, primarily focusing on their capabilities and the challenges they address in proteomic research. Advances in MS technology, such as trapped ion mobility spectrometry, or high field asymmetric waveform ion mobility spectrometry require sophisticated analysis software capable of handling the increased data complexity and exploiting the full potential of DIA. We identify and critically evaluate leading software tools in the DIA landscape, discussing their unique features, and the reliability of their quantitative and qualitative outputs. We present the biological and clinical relevance of DIA-MS and discuss crucial publications that paved the way for in-depth proteomic characterization in patient-derived specimens. Furthermore, we provide a perspective on emerging trends in clinical applications and present upcoming challenges including standardization and certification of MS-based acquisition strategies in molecular diagnostics. While we emphasize the need for continuous development of software tools to keep pace with evolving technologies, we advise researchers against uncritically accepting the results from DIA software tools. Each tool may have its own biases, and some may not be as sensitive or reliable as others. Our overarching recommendation for both researchers and clinicians is to employ multiple DIA analysis tools, utilizing orthogonal analysis approaches to enhance the robustness and reliability of their findings.
Collapse
Affiliation(s)
- Klemens Fröhlich
- Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany
| | - Eva Brombacher
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg, Germany; Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
| | - Adrianna Seredynska
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
| | - Maximilian Maldacker
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; Faculty of Biology, University of Freiburg, Freiburg, Germany
| | - Clemens Kreutz
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center-University of Freiburg, Freiburg, Germany; Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg, Germany
| | - Alexander Schmidt
- Proteomics Core Facility, Biozentrum Basel, University of Basel, Basel, Switzerland
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany; German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), Freiburg, Germany.
| |
Collapse
|
2
|
Allen C, Meinl R, Paez JS, Searle BC, Just S, Pino LK, Fondrie WE. nf-encyclopedia: A Cloud-Ready Pipeline for Chromatogram Library Data-Independent Acquisition Proteomics Workflows. J Proteome Res 2023; 22:2743-2749. [PMID: 37417926 DOI: 10.1021/acs.jproteome.2c00613] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/08/2023]
Abstract
Data-independent acquisition (DIA) mass spectrometry methods provide systematic and comprehensive quantification of the proteome; yet, relatively few open-source tools are available to analyze DIA proteomics experiments. Fewer still are tools that can leverage gas phase fractionated (GPF) chromatogram libraries to enhance the detection and quantification of peptides in these experiments. Here, we present nf-encyclopedia, an open-source NextFlow pipeline that connects three open-source tools, MSConvert, EncyclopeDIA, and MSstats, to analyze DIA proteomics experiments with or without chromatogram libraries. We demonstrate that nf-encyclopedia is reproducible when run on either a cloud platform or a local workstation and provides robust peptide and protein quantification. Additionally, we found that MSstats enhances protein-level quantitative performance over EncyclopeDIA alone. Finally, we benchmarked the ability of nf-encyclopedia to scale to large experiments in the cloud by leveraging the parallelization of compute resources. The nf-encyclopedia pipeline is available under a permissive Apache 2.0 license; run it on your desktop, cluster, or in the cloud: https://github.com/TalusBio/nf-encyclopedia.
Collapse
Affiliation(s)
- Carolyn Allen
- Talus Bioscience, Seattle, Washington 98122, United States
| | - Rico Meinl
- Talus Bioscience, Seattle, Washington 98122, United States
| | | | - Brian C Searle
- Department of Biomedical Informatics, The Ohio State University, Columbus, Ohio 43210, United States
- Pelotonia Institute for Immuno-Oncology, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio 43210, United States
- Proteome Software, Inc., Portland, Oregon 97219, United States
| | - Seth Just
- Proteome Software, Inc., Portland, Oregon 97219, United States
| | - Lindsay K Pino
- Talus Bioscience, Seattle, Washington 98122, United States
| | | |
Collapse
|
3
|
Mehta S, Bernt M, Chambers M, Fahrner M, Föll MC, Gruening B, Horro C, Johnson JE, Loux V, Rajczewski AT, Schilling O, Vandenbrouck Y, Gustafsson OJR, Thang WCM, Hyde C, Price G, Jagtap PD, Griffin TJ. A Galaxy of informatics resources for MS-based proteomics. Expert Rev Proteomics 2023; 20:251-266. [PMID: 37787106 DOI: 10.1080/14789450.2023.2265062] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 09/06/2023] [Indexed: 10/04/2023]
Abstract
INTRODUCTION Continuous advances in mass spectrometry (MS) technologies have enabled deeper and more reproducible proteome characterization and a better understanding of biological systems when integrated with other 'omics data. Bioinformatic resources meeting the analysis requirements of increasingly complex MS-based proteomic data and associated multi-omic data are critically needed. These requirements included availability of software that would span diverse types of analyses, scalability for large-scale, compute-intensive applications, and mechanisms to ease adoption of the software. AREAS COVERED The Galaxy ecosystem meets these requirements by offering a multitude of open-source tools for MS-based proteomics analyses and applications, all in an adaptable, scalable, and accessible computing environment. A thriving global community maintains these software and associated training resources to empower researcher-driven analyses. EXPERT OPINION The community-supported Galaxy ecosystem remains a crucial contributor to basic biological and clinical studies using MS-based proteomics. In addition to the current status of Galaxy-based resources, we describe ongoing developments for meeting emerging challenges in MS-based proteomic informatics. We hope this review will catalyze increased use of Galaxy by researchers employing MS-based proteomics and inspire software developers to join the community and implement new tools, workflows, and associated training content that will add further value to this already rich ecosystem.
Collapse
Affiliation(s)
- Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Matthias Bernt
- Helmholtz Centre for Environmental Research - UFZ, Department Computational Biology, Leipzig, Germany
| | | | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Melanie Christine Föll
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bjoern Gruening
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, Germany
| | - Carlos Horro
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, USA
| | - Valentin Loux
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
- Université Paris-Saclay, INRAE, BioinfOmics, MIGALE bioinformatics facility, Jouy-en-Josas, France
| | - Andrew T Rajczewski
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | | | | | - W C Mike Thang
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Institute of Molecular Bioscience, University of Queensland, St Lucia, Australia
| | - Cameron Hyde
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Sippy Downs, University of the Sunshine Coast, Australia
| | - Gareth Price
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Institute of Molecular Bioscience, University of Queensland, St Lucia, Australia
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
4
|
Pan Z, Fan L, Zhong Y, Guo J, Dong X, Xu X, Wang C, Su Y. Quantitative proteomics reveals reduction in central carbon and energy metabolisms contributes to gentamicin resistance in Staphylococcus aureus. J Proteomics 2023; 277:104849. [PMID: 36809838 DOI: 10.1016/j.jprot.2023.104849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2023] [Revised: 02/11/2023] [Accepted: 02/12/2023] [Indexed: 02/22/2023]
Abstract
The emergence of antibiotic resistance greatly increases the difficulty of treating bacterial infections. In order to develop effective treatments, the underlying mechanisms of antibiotic resistance must be understood. In this study, Staphylococcus aureus ATCC6538 strain was passaged in medium with and without gentamicin and obtained lab-evolved gentamicin-resistant S. aureus (RGEN) and gentamicin-sensitive S. aureus (SGEN) strains, respectively. Data-Independent Acquisition (DIA)-based proteomics approach was applied to compare the two strains. A total of 1426 proteins were identified, of which 462 were significantly different: 126 were upregulated and 336 were downregulated in RGEN compared to SGEN. Further analysis found that reduced protein biosynthesis was a characteristic feature in RGEN, related to metabolic suppression. The most differentially expressed proteins were involved in metabolic pathways. In RGEN, central carbon metabolism was dysregulated and energy metabolism decreased. After verification, it was found that the levels of NADH, ATP, and reactive oxygen species (ROS) decreased, and superoxide dismutase and catalase activities increased. These findings suggest that inhibition of central carbon and energy metabolic pathways may play an important role in the resistance of S. aureus to gentamicin, and that gentamicin resistance is associated with oxidative stress. Significance: The overuse and misuse of antibiotics have led to bacterial antibiotic resistance, which is a serious threat to human health. Understanding the mechanisms of antibiotic resistance will help better control these antibiotic-resistant pathogens in the future. The present study characterized the differential proteome of gentamicin-resistant Staphylococcus aureus using the most advanced DIA-based proteomics technology. Many of the differential expressed proteins were related to metabolism, specifically, reduced central carbon and energy metabolism. Lower levels of NADH, ROS, and ATP were detected as a consequence of the reduced metabolism. These results reveal that downregulation of protein expression affecting central carbon and energy metabolisms may play an important role in the resistance of S. aureus to gentamicin.
Collapse
Affiliation(s)
- Zhiyu Pan
- Department of Cell Biology & Institute of Biomedicine, National Engineering Research Center of Genetic Medicine, MOE Key Laboratory of Tumor Molecular Biology, Guangdong Provincial Key Laboratory of Bioengineering Medicine, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Lvyuan Fan
- Department of Cell Biology & Institute of Biomedicine, National Engineering Research Center of Genetic Medicine, MOE Key Laboratory of Tumor Molecular Biology, Guangdong Provincial Key Laboratory of Bioengineering Medicine, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Yilin Zhong
- Department of Cell Biology & Institute of Biomedicine, National Engineering Research Center of Genetic Medicine, MOE Key Laboratory of Tumor Molecular Biology, Guangdong Provincial Key Laboratory of Bioengineering Medicine, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Juan Guo
- Department of Cell Biology & Institute of Biomedicine, National Engineering Research Center of Genetic Medicine, MOE Key Laboratory of Tumor Molecular Biology, Guangdong Provincial Key Laboratory of Bioengineering Medicine, College of Life Science and Technology, Jinan University, Guangzhou 510632, China
| | - Xuesa Dong
- Shandong Freshwater Fisheries Research Institute, Jinan 250013, China
| | - Xiao Xu
- Shandong Freshwater Fisheries Research Institute, Jinan 250013, China
| | - Chao Wang
- Shandong Freshwater Fisheries Research Institute, Jinan 250013, China.
| | - Yubin Su
- Department of Cell Biology & Institute of Biomedicine, National Engineering Research Center of Genetic Medicine, MOE Key Laboratory of Tumor Molecular Biology, Guangdong Provincial Key Laboratory of Bioengineering Medicine, College of Life Science and Technology, Jinan University, Guangzhou 510632, China.
| |
Collapse
|
5
|
Wu J, Cao L, Wang J, Wang Y, Hao H, Huang L. Characterization of serum protein expression profiles in the early sarcopenia older adults with low grip strength: a cross-sectional study. BMC Musculoskelet Disord 2022; 23:894. [PMID: 36192674 PMCID: PMC9528053 DOI: 10.1186/s12891-022-05844-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 09/20/2022] [Indexed: 11/25/2022] Open
Abstract
Background Sarcopenia refers to the progressive loss of skeletal muscle mass and muscle function, which seriously threatens the quality of life of the older adults. Therefore, early diagnosis is urgently needed. This study aimed to explore the changes of serum protein profiles in sarcopenia patients through a cross-sectional study, and to provide the reference for clinical diagnosis. Methods This study was a cross-sectional study carried out in the Tianjin institute of physical education teaching experiment training center from December 2019 to December 2020. Ten older adults were recruited, including 5 sarcopenia and 5 healthy older adults. After a detailed diagnostic evaluation, blood samples were collected to prepare serum for proteomic analysis using the HPLC System Easy nLC method. The differentially expressed proteins (DEPs) were screened by the limma package of R software (version 4.1.0). Results A total of 114 DEPs were identified between the patients and healthy older adults, including 48 up-regulated proteins and 66 down-regulated proteins. The functional enrichment analysis showed that the 114 DEPs were significantly enriched in 153 GO terms, which mainly involved in low-density lipoprotein particle remodeling, and negative regulation of immune response,etc. The PPI network further suggested that the cholesteryl ester transfer protein and Apolipoprotein A2 could serve as biomarkers to facilitate diagnosis of sarcopenia. Conclusions This study provided a serum proteomic profile of sarcopenia patients, and identified two proteins with diagnostic value, which might help to improve the diagnostic accuracy of sarcopenia. Supplementary Information The online version contains supplementary material available at 10.1186/s12891-022-05844-2.
Collapse
Affiliation(s)
- Jingqiong Wu
- TianJin University of Sport, No.16 Donghai Road, West Tuanbo New Town, Jinghai District, Tianjin, 301617, PR China.,Guangxi Medical University, Nanning, 530021, Guangxi, PR China
| | - Longjun Cao
- TianJin University of Sport, No.16 Donghai Road, West Tuanbo New Town, Jinghai District, Tianjin, 301617, PR China
| | - Jiazhi Wang
- TianJin University of Sport, No.16 Donghai Road, West Tuanbo New Town, Jinghai District, Tianjin, 301617, PR China
| | - Yizhao Wang
- Tianjin Huanhu Hospital, Tianjin, 300350, PR China
| | - Huimin Hao
- TianJin University of Sport, No.16 Donghai Road, West Tuanbo New Town, Jinghai District, Tianjin, 301617, PR China
| | - Liping Huang
- TianJin University of Sport, No.16 Donghai Road, West Tuanbo New Town, Jinghai District, Tianjin, 301617, PR China.
| |
Collapse
|
6
|
Perez-Riverol Y. Proteomic repository data submission, dissemination, and reuse: key messages. Expert Rev Proteomics 2022; 19:297-310. [PMID: 36529941 PMCID: PMC7614296 DOI: 10.1080/14789450.2022.2160324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 12/07/2022] [Indexed: 12/23/2022]
Abstract
INTRODUCTION The creation of ProteomeXchange data workflows in 2012 transformed the field of proteomics, consisting of the standardization of data submission and dissemination and enabling the widespread reanalysis of public MS proteomics data worldwide. ProteomeXchange has triggered a growing trend toward public dissemination of proteomics data, facilitating the assessment, reuse, comparative analyses, and extraction of new findings from public datasets. By 2022, the consortium is integrated by PRIDE, PeptideAtlas, MassIVE, jPOST, iProX, and Panorama Public. AREAS COVERED Here, we review and discuss the current ecosystem of resources, guidelines, and file formats for proteomics data dissemination and reanalysis. Special attention is drawn to new exciting quantitative and post-translational modification-oriented resources. The challenges and future directions on data depositions including the lack of metadata and cloud-based and high-performance software solutions for fast and reproducible reanalysis of the available data are discussed. EXPERT OPINION The success of ProteomeXchange and the amount of proteomics data available in the public domain have triggered the creation and/or growth of other protein knowledgebase resources. Data reuse is a leading, active, and evolving field; supporting the creation of new formats, tools, and workflows to rediscover and reshape the public proteomics data.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
7
|
Walzer M, García-Seisdedos D, Prakash A, Brack P, Crowther P, Graham RL, George N, Mohammed S, Moreno P, Papatheodorou I, Hubbard SJ, Vizcaíno JA. Implementing the reuse of public DIA proteomics datasets: from the PRIDE database to Expression Atlas. Sci Data 2022; 9:335. [PMID: 35701420 PMCID: PMC9197839 DOI: 10.1038/s41597-022-01380-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Accepted: 05/12/2022] [Indexed: 11/14/2022] Open
Abstract
The number of mass spectrometry (MS)-based proteomics datasets in the public domain keeps increasing, particularly those generated by Data Independent Acquisition (DIA) approaches such as SWATH-MS. Unlike Data Dependent Acquisition datasets, the re-use of DIA datasets has been rather limited to date, despite its high potential, due to the technical challenges involved. We introduce a (re-)analysis pipeline for public SWATH-MS datasets which includes a combination of metadata annotation protocols, automated workflows for MS data analysis, statistical analysis, and the integration of the results into the Expression Atlas resource. Automation is orchestrated with Nextflow, using containerised open analysis software tools, rendering the pipeline readily available and reproducible. To demonstrate its utility, we reanalysed 10 public DIA datasets from the PRIDE database, comprising 1,278 SWATH-MS runs. The robustness of the analysis was evaluated, and the results compared to those obtained in the original publications. The final expression values were integrated into Expression Atlas, making SWATH-MS experiments more widely available and combining them with expression data originating from other proteomics and transcriptomics datasets.
Collapse
Affiliation(s)
- Mathias Walzer
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom.
| | - David García-Seisdedos
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Ananth Prakash
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Paul Brack
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Oxford Road, Manchester, M13 9PT, United Kingdom
| | - Peter Crowther
- Melandra Limited, 16 Brook Road, Urmston, Manchester, M41 5RY, United Kingdom
| | - Robert L Graham
- School of Biological Sciences, Chlorine Gardens, Queen's University Belfast, Belfast, BT9 5DL, United Kingdom
| | - Nancy George
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Suhaib Mohammed
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Pablo Moreno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Irene Papatheodorou
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Simon J Hubbard
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester Academic Health Science Centre, Oxford Road, Manchester, M13 9PT, United Kingdom
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, CB10 1SD, United Kingdom.
| |
Collapse
|
8
|
Data-Independent Acquisition-Based Mass Spectrometry (DIA-MS) for Quantitative Analysis of Human Intestinal Ischemia/Reperfusion. Appl Biochem Biotechnol 2022; 194:4156-4168. [PMID: 35666382 DOI: 10.1007/s12010-022-04005-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/27/2022] [Indexed: 11/02/2022]
Abstract
Intestinal ischemia-reperfusion (II/R) injury is a complex pathologic process, which is of great significance to unravel the underlying mechanisms and pathophysiology. Our study represented a comprehensive proteomic analysis in the human intestine with ischemia-reperfusion injury. The proteomics analysis measured a total of 5,230 proteins, and 417 differently expressed proteins (DEPs) were identified between II/R and control samples. GO and KEGG analysis demonstrated that the 290 upregulated DEPs in II/R were significantly involved in immune-related biological process and tight junction, focal adhesion, and cAMP signaling pathway, whereas the 127 downregulated DEPs in II/R were enriched in lipid metabolic process and metabolic pathway. Furthermore, we screened out 20 hub proteins from the protein-protein interaction (PPI) network according to the degree of connectivity, and six clusters were identified. Combined with the result of KEGG analysis, 6 from the 20 hub proteins, ACTB, CAV1, FLNA, MYLK, ACTN1, and MYL9, were identified as the key proteins in the progress of II/R injury. According to the previous studies, FLNA and MYL9 were selected as the novel disease-related proteins for the first time. In conclusion, this study extended our understanding of the alteration in the human intestine during ischemia and reperfusion and highlighted the potential role of FLNA and MYL9 in the progress of II/R injury, which need to be further studied.
Collapse
|
9
|
Fröhlich K, Brombacher E, Fahrner M, Vogele D, Kook L, Pinter N, Bronsert P, Timme-Bronsert S, Schmidt A, Bärenfaller K, Kreutz C, Schilling O. Benchmarking of analysis strategies for data-independent acquisition proteomics using a large-scale dataset comprising inter-patient heterogeneity. Nat Commun 2022; 13:2622. [PMID: 35551187 PMCID: PMC9098472 DOI: 10.1038/s41467-022-30094-0] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 04/14/2022] [Indexed: 12/25/2022] Open
Abstract
Numerous software tools exist for data-independent acquisition (DIA) analysis of clinical samples, necessitating their comprehensive benchmarking. We present a benchmark dataset comprising real-world inter-patient heterogeneity, which we use for in-depth benchmarking of DIA data analysis workflows for clinical settings. Combining spectral libraries, DIA software, sparsity reduction, normalization, and statistical tests results in 1428 distinct data analysis workflows, which we evaluate based on their ability to correctly identify differentially abundant proteins. From our dataset, we derive bootstrap datasets of varying sample sizes and use the whole range of bootstrap datasets to robustly evaluate each workflow. We find that all DIA software suites benefit from using a gas-phase fractionated spectral library, irrespective of the library refinement used. Gas-phase fractionation-based libraries perform best against two out of three reference protein lists. Among all investigated statistical tests non-parametric permutation-based statistical tests consistently perform best. Data independent acquisition (DIA) has been gaining momentum in clinical proteomics. Here, the authors create a benchmark dataset comprising inter-patient heterogeneity to compare popular DIA data analysis workflows for identifying differentially abundant proteins.
Collapse
Affiliation(s)
- Klemens Fröhlich
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg im Breisgau, Germany
| | - Eva Brombacher
- Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg im Breisgau, Germany.,Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center - University of Freiburg, Freiburg im Breisgau, Germany.,Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg im Breisgau, Germany
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg im Breisgau, Germany
| | - Daniel Vogele
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Faculty of Biology, University of Freiburg, Freiburg im Breisgau, Germany
| | - Lucas Kook
- Epidemiology, Biostatistics & Prevention Institute, University of Zurich, Zurich, Switzerland.,Institute for Data Analysis and Process Design, Zurich University of Applied Sciences, Winterthur, Switzerland
| | - Niko Pinter
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany
| | - Peter Bronsert
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany.,Tumorbank Comprehensive Cancer Center Freiburg, Medical Center University of Freiburg, Freiburg im Breisgau, Germany
| | - Sylvia Timme-Bronsert
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany.,Tumorbank Comprehensive Cancer Center Freiburg, Medical Center University of Freiburg, Freiburg im Breisgau, Germany
| | - Alexander Schmidt
- Proteomics Core Facility, Biozentrum, University of Basel, Basel, Switzerland
| | - Katja Bärenfaller
- Swiss Institute of Allergy and Asthma Research (SIAF), University of Zurich, and Swiss Institute of Bioinformatics (SIB), Wolfgang, Switzerland
| | - Clemens Kreutz
- Institute of Medical Biometry and Statistics, Faculty of Medicine and Medical Center - University of Freiburg, Freiburg im Breisgau, Germany.,Centre for Integrative Biological Signaling Studies (CIBSS), University of Freiburg, Freiburg im Breisgau, Germany
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center-University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg im Breisgau, Germany. .,German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany. .,BIOSS Centre for Biological Signaling Studies, University of Freiburg, Freiburg im Breisgau, Germany.
| |
Collapse
|
10
|
Pinter N, Glätzer D, Fahrner M, Fröhlich K, Johnson J, Grüning BA, Warscheid B, Drepper F, Schilling O, Föll MC. MaxQuant and MSstats in Galaxy Enable Reproducible Cloud-Based Analysis of Quantitative Proteomics Experiments for Everyone. J Proteome Res 2022; 21:1558-1565. [PMID: 35503992 DOI: 10.1021/acs.jproteome.2c00051] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Quantitative mass spectrometry-based proteomics has become a high-throughput technology for the identification and quantification of thousands of proteins in complex biological samples. Two frequently used tools, MaxQuant and MSstats, allow for the analysis of raw data and finding proteins with differential abundance between conditions of interest. To enable accessible and reproducible quantitative proteomics analyses in a cloud environment, we have integrated MaxQuant (including TMTpro 16/18plex), Proteomics Quality Control (PTXQC), MSstats, and MSstatsTMT into the open-source Galaxy framework. This enables the web-based analysis of label-free and isobaric labeling proteomics experiments via Galaxy's graphical user interface on public clouds. MaxQuant and MSstats in Galaxy can be applied in conjunction with thousands of existing Galaxy tools and integrated into standardized, sharable workflows. Galaxy tracks all metadata and intermediate results in analysis histories, which can be shared privately for collaborations or publicly, allowing full reproducibility and transparency of published analysis. To further increase accessibility, we provide detailed hands-on training materials. The integration of MaxQuant and MSstats into the Galaxy framework enables their usage in a reproducible way on accessible large computational infrastructures, hence realizing the foundation for high-throughput proteomics data science for everyone.
Collapse
Affiliation(s)
- Niko Pinter
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany
| | - Damian Glätzer
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Klemens Fröhlich
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), Albert-Ludwigs-University Freiburg, 79104 Freiburg, Germany
| | - James Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | | | - Bettina Warscheid
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany.,Faculty of Chemistry and Pharmacy, Department of Biochemistry, Julius Maximilian University of Würzburg, 97074 Würzburg, Germany
| | - Friedel Drepper
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), 79106 Freiburg, Germany
| | - Melanie Christine Föll
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Khoury College of Computer Sciences, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
11
|
Fahrner M, Föll MC, Grüning BA, Bernt M, Röst H, Schilling O. Democratizing data-independent acquisition proteomics analysis on public cloud infrastructures via the Galaxy framework. Gigascience 2022; 11:giac005. [PMID: 35166338 PMCID: PMC8848309 DOI: 10.1093/gigascience/giac005] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2021] [Revised: 11/26/2021] [Accepted: 01/12/2022] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Data-independent acquisition (DIA) has become an important approach in global, mass spectrometric proteomic studies because it provides in-depth insights into the molecular variety of biological systems. However, DIA data analysis remains challenging owing to the high complexity and large data and sample size, which require specialized software and vast computing infrastructures. Most available open-source DIA software necessitates basic programming skills and covers only a fraction of a complete DIA data analysis. In consequence, DIA data analysis often requires usage of multiple software tools and compatibility thereof, severely limiting the usability and reproducibility. FINDINGS To overcome this hurdle, we have integrated a suite of open-source DIA tools in the Galaxy framework for reproducible and version-controlled data processing. The DIA suite includes OpenSwath, PyProphet, diapysef, and swath2stats. We have compiled functional Galaxy pipelines for DIA processing, which provide a web-based graphical user interface to these pre-installed and pre-configured tools for their use on freely accessible, powerful computational resources of the Galaxy framework. This approach also enables seamless sharing workflows with full configuration in addition to sharing raw data and results. We demonstrate the usability of an all-in-one DIA pipeline in Galaxy by the analysis of a spike-in case study dataset. Additionally, extensive training material is provided to further increase access for the proteomics community. CONCLUSION The integration of an open-source DIA analysis suite in the web-based and user-friendly Galaxy framework in combination with extensive training material empowers a broad community of researches to perform reproducible and transparent DIA data analysis.
Collapse
Affiliation(s)
- Matthias Fahrner
- Institute for Surgical Pathology, Medical Center–University of Freiburg, Faculty of Medicine, University of Freiburg, Breisacher Straße 115a, D-79106 Freiburg, Germany
- Faculty of Biology, Albert-Ludwigs-University Freiburg, Schänzlestraße 1, D-79104 Freiburg, Germany
- Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Albertstraße 19A, D-79104, Germany
| | - Melanie Christine Föll
- Institute for Surgical Pathology, Medical Center–University of Freiburg, Faculty of Medicine, University of Freiburg, Breisacher Straße 115a, D-79106 Freiburg, Germany
- Khoury College of Computer Sciences, Northeastern University, 440 Huntington Ave, Boston, MA 02115, USA
| | - Björn Andreas Grüning
- Department of Computer Science, University of Freiburg, Georges-Köhler-Allee 106, D-79110 Freiburg, Germany
| | - Matthias Bernt
- Young Investigators Group Bioinformatics and Transcriptomics, Helmholtz Centre for Environmental Research–UFZ, Permoserstraße 15, D-04318 Leipzig, Germany
| | - Hannes Röst
- Donnelly Centre, University of Toronto, 160 College St, Toronto, ON M5S 3E1, Canada
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center–University of Freiburg, Faculty of Medicine, University of Freiburg, Breisacher Straße 115a, D-79106 Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Hugstetter Straße 55, D-79106 Freiburg, Germany
- BIOSS Centre for Biological Signaling Studies, University of Freiburg, Schänzlestraße 18, D-79104 Freiburg, Germany
| |
Collapse
|