1
|
Fallon TR, Shende VV, Wierzbicki IH, Pendleton AL, Watervoort NF, Auber RP, Gonzalez DJ, Wisecaver JH, Moore BS. Giant polyketide synthase enzymes in the biosynthesis of giant marine polyether toxins. Science 2024; 385:671-678. [PMID: 39116217 DOI: 10.1126/science.ado3290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Accepted: 07/10/2024] [Indexed: 08/10/2024]
Abstract
Prymnesium parvum are harmful haptophyte algae that cause massive environmental fish kills. Their polyketide polyether toxins, the prymnesins, are among the largest nonpolymeric compounds in nature and have biosynthetic origins that have remained enigmatic for more than 40 years. In this work, we report the "PKZILLAs," massive P. parvum polyketide synthase (PKS) genes that have evaded previous detection. PKZILLA-1 and -2 encode giant protein products of 4.7 and 3.2 megadaltons that have 140 and 99 enzyme domains. Their predicted polyene product matches the proposed pre-prymnesin precursor of the 90-carbon-backbone A-type prymnesins. We further characterize the variant PKZILLA-B1, which is responsible for the shorter B-type analog prymnesin-B1, from P. parvum RCC3426 and thus establish a general model of haptophyte polyether biosynthetic logic. This work expands expectations of genetic and enzymatic size limits in biology.
Collapse
Affiliation(s)
- Timothy R Fallon
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography and University of California, San Diego, La Jolla, CA 92093, USA
| | - Vikram V Shende
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography and University of California, San Diego, La Jolla, CA 92093, USA
| | - Igor H Wierzbicki
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
| | - Amanda L Pendleton
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA
- Purdue Center for Plant Biology, Purdue University, West Lafayette, IN 47907, USA
| | - Nathan F Watervoort
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA
- Purdue Center for Plant Biology, Purdue University, West Lafayette, IN 47907, USA
| | - Robert P Auber
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA
- Purdue Center for Plant Biology, Purdue University, West Lafayette, IN 47907, USA
| | - David J Gonzalez
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| | - Jennifer H Wisecaver
- Department of Biochemistry, Purdue University, West Lafayette, IN 47907, USA
- Purdue Center for Plant Biology, Purdue University, West Lafayette, IN 47907, USA
| | - Bradley S Moore
- Center for Marine Biotechnology and Biomedicine, Scripps Institution of Oceanography and University of California, San Diego, La Jolla, CA 92093, USA
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
2
|
Sequeira JC, Pereira V, Alves MM, Pereira MA, Rocha M, Salvador AF. MOSCA 2.0: A bioinformatics framework for metagenomics, metatranscriptomics and metaproteomics data analysis and visualization. Mol Ecol Resour 2024:e13996. [PMID: 39099161 DOI: 10.1111/1755-0998.13996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 06/14/2024] [Accepted: 07/15/2024] [Indexed: 08/06/2024]
Abstract
The analysis of meta-omics data requires the utilization of several bioinformatics tools and proficiency in informatics. The integration of multiple meta-omics data is even more challenging, and the outputs of existing bioinformatics solutions are not always easy to interpret. Here, we present a meta-omics bioinformatics pipeline, Meta-Omics Software for Community Analysis (MOSCA), which aims to overcome these limitations. MOSCA was initially developed for analysing metagenomics (MG) and metatranscriptomics (MT) data. Now, it also performs MG and metaproteomics (MP) integrated analysis, and MG/MT analysis was upgraded with an additional iterative binning step, metabolic pathways mapping, and several improvements regarding functional annotation and data visualization. MOSCA handles raw sequencing data and mass spectra and performs pre-processing, assembly, annotation, binning and differential gene/protein expression analysis. MOSCA shows taxonomic and functional analysis in large tables, performs metabolic pathways mapping, generates Krona plots and shows gene/protein expression results in heatmaps, improving omics data visualization. MOSCA is easily run from a single command while also providing a web interface (MOSGUITO). Relevant features include an extensive set of customization options, allowing tailored analyses to suit specific research objectives, and the ability to restart the pipeline from intermediary checkpoints using alternative configurations. Two case studies showcased MOSCA results, giving a complete view of the anaerobic microbial communities from anaerobic digesters and insights on the role of specific microorganisms. MOSCA represents a pivotal advancement in meta-omics research, offering an intuitive, comprehensive, and versatile solution for researchers seeking to unravel the intricate tapestry of microbial communities.
Collapse
Affiliation(s)
- João C Sequeira
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Vítor Pereira
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - M Madalena Alves
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - M Alcina Pereira
- Centre of Biological Engineering, University of Minho, Braga, Portugal
- LABBELS - Associate Laboratory, Braga/Guimarães, Portugal
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Braga, Portugal
- LABBELS - Associate Laboratory, Braga/Guimarães, Portugal
| | - Andreia F Salvador
- Centre of Biological Engineering, University of Minho, Braga, Portugal
- LABBELS - Associate Laboratory, Braga/Guimarães, Portugal
| |
Collapse
|
3
|
Kruk ME, Mehta S, Murray K, Higgins L, Do K, Johnson JE, Wagner R, Wendt CH, O’Connor JB, Harris JK, Laguna TA, Jagtap PD, Griffin TJ. An integrated metaproteomics workflow for studying host-microbe dynamics in bronchoalveolar lavage samples applied to cystic fibrosis disease. mSystems 2024; 9:e0092923. [PMID: 38934598 PMCID: PMC11264604 DOI: 10.1128/msystems.00929-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 05/13/2024] [Indexed: 06/28/2024] Open
Abstract
Airway microbiota are known to contribute to lung diseases, such as cystic fibrosis (CF), but their contributions to pathogenesis are still unclear. To improve our understanding of host-microbe interactions, we have developed an integrated analytical and bioinformatic mass spectrometry (MS)-based metaproteomics workflow to analyze clinical bronchoalveolar lavage (BAL) samples from people with airway disease. Proteins from BAL cellular pellets were processed and pooled together in groups categorized by disease status (CF vs. non-CF) and bacterial diversity, based on previously performed small subunit rRNA sequencing data. Proteins from each pooled sample group were digested and subjected to liquid chromatography tandem mass spectrometry (MS/MS). MS/MS spectra were matched to human and bacterial peptide sequences leveraging a bioinformatic workflow using a metagenomics-guided protein sequence database and rigorous evaluation. Label-free quantification revealed differentially abundant human peptides from proteins with known roles in CF, like neutrophil elastase and collagenase, and proteins with lesser-known roles in CF, including apolipoproteins. Differentially abundant bacterial peptides were identified from known CF pathogens (e.g., Pseudomonas), as well as other taxa with potentially novel roles in CF. We used this host-microbe peptide panel for targeted parallel-reaction monitoring validation, demonstrating for the first time an MS-based assay effective for quantifying host-microbe protein dynamics within BAL cells from individual CF patients. Our integrated bioinformatic and analytical workflow combining discovery, verification, and validation should prove useful for diverse studies to characterize microbial contributors in airway diseases. Furthermore, we describe a promising preliminary panel of differentially abundant microbe and host peptide sequences for further study as potential markers of host-microbe relationships in CF disease pathogenesis.IMPORTANCEIdentifying microbial pathogenic contributors and dysregulated human responses in airway disease, such as CF, is critical to understanding disease progression and developing more effective treatments. To this end, characterizing the proteins expressed from bacterial microbes and human host cells during disease progression can provide valuable new insights. We describe here a new method to confidently detect and monitor abundance changes of both microbe and host proteins from challenging BAL samples commonly collected from CF patients. Our method uses both state-of-the art mass spectrometry-based instrumentation to detect proteins present in these samples and customized bioinformatic software tools to analyze the data and characterize detected proteins and their association with CF. We demonstrate the use of this method to characterize microbe and host proteins from individual BAL samples, paving the way for a new approach to understand molecular contributors to CF and other diseases of the airway.
Collapse
Affiliation(s)
- Monica E. Kruk
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minneapolis, Minnesota, USA
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minneapolis, Minnesota, USA
| | - Kevin Murray
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minneapolis, Minnesota, USA
- Center for Metabolomics and Proteomics, University of Minnesota, Minneapolis, Minnesota, USA
| | - LeeAnn Higgins
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minneapolis, Minnesota, USA
- Center for Metabolomics and Proteomics, University of Minnesota, Minneapolis, Minnesota, USA
| | - Katherine Do
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minneapolis, Minnesota, USA
| | - James E. Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, USA
| | - Reid Wagner
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, USA
| | - Chris H. Wendt
- Division of Pulmonary, Allergy, Critical Care and Sleep Medicine, Medical School, University of Minnesota, Minneapolis, Minnesota, USA
- Minneapolis VA Health Care System, Minneapolis, Minnesota, USA
| | - John B. O’Connor
- Department of Pediatrics, Division of Pulmonary and Sleep Medicine, Seattle Children’s Hospital, Seattle, Washington, USA
| | - J. Kirk Harris
- Department of Pediatrics, University of Colorado School of Medicine, Aurora, Colorado, USA
| | - Theresa A. Laguna
- Department of Pediatrics, Division of Pulmonary and Sleep Medicine, Seattle Children’s Hospital, Seattle, Washington, USA
- Department of Pediatrics, University of Washington School of Medicine, Seattle, Washington, USA
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minneapolis, Minnesota, USA
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minneapolis, Minnesota, USA
| |
Collapse
|
4
|
Teles Filho ACDA, Sanchez DJD, Viana AGA, Sheheryar S, Guerreiro DD, Bustamante-Filho IC, Martins AMA, Sousa MV, Ricart CAO, Fontes W, Moura AA. A prospective study of the proteome of equine pre-implantation embryo. Reprod Domest Anim 2024; 59:e14663. [PMID: 38990011 DOI: 10.1111/rda.14663] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2024] [Revised: 06/11/2024] [Accepted: 06/24/2024] [Indexed: 07/12/2024]
Abstract
The present study was conducted to investigate the global proteome of 8-day-old equine blastocysts. Follicular dynamics of eight adult mares were monitored by ultrasonography and inseminated 24 h after the detection of a preovulatory follicle. Four expanded blastocysts were recovered, pooled, and subjected to protein extraction and mass spectrometry. Protein identification was conducted based on four database searches (PEAKS, Proteome Discoverer software, SearchGUI software, and PepExplorer). Enrichment analysis was performed using g:Profiler, Panther, and String platforms. After the elimination of identification redundancies among search tools (at three levels, based on identifiers, peptides, and cross-database mapping), 1977 proteins were reliably identified in the samples of equine embryos. Proteomic analysis unveiled robust metabolic activity in the 8-day equine embryo, highlighted by an abundance of proteins engaged in key metabolic pathways like the TCA cycle, ATP biosynthesis, and glycolysis. The prevalence of chaperones among highly abundant proteins suggests that regulation of protein folding, and degradation is a key process during embryo development. These findings pave the way for developing new strategies to improve equine embryo media and optimize in vitro fertilization techniques.
Collapse
Affiliation(s)
| | - Deisy J D Sanchez
- Laboratório de Biotecnologia, Universidad Cooperativa de Colombia, Bucaramanga, Colombia
| | | | - Sheheryar Sheheryar
- Department of Animal Science, Federal University of Ceará, Fortaleza, Brazil
| | - Denise D Guerreiro
- Department of Animal Science, Federal University of Ceará, Fortaleza, Brazil
| | | | - Aline M A Martins
- Laboratory of Protein Chemistry and Biochemistry, University of Brasília, Brasilia, Brazil
| | - Marcelo V Sousa
- Laboratory of Protein Chemistry and Biochemistry, University of Brasília, Brasilia, Brazil
| | - Carlos A O Ricart
- Laboratory of Protein Chemistry and Biochemistry, University of Brasília, Brasilia, Brazil
| | - Wagner Fontes
- Laboratory of Protein Chemistry and Biochemistry, University of Brasília, Brasilia, Brazil
| | - Arlindo A Moura
- Department of Animal Science, Federal University of Ceará, Fortaleza, Brazil
| |
Collapse
|
5
|
Anderson DC, Peterson MS, Lapp SA, Galinski MR. Proteomes of plasmodium knowlesi early and late ring-stage parasites and infected host erythrocytes. J Proteomics 2024; 302:105197. [PMID: 38759952 PMCID: PMC11357705 DOI: 10.1016/j.jprot.2024.105197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 05/05/2024] [Accepted: 05/14/2024] [Indexed: 05/19/2024]
Abstract
The emerging malaria parasite Plasmodium knowlesi threatens the goal of worldwide malaria elimination due to its zoonotic spread in Southeast Asia. After brief ex-vivo culture we used 2D LC/MS/MS to examine the early and late ring stages of infected Macaca mulatta red blood cells harboring P. knowlesi. The M. mulatta clathrin heavy chain and T-cell and macrophage inhibitor ERMAP were overexpressed in the early ring stage; glutaredoxin 3 was overexpressed in the late ring stage; GO term differential enrichments included response to oxidative stress and the cortical cytoskeleton in the early ring stage. P. knowlesi clathrin heavy chain and 60S acidic ribosomal protein P2 were overexpressed in the late ring stage; GO term differential enrichments included vacuoles in the early ring stage, ribosomes and translation in the late ring stage, and Golgi- and COPI-coated vesicles, proteasomes, nucleosomes, vacuoles, ion-, peptide-, protein-, nucleocytoplasmic- and RNA-transport, antioxidant activity and glycolysis in both stages. SIGNIFICANCE: Due to its zoonotic spread, cases of the emerging human pathogen Plasmodium knowlesi in southeast Asia, and particularly in Malaysia, threaten regional and worldwide goals for malaria elimination. Infection by this parasite can be fatal to humans, and can be associated with significant morbidity. Due to zoonotic transmission from large macaque reservoirs that are untreatable by drugs, and outdoor biting mosquito vectors that negate use of preventive measures such as bed nets, its containment remains a challenge. Its biology remains incompletely understood. Thus we examine the expressed proteome of the early and late ex-vivo cultured ring stages, the first intraerythrocyte developmental stages after infection of host rhesus macaque erythrocytes. We used GO term enrichment strategies and differential protein expression to compare early and late ring stages. The early ring stage is characterized by the enrichment of P. knowlesi vacuoles, and overexpression of the M. mulatta clathrin heavy chain, important for clathrin-coated pits and vesicles, and clathrin-mediated endocytosis. The M. mulatta protein ERMAP was also overexpressed in the early ring stage, suggesting a potential role in early ring stage inhibition of T-cells and macrophages responding to P. knowlesi infection of reticulocytes. This could allow expansion of the host P. knowlesi cellular niche, allowing parasite adaptation to invasion of a wider age range of RBCs than the preferred young RBCs or reticulocytes, resulting in proliferation and increased pathogenesis in infected humans. Other GO terms differentially enriched in the early ring stage include the M. mulatta cortical cytoskeleton and response to oxidative stress. The late ring stage is characterized by overexpression of the P. knowlesi clathrin heavy chain. Combined with late ring stage GO term enrichment of Golgi-associated and coated vesicles, and enrichment of COPI-coated vesicles in both stages, this suggests the importance to P. knowlesi biology of clathrin-mediated endocytosis. P. knowlesi ribosomes and translation were also differentially enriched in the late ring stage. With expression of a variety of heat shock proteins, these results suggest production of folded parasite proteins is increasing by the late ring stage. M. mulatta endocytosis was differentially enriched in the late ring stage, as were clathrin-coated vesicles and endocytic vesicles. This suggests that M. mulatta clathrin-based endocytosis, perhaps in infected reticulocytes rather than mature RBC, may be an important process in the late ring stage. Additional ring stage biology from enriched GO terms includes M. mulatta proteasomes, protein folding and the chaperonin-containing T complex, actin and cortical actin cytoskeletons. P knowlesi biology also includes proteasomes, as well as nucleosomes, antioxidant activity, a variety of transport processes, glycolysis, vacuoles and protein folding. Mature RBCs have lost internal organelles, suggesting infection here may involve immature reticulocytes still retaining organelles. P. knowlesi parasite proteasomes and translational machinery may be ring stage drug targets for known selective inhibitors of these processes in other Plasmodium species. To our knowledge this is the first examination of more than one timepoint within the ring stage. Our results expand knowledge of both host and parasite proteins, pathways and organelles underlying P. knowlesi ring stage biology.
Collapse
Affiliation(s)
- D C Anderson
- Biosciences Division, SRI International, Harrisonburg, VA 22802, USA.
| | - Mariko S Peterson
- Emory Vaccine Center and Yerkes National Primate Research Center, Emory University, Atlanta, GA 30322, USA
| | - Stacey A Lapp
- Emory Vaccine Center and Yerkes National Primate Research Center, Emory University, Atlanta, GA 30322, USA
| | - Mary R Galinski
- Emory Vaccine Center and Yerkes National Primate Research Center, Emory University, Atlanta, GA 30322, USA; Division of Infectious Diseases, Department of Medicine, Emory University School of Medicine, Emory University, Atlanta, GA 30322, USA
| |
Collapse
|
6
|
Kiseleva OI, Pyatnitskiy MA, Arzumanian VA, Kurbatov IY, Ilinsky VV, Ilgisonis EV, Plotnikova OA, Sharafetdinov KK, Tutelyan VA, Nikityuk DB, Ponomarenko EA, Poverennaya EV. Multiomics Picture of Obesity in Young Adults. BIOLOGY 2024; 13:272. [PMID: 38666884 PMCID: PMC11048234 DOI: 10.3390/biology13040272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/02/2024] [Accepted: 04/12/2024] [Indexed: 04/28/2024]
Abstract
Obesity is a socially significant disease that is characterized by a disproportionate accumulation of fat. It is also associated with chronic inflammation, cancer, diabetes, and other comorbidities. Investigating biomarkers and pathological processes linked to obesity is especially vital for young individuals, given their increased potential for lifestyle modifications. By comparing the genetic, proteomic, and metabolomic profiles of individuals categorized as underweight, normal, overweight, and obese, we aimed to determine which omics layer most accurately reflects the phenotypic changes in an organism that result from obesity. We profiled blood plasma samples by employing three omics methodologies. The untargeted GC×GC-MS metabolomics approach identified 313 metabolites. To augment the metabolomic dataset, we integrated a label-free HPLC-MS/MS proteomics method, leading to the identification of 708 proteins. The genomic layer encompassed the genotyping of 647,250 SNPs. Utilizing omics data, we trained sparse Partial Least Squares models to predict body mass index. Molecular features exhibiting frequently non-zero coefficients were selected as potential biomarkers, and we further explored enriched biological pathways. Proteomics was the most effective in single-omics analyses, with a median absolute error (MAE) of 5.44 ± 0.31 kg/m2, incorporating an average of 24 proteins per model. Metabolomics showed slightly lower performance (MAE = 6.06 ± 0.33 kg/m2), followed by genomics (MAE = 6.20 ± 0.34 kg/m2). As expected, multiomic models demonstrated better accuracy, particularly the combination of proteomics and metabolomics (MAE = 4.77 ± 0.33 kg/m2), while including genomics data did not enhance the results. This manuscript is the first multiomics study of obesity in a gender-balanced cohort of young adults profiled by genomic, proteomic, and metabolomic methods. The comprehensive approach provides novel insights into the molecular mechanisms of obesity, opening avenues for more targeted interventions.
Collapse
Affiliation(s)
- Olga I. Kiseleva
- Institute of Biomedical Chemistry, Moscow 119121, Russia; (O.I.K.)
| | - Mikhail A. Pyatnitskiy
- Institute of Biomedical Chemistry, Moscow 119121, Russia; (O.I.K.)
- Faculty of Computer Science, National Research University Higher School of Economics, Moscow 101000, Russia
| | | | - Ilya Y. Kurbatov
- Institute of Biomedical Chemistry, Moscow 119121, Russia; (O.I.K.)
| | | | | | - Oksana A. Plotnikova
- Federal Research Centre of Nutrition, Biotechnology and Food Safety, Russian Academy of Sciences, Moscow 109240, Russia
| | - Khaider K. Sharafetdinov
- Federal Research Centre of Nutrition, Biotechnology and Food Safety, Russian Academy of Sciences, Moscow 109240, Russia
- Russian Medical Academy of Continuing Professional Education, Ministry of Health of the Russian Federation, Moscow 125993, Russia
- I.M. Sechenov First Moscow State Medical University (Sechenov University), Ministry of Health of the Russian Federation, Moscow 119991, Russia
| | - Victor A. Tutelyan
- Federal Research Centre of Nutrition, Biotechnology and Food Safety, Russian Academy of Sciences, Moscow 109240, Russia
- I.M. Sechenov First Moscow State Medical University (Sechenov University), Ministry of Health of the Russian Federation, Moscow 119991, Russia
| | - Dmitry B. Nikityuk
- Federal Research Centre of Nutrition, Biotechnology and Food Safety, Russian Academy of Sciences, Moscow 109240, Russia
- I.M. Sechenov First Moscow State Medical University (Sechenov University), Ministry of Health of the Russian Federation, Moscow 119991, Russia
| | | | | |
Collapse
|
7
|
Keyport Kik S, Christopher D, Glauninger H, Hickernell CW, Bard JAM, Lin KM, Squires AH, Ford M, Sosnick TR, Drummond DA. An adaptive biomolecular condensation response is conserved across environmentally divergent species. Nat Commun 2024; 15:3127. [PMID: 38605014 PMCID: PMC11009240 DOI: 10.1038/s41467-024-47355-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Accepted: 03/27/2024] [Indexed: 04/13/2024] Open
Abstract
Cells must sense and respond to sudden maladaptive environmental changes-stresses-to survive and thrive. Across eukaryotes, stresses such as heat shock trigger conserved responses: growth arrest, a specific transcriptional response, and biomolecular condensation of protein and mRNA into structures known as stress granules under severe stress. The composition, formation mechanism, adaptive significance, and even evolutionary conservation of these condensed structures remain enigmatic. Here we provide a remarkable view into stress-triggered condensation, its evolutionary conservation and tuning, and its integration into other well-studied aspects of the stress response. Using three morphologically near-identical budding yeast species adapted to different thermal environments and diverged by up to 100 million years, we show that proteome-scale biomolecular condensation is tuned to species-specific thermal niches, closely tracking corresponding growth and transcriptional responses. In each species, poly(A)-binding protein-a core marker of stress granules-condenses in isolation at species-specific temperatures, with conserved molecular features and conformational changes modulating condensation. From the ecological to the molecular scale, our results reveal previously unappreciated levels of evolutionary selection in the eukaryotic stress response, while establishing a rich, tractable system for further inquiry.
Collapse
Affiliation(s)
- Samantha Keyport Kik
- Committee on Genetics, Genomics, and Systems Biology, The University of Chicago, Chicago, IL, USA
| | - Dana Christopher
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
| | - Hendrik Glauninger
- Graduate Program in Biophysical Sciences, The University of Chicago, Chicago, IL, USA
- Interdisciplinary Scientist Training Program, The University of Chicago, Chicago, IL, USA
| | - Caitlin Wong Hickernell
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
| | - Jared A M Bard
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
| | - Kyle M Lin
- Graduate Program in Biophysical Sciences, The University of Chicago, Chicago, IL, USA
- Interdisciplinary Scientist Training Program, The University of Chicago, Chicago, IL, USA
| | - Allison H Squires
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, University of Chicago, Chicago, IL, USA
| | | | - Tobin R Sosnick
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA
- Institute for Biophysical Dynamics, University of Chicago, Chicago, IL, USA
| | - D Allan Drummond
- Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, IL, USA.
- Institute for Biophysical Dynamics, University of Chicago, Chicago, IL, USA.
- Department of Medicine, Section of Genetic Medicine, The University of Chicago, Chicago, IL, USA.
| |
Collapse
|
8
|
Garge RK, Geck RC, Armstrong JO, Dunn B, Boutz DR, Battenhouse A, Leutert M, Dang V, Jiang P, Kwiatkowski D, Peiser T, McElroy H, Marcotte EM, Dunham MJ. Systematic profiling of ale yeast protein dynamics across fermentation and repitching. G3 (BETHESDA, MD.) 2024; 14:jkad293. [PMID: 38135291 PMCID: PMC10917522 DOI: 10.1093/g3journal/jkad293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2023] [Revised: 11/28/2023] [Accepted: 12/07/2023] [Indexed: 12/24/2023]
Abstract
Studying the genetic and molecular characteristics of brewing yeast strains is crucial for understanding their domestication history and adaptations accumulated over time in fermentation environments, and for guiding optimizations to the brewing process itself. Saccharomyces cerevisiae (brewing yeast) is among the most profiled organisms on the planet, yet the temporal molecular changes that underlie industrial fermentation and beer brewing remain understudied. Here, we characterized the genomic makeup of a Saccharomyces cerevisiae ale yeast widely used in the production of Hefeweizen beers, and applied shotgun mass spectrometry to systematically measure the proteomic changes throughout 2 fermentation cycles which were separated by 14 rounds of serial repitching. The resulting brewing yeast proteomics resource includes 64,740 protein abundance measurements. We found that this strain possesses typical genetic characteristics of Saccharomyces cerevisiae ale strains and displayed progressive shifts in molecular processes during fermentation based on protein abundance changes. We observed protein abundance differences between early fermentation batches compared to those separated by 14 rounds of serial repitching. The observed abundance differences occurred mainly in proteins involved in the metabolism of ergosterol and isobutyraldehyde. Our systematic profiling serves as a starting point for deeper characterization of how the yeast proteome changes during commercial fermentations and additionally serves as a resource to guide fermentation protocols, strain handling, and engineering practices in commercial brewing and fermentation environments. Finally, we created a web interface (https://brewing-yeast-proteomics.ccbb.utexas.edu/) to serve as a valuable resource for yeast geneticists, brewers, and biochemists to provide insights into the global trends underlying commercial beer production.
Collapse
Affiliation(s)
- Riddhiman K Garge
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Renee C Geck
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Joseph O Armstrong
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Barbara Dunn
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Daniel R Boutz
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
- Antibody Discovery and Accelerated Protein Therapeutics, Department of Pathology and Genomic Medicine, Houston Methodist Research Institute, Houston, TX 77030, USA
| | - Anna Battenhouse
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Mario Leutert
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Institute of Molecular Systems Biology, ETH Zürich, Zürich 8049, Switzerland
| | - Vy Dang
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Pengyao Jiang
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | | | | | | | - Edward M Marcotte
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX 78712, USA
| | - Maitreya J Dunham
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
9
|
Bouyssié D, Altıner P, Capella-Gutierrez S, Fernández JM, Hagemeijer YP, Horvatovich P, Hubálek M, Levander F, Mauri P, Palmblad M, Raffelsberger W, Rodríguez-Navas L, Di Silvestre D, Kunkli BT, Uszkoreit J, Vandenbrouck Y, Vizcaíno JA, Winkelhardt D, Schwämmle V. WOMBAT-P: Benchmarking Label-Free Proteomics Data Analysis Workflows. J Proteome Res 2024; 23:418-429. [PMID: 38038272 DOI: 10.1021/acs.jproteome.3c00636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
The inherent diversity of approaches in proteomics research has led to a wide range of software solutions for data analysis. These software solutions encompass multiple tools, each employing different algorithms for various tasks such as peptide-spectrum matching, protein inference, quantification, statistical analysis, and visualization. To enable an unbiased comparison of commonly used bottom-up label-free proteomics workflows, we introduce WOMBAT-P, a versatile platform designed for automated benchmarking and comparison. WOMBAT-P simplifies the processing of public data by utilizing the sample and data relationship format for proteomics (SDRF-Proteomics) as input. This feature streamlines the analysis of annotated local or public ProteomeXchange data sets, promoting efficient comparisons among diverse outputs. Through an evaluation using experimental ground truth data and a realistic biological data set, we uncover significant disparities and a limited overlap in the quantified proteins. WOMBAT-P not only enables rapid execution and seamless comparison of workflows but also provides valuable insights into the capabilities of different software solutions. These benchmarking metrics are a valuable resource for researchers in selecting the most suitable workflow for their specific data sets. The modular architecture of WOMBAT-P promotes extensibility and customization. The software is available at https://github.com/wombat-p/WOMBAT-Pipelines.
Collapse
Affiliation(s)
- David Bouyssié
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III─Paul Sabatier (UT3), 31062 Toulouse, France
- Proteomics French Infrastructure, ProFI, FR 2048 Toulouse, France
| | - Pınar Altıner
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III─Paul Sabatier (UT3), 31062 Toulouse, France
| | | | - José M Fernández
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Yanick Paco Hagemeijer
- Department of Analytical Biochemistry, University of Groningen, Groningen Research Institute of Pharmacy, 9712 CP Groningen, The Netherlands
- European Research Institute for the Biology of Ageing, University Medical Center Groningen, 9713 GZ Groningen, The Netherlands
| | - Peter Horvatovich
- Department of Analytical Biochemistry, University of Groningen, Groningen Research Institute of Pharmacy, 9712 CP Groningen, The Netherlands
| | - Martin Hubálek
- Institute of Organic Chemistry and Biochemistry, CAS, 160 00 Prague, Czech Republic
| | - Fredrik Levander
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Department of Immunotechnology, Lund University, 22100 Lund, Sweden
| | - Pierluigi Mauri
- Institute for Biomedical Technologies (ITB), Department of Biomedical Sciences, National Research Council (CNR), Segrate, 20054 Milan, Italy
| | - Magnus Palmblad
- Leiden University Medical Center, Postbus 9600, 2300 RC Leiden, The Netherlands
| | - Wolfgang Raffelsberger
- Wolfgang Raffelsberger: Institut de Génétique et de Biologie Moléculaire et Cellulaire, Université de Strasbourg, CNRS UMR7104, INSERM U1258, Illkirch, 1 Rue Laurent Fries, 67404 Illkirch, France
| | - Laura Rodríguez-Navas
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Dario Di Silvestre
- Institute for Biomedical Technologies (ITB), Department of Biomedical Sciences, National Research Council (CNR), Segrate, 20054 Milan, Italy
| | - Balázs Tibor Kunkli
- Balázs Tibor Kunkli: Department of Biochemistry and Molecular Biology, University of Debrecen, 4032 Debrecen, Hungary
| | - Julian Uszkoreit
- Medical Faculty, Medical Bioinformatics, Ruhr University Bochum, 44801 Bochum, Germany
- Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Ruhr University Bochum, 44801 Bochum, Germany
- Medical Faculty, Medizinisches Proteom-Center, Ruhr University Bochum, 44801 Bochum, Germany
| | - Yves Vandenbrouck
- Proteomics French Infrastructure, ProFI, FR 2048 Toulouse, France
- CEA, Fundamental Research Division, Proteomics French Infrastructure, 91191 Gif-sur-Yvette, France
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory European Bioinformatics Institute (EMBL-EBI), Wellcome Trust, Genome Campus, Hinxton, Cambridge CB10 1SD, U.K
| | - Dirk Winkelhardt
- Medical Faculty, Medizinisches Proteom-Center, Ruhr University Bochum, 44801 Bochum, Germany
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
| |
Collapse
|
10
|
Declercq A, Demeulemeester N, Gabriels R, Bouwmeester R, Degroeve S, Martens L. Bioinformatics Pipeline for Processing Single-Cell Data. Methods Mol Biol 2024; 2817:221-239. [PMID: 38907156 DOI: 10.1007/978-1-0716-3934-4_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/23/2024]
Abstract
Single-cell proteomics can offer valuable insights into dynamic cellular interactions, but identifying proteins at this level is challenging due to their low abundance. In this chapter, we present a state-of-the-art bioinformatics pipeline for single-cell proteomics that combines the search engine Sage (via SearchGUI), identification rescoring with MS2Rescore, quantification through FlashLFQ, and differential expression analysis using MSqRob2. MS2Rescore leverages LC-MS/MS behavior predictors, such as MS2PIP and DeepLC, to recalibrate scores with Percolator or mokapot. Combining these tools into a unified pipeline, this approach improves the detection of low-abundance peptides, resulting in increased identifications while maintaining stringent FDR thresholds.
Collapse
Affiliation(s)
- Arthur Declercq
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Nina Demeulemeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
- StatOmics, Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Sven Degroeve
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
| |
Collapse
|
11
|
Delogu F, Kunath BJ, Queirós PM, Halder R, Lebrun LA, Pope PB, May P, Widder S, Muller EEL, Wilmes P. Forecasting the dynamics of a complex microbial community using integrated meta-omics. Nat Ecol Evol 2024; 8:32-44. [PMID: 37957315 PMCID: PMC10781640 DOI: 10.1038/s41559-023-02241-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 10/02/2023] [Indexed: 11/15/2023]
Abstract
Predicting the behaviour of complex microbial communities is challenging. However, this is essential for complex biotechnological processes such as those in biological wastewater treatment plants (BWWTPs), which require sustainable operation. Here we summarize 14 months of longitudinal meta-omics data from a BWWTP anaerobic tank into 17 temporal signals, explaining 91.1% of the temporal variance, and link those signals to ecological events within the community. We forecast the signals over the subsequent five years and use 21 extra samples collected at defined time intervals for testing and validation. Our forecasts are correct for six signals and hint on phenomena such as predation cycles. Using all the 17 forecasts and the environmental variables, we predict gene abundance and expression, with a coefficient of determination ≥0.87 for the subsequent three years. Our study demonstrates the ability to forecast the dynamics of open microbial ecosystems using interactions between community cycles and environmental parameters.
Collapse
Affiliation(s)
- Francesco Delogu
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| | - Benoit J Kunath
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Pedro M Queirós
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Rashi Halder
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Laura A Lebrun
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Phillip B Pope
- Faculty of Biosciences, Norwegian University of Life Sciences, Ås, Norway
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences, Ås, Norway
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Stefanie Widder
- Department of Medicine 1, Research Division Infection Biology, Medical University of Vienna, Vienna, Austria
| | - Emilie E L Muller
- Génétique Moléculaire, Génomique, Microbiologie, UMR 7156 CNRS, Université de Strasbourg, Strasbourg, France
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
- Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg.
| |
Collapse
|
12
|
Provencher N, Leblanc S, Jacques JF, Roucou X. Exploring the Alternative Proteome with OpenProt and Mass Spectrometry. Methods Mol Biol 2024; 2836:3-17. [PMID: 38995532 DOI: 10.1007/978-1-0716-4007-4_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]
Abstract
Proteogenomics has revealed the translation of unannotated open reading frames (ORFs) present in mRNAs and in noncoding RNAs (ncRNAs). OpenProt annotates all ORFs with a minimum of 30 codons in the transcriptome of several species and displays many functional features associated with the corresponding proteins. Two types of proteins are annotated: reference or canonical proteins which are proteins already annotated in UniProt, RefSeq, or Ensembl and noncanonical proteins. Noncanonical proteins form two groups: predicted novel isoforms that display a significant level of homology with a reference protein and alternative proteins that are new proteins with no significant homology to known proteins. This chapter describes how to check whether a gene and/or transcript contains multiple open reading frames and how to use OpenProt databases for the detection of alternative proteins and novel isoforms by mass spectrometry-based proteomics.
Collapse
Affiliation(s)
- Nicolas Provencher
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Sébastien Leblanc
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Jean-François Jacques
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, QC, Canada
| | - Xavier Roucou
- Department of Biochemistry and Functional Genomics, Université de Sherbrooke, Sherbrooke, QC, Canada.
- Centre de Recherche du Centre Hospitalier Universitaire de Sherbrooke (CRCHUS), Sherbrooke, QC, Canada.
| |
Collapse
|
13
|
Arzumanian VA, Kurbatov IY, Ptitsyn KG, Khmeleva SA, Kurbatov LK, Radko SP, Poverennaya EV. Identifying N6-Methyladenosine Sites in HepG2 Cell Lines Using Oxford Nanopore Technology. Int J Mol Sci 2023; 24:16477. [PMID: 38003667 PMCID: PMC10671286 DOI: 10.3390/ijms242216477] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 11/03/2023] [Accepted: 11/12/2023] [Indexed: 11/26/2023] Open
Abstract
RNA modifications, particularly N6-methyladenosine (m6A), are pivotal regulators of RNA functionality and cellular processes. We analyzed m6A modifications by employing Oxford Nanopore technology and the m6Anet algorithm, focusing on the HepG2 cell line. We identified 3968 potential m6A modification sites in 2851 transcripts, corresponding to 1396 genes. A gene functional analysis revealed the active involvement of m6A-modified genes in ubiquitination, transcription regulation, and protein folding processes, aligning with the known role of m6A modifications in histone ubiquitination in cancer. To ensure data robustness, we assessed reproducibility across technical replicates. This study underscores the importance of evaluating algorithmic reproducibility, especially in supervised learning. Furthermore, we examined correlations between transcriptomic, translatomic, and proteomic levels. A strong transcriptomic-translatomic correlation was observed. In conclusion, our study deepens our understanding of m6A modifications' multifaceted impacts on cellular processes and underscores the importance of addressing reproducibility concerns in analytical approaches.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Ekaterina V. Poverennaya
- Institute of Biomedical Chemistry, Russian Academy of Medical Sciences, 119121 Moscow, Russia; (V.A.A.); (I.Y.K.); (K.G.P.); (S.A.K.); (L.K.K.); (S.P.R.)
| |
Collapse
|
14
|
Lazear MR. Sage: An Open-Source Tool for Fast Proteomics Searching and Quantification at Scale. J Proteome Res 2023; 22:3652-3659. [PMID: 37819886 DOI: 10.1021/acs.jproteome.3c00486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
The growing complexity and volume of proteomics data necessitate the development of efficient software tools for peptide identification and quantification from mass spectra. Given their central role in proteomics, it is imperative that these tools are auditable and extensible─requirements that are best fulfilled by open-source and permissively licensed software. This work presents Sage, a high-performance, open-source, and freely available proteomics pipeline. Scalable and cloud-ready, Sage matches the performance of state-of-the-art software tools while running an order of magnitude faster.
Collapse
Affiliation(s)
- Michael R Lazear
- Belharra Therapeutics, 3985 Sorrento Valley Boulevard Suite C, San Diego, California 92121, United States
| |
Collapse
|
15
|
Seregin AA, Smirnova LP, Dmitrieva EM, Zavialova MG, Simutkin GG, Ivanova SA. Differential Expression of Proteins Associated with Bipolar Disorder as Identified Using the PeptideShaker Software. Int J Mol Sci 2023; 24:15250. [PMID: 37894929 PMCID: PMC10607299 DOI: 10.3390/ijms242015250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Revised: 09/29/2023] [Accepted: 10/06/2023] [Indexed: 10/29/2023] Open
Abstract
The prevalence of bipolar disorder (BD) in modern society is growing rapidly, but due to the lack of paraclinical criteria, its differential diagnosis with other mental disorders is somewhat challenging. In this regard, the relevance of proteomic studies is increasing due to the development of methods for processing large data arrays; this contributes to the discovery of protein patterns of pathological processes and the creation of new methods of diagnosis and treatment. It seems promising to search for proteins involved in the pathogenesis of BD in an easily accessible material-blood serum. Sera from BD patients and healthy individuals were purified via affinity chromatography to isolate 14 major proteins and separated using 1D SDS-PAGE. After trypsinolysis, the proteins in the samples were identified via HPLC/mass spectrometry. Mass spectrometric data were processed using the OMSSA and X!Tandem search algorithms using the UniProtKB database, and the results were analyzed using PeptideShaker. Differences in proteomes were assessed via an unlabeled NSAF-based analysis using a two-tailed Bonferroni-adjusted t-test. When comparing the blood serum proteomes of BD patients and healthy individuals, 10 proteins showed significant differences in NSAF values. Of these, four proteins were predominantly present in BD patients with the maximum NSAF value: 14-3-3 protein zeta/delta; ectonucleoside triphosphate diphosphohydrolase 7; transforming growth factor-beta-induced protein ig-h3; and B-cell CLL/lymphoma 9 protein. Further exploration of the role of these proteins in BD is warranted; conducting such studies will help develop new paraclinical criteria and discover new targets for BD drug therapy.
Collapse
Affiliation(s)
- Alexander A. Seregin
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk 634014, Russia; (A.A.S.)
| | - Liudmila P. Smirnova
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk 634014, Russia; (A.A.S.)
| | - Elena M. Dmitrieva
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk 634014, Russia; (A.A.S.)
| | | | - German G. Simutkin
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk 634014, Russia; (A.A.S.)
| | - Svetlana A. Ivanova
- Mental Health Research Institute, Tomsk National Research Medical Center, Russian Academy of Sciences, Tomsk 634014, Russia; (A.A.S.)
| |
Collapse
|
16
|
Gómez-Varela D, Xian F, Grundtner S, Sondermann JR, Carta G, Schmidt M. Increasing taxonomic and functional characterization of host-microbiome interactions by DIA-PASEF metaproteomics. Front Microbiol 2023; 14:1258703. [PMID: 37908546 PMCID: PMC10613666 DOI: 10.3389/fmicb.2023.1258703] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 09/20/2023] [Indexed: 11/02/2023] Open
Abstract
Introduction Metaproteomics is a rapidly advancing field that offers unique insights into the taxonomic composition and the functional activity of microbial communities, and their effects on host physiology. Classically, data-dependent acquisition (DDA) mass spectrometry (MS) has been applied for peptide identification and quantification in metaproteomics. However, DDA-MS exhibits well-known limitations in terms of depth, sensitivity, and reproducibility. Consequently, methodological improvements are required to better characterize the protein landscape of microbiomes and their interactions with the host. Methods We present an optimized proteomic workflow that utilizes the information captured by Parallel Accumulation-Serial Fragmentation (PASEF) MS for comprehensive metaproteomic studies in complex fecal samples of mice. Results and discussion We show that implementing PASEF using a DDA acquisition scheme (DDA-PASEF) increased peptide quantification up to 5 times and reached higher accuracy and reproducibility compared to previously published classical DDA and data-independent acquisition (DIA) methods. Furthermore, we demonstrate that the combination of DIA, PASEF, and neuronal-network-based data analysis, was superior to DDA-PASEF in all mentioned parameters. Importantly, DIA-PASEF expanded the dynamic range towards low-abundant proteins and it doubled the quantification of proteins with unknown or uncharacterized functions. Compared to previous classical DDA metaproteomic studies, DIA-PASEF resulted in the quantification of up to 4 times more taxonomic units using 16 times less injected peptides and 4 times shorter chromatography gradients. Moreover, 131 additional functional pathways distributed across more and even uniquely identified taxa were profiled as revealed by a peptide-centric taxonomic-functional analysis. We tested our workflow on a validated preclinical mouse model of neuropathic pain to assess longitudinal changes in host-gut microbiome interactions associated with pain - an unexplored topic for metaproteomics. We uncovered the significant enrichment of two bacterial classes upon pain, and, in addition, the upregulation of metabolic activities previously linked to chronic pain as well as various hitherto unknown ones. Furthermore, our data revealed pain-associated dynamics of proteome complexes implicated in the crosstalk between the host immune system and the gut microbiome. In conclusion, the DIA-PASEF metaproteomic workflow presented here provides a stepping stone towards a deeper understanding of microbial ecosystems across the breadth of biomedical and biotechnological fields.
Collapse
|
17
|
Larréché S, Bousquet A, Chevillard L, Gahoual R, Jourdi G, Dupart AL, Bachelot-Loza C, Gaussem P, Siguret V, Chippaux JP, Mégarbane B. Bothrops atrox and Bothrops lanceolatus Venoms In Vitro Investigation: Composition, Procoagulant Effects, Co-Factor Dependency, and Correction Using Antivenoms. Toxins (Basel) 2023; 15:614. [PMID: 37888645 PMCID: PMC10611193 DOI: 10.3390/toxins15100614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 10/05/2023] [Accepted: 10/12/2023] [Indexed: 10/28/2023] Open
Abstract
Bothrops venoms are rich in enzymes acting on platelets and coagulation. This action is dependent on two major co-factors, i.e., calcium and phospholipids, while antivenoms variably neutralize venom-related coagulopathy effects. Our aims were (i) to describe the composition of B. atrox and B. lanceolatus venoms; (ii) to study their activity on the whole blood using rotational thromboelastometry (ROTEM); (iii) to evaluate the contribution of calcium and phospholipids in their activity; and (iv) to compare the effectiveness of four antivenoms (Bothrofav™, Inoserp™ South America, Antivipmyn™ TRI, and PoliVal-ICP™) on the procoagulant activity of these two venoms. Venom composition was comparable. Both venoms exhibited hypercoagulant effects. B. lanceolatus venom was completely dependent on calcium but less dependent on phospholipids than B. atrox venom to induce in vitro coagulation. The four antivenoms neutralized the procoagulant activity of the two venoms; however, with quantitative differences. Bothrofav™ was more effective against both venoms than the three other antivenoms. The relatively similar venom-induced effects in vitro were unexpected considering the opposite clinical manifestations resulting from envenomation (i.e., systemic bleeding with B. atrox and thrombosis with B. lanceolatus). In vivo studies are warranted to better understand the pathophysiology of systemic bleeding and thrombosis associated with Bothrops bites.
Collapse
Affiliation(s)
- Sébastien Larréché
- Inserm, UMRS-1144, Université Paris Cité, F-75006 Paris, France;
- Department of Medical Biology, Bégin Military Teaching Hospital, F-94160 Saint-Mandé, France; (A.B.); (A.-L.D.)
| | - Aurore Bousquet
- Department of Medical Biology, Bégin Military Teaching Hospital, F-94160 Saint-Mandé, France; (A.B.); (A.-L.D.)
| | - Lucie Chevillard
- Inserm, UMRS-1144, Université Paris Cité, F-75006 Paris, France;
| | - Rabah Gahoual
- Chemical and Biological Technologies for Health Unit, CNRS UMR 8258, Inserm, Université Paris Cité, F-75006 Paris, France;
| | - Georges Jourdi
- Innovative Therapies in Hemostasis, Inserm, Université Paris Cité, F-75006 Paris, France; (G.J.); (C.B.-L.); (P.G.); (V.S.)
- Department of Biological Hematology, Lariboisière Hospital, Assistance Publique–Hôpitaux de Paris, F-75010 Paris, France
| | - Anne-Laure Dupart
- Department of Medical Biology, Bégin Military Teaching Hospital, F-94160 Saint-Mandé, France; (A.B.); (A.-L.D.)
| | - Christilla Bachelot-Loza
- Innovative Therapies in Hemostasis, Inserm, Université Paris Cité, F-75006 Paris, France; (G.J.); (C.B.-L.); (P.G.); (V.S.)
| | - Pascale Gaussem
- Innovative Therapies in Hemostasis, Inserm, Université Paris Cité, F-75006 Paris, France; (G.J.); (C.B.-L.); (P.G.); (V.S.)
- Department of Hematology, Georges Pompidou European Hospital, Assistance Publique–Hôpitaux de Paris, F-75015 Paris, France
| | - Virginie Siguret
- Innovative Therapies in Hemostasis, Inserm, Université Paris Cité, F-75006 Paris, France; (G.J.); (C.B.-L.); (P.G.); (V.S.)
- Department of Biological Hematology, Lariboisière Hospital, Assistance Publique–Hôpitaux de Paris, F-75010 Paris, France
| | - Jean-Philippe Chippaux
- French National Research Institute for Sustainable Development, Université Paris Cité, F-75006 Paris, France;
| | - Bruno Mégarbane
- Inserm, UMRS-1144, Université Paris Cité, F-75006 Paris, France;
- Department of Medical and Toxicological Critical Care, Federation of Toxicology, Lariboisière Hospital, Assistance Publique–Hôpitaux de Paris, F-75010 Paris, France
| |
Collapse
|
18
|
Skiadopoulou D, Vašíček J, Kuznetsova K, Bouyssié D, Käll L, Vaudel M. Retention Time and Fragmentation Predictors Increase Confidence in Identification of Common Variant Peptides. J Proteome Res 2023; 22:3190-3199. [PMID: 37656829 PMCID: PMC10563157 DOI: 10.1021/acs.jproteome.3c00243] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Indexed: 09/03/2023]
Abstract
Precision medicine focuses on adapting care to the individual profile of patients, for example, accounting for their unique genetic makeup. Being able to account for the effect of genetic variation on the proteome holds great promise toward this goal. However, identifying the protein products of genetic variation using mass spectrometry has proven very challenging. Here we show that the identification of variant peptides can be improved by the integration of retention time and fragmentation predictors into a unified proteogenomic pipeline. By combining these intrinsic peptide characteristics using the search-engine post-processor Percolator, we demonstrate improved discrimination power between correct and incorrect peptide-spectrum matches. Our results demonstrate that the drop in performance that is induced when expanding a protein sequence database can be compensated, hence enabling efficient identification of genetic variation products in proteomics data. We anticipate that this enhancement of proteogenomic pipelines can provide a more refined picture of the unique proteome of patients and thereby contribute to improving patient care.
Collapse
Affiliation(s)
- Dafni Skiadopoulou
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
| | - Jakub Vašíček
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
| | - Ksenia Kuznetsova
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
| | - David Bouyssié
- Institut
de Pharmacologie et de Biologie Structurale (IPBS), Université
de Toulouse, CNRS, Université Toulouse III—Paul Sabatier
(UT3), 31000 Toulouse, France
| | - Lukas Käll
- Science
for Life Laboratory, School of Engineering Sciences in Chemistry,
Biotechnology and Health, KTH Royal Institute
of Technology, SE-100 44 Stockholm, Sweden
| | - Marc Vaudel
- Mohn
Center for Diabetes Precision Medicine, Department of Clinical Science, University of Bergen, NO-5020 Bergen, Norway
- Computational
Biology Unit, Department of Informatics, University of Bergen, NO-5020 Bergen, Norway
- Department
of Genetics and Bioinformatics, Health Data and Digitalization, Norwegian Institute of Public Health, N-0213 Oslo, Norway
| |
Collapse
|
19
|
De La Toba EA, Anapindi KDB, Sweedler JV. Assessment and Comparison of Database Search Engines for Peptidomic Applications. J Proteome Res 2023; 22:3123-3134. [PMID: 36809008 PMCID: PMC10440370 DOI: 10.1021/acs.jproteome.2c00307] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2023]
Abstract
Protein database search engines are an integral component of mass spectrometry-based peptidomic analyses. Given the unique computational challenges of peptidomics, many factors must be taken into consideration when optimizing search engine selection, as each platform has different algorithms by which tandem mass spectra are scored for subsequent peptide identifications. In this study, four different database search engines, PEAKS, MS-GF+, OMSSA, and X! Tandem, were compared with Aplysia californica and Rattus norvegicus peptidomics data sets, and various metrics were assessed such as the number of unique peptide and neuropeptide identifications, and peptide length distributions. Given the tested conditions, PEAKS was found to have the highest number of peptide and neuropeptide identifications out of the four search engines in both data sets. Furthermore, principal component analysis and multivariate logistic regression were employed to determine whether specific spectral features contribute to false C-terminal amidation assignments by each search engine. From this analysis, it was found that the primary features influencing incorrect peptide assignments were the precursor and fragment ion m/z errors. Finally, an assessment employing a mixed species protein database was performed to evaluate search engine precision and sensitivity when searched against an enlarged search space containing human proteins.
Collapse
Affiliation(s)
- Eduardo A. De La Toba
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, 61801
- Department of Chemistry, University of Illinois at Urbana-Champaign, 61801
| | - Krishna D. B. Anapindi
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, 61801
- Department of Chemistry, University of Illinois at Urbana-Champaign, 61801
| | - Jonathan V. Sweedler
- Beckman Institute of Advanced Science and Technology, University of Illinois at Urbana-Champaign, 61801
- Department of Chemistry, University of Illinois at Urbana-Champaign, 61801
| |
Collapse
|
20
|
Ducker C, French S, Pathak M, Taylor H, Sainter A, Askem W, Dreveny I, Santana AEG, Pickett JA, Oldham NJ. Characterisation of geranylgeranyl diphosphate synthase from the sandfly Lutzomyia longipalpis. INSECT BIOCHEMISTRY AND MOLECULAR BIOLOGY 2023; 161:104001. [PMID: 37619821 DOI: 10.1016/j.ibmb.2023.104001] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Revised: 08/02/2023] [Accepted: 08/20/2023] [Indexed: 08/26/2023]
Abstract
Leishmaniasis is a debilitating and often fatal neglected tropical disease. Males from sub-populations of the Leishmania-harbouring sandfly, Lutzomyia longipalpis, produce the diterpene sex and aggregation pheromone, sobralene, for which geranylgeranyl diphosphate (GGPP) is the likely isoprenoid precursor. We have identified a GGPP synthase (lzGGPPS) from L. longipalpis, which was recombinantly expressed in bacteria and purified for functional and kinetic analysis. In vitro enzymatic assays using LC-MS showed that lzGGPPS is an active enzyme, capable of converting substrates dimethylallyl diphosphate (DMAPP), (E)-geranyl diphosphate (GPP), (E,E)-farnesyl diphosphate (FPP) with co-substrate isopentenyl diphosphate (IPP) into (E,E,E)-GGPP, while (Z,E)-FPP was also accepted with low efficacy. Comparison of metal cofactors for lzGGPPS highlighted Mg2+ as most efficient, giving increased GGPP output when compared against other divalent metal ions tested. In line with previously characterised GGPPS enzymes, GGPP acted as an inhibitor of lzGGPPS activity. The molecular weight in solution of lzGGPPS was determined to be ∼221 kDa by analytical SEC, suggesting a hexameric assembly, as seen in the human enzyme, and representing the first assessment of GGPPS quaternary structure in insects.
Collapse
Affiliation(s)
- Charles Ducker
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - Stanley French
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - Monika Pathak
- School of Pharmacy, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - Harry Taylor
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - Adam Sainter
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - William Askem
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - Ingrid Dreveny
- School of Pharmacy, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | | | - John A Pickett
- School of Chemistry, Cardiff University, Main Building, Park Pl, Cardiff, CF10 3AT, UK
| | - Neil J Oldham
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK.
| |
Collapse
|
21
|
Garge RK, Geck RC, Armstrong JO, Dunn B, Boutz DR, Battenhouse A, Leutert M, Dang V, Jiang P, Kwiatkowski D, Peiser T, McElroy H, Marcotte EM, Dunham MJ. Systematic Profiling of Ale Yeast Protein Dynamics across Fermentation and Repitching. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.21.558736. [PMID: 37790497 PMCID: PMC10543003 DOI: 10.1101/2023.09.21.558736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
Studying the genetic and molecular characteristics of brewing yeast strains is crucial for understanding their domestication history and adaptations accumulated over time in fermentation environments, and for guiding optimizations to the brewing process itself. Saccharomyces cerevisiae (brewing yeast) is amongst the most profiled organisms on the planet, yet the temporal molecular changes that underlie industrial fermentation and beer brewing remain understudied. Here, we characterized the genomic makeup of a Saccharomyces cerevisiae ale yeast widely used in the production of Hefeweizen beers, and applied shotgun mass spectrometry to systematically measure the proteomic changes throughout two fermentation cycles which were separated by 14 rounds of serial repitching. The resulting brewing yeast proteomics resource includes 64,740 protein abundance measurements. We found that this strain possesses typical genetic characteristics of Saccharomyces cerevisiae ale strains and displayed progressive shifts in molecular processes during fermentation based on protein abundance changes. We observed protein abundance differences between early fermentation batches compared to those separated by 14 rounds of serial repitching. The observed abundance differences occurred mainly in proteins involved in the metabolism of ergosterol and isobutyraldehyde. Our systematic profiling serves as a starting point for deeper characterization of how the yeast proteome changes during commercial fermentations and additionally serves as a resource to guide fermentation protocols, strain handling, and engineering practices in commercial brewing and fermentation environments. Finally, we created a web interface (https://brewing-yeast-proteomics.ccbb.utexas.edu/) to serve as a valuable resource for yeast geneticists, brewers, and biochemists to provide insights into the global trends underlying commercial beer production.
Collapse
Affiliation(s)
- Riddhiman K. Garge
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas, USA
| | - Renee C. Geck
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Joseph O. Armstrong
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Barbara Dunn
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | - Daniel R. Boutz
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas, USA
- Houston Methodist Research Institute, Houston, Texas, USA
| | - Anna Battenhouse
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas, USA
| | - Mario Leutert
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
- Institute of Molecular Systems Biology, ETH Zürich, Zürich, Switzerland
| | - Vy Dang
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas, USA
| | - Pengyao Jiang
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| | | | | | | | - Edward M. Marcotte
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, Texas, USA
| | - Maitreya J. Dunham
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
| |
Collapse
|
22
|
Bihani S, Gupta A, Mehta S, Rajczewski AT, Johnson J, Borishetty D, Griffin TJ, Srivastava S, Jagtap PD. Metaproteomic Analysis of Nasopharyngeal Swab Samples to Identify Microbial Peptides in COVID-19 Patients. J Proteome Res 2023; 22:2608-2619. [PMID: 37450889 DOI: 10.1021/acs.jproteome.3c00040] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/18/2023]
Abstract
During the COVID-19 pandemic, impaired immunity and medical interventions resulted in cases of secondary infections. The clinical difficulties and dangers associated with secondary infections in patients necessitate the exploration of their microbiome. Metaproteomics is a powerful approach to study the taxonomic composition and functional status of the microbiome under study. In this study, the mass spectrometry (MS)-based data of nasopharyngeal swab samples from COVID-19 patients was used to investigate the metaproteome. We have established a robust bioinformatics workflow within the Galaxy platform, which includes (a) generation of a tailored database of the common respiratory tract pathogens, (b) database search using multiple search algorithms, and (c) verification of the detected microbial peptides. The microbial peptides detected in this study, belong to several opportunistic pathogens such as Streptococcus pneumoniae, Klebsiella pneumoniae, Rhizopus microsporus, and Syncephalastrum racemosum. Microbial proteins with a role in stress response, gene expression, and DNA repair were found to be upregulated in severe patients compared to negative patients. Using parallel reaction monitoring (PRM), we confirmed some of the microbial peptides in fresh clinical samples. MS-based clinical metaproteomics can serve as a powerful tool for detection and characterization of potential pathogens, which can significantly impact the diagnosis and treatment of patients.
Collapse
Affiliation(s)
- Surbhi Bihani
- Department of Bioscience and Bioengineering, Indian Institute of Technology Bombay, Mumbai, Maharashtra 400076, India
| | - Aryan Gupta
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai, Maharashtra 400076, India
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 7-129 MCB, 420 Washington Ave SE, Minneapolis, Minnesota 55455, United States
| | - Andrew T Rajczewski
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 7-129 MCB, 420 Washington Ave SE, Minneapolis, Minnesota 55455, United States
| | - James Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Dhanush Borishetty
- Department of Bioscience and Bioengineering, Indian Institute of Technology Bombay, Mumbai, Maharashtra 400076, India
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 7-129 MCB, 420 Washington Ave SE, Minneapolis, Minnesota 55455, United States
| | - Sanjeeva Srivastava
- Department of Bioscience and Bioengineering, Indian Institute of Technology Bombay, Mumbai, Maharashtra 400076, India
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, 7-129 MCB, 420 Washington Ave SE, Minneapolis, Minnesota 55455, United States
| |
Collapse
|
23
|
Lloyd J, Biasutto A, Dürr KL, Jazayeri A, Hopper JT, Oldham NJ. Mapping the Binding Interactions between Human Gasdermin D and Human Caspase-1 Using Carbene Footprinting. JACS AU 2023; 3:2025-2035. [PMID: 37502151 PMCID: PMC10369405 DOI: 10.1021/jacsau.3c00236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2023] [Revised: 06/09/2023] [Accepted: 06/09/2023] [Indexed: 07/29/2023]
Abstract
Carbene footprinting is a recently developed mass spectrometry-based chemical labeling technique that probes protein interactions and conformation. Here, we use the methodology to investigate binding interactions between the protease human Caspase-1 (C285A) and full-length human Gasdermin D (hGSDMD), which are important in inflammatory cell death. GSDMD is cleaved by Caspase-1, releasing its N-terminal domain which oligomerizes in the membrane to form large pores, resulting in lytic cell death. Regions of reduced carbene labeling (masking), caused by protein binding, were observed for each partner in the presence of the other and were consistent with hCaspase-1 exosite and active-site interactions. Most notably, the results showed direct occupancy of hCaspase-1 (C285A) active-site by hGSDMD for the first time. Differential carbene labeling of full-length hGSDMD and the pore-forming N-terminal domain assembled in liposomes showed masking of the latter, consistent with oligomeric assembly and insertion into the lipid bilayer. Interactions between Caspase-1 and the specific inhibitor VRT-043198 were also studied by this approach. In wild-type hCaspase-1, VRT-043198 modifies the active-site Cys285 through the formation of a S,O-hemiacetal. Here, we showed by carbene labeling that this inhibitor can noncovalently occupy the active site of a C285A mutant. These findings add considerably to our knowledge of the hCaspase-1-hGSDMD system.
Collapse
Affiliation(s)
- James
R. Lloyd
- School
of Chemistry, University of Nottingham, University Park, Nottingham NG7 2RD, U.K.
| | - Antonio Biasutto
- OMass
Therapeutics, Schrodinger Building, Oxford Science Park, Oxford OX4 4GE, U.K.
| | - Katharina L. Dürr
- OMass
Therapeutics, Schrodinger Building, Oxford Science Park, Oxford OX4 4GE, U.K.
| | - Ali Jazayeri
- OMass
Therapeutics, Schrodinger Building, Oxford Science Park, Oxford OX4 4GE, U.K.
| | - Jonathan T.S. Hopper
- OMass
Therapeutics, Schrodinger Building, Oxford Science Park, Oxford OX4 4GE, U.K.
| | - Neil J. Oldham
- School
of Chemistry, University of Nottingham, University Park, Nottingham NG7 2RD, U.K.
| |
Collapse
|
24
|
van Leeuwen SJM, Proctor GB, Staes A, Laheij AMGA, Potting CMJ, Brennan MT, von Bültzingslöwen I, Rozema FR, Hazenberg MD, Blijlevens NMA, Raber-Durlacher JE, Huysmans MCDNJM. The salivary proteome in relation to oral mucositis in autologous hematopoietic stem cell transplantation recipients: a labelled and label-free proteomics approach. BMC Oral Health 2023; 23:460. [PMID: 37420206 PMCID: PMC10329372 DOI: 10.1186/s12903-023-03190-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2022] [Accepted: 06/30/2023] [Indexed: 07/09/2023] Open
Abstract
BACKGROUND Oral mucositis is a frequently seen complication in the first weeks after hematopoietic stem cell transplantation recipients which can severely affects patients quality of life. In this study, a labelled and label-free proteomics approach were used to identify differences between the salivary proteomes of autologous hematopoietic stem cell transplantation (ASCT) recipients developing ulcerative oral mucositis (ULC-OM; WHO score ≥ 2) or not (NON-OM). METHODS In the TMT-labelled analysis we pooled saliva samples from 5 ULC-OM patients at each of 5 timepoints: baseline, 1, 2, 3 weeks and 3 months after ASCT and compared these with pooled samples from 5 NON-OM patients. For the label-free analysis we analyzed saliva samples from 9 ULC-OM and 10 NON-OM patients at 6 different timepoints (including 12 months after ASCT) with Data-Independent Acquisition (DIA). As spectral library, all samples were grouped (ULC-OM vs NON-OM) and analyzed with Data Dependent Analysis (DDA). PCA plots and a volcano plot were generated in RStudio and differently regulated proteins were analyzed using GO analysis with g:Profiler. RESULTS A different clustering of ULC-OM pools was found at baseline, weeks 2 and 3 after ASCT with TMT-labelled analysis. Using label-free analysis, week 1-3 samples clustered distinctly from the other timepoints. Unique and up-regulated proteins in the NON-OM group (DDA analysis) were involved in immune system-related processes, while those proteins in the ULC-OM group were intracellular proteins indicating cell lysis. CONCLUSIONS The salivary proteome in ASCT recipients has a tissue protective or tissue-damage signature, that corresponded with the absence or presence of ulcerative oral mucositis, respectively. TRIAL REGISTRATION The study is registered in the national trial register (NTR5760; automatically added to the International Clinical Trial Registry Platform).
Collapse
Affiliation(s)
- S J M van Leeuwen
- Department of Dentistry, Radboud University Medical Center, Nijmegen, The Netherlands.
| | - G B Proctor
- Centre for Host Microbiome Interactions, King's College London Dental Institute, London, UK
| | - A Staes
- VIB Proteomics Core, VIB Center for Medical Biotechnology, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - A M G A Laheij
- Department of Oral Medicine, Academic Centre for Dentistry Amsterdam, University of Amsterdam and VU University, Amsterdam, The Netherlands
- Department of Preventive Dentistry, Academic Centre for Dentistry Amsterdam, University of Amsterdam and VU University, Amsterdam, The Netherlands
- Department of Oral and Maxillofacial Surgery, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - C M J Potting
- Department of Hematology, Radboud University Medical Center, Nijmegen, The Netherlands
| | - M T Brennan
- Department of Oral Medicine/Oral and Maxillofacial Surgery, Atrium Health Carolinas Medical Centre, NC, Charlotte, USA
- Department of Otolaryngology/Head and Neck Surgery, Wake Forest University School of Medicine, NC, Winston-Salem, USA
| | - I von Bültzingslöwen
- Department of Oral Microbiology and Immunology, Institute of Odontology, The Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - F R Rozema
- Department of Oral Medicine, Academic Centre for Dentistry Amsterdam, University of Amsterdam and VU University, Amsterdam, The Netherlands
- Department of Oral and Maxillofacial Surgery, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - M D Hazenberg
- Department of Hematology, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
- Department of Hematopoiesis, Sanquin Research, Amsterdam, The Netherlands
| | - N M A Blijlevens
- Department of Hematology, Radboud University Medical Center, Nijmegen, The Netherlands
| | - J E Raber-Durlacher
- Department of Oral Medicine, Academic Centre for Dentistry Amsterdam, University of Amsterdam and VU University, Amsterdam, The Netherlands
- Department of Oral and Maxillofacial Surgery, Amsterdam UMC, University of Amsterdam, Amsterdam, The Netherlands
| | - M C D N J M Huysmans
- Department of Dentistry, Radboud University Medical Center, Nijmegen, The Netherlands
| |
Collapse
|
25
|
Mehta S, Bernt M, Chambers M, Fahrner M, Föll MC, Gruening B, Horro C, Johnson JE, Loux V, Rajczewski AT, Schilling O, Vandenbrouck Y, Gustafsson OJR, Thang WCM, Hyde C, Price G, Jagtap PD, Griffin TJ. A Galaxy of informatics resources for MS-based proteomics. Expert Rev Proteomics 2023; 20:251-266. [PMID: 37787106 DOI: 10.1080/14789450.2023.2265062] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 09/06/2023] [Indexed: 10/04/2023]
Abstract
INTRODUCTION Continuous advances in mass spectrometry (MS) technologies have enabled deeper and more reproducible proteome characterization and a better understanding of biological systems when integrated with other 'omics data. Bioinformatic resources meeting the analysis requirements of increasingly complex MS-based proteomic data and associated multi-omic data are critically needed. These requirements included availability of software that would span diverse types of analyses, scalability for large-scale, compute-intensive applications, and mechanisms to ease adoption of the software. AREAS COVERED The Galaxy ecosystem meets these requirements by offering a multitude of open-source tools for MS-based proteomics analyses and applications, all in an adaptable, scalable, and accessible computing environment. A thriving global community maintains these software and associated training resources to empower researcher-driven analyses. EXPERT OPINION The community-supported Galaxy ecosystem remains a crucial contributor to basic biological and clinical studies using MS-based proteomics. In addition to the current status of Galaxy-based resources, we describe ongoing developments for meeting emerging challenges in MS-based proteomic informatics. We hope this review will catalyze increased use of Galaxy by researchers employing MS-based proteomics and inspire software developers to join the community and implement new tools, workflows, and associated training content that will add further value to this already rich ecosystem.
Collapse
Affiliation(s)
- Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Matthias Bernt
- Helmholtz Centre for Environmental Research - UFZ, Department Computational Biology, Leipzig, Germany
| | | | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Melanie Christine Föll
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bjoern Gruening
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, Germany
| | - Carlos Horro
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, USA
| | - Valentin Loux
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
- Université Paris-Saclay, INRAE, BioinfOmics, MIGALE bioinformatics facility, Jouy-en-Josas, France
| | - Andrew T Rajczewski
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | | | | | - W C Mike Thang
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Institute of Molecular Bioscience, University of Queensland, St Lucia, Australia
| | - Cameron Hyde
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Sippy Downs, University of the Sunshine Coast, Australia
| | - Gareth Price
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Institute of Molecular Bioscience, University of Queensland, St Lucia, Australia
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
26
|
Bakker R, Ellers J, Roelofs D, Vooijs R, Dijkstra T, van Gestel CAM, Hoedjes KM. Combining time-resolved transcriptomics and proteomics data for Adverse Outcome Pathway refinement in ecotoxicology. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 869:161740. [PMID: 36708843 DOI: 10.1016/j.scitotenv.2023.161740] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 01/14/2023] [Accepted: 01/17/2023] [Indexed: 06/18/2023]
Abstract
Conventional Environmental Risk Assessment (ERA) of pesticide pollution is based on soil concentrations and apical endpoints, such as the reproduction of test organisms, but has traditionally disregarded information along the organismal response cascade leading to an adverse outcome. The Adverse Outcome Pathway (AOP) framework includes response information at any level of biological organization, providing opportunities to use intermediate responses as a predictive read-out for adverse outcomes instead. Transcriptomic and proteomic data can provide thousands of data points on the response to toxic exposure. Combining multiple omics data types is necessary for a comprehensive overview of the response cascade and, therefore, AOP development. However, it is unclear if transcript and protein responses are synchronized in time or time lagged. To understand if analysis of multi-omics data obtained at the same timepoint reveal one synchronized response cascade, we studied time-resolved shifts in gene transcript and protein abundance in the springtail Folsomia candida, a soil ecotoxicological model, after exposure to the neonicotinoid insecticide imidacloprid. We analyzed transcriptome and proteome data every 12 h up to 72 h after onset of exposure. The most pronounced shift in both transcript and protein abundances was observed after 48 h exposure. Moreover, cross-correlation analyses indicate that most genes displayed the highest correlation between transcript and protein abundances without a time-lag. This demonstrates that a combined analysis of transcriptomic and proteomic data from the same time-point can be used for AOP improvement. This data will promote the development of biomarkers for the presence of neonicotinoid insecticides or chemicals with a similar mechanism of action in soils.
Collapse
Affiliation(s)
- Ruben Bakker
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Jacintha Ellers
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Dick Roelofs
- Keygene N.V., Agro Business Park 90, 6708 PW Wageningen, the Netherlands
| | - Riet Vooijs
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Tjeerd Dijkstra
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 25, D-72076 Tübingen, Germany
| | - Cornelis A M van Gestel
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands
| | - Katja M Hoedjes
- Amsterdam Institute for Life and Environment (A-LIFE), Faculty of Science, Vrije Universiteit Amsterdam, De Boelelaan 1085, 1081 HV Amsterdam, the Netherlands.
| |
Collapse
|
27
|
Elliff J, Biswas A, Roshan P, Kuppa S, Patterson A, Mattice J, Chinnaraj M, Burd R, Walker SE, Pozzi N, Antony E, Bothner B, Origanti S. Dynamic states of eIF6 and SDS variants modulate interactions with uL14 of the 60S ribosomal subunit. Nucleic Acids Res 2023; 51:1803-1822. [PMID: 36651285 PMCID: PMC9976893 DOI: 10.1093/nar/gkac1266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 12/20/2022] [Accepted: 12/22/2022] [Indexed: 01/19/2023] Open
Abstract
Assembly of ribosomal subunits into active ribosomal complexes is integral to protein synthesis. Release of eIF6 from the 60S ribosomal subunit primes 60S to associate with the 40S subunit and engage in translation. The dynamics of eIF6 interaction with the uL14 (RPL23) interface of 60S and its perturbation by somatic mutations acquired in Shwachman-Diamond Syndrome (SDS) is yet to be clearly understood. Here, by using a modified strategy to obtain high yields of recombinant human eIF6 we have uncovered the critical interface entailing eight key residues in the C-tail of uL14 that is essential for physical interactions between 60S and eIF6. Disruption of the complementary binding interface by conformational changes in eIF6 disease variants provide a mechanism for weakened interactions of variants with the 60S. Hydrogen-deuterium exchange mass spectrometry (HDX-MS) analyses uncovered dynamic configurational rearrangements in eIF6 induced by binding to uL14 and exposed an allosteric interface regulated by the C-tail of eIF6. Disrupting key residues in the eIF6-60S binding interface markedly limits proliferation of cancer cells, which highlights the significance of therapeutically targeting this interface. Establishing these key interfaces thus provide a therapeutic framework for targeting eIF6 in cancers and SDS.
Collapse
Affiliation(s)
- Jonah Elliff
- Department of Biological Sciences, Marquette University, Milwaukee, WI 53233, USA
- Department of Immunology, The University of Iowa, Iowa City, IA 52242, USA
| | - Aparna Biswas
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Poonam Roshan
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Sahiti Kuppa
- Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine, MO 63104, USA
| | - Angela Patterson
- Department of Chemistry and Biochemistry, Montana State University, Bozeman, MT 59717, USA
| | - Jenna Mattice
- Department of Chemistry and Biochemistry, Montana State University, Bozeman, MT 59717, USA
| | - Mathivanan Chinnaraj
- Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine, MO 63104, USA
| | - Ryan Burd
- Department of Biological Sciences, Marquette University, Milwaukee, WI 53233, USA
| | - Sarah E Walker
- Department of Biological Sciences, State University of New York, Buffalo, NY 14260, USA
| | - Nicola Pozzi
- Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine, MO 63104, USA
| | - Edwin Antony
- Department of Biochemistry and Molecular Biology, Saint Louis University School of Medicine, MO 63104, USA
| | - Brian Bothner
- Department of Chemistry and Biochemistry, Montana State University, Bozeman, MT 59717, USA
| | - Sofia Origanti
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| |
Collapse
|
28
|
Probing the E1o-E2o and E1a-E2o Interactions in Binary Subcomplexes of the Human 2-Oxoglutarate Dehydrogenase and 2-Oxoadipate Dehydrogenase Complexes by Chemical Cross-Linking Mass Spectrometry and Molecular Dynamics Simulation. Int J Mol Sci 2023; 24:ijms24054555. [PMID: 36901986 PMCID: PMC10003691 DOI: 10.3390/ijms24054555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 02/22/2023] [Accepted: 02/23/2023] [Indexed: 03/02/2023] Open
Abstract
The human 2-oxoglutarate dehydrogenase complex (hOGDHc) is a key enzyme in the tricarboxylic acid cycle and is one of the main regulators of mitochondrial metabolism through NADH and reactive oxygen species levels. Evidence was obtained for formation of a hybrid complex between the hOGDHc and its homologue the 2-oxoadipate dehydrogenase complex (hOADHc) in the L-lysine metabolic pathway, suggesting a crosstalk between the two distinct pathways. Findings raised fundamental questions about the assembly of hE1a (2-oxoadipate-dependent E1 component) and hE1o (2-oxoglutarate-dependent E1) to the common hE2o core component. Here we report chemical cross-linking mass spectrometry (CL-MS) and molecular dynamics (MD) simulation analyses to understand assembly in binary subcomplexes. The CL-MS studies revealed the most prominent loci for hE1o-hE2o and hE1a-hE2o interactions and suggested different binding modes. The MD simulation studies led to the following conclusions: (i) The N-terminal regions in E1s are shielded by, but do not interact directly with hE2o. (ii) The hE2o linker region exhibits the highest number of H-bonds with the N-terminus and α/β1 helix of hE1o, yet with the interdomain linker and α/β1 helix of hE1a. (iii) The C-termini are involved in dynamic interactions in complexes, suggesting the presence of at least two conformations in solution.
Collapse
|
29
|
Canderan J, Stamboulian M, Ye Y. MetaProD: A Highly-Configurable Mass Spectrometry Analyzer for Multiplexed Proteomic and Metaproteomic Data. J Proteome Res 2023; 22:442-453. [PMID: 36688801 PMCID: PMC9903327 DOI: 10.1021/acs.jproteome.2c00614] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2022] [Indexed: 01/24/2023]
Abstract
The microbiome has been shown to be important for human health because of its influence on disease and the immune response. Mass spectrometry is an important tool for evaluating protein expression and species composition in the microbiome but is technically challenging and time-consuming. Multiplexing has emerged as a way to make spectrometry workflows faster while improving results. Here, we present MetaProD (MetaProteomics in Django) as a highly configurable metaproteomic data analysis pipeline supporting label-free and multiplexed mass spectrometry. The pipeline is open-source, uses fully open-source tools, and is integrated with Django to offer a web-based interface for configuration and data access. Benchmarking of MetaProD using multiple metaproteomics data sets showed that MetaProD achieved fast and efficient identification of peptides and proteins. Application of MetaProD to a multiplexed cancer data set resulted in identification of more differentially expressed human proteins in cancer tissues versus healthy tissues as compared to previous studies; in addition, MetaProD identified bacterial proteins in those samples, some of which are differentially abundant.
Collapse
Affiliation(s)
- Jamie Canderan
- Informatics
Department, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, Indiana 47408, United States
| | - Moses Stamboulian
- Informatics
Department, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, Indiana 47408, United States
| | - Yuzhen Ye
- Computer
Science Department, Luddy School of Informatics, Computing and Engineering, Indiana University, Bloomington, Indiana 47408, United States
| |
Collapse
|
30
|
Debrie E, Malfait M, Gabriels R, Declerq A, Sticker A, Martens L, Clement L. Quality Control for the Target Decoy Approach for Peptide Identification. J Proteome Res 2023; 22:350-358. [PMID: 36648107 DOI: 10.1021/acs.jproteome.2c00423] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Reliable peptide identification is key in mass spectrometry (MS) based proteomics. To this end, the target decoy approach (TDA) has become the cornerstone for extracting a set of reliable peptide-to-spectrum matches (PSMs) that will be used in downstream analysis. Indeed, TDA is now the default method to estimate the false discovery rate (FDR) for a given set of PSMs, and users typically view it as a universal solution for assessing the FDR in the peptide identification step. However, the TDA also relies on a minimal set of assumptions, which are typically never verified in practice. We argue that a violation of these assumptions can lead to poor FDR control, which can be detrimental to any downstream data analysis. We here therefore first clearly spell out these TDA assumptions, and introduce TargetDecoy, a Bioconductor package with all the necessary functionality to control the TDA quality and its underlying assumptions for a given set of PSMs.
Collapse
Affiliation(s)
- Elke Debrie
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium
| | - Milan Malfait
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium.,Statistics and Decision Sciences, Janssen Pharmaceutical Companies of Johnson and Johnson, 2340Beerse, Belgium
| | - Ralf Gabriels
- VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium
| | - Arthur Declerq
- VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium
| | - Adriaan Sticker
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium.,VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, 9052Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000Ghent, Belgium
| | - Lieven Clement
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, 9000Ghent, Belgium
| |
Collapse
|
31
|
Kazieva LS, Farafonova TE, Zgoda VG. [Antibody proteomics]. BIOMEDITSINSKAIA KHIMIIA 2023; 69:5-18. [PMID: 36857423 DOI: 10.18097/pbmc20236901005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/02/2023]
Abstract
Antibodies represent an essential component of humoral immunity; therefore their study is important for molecular biology and medicine. The unique property of antibodies to specifically recognize and bind a certain molecular target (an antigen) determines their widespread application in treatment and diagnostics of diseases, as well as in laboratory and biotechnological practices. High specificity and affinity of antibodies is determined by the presence of primary structure variable regions, which are not encoded in the human genome and are unique for each antibody-producing B cell clone. Hence, there is little or no information about amino acid sequences of the variable regions in the databases. This differs identification of antibody primary structure from most of the proteomic studies because it requires either B cell genome sequencing or de novo amino acid sequencing of the antibody. The present review demonstrates some examples of proteomic and proteogenomic approaches and the methodological arsenal that proteomics can offer for studying antibodies, in particular, for identification of primary structure, evaluation of posttranslational modifications and application of bioinformatics tools for their decoding.
Collapse
Affiliation(s)
- L Sh Kazieva
- Institute of Biomedical Chemistry, Moscow, Russia
| | | | - V G Zgoda
- Institute of Biomedical Chemistry, Moscow, Russia
| |
Collapse
|
32
|
Poverennaya EV, Pyatnitskiy MA, Dolgalev GV, Arzumanian VA, Kiseleva OI, Kurbatov IY, Kurbatov LK, Vakhrushev IV, Romashin DD, Kim YS, Ponomarenko EA. Exploiting Multi-Omics Profiling and Systems Biology to Investigate Functions of TOMM34. BIOLOGY 2023; 12:198. [PMID: 36829477 PMCID: PMC9952762 DOI: 10.3390/biology12020198] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Revised: 01/17/2023] [Accepted: 01/25/2023] [Indexed: 01/31/2023]
Abstract
Although modern biology is now in the post-genomic era with vastly increased access to high-quality data, the set of human genes with a known function remains far from complete. This is especially true for hundreds of mitochondria-associated genes, which are under-characterized and lack clear functional annotation. However, with the advent of multi-omics profiling methods coupled with systems biology algorithms, the cellular role of many such genes can be elucidated. Here, we report genes and pathways associated with TOMM34, Translocase of Outer Mitochondrial Membrane, which plays role in the mitochondrial protein import as a part of cytosolic complex together with Hsp70/Hsp90 and is upregulated in various cancers. We identified genes, proteins, and metabolites altered in TOMM34-/- HepG2 cells. To our knowledge, this is the first attempt to study the functional capacity of TOMM34 using a multi-omics strategy. We demonstrate that TOMM34 affects various processes including oxidative phosphorylation, citric acid cycle, metabolism of purine, and several amino acids. Besides the analysis of already known pathways, we utilized de novo network enrichment algorithm to extract novel perturbed subnetworks, thus obtaining evidence that TOMM34 potentially plays role in several other cellular processes, including NOTCH-, MAPK-, and STAT3-signaling. Collectively, our findings provide new insights into TOMM34's cellular functions.
Collapse
Affiliation(s)
| | - Mikhail A. Pyatnitskiy
- Institute of Biomedical Chemistry, Moscow 119121, Russia
- Faculty Of Computer Science, National Research University Higher School of Economics, Moscow 101000, Russia
| | | | | | | | | | | | | | | | - Yan S. Kim
- Institute of Biomedical Chemistry, Moscow 119121, Russia
| | | |
Collapse
|
33
|
Beslic D, Tscheuschner G, Renard BY, Weller MG, Muth T. Comprehensive evaluation of peptide de novo sequencing tools for monoclonal antibody assembly. Brief Bioinform 2023; 24:bbac542. [PMID: 36545804 PMCID: PMC9851299 DOI: 10.1093/bib/bbac542] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 10/25/2022] [Accepted: 11/10/2022] [Indexed: 12/24/2022] Open
Abstract
Monoclonal antibodies are biotechnologically produced proteins with various applications in research, therapeutics and diagnostics. Their ability to recognize and bind to specific molecule structures makes them essential research tools and therapeutic agents. Sequence information of antibodies is helpful for understanding antibody-antigen interactions and ensuring their affinity and specificity. De novo protein sequencing based on mass spectrometry is a valuable method to obtain the amino acid sequence of peptides and proteins without a priori knowledge. In this study, we evaluated six recently developed de novo peptide sequencing algorithms (Novor, pNovo 3, DeepNovo, SMSNet, PointNovo and Casanovo), which were not specifically designed for antibody data. We validated their ability to identify and assemble antibody sequences on three multi-enzymatic data sets. The deep learning-based tools Casanovo and PointNovo showed an increased peptide recall across different enzymes and data sets compared with spectrum-graph-based approaches. We evaluated different error types of de novo peptide sequencing tools and their performance for different numbers of missing cleavage sites, noisy spectra and peptides of various lengths. We achieved a sequence coverage of 97.69-99.53% on the light chains of three different antibody data sets using the de Bruijn assembler ALPS and the predictions from Casanovo. However, low sequence coverage and accuracy on the heavy chains demonstrate that complete de novo protein sequencing remains a challenging issue in proteomics that requires improved de novo error correction, alternative digestion strategies and hybrid approaches such as homology search to achieve high accuracy on long protein sequences.
Collapse
Affiliation(s)
- Denis Beslic
- Robert Koch Institute, MF1, Nordufer 20, 13353 Berlin
| | - Georg Tscheuschner
- Federal Institute for Materials Research and Testing (BAM), Richard-Willstätter-Straße 11, 12489 Berlin
| | - Bernhard Y Renard
- Hasso Plattner Institute, Digital Engineering Faculty, University of Potsdam, Prof.-Dr.-Helmert-Straße 2-3, 14482 Potsdam
| | - Michael G Weller
- Federal Institute for Materials Research and Testing (BAM), Richard-Willstätter-Straße 11, 12489 Berlin
| | - Thilo Muth
- Federal Institute for Materials Research and Testing (BAM), Richard-Willstätter-Straße 11, 12489 Berlin
| |
Collapse
|
34
|
Millikin RJ, Shortreed MR, Scalf M, Smith LM. Fast, Free, and Flexible Peptide and Protein Quantification with FlashLFQ. Methods Mol Biol 2023; 2426:303-313. [PMID: 36308694 PMCID: PMC9623451 DOI: 10.1007/978-1-0716-1967-4_13] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The rapid and accurate quantification of peptides is a critical element of modern proteomics that has become increasingly challenging as proteomic data sets grow in size and complexity. We present here FlashLFQ, a computer program for high-speed label-free quantification of peptides and proteins following a search of bottom-up mass spectrometry data. FlashLFQ is approximately an order of magnitude faster than established label-free quantification methods and can quantify data-dependent analysis (DDA) search results from any proteomics search program. It is available as a graphical user interface program, a command line tool, a Docker image, and integrated into the MetaMorpheus search software.
Collapse
Affiliation(s)
| | | | - Mark Scalf
- Department of Chemistry, University of Wisconsin, Madison, WI, USA
| | - Lloyd M Smith
- Department of Chemistry, University of Wisconsin, Madison, WI, USA.
| |
Collapse
|
35
|
Patterson A, White A, Waymire E, Fleck S, Golden S, Wilkinson RA, Wiedenheft B, Bothner B. Anti-CRISPR proteins function through thermodynamic tuning and allosteric regulation of CRISPR RNA-guided surveillance complex. Nucleic Acids Res 2022; 50:11243-11254. [PMID: 36215034 DOI: 10.1093/nar/gkac841] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Revised: 09/14/2022] [Accepted: 10/07/2022] [Indexed: 11/13/2022] Open
Abstract
CRISPR RNA-guided detection and degradation of foreign DNA is a dynamic process. Viruses can interfere with this cellular defense by expressing small proteins called anti-CRISPRs. While structural models of anti-CRISPRs bound to their target complex provide static snapshots that inform mechanism, the dynamics and thermodynamics of these interactions are often overlooked. Here, we use hydrogen deuterium exchange-mass spectrometry (HDX-MS) and differential scanning fluorimetry (DSF) experiments to determine how anti-CRISPR binding impacts the conformational landscape of the type IF CRISPR RNA guided surveillance complex (Csy) upon binding of two different anti-CRISPR proteins (AcrIF9 and AcrIF2). The results demonstrate that AcrIF2 binding relies on enthalpic stabilization, whereas AcrIF9 uses an entropy driven reaction to bind the CRISPR RNA-guided surveillance complex. Collectively, this work reveals the thermodynamic basis and mechanistic versatility of anti-CRISPR-mediated immune suppression. More broadly, this work presents a striking example of how allosteric effectors are employed to regulate nucleoprotein complexes.
Collapse
Affiliation(s)
- Angela Patterson
- Chemistry and Biochemistry Department, Montana State University, Bozeman, MT 59717, USA
| | - Aidan White
- Chemistry and Biochemistry Department, Montana State University, Bozeman, MT 59717, USA
| | - Elizabeth Waymire
- Chemistry and Biochemistry Department, Montana State University, Bozeman, MT 59717, USA
| | - Sophie Fleck
- Chemistry and Biochemistry Department, Montana State University, Bozeman, MT 59717, USA
| | - Sarah Golden
- Microbiology and Cell Biology Department, Montana State University, Bozeman, MT 59717, USA
| | - Royce A Wilkinson
- Microbiology and Cell Biology Department, Montana State University, Bozeman, MT 59717, USA
| | - Blake Wiedenheft
- Microbiology and Cell Biology Department, Montana State University, Bozeman, MT 59717, USA
| | - Brian Bothner
- Chemistry and Biochemistry Department, Montana State University, Bozeman, MT 59717, USA
| |
Collapse
|
36
|
Mehta S, Carvalho VM, Rajczewski AT, Pible O, Grüning BA, Johnson JE, Wagner R, Armengaud J, Griffin TJ, Jagtap PD. Catching the Wave: Detecting Strain-Specific SARS-CoV-2 Peptides in Clinical Samples Collected during Infection Waves from Diverse Geographical Locations. Viruses 2022; 14:2205. [PMID: 36298760 PMCID: PMC9609567 DOI: 10.3390/v14102205] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 10/04/2022] [Accepted: 10/05/2022] [Indexed: 11/05/2022] Open
Abstract
The Coronavirus disease 2019 (COVID-19) pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) resulted in a major health crisis worldwide with its continuously emerging new strains, resulting in new viral variants that drive "waves" of infection. PCR or antigen detection assays have been routinely used to detect clinical infections; however, the emergence of these newer strains has presented challenges in detection. One of the alternatives has been to detect and characterize variant-specific peptide sequences from viral proteins using mass spectrometry (MS)-based methods. MS methods can potentially help in both diagnostics and vaccine development by understanding the dynamic changes in the viral proteome associated with specific strains and infection waves. In this study, we developed an accessible, flexible, and shareable bioinformatics workflow that was implemented in the Galaxy Platform to detect variant-specific peptide sequences from MS data derived from the clinical samples. We demonstrated the utility of the workflow by characterizing published clinical data from across the world during various pandemic waves. Our analysis identified six SARS-CoV-2 variant-specific peptides suitable for confident detection by MS in commonly collected clinical samples.
Collapse
Affiliation(s)
- Subina Mehta
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | | | - Andrew T. Rajczewski
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Olivier Pible
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, 30200 Bagnols-sur-Cèze, France
| | - Björn A. Grüning
- Department of Computer Science, University of Freiburg, 79110 Freiburg, Germany
| | - James E. Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN 55455, USA
| | - Reid Wagner
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN 55455, USA
| | - Jean Armengaud
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris-Saclay, CEA, INRAE, 30200 Bagnols-sur-Cèze, France
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| |
Collapse
|
37
|
Kruger A, Vlok M, Turner S, Venter C, Laubscher GJ, Kell DB, Pretorius E. Proteomics of fibrin amyloid microclots in long COVID/post-acute sequelae of COVID-19 (PASC) shows many entrapped pro-inflammatory molecules that may also contribute to a failed fibrinolytic system. Cardiovasc Diabetol 2022; 21:190. [PMID: 36131342 PMCID: PMC9491257 DOI: 10.1186/s12933-022-01623-4] [Citation(s) in RCA: 57] [Impact Index Per Article: 28.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 09/07/2022] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND Post-acute sequelae of COVID-19 (PASC), also now known as long COVID, has become a major global health and economic burden. Previously, we provided evidence that there is a significant insoluble fibrin amyloid microclot load in the circulation of individuals with long COVID, and that these microclots entrap a substantial number of inflammatory molecules, including those that might prevent clot breakdown. Scientifically, the most challenging aspect of this debilitating condition is that traditional pathology tests such as a serum CRP (C-reactive protein) may not show any significant abnormal inflammatory markers, albeit these tests measure only the soluble inflammatory molecules. Elevated, or abnormal soluble biomarkers such as IL-6, D-Dimer or fibrinogen indicate an increased risk for thrombosis or a host immune response in COVID-19. The absence of biomarkers in standard pathology tests, result in a significant amount of confusion for patients and clinicians, as patients are extremely sick or even bed-ridden but with no regular identifiable reason for their disease. Biomarkers that are currently available cannot detect the molecules present in the microclots we identified and are therefore unable to confirm their presence or the mechanisms that drive their formation. METHODS Here we analysed the protein content of double-digested microclots of 99 long COVID patients and 29 healthy controls. The patients suffering from long COVID reported their symptoms through a questionnaire completed by themselves or their attending physician. RESULTS Our long COVID cohort's symptoms were found to be in line with global findings, where the most prevalent symptoms were constant fatigue (74%,) cognitive impairment (71%) and depression and anxiety (30%). Our most noteworthy findings were a reduced level of plasma Kallikrein compared to our controls, an increased level of platelet factor 4 (PF4) von Willebrand factor (VWF), and a marginally increased level of α-2 antiplasmin (α-2-AP). We also found a significant presence of antibodies entrapped inside these microclots. CONCLUSION Our results confirm the presence of pro-inflammatory molecules that may also contribute to a failed fibrinolysis phenomenon, which could possibly explain why individuals with long COVID suffer from chronic fatigue, dyspnoea, or cognitive impairment. In addition, significant platelet hyperactivation was noted. Hyperactivation will result in the granular content of platelets being shed into the circulation, including PF4. Overall, our results provide further evidence of both a failed fibrinolytic system in long COVID/PASC and the entrapment of many proteins whose presence might otherwise go unrecorded. These findings might have significant implications for individuals with pre-existing comorbidities, including cardiovascular disease and type 2 diabetes.
Collapse
Affiliation(s)
- Arneaux Kruger
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Private Bag X1 Matieland, Stellenbosch, 7602, South Africa
| | - Mare Vlok
- Central Analytical Facility, Mass Spectrometry Stellenbosch University, Tygerberg Campus, Room 6054, Clinical Building, Francie Van Zijl Drive, Tygerberg, Cape Town, 7505, South Africa
| | - Simone Turner
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Private Bag X1 Matieland, Stellenbosch, 7602, South Africa
| | - Chantelle Venter
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Private Bag X1 Matieland, Stellenbosch, 7602, South Africa
| | | | - Douglas B Kell
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Private Bag X1 Matieland, Stellenbosch, 7602, South Africa.
- Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, L69 7ZB, UK.
- The Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Kemitorvet 200, 2800, Kongens Lyngby, Denmark.
| | - Etheresia Pretorius
- Department of Physiological Sciences, Faculty of Science, Stellenbosch University, Private Bag X1 Matieland, Stellenbosch, 7602, South Africa.
- Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, Faculty of Health and Life Sciences, University of Liverpool, Liverpool, L69 7ZB, UK.
| |
Collapse
|
38
|
de Nies L, Busi SB, Kunath BJ, May P, Wilmes P. Mobilome-driven segregation of the resistome in biological wastewater treatment. eLife 2022; 11:81196. [PMID: 36111782 PMCID: PMC9643006 DOI: 10.7554/elife.81196] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Accepted: 09/15/2022] [Indexed: 12/05/2022] Open
Abstract
Biological wastewater treatment plants (BWWTP) are considered to be hotspots for the evolution and subsequent spread of antimicrobial resistance (AMR). Mobile genetic elements (MGEs) promote the mobilization and dissemination of antimicrobial resistance genes (ARGs) and are thereby critical mediators of AMR within the BWWTP microbial community. At present, it is unclear whether specific AMR categories are differentially disseminated via bacteriophages (phages) or plasmids. To understand the segregation of AMR in relation to MGEs, we analyzed meta-omic (metagenomic, metatranscriptomic and metaproteomic) data systematically collected over 1.5 years from a BWWTP. Our results showed a core group of 15 AMR categories which were found across all timepoints. Some of these AMR categories were disseminated exclusively (bacitracin) or primarily (aminoglycoside, MLS and sulfonamide) via plasmids or phages (fosfomycin and peptide), whereas others were disseminated equally by both. Combined and timepoint-specific analyses of gene, transcript and protein abundances further demonstrated that aminoglycoside, bacitracin and sulfonamide resistance genes were expressed more by plasmids, in contrast to fosfomycin and peptide AMR expression by phages, thereby validating our genomic findings. In the analyzed communities, the dominant taxon Candidatus Microthrix parvicella was a major contributor to several AMR categories whereby its plasmids primarily mediated aminoglycoside resistance. Importantly, we also found AMR associated with ESKAPEE pathogens within the BWWTP, and here MGEs also contributed differentially to the dissemination of the corresponding ARGs. Collectively our findings pave the way toward understanding the segmentation of AMR within MGEs, thereby shedding new light on resistome populations and their mediators, essential elements that are of immediate relevance to human health.
Collapse
Affiliation(s)
- Laura de Nies
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg
| | | | | | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg
| |
Collapse
|
39
|
The TOG protein Stu2 is regulated by acetylation. PLoS Genet 2022; 18:e1010358. [PMID: 36084134 PMCID: PMC9491610 DOI: 10.1371/journal.pgen.1010358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2022] [Revised: 09/21/2022] [Accepted: 07/27/2022] [Indexed: 11/27/2022] Open
Abstract
Stu2 in S. cerevisiae is a member of the XMAP215/Dis1/CKAP5/ch-TOG family of MAPs and has multiple functions in controlling microtubules, including microtubule polymerization, microtubule depolymerization, linking chromosomes to the kinetochore, and assembly of γ-TuSCs at the SPB. Whereas phosphorylation has been shown to be critical for Stu2 localization at the kinetochore, other regulatory mechanisms that control Stu2 function are still poorly understood. Here, we show that a novel form of Stu2 regulation occurs through the acetylation of three lysine residues at K252, K469, and K870, which are located in three distinct domains of Stu2. Alteration of acetylation through acetyl-mimetic and acetyl-blocking mutations did not impact the essential function of Stu2. Instead, these mutations lead to a decrease in chromosome stability, as well as changes in resistance to the microtubule depolymerization drug, benomyl. In agreement with our in silico modeling, several acetylation-mimetic mutants displayed increased interactions with γ-tubulin. Taken together, these data suggest that Stu2 acetylation can govern multiple Stu2 functions, including chromosome stability and interactions at the SPB. Microtubules are proteinaceous polymers that play several important roles in cell division and segregation of the genetic material to each daughter cell. The functions of microtubules are critically dependent upon their dynamic properties in which tubulin subunits are added or removed from the microtubule end, allowing microtubules to grow or shorten in length. These dynamic properties are controlled by several types of microtubule associated proteins. In this study using bakers yeast, we describe our discovery of a previously unappreciated way to regulate the microtubule associated protein Stu2 by a modification called acetylation. When we created mutations in the Stu2 protein that can’t be properly acetylated, the cell lost some of its chromosomes. Some of these mutations actually caused the microtubules to be resistant to drugs that normally disassemble the microtubule polymer. As similar versions of the Stu2 protein are found in diverse organisms that range from yeast and fungus, to plants, insects, mammals and humans, our work could provide unique insights into how microtubules malfunction in some human diseases. With further studies, this may provide a new understanding of chromosome loss in birth defects and/or cancer.
Collapse
|
40
|
Molecular and in vivo studies of a glutamate-class prolyl-endopeptidase for coeliac disease therapy. Nat Commun 2022; 13:4446. [PMID: 35915115 PMCID: PMC9343461 DOI: 10.1038/s41467-022-32215-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Accepted: 07/21/2022] [Indexed: 11/25/2022] Open
Abstract
The digestion of gluten generates toxic peptides, among which a highly immunogenic proline-rich 33-mer from wheat α-gliadin, that trigger coeliac disease. Neprosin from the pitcher plant is a reported prolyl endopeptidase. Here, we produce recombinant neprosin and its mutants, and find that full-length neprosin is a zymogen, which is self-activated at gastric pH by the release of an all-β pro-domain via a pH-switch mechanism featuring a lysine plug. The catalytic domain is an atypical 7+8-stranded β-sandwich with an extended active-site cleft containing an unprecedented pair of catalytic glutamates. Neprosin efficiently degrades both gliadin and the 33-mer in vitro under gastric conditions and is reversibly inactivated at pH > 5. Moreover, co-administration of gliadin and the neprosin zymogen at the ratio 500:1 reduces the abundance of the 33-mer in the small intestine of mice by up to 90%. Neprosin therefore founds a family of eukaryotic glutamate endopeptidases that fulfils requisites for a therapeutic glutenase. Celiac disease is characterized by intolerance to gluten, a cereal protein. Here, the authors show that neprosin, a glutamate peptidase from the pitcher plant, efficiently cleaves gluten components under physiological conditions in vitro and in the gut of mice.
Collapse
|
41
|
Perez-Riverol Y. Proteomic repository data submission, dissemination, and reuse: key messages. Expert Rev Proteomics 2022; 19:297-310. [PMID: 36529941 PMCID: PMC7614296 DOI: 10.1080/14789450.2022.2160324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 12/07/2022] [Indexed: 12/23/2022]
Abstract
INTRODUCTION The creation of ProteomeXchange data workflows in 2012 transformed the field of proteomics, consisting of the standardization of data submission and dissemination and enabling the widespread reanalysis of public MS proteomics data worldwide. ProteomeXchange has triggered a growing trend toward public dissemination of proteomics data, facilitating the assessment, reuse, comparative analyses, and extraction of new findings from public datasets. By 2022, the consortium is integrated by PRIDE, PeptideAtlas, MassIVE, jPOST, iProX, and Panorama Public. AREAS COVERED Here, we review and discuss the current ecosystem of resources, guidelines, and file formats for proteomics data dissemination and reanalysis. Special attention is drawn to new exciting quantitative and post-translational modification-oriented resources. The challenges and future directions on data depositions including the lack of metadata and cloud-based and high-performance software solutions for fast and reproducible reanalysis of the available data are discussed. EXPERT OPINION The success of ProteomeXchange and the amount of proteomics data available in the public domain have triggered the creation and/or growth of other protein knowledgebase resources. Data reuse is a leading, active, and evolving field; supporting the creation of new formats, tools, and workflows to rediscover and reshape the public proteomics data.
Collapse
Affiliation(s)
- Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, Cambridge, UK
| |
Collapse
|
42
|
Ferrari V, Stroobant V, Abi Habib J, Naulaerts S, Van den Eynde BJ, Vigneron N. New Insights into the Mechanisms of Proteasome-Mediated Peptide Splicing Learned from Comparing Splicing Efficiency by Different Proteasome Subtypes. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2022; 208:2817-2828. [PMID: 35688464 DOI: 10.4049/jimmunol.2101198] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Accepted: 04/03/2022] [Indexed: 06/15/2023]
Abstract
By tying peptide fragments originally distant in parental proteins, the proteasome can generate spliced peptides that are recognized by CTL. This occurs by transpeptidation involving a peptide-acyl-enzyme intermediate and another peptide fragment present in the catalytic chamber. Four main subtypes of proteasomes exist: the standard proteasome (SP), the immunoproteasome, and intermediate proteasomes β1-β2-β5i (single intermediate proteasome) and β1i-β2-β5i (double intermediate proteasome). In this study, we use a tandem mass tag-quantification approach to study the production of six spliced human antigenic peptides by the four proteasome subtypes. Peptides fibroblast growth factor-5172-176/217-220, tyrosinase368-373/336-340, and gp10040-42/47-52 are better produced by the SP than the other proteasome subtypes. The peptides SP110296-301/286-289, gp100195-202/191or192, and gp10047-52/40-42 are better produced by the immunoproteasome and double intermediate proteasome. The current model of proteasome-catalyzed peptide splicing suggests that the production of a spliced peptide depends on the abundance of the peptide splicing partners. Surprisingly, we found that despite the fact that reciprocal peptides RTK_QLYPEW (gp10040-42/47-52) and QLYPEW_RTK (gp10047-52/40-42) are composed of identical splicing partners, their production varies differently according to the proteasome subtype. These differences were maintained after in vitro digestions involving identical amounts of the splicing fragments. Our results indicate that the amount of splicing partner is not the only factor driving peptide splicing and suggest that peptide splicing efficiency also relies on other factors, such as the affinity of the C-terminal splice reactant for the primed binding site of the catalytic subunit.
Collapse
Affiliation(s)
- Violette Ferrari
- Ludwig Institute for Cancer Research, Brussels, Belgium
- de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
- Walloon Excellence in Lifesciences and Biotechnology, Brussels, Belgium; and
| | - Vincent Stroobant
- Ludwig Institute for Cancer Research, Brussels, Belgium
- de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
- Walloon Excellence in Lifesciences and Biotechnology, Brussels, Belgium; and
| | - Joanna Abi Habib
- Ludwig Institute for Cancer Research, Brussels, Belgium
- de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
- Walloon Excellence in Lifesciences and Biotechnology, Brussels, Belgium; and
| | - Stefan Naulaerts
- Ludwig Institute for Cancer Research, Brussels, Belgium
- de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
- Walloon Excellence in Lifesciences and Biotechnology, Brussels, Belgium; and
| | - Benoit J Van den Eynde
- Ludwig Institute for Cancer Research, Brussels, Belgium;
- de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
- Walloon Excellence in Lifesciences and Biotechnology, Brussels, Belgium; and
- Nuffield Department of Medicine, Ludwig Institute for Cancer Research, University of Oxford, Oxford, United Kingdom
| | - Nathalie Vigneron
- Ludwig Institute for Cancer Research, Brussels, Belgium;
- de Duve Institute, Université Catholique de Louvain, Brussels, Belgium
- Walloon Excellence in Lifesciences and Biotechnology, Brussels, Belgium; and
| |
Collapse
|
43
|
Mariasina SS, Chang CF, Navalayeu TL, Chugunova AA, Efimov SV, Zgoda VG, Ivlev VA, Dontsova OA, Sergiev PV, Polshakov VI. Williams-Beuren Syndrome Related Methyltransferase WBSCR27: From Structure to Possible Function. Front Mol Biosci 2022; 9:865743. [PMID: 35782865 PMCID: PMC9240639 DOI: 10.3389/fmolb.2022.865743] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Accepted: 05/16/2022] [Indexed: 11/17/2022] Open
Abstract
Williams-Beuren syndrome (WBS) is a genetic disorder associated with the hemizygous deletion of several genes in chromosome 7, encoding 26 proteins. Malfunction of these proteins induce multisystemic failure in an organism. While biological functions of most proteins are more or less established, the one of methyltransferase WBSCR27 remains elusive. To find the substrate of methylation catalyzed by WBSCR27 we constructed mouse cell lines with a Wbscr27 gene knockout and studied the obtained cells using several molecular biology and mass spectrometry techniques. We attempted to pinpoint the methylation target among the RNAs and proteins, but in all cases neither a direct substrate has been identified nor the protein partners have been detected. To reveal the nature of the putative methylation substrate we determined the solution structure and studied the conformational dynamic properties of WBSCR27 in apo state and in complex with S-adenosyl-L-homocysteine (SAH). The protein core was found to form a canonical Rossman fold common for Class I methyltransferases. N-terminus of the protein and the β6–β7 loop were disordered in apo-form, but binding of SAH induced the transition of these fragments to a well-formed substrate binding site. Analyzing the structure of this binding site allows us to suggest potential substrates of WBSCR27 methylation to be probed in further research.
Collapse
Affiliation(s)
- Sofia S. Mariasina
- Faculty of Fundamental Medicine, M.V. Lomonosov Moscow State University, Moscow, Russia
- Institute of Functional Genomics, M.V. Lomonosov Moscow State University, Moscow, Russia
| | - Chi-Fon Chang
- Genomics Research Center, Academia Sinica, Taipei, Taiwan
| | | | | | - Sergey V. Efimov
- NMR Laboratory, Institute of Physics, Kazan Federal University, Kazan, Russia
| | | | | | - Olga A. Dontsova
- Chemical Department, M.V. Lomonosov Moscow State University, Moscow, Russia
- Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Petr V. Sergiev
- Institute of Functional Genomics, M.V. Lomonosov Moscow State University, Moscow, Russia
- Chemical Department, M.V. Lomonosov Moscow State University, Moscow, Russia
- Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Vladimir I. Polshakov
- Faculty of Fundamental Medicine, M.V. Lomonosov Moscow State University, Moscow, Russia
- *Correspondence: Vladimir I. Polshakov,
| |
Collapse
|
44
|
Aggarwal S, Raj A, Kumar D, Dash D, Yadav AK. False discovery rate: the Achilles' heel of proteogenomics. Brief Bioinform 2022; 23:6582880. [PMID: 35534181 DOI: 10.1093/bib/bbac163] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2021] [Revised: 03/14/2022] [Accepted: 04/12/2022] [Indexed: 12/25/2022] Open
Abstract
Proteogenomics refers to the integrated analysis of the genome and proteome that leverages mass-spectrometry (MS)-based proteomics data to improve genome annotations, understand gene expression control through proteoforms and find sequence variants to develop novel insights for disease classification and therapeutic strategies. However, proteogenomic studies often suffer from reduced sensitivity and specificity due to inflated database size. To control the error rates, proteogenomics depends on the target-decoy search strategy, the de-facto method for false discovery rate (FDR) estimation in proteomics. The proteogenomic databases constructed from three- or six-frame nucleotide database translation not only increase the search space and compute-time but also violate the equivalence of target and decoy databases. These searches result in poorer separation between target and decoy scores, leading to stringent FDR thresholds. Understanding these factors and applying modified strategies such as two-pass database search or peptide-class-specific FDR can result in a better interpretation of MS data without introducing additional statistical biases. Based on these considerations, a user can interpret the proteogenomics results appropriately and control false positives and negatives in a more informed manner. In this review, first, we briefly discuss the proteogenomic workflows and limitations in database construction, followed by various considerations that can influence potential novel discoveries in a proteogenomic study. We conclude with suggestions to counter these challenges for better proteogenomic data interpretation.
Collapse
Affiliation(s)
- Suruchi Aggarwal
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, 3rd milestone, PO Box No. 04, Faridabad-Gurgaon Expressway, Faridabad-121001, Haryana, India
| | - Anurag Raj
- GN Ramachandran Knowledge Centre for Genome Informatics, CSIR-Institute of Genomics & Integrative Biology, South Campus, Mathura Road, New Delhi 110025, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad-201002, India
| | - Dhirendra Kumar
- GN Ramachandran Knowledge Centre for Genome Informatics, CSIR-Institute of Genomics & Integrative Biology, South Campus, Mathura Road, New Delhi 110025, India
| | - Debasis Dash
- GN Ramachandran Knowledge Centre for Genome Informatics, CSIR-Institute of Genomics & Integrative Biology, South Campus, Mathura Road, New Delhi 110025, India.,Academy of Scientific and Innovative Research (AcSIR), Ghaziabad-201002, India
| | - Amit Kumar Yadav
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, 3rd milestone, PO Box No. 04, Faridabad-Gurgaon Expressway, Faridabad-121001, Haryana, India
| |
Collapse
|
45
|
Pinter N, Glätzer D, Fahrner M, Fröhlich K, Johnson J, Grüning BA, Warscheid B, Drepper F, Schilling O, Föll MC. MaxQuant and MSstats in Galaxy Enable Reproducible Cloud-Based Analysis of Quantitative Proteomics Experiments for Everyone. J Proteome Res 2022; 21:1558-1565. [PMID: 35503992 DOI: 10.1021/acs.jproteome.2c00051] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Quantitative mass spectrometry-based proteomics has become a high-throughput technology for the identification and quantification of thousands of proteins in complex biological samples. Two frequently used tools, MaxQuant and MSstats, allow for the analysis of raw data and finding proteins with differential abundance between conditions of interest. To enable accessible and reproducible quantitative proteomics analyses in a cloud environment, we have integrated MaxQuant (including TMTpro 16/18plex), Proteomics Quality Control (PTXQC), MSstats, and MSstatsTMT into the open-source Galaxy framework. This enables the web-based analysis of label-free and isobaric labeling proteomics experiments via Galaxy's graphical user interface on public clouds. MaxQuant and MSstats in Galaxy can be applied in conjunction with thousands of existing Galaxy tools and integrated into standardized, sharable workflows. Galaxy tracks all metadata and intermediate results in analysis histories, which can be shared privately for collaborations or publicly, allowing full reproducibility and transparency of published analysis. To further increase accessibility, we provide detailed hands-on training materials. The integration of MaxQuant and MSstats into the Galaxy framework enables their usage in a reproducible way on accessible large computational infrastructures, hence realizing the foundation for high-throughput proteomics data science for everyone.
Collapse
Affiliation(s)
- Niko Pinter
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany
| | - Damian Glätzer
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Klemens Fröhlich
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), Albert-Ludwigs-University Freiburg, 79104 Freiburg, Germany
| | - James Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | | | - Bettina Warscheid
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany.,Faculty of Chemistry and Pharmacy, Department of Biochemistry, Julius Maximilian University of Würzburg, 97074 Würzburg, Germany
| | - Friedel Drepper
- Biochemistry and Functional Proteomics, Institute of Biology II, Faculty of Biology, University of Freiburg, 79104 Freiburg, Germany
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,German Cancer Consortium (DKTK) and Cancer Research Center (DKFZ), 79106 Freiburg, Germany
| | - Melanie Christine Föll
- Institute for Surgical Pathology, Medical Center, University of Freiburg, 79106 Freiburg, Germany.,Faculty of Medicine, University of Freiburg, 79110 Freiburg, Germany.,Khoury College of Computer Sciences, Northeastern University, Boston, Massachusetts 02115, United States
| |
Collapse
|
46
|
Shiferaw GA, Gabriels R, Bouwmeester R, Van Den Bossche T, Vandermarliere E, Martens L, Volders PJ. Sensitive and Specific Spectral Library Searching with CompOmics Spectral Library Searching Tool and Percolator. J Proteome Res 2022; 21:1365-1370. [PMID: 35446579 DOI: 10.1021/acs.jproteome.2c00075] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Maintaining high sensitivity while limiting false positives is a key challenge in peptide identification from mass spectrometry data. Here, we investigate the effects of integrating the machine learning-based postprocessor Percolator into our spectral library searching tool COSS (CompOmics Spectral library Searching tool). To evaluate the effects of this postprocessing, we have used 40 data sets from 2 different projects and have searched these against the NIST and MassIVE spectral libraries. The searching is carried out using 2 spectral library search tools, COSS and MSPepSearch with and without Percolator postprocessing, and using sequence database search engine MS-GF+ as a baseline comparator. The addition of the Percolator rescoring step to COSS is effective and results in a substantial improvement in sensitivity and specificity of the identifications. COSS is freely available as open source under the permissive Apache2 license, and binaries and source code are found at https://github.com/compomics/COSS.
Collapse
Affiliation(s)
- Genet Abay Shiferaw
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Ralf Gabriels
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Robbin Bouwmeester
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Tim Van Den Bossche
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Elien Vandermarliere
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium
| | - Pieter-Jan Volders
- VIB-UGent Center for Medical Biotechnology, VIB, 9000 Ghent, Belgium.,Department of Biomolecular Medicine, Ghent University, 9000 Ghent, Belgium.,Cancer Research Institute Ghent, Ghent University, 9000 Ghent, Belgium
| |
Collapse
|
47
|
Hari PS, Balakrishnan L, Kotyada C, Everad John A, Tiwary S, Shah N, Sirdeshmukh R. Proteogenomic Analysis of Breast Cancer Transcriptomic and Proteomic Data, Using De Novo Transcript Assembly: Genome-Wide Identification of Novel Peptides and Clinical Implications. Mol Cell Proteomics 2022; 21:100220. [PMID: 35227895 PMCID: PMC9020135 DOI: 10.1016/j.mcpro.2022.100220] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2021] [Revised: 01/16/2022] [Accepted: 02/24/2022] [Indexed: 11/30/2022] Open
Abstract
We have carried out proteogenomic analysis of the breast cancer transcriptomic and proteomic data, available at The Clinical Proteomic Tumor Analysis Consortium resource, to identify novel peptides arising from alternatively spliced events as well as other noncanonical expressions. We used a pipeline that consisted of de novo transcript assembly, six frame-translated custom database, and a combination of search engines to identify novel peptides. A portfolio of 4,387 novel peptide sequences initially identified was further screened through PepQuery validation tool (Clinical Proteomic Tumor Analysis Consortium), which yielded 1,558 novel peptides. We considered the dataset of 1,558 validated through PepQuery to understand their functional and clinical significance, leaving the rest to be further verified using other validation tools and approaches. The novel peptides mapped to the known gene sequences as well as to genomic regions yet undefined for translation, 580 novel peptides mapped to known protein-coding genes, 147 to non–protein-coding genes, and 831 belonged to novel translational sequences. The novel peptides belonging to protein-coding genes represented alternatively spliced events or 5′ or 3′ extensions, whereas others represented translation from pseudogenes, long noncoding RNAs, or novel peptides originating from uncharacterized protein-coding sequences—mostly from the intronic regions of known genes. Seventy-six of the 580 protein-coding genes were associated with cancer hallmark genes, which included key oncogenes, transcription factors, kinases, and cell surface receptors. Survival association analysis of the 76 novel peptide sequences revealed 10 of them to be significant, and we present a panel of six novel peptides, whose high expression was found to be strongly associated with poor survival of patients with human epidermal growth factor receptor 2–enriched subtype. Our analysis represents a landscape of novel peptides of different types that may be expressed in breast cancer tissues, whereas their presence in full-length functional proteins needs further investigations. Novel protein variants and peptides from noncoding sequences are rapidly emerging. Mining of mass spectrometry data using proteogenomic analysis reveals such entities. Novel peptides from coding and noncoding sequences identified in breast cancer. Novel peptides mapped to cancer hallmark genes in breast cancer. Panel of novel peptides with prognostic potential found for HER2-enriched subtype.
Collapse
Affiliation(s)
- P S Hari
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India
| | - Lavanya Balakrishnan
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India
| | - Chaithanya Kotyada
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India
| | | | - Shivani Tiwary
- Simulation and Modeling Sciences, Pfizer Pharma GmBH, Berlin, Germany
| | - Nameeta Shah
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India.
| | - Ravi Sirdeshmukh
- Mazumdar Shaw Center for Translational Research, Narayana Health, Bangalore, India; Institute of Bioinformatics, International Tech Park, Bangalore, India; Health Sciences, Manipal Academy of Higher Education, Manipal, India.
| |
Collapse
|
48
|
The impact of noise and missing fragmentation cleavages on de novo peptide identification algorithms. Comput Struct Biotechnol J 2022; 20:1402-1412. [PMID: 35386104 PMCID: PMC8956878 DOI: 10.1016/j.csbj.2022.03.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 03/09/2022] [Accepted: 03/09/2022] [Indexed: 01/24/2023] Open
Abstract
Most correct de novo peptides have ⩽1 missing fragmentation cleavages. DeepNovo outperforms Novor for peptide accuracy for both data types. Novor excels at amino acid recall when many fragmentation cleavages are missing. Deep learning allows DeepNovo to predict amino acids without adjacent peaks.
Proteomics aims to characterise system-wide protein expression and typically relies on mass-spectrometry and peptide fragmentation, followed by a database search for protein identification. It has wide ranging applications from clinical to environmental settings and virtually impacts on every area of biology. In that context, de novo peptide sequencing is becoming increasingly popular. Historically its performance lagged behind database search methods but with the integration of machine learning, this field of research is gaining momentum. To enable de novo peptide sequencing to realise its full potential, it is critical to explore the mass spectrometry data underpinning peptide identification. In this research we investigate the characteristics of tandem mass spectra using 8 published datasets. We then evaluate two state of the art de novo peptide sequencing algorithms, Novor and DeepNovo, with a particular focus on their performance with regard to missing fragmentation cleavage sites and noise. DeepNovo was found to perform better than Novor overall. However, Novor recalled more correct amino acids when 6 or more cleavage sites were missing. Furthermore, less than 11% of each algorithms’ correct peptide predictions emanate from data with more than one missing cleavage site, highlighting the issues missing cleavages pose. We further investigate how the algorithms manage to correctly identify peptides with many of these missing fragmentation cleavages. We show how noise negatively impacts the performance of both algorithms, when high intensity peaks are considered. Finally, we provide recommendations regarding further algorithms’ improvements and offer potential avenues to overcome current inherent data limitations.
Collapse
|
49
|
Sun B, van Dissel D, Mo I, Boysen P, Haslene-Hox H, Lund H. Identification of novel biomarkers of inflammation in Atlantic salmon (Salmo salar L.) by a plasma proteomic approach. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2022; 127:104268. [PMID: 34571096 DOI: 10.1016/j.dci.2021.104268] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/05/2021] [Revised: 09/22/2021] [Accepted: 09/22/2021] [Indexed: 06/13/2023]
Abstract
Monitoring fish welfare has become a central issue for the fast-growing aquaculture industry, and finding proper biomarkers of stress, inflammation and infection is necessary for surveillance and documentation of fish health. In this study, a proteomic approach using mass spectrometry was applied to identify indicators of the acute response in Atlantic salmon blood plasma by comparing Aeromonas salmonicida subsp. salmonicida infected fish and non-infected controls. The antimicrobial proteins cathelicidin (CATH), L-plastin (Plastin-2, LCP1) and soluble toll-like receptor 5 (sTLR5) were uniquely or mainly identified in the plasma of infected fish. In addition, five immune-related proteins showed significantly increased expression in plasma of infected fish: haptoglobin, high affinity immunoglobulin Fc gamma receptor I (FcγR1, CD64), leucine-rich alpha 2 glycoprotein (LRG1), complement C4 (C4) and phospholipase A2 inhibitor 31 kDa subunit-like protein. However, various fibrinogen components, CD209 and CD44 antigen-like molecules decreased in infected fish. Selected biomarkers were further verified by Western blot analysis of plasma and real time PCR of spleen and liver, including CATH1, CATH2 and L-plastin. A significant increase of L-plastin occurred as early as 24 h after infection, and a CATH2 increase was observed from 72 h in plasma of infected fish. Real time PCR of selected genes confirmed increased transcription of CATH1 and CATH2. In addition, serum amyloid A mRNA significantly increased in liver and spleen after bacterial infection. However, transcription of L-plastin was not consistently induced in liver and spleen. The results of the present study reveal novel and promising biomarkers of the acute phase response and inflammation in Atlantic salmon.
Collapse
Affiliation(s)
- Baojian Sun
- Faculty of Veterinary Medicine, Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Dino van Dissel
- SINTEF AS, Department of Biotechnology and Nanomedicine, Trondheim, Norway
| | - Ingrid Mo
- Faculty of Veterinary Medicine, Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Preben Boysen
- Faculty of Veterinary Medicine, Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Hanne Haslene-Hox
- SINTEF AS, Department of Biotechnology and Nanomedicine, Trondheim, Norway
| | - Hege Lund
- Faculty of Veterinary Medicine, Norwegian University of Life Sciences (NMBU), Ås, Norway.
| |
Collapse
|
50
|
Van Den Bossche T, Kunath BJ, Schallert K, Schäpe SS, Abraham PE, Armengaud J, Arntzen MØ, Bassignani A, Benndorf D, Fuchs S, Giannone RJ, Griffin TJ, Hagen LH, Halder R, Henry C, Hettich RL, Heyer R, Jagtap P, Jehmlich N, Jensen M, Juste C, Kleiner M, Langella O, Lehmann T, Leith E, May P, Mesuere B, Miotello G, Peters SL, Pible O, Queiros PT, Reichl U, Renard BY, Schiebenhoefer H, Sczyrba A, Tanca A, Trappe K, Trezzi JP, Uzzau S, Verschaffelt P, von Bergen M, Wilmes P, Wolf M, Martens L, Muth T. Critical Assessment of MetaProteome Investigation (CAMPI): a multi-laboratory comparison of established workflows. Nat Commun 2021; 12:7305. [PMID: 34911965 PMCID: PMC8674281 DOI: 10.1038/s41467-021-27542-8] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 11/24/2021] [Indexed: 12/17/2022] Open
Abstract
Metaproteomics has matured into a powerful tool to assess functional interactions in microbial communities. While many metaproteomic workflows are available, the impact of method choice on results remains unclear. Here, we carry out a community-driven, multi-laboratory comparison in metaproteomics: the critical assessment of metaproteome investigation study (CAMPI). Based on well-established workflows, we evaluate the effect of sample preparation, mass spectrometry, and bioinformatic analysis using two samples: a simplified, laboratory-assembled human intestinal model and a human fecal sample. We observe that variability at the peptide level is predominantly due to sample processing workflows, with a smaller contribution of bioinformatic pipelines. These peptide-level differences largely disappear at the protein group level. While differences are observed for predicted community composition, similar functional profiles are obtained across workflows. CAMPI demonstrates the robustness of present-day metaproteomics research, serves as a template for multi-laboratory studies in metaproteomics, and provides publicly available data sets for benchmarking future developments.
Collapse
Affiliation(s)
- Tim Van Den Bossche
- VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium
| | - Benoit J Kunath
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Kay Schallert
- Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
| | - Stephanie S Schäpe
- Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
| | - Paul E Abraham
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Jean Armengaud
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
| | - Magnus Ø Arntzen
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Ariane Bassignani
- INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Dirk Benndorf
- Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
- Microbiology, Department of Applied Biosciences and Process Technology, Anhalt University of Applied Sciences, Köthen, Germany
- Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
| | - Stephan Fuchs
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
| | | | - Timothy J Griffin
- Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Live H Hagen
- Faculty of Chemistry, Biotechnology and Food Science, Norwegian University of Life Sciences (NMBU), Ås, Norway
| | - Rashi Halder
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Céline Henry
- INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Robert L Hettich
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Robert Heyer
- Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
| | - Pratik Jagtap
- Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Nico Jehmlich
- Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
| | - Marlene Jensen
- Department of Plant & Microbial Biology, North Carolina State University, Raleigh, USA
| | - Catherine Juste
- INRAE, AgroParisTech, Micalis Institute, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Manuel Kleiner
- Department of Plant & Microbial Biology, North Carolina State University, Raleigh, USA
| | - Olivier Langella
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE - Le Moulon, 91190, Gif-sur-Yvette, France
| | - Theresa Lehmann
- Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
| | - Emma Leith
- Department of Biochemistry Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Bart Mesuere
- VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
| | - Guylaine Miotello
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
| | - Samantha L Peters
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Olivier Pible
- Département Médicaments et Technologies pour la Santé (DMTS), Université Paris Saclay, CEA, INRAE, SPI, 30200, Bagnols-sur-Cèze, France
| | - Pedro T Queiros
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Udo Reichl
- Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
- Bioprocess Engineering, Max Planck Institute for Dynamics of Complex Technical Systems, Magdeburg, Germany
| | - Bernhard Y Renard
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
- Data Analytics and Computational Statistics, Hasso-Plattner-Institute, Faculty of Digital Engineering, University of Potsdam, Potsdam, Germany
| | - Henning Schiebenhoefer
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
- Data Analytics and Computational Statistics, Hasso-Plattner-Institute, Faculty of Digital Engineering, University of Potsdam, Potsdam, Germany
| | | | - Alessandro Tanca
- Department of Biomedical Sciences, University of Sassari, Sassari, Italy
| | - Kathrin Trappe
- Bioinformatics Unit (MF1), Department for Methods Development and Research Infrastructure, Robert Koch Institute, Berlin, Germany
| | - Jean-Pierre Trezzi
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Integrated Biobank of Luxembourg, Luxembourg Institute of Health, 1, rue Louis Rech, L-3555, Dudelange, Luxembourg
| | - Sergio Uzzau
- Department of Biomedical Sciences, University of Sassari, Sassari, Italy
| | - Pieter Verschaffelt
- VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
| | - Martin von Bergen
- Department of Molecular Systems Biology, Helmholtz-Centre for Environmental Research - UFZ GmbH, Leipzig, Germany
| | - Paul Wilmes
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Esch-sur-Alzette, Luxembourg
- Department of Life Sciences and Medicine, Faculty of Science, Technology and Medicine, University of Luxembourg, 6 avenue du Swing, L-4367, Belvaux, Luxembourg
| | - Maximilian Wolf
- Bioprocess Engineering, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
| | - Lennart Martens
- VIB - UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.
- Department of Biomolecular Medicine, Faculty of Medicine and Health Sciences, Ghent University, Ghent, Belgium.
| | - Thilo Muth
- Section eScience (S.3), Federal Institute for Materials Research and Testing, Berlin, Germany
| |
Collapse
|