1
|
Sun Z, Ning Z, Cheng K, Duan H, Wu Q, Mayne J, Figeys D. MetaPep: A core peptide database for faster human gut metaproteomics database searches. Comput Struct Biotechnol J 2023; 21:4228-4237. [PMID: 37692080 PMCID: PMC10491838 DOI: 10.1016/j.csbj.2023.08.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Revised: 08/18/2023] [Accepted: 08/25/2023] [Indexed: 09/12/2023] Open
Abstract
Metaproteomics has increasingly been applied to study functional changes in the human gut microbiome. Peptide identification is an important step in metaproteomics research, with sequence database search (SDS) and spectral library search (SLS) as the two main methods to identify peptides. However, the large search space in metaproteomics studies causes significant challenges for both identification methods. Moreover, with the development of mass spectrometry, it is now feasible to perform metaproteomic projects involving 100-1000 individual microbiomes. These large-scale projects create a conundrum for searching large databases. In this study, we constructed MetaPep, a core peptide database (including both collections of peptide sequences and tandem MS spectra) greatly accelerating the peptide identifications. Raw files from fifteen metaproteomics projects were re-analyzed and the identified peptide-spectrum matches (PSMs) were used to construct the MetaPep database. The constructed MetaPep database achieved rapid and accurate identification of peptides for human gut metaproteomics. MetaPep has a large collection of peptides and spectra that have been identified in published human gut metaproteomics datasets. MetaPep database can be used as an important resource in the current stage of human gut metaproteomics research. This study showed the possibility of applying a core peptide database as a generic metaproteomics workflow. MetaPep could also be an important resource for future human gut metaproteomics research, such as DIA (data-independent acquisition) analysis.
Collapse
Affiliation(s)
- Zhongzhi Sun
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Zhibin Ning
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Kai Cheng
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Haonan Duan
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Qing Wu
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
- Department of Biochemistry, Microbiology and Immunology, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Janice Mayne
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| | - Daniel Figeys
- School of Pharmaceutical Sciences, Faculty of Medicine, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| |
Collapse
|
2
|
Desai H, Ofori S, Boatner L, Yu F, Villanueva M, Ung N, Nesvizhskii AI, Backus K. Multi-omic stratification of the missense variant cysteinome. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.12.553095. [PMID: 37645963 PMCID: PMC10461992 DOI: 10.1101/2023.08.12.553095] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Cancer genomes are rife with genetic variants; one key outcome of this variation is gain-ofcysteine, which is the most frequently acquired amino acid due to missense variants in COSMIC. Acquired cysteines are both driver mutations and sites targeted by precision therapies. However, despite their ubiquity, nearly all acquired cysteines remain uncharacterized. Here, we pair cysteine chemoproteomics-a technique that enables proteome-wide pinpointing of functional, redox sensitive, and potentially druggable residues-with genomics to reveal the hidden landscape of cysteine acquisition. For both cancer and healthy genomes, we find that cysteine acquisition is a ubiquitous consequence of genetic variation that is further elevated in the context of decreased DNA repair. Our chemoproteogenomics platform integrates chemoproteomic, whole exome, and RNA-seq data, with a customized 2-stage false discovery rate (FDR) error controlled proteomic search, further enhanced with a user-friendly FragPipe interface. Integration of CADD predictions of deleteriousness revealed marked enrichment for likely damaging variants that result in acquisition of cysteine. By deploying chemoproteogenomics across eleven cell lines, we identify 116 gain-of-cysteines, of which 10 were liganded by electrophilic druglike molecules. Reference cysteines proximal to missense variants were also found to be pervasive, 791 in total, supporting heretofore untapped opportunities for proteoform-specific chemical probe development campaigns. As chemoproteogenomics is further distinguished by sample-matched combinatorial variant databases and compatible with redox proteomics and small molecule screening, we expect widespread utility in guiding proteoform-specific biology and therapeutic discovery.
Collapse
Affiliation(s)
- Heta Desai
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, CA, 90095, USA
- Molecular Biology Institute, UCLA, Los Angeles, CA, 90095, USA
| | - Samuel Ofori
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, CA, 90095, USA
| | - Lisa Boatner
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, CA, 90095, USA
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, CA, 90095, USA
| | - Fengchao Yu
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Miranda Villanueva
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, CA, 90095, USA
- Molecular Biology Institute, UCLA, Los Angeles, CA, 90095, USA
| | - Nicholas Ung
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, CA, 90095, USA
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, CA, 90095, USA
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA
- Molecular Biology Institute, UCLA, Los Angeles, CA, 90095, USA
- DOE Institute for Genomics and Proteomics, UCLA, Los Angeles, CA, 90095, USA
- Jonsson Comprehensive Cancer Center, UCLA, Los Angeles, CA, 90095, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, 90095, USA
| | - Alexey I Nesvizhskii
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109, USA
- Department of Pathology, University of Michigan, Ann Arbor, MI, 48109, USA
| | - Keriann Backus
- Biological Chemistry Department, David Geffen School of Medicine, UCLA, Los Angeles, CA, 90095, USA
- Department of Chemistry and Biochemistry, UCLA, Los Angeles, CA, 90095, USA
- Molecular Biology Institute, UCLA, Los Angeles, CA, 90095, USA
- DOE Institute for Genomics and Proteomics, UCLA, Los Angeles, CA, 90095, USA
- Jonsson Comprehensive Cancer Center, UCLA, Los Angeles, CA, 90095, USA
- Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, UCLA, Los Angeles, CA, 90095, USA
| |
Collapse
|
3
|
Zeng H, Huang J, Ren J, Wang CK, Tang Z, Zhou H, Zhou Y, Shi H, Aditham A, Sui X, Chen H, Lo JA, Wang X. Spatially resolved single-cell translatomics at molecular resolution. Science 2023; 380:eadd3067. [PMID: 37384709 PMCID: PMC11146668 DOI: 10.1126/science.add3067] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 05/07/2023] [Indexed: 07/01/2023]
Abstract
The precise control of messenger RNA (mRNA) translation is a crucial step in posttranscriptional gene regulation of cellular physiology. However, it remains a challenge to systematically study mRNA translation at the transcriptomic scale with spatial and single-cell resolution. Here, we report the development of ribosome-bound mRNA mapping (RIBOmap), a highly multiplexed three-dimensional in situ profiling method to detect cellular translatome. RIBOmap profiling of 981 genes in HeLa cells revealed cell cycle-dependent translational control and colocalized translation of functional gene modules. We mapped 5413 genes in mouse brain tissues, yielding spatially resolved single-cell translatomic profiles for 119,173 cells and revealing cell type-specific and brain region-specific translational regulation, including translation remodeling during oligodendrocyte maturation. Our method detected widespread patterns of localized translation in neuronal and glial cells in intact brain tissue networks.
Collapse
Affiliation(s)
- Hu Zeng
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jiahao Huang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jingyi Ren
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | | | - Zefang Tang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Haowen Zhou
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Yiming Zhou
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Center for Psychiatric Research, Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Hailing Shi
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Abhishek Aditham
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Xin Sui
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Hongyu Chen
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Jennifer A. Lo
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Xiao Wang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| |
Collapse
|
4
|
Henfrey C, Murphy S, Tellier M. Regulation of mature mRNA levels by RNA processing efficiency. NAR Genom Bioinform 2023; 5:lqad059. [PMID: 37305169 PMCID: PMC10251645 DOI: 10.1093/nargab/lqad059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Revised: 05/13/2023] [Accepted: 05/24/2023] [Indexed: 06/13/2023] Open
Abstract
Transcription and co-transcriptional processes, including pre-mRNA splicing and mRNA cleavage and polyadenylation, regulate the production of mature mRNAs. The carboxyl terminal domain (CTD) of RNA polymerase (pol) II, which comprises 52 repeats of the Tyr1Ser2Pro3Thr4Ser5Pro6Ser7 peptide, is involved in the coordination of transcription with co-transcriptional processes. The pol II CTD is dynamically modified by protein phosphorylation, which regulates recruitment of transcription and co-transcriptional factors. We have investigated whether mature mRNA levels from intron-containing protein-coding genes are related to pol II CTD phosphorylation, RNA stability, and pre-mRNA splicing and mRNA cleavage and polyadenylation efficiency. We find that genes that produce a low level of mature mRNAs are associated with relatively high phosphorylation of the pol II CTD Thr4 residue, poor RNA processing, increased chromatin association of transcripts, and shorter RNA half-life. While these poorly-processed transcripts are degraded by the nuclear RNA exosome, our results indicate that in addition to RNA half-life, chromatin association due to a low RNA processing efficiency also plays an important role in the regulation of mature mRNA levels.
Collapse
Affiliation(s)
- Callum Henfrey
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Shona Murphy
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
| | - Michael Tellier
- Sir William Dunn School of Pathology, University of Oxford, Oxford OX1 3RE, UK
- Department of Molecular and Cell Biology, University of Leicester, Leicester LE1 7RH, UK
| |
Collapse
|
5
|
Bairoch A. Meet the Editorial Board Member. CURR PROTEOMICS 2022. [DOI: 10.2174/157016461904220907111423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Affiliation(s)
- Amos Bairoch
- Department of Human Protein Sciences
Swiss-Prot Group
Swiss Institute of Bioinformatics
Geneva
Switzerland
| |
Collapse
|
6
|
Tsang O, Wong JWH. Proteogenomic interrogation of cancer cell lines: an overview of the field. Expert Rev Proteomics 2021; 18:221-232. [PMID: 33877947 DOI: 10.1080/14789450.2021.1914594] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Introduction: Cancer cell lines (CCLs) have been a major resource for cancer research. Over the past couple of decades, they have been instrumental in omic profiling method development and as model systems to generate new knowledge in cell and cancer biology. More recently, with the increasing amount of genomic, transcriptomic and proteomic data being generated in hundreds of CCLs, there is growing potential for integrative proteogenomic data analyses to be performed.Areas covered: In this review, we first describe the most commonly used proteome profiling methods in CCLs. We then discuss how these proteomics data can be integrated with genomics data for proteogenomics analyses. Finally, we highlight some of the recent biological discoveries that have arisen from proteogenomics analyses of CCLs.Expert opinion: Protegeonomics analyses of CCLs have so far enabled the discovery of novel proteins and proteoforms. It has also improved our understanding of biological processes including post-transcriptional regulation of protein abundance and the presentation of antigens by major histocompatibility complex alleles. With proteomics data to be generated in hundreds to thousands of CCLs in coming years, there will be further potential for large-scale proteogenomics analyses and data integration with the phenotypically well-characterized CCLs.
Collapse
Affiliation(s)
- Olson Tsang
- Centre for PanorOmic Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR
| | - Jason W H Wong
- Centre for PanorOmic Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR.,School of Biomedical Sciences, The University of Hong Kong, Pokfulam, Hong Kong SAR
| |
Collapse
|
7
|
Comparison of different variant sequence types coupled with decoy generation methods used in concatenated target-decoy database searches for proteogenomic research. J Proteomics 2020; 231:104021. [PMID: 33148401 DOI: 10.1016/j.jprot.2020.104021] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Revised: 09/29/2020] [Accepted: 10/15/2020] [Indexed: 12/21/2022]
Abstract
Concatenated target-decoy database searches are commonly used in proteogenomic research for variant peptide identification. Currently, protein-based and peptide-based sequence databases are applied to store variant sequences for database searches. The protein-based database records a full-length wild-type protein sequence but using the given variant events to replace the original amino acids, whereas the peptide-based database retains only the in silico digested peptides containing the variants. However, the performance of applying various decoy generation methods on the peptide-based variant sequence database is still unclear, compared to the protein-based database. In this paper, we conduct a thorough comparison on target-decoy databases constructed by the above two types of databases coupled with various decoy generation methods for proteogenomic analyses. The results show that for the protein-based variant sequence database, using the reverse or the pseudo reverse method achieves similar performance for variant peptide identification. Furthermore, for the peptide-based database, the pseudo reverse method is more suitable than the widely used reverse method, as shown by identifying 6% more variant PSMs in a HEK293 cell line data set. SIGNIFICANCE: In our survey of publications on proteogenomic studies, 57% of the studies adopt the peptide-based variant sequence database coupled with the reverse method for decoy generation to construct a target-decoy database for searches. However, our results show that when using the peptide-based variant sequence database, it is better to adopt the pseudo reverse method for generating decoy sequences, to avoid leading to fewer variant peptides being identified.
Collapse
|
8
|
Omenn GS, Lane L, Overall CM, Cristea IM, Corrales FJ, Lindskog C, Paik YK, Van Eyk JE, Liu S, Pennington SR, Snyder MP, Baker MS, Bandeira N, Aebersold R, Moritz RL, Deutsch EW. Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project. J Proteome Res 2020; 19:4735-4746. [PMID: 32931287 DOI: 10.1021/acs.jproteome.0c00485] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
According to the 2020 Metrics of the HUPO Human Proteome Project (HPP), expression has now been detected at the protein level for >90% of the 19 773 predicted proteins coded in the human genome. The HPP annually reports on progress made throughout the world toward credibly identifying and characterizing the complete human protein parts list and promoting proteomics as an integral part of multiomics studies in medicine and the life sciences. NeXtProt release 2020-01 classified 17 874 proteins as PE1, having strong protein-level evidence, up 180 from 17 694 one year earlier. These represent 90.4% of the 19 773 predicted coding genes (all PE1,2,3,4 proteins in neXtProt). Conversely, the number of neXtProt PE2,3,4 proteins, termed the "missing proteins" (MPs), was reduced by 230 from 2129 to 1899 since the neXtProt 2019-01 release. PeptideAtlas is the primary source of uniform reanalysis of raw mass spectrometry data for neXtProt, supplemented this year with extensive data from MassIVE. PeptideAtlas 2020-01 added 362 canonical proteins between 2019 and 2020 and MassIVE contributed 84 more, many of which converted PE1 entries based on non-MS evidence to the MS-based subgroup. The 19 Biology and Disease-driven B/D-HPP teams continue to pursue the identification of driver proteins that underlie disease states, the characterization of regulatory mechanisms controlling the functions of these proteins, their proteoforms, and their interactions, and the progression of transitions from correlation to coexpression to causal networks after system perturbations. And the Human Protein Atlas published Blood, Brain, and Metabolic Atlases.
Collapse
Affiliation(s)
- Gilbert S Omenn
- University of Michigan, Ann Arbor, Michigan 48109, United States.,Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | | | - Ileana M Cristea
- Princeton University, Princeton, New Jersey 08544, United States
| | | | | | | | | | - Siqi Liu
- BGI Group, Shenzhen 518083, China
| | | | | | - Mark S Baker
- Macquarie University, Macquarie Park, NSW 2109, Australia
| | - Nuno Bandeira
- University of California, San Diego, La Jolla, California 92093, United States
| | - Ruedi Aebersold
- ETH-Zurich and University of Zurich, 8092 Zurich, Switzerland
| | - Robert L Moritz
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| |
Collapse
|
9
|
Choong WK, Wang JH, Sung TY. MinProtMaxVP: Generating a minimized number of protein variant sequences containing all possible variant peptides for proteogenomic analysis. J Proteomics 2020; 223:103819. [PMID: 32407886 DOI: 10.1016/j.jprot.2020.103819] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2019] [Revised: 05/04/2020] [Accepted: 05/09/2020] [Indexed: 12/12/2022]
Abstract
Identifying single-amino-acid variants (SAVs) from mass spectrometry-based experiments is critical for validating single-nucleotide variants (SNVs) at the protein level to facilitate biomedical research. Currently, two approaches are usually applied to convert SNV annotations into SAV-harboring protein sequences. One approach generates one sequence containing exactly one SAV, and the other all SAVs. However, they may neglect the possibility of SAV combinations, e.g., haplotypes, existing in bio-samples. Therefore, it is necessary to consider all SAV combinations of a protein when generating SAV-harboring protein sequences. In this paper, we propose MinProtMaxVP, a novel approach which selects a minimized number of SAV-harboring protein sequences generated from the exhaustive approach, while still accommodating all possible variant peptides, by solving a classic set covering problem. Our study on known haplotype variations of TAS2R38 justifies the necessity for MinProtMaxVP to consider all combinations of SAVs. The performance of MinProtMaxVP is demonstrated by an in silico study on OR2T27 with five SAVs and real experimental data of the HEK293 cell line. Furthermore, assuming simulated somatic and germline variants of OR2T27 in tumor and normal tissues demonstrates that when adopting the appropriate somatic and germline SAV integration strategy, MinProtMaxVP is adaptable to labeling and label-free mass spectrometry-based experiments. SIGNIFICANCE: We present MinProtMaxVP, a novel approach to generate SAV-harboring protein sequences for constructing a customized protein sequence database, which is used in database searching for variant peptide identification. This approach outperforms the existing approaches in generating all possible variant peptides to be included in protein sequences and possibly leading to identification of more variant peptides in proteogenomic analysis.
Collapse
Affiliation(s)
- Wai-Kok Choong
- Institute of Information Science, Academia Sinica, Nankang, Taipei 11529, Taiwan
| | - Jen-Hung Wang
- Institute of Information Science, Academia Sinica, Nankang, Taipei 11529, Taiwan
| | - Ting-Yi Sung
- Institute of Information Science, Academia Sinica, Nankang, Taipei 11529, Taiwan.
| |
Collapse
|
10
|
Searle BC, Swearingen KE, Barnes CA, Schmidt T, Gessulat S, Küster B, Wilhelm M. Generating high quality libraries for DIA MS with empirically corrected peptide predictions. Nat Commun 2020; 11:1548. [PMID: 32214105 PMCID: PMC7096433 DOI: 10.1038/s41467-020-15346-1] [Citation(s) in RCA: 125] [Impact Index Per Article: 31.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2019] [Accepted: 02/28/2020] [Indexed: 11/09/2022] Open
Abstract
Data-independent acquisition approaches typically rely on experiment-specific spectrum libraries, requiring offline fractionation and tens to hundreds of injections. We demonstrate a library generation workflow that leverages fragmentation and retention time prediction to build libraries containing every peptide in a proteome, and then refines those libraries with empirical data. Our method specifically enables rapid, experiment-specific library generation for non-model organisms, which we demonstrate using the malaria parasite Plasmodium falciparum, and non-canonical databases, which we show by detecting missense variants in HeLa.
Collapse
Affiliation(s)
- Brian C Searle
- Institute for Systems Biology, Seattle, WA, USA. .,Proteome Software, Inc., Portland, OR, USA.
| | | | | | | | - Siegfried Gessulat
- Technical University of Munich, Freising, Germany.,SAP SE, Potsdam, Germany
| | - Bernhard Küster
- Technical University of Munich, Freising, Germany.,Bavarian Center for Biomolecular Mass Spectrometry, Freising, Germany
| | | |
Collapse
|
11
|
Omenn GS, Lane L, Overall CM, Corrales FJ, Schwenk JM, Paik YK, Van Eyk JE, Liu S, Pennington S, Snyder MP, Baker MS, Deutsch EW. Progress on Identifying and Characterizing the Human Proteome: 2019 Metrics from the HUPO Human Proteome Project. J Proteome Res 2019; 18:4098-4107. [PMID: 31430157 PMCID: PMC6898754 DOI: 10.1021/acs.jproteome.9b00434] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Abstract
The Human Proteome Project (HPP) annually reports on progress made throughout the field in credibly identifying and characterizing the complete human protein parts list and making proteomics an integral part of multiomics studies in medicine and the life sciences. NeXtProt release 2019-01-11 contains 17 694 proteins with strong protein-level evidence (PE1), compliant with HPP Guidelines for Interpretation of MS Data v2.1; these represent 89% of all 19 823 neXtProt predicted coding genes (all PE1,2,3,4 proteins), up from 17 470 one year earlier. Conversely, the number of neXtProt PE2,3,4 proteins, termed the "missing proteins" (MPs), has been reduced from 2949 to 2129 since 2016 through efforts throughout the community, including the chromosome-centric HPP. PeptideAtlas is the source of uniformly reanalyzed raw mass spectrometry data for neXtProt; PeptideAtlas added 495 canonical proteins between 2018 and 2019, especially from studies designed to detect hard-to-identify proteins. Meanwhile, the Human Protein Atlas has released version 18.1 with immunohistochemical evidence of expression of 17 000 proteins and survival plots as part of the Pathology Atlas. Many investigators apply multiplexed SRM-targeted proteomics for quantitation of organ-specific popular proteins in studies of various human diseases. The 19 teams of the Biology and Disease-driven B/D-HPP published a total of 160 publications in 2018, bringing proteomics to a broad array of biomedical research.
Collapse
Affiliation(s)
- Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, Michigan 48109-2218, United States
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109-5263, United States
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, CMU, Michel-Servet 1, 1211 Geneva 4, Switzerland
| | - Christopher M. Overall
- Life Sciences Institute, Faculty of Dentistry, University of British Columbia, 2350 Health Sciences Mall, Room 4.401, Vancouver, British Columbia V6T 1Z3, Canada
| | | | - Jochen M. Schwenk
- Science for Life Laboratory, KTH Royal Institute of Technology, Tomtebodavägen 23A, 17165 Solna, Sweden
| | - Young-Ki Paik
- Yonsei Proteome Research Center, Yonsei University, Room 425, Building #114, 50 Yonsei-ro, Seodaemoon-ku, Seoul 120-749, South Korea
| | - Jennifer E. Van Eyk
- Advanced Clinical BioSystems Research Institute, Cedars Sinai Precision Biomarker Laboratories, Barbra Streisand Women’s Heart Center, Cedars-Sinai Medical Center, Los Angeles, California 90048, United States
| | - Siqi Liu
- BGI Group-Shenzhen, Yantian District, Shenzhen 518083, China
| | - Stephen Pennington
- School of Medicine, University College Dublin, Conway Institute Belfield, Dublin 4, Ireland
| | - Michael P. Snyder
- Department of Genetics, Stanford University, Alway Building, 300 Pasteur Drive and 3165 Porter Drive, Palo Alto, California 94304, United States
| | - Mark S. Baker
- Department of Biomedical Sciences, Faculty of Medicine & Health Sciences, Macquarie University, 75 Talavera Road, North Ryde, NSW 2109, Australia
| | - Eric W. Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, Washington 98109-5263, United States
| |
Collapse
|
12
|
Paik YK, Overall CM, Corrales F, Deutsch EW, Lane L, Omenn GS. Toward Completion of the Human Proteome Parts List: Progress Uncovering Proteins That Are Missing or Have Unknown Function and Developing Analytical Methods. J Proteome Res 2019; 17:4023-4030. [PMID: 30985145 PMCID: PMC6288998 DOI: 10.1021/acs.jproteome.8b00885] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Young-Ki Paik
- Yonsei Proteome Research Center, College of Life Science and Technology, Yonsei University
| | - Christopher M Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences and Biochemistry & Molecular Biology, Faculty of Dentistry, University of British Columbia
| | - Fernando Corrales
- Functional Proteomics Laboratory National Center of Biotechnology, CSIC
| | | | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, Faculty of Medicine, CMU, University of Geneva
| | - Gilbert S Omenn
- Institute for Systems Biology, Departments of Computational Medicine & Bioinformatics, Internal Medicine, and Human Genetics & School of Public Health, University of Michigan
| |
Collapse
|
13
|
Li N, Zhan X. Mitochondrial Dysfunction Pathway Networks and Mitochondrial Dynamics in the Pathogenesis of Pituitary Adenomas. Front Endocrinol (Lausanne) 2019; 10:690. [PMID: 31649621 PMCID: PMC6794370 DOI: 10.3389/fendo.2019.00690] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/28/2018] [Accepted: 09/23/2019] [Indexed: 12/17/2022] Open
Abstract
Mitochondrion is a multi-functional organelle, which is associated with various signaling pathway networks, including energy metabolism, oxidative stress, cell apoptosis, cell cycles, autophagy, and immunity process. Mitochondrial proteins have been discovered to modulate these signaling pathway networks, and multiple biological behaviors to adapt to various internal environments or signaling events of human pathogenesis. Accordingly, mitochondrial dysfunction that alters the bioenergetic and biosynthetic state might contribute to multiple diseases, including cell transformation and tumor. Multiomics studies have revealed that mitochondrial dysfunction, oxidative stress, and cell cycle dysregulation signaling pathways operate in human pituitary adenomas, which suggest mitochondria play critical roles in pituitary adenomas. Some drugs targeting mitochondria are found as a therapeutic strategy for pituitary adenomas, including melatonin, melatonin inhibitors, temozolomide, pyrimethamine, 18 beta-glycyrrhetinic acid, gossypol acetate, Yougui pill, T-2 toxin, grifolic acid, cyclosporine A, dopamine agonists, and paeoniflorin. This article reviews the latest experimental evidence and potential biological roles of mitochondrial dysfunction and mitochondrial dynamics in pituitary adenoma progression, potential molecular mechanisms between mitochondria and pituitary adenoma progression, and current status and perspectives of mitochondria-based biomarkers and targeted drugs for effective management of pituitary adenomas.
Collapse
Affiliation(s)
- Na Li
- Key Laboratory of Cancer Proteomics of Chinese Ministry of Health, Xiangya Hospital, Central South University, Changsha, China
- Hunan Engineering Laboratory for Structural Biology and Drug Design, Xiangya Hospital, Central South University, Changsha, China
- State Local Joint Engineering Laboratory for Anticancer Drugs, Xiangya Hospital, Central South University, Changsha, China
| | - Xianquan Zhan
- Key Laboratory of Cancer Proteomics of Chinese Ministry of Health, Xiangya Hospital, Central South University, Changsha, China
- Hunan Engineering Laboratory for Structural Biology and Drug Design, Xiangya Hospital, Central South University, Changsha, China
- State Local Joint Engineering Laboratory for Anticancer Drugs, Xiangya Hospital, Central South University, Changsha, China
- National Clinical Research Center for Geriatric Disorders, Xiangya Hospital, Central South University, Changsha, China
- Department of Oncology, Xiangya Hospital, Central South University, Changsha, China
| |
Collapse
|
14
|
The function of histone acetylation in cervical cancer development. Biosci Rep 2019; 39:BSR20190527. [PMID: 30886064 PMCID: PMC6465204 DOI: 10.1042/bsr20190527] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2019] [Revised: 03/14/2019] [Accepted: 03/15/2019] [Indexed: 12/19/2022] Open
Abstract
Cervical cancer is the fourth most common female cancer in the world. It is well known that cervical cancer is closely related to high-risk human papillomavirus (HPV) infection. However, epigenetics has increasingly been recognized for its role in tumorigenesis. Epigenetics refers to changes in gene expression levels based on non-gene sequence changes, primarily through transcription or translation of genes regulation, thus affecting its function and characteristics. Typical post-translational modifications (PTMs) include acetylation, propionylation, butyrylation, malonylation and succinylation, among which the acetylation modification of lysine sites has been studied more clearly so far. The acetylation modification of lysine residues in proteins is involved in many aspects of cellular life activities, including carbon metabolism, transcriptional regulation, amino acid metabolism and so on. In this review, we summarize the latest discoveries on cervical cancer development arising from the aspect of acetylation, especially histone acetylation.
Collapse
|
15
|
Kushner IK, Clair G, Purvine SO, Lee JY, Adkins JN, Payne SH. Individual Variability of Protein Expression in Human Tissues. J Proteome Res 2018; 17:3914-3922. [PMID: 30300549 DOI: 10.1021/acs.jproteome.8b00580] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Human tissues are known to exhibit interindividual variability, but a deeper understanding of the different factors affecting protein expression is necessary to further apply this knowledge. Our goal was to explore the proteomic variability between individuals as well as between healthy and diseased samples, and to test the efficacy of machine learning classifiers. In order to investigate whether disparate proteomics data sets may be combined, we performed a retrospective analysis of proteomics data from 9 different human tissues. These data sets represent several different sample prep methods, mass spectrometry instruments, and tissue health. Using these data, we examined interindividual and intertissue variability in peptide expression, and analyzed the methods required to build accurate tissue classifiers. We also evaluated the limits of tissue classification by downsampling the peptide data to simulate situations where less data is available, such as clinical biopsies, laser capture microdissection or potentially single-cell proteomics. Our findings reveal the strong potential for utilizing proteomics data to build robust tissue classifiers, which has many prospective clinical applications for evaluating the applicability of model clinical systems.
Collapse
Affiliation(s)
- Irena K Kushner
- Biological Sciences Division , Pacific Northwest National Laboratory , Richland , Washington 99336 , United States
| | - Geremy Clair
- Biological Sciences Division , Pacific Northwest National Laboratory , Richland , Washington 99336 , United States
| | - Samuel Owen Purvine
- Biological Sciences Division , Pacific Northwest National Laboratory , Richland , Washington 99336 , United States
| | - Joon-Yong Lee
- Biological Sciences Division , Pacific Northwest National Laboratory , Richland , Washington 99336 , United States
| | - Joshua N Adkins
- Biological Sciences Division , Pacific Northwest National Laboratory , Richland , Washington 99336 , United States
| | - Samuel H Payne
- Biological Sciences Division , Pacific Northwest National Laboratory , Richland , Washington 99336 , United States
| |
Collapse
|