1
|
Wei Q, Li J, He QY, Chen Y, Zhang G. Identifying PE2 and PE5 Proteins from Existing Mass Spectrometry Data Using pFind. J Proteome Res 2024; 23:2323-2331. [PMID: 38865581 DOI: 10.1021/acs.jproteome.3c00674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2024]
Abstract
The Chromosome-Centric Human Proteome Project (C-HPP) aims to identify all proteins encoded by the human genome. Currently, the human proteome still contains approximately 2000 PE2-PE5 proteins, referring to annotated coding genes that lack sufficient protein-level evidence. During the past 10 years, it has been increasingly difficult to identify PE2-PE5 proteins in C-HPP approaches due to the limited occurrence. Therefore, we proposed that reanalyzing massive MS data sets in repository with newly developed algorithms may increase the occurrence of the peptides of these proteins. In this study, we downloaded 1000 MS data sets via the ProteomeXchange database. Using pFind software, we identified peptides referring to 1788 PE2-PE5 proteins. Among them, 11 PE2 and 16 PE5 proteins were identified with at least 2 peptides, and 12 of them were identified using 2 peptides in a single data set, following the criteria of the HPP guidelines. We found translation evidence for 16 of the 11 PE2 and 16 PE5 proteins in our RNC-seq data, supporting their existence. The properties of the PE2 and PE5 proteins were similar to those of the PE1 proteins. Our approach demonstrated that mining PE2 and PE5 proteins in massive data repository is still worthy, and multidata set peptide identifications may support the presence of PE2 and PE5 proteins or at least prompt additional studies for validation. Extremely high throughput could be a solution to finding more PE2 and PE5 proteins.
Collapse
Affiliation(s)
- Qianzhou Wei
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Jiamin Li
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Qing-Yu He
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Yang Chen
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Gong Zhang
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes and MOE Key Laboratory of Tumor Molecular Biology, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| |
Collapse
|
2
|
Deep coverage proteome analysis of hair shaft for forensic individual identification. Forensic Sci Int Genet 2022; 60:102742. [DOI: 10.1016/j.fsigen.2022.102742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2021] [Revised: 06/13/2022] [Accepted: 06/15/2022] [Indexed: 11/18/2022]
|
3
|
A Strategy for Rapid Discovery of Marker Peptides Associated with Fibrinolytic Efficacy of Pheretima aspergillum Based on Bioinformatics Combined with Parallel Reaction Monitoring. Molecules 2022; 27:molecules27092651. [PMID: 35566002 PMCID: PMC9100157 DOI: 10.3390/molecules27092651] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Revised: 04/14/2022] [Accepted: 04/19/2022] [Indexed: 11/17/2022] Open
Abstract
Quality control of animal-derived traditional Chinese medicines has improved dramatically as proteomics research advanced in the past few decades. However, it remains challenging to identify quality attributes with routine proteomics approaches since protein with fibrinolytic activity is rarely reported in pheretima, a typical animal-derived traditional medicine. A novel strategy based on bioinformatics combined with parallel reaction monitoring (PRM) was developed here to rapidly discover the marker peptides associated with a fibrinolytic effect. Potential marker peptides were found by lumbrokinase sequences’ alignment and in silico digestion. The fibrinogen zymography was used to visually identify fibrinolytic proteins in pheretima. As a result, it was found that the fibrinolytic activity varied among different portions of pheretima. Fibrinolytic proteins were distributed regionally in the anterior and anterior-mid portion and there was no significant fibrinogenolytic activity observed in the mid-posterior and posterior portion. Finally, PRM experiments were deployed to validate and quantify selected marker peptides and a total of 11 peptides were identified as marker peptides, which could be potentially used in quality control of pheretima. This strategy provides a robust workflow to benefit the quality control of other animal-derived traditional medicines.
Collapse
|
4
|
Bu F, Cheng Q, Zhang Y, Zhang X, Yan K, Liu F, Li Z, Lu X, Ren Y, Liu S. Discovery of Missing Proteins from an Aneuploidy Cell Line Using a Proteogenomic Approach. J Proteome Res 2021; 20:5329-5339. [PMID: 34748338 DOI: 10.1021/acs.jproteome.1c00772] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
With the steadfast development of proteomic technology, the number of missing proteins (MPs) has been continuously shrinking, with approximately 1470 MPs that have not been explored yet. Due to this phenomenon, the discovery of MPs has been increasingly more difficult and elusive. In order to face this challenge, we have hypothesized that a stable aneuploid cell line with increased chromosomes serves as a useful material for assisting MP exploration. Ker-CT cell line with trisomy at chromosome 5 and 20 was selected for this purpose. With a combination strategy of RNA-Seq and LC-MS/MS, a total of 22 178 transcripts and 8846 proteins were identified in Ker-CT. Although the transcripts corresponding to 15 and 15 MP genes located at chromosome 5 and 20 were detected, none of the MPs were found in Ker-CT. Surprisingly, 3 MPs containing at least two unique non-nest peptides of length ≥9 amino acids were identified in Ker-CT, whose genes are located on chromosome 3 and 10, respectively. Furthermore, the 3 MPs were verified using the method of parallel reaction monitoring (PRM). These results suggest that the abnormal status of chromosomes may not only impact the expression of the corresponding genes in trisomy chromosomes, but also influence that of other chromosomes, which benefits MP discovery. The data obtained in this study are available via ProteomeXchange (PXD028647) and PeptideAtlas (PASS01700), respectively.
Collapse
Affiliation(s)
- Fanyu Bu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Qingqiu Cheng
- Clinical Laboratory Center of Dongguan Eighth People's Hospital, Dongguan 523325, China
| | - Yuxing Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Xia Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Keqiang Yan
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Frank Liu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Zelong Li
- Biological Resource Center of Plants, Animals and Microorganisms, China National Gene Bank, BGI-Shenzhen, Guangdong 518120, China
| | - Xiaomei Lu
- Clinical Laboratory Center of Dongguan Eighth People's Hospital, Dongguan 523325, China
| | - Yan Ren
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
| | - Siqi Liu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China.,Department of BGI Education, School of Life Sciences, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| |
Collapse
|
5
|
[Advances in high-throughput proteomic analysis]. Se Pu 2021; 39:112-117. [PMID: 34227342 PMCID: PMC9274848 DOI: 10.3724/sp.j.1123.2020.08023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open
Abstract
基于质谱的蛋白质组学技术已经日趋成熟,可以对细胞和组织中的成千上万种蛋白质进行全面的定性和定量分析,逐步实现“深度覆盖”。随着生物医学日益增长的大队列蛋白质组学分析需求,如何在保持较为理想的覆盖深度下实现短时间、快速的“高通量”蛋白质组学分析已成为当前亟需解决的关键问题之一。常规的蛋白质组学分析流程通常包括样品前处理、色谱分离、质谱检测和数据分析。该文从以上4个方面展开介绍近10年以来高通量蛋白质组学分析技术取得的一系列研究进展,主要包括:(1)基于高通量、自动化移液工作站的蛋白质组样品前处理方法;(2)基于微升流速液相色谱与质谱联用的高通量蛋白质组检测方法;(3)利用灵敏度高、扫描速度快的质谱仪实现短色谱梯度分离下蛋白质组深度覆盖的分析方法;(4)基于人工智能、深度神经网络、机器学习等的蛋白质组学大数据分析方法。此外,对高通量蛋白质组学面临的挑战及其发展进行展望。总而言之,预期在不久的将来高通量蛋白质组学技术将会逐步“落地转化”,成为大队列蛋白质组学分析的利器。
Collapse
|
6
|
Smythers AL, Hicks LM. Mapping the plant proteome: tools for surveying coordinating pathways. Emerg Top Life Sci 2021; 5:203-220. [PMID: 33620075 PMCID: PMC8166341 DOI: 10.1042/etls20200270] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 02/07/2021] [Accepted: 02/09/2021] [Indexed: 12/14/2022]
Abstract
Plants rapidly respond to environmental fluctuations through coordinated, multi-scalar regulation, enabling complex reactions despite their inherently sessile nature. In particular, protein post-translational signaling and protein-protein interactions combine to manipulate cellular responses and regulate plant homeostasis with precise temporal and spatial control. Understanding these proteomic networks are essential to addressing ongoing global crises, including those of food security, rising global temperatures, and the need for renewable materials and fuels. Technological advances in mass spectrometry-based proteomics are enabling investigations of unprecedented depth, and are increasingly being optimized for and applied to plant systems. This review highlights recent advances in plant proteomics, with an emphasis on spatially and temporally resolved analysis of post-translational modifications and protein interactions. It also details the necessity for generation of a comprehensive plant cell atlas while highlighting recent accomplishments within the field.
Collapse
Affiliation(s)
- Amanda L Smythers
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, U.S.A
| | - Leslie M Hicks
- Department of Chemistry, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, U.S.A
| |
Collapse
|
7
|
Aggarwal S, Tolani P, Gupta S, Yadav AK. Posttranslational modifications in systems biology. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2021; 127:93-126. [PMID: 34340775 DOI: 10.1016/bs.apcsb.2021.03.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The biological complexity cannot be captured by genes or proteins alone. The protein posttranslational modifications (PTMs) impart functional diversity to the proteome and regulate protein structure, activity, localization and interactions. Their dynamics drive cellular signaling, growth and development while their dysregulation causes many diseases. Mass spectrometry based quantitative profiling of PTMs and bioinformatics analysis tools allow systems level insights into their network architecture. High-resolution profiling of PTM networks will advance disease understanding and precision medicine. It can accelerate the discovery of biomarkers and drug targets. This requires better tools for unbiased, high-throughput and accurate PTM identification, site localization and automated annotation on a systems level.
Collapse
Affiliation(s)
- Suruchi Aggarwal
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India; Department of Molecular Biology and Biotechnology, Cotton University, Guwahati, Assam, India
| | - Priya Tolani
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India
| | - Srishti Gupta
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India; School of Biosciences and Technology, Vellore Institute of Technology, Vellore, India
| | - Amit Kumar Yadav
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, Haryana, India.
| |
Collapse
|
8
|
González-Gomariz J, Serrano G, Tilve-Álvarez CM, Corrales FJ, Guruceaga E, Segura V. UPEFinder: A Bioinformatic Tool for the Study of Uncharacterized Proteins Based on Gene Expression Correlation and the PageRank Algorithm. J Proteome Res 2020; 19:4795-4807. [PMID: 33155801 DOI: 10.1021/acs.jproteome.0c00364] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The Human Proteome Project (HPP) is leading the international effort to characterize the human proteome. Although the main goal of this project was first focused on the detection of missing proteins, a new challenge arose from the need to assign biological functions to the uncharacterized human proteins and describe their implications in human diseases. Not only the proteins with experimental evidence (uPE1 proteins) but also the uncharacterized missing proteins (uMPs) were the objects of study in this challenge, neXt-CP50. In this work, we developed a new bioinformatic approach to infer biological annotations for the uPE1 proteins and uMPs based on a "guilt-by-association" analysis using public RNA-Seq data sets. We used the correlation of these proteins with the well-characterized PE1 proteins to construct a network. In this way, we applied the PageRank algorithm to this network to identify the most relevant nodes, which were the biological annotations of the uncharacterized proteins. All of the generated information was stored in a database. In addition, we implemented the web application UPEFinder (https://upefinder.proteored.org) to facilitate the access to this new resource. This information is especially relevant for the researchers of the HPP who are interested in the generation and validation of new hypotheses about the functions of these proteins. Both the database and the web application are publicly available (https://github.com/ubioinformat/UPEfinder).
Collapse
Affiliation(s)
| | - Guillermo Serrano
- Bioinformatics Platform, CIMA University of Navarra, Pamplona E-31008, Spain
| | - Carlos M Tilve-Álvarez
- Fundación Profesor Nóvoa-Santos, Instituto de Investigación Biomédica da Coruña, Coruña E-15006, Spain
| | - Fernando J Corrales
- Proteomics Unit, National Center for Biotechnology, CSIC, Madrid E-28049, Spain
| | - Elizabeth Guruceaga
- IdiSNA, Navarra Institute for Health Research, Pamplona E-31008, Spain.,Bioinformatics Platform, CIMA University of Navarra, Pamplona E-31008, Spain
| | - Victor Segura
- Tracasa Instrumental, Sarriguren E-31621, Spain.,Sección de Ingeniería del Dato, Dirección General de Telecomunicaciones y Digitalización, Gobierno de Navarra, Sarriguren E-31621, Spain
| |
Collapse
|
9
|
Zhang Y, Zhang K, Bu F, Hao P, Yang H, Liu S, Ren Y. D283 Med, a Cell Line Derived from Peritoneal Metastatic Medulloblastoma: A Good Choice for Missing Protein Discovery. J Proteome Res 2020; 19:4857-4866. [DOI: 10.1021/acs.jproteome.0c00743] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Yuanliang Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| | - Keren Zhang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| | - Fanyu Bu
- BGI Education Center, University of Chinese Academy of Sciences, Shenzhen, Guangdong 518083, China
| | - Piliang Hao
- School of Life Science and Technology, ShanghaiTech University, 393 Middle Huaxia Road, Shanghai 201210, China
| | - Huanming Yang
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- James D. Watson Institute of Genome Sciences, Hangzhou 310058, China
| | - Siqi Liu
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| | - Yan Ren
- BGI-Shenzhen, Beishan Industrial Zone 11th Building, Yantian District, Shenzhen, Guangdong 518083, China
- BGI-Genomics, BGI-Shenzhen, Shenzhen, Guangdong 518083, China
| |
Collapse
|
10
|
Wu S, Sun J, Wang X, Xu F, Chi H, Li Y, Zhong B, Xie Y, Yan Z, Chang L, Wang D, He F, Wu J, Zhang Y, Xu P. Open-pFind Verified Four Missing Proteins from Multi-Tissues. J Proteome Res 2020; 19:4808-4814. [PMID: 33172275 DOI: 10.1021/acs.jproteome.0c00370] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2012 to perfect the annotation of human protein existence by identifying stronger evidence of the expression of missing proteins (MPs) at the protein level. After an 8 year effort all over the world, the number of MPs in the neXtProt database significantly decreased from 5511 (2012-02-24) to 1899 (2020-01-17). It is now more difficult to provide confident evidence of the remaining MPs because of their specific characteristics, including low abundance, low molecular weight, unexpected modifications, transmembrane structure, tissue-expression specificity, and so on. A higher resolution mass spectrometry (MS) interpretation engine might provide an opportunity to identify these buried MPs in complex samples by the combination with multi-tissue large-scale proteomics. In this study, open-pFind was used to dig MPs from 20 pairs of healthy human tissues by Wang et al. ( Mol. Syst. Biol. 2019, 15 (2), e8503) combined with our large-scale testis data set digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity for different amino acid residues ( J. Proteme Res. 2019, 18 (12), 4189-4196). A total of 1 535 536 peptides with 17 283 477 peptide-spectrum matches (PSMs) were mapped to 14 279 protein entries at a false discovery rate of <1% at the PSM, peptide, and protein levels. A total of 103 MP candidates were identified, among which 86 candidates had more unique peptide numbers compared with our single testis tissue. After rigorous screening, manual checks, peptide synthesis, and matching with documented peptides from PeptideAtlas, we validated four MPs, P0C7T8 (duodenum and small intestine), Q8WWZ4 (stomach and rectum), Q8IV35 (fallopian tube), and O14921 (tonsil), at the protein level. All MS raw files have been deposited to the ProteomeXchange with identifier PXD021391.
Collapse
Affiliation(s)
- Shujia Wu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Jinshuai Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Xi Wang
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Beijing 100080, China
| | - Feng Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Hao Chi
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Beijing 100080, China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Bowen Zhong
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Yuping Xie
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Zhonghua Yan
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Dongxue Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Junzhu Wu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China
| | - Yao Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Ping Xu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| |
Collapse
|
11
|
Omenn GS, Lane L, Overall CM, Cristea IM, Corrales FJ, Lindskog C, Paik YK, Van Eyk JE, Liu S, Pennington SR, Snyder MP, Baker MS, Bandeira N, Aebersold R, Moritz RL, Deutsch EW. Research on the Human Proteome Reaches a Major Milestone: >90% of Predicted Human Proteins Now Credibly Detected, According to the HUPO Human Proteome Project. J Proteome Res 2020; 19:4735-4746. [PMID: 32931287 DOI: 10.1021/acs.jproteome.0c00485] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
According to the 2020 Metrics of the HUPO Human Proteome Project (HPP), expression has now been detected at the protein level for >90% of the 19 773 predicted proteins coded in the human genome. The HPP annually reports on progress made throughout the world toward credibly identifying and characterizing the complete human protein parts list and promoting proteomics as an integral part of multiomics studies in medicine and the life sciences. NeXtProt release 2020-01 classified 17 874 proteins as PE1, having strong protein-level evidence, up 180 from 17 694 one year earlier. These represent 90.4% of the 19 773 predicted coding genes (all PE1,2,3,4 proteins in neXtProt). Conversely, the number of neXtProt PE2,3,4 proteins, termed the "missing proteins" (MPs), was reduced by 230 from 2129 to 1899 since the neXtProt 2019-01 release. PeptideAtlas is the primary source of uniform reanalysis of raw mass spectrometry data for neXtProt, supplemented this year with extensive data from MassIVE. PeptideAtlas 2020-01 added 362 canonical proteins between 2019 and 2020 and MassIVE contributed 84 more, many of which converted PE1 entries based on non-MS evidence to the MS-based subgroup. The 19 Biology and Disease-driven B/D-HPP teams continue to pursue the identification of driver proteins that underlie disease states, the characterization of regulatory mechanisms controlling the functions of these proteins, their proteoforms, and their interactions, and the progression of transitions from correlation to coexpression to causal networks after system perturbations. And the Human Protein Atlas published Blood, Brain, and Metabolic Atlases.
Collapse
Affiliation(s)
- Gilbert S Omenn
- University of Michigan, Ann Arbor, Michigan 48109, United States.,Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | | | - Ileana M Cristea
- Princeton University, Princeton, New Jersey 08544, United States
| | | | | | | | | | - Siqi Liu
- BGI Group, Shenzhen 518083, China
| | | | | | - Mark S Baker
- Macquarie University, Macquarie Park, NSW 2109, Australia
| | - Nuno Bandeira
- University of California, San Diego, La Jolla, California 92093, United States
| | - Ruedi Aebersold
- ETH-Zurich and University of Zurich, 8092 Zurich, Switzerland
| | - Robert L Moritz
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| |
Collapse
|
12
|
Adhikari S, Nice EC, Deutsch EW, Lane L, Omenn GS, Pennington SR, Paik YK, Overall CM, Corrales FJ, Cristea IM, Van Eyk JE, Uhlén M, Lindskog C, Chan DW, Bairoch A, Waddington JC, Justice JL, LaBaer J, Rodriguez H, He F, Kostrzewa M, Ping P, Gundry RL, Stewart P, Srivastava S, Srivastava S, Nogueira FCS, Domont GB, Vandenbrouck Y, Lam MPY, Wennersten S, Vizcaino JA, Wilkins M, Schwenk JM, Lundberg E, Bandeira N, Marko-Varga G, Weintraub ST, Pineau C, Kusebauch U, Moritz RL, Ahn SB, Palmblad M, Snyder MP, Aebersold R, Baker MS. A high-stringency blueprint of the human proteome. Nat Commun 2020; 11:5301. [PMID: 33067450 PMCID: PMC7568584 DOI: 10.1038/s41467-020-19045-9] [Citation(s) in RCA: 131] [Impact Index Per Article: 32.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 09/25/2020] [Indexed: 02/07/2023] Open
Abstract
The Human Proteome Organization (HUPO) launched the Human Proteome Project (HPP) in 2010, creating an international framework for global collaboration, data sharing, quality assurance and enhancing accurate annotation of the genome-encoded proteome. During the subsequent decade, the HPP established collaborations, developed guidelines and metrics, and undertook reanalysis of previously deposited community data, continuously increasing the coverage of the human proteome. On the occasion of the HPP's tenth anniversary, we here report a 90.4% complete high-stringency human proteome blueprint. This knowledge is essential for discerning molecular processes in health and disease, as we demonstrate by highlighting potential roles the human proteome plays in our understanding, diagnosis and treatment of cancers, cardiovascular and infectious diseases.
Collapse
Affiliation(s)
- Subash Adhikari
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia
| | - Edouard C Nice
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia
- Faculty of Medicine, Nursing and Health Sciences, Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, 3800, Australia
| | - Eric W Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Lydie Lane
- Faculty of Medicine, SIB-Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, University of Geneva, CMU, Michel-Servet 1, 1211, Geneva, Switzerland
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109-2218, USA
| | - Stephen R Pennington
- UCD Conway Institute of Biomolecular and Biomedical Research, School of Medicine, University College Dublin, Dublin, Ireland
| | - Young-Ki Paik
- Yonsei Proteome Research Center, 50 Yonsei-ro, Sudaemoon-ku, Seoul, 120-749, South Korea
| | | | - Fernando J Corrales
- Functional Proteomics Laboratory, Centro Nacional de Biotecnología-CSIC, Proteored-ISCIII, 28049, Madrid, Spain
| | - Ileana M Cristea
- Department of Molecular Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Jennifer E Van Eyk
- Cedars Sinai Medical Center, Advanced Clinical Biosystems Research Institute, The Smidt Heart Institute, Los Angeles, CA, 90048, USA
| | - Mathias Uhlén
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, 17121, Solna, Sweden
| | - Cecilia Lindskog
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, 75185, Uppsala, Sweden
| | - Daniel W Chan
- Department of Pathology and Oncology, Johns Hopkins University School of Medicine, Baltimore, MD, 21224, USA
| | - Amos Bairoch
- Faculty of Medicine, SIB-Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, University of Geneva, CMU, Michel-Servet 1, 1211, Geneva, Switzerland
| | - James C Waddington
- UCD Conway Institute of Biomolecular and Biomedical Research, School of Medicine, University College Dublin, Dublin, Ireland
| | - Joshua L Justice
- Department of Molecular Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Joshua LaBaer
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, NIH, Bethesda, MD, 20892, USA
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China
| | - Markus Kostrzewa
- Bruker Daltonik GmbH, Microbiology and Diagnostics, Fahrenheitstrasse, 428359, Bremen, Germany
| | - Peipei Ping
- Cardiac Proteomics and Signaling Laboratory, Department of Physiology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Rebekah L Gundry
- CardiOmics Program, Center for Heart and Vascular Research, Division of Cardiovascular Medicine and Department of Cellular and Integrative Physiology, University of Nebraska Medical Center, Omaha, NE, 68198, USA
| | - Peter Stewart
- Department of Chemical Pathology, Royal Prince Alfred Hospital, Camperdown, NSW, Australia
| | | | - Sudhir Srivastava
- Cancer Biomarkers Research Branch, National Cancer Institute, National Institutes of Health, 9609 Medical Center Drive, Suite 5E136, Rockville, MD, 20852, USA
| | - Fabio C S Nogueira
- Proteomics Unit and Laboratory of Proteomics, Institute of Chemistry, Federal University of Rio de Janeiro, Av Athos da Silveria Ramos, 149, 21941-909, Rio de Janeiro, RJ, Brazil
| | - Gilberto B Domont
- Proteomics Unit and Laboratory of Proteomics, Institute of Chemistry, Federal University of Rio de Janeiro, Av Athos da Silveria Ramos, 149, 21941-909, Rio de Janeiro, RJ, Brazil
| | - Yves Vandenbrouck
- University of Grenoble Alpes, Inserm, CEA, IRIG-BGE, U1038, 38000, Grenoble, France
| | - Maggie P Y Lam
- Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
- Consortium for Fibrosis Research and Translation, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
| | - Sara Wennersten
- Division of Cardiology, Department of Medicine, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
| | - Juan Antonio Vizcaino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Marc Wilkins
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Jochen M Schwenk
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, 17121, Solna, Sweden
| | - Emma Lundberg
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, 17121, Solna, Sweden
| | - Nuno Bandeira
- Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, Mail Code 0404, La Jolla, CA, 92093-0404, USA
| | | | - Susan T Weintraub
- Department of Biochemistry and Structural Biology, University of Texas Health Science Center San Antonio, UT Health, 7703 Floyd Curl Drive, San Antonio, TX, 78229-3900, USA
| | - Charles Pineau
- University of Rennes, Inserm, EHESP, IREST, UMR_S 1085, F-35042, Rennes, France
| | - Ulrike Kusebauch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Robert L Moritz
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Seong Beom Ahn
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia
| | - Magnus Palmblad
- Leiden University Medical Center, Leiden, 2333, The Netherlands
| | - Michael P Snyder
- Department of Genetics, Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Ruedi Aebersold
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
- Faculty of Science, University of Zurich, Zurich, Switzerland
| | - Mark S Baker
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia.
- Department of Genetics, Stanford School of Medicine, Stanford, CA, 94305, USA.
| |
Collapse
|
13
|
Xu R, Sheng J, Bai M, Shu K, Zhu Y, Chang C. A Comprehensive Evaluation of MS/MS Spectrum Prediction Tools for Shotgun Proteomics. Proteomics 2020; 20:e1900345. [DOI: 10.1002/pmic.201900345] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2019] [Revised: 04/29/2020] [Indexed: 01/27/2023]
Affiliation(s)
- Rui Xu
- State Key Laboratory of Proteomics Beijing Proteome Research Center National Center for Protein Sciences (Beijing) Beijing Institute of Lifeomics Beijing 102206 China
- Chongqing Key Laboratory on Big Data for Bio Intelligence Chongqing University of Posts and Telecommunications Chongqing 400065 China
| | - Jie Sheng
- State Key Laboratory of Proteomics Beijing Proteome Research Center National Center for Protein Sciences (Beijing) Beijing Institute of Lifeomics Beijing 102206 China
- Chongqing Key Laboratory on Big Data for Bio Intelligence Chongqing University of Posts and Telecommunications Chongqing 400065 China
| | - Mingze Bai
- Chongqing Key Laboratory on Big Data for Bio Intelligence Chongqing University of Posts and Telecommunications Chongqing 400065 China
| | - Kunxian Shu
- Chongqing Key Laboratory on Big Data for Bio Intelligence Chongqing University of Posts and Telecommunications Chongqing 400065 China
| | - Yunping Zhu
- State Key Laboratory of Proteomics Beijing Proteome Research Center National Center for Protein Sciences (Beijing) Beijing Institute of Lifeomics Beijing 102206 China
| | - Cheng Chang
- State Key Laboratory of Proteomics Beijing Proteome Research Center National Center for Protein Sciences (Beijing) Beijing Institute of Lifeomics Beijing 102206 China
| |
Collapse
|