1
|
Hwang H, Im JE, Yang Y, Kim H, Kwon KH, Kim YH, Kim JY, Yoo JS. Bioinformatic Prediction of Gene Ontology Terms of Uncharacterized Proteins from Chromosome 11. J Proteome Res 2020; 19:4907-4912. [PMID: 33089979 DOI: 10.1021/acs.jproteome.0c00482] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
In chromosome 11, 71 out of its 1254 proteins remain functionally uncharacterized on the basis of their existence evidence (uPE1s) following the latest version of neXtProt (release 2020-01-17). Because in vivo and in vitro experimental strategies are often time-consuming and labor-intensive, there is a need for a bioinformatics tool to predict the function annotation. Here, we used I-TASSER/COFACTOR provided on the neXtProt web site, which predicts gene ontology (GO) terms based on the 3D structure of the protein. I-TASSER/COFACTOR predicted 2413 GO terms with a benchmark dataset of the 22 proteins belonging to PE1 of chromosome 11. In this study, we developed a filtering algorithm in order to select specific GO terms using the GO map generated by I-TASSER/COFACTOR. As a result, 187 specific GO terms showed a higher average precision-recall score at the least cellular component term compared to 2413 predicted GO terms. Next, we applied 65 proteins belonging to uPE1s of chromosome 11, and then 409 out of 6684 GO terms survived, where 103 and 142 GO terms of molecular function and biological process, respectively, were included. Representatively, the cellular component GO terms of CCDC90B, C11orf52, and the SMAP were predicted and validated using the overexpression system into 293T cells and immunofluorescence staining. We will further study their biological and molecular functions toward the goal of the neXt-CP50 project as a part of C-HPP. We shared all results and programs in Github (https://github.com/heeyounh/I-TASSER-COFACTOR-filtering.git).
Collapse
Affiliation(s)
- Heeyoun Hwang
- Research Center for Bioconvergence Analysis, Korea Basic Science Institute, Cheongju 28119, Republic of Korea
| | - Ji Eun Im
- Division of Convergence Technology, Research Institute of National Cancer Center, Goyang 10408, Republic of Korea
| | - Yeji Yang
- Research Center for Bioconvergence Analysis, Korea Basic Science Institute, Cheongju 28119, Republic of Korea
| | - Hyejin Kim
- Research Center for Bioconvergence Analysis, Korea Basic Science Institute, Cheongju 28119, Republic of Korea.,Graduate School of Analytical Science and Technology, Chungnam National University, Daejeon 34134, Republic of Korea
| | - Kyung-Hoon Kwon
- Research Center for Bioconvergence Analysis, Korea Basic Science Institute, Cheongju 28119, Republic of Korea
| | - Yun-Hee Kim
- Division of Convergence Technology, Research Institute of National Cancer Center, Goyang 10408, Republic of Korea.,Department of Cancer Biomedical Science, The National Cancer Center Graduate School of Cancer Science and Policy, Goyang 10408, Republic of Korea
| | - Jin Young Kim
- Research Center for Bioconvergence Analysis, Korea Basic Science Institute, Cheongju 28119, Republic of Korea
| | - Jong Shin Yoo
- Research Center for Bioconvergence Analysis, Korea Basic Science Institute, Cheongju 28119, Republic of Korea.,Graduate School of Analytical Science and Technology, Chungnam National University, Daejeon 34134, Republic of Korea
| |
Collapse
|
2
|
Wu S, Sun J, Wang X, Xu F, Chi H, Li Y, Zhong B, Xie Y, Yan Z, Chang L, Wang D, He F, Wu J, Zhang Y, Xu P. Open-pFind Verified Four Missing Proteins from Multi-Tissues. J Proteome Res 2020; 19:4808-4814. [PMID: 33172275 DOI: 10.1021/acs.jproteome.0c00370] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The Chromosome-Centric Human Proteome Project (C-HPP) was launched in 2012 to perfect the annotation of human protein existence by identifying stronger evidence of the expression of missing proteins (MPs) at the protein level. After an 8 year effort all over the world, the number of MPs in the neXtProt database significantly decreased from 5511 (2012-02-24) to 1899 (2020-01-17). It is now more difficult to provide confident evidence of the remaining MPs because of their specific characteristics, including low abundance, low molecular weight, unexpected modifications, transmembrane structure, tissue-expression specificity, and so on. A higher resolution mass spectrometry (MS) interpretation engine might provide an opportunity to identify these buried MPs in complex samples by the combination with multi-tissue large-scale proteomics. In this study, open-pFind was used to dig MPs from 20 pairs of healthy human tissues by Wang et al. ( Mol. Syst. Biol. 2019, 15 (2), e8503) combined with our large-scale testis data set digested by three enzymes (Glu-C, Lys-C, and trypsin) with specificity for different amino acid residues ( J. Proteme Res. 2019, 18 (12), 4189-4196). A total of 1 535 536 peptides with 17 283 477 peptide-spectrum matches (PSMs) were mapped to 14 279 protein entries at a false discovery rate of <1% at the PSM, peptide, and protein levels. A total of 103 MP candidates were identified, among which 86 candidates had more unique peptide numbers compared with our single testis tissue. After rigorous screening, manual checks, peptide synthesis, and matching with documented peptides from PeptideAtlas, we validated four MPs, P0C7T8 (duodenum and small intestine), Q8WWZ4 (stomach and rectum), Q8IV35 (fallopian tube), and O14921 (tonsil), at the protein level. All MS raw files have been deposited to the ProteomeXchange with identifier PXD021391.
Collapse
Affiliation(s)
- Shujia Wu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Jinshuai Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| | - Xi Wang
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Beijing 100080, China
| | - Feng Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Hao Chi
- Key Laboratory of Intelligent Information Processing of Chinese Academy of Sciences (CAS), Institute of Computing Technology, Beijing 100080, China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Bowen Zhong
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Yuping Xie
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Zhonghua Yan
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Dongxue Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Junzhu Wu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China
| | - Yao Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China
| | - Ping Xu
- School of Basic Medical Science, Key Laboratory of Combinational Biosynthesis and Drug Discovery of Ministry of Education, School of Basic Medical Sciences, Wuhan University, Wuhan 430072, China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Research Unit of Proteomics & Research and Development of New Drug of Chinese Academy of Medical Sciences, Institute of Lifeomics, Beijing 102206, China.,Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences, Hebei University, Baoding, Hebei 071002, China
| |
Collapse
|
3
|
Sun J, Shi J, Wang Y, Chen Y, Li Y, Kong D, Chang L, Liu F, Lv Z, Zhou Y, He F, Zhang Y, Xu P. Multiproteases Combined with High-pH Reverse-Phase Separation Strategy Verified Fourteen Missing Proteins in Human Testis Tissue. J Proteome Res 2018; 17:4171-4177. [PMID: 30280576 DOI: 10.1021/acs.jproteome.8b00397] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Subsequent to conducting the Chromosome-Centric Human Proteome Project, we have focused on human testis-enriched missing proteins (MPs) since 2015. For protein coverage to be enhanced, a multiprotease strategy was used for separation of samples by 10% SDS-PAGE. For the separating efficiency to be improved, a high-pH reverse phase (RP) separation strategy was applied to fractionate complex samples in this study. A total of 11,558 proteins was identified, which is the largest proteome data set for single human tissue sample so far. On the basis of this large-scale data set, we verified 14 MPs (PE2) in neXtProt (2018-01) after spectrum quality analysis, isobaric post-translational modification, and single amino acid variant filtering, and synthesized peptide matching. Tissue expression analysis showed that 3 of 14 MPs were testis-specific proteins. Functional analysis showed that 10 of 14 MPs were closely related to liver tumor, liver carcinoma, and hepatocellular carcinoma. Another 100 MPs were listed as candidates but required additional verification information. All MS data sets have been deposited into the ProteomeXchange with the identifier PXD009737.
Collapse
Affiliation(s)
- Jinshuai Sun
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Jiahui Shi
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yihao Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yang Chen
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yanchang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Degang Kong
- Department of Hepatopancreatobiliary Surgery , The Second Affiliated Hospital of Tianjin Medical University , Tianjin 300211 , China
| | - Lei Chang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Fengsong Liu
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China
| | - Zhitang Lv
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China
| | - Yue Zhou
- Demo Laboratory of Thermofisher Scientific China , Shanghai 200120 , China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China
| | - Yao Zhang
- State Key Laboratory of Biocontrol, Guangdong Key Laboratory of Plant Resources, School of Life Sciences , Sun Yat-Sen University , Guangzhou 510275 , China
| | - Ping Xu
- Hebei Province Key Lab of Research and Application on Microbial Diversity, College of Life Sciences , Hebei University , Baoding , Hebei 071002 , China.,State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing) , Beijing Institute of Lifeomics , Beijing 102206 , China.,Key Laboratory of Combinational Biosynthesis and Drug Discovery (Wuhan University), Ministry of Education, School of Pharmaceutical Science , Wuhan University , Wuhan 430072 , China
| |
Collapse
|
4
|
Abstract
INTRODUCTION The mission of the Chromosome-Centric Human Proteome Project (C-HPP), is to map and annotate the entire predicted human protein set (~20,000 proteins) encoded by each chromosome. The initial steps of the project are focused on 'missing proteins (MPs)', which lacked documented evidence for existence at protein level. In addition to remaining 2,579 MPs, we also target those annotated proteins having unknown functions, uPE1 proteins, alternative splice isoforms and post-translational modifications. We also consider how to investigate various protein functions involved in cis-regulatory phenomena, amplicons lncRNAs and smORFs. Areas covered: We will cover the scope, historic background, progress, challenges and future prospects of C-HPP. This review also addresses the question of how we can best improve the methodological approaches, select the optimal biological samples, and recommend stringent protocols for the identification and characterization of MPs. A new strategy for functional analysis of some of those annotated proteins having unknown function will also be discussed. Expert commentary: If the project moves well by reshaping the original goals, the current working modules and team work in the proposed extended planning period, it is anticipated that a progressively more detailed draft of an accurate chromosome-based proteome map will become available with functional information.
Collapse
Affiliation(s)
- Young-Ki Paik
- a Yonsei Proteome Research Center and Department of Biochemistry , Yonsei University , Seoul , Korea
| | - Gilbert S Omenn
- b Department of Computational Medicine & Bioinformatics , University of Michigan , Ann Arbor , MI , USA
| | - William S Hancock
- c Department of Chemical Biology , Northeastern University , Boston , Massachusetts 02115 , USA
| | - Lydie Lane
- d Department of Human Protein Sciences, Faculty of Medicine , University of Geneva , Geneva , Switzerland.,e Swiss Institute of Bioinformatics , Geneva , Switzerland
| | - Christopher M Overall
- f Centre for Blood Research, Departments of Oral Biological & Medical Sciences, and Biochemistry & Molecular Biology, Faculty of Dentistry , University of British Columbia , Vancouver , Canada
| |
Collapse
|
5
|
Wang Y, Chen Y, Zhang Y, Wei W, Li Y, Zhang T, He F, Gao Y, Xu P. Multi-Protease Strategy Identifies Three PE2 Missing Proteins in Human Testis Tissue. J Proteome Res 2017; 16:4352-4363. [PMID: 28959888 DOI: 10.1021/acs.jproteome.7b00340] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Although 5 years of the missing proteins (MPs) study have been completed, searching for MPs remains one of the core missions of the Chromosome-Centric Human Proteome Project (C-HPP). Following the next-50-MPs challenge of the C-HPP, we have focused on the testis-enriched MPs by various strategies since 2015. On the basis of the theoretical analysis of MPs (2017-01, neXtProt) using multiprotease digestion, we found that nonconventional proteases (e.g. LysargiNase, GluC) could improve the peptide diversity and sequence coverage compared with Trypsin. Therefore, a multiprotease strategy was used for searching more MPs in the same human testis tissues separated by 10% SDS-PAGE, followed by high resolution LC-MS/MS system (Q Exactive HF). A total of 7838 proteins were identified. Among them, three PE2 MPs in neXtProt 2017-01 have been identified: beta-defensin 123 ( Q8N688 , chr 20q), cancer/testis antigen family 45 member A10 ( P0DMU9 , chr Xq), and Histone H2A-Bbd type 2/3 ( P0C5Z0 , chr Xq). However, because only one unique peptide of ≥9 AA was identified in beta-defensin 123 and Histone H2A-Bbd type 2/3, respectively, further analysis indicates that each falls under the exceptions clause of the HPP Guidelines v2.1. After a spectrum quality check, isobaric PTM and single amino acid variant (SAAV) filtering, and verification with a synthesized peptide, and based on overlapping peptides from different proteases, these three MPs should be considered as exemplary examples of MPs found by exceptional criteria. Other MPs were considered as candidates but need further validation. All MS data sets have been deposited to the ProteomeXchange with identifier PXD006465.
Collapse
Affiliation(s)
- Yihao Wang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Department of Pharmacology and Toxicology, Beijing Institute of Radiation Medicine , Beijing 100850, China
| | - Yang Chen
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yao Zhang
- State Key Laboratory of Biocontrol and Guangdong Provincial Key Laboratory of Plant Resources, College of Ecology and Evolution, Sun Yat-Sen University , Guangzhou 510275, China
| | - Wei Wei
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yanchang Li
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Tao Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yue Gao
- Department of Pharmacology and Toxicology, Beijing Institute of Radiation Medicine , Beijing 100850, China
| | - Ping Xu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China.,Graduate School, Anhui Medical University , Hefei 230032, China.,Tianjin Baodi Hospital , Tianjin 301800, China
| |
Collapse
|
6
|
Peng X, Xu F, Liu S, Li S, Huang Q, Chang L, Wang L, Ma X, He F, Xu P. Identification of Missing Proteins in the Phosphoproteome of Kidney Cancer. J Proteome Res 2017; 16:4364-4373. [PMID: 28857561 DOI: 10.1021/acs.jproteome.7b00332] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Identifying missing proteins (MPs) has been one of the critical missions of the Chromosome-Centric Human Proteome Project (C-HPP). Since 2012, over 30 research teams from 17 countries have been trying to search adequate and accurate evidence of MPs through various biochemical strategies. MPs mainly fall into the following classes: (1) low-molecular-weight (LMW) proteins, (2) membrane proteins, (3) proteins that contained various post-translational modifications (PTMs), (4) nucleic acid-associated proteins, (5) low abundance, and (6) unexpressed genes. In this study, kidney cancer and adjacent tissues were used for phosphoproteomics research, and 8962 proteins were identified, including 6415 phosphoproteins, and 44 728 phosphosites, of which 10 266 were unreported previously. In total, 75 candidate detections were found, including 45 phoshoproteins. GO analysis for these 75 candidate detections revealed that these proteins mainly clustered as membrane proteins and took part in nephron and kidney development. After rigorous screening and manual check, 9 of them were verified with the synthesized peptides. Finally, only one missing protein was confirmed. All mass spectrometry data from this study have been deposited in the PRIDE with identifier PXD006482.
Collapse
Affiliation(s)
- Xuehui Peng
- Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of the Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China.,State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Feng Xu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Shu Liu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Suzhen Li
- Graduate School, Anhui Medical University , Hefei 230032, China
| | - Qingbo Huang
- Chinese PLA General Hospital , Urology Department, Beijing 100853, China
| | - Lei Chang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Lei Wang
- Chinese PLA General Hospital , Urology Department, Beijing 100853, China
| | - Xin Ma
- Chinese PLA General Hospital , Urology Department, Beijing 100853, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Ping Xu
- Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of the Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China.,State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Graduate School, Anhui Medical University , Hefei 230032, China
| |
Collapse
|
7
|
Wei W, Luo W, Wu F, Peng X, Zhang Y, Zhang M, Zhao Y, Su N, Qi Y, Chen L, Zhang Y, Wen B, He F, Xu P. Deep Coverage Proteomics Identifies More Low-Abundance Missing Proteins in Human Testis Tissue with Q-Exactive HF Mass Spectrometer. J Proteome Res 2016; 15:3988-3997. [PMID: 27535590 DOI: 10.1021/acs.jproteome.6b00390] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Since 2012, missing proteins (MPs) investigation has been one of the critical missions of Chromosome-Centric Human Proteome Project (C-HPP) through various biochemical strategies. On the basis of our previous testis MPs study, faster scanning and higher resolution mass-spectrometry-based proteomics might be conducive to MPs exploration, especially for low-abundance proteins. In this study, Q-Exactive HF (HF) was used to survey proteins from the same testis tissues separated by two separating methods (tricine- and glycine-SDS-PAGE), as previously described. A total of 8526 proteins were identified, of which more low-abundance proteins were uniquely detected in HF data but not in our previous LTQ Orbitrap Velos (Velos) reanalysis data. Further transcriptomics analysis showed that these uniquely identified proteins by HF also had lower expression at the mRNA level. Of the 81 total identified MPs, 74 and 39 proteins were listed as MPs in HF and Velos data sets, respectively. Among the above MPs, 47 proteins (43 neXtProt PE2 and 4 PE3) were ranked as confirmed MPs after verifying with the stringent spectra match and isobaric and single amino acid variants filtering. Functional investigation of these 47 MPs revealed that 11 MPs were testis-specific proteins and 7 MPs were involved in spermatogenesis process. Therefore, we concluded that higher scanning speed and resolution of HF might be factors for improving the low-abundance MP identification in future C-HPP studies. All mass-spectrometry data from this study have been deposited in the ProteomeXchange with identifier PXD004092.
Collapse
Affiliation(s)
- Wei Wei
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Weijia Luo
- Graduate School, Anhui Medical University , Hefei 230032, China
| | - Feilin Wu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Life Science College, Southwest Forestry University , Kunming, 650224, China
| | - Xuehui Peng
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of the Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China
| | - Yao Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Institute of Microbiology , Chinese Academy of Science, Beijing 100101, China
| | - Manli Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yan Zhao
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Na Su
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - YingZi Qi
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Lingsheng Chen
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Yangjun Zhang
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Bo Wen
- BGI-Shenzhen , Shenzhen 518083, China
| | - Fuchu He
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China
| | - Ping Xu
- State Key Laboratory of Proteomics, National Center for Protein Sciences Beijing, Beijing Proteome Research Center, National Engineering Research Center for Protein Drugs, Beijing Institute of Radiation Medicine , Beijing 102206, China.,Graduate School, Anhui Medical University , Hefei 230032, China.,Key Laboratory of Combinatorial Biosynthesis and Drug Discovery of the Ministry of Education, School of Pharmaceutical Sciences, Wuhan University , Wuhan 430072, China
| |
Collapse
|
8
|
Kong N, Zhou Y, Xu S, Deng Y, Fan Y, Zhang Y, Ren Z, Lin L, Ren Y, Wang Q, Zi J, Wen B, Liu S. Assessing Transcription Regulatory Elements To Evaluate the Expression Status of Missing Protein Genes on Chromosomes 11 and 19. J Proteome Res 2015; 14:4967-75. [PMID: 26456862 DOI: 10.1021/acs.jproteome.5b00567] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
During an investigation of missing proteins with the RNA-seq data acquired from three liver cancer cell lines, the majority of the missing protein coding genes (MPGs) located at chromosome 11 (chr11) had no corresponding mRNAs, while a high percentage of the MPGs on chr19 were detected at the mRNA level. The phenomenon, which was also observed in more than 40 cell lines, led to an inquiry of causation of the different transcriptional statuses of the MPGs in the two chromosomes. We hypothesized that the special chromatin structure was a key element to regulate MPG transcription. Upon a systematical comparison of the effects of DNase I hypersensitive sites (DHSs), transcription factors (TFs), and histone modifications toward these genes or MPGs with/without mRNA evidence in chr11 and 19, we attributed the poor transcription of the MPGs to the weak capacity of these transcription regulatory elements, regardless of which chromosome the MPGs were located. We further analyzed the gene contents in chr11 and found a number of genes related to sensory functions in the presence of chr11. We postulate that a high number of sensory-related genes, which are located within special chromatin structure, could bring a low detection rate of MPGs in chr11.
Collapse
Affiliation(s)
- Nannan Kong
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , No. 1 BeiChen West Road, Beijing 100101, China.,BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,Graduate University of the Chinese Academy of Sciences , 19A, Yuquan Road, Beijing 100049, China
| | - Yang Zhou
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , No. 1 BeiChen West Road, Beijing 100101, China.,BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,Graduate University of the Chinese Academy of Sciences , 19A, Yuquan Road, Beijing 100049, China
| | - Shaohang Xu
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Yamei Deng
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , No. 1 BeiChen West Road, Beijing 100101, China.,BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,Graduate University of the Chinese Academy of Sciences , 19A, Yuquan Road, Beijing 100049, China
| | - Yang Fan
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , No. 1 BeiChen West Road, Beijing 100101, China.,BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,Graduate University of the Chinese Academy of Sciences , 19A, Yuquan Road, Beijing 100049, China
| | - Yue Zhang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , No. 1 BeiChen West Road, Beijing 100101, China.,BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,Graduate University of the Chinese Academy of Sciences , 19A, Yuquan Road, Beijing 100049, China
| | - Zhe Ren
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Liang Lin
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Yan Ren
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Quanhui Wang
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , No. 1 BeiChen West Road, Beijing 100101, China.,BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China.,Graduate University of the Chinese Academy of Sciences , 19A, Yuquan Road, Beijing 100049, China
| | - Jin Zi
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Bo Wen
- BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Siqi Liu
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences , No. 1 BeiChen West Road, Beijing 100101, China.,BGI-Shenzhen , 11 Build, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| |
Collapse
|
9
|
Jeong SK, Hancock WS, Paik YK. GenomewidePDB 2.0: A Newly Upgraded Versatile Proteogenomic Database for the Chromosome-Centric Human Proteome Project. J Proteome Res 2015; 14:3710-9. [PMID: 26272709 DOI: 10.1021/acs.jproteome.5b00541] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
Abstract
Since the launch of the Chromosome-centric Human Proteome Project (C-HPP) in 2012, the number of "missing" proteins has fallen to 2932, down from ∼5932 since the number was first counted in 2011. We compared the characteristics of missing proteins with those of already annotated proteins with respect to transcriptional expression pattern and the time periods in which newly identified proteins were annotated. We learned that missing proteins commonly exhibit lower levels of transcriptional expression and less tissue-specific expression compared with already annotated proteins. This makes it more difficult to identify missing proteins as time goes on. One of the C-HPP goals is to identify alternative spliced product of proteins (ASPs), which are usually difficult to find by shot-gun proteomic methods due to their sequence similarities with the representative proteins. To resolve this problem, it may be necessary to use a targeted proteomics approach (e.g., selected and multiple reaction monitoring [S/MRM] assays) and an innovative bioinformatics platform that enables the selection of target peptides for rarely expressed missing proteins or ASPs. Given that the success of efforts to identify missing proteins may rely on more informative public databases, it was necessary to upgrade the available integrative databases. To this end, we attempted to improve the features and utility of GenomewidePDB by integrating transcriptomic information (e.g., alternatively spliced transcripts), annotated peptide information, and an advanced search interface that can find proteins of interest when applying a targeted proteomics strategy. This upgraded version of the database, GenomewidePDB 2.0, may not only expedite identification of the remaining missing proteins but also enhance the exchange of information among the proteome community. GenomewidePDB 2.0 is available publicly at http://genomewidepdb.proteomix.org/.
Collapse
Affiliation(s)
- Seul-Ki Jeong
- Yonsei Proteome Research Center and Biomedical Proteome Research Center , 50 Yonsei-Ro, Seodaemun-gu, Seoul 120-749, Korea
| | - William S Hancock
- Barnett Institute and Department of Chemistry and Chemical Biology, Northeastern University , 12 Oxford Street, Boston, Massachusetts 02115, United States
| | - Young-Ki Paik
- Yonsei Proteome Research Center and Biomedical Proteome Research Center , 50 Yonsei-Ro, Seodaemun-gu, Seoul 120-749, Korea.,Department of Biochemistry, Department of Integrated Omics for Biomedical Science (World Class University Graduate Program), Yonsei University , 50 Yonsei-Ro, Sudaemoon-ku, Seoul 120-749, Korea
| |
Collapse
|
10
|
Horvatovich P, Végvári Á, Saul J, Park JG, Qiu J, Syring M, Pirrotte P, Petritis K, Tegeler TJ, Aziz M, Fuentes M, Diez P, Gonzalez-Gonzalez M, Ibarrola N, Droste C, De Las Rivas J, Gil C, Clemente F, Hernaez ML, Corrales FJ, Nilsson CL, Berven FS, Bischoff R, Fehniger TE, LaBaer J, Marko-Varga G. In Vitro Transcription/Translation System: A Versatile Tool in the Search for Missing Proteins. J Proteome Res 2015; 14:3441-51. [PMID: 26155874 DOI: 10.1021/acs.jproteome.5b00486] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Approximately 18% of all human genes purported to encode proteins have not been directly evidenced at the protein level, according to the validation criteria established by neXtProt, and are considered to be "missing" proteins. One of the goals of the Chromosome-Centric Human Proteome Project (C-HPP) is to identify as many of these missing proteins as possible in human samples using mass spectrometry-based methods. To further this goal, a consortium of C-HPP teams (chromosomes 5, 10, 16, and 19) has joined forces to devise new strategies to identify missing proteins by use of a cell-free in vitro transcription/translation system (IVTT). The proposed strategy employs LC-MS/MS data-dependent acquisition (DDA) and targeted selective reaction monitoring (SRM) methods to scrutinize low-complexity samples derived from IVTT. The optimized assays are then applied to identify missing proteins in human cells and tissues. We describe the approach and show proof-of-concept results for development of LC-SRM assays for identification of 18 missing proteins. We believe that the IVTT system, when coupled with downstream mass spectrometric identification, can be applied to identify proteins that have eluded more traditional methods of detection.
Collapse
Affiliation(s)
- Péter Horvatovich
- Analytical Biochemistry, Department of Pharmacy, University of Groningen , A. Deusinglaan 1, 9713 AV Groningen, The Netherlands
| | - Ákos Végvári
- Department of Pharmacology & Toxicology, The University of Texas Medical Branch , 301 University Boulevard, Galveston, Texas 77555-1074, United States
| | - Justin Saul
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University , Tempe, Arizona 85287, United States
| | - Jin G Park
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University , Tempe, Arizona 85287, United States
| | - Ji Qiu
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University , Tempe, Arizona 85287, United States
| | - Michael Syring
- Center for Proteomics, Translational Genomics Research Institute , Phoenix, Arizona 85004, United States
| | - Patrick Pirrotte
- Center for Proteomics, Translational Genomics Research Institute , Phoenix, Arizona 85004, United States
| | - Konstantinos Petritis
- Center for Proteomics, Translational Genomics Research Institute , Phoenix, Arizona 85004, United States.,Pathology Research, Phoenix Children's Hospital , 1919 East Thomas Road, Phoenix, Arizona 85016, United States
| | - Tony J Tegeler
- Center for Proteomics, Translational Genomics Research Institute , Phoenix, Arizona 85004, United States
| | - Meraj Aziz
- Center for Proteomics, Translational Genomics Research Institute , Phoenix, Arizona 85004, United States
| | | | | | | | | | | | | | - Concha Gil
- Department of Microbiology & Proteomics Unit, University Complutense , 28040 Madrid, Spain
| | - Felipe Clemente
- Department of Microbiology & Proteomics Unit, University Complutense , 28040 Madrid, Spain
| | - Maria Luisa Hernaez
- Department of Microbiology & Proteomics Unit, University Complutense , 28040 Madrid, Spain
| | - Fernando J Corrales
- Center for Applied Medical Research (CIMA), University of Navarra, PRB2-ProteoRed-ISCIII, IDISNA, Ciberhed , 31008 Pamplona, Spain
| | - Carol L Nilsson
- Department of Pharmacology & Toxicology, The University of Texas Medical Branch , 301 University Boulevard, Galveston, Texas 77555-1074, United States
| | - Frode S Berven
- Proteomics Unit (PROBE), Department of Biomedicine, University of Bergen , Postbox 7804, N-5009 Bergen, Norway.,The Norwegian Multiple Sclerosis Competence Centre, Department of Neurology, Haukeland University Hospital , Postbox 1400, 5021 Bergen, Norway
| | - Rainer Bischoff
- Analytical Biochemistry, Department of Pharmacy, University of Groningen , A. Deusinglaan 1, 9713 AV Groningen, The Netherlands
| | | | - Joshua LaBaer
- Center for Personalized Diagnostics, Biodesign Institute, Arizona State University , Tempe, Arizona 85287, United States
| | - György Marko-Varga
- First Department of Surgery, Tokyo Medical University , 6-7-1 Nishishinjuku Shinjuku-ku, 160-0023 Tokyo, Japan
| |
Collapse
|
11
|
Lichti CF, Mostovenko E, Wadsworth PA, Lynch GC, Pettitt BM, Sulman EP, Wang Q, Lang FF, Rezeli M, Marko-Varga G, Végvári Á, Nilsson CL. Systematic identification of single amino acid variants in glioma stem-cell-derived chromosome 19 proteins. J Proteome Res 2014; 14:778-86. [PMID: 25399873 PMCID: PMC4324435 DOI: 10.1021/pr500810g] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
![]()
Novel
proteoforms with single amino acid variations represent proteins
that often have altered biological functions but are less explored
in the human proteome. We have developed an approach, searching high
quality shotgun proteomic data against an extended protein database,
to identify expressed mutant proteoforms in glioma stem cell (GSC)
lines. The systematic search of MS/MS spectra using PEAKS 7.0 as the
search engine has recognized 17 chromosome 19 proteins in GSCs with
altered amino acid sequences. The results were further verified by
manual spectral examination, validating 19 proteoforms. One of the
novel findings, a mutant form of branched-chain aminotransferase 2
(p.Thr186Arg), was verified at the transcript level
and by targeted proteomics in several glioma stem cell lines. The
structure of this proteoform was examined by molecular modeling in
order to estimate conformational changes due to mutation that might
lead to functional modifications potentially linked to glioma. Based
on our initial findings, we believe that our approach presented could
contribute to construct a more complete map of the human functional
proteome.
Collapse
Affiliation(s)
- Cheryl F Lichti
- Department of Pharmacology and Toxicology and ‡Biochemistry and Molecular Biology, UTMB Cancer Center, University of Texas Medical Branch , Galveston, Texas 77555, United States
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|