1
|
Liu W, Xiang L, Zheng T, Jin J, Zhang G. TranslatomeDB: a comprehensive database and cloud-based analysis platform for translatome sequencing data. Nucleic Acids Res 2019; 46:D206-D212. [PMID: 29106630 PMCID: PMC5753366 DOI: 10.1093/nar/gkx1034] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2017] [Accepted: 10/17/2017] [Indexed: 01/08/2023] Open
Abstract
Translation is a key regulatory step, linking transcriptome and proteome. Two major methods of translatome investigations are RNC-seq (sequencing of translating mRNA) and Ribo-seq (ribosome profiling). To facilitate the investigation of translation, we built a comprehensive database TranslatomeDB (http://www.translatomedb.net/) which provides collection and integrated analysis of published and user-generated translatome sequencing data. The current version includes 2453 Ribo-seq, 10 RNC-seq and their 1394 corresponding mRNA-seq datasets in 13 species. The database emphasizes the analysis functions in addition to the dataset collections. Differential gene expression (DGE) analysis can be performed between any two datasets of same species and type, both on transcriptome and translatome levels. The translation indices translation ratios, elongation velocity index and translational efficiency can be calculated to quantitatively evaluate translational initiation efficiency and elongation velocity, respectively. All datasets were analyzed using a unified, robust, accurate and experimentally-verifiable pipeline based on the FANSe3 mapping algorithm and edgeR for DGE analyzes. TranslatomeDB also allows users to upload their own datasets and utilize the identical unified pipeline to analyze their data. We believe that our TranslatomeDB is a comprehensive platform and knowledgebase on translatome and proteome research, releasing the biologists from complex searching, analyzing and comparing huge sequencing data without needing local computational power.
Collapse
Affiliation(s)
- Wanting Liu
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | | | - Tingkai Zheng
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Jingjie Jin
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China
| | - Gong Zhang
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, Jinan University, Guangzhou 510632, China.,Chi-Biotech Co. Ltd., Shenzhen 518000, China
| |
Collapse
|
2
|
Paik YK, Lane L, Kawamura T, Chen YJ, Cho JY, LaBaer J, Yoo JS, Domont G, Corrales F, Omenn GS, Archakov A, Encarnación-Guevara S, Lui S, Salekdeh GH, Cho JY, Kim CY, Overall CM. Launching the C-HPP neXt-CP50 Pilot Project for Functional Characterization of Identified Proteins with No Known Function. J Proteome Res 2018; 17:4042-4050. [PMID: 30269496 PMCID: PMC6693327 DOI: 10.1021/acs.jproteome.8b00383] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
An important goal of the Human Proteome Organization (HUPO) Chromosome-centric Human Proteome Project (C-HPP) is to correctly define the number of canonical proteins encoded by their cognate open reading frames on each chromosome in the human genome. When identified with high confidence of protein evidence (PE), such proteins are termed PE1 proteins in the online database resource, neXtProt. However, proteins that have not been identified unequivocally at the protein level but that have other evidence suggestive of their existence (PE2-4) are termed missing proteins (MPs). The number of MPs has been reduced from 5511 in 2012 to 2186 in 2018 (neXtProt 2018-01-17 release). Although the annotation of the human proteome has made significant progress, the "parts list" alone does not inform function. Indeed, 1937 proteins representing ∼10% of the human proteome have no function either annotated from experimental characterization or predicted by homology to other proteins. Specifically, these 1937 "dark proteins" of the so-called dark proteome are composed of 1260 functionally uncharacterized but identified PE1 proteins, designated as uPE1, plus 677 MPs from categories PE2-PE4, which also have no known or predicted function and are termed uMPs. At the HUPO-2017 Annual Meeting, the C-HPP officially adopted the uPE1 pilot initiative, with 14 participating international teams later committing to demonstrate the feasibility of the functional characterization of large numbers of dark proteins (CP), starting first with 50 uPE1 proteins, in a stepwise chromosome-centric organizational manner. The second aim of the feasibility phase to characterize protein (CP) functions of 50 uPE1 proteins, termed the neXt-CP50 initiative, is to utilize a variety of approaches and workflows according to individual team expertise, interest, and resources so as to enable the C-HPP to recommend experimentally proven workflows to the proteome community within 3 years. The results from this pilot will not only be the cornerstone of a larger characterization initiative but also enhance understanding of the human proteome and integrated cellular networks for the discovery of new mechanisms of pathology, mechanistically informative biomarkers, and rational drug targets.
Collapse
Affiliation(s)
- Young-Ki Paik
- Yonsei Proteome Research Center and Department of Integrative Omics, Yonsei University, Sudaemoon-ku, Seoul, Korea
| | - Lydie Lane
- CALIPHO group, Swiss Institute of Bioinformatics & Department of Microbiology and Molecular medicine, Faculty of Medicine, University of Geneva, Geneva, Switzerland
| | - Takeshi Kawamura
- Proteomics Laboratory, Isotope Science Center, The University of Tokyo, Bunkyo-Ku, Tokyo 113-0032 Japan
| | - Yu-Ju Chen
- Institute of Chemistry Academia Sinica, 128 Academia Road Sec. 2, Nankang Taipei 115 Taiwan
| | - Je-Yoel Cho
- Research Institute for Veterinary Science, College of Veterinary Medicine, Seoul University, 1 Gwanak-, Gwanak-gu, 151-742 Seoul, South Korea
| | - Joshua LaBaer
- McAllister Ave. Arizona State University, Tempe, Arizona, 85287-5001, USA
| | - Jong Shin Yoo
- Division of Mass Spectrometry Research, Korea Basic Science Institute, Ochang, Korea
| | - Gilberto Domont
- Federal University of Rio de Janeiro Institute of Chemistry, Rio de Janeiro, RJ Brazil
| | - Fernando Corrales
- Functional Proteomics Laboratory National Center of Biotechnology, CSIC 28049 Madrid, Spain
| | - Gilbert S. Omenn
- Center for Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109-2218, United States
| | | | | | - Siqi Lui
- BGI-Shenzhen, Beishan Industrial Zone, Yantian District, Shenzhen, 518083, China
| | - Ghasem Hosseini Salekdeh
- Department of Molecular Systems Biology, Royan Institute for Stem Cell Biology and Technology, 1665659911, Tehran, Iran
- Department of Molecular Sciences, Macquarie University, Sydney, Australia
| | - Jin-Young Cho
- Yonsei Proteome Research Center and Department of Integrative Omics, Yonsei University, Sudaemoon-ku, Seoul, Korea
| | - Chae-Yeon Kim
- Yonsei Proteome Research Center and Department of Integrative Omics, Yonsei University, Sudaemoon-ku, Seoul, Korea
| | - Christopher M. Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences, and Biochemistry & Molecular Biology, Faculty of Dentistry, University of British Columbia, Vancouver, Canada
| |
Collapse
|
3
|
|
4
|
Yang MH, Chen WJ, Fu YS, Huang B, Tsai WC, Arthur Chen YM, Lin PC, Yuan CH, Tyan YC. Utilizing glycine N-methyltransferasegene knockout mice as a model for identification of missing proteins in hepatocellular carcinoma. Oncotarget 2017; 9:442-452. [PMID: 29416626 PMCID: PMC5787479 DOI: 10.18632/oncotarget.23064] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Accepted: 11/13/2017] [Indexed: 11/25/2022] Open
Abstract
Glycine N-methyltransferase is a tumor suppressor gene for hepatocellular carcinoma, which can activate DNA methylation by inducing the S-adenosylmethionine to S-adenosylhomocystine. Previous studies have indicated that the expression of Glycine N-methyltransferase is inhibited in hepatocellular carcinoma. To confirm and identify missing proteins, the pathologic analysis of the tumor-bearing mice will provide critical histologic information. Such a mouse model is applied as a screening tool for hepatocellular carcinoma as well as a strategy for missing protein discovery. In this study we designed an analysis platform using the human proteome atlas to compare the possible missing proteins to human whole chromosomes. This will integrate the information from animal studies to establish an optimal technique in the missing protein biomarker discovery.
Collapse
Affiliation(s)
- Ming-Hui Yang
- Center for Infectious Disease and Cancer Research, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Wan-Jou Chen
- Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Yaw-Syan Fu
- Center for Infectious Disease and Cancer Research, Kaohsiung Medical University, Kaohsiung, Taiwan.,Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Bin Huang
- Center for Infectious Disease and Cancer Research, Kaohsiung Medical University, Kaohsiung, Taiwan.,Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Wan-Chi Tsai
- Department of Medical Laboratory Science and Biotechnology, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Yi-Ming Arthur Chen
- Center for Infectious Disease and Cancer Research, Kaohsiung Medical University, Kaohsiung, Taiwan.,Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan
| | - Po-Chiao Lin
- Department of Chemistry, National Sun Yat-sen University, Kaohsiung, Taiwan
| | - Cheng-Hui Yuan
- Mass Spectrometry Laboratory, Department of Chemistry, National University of Singapore, Singapore
| | - Yu-Chang Tyan
- Center for Infectious Disease and Cancer Research, Kaohsiung Medical University, Kaohsiung, Taiwan.,Department of Medical Imaging and Radiological Sciences, Kaohsiung Medical University, Kaohsiung, Taiwan.,Graduate Institute of Medicine, College of Medicine, Kaohsiung Medical University, Kaohsiung, Taiwan.,Institute of Medical Science and Technology, National Sun Yat-sen University, Kaohsiung, Taiwan.,Department of Medical Research, Kaohsiung Medical University Hospital, Kaohsiung, Taiwan
| |
Collapse
|
5
|
Schwenk JM, Omenn GS, Sun Z, Campbell DS, Baker MS, Overall CM, Aebersold R, Moritz RL, Deutsch EW. The Human Plasma Proteome Draft of 2017: Building on the Human Plasma PeptideAtlas from Mass Spectrometry and Complementary Assays. J Proteome Res 2017; 16:4299-4310. [PMID: 28938075 PMCID: PMC5864247 DOI: 10.1021/acs.jproteome.7b00467] [Citation(s) in RCA: 141] [Impact Index Per Article: 20.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
Human blood plasma provides a highly accessible window to the proteome of any individual in health and disease. Since its inception in 2002, the Human Proteome Organization's Human Plasma Proteome Project (HPPP) has been promoting advances in the study and understanding of the full protein complement of human plasma and on determining the abundance and modifications of its components. In 2017, we review the history of the HPPP and the advances of human plasma proteomics in general, including several recent achievements. We then present the latest 2017-04 build of Human Plasma PeptideAtlas, which yields ∼43 million peptide-spectrum matches and 122,730 distinct peptide sequences from 178 individual experiments at a 1% protein-level FDR globally across all experiments. Applying the latest Human Proteome Project Data Interpretation Guidelines, we catalog 3509 proteins that have at least two non-nested uniquely mapping peptides of nine amino acids or more and >1300 additional proteins with ambiguous evidence. We apply the same two-peptide guideline to historical PeptideAtlas builds going back to 2006 and examine the progress made in the past ten years in plasma proteome coverage. We also compare the distribution of proteins in historical PeptideAtlas builds in various RNA abundance and cellular localization categories. We then discuss advances in plasma proteomics based on targeted mass spectrometry as well as affinity assays, which during early 2017 target ∼2000 proteins. Finally, we describe considerations about sample handling and study design, concluding with an outlook for future advances in deciphering the human plasma proteome.
Collapse
Affiliation(s)
- Jochen M. Schwenk
- Affinity Proteomics, SciLifeLab, School of Biotechnology, KTH Royal Institute of Technology, Tomtebodavägen 23, SE-171 65 Solna, Sweden
| | - Gilbert S. Omenn
- Departments of Computational Medicine & Bioinformatics, Internal Medicine, and Human Genetics and School of Public Health, University of Michigan, Ann Arbor, MI, 48109-2218, USA
- Institute for Systems Biology, Seattle, WA, USA
| | - Zhi Sun
- Institute for Systems Biology, Seattle, WA, USA
| | | | - Mark S. Baker
- Department of Biomedical Sciences, Faculty of Medicine and Health Science, Macquarie University, NSW, 2109. Australia
| | - Christopher M. Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences, and Biochemistry & Molecular Biology, Faculty of Dentistry, University of British Columbia, Vancouver, Canada
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, Zurich, Switzerland
- Faculty of Science, University of Zurich, 8006 Zurich, Switzerland
| | | | | |
Collapse
|
6
|
Na K, Shin H, Cho JY, Jung SH, Lim J, Lim JS, Kim EA, Kim HS, Kang AR, Kim JH, Shin JM, Jeong SK, Kim CY, Park JY, Chung HM, Omenn GS, Hancock WS, Paik YK. Systematic Proteogenomic Approach To Exploring a Novel Function for NHERF1 in Human Reproductive Disorder: Lessons for Exploring Missing Proteins. J Proteome Res 2017; 16:4455-4467. [PMID: 28960081 DOI: 10.1021/acs.jproteome.7b00146] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
One of the major goals of the Chromosome-Centric Human Proteome Project (C-HPP) is to fill the knowledge gaps between human genomic information and the corresponding proteomic information. These gaps are due to "missing" proteins (MPs)-predicted proteins with insufficient evidence from mass spectrometry (MS), biochemical, structural, or antibody analyses-that currently account for 2579 of the 19587 predicted human proteins (neXtProt, 2017-01). We address some of the lessons learned from the inconsistent annotations of missing proteins in databases (DB) and demonstrate a systematic proteogenomic approach designed to explore a potential new function of a known protein. To illustrate a cautious and strategic approach for characterization of novel function in vitro and in vivo, we present the case of Na(+)/H(+) exchange regulatory cofactor 1 (NHERF1/SLC9A3R1, located at chromosome 17q25.1; hereafter NHERF1), which was mistakenly labeled as an MP in one DB (Global Proteome Machine Database; GPMDB, 2011-09 release) but was well known in another public DB and in the literature. As a first step, NHERF1 was determined by MS and immunoblotting for its molecular identity. We next investigated the potential new function of NHERF1 by carrying out the quantitative MS profiling of placental trophoblasts (PXD004723) and functional study of cytotrophoblast JEG-3 cells. We found that NHERF1 was associated with trophoblast differentiation and motility. To validate this newly found cellular function of NHERF1, we used the Caenorhabditis elegans mutant of nrfl-1 (a nematode ortholog of NHERF1), which exhibits a protruding vulva (Pvl) and egg-laying-defective phenotype, and performed genetic complementation work. The nrfl-1 mutant was almost fully rescued by the transfection of the recombinant transgenic construct that contained human NHERF1. These results suggest that NHERF1 could have a previously unknown function in pregnancy and in the development of human embryos. Our study outlines a stepwise experimental platform to explore new functions of ambiguously denoted candidate proteins and scrutinizes the mandated DB search for the selection of MPs to study in the future.
Collapse
Affiliation(s)
- Keun Na
- Yonsei Proteome Research Center, Yonsei University , Seoul 03722, South Korea
| | - Heon Shin
- Department of Integrated OMICS for Biomedical Science, Yonsei University , Seoul 03722, South Korea
| | - Jin-Young Cho
- Yonsei Proteome Research Center, Yonsei University , Seoul 03722, South Korea
| | - Sang Hee Jung
- Department of Obstetrics and Gynecology, CHA Bundang Medical Center, CHA University , Seongnam 13496, South Korea
| | - Jaeseung Lim
- CHA Biotech Co., Ltd. , Seongnam 13488, South Korea
| | - Jong-Sun Lim
- Yonsei Proteome Research Center, Yonsei University , Seoul 03722, South Korea
| | - Eun Ah Kim
- Department of Obstetrics and Gynecology, CHA Bundang Medical Center, CHA University , Seongnam 13496, South Korea
| | - Hye Sun Kim
- CHA Biotech Co., Ltd. , Seongnam 13488, South Korea
| | - Ah Reum Kang
- CHA Biotech Co., Ltd. , Seongnam 13488, South Korea
| | - Ji Hye Kim
- CHA Biotech Co., Ltd. , Seongnam 13488, South Korea
| | - Jeong Min Shin
- Department of Biochemistry, CHA University , Seongnam 13488, South Korea
| | - Seul-Ki Jeong
- Yonsei Proteome Research Center, Yonsei University , Seoul 03722, South Korea
| | - Chae-Yeon Kim
- Department of Integrated OMICS for Biomedical Science, Yonsei University , Seoul 03722, South Korea
| | - Jun Young Park
- Department of Integrated OMICS for Biomedical Science, Yonsei University , Seoul 03722, South Korea
| | - Hyung-Min Chung
- Department of Medicine, School of Medicine, Konkuk University , Seoul 143701, South Korea
| | - Gilbert S Omenn
- Center for Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, Michigan 48109, United States
| | - William S Hancock
- Department of Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States
| | - Young-Ki Paik
- Yonsei Proteome Research Center, Yonsei University , Seoul 03722, South Korea.,Department of Integrated OMICS for Biomedical Science, Yonsei University , Seoul 03722, South Korea.,Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University , Seoul 03722, South Korea
| |
Collapse
|
7
|
Zhang W, Chen X, Yan Z, Chen Y, Cui Y, Chen B, Huang C, Zhang W, Yin X, He QY, He F, Wang T. Detergent-Insoluble Proteome Analysis Revealed Aberrantly Aggregated Proteins in Human Preeclampsia Placentas. J Proteome Res 2017; 16:4468-4480. [PMID: 28965414 DOI: 10.1021/acs.jproteome.7b00352] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Preeclampsia (PE) is a placenta disease, featured by hypertension, proteinuria, and other multiorgan dysfunctions, and its etiology is unclear. We and others have shown that intensive endoplasmic reticulum (ER) stress and unfolded protein response (UPR) occur in the PE placenta. In this study, we isolated detergent-insoluble proteins (DIPs) from human placenta tissues, which were enriched with protein aggregates, to characterize the placenta UPR in PE. With data-independent acquisition (DIA) mass spectrometry, we identified 2066 DIPs across all normal (n = 10) and PE (n = 10) placenta samples, among which 110 and 108 DIPs were significantly up- and down-regulated in PE, respectively. Per clustering analysis, differential DIPs could generally distinguish PE from normal placentas. We verified the MS quantitation of endoglin and vimentin by immunoblotting. In addition, we observed that PE placenta tissues have remarkably more endoglin in the cytoplasm. Furthermore, we found that DIPs were evenly distributed across different chromosomes and could be enriched in diversified gene ontology terms, while differential DIPs avoided to distribute on X-chromosome. Significantly up-regulated DIPs in PE were focused on the top functions of lipid metabolism, while 23 of these DIPs could form the top network regulating cellular movement, development, growth, and proliferation. Our results implicate that human PE placentas have disease-relevant differential DIPs, which reflect aberrantly aggregated proteins of placental tissues. The mass spectrometry proteomics data have been deposited to ProteomeXchange consortium with the data set identifier PXD006654, and iProX database (accession number: IPX0000948000).
Collapse
Affiliation(s)
- Wanling Zhang
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| | - Xing Chen
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| | - Ziqi Yan
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| | - Yang Chen
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| | - Yizhi Cui
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| | | | | | | | - Xingfeng Yin
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| | - Qing-Yu He
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| | | | - Tong Wang
- Key Laboratory of Functional Protein Research of Guangdong Higher Education Institutes, Institute of Life and Health Engineering, College of Life Science and Technology, Jinan University , 601 Huangpu Avenue West, Guangzhou, Guangdong 510632, China
| |
Collapse
|
8
|
Choong WK, Lih TSM, Chen YJ, Sung TY. Decoding the Effect of Isobaric Substitutions on Identifying Missing Proteins and Variant Peptides in Human Proteome. J Proteome Res 2017; 16:4415-4424. [DOI: 10.1021/acs.jproteome.7b00342] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Wai-Kok Choong
- Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
- Institute of Chemistry, Academia Sinica, Taipei 11529, Taiwan
| | - Tung-Shing Mamie Lih
- Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
- Institute of Chemistry, Academia Sinica, Taipei 11529, Taiwan
| | - Yu-Ju Chen
- Institute of Chemistry, Academia Sinica, Taipei 11529, Taiwan
| | - Ting-Yi Sung
- Institute of Information Science, Academia Sinica, Taipei 11529, Taiwan
| |
Collapse
|
9
|
Segura V, Garin-Muga A, Guruceaga E, Corrales FJ. Progress and pitfalls in finding the 'missing proteins' from the human proteome map. Expert Rev Proteomics 2016; 14:9-14. [PMID: 27885863 DOI: 10.1080/14789450.2017.1265450] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
INTRODUCTION The Human Proteome Project was launched with two main goals: the comprehensive and systematic definition of the human proteome map and the development of ready to use analytical tools to measure relevant proteins in their biological context in health and disease. Despite the great progress in this endeavour, there is still a group of reluctant proteins with no, or scarce, experimental evidence supporting their existence. These are called the 'missing proteins' and represent one of the biggest challenges to complete the human proteome map. Areas covered: This review focuses on the description of the missing proteome based on the HUPO standards, the analysis of the reasons explaining the difficulty of detecting missing proteins and the strategies currently used in the search for missing proteins. The present and future of the quest for the missing proteins is critically revised hereafter. Expert commentary: An overarching multidisciplinary effort is currently being done under the HUPO umbrella to allow completion of the human proteome map. It is expected that the detection of missing proteins will grow in the coming years since the methods and the best tissue/cell type sample for their search are already on the table.
Collapse
Affiliation(s)
- Victor Segura
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| | - Alba Garin-Muga
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| | - Elizabeth Guruceaga
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| | - Fernando J Corrales
- a Proteomics and Bioinformatics Laboratory, CIMA , University of Navarra , Pamplona , Spain
| |
Collapse
|
10
|
Deutsch EW, Overall CM, Van Eyk JE, Baker MS, Paik YK, Weintraub ST, Lane L, Martens L, Vandenbrouck Y, Kusebauch U, Hancock WS, Hermjakob H, Aebersold R, Moritz RL, Omenn GS. Human Proteome Project Mass Spectrometry Data Interpretation Guidelines 2.1. J Proteome Res 2016; 15:3961-3970. [PMID: 27490519 DOI: 10.1021/acs.jproteome.6b00392] [Citation(s) in RCA: 134] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
Every data-rich community research effort requires a clear plan for ensuring the quality of the data interpretation and comparability of analyses. To address this need within the Human Proteome Project (HPP) of the Human Proteome Organization (HUPO), we have developed through broad consultation a set of mass spectrometry data interpretation guidelines that should be applied to all HPP data contributions. For submission of manuscripts reporting HPP protein identification results, the guidelines are presented as a one-page checklist containing 15 essential points followed by two pages of expanded description of each. Here we present an overview of the guidelines and provide an in-depth description of each of the 15 elements to facilitate understanding of the intentions and rationale behind the guidelines, for both authors and reviewers. Broadly, these guidelines provide specific directions regarding how HPP data are to be submitted to mass spectrometry data repositories, how error analysis should be presented, and how detection of novel proteins should be supported with additional confirmatory evidence. These guidelines, developed by the HPP community, are presented to the broader scientific community for further discussion.
Collapse
Affiliation(s)
- Eric W Deutsch
- Institute for Systems Biology , 401 Terry Avenure North, Seattle, Washington 98109, United States
| | - Christopher M Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences, and Biochemistry & Molecular Biology, Faculty of Dentistry, University of British Columbia , Vancouver, British Columbia V6T 1Z3, Canada
| | - Jennifer E Van Eyk
- Advanced Clinical Biosystems Research Institute, Department of Medicine, Cedars Sinai Medical Center , Los Angeles, California 90048, United States
| | - Mark S Baker
- Department of Biomedical Sciences, Faculty of Medicine and Health Science, Macquarie University , Sydney, New South Wales 2109, Australia
| | - Young-Ki Paik
- Yonsei Proteome Research Center and Department of Biochemistry, Yonsei University , 50 Yonsei-ro, Sudaemoon-ku, Seoul 120-749, Korea
| | - Susan T Weintraub
- The University of Texas , Health Science Center at San Antonio, San Antonio, Texas 78229, United States
| | - Lydie Lane
- SIB Swiss Institute of Bioinformatics and Department of Human Protein Science, Faculty of Medicine, University of Geneva , CMU, Michel Servet 1, 1211 Geneva 4, Switzerland
| | - Lennart Martens
- Department of Medical Protein Research, VIB , Ghent 9052, Belgium.,Department of Biochemistry, Ghent University , Ghent B-9000, Belgium
| | - Yves Vandenbrouck
- French Proteomics Infrastructure, Biosciences and Biotechnology Institute of Grenoble (BIG), Université Grenoble Alpes, CEA, INSERM , U1038 Grenoble, France
| | - Ulrike Kusebauch
- Institute for Systems Biology , 401 Terry Avenure North, Seattle, Washington 98109, United States
| | - William S Hancock
- Department of Chemical Biology, Northeastern University , Boston, Massachusetts 02115, United States
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus , Hinxton, Cambridge CB10 1SD, United Kingdom.,National Center for Protein Sciences , Beijing 102206, China
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology , ETH Zurich, Zurich 8093, Switzerland.,Faculty of Science, University of Zurich , 8006 Zurich, Switzerland
| | - Robert L Moritz
- Institute for Systems Biology , 401 Terry Avenure North, Seattle, Washington 98109, United States
| | - Gilbert S Omenn
- Institute for Systems Biology , 401 Terry Avenure North, Seattle, Washington 98109, United States.,Departments of Computational Medicine & Bioinformatics, Internal Medicine, and Human Genetics and School of Public Health, University of Michigan , Ann Arbor, Michigan 48109-2218, United States
| |
Collapse
|
11
|
Elguoshy A, Magdeldin S, Xu B, Hirao Y, Zhang Y, Kinoshita N, Takisawa Y, Nameta M, Yamamoto K, El-Refy A, El-Fiky F, Yamamoto T. Why are they missing? : Bioinformatics characterization of missing human proteins. J Proteomics 2016; 149:7-14. [PMID: 27535355 DOI: 10.1016/j.jprot.2016.08.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2015] [Revised: 07/17/2016] [Accepted: 08/08/2016] [Indexed: 12/19/2022]
Abstract
NeXtProt is a web-based protein knowledge platform that supports research on human proteins. NeXtProt (release 2015-04-28) lists 20,060 proteins, among them, 3373 canonical proteins (16.8%) lack credible experimental evidence at protein level (PE2:PE5). Therefore, they are considered as "missing proteins". A comprehensive bioinformatic workflow has been proposed to analyze these "missing" proteins. The aims of current study were to analyze physicochemical properties, existence and distribution of the tryptic cleavage sites, and to pinpoint the signature peptides of the missing proteins. Our findings showed that 23.7% of missing proteins were hydrophobic proteins possessing transmembrane domains (TMD). Also, forty missing entries generate tryptic peptides were either out of mass detection range (>30aa) or mapped to different proteins (<9aa). Additionally, 21% of missing entries didn't generate any unique tryptic peptides. In silico endopeptidase combination strategy increased the possibility of missing proteins identification. Coherently, using both mature protein database and signal peptidome database could be a promising option to identify some missing proteins by targeting their unique N-terminal tryptic peptide from mature protein database and or C-terminus tryptic peptide from signal peptidome database. In conclusion, Identification of missing protein requires additional consideration during sample preparation, extraction, digestion and data analysis to increase its incidence of identification.
Collapse
Affiliation(s)
- Amr Elguoshy
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan; Biotechnology Department, Faculty of Agriculture, Al-Azhar University, Cairo 11682, Egypt
| | - Sameh Magdeldin
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan; Department of Physiology, Faculty of Veterinary Medicine, Suez Canal University, Ismailia 41522, Egypt
| | - Bo Xu
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan
| | - Yoshitoshi Hirao
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan
| | - Ying Zhang
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan
| | - Naohiko Kinoshita
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan
| | - Yusuke Takisawa
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan
| | - Masaaki Nameta
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan
| | - Keiko Yamamoto
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan
| | - Ali El-Refy
- Biotechnology Department, Faculty of Agriculture, Al-Azhar University, Cairo 11682, Egypt
| | - Fawzy El-Fiky
- Biotechnology Department, Faculty of Agriculture, Al-Azhar University, Cairo 11682, Egypt
| | - Tadashi Yamamoto
- Biofluid Biomarker Center, Institute of Social innovation and Co-operation, Niigata University, Niigata 951-2181, Japan.
| |
Collapse
|
12
|
Guo J, Cui Y, Yan Z, Luo Y, Zhang W, Deng S, Tang S, Zhang G, He QY, Wang T. Phosphoproteome Characterization of Human Colorectal Cancer SW620 Cell-Derived Exosomes and New Phosphosite Discovery for C-HPP. J Proteome Res 2016; 15:4060-4072. [PMID: 27470641 DOI: 10.1021/acs.jproteome.6b00391] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Affiliation(s)
- Jiahui Guo
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Yizhi Cui
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Ziqi Yan
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Yanzhang Luo
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Wanling Zhang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Suyuan Deng
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Shengquan Tang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Gong Zhang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Qing-Yu He
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| | - Tong Wang
- Key Laboratory of Functional
Protein Research of Guangdong Higher Education Institutes, Institute
of Life and Health Engineering, College of Life Science and Technology, Jinan University, 601 Huangpu Avenue West, Guangzhou 510632, China
| |
Collapse
|
13
|
Fishilevich S, Zimmerman S, Kohn A, Iny Stein T, Olender T, Kolker E, Safran M, Lancet D. Genic insights from integrated human proteomics in GeneCards. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw030. [PMID: 27048349 PMCID: PMC4820835 DOI: 10.1093/database/baw030] [Citation(s) in RCA: 102] [Impact Index Per Article: 12.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 02/23/2016] [Indexed: 11/15/2022]
Abstract
GeneCards is a one-stop shop for searchable human gene annotations (http://www.genecards.org/). Data are automatically mined from ∼120 sources and presented in an integrated web card for every human gene. We report the application of recent advances in proteomics to enhance gene annotation and classification in GeneCards. First, we constructed the Human Integrated Protein Expression Database (HIPED), a unified database of protein abundance in human tissues, based on the publically available mass spectrometry (MS)-based proteomics sources ProteomicsDB, Multi-Omics Profiling Expression Database, Protein Abundance Across Organisms and The MaxQuant DataBase. The integrated database, residing within GeneCards, compares favourably with its individual sources, covering nearly 90% of human protein-coding genes. For gene annotation and comparisons, we first defined a protein expression vector for each gene, based on normalized abundances in 69 normal human tissues. This vector is portrayed in the GeneCards expression section as a bar graph, allowing visual inspection and comparison. These data are juxtaposed with transcriptome bar graphs. Using the protein expression vectors, we further defined a pairwise metric that helps assess expression-based pairwise proximity. This new metric for finding functional partners complements eight others, including sharing of pathways, gene ontology (GO) terms and domains, implemented in the GeneCards Suite. In parallel, we calculated proteome-based differential expression, highlighting a subset of tissues that overexpress a gene and subserving gene classification. This textual annotation allows users of VarElect, the suite’s next-generation phenotyper, to more effectively discover causative disease variants. Finally, we define the protein–RNA expression ratio and correlation as yet another attribute of every gene in each tissue, adding further annotative information. The results constitute a significant enhancement of several GeneCards sections and help promote and organize the genome-wide structural and functional knowledge of the human proteome. Database URL: http://www.genecards.org/
Collapse
Affiliation(s)
- Simon Fishilevich
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Shahar Zimmerman
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Asher Kohn
- LifeMap Sciences Ltd., Tel Aviv 69710, Israel
| | - Tsippi Iny Stein
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Tsviya Olender
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Eugene Kolker
- CDO Analytics, Seattle Children's Hospital, Seattle, WA 98101 USA Bioinformatics and High-Throughput Analysis Laboratory, Seattle Children's Research Institute, Seattle, WA 98101 USA Data-Enabled Life Sciences Alliance (DELSA), Seattle, Washington, 98101, USA Departments of Biomedical Informatics and Medical Education and Pediatrics, University of Washington School of Medicine, Seattle, WA 98109, USA Department of Chemistry and Chemical Biology, Northeastern University College of Science, Boston, MA 02115 USA
| | - Marilyn Safran
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, 7610001, Israel
| | - Doron Lancet
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot, 7610001, Israel
| |
Collapse
|
14
|
Proteogenomic Tools and Approaches to Explore Protein Coding Landscapes of Eukaryotic Genomes. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 926:1-10. [DOI: 10.1007/978-3-319-42316-6_1] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|