1
|
Zhu C, Liu LY, Ha A, Yamaguchi TN, Zhu H, Hugh-White R, Livingstone J, Patel Y, Kislinger T, Boutros PC. moPepGen: Rapid and Comprehensive Identification of Non-canonical Peptides. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.28.587261. [PMID: 38585946 PMCID: PMC10996593 DOI: 10.1101/2024.03.28.587261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]
Abstract
Gene expression is a multi-step transformation of biological information from its storage form (DNA) into functional forms (protein and some RNAs). Regulatory activities at each step of this transformation multiply a single gene into a myriad of proteoforms. Proteogenomics is the study of how genomic and transcriptomic variation creates this proteomic diversity, and is limited by the challenges of modeling the complexities of gene-expression. We therefore created moPepGen, a graph-based algorithm that comprehensively generates non-canonical peptides in linear time. moPepGen works with multiple technologies, in multiple species and on all types of genetic and transcriptomic data. In human cancer proteomes, it enumerates previously unobservable noncanonical peptides arising from germline and somatic genomic variants, noncoding open reading frames, RNA fusions and RNA circularization. By enabling efficient detection and quantitation of previously hidden proteins in both existing and new proteomic data, moPepGen facilitates all proteogenomics applications. It is available at: https://github.com/uclahs-cds/package-moPepGen.
Collapse
Affiliation(s)
- Chenghao Zhu
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, USA
- Institute for Precision Health, University of California, Los Angeles, CA, USA
- Department of Urology, University of California, Los Angeles, CA, USA
| | - Lydia Y. Liu
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, USA
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
- Vector Institute for Artificial Intelligence, Toronto, Canada
| | - Annie Ha
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Takafumi N. Yamaguchi
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, USA
- Institute for Precision Health, University of California, Los Angeles, CA, USA
| | - Helen Zhu
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
- Vector Institute for Artificial Intelligence, Toronto, Canada
| | - Rupert Hugh-White
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, USA
- Institute for Precision Health, University of California, Los Angeles, CA, USA
| | - Julie Livingstone
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, USA
- Institute for Precision Health, University of California, Los Angeles, CA, USA
| | - Yash Patel
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, USA
- Institute for Precision Health, University of California, Los Angeles, CA, USA
| | - Thomas Kislinger
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
- Princess Margaret Cancer Centre, University Health Network, Toronto, Canada
| | - Paul C. Boutros
- Department of Human Genetics, University of California, Los Angeles, CA, USA
- Jonsson Comprehensive Cancer Center, University of California, Los Angeles, CA, USA
- Institute for Precision Health, University of California, Los Angeles, CA, USA
- Department of Urology, University of California, Los Angeles, CA, USA
- Department of Medical Biophysics, University of Toronto, Toronto, Canada
| |
Collapse
|
2
|
Sarygina EV, Kozlova AS, Ponomarenko EA, Ilgisonis EV. The human proteome size as a technological development function. BIOMEDITSINSKAIA KHIMIIA 2024; 70:364-373. [PMID: 39324201 DOI: 10.18097/pbmc20247005364] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2024]
Abstract
Changes in information on the number of human proteoforms, post-translational modification (PTM) events, alternative splicing (AS), single-amino acid polymorphisms (SAP) associated with protein-coding genes in the neXtProt database have been retrospectively analyzed. In 2016, our group proposed three mathematical models for predicting the number of different proteins (proteoforms) in the human proteome. Eight years later, we compared the original data of the information resources and their contribution to the prediction results, correlating the differences with new approaches to experimental and bioinformatic analysis of protein modifications. The aim of this work is to update information on the status of records in the databases of identified proteoforms since 2016, as well as to identify trends in changes in the quantities of these records. According to various information models, modern experimental methods may identify from 5 to 125 million different proteoforms: the proteins formed due to alternative splicing, the implementation of single nucleotide polymorphisms at the proteomic level, and post-translational modifications in various combinations. This result reflects an increase in the size of the human proteome by 20 or more times over the past 8 years.
Collapse
Affiliation(s)
- E V Sarygina
- Institute of Biomedical Chemistry, Moscow, Russia
| | - A S Kozlova
- Institute of Biomedical Chemistry, Moscow, Russia
| | | | | |
Collapse
|
3
|
Kiseleva OI, Arzumanian VA, Kurbatov IY, Poverennaya EV. In silico and in cellulo approaches for functional annotation of human protein splice variants. BIOMEDITSINSKAIA KHIMIIA 2024; 70:315-328. [PMID: 39324196 DOI: 10.18097/pbmc20247005315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/27/2024]
Abstract
The elegance of pre-mRNA splicing mechanisms continues to interest scientists even after over a half century, since the discovery of the fact that coding regions in genes are interrupted by non-coding sequences. The vast majority of human genes have several mRNA variants, coding structurally and functionally different protein isoforms in a tissue-specific manner and with a linkage to specific developmental stages of the organism. Alteration of splicing patterns shifts the balance of functionally distinct proteins in living systems, distorts normal molecular pathways, and may trigger the onset and progression of various pathologies. Over the past two decades, numerous studies have been conducted in various life sciences disciplines to deepen our understanding of splicing mechanisms and the extent of their impact on the functioning of living systems. This review aims to summarize experimental and computational approaches used to elucidate the functions of splice variants of a single gene based on our experience accumulated in the laboratory of interactomics of proteoforms at the Institute of Biomedical Chemistry (IBMC) and best global practices.
Collapse
Affiliation(s)
- O I Kiseleva
- Institute of Biomedical Chemistry, Moscow, Russia
| | | | | | | |
Collapse
|
4
|
Cai SH, Wang B, Zhang J, Guo J, Hu B. Wearable sampling of proteins from human exhaled aerosols for nano-liquid chromatography-tandem mass spectrometry analysis: A pilot study. RAPID COMMUNICATIONS IN MASS SPECTROMETRY : RCM 2024; 38:e9737. [PMID: 38533583 DOI: 10.1002/rcm.9737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 02/25/2024] [Accepted: 02/27/2024] [Indexed: 03/28/2024]
Abstract
RATIONALE Human exhaled breath usually contains unique proteins that may provide clues to characterize individual physiological activities and many diseases. However, the concentration of exhaled proteins in exhaled breath is extremely low and usually does not reach the detection limits of all online breath mass spectrometry instruments. Therefore, developing a new breath sampler for collecting and characterizing exhaled proteins is important. METHODS In this study, a new mask-based wearable sampler was developed by fixing metal materials into the inner surface of the KN95 mask. Human exhaled proteins could be directly adsorbed onto the metal material while wearing the mask. After sampling, the collected proteins were eluted, digested, and identified using nano-liquid chromatography-tandem mass spectrometry (nano-LC-MS/MS). RESULTS The adsorption of exhaled proteins was evaluated, showing that modified gold foil is an effective material for collecting exhaled proteins. Various endogenous proteins were successfully identified from exhaled breath, many of which can be potential biomarkers for disease diagnosis. CONCLUSIONS By coupling the newly developed mask sampler with nano-LC-MS/MS, human exhaled proteins were successfully collected and identified. Our results show that the mask sampler is wearable, simple, and convenient, and the method is noninvasive for investigating disease diagnosis and human health.
Collapse
Affiliation(s)
- Shen-Hui Cai
- Institute of Mass Spectrometry and Atmospheric Environment, Guangdong Provincial Engineering Research Center for On-line Source Apportionment System of Air Pollution, Guangdong Provincial Key Laboratory of Speed Capability Research, Jinan University, Guangzhou, China
| | - Baixue Wang
- Institute of Mass Spectrometry and Atmospheric Environment, Guangdong Provincial Engineering Research Center for On-line Source Apportionment System of Air Pollution, Guangdong Provincial Key Laboratory of Speed Capability Research, Jinan University, Guangzhou, China
| | - Jianfeng Zhang
- Institute of Mass Spectrometry and Atmospheric Environment, Guangdong Provincial Engineering Research Center for On-line Source Apportionment System of Air Pollution, Guangdong Provincial Key Laboratory of Speed Capability Research, Jinan University, Guangzhou, China
| | - Jiubiao Guo
- Clinical Research Center, The First Affiliated Hospital of Shantou University Medical College, Shantou, China
| | - Bin Hu
- Institute of Mass Spectrometry and Atmospheric Environment, Guangdong Provincial Engineering Research Center for On-line Source Apportionment System of Air Pollution, Guangdong Provincial Key Laboratory of Speed Capability Research, Jinan University, Guangzhou, China
| |
Collapse
|
5
|
van Rijn JPM, Martens M, Ammar A, Cimpan MR, Fessard V, Hoet P, Jeliazkova N, Murugadoss S, Vinković Vrček I, Willighagen EL. From papers to RDF-based integration of physicochemical data and adverse outcome pathways for nanomaterials. J Cheminform 2024; 16:49. [PMID: 38693555 PMCID: PMC11064368 DOI: 10.1186/s13321-024-00833-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2023] [Accepted: 03/23/2024] [Indexed: 05/03/2024] Open
Abstract
Adverse Outcome Pathways (AOPs) have been proposed to facilitate mechanistic understanding of interactions of chemicals/materials with biological systems. Each AOP starts with a molecular initiating event (MIE) and possibly ends with adverse outcome(s) (AOs) via a series of key events (KEs). So far, the interaction of engineered nanomaterials (ENMs) with biomolecules, biomembranes, cells, and biological structures, in general, is not yet fully elucidated. There is also a huge lack of information on which AOPs are ENMs-relevant or -specific, despite numerous published data on toxicological endpoints they trigger, such as oxidative stress and inflammation. We propose to integrate related data and knowledge recently collected. Our approach combines the annotation of nanomaterials and their MIEs with ontology annotation to demonstrate how we can then query AOPs and biological pathway information for these materials. We conclude that a FAIR (Findable, Accessible, Interoperable, Reusable) representation of the ENM-MIE knowledge simplifies integration with other knowledge. SCIENTIFIC CONTRIBUTION: This study introduces a new database linking nanomaterial stressors to the first known MIE or KE. Second, it presents a reproducible workflow to analyze and summarize this knowledge. Third, this work extends the use of semantic web technologies to the field of nanoinformatics and nanosafety.
Collapse
Affiliation(s)
- Jeaphianne P M van Rijn
- Dept of Bioinformatics, BiGCaT, NUTRIM, FHML, Maastricht University, Maastricht, The Netherlands
| | - Marvin Martens
- Dept of Bioinformatics, BiGCaT, NUTRIM, FHML, Maastricht University, Maastricht, The Netherlands
| | - Ammar Ammar
- Dept of Bioinformatics, BiGCaT, NUTRIM, FHML, Maastricht University, Maastricht, The Netherlands
| | - Mihaela Roxana Cimpan
- Department of Clinical Dentistry, Faculty of Medicine, University of Bergen, Bergen, Norway
| | - Valerie Fessard
- Fougères Laboratory, Anses, French Agency for Food, Environmental and Occupational Health and Safety, Toxicology of Contaminants Unit, Fougères, France
| | - Peter Hoet
- Laboratory of Toxicology, Unit of Environment and Health, Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium
| | | | - Sivakumar Murugadoss
- Laboratory of Toxicology, Unit of Environment and Health, Department of Public Health and Primary Care, KU Leuven, Leuven, Belgium
- SD Chemical and Physical Health Risks, Brussels, Belgium
| | | | - Egon L Willighagen
- Dept of Bioinformatics, BiGCaT, NUTRIM, FHML, Maastricht University, Maastricht, The Netherlands.
| |
Collapse
|
6
|
Santos LGC, Parreira VDSC, da Silva EMG, Santos MDM, Fernandes ADF, Neves-Ferreira AGDC, Carvalho PC, Freitas FCDP, Passetti F. SpliceProt 2.0: A Sequence Repository of Human, Mouse, and Rat Proteoforms. Int J Mol Sci 2024; 25:1183. [PMID: 38256255 PMCID: PMC10816255 DOI: 10.3390/ijms25021183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 12/15/2023] [Accepted: 01/03/2024] [Indexed: 01/24/2024] Open
Abstract
SpliceProt 2.0 is a public proteogenomics database that aims to list the sequence of known proteins and potential new proteoforms in human, mouse, and rat proteomes. This updated repository provides an even broader range of computationally translated proteins and serves, for example, to aid with proteomic validation of splice variants absent from the reference UniProtKB/SwissProt database. We demonstrate the value of SpliceProt 2.0 to predict orthologous proteins between humans and murines based on transcript reconstruction, sequence annotation and detection at the transcriptome and proteome levels. In this release, the annotation data used in the reconstruction of transcripts based on the methodology of ternary matrices were acquired from new databases such as Ensembl, UniProt, and APPRIS. Another innovation implemented in the pipeline is the exclusion of transcripts predicted to be susceptible to degradation through the NMD pathway. Taken together, our repository and its applications represent a valuable resource for the proteogenomics community.
Collapse
Affiliation(s)
- Letícia Graziela Costa Santos
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Vinícius da Silva Coutinho Parreira
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Esdras Matheus Gomes da Silva
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fundação Oswaldo Cruz (FIOCRUZ), Av. Brazil 4036, Campus Maré, Rio de Janeiro 21040-361, RJ, Brazil
| | - Marlon Dias Mariano Santos
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Alexander da Franca Fernandes
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Ana Gisele da Costa Neves-Ferreira
- Laboratory of Toxinology, Oswaldo Cruz Institute, Fundação Oswaldo Cruz (FIOCRUZ), Av. Brazil 4036, Campus Maré, Rio de Janeiro 21040-361, RJ, Brazil
| | - Paulo Costa Carvalho
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| | - Flávia Cristina de Paula Freitas
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
- Departamento de Genética e Evolução, Universidade Federal de São Carlos (UFSCar), Rodovia Washington Luis, Km 235, São Carlos 13565-905, SP, Brazil
| | - Fabio Passetti
- Instituto Carlos Chagas, Fundação Oswaldo Cruz (FIOCRUZ), Rua Professor Algacyr Munhoz Mader 3775, Cidade Industrial De Curitiba, Curitiba 81310-020, PR, Brazil
| |
Collapse
|
7
|
Colaco V, Goswami N, Goel VK, Srivastava SK, Lalrohlua P, Senthil Kumar N, Borah P, Baruah R, Varma AK. In silico and structure-based evaluation of deleterious mutations identified in human Chk1, Chk2, and Wee1 protein kinase. J Cell Biochem 2024; 125:89-99. [PMID: 38047473 DOI: 10.1002/jcb.30508] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 11/16/2023] [Accepted: 11/20/2023] [Indexed: 12/05/2023]
Abstract
Checkpoint kinases Chk1, Chk2, Wee1 are playing a key role in DNA damage response and genomic integrity. Cancer-associated mutations identified in human Chk1, Chk2, and Wee1 were retrieved to understand the function associated with the mutation and also alterations in the folding pattern. Therefore, an attempt has been made to identify deleterious effect of variants using in silico and structure-based approach. Variants of uncertain significance for Chk1, Chk2, and Wee1 were retrieved from different databases and four prediction servers were employed to predict pathogenicity of mutations. Further, Interpro, I-Mutant 3.0, Consurf, TM-align, and have (y)our protein explained were used for comprehensive study of the deleterious effects of variants. The sequences of Chk1, Chk2, and Wee1 were analyzed using Clustal Omega, and the three-dimensional structures of the proteins were aligned using TM-align. The molecular dynamics simulations were performed to explore the differences in folding pattern between Chk1, Chk2, Wee1 wild-type, and mutant protein and also to evaluate the structural integrity. Thirty-six variants in Chk1, 250 Variants in Chk2, and 29 in Wee1 were categorized as pathogenic using in silico prediction tools. Furthermore, 25 mutations in Chk1, 189 in Chk2, and 14 in Wee1 were highly conserved, possessing deleterious effect and also influencing the protein structure and function. These identified mutations may provide underlying genetic intricacies to serve as potential targets for therapeutic inventions and clinical management.
Collapse
Affiliation(s)
- Venessa Colaco
- Advanced Centre for Treatment, Research and Education in Cancer, Kharghar, Navi Mumbai, Maharashtra, India
| | - Nabajyoti Goswami
- Advanced Centre for Treatment, Research and Education in Cancer, Kharghar, Navi Mumbai, Maharashtra, India
| | - Vijay Kumar Goel
- School of Physical Sciences, Jawaharlal Nehru University, New Delhi, India
| | | | | | | | - Probodh Borah
- College of Veterinary Science, Assam Agricultural University, Khanapara, Guwahati, Assam, India
| | - Reshita Baruah
- Advanced Centre for Treatment, Research and Education in Cancer, Kharghar, Navi Mumbai, Maharashtra, India
| | - Ashok K Varma
- Advanced Centre for Treatment, Research and Education in Cancer, Kharghar, Navi Mumbai, Maharashtra, India
- Homi Bhabha National Institute, Training School Complex, Anushaktinagar, Mumbai, India
| |
Collapse
|
8
|
Wang XY, Xu YM, Lau ATY. Proteogenomics in Cancer: Then and Now. J Proteome Res 2023; 22:3103-3122. [PMID: 37725793 DOI: 10.1021/acs.jproteome.3c00196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/21/2023]
Abstract
For years, the paths of sequencing technologies and mass spectrometry have occurred in isolation, with each developing its own unique culture and expertise. These two technologies are crucial for inspecting complementary aspects of the molecular phenotype across the central dogma. Integrative multiomics strives to bridge the analysis gap among different fields to complete more comprehensive mechanisms of life events and diseases. Proteogenomics is one integrated multiomics field. Here in this review, we mainly summarize and discuss three aspects: workflow of proteogenomics, proteogenomics applications in cancer research, and the SWOT (Strengths, Weaknesses, Opportunities, Threats) analysis of proteogenomics in cancer research. In conclusion, proteogenomics has a promising future as it clarifies the functional consequences of many unannotated genomic abnormalities or noncanonical variants and identifies driver genes and novel therapeutic targets across cancers, which would substantially accelerate the development of precision oncology.
Collapse
Affiliation(s)
- Xiu-Yun Wang
- Laboratory of Cancer Biology and Epigenetics, Department of Cell Biology and Genetics, Shantou University Medical College, Shantou, Guangdong 515041, People's Republic of China
| | - Yan-Ming Xu
- Laboratory of Cancer Biology and Epigenetics, Department of Cell Biology and Genetics, Shantou University Medical College, Shantou, Guangdong 515041, People's Republic of China
| | - Andy T Y Lau
- Laboratory of Cancer Biology and Epigenetics, Department of Cell Biology and Genetics, Shantou University Medical College, Shantou, Guangdong 515041, People's Republic of China
| |
Collapse
|
9
|
Novikova SE, Tolstova TV, Soloveva NA, Farafonova TE, Tikhonova OV, Kurbatov LK, Rusanov AL, Zgoda VG. Proteomic Approach to Investigating Expression, Localization, and Functions of the SOWAHD Gene Protein Product during Granulocytic Differentiation. BIOCHEMISTRY. BIOKHIMIIA 2023; 88:1668-1682. [PMID: 38105032 DOI: 10.1134/s000629792310019x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 08/23/2023] [Accepted: 08/26/2023] [Indexed: 12/19/2023]
Abstract
Cataloging human proteins and evaluation of their expression, cellular localization, functions, and potential medical significance are important tasks for the global proteomic community. At present, localization and functions of protein products for almost half of protein-coding genes remain unknown or poorly understood. Investigation of organelle proteomes is a promising approach to uncovering localization and functions of human proteins. Nuclear proteome is of particular interest because many nuclear proteins, e.g., transcription factors, regulate functions that determine cell fate. Meta-analysis of the nuclear proteome, or nucleome, of HL-60 cells treated with all-trans-retinoic acid (ATRA) has shown that the functions and localization of a protein product of the SOWAHD gene are poorly understood. Also, there is no comprehensive information on the SOWAHD gene expression at the protein level. In HL-60 cells, the number of mRNA transcripts of the SOWAHD gene was determined as 6.4 ± 0.7 transcripts per million molecules. Using targeted mass spectrometry, the content of the SOWAHD protein was measured as 0.27 to 1.25 fmol/μg total protein. The half-life for the protein product of the SOWAHD gene determined using stable isotope pulse-chase labeling was ~19 h. Proteomic profiling of the nuclear fraction of HL-60 cells showed that the content of the SOWAHD protein increased during the ATRA-induced granulocytic differentiation, reached the peak value at 9 h after ATRA addition, and then decreased. Nuclear location and involvement of the SOWAHD protein in the ATRA-induced granulocytic differentiation have been demonstrated for the first time.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Victor G Zgoda
- Institute of Biomedical Chemistry, Moscow, 119121, Russia.
| |
Collapse
|
10
|
Nasaev SS, Kopeykina AS, Kuznetsova KG, Levitsky LI, Moshkovskii SA. Proteomic Analysis of Zebrafish Protein Recoding via mRNA Editing by ADAR Enzymes. BIOCHEMISTRY. BIOKHIMIIA 2022; 87:1301-1309. [PMID: 36509721 DOI: 10.1134/s0006297922110098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
RNA editing by adenosine deaminases of the ADAR family can lead to protein recoding, since inosine formed from adenosine in mRNA is complementary to cytosine; the resulting codon editing might introduce amino acid substitutions into translated proteins. Proteome recoding can have functional consequences which have been described in many animals including humans. Using protein recoding database derived from publicly available transcriptome data, we identified for the first time the recoding sites in the zebrafish shotgun proteomes. Out of more than a hundred predicted recoding events, ten substitutions were found in six used datasets. Seven of them were in the AMPA glutamate receptor subunits, whose recoding has been well described, and are conserved among vertebrates. Three sites were specific for zebrafish proteins and were found in the transmembrane receptors astrotactin 1 and neuregulin 3b (proteins involved in the neuronal adhesion and signaling) and in the rims2b gene product (presynaptic membrane protein participating in the neurotransmitter release), respectively. Further studies are needed to elucidate the role of recoding of the said three proteins in the zebrafish.
Collapse
Affiliation(s)
- Shamsudin S Nasaev
- Federal Research and Clinical Center of Physical-Chemical Medicine, Moscow, 119435, Russia.,Institute of Biomedical Chemistry, Moscow, 119121, Russia
| | - Anna S Kopeykina
- Pirogov Russian National Research Medical University, Moscow, 117997, Russia
| | | | - Lev I Levitsky
- Talrose Institute for Energy Problems of Chemical Physics, Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, Moscow, 119334, Russia
| | - Sergei A Moshkovskii
- Federal Research and Clinical Center of Physical-Chemical Medicine, Moscow, 119435, Russia. .,Pirogov Russian National Research Medical University, Moscow, 117997, Russia
| |
Collapse
|
11
|
Zhu YH, Zhang C, Liu Y, Omenn GS, Freddolino PL, Yu DJ, Zhang Y. TripletGO: Integrating Transcript Expression Profiles with Protein Homology Inferences for Gene Function Prediction. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022; 20:1013-1027. [PMID: 35568117 PMCID: PMC10025770 DOI: 10.1016/j.gpb.2022.03.001] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 03/02/2022] [Accepted: 04/16/2022] [Indexed: 01/13/2023]
Abstract
Gene Ontology (GO) has been widely used to annotate functions of genes and gene products. Here, we proposed a new method, TripletGO, to deduce GO terms of protein-coding and non-coding genes, through the integration of four complementary pipelines built on transcript expression profile, genetic sequence alignment, protein sequence alignment, and naïve probability. TripletGO was tested on a large set of 5754 genes from 8 species (human, mouse, Arabidopsis, rat, fly, budding yeast, fission yeast, and nematoda) and 2433 proteins with available expression data from the third Critical Assessment of Protein Function Annotation challenge (CAFA3). Experimental results show that TripletGO achieves function annotation accuracy significantly beyond the current state-of-the-art approaches. Detailed analyses show that the major advantage of TripletGO lies in the coupling of a new triplet network-based profiling method with the feature space mapping technique, which can accurately recognize function patterns from transcript expression profiles. Meanwhile, the combination of multiple complementary models, especially those from transcript expression and protein-level alignments, improves the coverage and accuracy of the final GO annotation results. The standalone package and an online server of TripletGO are freely available at https://zhanggroup.org/TripletGO/.
Collapse
Affiliation(s)
- Yi-Heng Zhu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Chengxin Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA
| | - Yan Liu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Departments of Internal Medicine and Human Genetics, and School of Public Health, University of Michigan, Ann Arbor, MI 48109, USA
| | - Peter L Freddolino
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA
| | - Dong-Jun Yu
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI 48109, USA; Department of Biological Chemistry, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
12
|
Martyn GD, Veggiani G, Kusebauch U, Morrone SR, Yates BP, Singer AU, Tong J, Manczyk N, Gish G, Sun Z, Kurinov I, Sicheri F, Moran MF, Moritz RL, Sidhu SS. Engineered SH2 Domains for Targeted Phosphoproteomics. ACS Chem Biol 2022; 17:1472-1484. [PMID: 35613471 DOI: 10.1021/acschembio.2c00051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
A comprehensive analysis of the phosphoproteome is essential for understanding molecular mechanisms of human diseases. However, current tools used to enrich phosphotyrosine (pTyr) are limited in their applicability and scope. Here, we engineered new superbinder Src-Homology 2 (SH2) domains that enrich diverse sets of pTyr-peptides. We used phage display to select a Fes-SH2 domain variant (superFes; sFes1) with high affinity for pTyr and solved its structure bound to a pTyr-peptide. We performed systematic structure-function analyses of the superbinding mechanisms of sFes1 and superSrc-SH2 (sSrc1), another SH2 superbinder. We grafted the superbinder motifs from sFes1 and sSrc1 into 17 additional SH2 domains and confirmed increased binding affinity for specific pTyr-peptides. Using mass spectrometry (MS), we demonstrated that SH2 superbinders have distinct specificity profiles and superior capabilities to enrich pTyr-peptides. Finally, using combinations of SH2 superbinders as affinity purification (AP) tools we showed that unique subsets of pTyr-peptides can be enriched with unparalleled depth and coverage.
Collapse
Affiliation(s)
- Gregory D. Martyn
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| | - Gianluca Veggiani
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S3E1, Canada
| | - Ulrike Kusebauch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Seamus R. Morrone
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Bradley P. Yates
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S3E1, Canada
| | - Alex U. Singer
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S3E1, Canada
| | - Jiefei Tong
- Program in Cell biology, Hospital for Sick Children, Toronto M5G 0A4, Canada
| | - Noah Manczyk
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario M5G 1X5, Canada
| | - Gerald Gish
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario M5G 1X5, Canada
| | - Zhi Sun
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Igor Kurinov
- Department of Chemistry and Chemical Biology, Cornell University, NE-CAT, Argonne, Illinois 60439, United States
| | - Frank Sicheri
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Department of Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, Ontario M5G 1X5, Canada
| | - Michael F. Moran
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
- Program in Cell biology, Hospital for Sick Children, Toronto M5G 0A4, Canada
- The Hospital for Sick Children, SPARC Biocentre, Toronto, Ontario M5G 0A4, Canada
| | - Robert L. Moritz
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Sachdev S. Sidhu
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, Ontario M5S3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario M5S 1A8, Canada
| |
Collapse
|
13
|
Deutsch EW, Omenn GS, Sun Z, Maes M, Pernemalm M, Palaniappan KK, Letunica N, Vandenbrouck Y, Brun V, Tao SC, Yu X, Geyer PE, Ignjatovic V, Moritz RL, Schwenk JM. Advances and Utility of the Human Plasma Proteome. J Proteome Res 2021; 20:5241-5263. [PMID: 34672606 PMCID: PMC9469506 DOI: 10.1021/acs.jproteome.1c00657] [Citation(s) in RCA: 83] [Impact Index Per Article: 27.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
The study of proteins circulating in blood offers tremendous opportunities to diagnose, stratify, or possibly prevent diseases. With recent technological advances and the urgent need to understand the effects of COVID-19, the proteomic analysis of blood-derived serum and plasma has become even more important for studying human biology and pathophysiology. Here we provide views and perspectives about technological developments and possible clinical applications that use mass-spectrometry(MS)- or affinity-based methods. We discuss examples where plasma proteomics contributed valuable insights into SARS-CoV-2 infections, aging, and hemostasis and the opportunities offered by combining proteomics with genetic data. As a contribution to the Human Proteome Organization (HUPO) Human Plasma Proteome Project (HPPP), we present the Human Plasma PeptideAtlas build 2021-07 that comprises 4395 canonical and 1482 additional nonredundant human proteins detected in 240 MS-based experiments. In addition, we report the new Human Extracellular Vesicle PeptideAtlas 2021-06, which comprises five studies and 2757 canonical proteins detected in extracellular vesicles circulating in blood, of which 74% (2047) are in common with the plasma PeptideAtlas. Our overview summarizes the recent advances, impactful applications, and ongoing challenges for translating plasma proteomics into utility for precision medicine.
Collapse
Affiliation(s)
- Eric W Deutsch
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Gilbert S Omenn
- Institute for Systems Biology, Seattle, Washington 98109, United States.,Departments of Computational Medicine & Bioinformatics, Internal Medicine, and Human Genetics and School of Public Health, University of Michigan, Ann Arbor, Michigan 48109-2218, United States
| | - Zhi Sun
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Michal Maes
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Maria Pernemalm
- Department of Oncology and Pathology/Science for Life Laboratory, Karolinska Institutet, 171 65 Stockholm, Sweden
| | | | - Natasha Letunica
- Murdoch Children's Research Institute, 50 Flemington Road, Parkville 3052, Victoria, Australia
| | - Yves Vandenbrouck
- Université Grenoble Alpes, CEA, Inserm U1292, Grenoble 38000, France
| | - Virginie Brun
- Université Grenoble Alpes, CEA, Inserm U1292, Grenoble 38000, France
| | - Sheng-Ce Tao
- Key Laboratory of Systems Biomedicine (Ministry of Education), Shanghai Center for Systems Biomedicine, Shanghai Jiao Tong University, B207 SCSB Building, 800 Dongchuan Road, Shanghai 200240, China
| | - Xiaobo Yu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing 102206, China
| | - Philipp E Geyer
- OmicEra Diagnostics GmbH, Behringstr. 6, 82152 Planegg, Germany
| | - Vera Ignjatovic
- Murdoch Children's Research Institute, 50 Flemington Road, Parkville 3052, Victoria, Australia.,Department of Paediatrics, The University of Melbourne, 50 Flemington Road, Parkville 3052, Victoria, Australia
| | - Robert L Moritz
- Institute for Systems Biology, Seattle, Washington 98109, United States
| | - Jochen M Schwenk
- Affinity Proteomics, Science for Life Laboratory, Department of Protein Science, KTH Royal Institute of Technology, Tomtebodavägen 23, SE-171 65 Solna, Sweden
| |
Collapse
|
14
|
Pailleux F, Maes P, Jaquinod M, Barthelon J, Darnaud M, Lacoste C, Vandenbrouck Y, Gilquin B, Louwagie M, Hesse AM, Kraut A, Garin J, Leroy V, Zarski JP, Bruley C, Couté Y, Samuel D, Ichai P, Faivre J, Brun V. Mass Spectrometry-Based Proteomics Reveal Alcohol Dehydrogenase 1B as a Blood Biomarker Candidate to Monitor Acetaminophen-Induced Liver Injury. Int J Mol Sci 2021; 22:ijms222011071. [PMID: 34681731 PMCID: PMC8540689 DOI: 10.3390/ijms222011071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 10/03/2021] [Accepted: 10/11/2021] [Indexed: 12/18/2022] Open
Abstract
Acute liver injury (ALI) is a severe disorder resulting from excessive hepatocyte cell death, and frequently caused by acetaminophen intoxication. Clinical management of ALI progression is hampered by the dearth of blood biomarkers available. In this study, a bioinformatics workflow was developed to screen omics databases and identify potential biomarkers for hepatocyte cell death. Then, discovery proteomics was harnessed to select from among these candidates those that were specifically detected in the blood of acetaminophen-induced ALI patients. Among these candidates, the isoenzyme alcohol dehydrogenase 1B (ADH1B) was massively leaked into the blood. To evaluate ADH1B, we developed a targeted proteomics assay and quantified ADH1B in serum samples collected at different times from 17 patients admitted for acetaminophen-induced ALI. Serum ADH1B concentrations increased markedly during the acute phase of the disease, and dropped to undetectable levels during recovery. In contrast to alanine aminotransferase activity, the rapid drop in circulating ADH1B concentrations was followed by an improvement in the international normalized ratio (INR) within 10–48 h, and was associated with favorable outcomes. In conclusion, the combination of omics data exploration and proteomics revealed ADH1B as a new blood biomarker candidate that could be useful for the monitoring of acetaminophen-induced ALI.
Collapse
Affiliation(s)
- Floriane Pailleux
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Pauline Maes
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Michel Jaquinod
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Justine Barthelon
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
- Clinique Universitaire d’Hépato-gastroentérologie, Centre Hospitalier Universitaire Grenoble, 38000 Grenoble, France; (V.L.); (J.-P.Z.)
| | - Marion Darnaud
- Hepatobiliary Centre, Paul-Brousse University Hospital, INSERM U1193, 94800 Villejuif, France; (M.D.); (C.L.); (D.S.); (P.I.)
- Faculté de Médecine, Université Paris-Sud, Université Paris-Saclay, 94270 Le Kremlin-Bicêtre, France
| | - Claire Lacoste
- Hepatobiliary Centre, Paul-Brousse University Hospital, INSERM U1193, 94800 Villejuif, France; (M.D.); (C.L.); (D.S.); (P.I.)
- Faculté de Médecine, Université Paris-Sud, Université Paris-Saclay, 94270 Le Kremlin-Bicêtre, France
| | - Yves Vandenbrouck
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Benoît Gilquin
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
- Univ. Grenoble Alpes, CEA, LETI, Clinatec, 38000 Grenoble, France
| | - Mathilde Louwagie
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Anne-Marie Hesse
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Alexandra Kraut
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Jérôme Garin
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Vincent Leroy
- Clinique Universitaire d’Hépato-gastroentérologie, Centre Hospitalier Universitaire Grenoble, 38000 Grenoble, France; (V.L.); (J.-P.Z.)
- Institute for Advanced Biosciences, Université Grenoble Alpes, CNRS, INSERM U1209, 38000 Grenoble, France
| | - Jean-Pierre Zarski
- Clinique Universitaire d’Hépato-gastroentérologie, Centre Hospitalier Universitaire Grenoble, 38000 Grenoble, France; (V.L.); (J.-P.Z.)
- Institute for Advanced Biosciences, Université Grenoble Alpes, CNRS, INSERM U1209, 38000 Grenoble, France
| | - Christophe Bruley
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Yohann Couté
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
| | - Didier Samuel
- Hepatobiliary Centre, Paul-Brousse University Hospital, INSERM U1193, 94800 Villejuif, France; (M.D.); (C.L.); (D.S.); (P.I.)
- Faculté de Médecine, Université Paris-Sud, Université Paris-Saclay, 94270 Le Kremlin-Bicêtre, France
| | - Philippe Ichai
- Hepatobiliary Centre, Paul-Brousse University Hospital, INSERM U1193, 94800 Villejuif, France; (M.D.); (C.L.); (D.S.); (P.I.)
- Faculté de Médecine, Université Paris-Sud, Université Paris-Saclay, 94270 Le Kremlin-Bicêtre, France
| | - Jamila Faivre
- Hepatobiliary Centre, Paul-Brousse University Hospital, INSERM U1193, 94800 Villejuif, France; (M.D.); (C.L.); (D.S.); (P.I.)
- Faculté de Médecine, Université Paris-Sud, Université Paris-Saclay, 94270 Le Kremlin-Bicêtre, France
- Assistance Publique-Hôpitaux de Paris (AP-HP), Pôle de Biologie Médicale, Paul-Brousse University Hospital, 94800 Villejuif, France
- Correspondence: (J.F.); (V.B.)
| | - Virginie Brun
- Univ. Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, CNRS, CEA, FR2048, 38000 Grenoble, France; (F.P.); (P.M.); (M.J.); (J.B.); (Y.V.); (B.G.); (M.L.); (A.-M.H.); (A.K.); (J.G.); (C.B.); (Y.C.)
- Univ. Grenoble Alpes, CEA, LETI, Clinatec, 38000 Grenoble, France
- Correspondence: (J.F.); (V.B.)
| |
Collapse
|
15
|
Shao D, Huang L, Wang Y, Cui X, Li Y, Wang Y, Ma Q, Du W, Cui J. HBFP: a new repository for human body fluid proteome. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6395039. [PMID: 34642750 PMCID: PMC8516408 DOI: 10.1093/database/baab065] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Revised: 09/23/2021] [Accepted: 09/28/2021] [Indexed: 12/15/2022]
Abstract
Body fluid proteome has been intensively studied as a primary source for disease
biomarker discovery. Using advanced proteomics technologies, early research
success has resulted in increasingly accumulated proteins detected in different
body fluids, among which many are promising biomarkers. However, despite a
handful of small-scale and specific data resources, current research is clearly
lacking effort compiling published body fluid proteins into a centralized and
sustainable repository that can provide users with systematic analytic tools. In
this study, we developed a new database of human body fluid proteome (HBFP) that
focuses on experimentally validated proteome in 17 types of human body fluids.
The current database archives 11 827 unique proteins reported by 164
scientific publications, with a maximal false discovery rate of 0.01 on both the
peptide and protein levels since 2001, and enables users to query, analyze and
download protein entries with respect to each body fluid. Three unique features
of this new system include the following: (i) the protein annotation page
includes detailed abundance information based on relative qualitative measures
of peptides reported in the original references, (ii) a new score is calculated
on each reported protein to indicate the discovery confidence and (iii) HBFP
catalogs 7354 proteins with at least two non-nested uniquely mapping peptides of
nine amino acids according to the Human Proteome Project Data Interpretation
Guidelines, while the remaining 4473 proteins have more than two unique peptides
without given sequence information. As an important resource for human protein
secretome, we anticipate that this new HBFP database can be a powerful tool that
facilitates research in clinical proteomics and biomarker discovery. Database URL:https://bmbl.bmi.osumc.edu/HBFP/
Collapse
Affiliation(s)
- Dan Shao
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, 122E Avery Hall, 1144 T St., Lincoln, NE 68588, USA.,Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 2699 Qianjin Street, Changchun 130012, China.,Department of Computer Science and Technology, Changchun University, 6543 Weixing Road, Changchun 130022, China
| | - Lan Huang
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Yan Wang
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Xueteng Cui
- Department of Computer Science and Technology, Changchun University, 6543 Weixing Road, Changchun 130022, China
| | - Yufei Li
- Department of Computer Science and Technology, Changchun University, 6543 Weixing Road, Changchun 130022, China
| | - Yao Wang
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, 310G Lincoln tower, 1800 cannon drive, Columbus, OH 43210, USA
| | - Wei Du
- Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, 2699 Qianjin Street, Changchun 130012, China
| | - Juan Cui
- Department of Computer Science and Engineering, University of Nebraska-Lincoln, 122E Avery Hall, 1144 T St., Lincoln, NE 68588, USA
| |
Collapse
|
16
|
Proteomic Tools for the Analysis of Cytoskeleton Proteins. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2021; 2364:363-425. [PMID: 34542864 DOI: 10.1007/978-1-0716-1661-1_19] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
Abstract
Proteomic analyses have become an essential part of the toolkit of the molecular biologist, given the widespread availability of genomic data and open source or freely accessible bioinformatics software. Tools are available for detecting homologous sequences, recognizing functional domains, and modeling the three-dimensional structure for any given protein sequence, as well as for predicting interactions with other proteins or macromolecules. Although a wealth of structural and functional information is available for many cytoskeletal proteins, with representatives spanning all of the major subfamilies, the majority of cytoskeletal proteins remain partially or totally uncharacterized. Moreover, bioinformatics tools provide a means for studying the effects of synthetic mutations or naturally occurring variants of these cytoskeletal proteins. This chapter discusses various freely available proteomic analysis tools, with a focus on in silico prediction of protein structure and function. The selected tools are notable for providing an easily accessible interface for the novice while retaining advanced functionality for more experienced computational biologists.
Collapse
|
17
|
Baltoumas FA, Zafeiropoulou S, Karatzas E, Koutrouli M, Thanati F, Voutsadaki K, Gkonta M, Hotova J, Kasionis I, Hatzis P, Pavlopoulos GA. Biomolecule and Bioentity Interaction Databases in Systems Biology: A Comprehensive Review. Biomolecules 2021; 11:1245. [PMID: 34439912 PMCID: PMC8391349 DOI: 10.3390/biom11081245] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Revised: 08/16/2021] [Accepted: 08/18/2021] [Indexed: 02/06/2023] Open
Abstract
Technological advances in high-throughput techniques have resulted in tremendous growth of complex biological datasets providing evidence regarding various biomolecular interactions. To cope with this data flood, computational approaches, web services, and databases have been implemented to deal with issues such as data integration, visualization, exploration, organization, scalability, and complexity. Nevertheless, as the number of such sets increases, it is becoming more and more difficult for an end user to know what the scope and focus of each repository is and how redundant the information between them is. Several repositories have a more general scope, while others focus on specialized aspects, such as specific organisms or biological systems. Unfortunately, many of these databases are self-contained or poorly documented and maintained. For a clearer view, in this article we provide a comprehensive categorization, comparison and evaluation of such repositories for different bioentity interaction types. We discuss most of the publicly available services based on their content, sources of information, data representation methods, user-friendliness, scope and interconnectivity, and we comment on their strengths and weaknesses. We aim for this review to reach a broad readership varying from biomedical beginners to experts and serve as a reference article in the field of Network Biology.
Collapse
Affiliation(s)
- Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Sofia Zafeiropoulou
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Mikaela Koutrouli
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
- Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Foteini Thanati
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Kleanthi Voutsadaki
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Maria Gkonta
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Joana Hotova
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Ioannis Kasionis
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
| | - Pantelis Hatzis
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
- Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center “Alexander Fleming”, 16672 Vari, Greece; (S.Z.); (E.K.); (M.K.); (F.T.); (K.V.); (M.G.); (J.H.); (I.K.); (P.H.)
- Center for New Biotechnologies and Precision Medicine, School of Medicine, National and Kapodistrian University of Athens, 11527 Athens, Greece
| |
Collapse
|
18
|
Omenn GS. Reflections on the HUPO Human Proteome Project, the Flagship Project of the Human Proteome Organization, at 10 Years. Mol Cell Proteomics 2021; 20:100062. [PMID: 33640492 PMCID: PMC8058560 DOI: 10.1016/j.mcpro.2021.100062] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Revised: 02/04/2021] [Accepted: 02/05/2021] [Indexed: 02/08/2023] Open
Abstract
We celebrate the 10th anniversary of the launch of the HUPO Human Proteome Project (HPP) and its major milestone of confident detection of at least one protein from each of 90% of the predicted protein-coding genes, based on the output of the entire proteomics community. The Human Genome Project reached a similar decadal milestone 20 years ago. The HPP has engaged proteomics teams around the world, strongly influenced data-sharing, enhanced quality assurance, and issued stringent guidelines for claims of detecting previously "missing proteins." This invited perspective complements papers on "A High-Stringency Blueprint of the Human Proteome" and "The Human Proteome Reaches a Major Milestone" in special issues of Nature Communications and Journal of Proteome Research, respectively, released in conjunction with the October 2020 virtual HUPO Congress and its celebration of the 10th anniversary of the HUPO HPP.
Collapse
Affiliation(s)
- Gilbert S Omenn
- University of Michigan Medical School, Departments of Computational Medicine & Bioinformatics, Internal Medicine, Human Genetics, and School of Public Health, Ann Arbor, Michigan, USA.
| |
Collapse
|
19
|
Kamdar MR, Musen MA. An empirical meta-analysis of the life sciences linked open data on the web. Sci Data 2021; 8:24. [PMID: 33479214 PMCID: PMC7819992 DOI: 10.1038/s41597-021-00797-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 12/04/2020] [Indexed: 01/29/2023] Open
Abstract
While the biomedical community has published several "open data" sources in the last decade, most researchers still endure severe logistical and technical challenges to discover, query, and integrate heterogeneous data and knowledge from multiple sources. To tackle these challenges, the community has experimented with Semantic Web and linked data technologies to create the Life Sciences Linked Open Data (LSLOD) cloud. In this paper, we extract schemas from more than 80 biomedical linked open data sources into an LSLOD schema graph and conduct an empirical meta-analysis to evaluate the extent of semantic heterogeneity across the LSLOD cloud. We observe that several LSLOD sources exist as stand-alone data sources that are not inter-linked with other sources, use unpublished schemas with minimal reuse or mappings, and have elements that are not useful for data integration from a biomedical perspective. We envision that the LSLOD schema graph and the findings from this research will aid researchers who wish to query and integrate data and knowledge from multiple biomedical sources simultaneously on the Web.
Collapse
Affiliation(s)
- Maulik R Kamdar
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA.
- Elsevier Health Markets, Philadelphia, PA, USA.
| | - Mark A Musen
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA
| |
Collapse
|
20
|
Combes F, Loux V, Vandenbrouck Y. GO Enrichment Analysis for Differential Proteomics Using ProteoRE. Methods Mol Biol 2021; 2361:179-196. [PMID: 34236662 DOI: 10.1007/978-1-0716-1641-3_11] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
With the increased simplicity of producing proteomics data, the bottleneck has now shifted to the functional analysis of large lists of proteins to translate this primary level of information into meaningful biological knowledge. Tools implementing such approach are a powerful way to gain biological insights related to their samples, provided that biologists/clinicians have access to computational solutions even when they have little programming experience or bioinformatics support. To achieve this goal, we designed ProteoRE (Proteomics Research Environment), a unified online research service that provides end-users with a set of tools to interpret their proteomics data in a collaborative and reproducible manner. ProteoRE is built upon the Galaxy framework, a workflow system allowing for data and analysis persistence, and providing user interfaces to facilitate the interaction with tools dedicated to the functional and the visual analysis of proteomics datasets. A set of tools relying on computational methods selected for their complementarity in terms of functional analysis was developed and made accessible via the ProteoRE web portal. In this chapter, a step-by-step protocol linking these tools is designed to perform a functional annotation and GO-based enrichment analyses applied to a set of differentially expressed proteins as a use case. Analytical practices, guidelines as well as tips related to this strategy are also provided. Tools, datasets, and results are freely available at http://www.proteore.org , allowing researchers to reuse them.
Collapse
Affiliation(s)
- Florence Combes
- Université Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, Grenoble, France
| | - Valentin Loux
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
- Université Paris-Saclay, INRAE, BioinfOmics, MIGALE bioinformatics facility, Jouy-en-Josas, France
| | - Yves Vandenbrouck
- Université Grenoble Alpes, INSERM, CEA, UMR BioSanté U1292, Grenoble, France.
| |
Collapse
|
21
|
Kaushal P, Lee C. N-terminomics - its past and recent advancements. J Proteomics 2020; 233:104089. [PMID: 33359939 DOI: 10.1016/j.jprot.2020.104089] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Revised: 07/22/2020] [Accepted: 12/20/2020] [Indexed: 02/06/2023]
Abstract
N-terminomics is a rapidly evolving branch of proteomics that encompasses the study of protein N-terminal sequence. A proteome-wide collection of such sequences has been widely used to understand the proteolytic cascades and in annotating the genome. Over the last two decades, various N-terminomic strategies have been developed for achieving high sensitivity, greater depth of coverage, and high-throughputness. We, in this review, cover how the field of N-terminomics has evolved to date, including discussion on various sample preparation and N-terminal peptide enrichment strategies. We also compare different N-terminomic methods and highlight their relative benefits and shortcomings in their implementation. In addition, an overview of the currently available bioinformatics tools and data analysis pipelines for the annotation of N-terminomic datasets is also included. SIGNIFICANCE: It has been recognized that proteins undergo several post-translational modifications (PTM), and a number of perturbed biological pathways are directly associated with modifications at the terminal sites of a protein. In this regard, N-terminomics can be applied to generate a proteome-wide landscape of mature N-terminal sequences, annotate their source of generation, and recognize their significance in the biological pathways. Besides, a system-wide study can be used to study complicated proteolytic machinery and protease cleavage patterns for potential therapeutic targets. Moreover, due to unprecedented improvements in the analytical methods and mass spectrometry instrumentation in recent times, the N-terminomic methodologies now offers an unparalleled ability to study proteoforms and their implications in clinical conditions. Such approaches can further be applied for the detection of low abundant proteoforms, annotation of non-canonical protein coding sites, identification of candidate disease biomarkers, and, last but not least, the discovery of novel drug targets.
Collapse
Affiliation(s)
- Prashant Kaushal
- Center for Theragnosis, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea; Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology, Seoul 02792, Republic of Korea
| | - Cheolju Lee
- Center for Theragnosis, Korea Institute of Science and Technology, Seoul 02792, Republic of Korea; Division of Bio-Medical Science & Technology, KIST School, Korea University of Science and Technology, Seoul 02792, Republic of Korea; KHU-KIST Department of Converging Science and Technology, Kyung Hee University, 26 Kyunghee-daero, Dongdaemun-gu, Seoul 02447, Republic of Korea.
| |
Collapse
|
22
|
Affiliation(s)
- Christopher M Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences, and Biochemistry & Molecular Biology, Faculty of Dentistry, The University of British Columbia, Vancouver, British Columbia V6T 1Z3, Canada
| |
Collapse
|
23
|
Overall CM. The HUPO High-Stringency Inventory of Humanity's Shared Human Proteome Revealed. J Proteome Res 2020; 19:4211-4214. [PMID: 33074000 DOI: 10.1021/acs.jproteome.0c00794] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Christopher M Overall
- Centre for Blood Research, Departments of Oral Biological & Medical Sciences and Biochemistry & Molecular Biology, Faculty of Dentistry, The University of British Columbia, Vancouver, British Columbia V6T 1Z3, Canada
| |
Collapse
|
24
|
Adhikari S, Nice EC, Deutsch EW, Lane L, Omenn GS, Pennington SR, Paik YK, Overall CM, Corrales FJ, Cristea IM, Van Eyk JE, Uhlén M, Lindskog C, Chan DW, Bairoch A, Waddington JC, Justice JL, LaBaer J, Rodriguez H, He F, Kostrzewa M, Ping P, Gundry RL, Stewart P, Srivastava S, Srivastava S, Nogueira FCS, Domont GB, Vandenbrouck Y, Lam MPY, Wennersten S, Vizcaino JA, Wilkins M, Schwenk JM, Lundberg E, Bandeira N, Marko-Varga G, Weintraub ST, Pineau C, Kusebauch U, Moritz RL, Ahn SB, Palmblad M, Snyder MP, Aebersold R, Baker MS. A high-stringency blueprint of the human proteome. Nat Commun 2020; 11:5301. [PMID: 33067450 PMCID: PMC7568584 DOI: 10.1038/s41467-020-19045-9] [Citation(s) in RCA: 134] [Impact Index Per Article: 33.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Accepted: 09/25/2020] [Indexed: 02/07/2023] Open
Abstract
The Human Proteome Organization (HUPO) launched the Human Proteome Project (HPP) in 2010, creating an international framework for global collaboration, data sharing, quality assurance and enhancing accurate annotation of the genome-encoded proteome. During the subsequent decade, the HPP established collaborations, developed guidelines and metrics, and undertook reanalysis of previously deposited community data, continuously increasing the coverage of the human proteome. On the occasion of the HPP's tenth anniversary, we here report a 90.4% complete high-stringency human proteome blueprint. This knowledge is essential for discerning molecular processes in health and disease, as we demonstrate by highlighting potential roles the human proteome plays in our understanding, diagnosis and treatment of cancers, cardiovascular and infectious diseases.
Collapse
Affiliation(s)
- Subash Adhikari
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia
| | - Edouard C Nice
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia
- Faculty of Medicine, Nursing and Health Sciences, Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC, 3800, Australia
| | - Eric W Deutsch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Lydie Lane
- Faculty of Medicine, SIB-Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, University of Geneva, CMU, Michel-Servet 1, 1211, Geneva, Switzerland
| | - Gilbert S Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, 48109-2218, USA
| | - Stephen R Pennington
- UCD Conway Institute of Biomolecular and Biomedical Research, School of Medicine, University College Dublin, Dublin, Ireland
| | - Young-Ki Paik
- Yonsei Proteome Research Center, 50 Yonsei-ro, Sudaemoon-ku, Seoul, 120-749, South Korea
| | | | - Fernando J Corrales
- Functional Proteomics Laboratory, Centro Nacional de Biotecnología-CSIC, Proteored-ISCIII, 28049, Madrid, Spain
| | - Ileana M Cristea
- Department of Molecular Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Jennifer E Van Eyk
- Cedars Sinai Medical Center, Advanced Clinical Biosystems Research Institute, The Smidt Heart Institute, Los Angeles, CA, 90048, USA
| | - Mathias Uhlén
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, 17121, Solna, Sweden
| | - Cecilia Lindskog
- Rudbeck Laboratory, Department of Immunology, Genetics and Pathology, Uppsala University, 75185, Uppsala, Sweden
| | - Daniel W Chan
- Department of Pathology and Oncology, Johns Hopkins University School of Medicine, Baltimore, MD, 21224, USA
| | - Amos Bairoch
- Faculty of Medicine, SIB-Swiss Institute of Bioinformatics and Department of Microbiology and Molecular Medicine, University of Geneva, CMU, Michel-Servet 1, 1211, Geneva, Switzerland
| | - James C Waddington
- UCD Conway Institute of Biomolecular and Biomedical Research, School of Medicine, University College Dublin, Dublin, Ireland
| | - Joshua L Justice
- Department of Molecular Biology, Princeton University, Princeton, NJ, 08544, USA
| | - Joshua LaBaer
- Biodesign Institute, Arizona State University, Tempe, AZ, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, NIH, Bethesda, MD, 20892, USA
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Lifeomics, Beijing, 102206, China
| | - Markus Kostrzewa
- Bruker Daltonik GmbH, Microbiology and Diagnostics, Fahrenheitstrasse, 428359, Bremen, Germany
| | - Peipei Ping
- Cardiac Proteomics and Signaling Laboratory, Department of Physiology, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA
| | - Rebekah L Gundry
- CardiOmics Program, Center for Heart and Vascular Research, Division of Cardiovascular Medicine and Department of Cellular and Integrative Physiology, University of Nebraska Medical Center, Omaha, NE, 68198, USA
| | - Peter Stewart
- Department of Chemical Pathology, Royal Prince Alfred Hospital, Camperdown, NSW, Australia
| | | | - Sudhir Srivastava
- Cancer Biomarkers Research Branch, National Cancer Institute, National Institutes of Health, 9609 Medical Center Drive, Suite 5E136, Rockville, MD, 20852, USA
| | - Fabio C S Nogueira
- Proteomics Unit and Laboratory of Proteomics, Institute of Chemistry, Federal University of Rio de Janeiro, Av Athos da Silveria Ramos, 149, 21941-909, Rio de Janeiro, RJ, Brazil
| | - Gilberto B Domont
- Proteomics Unit and Laboratory of Proteomics, Institute of Chemistry, Federal University of Rio de Janeiro, Av Athos da Silveria Ramos, 149, 21941-909, Rio de Janeiro, RJ, Brazil
| | - Yves Vandenbrouck
- University of Grenoble Alpes, Inserm, CEA, IRIG-BGE, U1038, 38000, Grenoble, France
| | - Maggie P Y Lam
- Departments of Medicine-Cardiology and Biochemistry and Molecular Genetics, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
- Consortium for Fibrosis Research and Translation, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
| | - Sara Wennersten
- Division of Cardiology, Department of Medicine, University of Colorado, Anschutz Medical Campus, Aurora, CO, USA
| | - Juan Antonio Vizcaino
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Marc Wilkins
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Jochen M Schwenk
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, 17121, Solna, Sweden
| | - Emma Lundberg
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, KTH Royal Institute of Technology, 17121, Solna, Sweden
| | - Nuno Bandeira
- Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, Mail Code 0404, La Jolla, CA, 92093-0404, USA
| | | | - Susan T Weintraub
- Department of Biochemistry and Structural Biology, University of Texas Health Science Center San Antonio, UT Health, 7703 Floyd Curl Drive, San Antonio, TX, 78229-3900, USA
| | - Charles Pineau
- University of Rennes, Inserm, EHESP, IREST, UMR_S 1085, F-35042, Rennes, France
| | - Ulrike Kusebauch
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Robert L Moritz
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
| | - Seong Beom Ahn
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia
| | - Magnus Palmblad
- Leiden University Medical Center, Leiden, 2333, The Netherlands
| | - Michael P Snyder
- Department of Genetics, Stanford School of Medicine, Stanford, CA, 94305, USA
| | - Ruedi Aebersold
- Institute for Systems Biology, 401 Terry Avenue North, Seattle, WA, 98109, USA
- Faculty of Science, University of Zurich, Zurich, Switzerland
| | - Mark S Baker
- Faculty of Medicine, Health and Human Sciences, Department of Biomedical Sciences, Macquarie University, North Ryde, NSW, 2109, Australia.
- Department of Genetics, Stanford School of Medicine, Stanford, CA, 94305, USA.
| |
Collapse
|
25
|
Cerciello F, Choi M, Sinicropi-Yao SL, Lomeo K, Amann JM, Felley-Bosco E, Stahel RA, Robinson BWS, Creaney J, Pass HI, Vitek O, Carbone DP. Verification of a Blood-Based Targeted Proteomics Signature for Malignant Pleural Mesothelioma. Cancer Epidemiol Biomarkers Prev 2020; 29:1973-1982. [PMID: 32732250 PMCID: PMC7541795 DOI: 10.1158/1055-9965.epi-20-0543] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2020] [Revised: 06/18/2020] [Accepted: 07/27/2020] [Indexed: 11/16/2022] Open
Abstract
BACKGROUND We have verified a mass spectrometry (MS)-based targeted proteomics signature for the detection of malignant pleural mesothelioma (MPM) from the blood. METHODS A seven-peptide biomarker MPM signature by targeted proteomics in serum was identified in a previous independent study. Here, we have verified the predictive accuracy of a reduced version of that signature, now composed of six-peptide biomarkers. We have applied liquid chromatography-selected reaction monitoring (LC-SRM), also known as multiple-reaction monitoring (MRM), for the investigation of 402 serum samples from 213 patients with MPM and 189 cancer-free asbestos-exposed donors from the United States, Australia, and Europe. RESULTS Each of the biomarkers composing the signature was independently informative, with no apparent functional or physical relation to each other. The multiplexing possibility offered by MS proteomics allowed their integration into a single signature with a higher discriminating capacity than that of the single biomarkers alone. The strategy allowed in this way to increase their potential utility for clinical decisions. The signature discriminated patients with MPM and asbestos-exposed donors with AUC of 0.738. For early-stage MPM, AUC was 0.765. This signature was also prognostic, and Kaplan-Meier analysis showed a significant difference between high- and low-risk groups with an HR of 1.659 (95% CI, 1.075-2.562; P = 0.021). CONCLUSIONS Targeted proteomics allowed the development of a multianalyte signature with diagnostic and prognostic potential for MPM from the blood. IMPACT The proteomic signature represents an additional diagnostic approach for informing clinical decisions for patients at risk for MPM.
Collapse
Affiliation(s)
- Ferdinando Cerciello
- James Thoracic Center, James Cancer Center, The Ohio State University Medical Center, Columbus, Ohio.
| | - Meena Choi
- College of Computer and Information Science, Northeastern University, Boston, Massachusetts
| | - Sara L Sinicropi-Yao
- James Thoracic Center, James Cancer Center, The Ohio State University Medical Center, Columbus, Ohio
| | - Katie Lomeo
- James Thoracic Center, James Cancer Center, The Ohio State University Medical Center, Columbus, Ohio
| | - Joseph M Amann
- James Thoracic Center, James Cancer Center, The Ohio State University Medical Center, Columbus, Ohio
| | - Emanuela Felley-Bosco
- Laboratory of Molecular Oncology, Division of Thoracic Surgery, University Hospital Zürich, Zürich, Switzerland
| | - Rolf A Stahel
- Department of Oncology, Center of Hematology and Oncology, Comprehensive Cancer Center Zürich, University Hospital Zürich, Zürich, Switzerland
| | - Bruce W S Robinson
- National Centre for Asbestos Related Disease, University of Western Australia, School of Medicine and Pharmacology, Nedlands, Western Australia
| | - Jenette Creaney
- National Centre for Asbestos Related Disease, University of Western Australia, School of Medicine and Pharmacology, Nedlands, Western Australia
| | - Harvey I Pass
- New York University, Langone Medical Center, New York, New York
| | - Olga Vitek
- College of Computer and Information Science, Northeastern University, Boston, Massachusetts
| | - David P Carbone
- James Thoracic Center, James Cancer Center, The Ohio State University Medical Center, Columbus, Ohio.
| |
Collapse
|
26
|
Aggarwal S, Banerjee SK, Talukdar NC, Yadav AK. Post-translational Modification Crosstalk and Hotspots in Sirtuin Interactors Implicated in Cardiovascular Diseases. Front Genet 2020; 11:356. [PMID: 32425973 PMCID: PMC7204943 DOI: 10.3389/fgene.2020.00356] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Accepted: 03/24/2020] [Indexed: 01/07/2023] Open
Abstract
Sirtuins are protein deacetylases that play a protective role in cardiovascular diseases (CVDs), as well as many other diseases. Absence of sirtuins can lead to hyperacetylation of both nuclear and mitochondrial proteins leading to metabolic dysregulation. The protein post-translational modifications (PTMs) are known to crosstalk among each other to bring about complex phenotypic outcomes. Various PTM types such as acetylation, ubiquitination, and phosphorylation, and so on, drive transcriptional regulation and metabolism, but such crosstalks are poorly understood. We integrated protein–protein interactions (PPI) and PTMs from several databases to integrate information on 1,251 sirtuin-interacting proteins, of which 544 are associated with cardiac diseases. Based on the ∼100,000 PTM sites obtained for sirtuin interactors, we observed that the frequency of PTM sites (83 per protein), as well as PTM types (five per protein), is higher than the global average for human proteome. We found that ∼60–70% PTM sites fall into ordered regions. Approximately 83% of the sirtuin interactors contained at least one competitive crosstalk (in situ) site, with half of the sites occurring in CVD-associated proteins. A large proportion of identified crosstalk sites were observed for acetylation and ubiquitination competition. We identified 614 proteins containing PTM hotspots (≥5 PTM sites) and 133 proteins containing crosstalk hotspots (≥3 crosstalk sites). We observed that a large proportion of disease-associated sequence variants were found in PTM motifs of CVD proteins. We identified seven proteins (TP53, LMNA, MAPT, ATP2A2, NCL, APEX1, and HIST1H3A) containing disease-associated variants in PTM and crosstalk hotspots. This is the first comprehensive bioinformatics analysis on sirtuin interactors with respect to PTMs and their crosstalks. This study forms a platform for generating interesting hypotheses that can be tested for a deeper mechanistic understanding gained or derived from big-data analytics.
Collapse
Affiliation(s)
- Suruchi Aggarwal
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, India.,Division of Life Sciences, Institute of Advanced Study in Science and Technology, Guwahati, India.,Department of Molecular Biology and Biotechnology, Cotton University, Guwahati, India
| | - Sanjay K Banerjee
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, India
| | - Narayan Chandra Talukdar
- Division of Life Sciences, Institute of Advanced Study in Science and Technology, Guwahati, India.,Department of Molecular Biology and Biotechnology, Cotton University, Guwahati, India
| | - Amit Kumar Yadav
- Translational Health Science and Technology Institute, NCR Biotech Science Cluster, Faridabad, India
| |
Collapse
|
27
|
Naryzhny S, Klopov N, Ronzhina N, Zorina E, Zgoda V, Kleyst O, Belyakova N, Legina O. A database for inventory of proteoform profiles: "2DE-pattern". Electrophoresis 2020; 41:1118-1124. [PMID: 32307725 DOI: 10.1002/elps.201900468] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2019] [Revised: 03/23/2020] [Accepted: 03/26/2020] [Indexed: 01/01/2023]
Abstract
The human proteome is composed of a diverse and heterogeneous range of gene products/proteoforms/protein species. Because of the growing amount of information about proteoforms generated by different methods, we need a convenient approach to make an inventory of the data. Here, we present a database of proteoforms that is based on information obtained by separation of proteoforms using 2DE followed by shotgun ESI-LC-MS/MS. The database's principles and structure are described. The database is called "2DE-pattern" as it contains multiple isoform-centric patterns of proteoforms separated according to 2DE principles. The database can be freely used at http://2de-pattern.pnpi.nrcki.ru.
Collapse
Affiliation(s)
- Stanislav Naryzhny
- Orekhovich Institute of Biomedical Chemistry of Russian Academy of Medical Sciences, Moscow, Russia.,B.P. Konstantinov Petersburg Nuclear Physics Institute, National Research Center "Kurchatov Institute", Gatchina, Russia
| | - Nikolay Klopov
- B.P. Konstantinov Petersburg Nuclear Physics Institute, National Research Center "Kurchatov Institute", Gatchina, Russia
| | - Natalia Ronzhina
- B.P. Konstantinov Petersburg Nuclear Physics Institute, National Research Center "Kurchatov Institute", Gatchina, Russia
| | - Elena Zorina
- Orekhovich Institute of Biomedical Chemistry of Russian Academy of Medical Sciences, Moscow, Russia
| | - Victor Zgoda
- Orekhovich Institute of Biomedical Chemistry of Russian Academy of Medical Sciences, Moscow, Russia
| | - Olga Kleyst
- B.P. Konstantinov Petersburg Nuclear Physics Institute, National Research Center "Kurchatov Institute", Gatchina, Russia
| | - Natalia Belyakova
- B.P. Konstantinov Petersburg Nuclear Physics Institute, National Research Center "Kurchatov Institute", Gatchina, Russia
| | - Olga Legina
- B.P. Konstantinov Petersburg Nuclear Physics Institute, National Research Center "Kurchatov Institute", Gatchina, Russia
| |
Collapse
|
28
|
Cascarina SM, Ross ED. Natural and pathogenic protein sequence variation affecting prion-like domains within and across human proteomes. BMC Genomics 2020; 21:23. [PMID: 31914925 PMCID: PMC6947906 DOI: 10.1186/s12864-019-6425-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 12/23/2019] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Impaired proteostatic regulation of proteins with prion-like domains (PrLDs) is associated with a variety of human diseases including neurodegenerative disorders, myopathies, and certain forms of cancer. For many of these disorders, current models suggest a prion-like molecular mechanism of disease, whereby proteins aggregate and spread to neighboring cells in an infectious manner. The development of prion prediction algorithms has facilitated the large-scale identification of PrLDs among "reference" proteomes for various organisms. However, the degree to which intraspecies protein sequence diversity influences predicted prion propensity has not been systematically examined. RESULTS Here, we explore protein sequence variation introduced at genetic, post-transcriptional, and post-translational levels, and its influence on predicted aggregation propensity for human PrLDs. We find that sequence variation is relatively common among PrLDs and in some cases can result in relatively large differences in predicted prion propensity. Sequence variation introduced at the post-transcriptional level (via alternative splicing) also commonly affects predicted aggregation propensity, often by direct inclusion or exclusion of a PrLD. Finally, analysis of a database of sequence variants associated with human disease reveals a number of mutations within PrLDs that are predicted to increase prion propensity. CONCLUSIONS Our analyses expand the list of candidate human PrLDs, quantitatively estimate the effects of sequence variation on the aggregation propensity of PrLDs, and suggest the involvement of prion-like mechanisms in additional human diseases.
Collapse
Affiliation(s)
- Sean M Cascarina
- Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, CO, 80523, USA
| | - Eric D Ross
- Department of Biochemistry and Molecular Biology, Colorado State University, Fort Collins, CO, 80523, USA.
| |
Collapse
|
29
|
Sima AC, Stockinger K, de Farias TM, Gil M. Semantic Integration and Enrichment of Heterogeneous Biological Databases. Methods Mol Biol 2020; 1910:655-690. [PMID: 31278681 DOI: 10.1007/978-1-4939-9074-0_22] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
Abstract
Biological databases are growing at an exponential rate, currently being among the major producers of Big Data, almost on par with commercial generators, such as YouTube or Twitter. While traditionally biological databases evolved as independent silos, each purposely built by a different research group in order to answer specific research questions; more recently significant efforts have been made toward integrating these heterogeneous sources into unified data access systems or interoperable systems using the FAIR principles of data sharing. Semantic Web technologies have been key enablers in this process, opening the path for new insights into the unified data, which were not visible at the level of each independent database. In this chapter, we first provide an introduction into two of the most used database models for biological data: relational databases and RDF stores. Next, we discuss ontology-based data integration, which serves to unify and enrich heterogeneous data sources. We present an extensive timeline of milestones in data integration based on Semantic Web technologies in the field of life sciences. Finally, we discuss some of the remaining challenges in making ontology-based data access (OBDA) systems easily accessible to a larger audience. In particular, we introduce natural language search interfaces, which alleviate the need for database users to be familiar with technical query languages. We illustrate the main theoretical concepts of data integration through concrete examples, using two well-known biological databases: a gene expression database, Bgee, and an orthology database, OMA.
Collapse
Affiliation(s)
- Ana Claudia Sima
- ZHAW Zurich University of Applied Sciences, Winterthur, Switzerland. .,University of Lausanne, Lausanne, Switzerland.
| | - Kurt Stockinger
- ZHAW Zurich University of Applied Sciences, Winterthur, Switzerland
| | - Tarcisio Mendes de Farias
- University of Lausanne, Lausanne, Switzerland.,SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Manuel Gil
- ZHAW Zurich University of Applied Sciences, Winterthur, Switzerland.,SIB Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
30
|
Vandenbrouck Y, Christiany D, Combes F, Loux V, Brun V. Bioinformatics Tools and Workflow to Select Blood Biomarkers for Early Cancer Diagnosis: An Application to Pancreatic Cancer. Proteomics 2019; 19:e1800489. [DOI: 10.1002/pmic.201800489] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Revised: 08/11/2019] [Indexed: 12/28/2022]
Affiliation(s)
- Yves Vandenbrouck
- University of Grenoble Alpes, INSERM, CEA, IRIG‐BGE, U1038 Grenoble 38000 France
| | - David Christiany
- University of Grenoble Alpes, INSERM, CEA, IRIG‐BGE, U1038 Grenoble 38000 France
- MaIAGE, INRA, Université Paris‐Saclay Jouy‐en‐Josas 78350 France
| | - Florence Combes
- University of Grenoble Alpes, INSERM, CEA, IRIG‐BGE, U1038 Grenoble 38000 France
| | - Valentin Loux
- MaIAGE, INRA, Université Paris‐Saclay Jouy‐en‐Josas 78350 France
| | - Virginie Brun
- University of Grenoble Alpes, INSERM, CEA, IRIG‐BGE, U1038 Grenoble 38000 France
| |
Collapse
|
31
|
Kamdar MR, Fernández JD, Polleres A, Tudorache T, Musen MA. Enabling Web-scale data integration in biomedicine through Linked Open Data. NPJ Digit Med 2019; 2:90. [PMID: 31531395 PMCID: PMC6736878 DOI: 10.1038/s41746-019-0162-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Accepted: 08/06/2019] [Indexed: 01/17/2023] Open
Abstract
The biomedical data landscape is fragmented with several isolated, heterogeneous data and knowledge sources, which use varying formats, syntaxes, schemas, and entity notations, existing on the Web. Biomedical researchers face severe logistical and technical challenges to query, integrate, analyze, and visualize data from multiple diverse sources in the context of available biomedical knowledge. Semantic Web technologies and Linked Data principles may aid toward Web-scale semantic processing and data integration in biomedicine. The biomedical research community has been one of the earliest adopters of these technologies and principles to publish data and knowledge on the Web as linked graphs and ontologies, hence creating the Life Sciences Linked Open Data (LSLOD) cloud. In this paper, we provide our perspective on some opportunities proffered by the use of LSLOD to integrate biomedical data and knowledge in three domains: (1) pharmacology, (2) cancer research, and (3) infectious diseases. We will discuss some of the major challenges that hinder the wide-spread use and consumption of LSLOD by the biomedical research community. Finally, we provide a few technical solutions and insights that can address these challenges. Eventually, LSLOD can enable the development of scalable, intelligent infrastructures that support artificial intelligence methods for augmenting human intelligence to achieve better clinical outcomes for patients, to enhance the quality of biomedical research, and to improve our understanding of living systems.
Collapse
Affiliation(s)
- Maulik R. Kamdar
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
| | - Javier D. Fernández
- Vienna University of Economics & Business, Vienna, Austria
- Complexity Science Hub Vienna, Vienna, Austria
| | - Axel Polleres
- Vienna University of Economics & Business, Vienna, Austria
- Complexity Science Hub Vienna, Vienna, Austria
| | - Tania Tudorache
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
| | - Mark A. Musen
- Center for Biomedical Informatics Research, Stanford University, Stanford, CA USA
| |
Collapse
|
32
|
Minervini G, Quaglia F, Tabaro F, Tosatto SCE. Insights into the molecular features of the von Hippel-Lindau-like protein. Amino Acids 2019; 51:1461-1474. [PMID: 31485743 DOI: 10.1007/s00726-019-02781-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Accepted: 08/28/2019] [Indexed: 12/19/2022]
Abstract
We present an in silico characterization of the von Hippel-Lindau-like protein (VLP), the only known human paralog of the von Hippel-Lindau tumor suppressor protein (pVHL). Phylogenetic investigation showed VLP to be mostly conserved in upper mammals and specifically expressed in brain and testis. Structural analysis and molecular dynamics simulations show VLP to be very similar to pVHL three-dimensional organization and binding dynamics. In particular, conservation of elements at the protein interfaces suggests VLP to be a functional pVHL homolog potentially possessing multiple functions beyond HIF-1α-dependent binding activity. Our findings show that VLP may share at least seven interactors with pVHL, suggesting novel functional roles for this understudied human protein. These may occur at precise hypoxia levels where functional overlap with pVHL may permit a finer modulation of pVHL functions.
Collapse
Affiliation(s)
- Giovanni Minervini
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 35121, Padua, Italy
| | - Federica Quaglia
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 35121, Padua, Italy
| | - Francesco Tabaro
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 35121, Padua, Italy.,Institute of Biosciences and Medical Technology, Tampere, Finland
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 35121, Padua, Italy. .,CNR Institute of Neuroscience, Padua, Italy.
| |
Collapse
|
33
|
Choong WK, Chen CT, Wang JH, Sung TY. iHPDM: In Silico Human Proteome Digestion Map with Proteolytic Peptide Analysis and Graphical Visualizations. J Proteome Res 2019; 18:4124-4132. [PMID: 31429573 DOI: 10.1021/acs.jproteome.9b00350] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
When conducting proteomics experiments to detect missing proteins and protein isoforms in the human proteome, it is desirable to use a protease that can yield more unique peptides with properties amenable for mass spectrometry analysis. Though trypsin is currently the most widely used protease, some proteins can yield only a limited number of unique peptides by trypsin digestion. Other proteases and multiple proteases have been applied in reported studies to increase the number of identified proteins and protein sequence coverage. To facilitate the selection of proteases, we developed a web-based resource, called in silico Human Proteome Digestion Map (iHPDM), which contains a comprehensive proteolytic peptide database constructed from human proteins, including isoforms, in neXtProt digested by 15 protease combinations of one or two proteases. iHPDM provides convenient functions and graphical visualizations for users to examine and compare the digestion results of different proteases. Notably, it also supports users to input filtering criteria on digested peptides, e.g., peptide length and uniqueness, to select suitable proteases. iHPDM can facilitate protease selection for shotgun proteomics experiments to identify missing proteins, protein isoforms, and single amino acid variant peptides.
Collapse
Affiliation(s)
- Wai-Kok Choong
- Institute of Information Science , Academia Sinica , Taipei 11529 , Taiwan
| | - Ching-Tai Chen
- Institute of Information Science , Academia Sinica , Taipei 11529 , Taiwan
| | - Jen-Hung Wang
- Institute of Information Science , Academia Sinica , Taipei 11529 , Taiwan
| | - Ting-Yi Sung
- Institute of Information Science , Academia Sinica , Taipei 11529 , Taiwan
| |
Collapse
|
34
|
Kim CY, Na K, Park S, Jeong SK, Cho JY, Shin H, Lee MJ, Han G, Paik YK. FusionPro, a Versatile Proteogenomic Tool for Identification of Novel Fusion Transcripts and Their Potential Translation Products in Cancer Cells. Mol Cell Proteomics 2019; 18:1651-1668. [PMID: 31208993 PMCID: PMC6683003 DOI: 10.1074/mcp.ra119.001456] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Revised: 05/23/2019] [Indexed: 01/21/2023] Open
Abstract
Fusion proteoforms are translation products derived from gene fusion. Although very rare, the fusion proteoforms play important roles in biomedical science. For example, fusion proteoforms influence the development of tumors by serving as cancer markers or cell cycle regulators. Although numerous studies have reported bioinformatics tools that can predict fusion transcripts, few proteogenomic tools are available that can predict and identify proteoforms. In this study, we develop a versatile proteogenomic tool "FusionPro," which facilitates the identification of fusion transcripts and their potential translatable peptides. FusionPro provides an independent gene fusion prediction module and can build sequence databases for annotated fusion proteoforms. FusionPro shows greater sensitivity than the available fusion finders when analyzing simulated or real RNA sequencing data sets. We use FusionPro to identify 18 fusion junction peptides and three potential fusion-derived peptides by MS/MS-based analysis of leukemia cell lines (Jurkat and K562) and ovarian cancer tissues from the Clinical Proteomic Tumor Analysis Consortium. Among the identified fusion proteins, we molecularly validate two fusion junction isoforms and a translation product of FAM133B:CDK6. Moreover, sequence analysis suggests that the fusion protein participates in the cell cycle progression. In addition, our prediction results indicate that fusion transcripts often have multiple fusion junctions and that these fusion junctions tend to be distributed in a nonrandom pattern at both the chromosome and gene levels. Thus, FusionPro allows users to detect various types of fusion translation products using a transcriptome-informed approach and to gain a comprehensive understanding of the formation and biological roles of fusion proteoforms.
Collapse
Affiliation(s)
- Chae-Yeon Kim
- ‡Interdisciplinary Program of Integrated OMICS for Biomedical Science, The Graduate School, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea; §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Keun Na
- §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Saeram Park
- §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Seul-Ki Jeong
- §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Jin-Young Cho
- §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Heon Shin
- §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Min Jung Lee
- §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Gyoonhee Han
- ¶Department of Pharmacy, College of Pharmacy, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea
| | - Young-Ki Paik
- §Yonsei Proteome Research Center, Yonsei University, 50 Yonsei-ro, Seodaemun-gu, Seoul 03722, Republic of Korea.
| |
Collapse
|
35
|
Peng H, Chen R, Brentnall TA, Eng JK, Picozzi VJ, Pan S. Predictive proteomic signatures for response of pancreatic cancer patients receiving chemotherapy. Clin Proteomics 2019; 16:31. [PMID: 31346328 PMCID: PMC6636003 DOI: 10.1186/s12014-019-9251-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Accepted: 07/10/2019] [Indexed: 01/12/2023] Open
Abstract
BACKGROUND Pancreatic ductal adenocarcinoma (PDAC) is a lethal cancer that is characterized by its poor prognosis, rapid progression and development of drug resistance. Chemotherapy is a vital treatment option for most of PDAC patients. Stratification of PDAC patients, who would have a higher likelihood of responding to chemotherapy, could facilitate treatment selection and patient management. METHODS A quantitative proteomic study was performed to characterize the protein profiles in the plasma of PDAC patients undergoing chemotherapy to determine if specific biomarkers could be used to predict likelihood of therapeutic response. RESULTS By comparing the plasma proteome of the PDAC patients with positive therapeutic response and longer overall survival (Good-responders) to those who did not respond as well with shorter survival time (Limited-responders), we identified differential proteins and protein variants that could effectively segregate Good-responders from Limited-responders. Functional clustering and pathway analysis further suggested that many of these differential proteins were involved in pancreatic tumorigenesis. Four proteins, including vitamin-K dependent protein Z (PZ), sex hormone-binding globulin (SHBG), von Willebrand factor (VWF) and zinc-alpha-2-glycoprotein (AZGP1), were further evaluated as single or composite predictive biomarker with/without inclusion of CA 19-9. A composite biomarker panel that consists of PZ, SHBG, VWF and CA 19-9 demonstrated the best performance in distinguishing Good-responders from Limited-responders. CONCLUSION Based on the cohort investigated, our data suggested that systemic proteome alterations involved in pathways associated with inflammation, immunoresponse, coagulation and complement cascades may be reporters of chemo-treatment outcome in PDAC patients. For the majority of the patients involved, the panel consisting of PZ, SHBG, VWF and CA 19-9 was able to segregate Good-responders from Limited-responders effectively. Our data also showed that dramatic fluctuations of biomarker concentration in the circulating system of a PDAC patient, which might result from biological heterogeneity or confounding complications, could diminish the performance of a biomarker. Categorization of PDAC patients in terms of their tumor stages and histological types could potentially facilitate patient stratification for treatment.
Collapse
Affiliation(s)
- Hong Peng
- 0000 0000 9206 2401grid.267308.8Institute of Molecular Medicine, the University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| | - Ru Chen
- 0000 0001 2160 926Xgrid.39382.33Division of Gastroenterology, Department of Medicine, Baylor College of Medicine, Houston, TX 77030 USA
| | - Teresa A. Brentnall
- 0000000122986657grid.34477.33Division of Gastroenterology, Department of Medicine, The University of Washington, Seattle, WA 98195 USA
| | - Jimmy K. Eng
- 0000000122986657grid.34477.33Proteomics Resource, The University of Washington, Seattle, WA 98109 USA
| | - Vincent J. Picozzi
- 0000 0001 2219 0587grid.416879.5Virginia Mason Medical Center, Seattle, WA 98101 USA
| | - Sheng Pan
- 0000 0000 9206 2401grid.267308.8Institute of Molecular Medicine, the University of Texas Health Science Center at Houston, Houston, TX 77030 USA
| |
Collapse
|
36
|
Laying the foundation for genomically-based risk assessment in chronic myeloid leukemia. Leukemia 2019; 33:1835-1850. [PMID: 31209280 DOI: 10.1038/s41375-019-0512-y] [Citation(s) in RCA: 87] [Impact Index Per Article: 17.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2019] [Accepted: 04/23/2019] [Indexed: 12/16/2022]
Abstract
Outcomes for patients with chronic myeloid leukemia (CML) have substantially improved due to advances in drug development and rational treatment intervention strategies. Despite these significant advances there are still unanswered questions on patient management regarding how to more reliably predict treatment failure at the time of diagnosis and how to select frontline tyrosine kinase inhibitor (TKI) therapy for optimal outcome. The BCR-ABL1 transcript level at diagnosis has no established prognostic impact and cannot guide frontline TKI selection. BCR-ABL1 mutations are detected in ~50% of TKI resistant patients but are rarely responsible for primary resistance. Other resistance mechanisms are largely uncharacterized and there are no other routine molecular testing strategies to facilitate the evaluation and further stratification of TKI resistance. Advances in next-generation sequencing technology has aided the management of a growing number of other malignancies, enabling the incorporation of somatic mutation profiles in diagnosis, classification, and prognostication. A largely unexplored area in CML research is whether expanded genomic analysis at diagnosis, resistance, and disease transformation can enhance patient management decisions, as has occurred for other cancers. The aim of this article is to review publications that reported mutated cancer-associated genes in CML patients at various disease phases. We discuss the frequency and type of such variants at initial diagnosis and at the time of treatment failure and transformation. Current limitations in the evaluation of mutants and recommendations for future reporting are outlined. The collective evaluation of mutational studies over more than a decade suggests a limited set of cancer-associated genes are indeed recurrently mutated in CML and some at a relatively high frequency. Genomic studies have the potential to lay the foundation for improved diagnostic risk classification according to clinical and genomic risk, and to enable more precise early identification of TKI resistance.
Collapse
|
37
|
Sachs J, Page R, Baskauf SJ, Pender J, Lujan-Toro B, Macklin J, Comspon Z. Training and hackathon on building biodiversity knowledge graphs. RESEARCH IDEAS AND OUTCOMES 2019. [DOI: 10.3897/rio.5.e36152] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Knowledge graphs have the potential to unite disconnected digitized biodiversity data, and there are a number of efforts underway to build biodiversity knowledge graphs. More generally, the recent popularity of knowledge graphs, driven in part by the advent and success of the Google Knowledge Graph, has breathed life into the ongoing development of semantic web infrastructure and prototypes in the biodiversity informatics community. We describe a one week training event and hackathon that focused on applying three specific knowledge graph technologies – the Neptune graph database; Metaphactory; and Wikidata - to a diverse set of biodiversity use cases.
We give an overview of the training, the projects that were advanced throughout the week, and the critical discussions that emerged. We believe that the main barriers towards adoption of biodiversity knowledge graphs are the lack of understanding of knowledge graphs and the lack of adoption of shared unique identifiers. Furthermore, we believe an important advancement in the outlook of knowledge graph development is the emergence of Wikidata as an identifier broker and as a scoping tool. To remedy the current barriers towards biodiversity knowledge graph development, we recommend continued discussions at workshops and at conferences, which we expect to increase awareness and adoption of knowledge graph technologies.
Collapse
|
38
|
Wang X, Diao L, Sun D, Wang D, Zhu J, He Y, Liu Y, Xu H, Zhang Y, Liu J, Wang Y, He F, Li Y, Li D. OsteoporosAtlas: a human osteoporosis-related gene database. PeerJ 2019; 7:e6778. [PMID: 31086734 PMCID: PMC6487800 DOI: 10.7717/peerj.6778] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2018] [Accepted: 03/13/2019] [Indexed: 01/12/2023] Open
Abstract
Background Osteoporosis is a common, complex disease of bone with a strong heritable component, characterized by low bone mineral density, microarchitectural deterioration of bone tissue and an increased risk of fracture. Due to limited drug selection for osteoporosis and increasing morbidity, mortality of osteoporotic fractures, osteoporosis has become a major health burden in aging societies. Current researches for identifying specific loci or genes involved in osteoporosis contribute to a greater understanding of the pathogenesis of osteoporosis and the development of better diagnosis, prevention and treatment strategies. However, little is known about how most causal genes work and interact to influence osteoporosis. Therefore, it is greatly significant to collect and analyze the studies involved in osteoporosis-related genes. Unfortunately, the information about all these osteoporosis-related genes is scattered in a large amount of extensive literature. Currently, there is no specialized database for easily accessing relevant information about osteoporosis-related genes and miRNAs. Methods We extracted data from literature abstracts in PubMed by text-mining and manual curation. Moreover, a local MySQL database containing all the data was developed with PHP on a Windows server. Results OsteoporosAtlas (http://biokb.ncpsb.org/osteoporosis/), the first specialized database for easily accessing relevant information such as osteoporosis-related genes and miRNAs, was constructed and served for researchers. OsteoporosAtlas enables users to retrieve, browse and download osteoporosis-related genes and miRNAs. Gene ontology and pathway analyses were integrated into OsteoporosAtlas. It currently includes 617 human encoding genes, 131 human non-coding miRNAs, and 128 functional roles. We think that OsteoporosAtlas will be an important bioinformatics resource to facilitate a better understanding of the pathogenesis of osteoporosis and developing better diagnosis, prevention and treatment strategies.
Collapse
Affiliation(s)
- Xun Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Lihong Diao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Dezhi Sun
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Dan Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Jiarun Zhu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China.,College of life Sciences, Hebei University, Baoding, China
| | - Yangzhige He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China.,Central Research Laboratory, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, China
| | - Yuan Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Hao Xu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Yi Zhang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China.,College of life Sciences, Hebei University, Baoding, China
| | - Jinying Liu
- School of Traditional Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Yan Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Yang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| | - Dong Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences, Beijing Institute of Lifeomics, Beijing, China
| |
Collapse
|
39
|
Fernández-Irigoyen J, Corrales F, Santamaría E. The Human Brain Proteome Project: Biological and Technological Challenges. Methods Mol Biol 2019; 2044:3-23. [PMID: 31432403 DOI: 10.1007/978-1-4939-9706-0_1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Brain proteomics has become a method of choice that allows zooming-in where neuropathophysiological alterations are taking place, detecting protein mediators that might eventually be measured in cerebrospinal fluid (CSF) as potential neuropathologically derived biomarkers. Following this hypothesis, mass spectrometry-based neuroproteomics has emerged as a powerful approach to profile neural proteomes derived from brain structures and CSF in order to map the extensive protein catalog of the human brain. This chapter provides a historical perspective on the Human Brain Proteome Project (HBPP), some recommendation to the experimental design in neuroproteomic projects, and a brief description of relevant technological and computational innovations that are emerging in the neurobiology field thanks to the proteomics community. Importantly, this chapter highlights recent discoveries from the biology- and disease-oriented branch of the HBPP (B/D-HBPP) focused on spatiotemporal proteomic characterizations of mouse models of neurodegenerative diseases, elucidation of proteostatic networks in different types of dementia, the characterization of unresolved clinical phenotypes, and the discovery of novel biomarker candidates in CSF.
Collapse
Affiliation(s)
- Joaquín Fernández-Irigoyen
- Proteomics Unit, Clinical Neuroproteomics Laboratory, Navarrabiomed, Complejo Hospitalario de Navarra (CHN), Universidad Pública de Navarra (UPNA), IdiSNA, Proteored-ISCIII, Pamplona, Spain
| | - Fernando Corrales
- Functional Proteomics Laboratory,, Proteored-ISCIII, CIBERehd, Madrid, Spain
| | - Enrique Santamaría
- Proteomics Unit, Clinical Neuroproteomics Laboratory, Navarrabiomed, Complejo Hospitalario de Navarra (CHN), Universidad Pública de Navarra (UPNA), IdiSNA, Proteored-ISCIII, Pamplona, Spain.
| |
Collapse
|
40
|
Roy S, Kuruc M. Methods to Monitor the Functional Subproteomes of SERPIN Protease Inhibitors. Methods Mol Biol 2019; 1871:41-54. [PMID: 30276730 DOI: 10.1007/978-1-4939-8814-3_2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Conformational variants of the unique family of protease inhibitors annotated as SERPINs are most often underrepresented in proteomic analyses. This limits understanding the complex regulation that this family of proteins presents to the networks within the protease web of interactions. Using bead-based separation provided by a family of proteomic enrichment products-notably AlbuVoid™ and AlbuSorb™, we demonstrate their utility to satisfy investigations of serum SERPINs. We also suggest their use to develop functional profiles of the SERPIN proteoforms, and how those can establish relationships to disease phenotypes, gene mutations, and dysregulated mechanisms.
Collapse
Affiliation(s)
- Swapan Roy
- Biotech Support Group LLC, Monmouth Junction, NJ, USA
| | - Matthew Kuruc
- Biotech Support Group LLC, Monmouth Junction, NJ, USA.
| |
Collapse
|
41
|
Nguyen L, Brun V, Combes F, Loux V, Vandenbrouck Y. Designing an In Silico Strategy to Select Tissue-Leakage Biomarkers Using the Galaxy Framework. Methods Mol Biol 2019; 1959:275-289. [PMID: 30852829 DOI: 10.1007/978-1-4939-9164-8_18] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Knowledge-based approaches using large-scale biological ("omics") data are a powerful way to identify mechanistic biomarkers, provided that scientists have access to computational solutions even when they have little programming experience or bioinformatics support. To achieve this goal, we designed a set of tools under the Galaxy framework to allow biologists to define their own strategy for reproducible biomarker selection. These tools rely on retrieving experimental data from public databases, and applying successive filters derived from information relating to disease pathophysiology. A step-by-step protocol linking these tools was implemented to select tissue-leakage biomarker candidates of myocardial infarction. A list of 24 candidates suitable for experimental assessment by MS-based proteomics is proposed. These tools have been made publicly available at http://www.proteore.org , allowing researchers to reuse them in their quest for biomarker discovery.
Collapse
Affiliation(s)
- Lien Nguyen
- Université Grenoble Alpes, CEA, Inserm, BGE U1038, Grenoble, France
- INRA, MAIAGE Unit, University Paris-Saclay, Jouy-en-Josas, France
| | - Virginie Brun
- Université Grenoble Alpes, CEA, Inserm, BGE U1038, Grenoble, France
| | - Florence Combes
- Université Grenoble Alpes, CEA, Inserm, BGE U1038, Grenoble, France
| | - Valentin Loux
- INRA, MAIAGE Unit, University Paris-Saclay, Jouy-en-Josas, France
| | - Yves Vandenbrouck
- Université Grenoble Alpes, CEA, Inserm, BGE U1038, Grenoble, France.
| |
Collapse
|
42
|
Liu J, Liu Y, Wang D, He M, Diao L, Liu Z, Li Y, Tang L, He F, Li D, Guo S. AllerGAtlas 1.0: a human allergy-related genes database. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2018; 2018:4904120. [PMID: 29688358 PMCID: PMC5824776 DOI: 10.1093/database/bay010] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Accepted: 01/15/2018] [Indexed: 12/25/2022]
Abstract
Allergy is a detrimental hypersensitive response to innocuous environmental antigen, which is caused by the effect of interaction between environmental factors and multiple genetic pre-disposition. In the past decades, hundreds of allergy-related genes have been identified to illustrate the epidemiology and pathogenesis of allergic diseases, which are associated with better endophenotype, novel biomarkers, early-life risk factors and individual differences in treatment responses. However, the information of all these allergy-related genes is dispersed in thousands of publications. Here, we present a manually curated human allergy-related gene database of AllerGAtlas, which contained 1195 well-annotated human allergy-related genes, determined by text-mining and manual curation. AllerGAtlas will be a valuable bioinformatics resource to search human allergy-related genes and explore their functions in allergy for experimental research. Database URL: http://biokb.ncpsb.org/AlleRGatlas/
Collapse
Affiliation(s)
- Jinying Liu
- School of Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| | - Yuan Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Dan Wang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Mengqi He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Lihong Diao
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Zhongyang Liu
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Yang Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Li Tang
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Fuchu He
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Dong Li
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences-Beijing (PHOENIX Center), Beijing Institute of Lifeomics, Beijing, China
| | - Shuzhen Guo
- School of Chinese Medicine, Beijing University of Chinese Medicine, Beijing, China
| |
Collapse
|
43
|
Robin T, Bairoch A, Müller M, Lisacek F, Lane L. Large-Scale Reanalysis of Publicly Available HeLa Cell Proteomics Data in the Context of the Human Proteome Project. J Proteome Res 2018; 17:4160-4170. [DOI: 10.1021/acs.jproteome.8b00392] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Thibault Robin
- CALIPHO Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1211 Geneva, Switzerland
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, CH-1211 Geneva, Switzerland
| | - Amos Bairoch
- CALIPHO Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, CH-1211 Geneva, Switzerland
| | - Markus Müller
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, Genopode Building, Quartier Sorge, CH-1015 Lausanne, Switzerland
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1211 Geneva, Switzerland
- Section of Biology, University of Geneva, CH-1211 Geneva, Switzerland
| | - Lydie Lane
- CALIPHO Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Department of Microbiology and Molecular Medicine, Faculty of Medicine, University of Geneva, CH-1211 Geneva, Switzerland
| |
Collapse
|
44
|
Ding Y, Li N, Chang G, Li J, Yao R, Shen Y, Wang J, Huang X, Wang X. Clinical and molecular genetic characterization of two patients with mutations in the phosphoglucomutase 1 (PGM1) gene. J Pediatr Endocrinol Metab 2018; 31:781-788. [PMID: 29858906 DOI: 10.1515/jpem-2017-0551] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/26/2017] [Accepted: 04/16/2018] [Indexed: 01/21/2023]
Abstract
Background The phosphoglucomutase 1 (PGM1) enzyme plays a central role in glucose homeostasis by catalyzing the inter-conversion of glucose 1-phosphate and glucose 6-phosphate. Recently, PGM1 deficiency has been recognized as a cause of the congenital disorders of glycosylation (CDGs). Methods Two Chinese Han pediatric patients with recurrent hypoglycemia, hepatopathy and growth retardation are described in this study. Targeted gene sequencing (TGS) was performed to screen for causal genetic variants in the genome of the patients and their parents to determine the genetic basis of the phenotype. Results DNA sequencing identified three variations of the PGM1 gene (NM_002633.2). Patient 1 had a novel homozygous mutation (c.119delT, p.Ile40Thrfs*28). In patient 2, we found a compound heterozygous mutation of c.1172G>T(p.Gly391Val) (novel) and c.1507C>T(p.Arg503*) (known pathogenic). Conclusions This report deepens our understanding of the clinical features of PGM1 mutation. The early molecular genetic analysis and multisystem assessment were here found to be essential to the diagnosis of PGM1-CDG and the provision of timely and proper treatment.
Collapse
Affiliation(s)
- Yu Ding
- Department of Endocrinology and Metabolism, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| | - Niu Li
- Institute of Pediatric Translational Medicine, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| | - Gouying Chang
- Department of Endocrinology and Metabolism, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| | - Juan Li
- Department of Endocrinology and Metabolism, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| | - Ruen Yao
- Institute of Pediatric Translational Medicine, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| | - Yiping Shen
- Institute of Pediatric Translational Medicine, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China.,Boston Children's Hospital, Boston, MA, USA
| | - Jian Wang
- Institute of Pediatric Translational Medicine, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| | - Xiaodong Huang
- Department of Endocrinology and Metabolism, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| | - Xiumin Wang
- Department of Endocrinology and Metabolism, Shanghai Children's Medical Center, Shanghai Jiaotong University School of Medicine, Shanghai, P.R. China
| |
Collapse
|
45
|
Bell MJ, Lord P. On patterns and re-use in bioinformatics databases. Bioinformatics 2018; 33:2731-2736. [PMID: 28525546 PMCID: PMC5860070 DOI: 10.1093/bioinformatics/btx310] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2016] [Accepted: 05/16/2017] [Indexed: 11/20/2022] Open
Abstract
Motivation As the quantity of data being depositing into biological databases continues to increase, it becomes ever more vital to develop methods that enable us to understand this data and ensure that the knowledge is correct. It is widely-held that data percolates between different databases, which causes particular concerns for data correctness; if this percolation occurs, incorrect data in one database may eventually affect many others while, conversely, corrections in one database may fail to percolate to others. In this paper, we test this widely-held belief by directly looking for sentence reuse both within and between databases. Further, we investigate patterns of how sentences are reused over time. Finally, we consider the limitations of this form of analysis and the implications that this may have for bioinformatics database design. Results We show that reuse of annotation is common within many different databases, and that also there is a detectable level of reuse between databases. In addition, we show that there are patterns of reuse that have previously been shown to be associated with percolation errors. Availability and implementation Analytical software is available on request.
Collapse
Affiliation(s)
- Michael J Bell
- School of Computing Science, Newcastle University, Newcastle-upon-Tyne, UK
| | - Phillip Lord
- School of Computing Science, Newcastle University, Newcastle-upon-Tyne, UK
| |
Collapse
|
46
|
Lee SE, Song J, Bösl K, Müller AC, Vitko D, Bennett KL, Superti-Furga G, Pandey A, Kandasamy RK, Kim MS. Proteogenomic Analysis to Identify Missing Proteins from Haploid Cell Lines. Proteomics 2018; 18:e1700386. [DOI: 10.1002/pmic.201700386] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Revised: 02/14/2018] [Indexed: 12/29/2022]
Affiliation(s)
- Seung-Eun Lee
- Department of Applied Chemistry College of Applied Science; Kyung Hee University; Yongin-si Republic of Korea
- Department of Biomedical Science and TechnologyKyung Hee Medical Science Research Institute; Kyung Hee University; Yongin-si Republic of Korea
| | - JongKeon Song
- Department of Applied Chemistry College of Applied Science; Kyung Hee University; Yongin-si Republic of Korea
| | - Korbinian Bösl
- Centre of Molecular Inflammation Research (SFF-CEMIR), and Department of Clinical and Molecular Medicine (IKOM); Norwegian University of Science and Technology; Trondheim Norway
| | - André C. Müller
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences; Vienna Austria
| | - Dijana Vitko
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences; Vienna Austria
| | - Keiryn L. Bennett
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences; Vienna Austria
| | - Giulio Superti-Furga
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences; Vienna Austria
- Center for Physiology and Pharmacology; Medical University of Vienna; Vienna Austria
| | - Akhilesh Pandey
- Department of Applied Chemistry College of Applied Science; Kyung Hee University; Yongin-si Republic of Korea
- Department of Biological Chemistry; Johns Hopkins University School of Medicine; Baltimore MD USA
| | - Richard K. Kandasamy
- Centre of Molecular Inflammation Research (SFF-CEMIR), and Department of Clinical and Molecular Medicine (IKOM); Norwegian University of Science and Technology; Trondheim Norway
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences; Vienna Austria
- Centre for Molecular Medicine Norway (NCMM) Nordic EMBL Partnership; University of Oslo and Oslo University Hospital; Oslo Norway
| | - Min-Sik Kim
- Department of Applied Chemistry College of Applied Science; Kyung Hee University; Yongin-si Republic of Korea
- Department of Biomedical Science and TechnologyKyung Hee Medical Science Research Institute; Kyung Hee University; Yongin-si Republic of Korea
- Global Center for Pharmaceutical Ingredient Materials; Kyung Hee University; Yongin-si Republic of Korea
| |
Collapse
|
47
|
Lacombe M, Marie-Desvergne C, Combes F, Kraut A, Bruley C, Vandenbrouck Y, Chamel Mossuz V, Couté Y, Brun V. Proteomic characterization of human exhaled breath condensate. J Breath Res 2018; 12:021001. [PMID: 29189203 DOI: 10.1088/1752-7163/aa9e71] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
To improve biomedical knowledge and to support biomarker discovery studies, it is essential to establish comprehensive proteome maps for human tissues and biofluids, and to make them publicly accessible. In this study, we performed an in-depth proteomics characterization of exhaled breath condensate (EBC), a sample obtained non-invasively by condensation of exhaled air that contains submicron droplets of airway lining fluid. Two pooled samples of EBC, each obtained from 10 healthy donors, were processed using a straightforward protocol based on sample lyophilization, in-gel digestion and liquid chromatography tandem-mass spectrometry analysis. Two 'technical' control samples were processed in parallel to the pooled samples to correct for exogenous protein contamination. A total of 229 unique proteins were identified in EBC among which 153 proteins were detected in both EBC pooled samples. A detailed bioinformatics analysis of these 153 proteins showed that most of the proteins identified corresponded to proteins secreted in the respiratory tract (lung, bronchi). Eight proteins were salivary proteins. Our dataset is described and has been made accessible through the ProteomeXchange database (dataset identifier: PXD007591) and is expected to be useful for future MS-based biomarker studies using EBC as the diagnostic specimen.
Collapse
Affiliation(s)
- Maud Lacombe
- Université Grenoble-Alpes, F-38000, Grenoble, France. CEA, BIG, Biologie à Grande Echelle, F-38054, Grenoble, France. Inserm, Unité 1038, F-38054, Grenoble, France
| | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Segura V, Valero ML, Cantero L, Muñoz J, Zarzuela E, García F, Aloria K, Beaskoetxea J, Arizmendi JM, Navajas R, Paradela A, Díez P, Dégano RM, Fuentes M, Orfao A, Montero AG, Garin-Muga A, Corrales FJ, Pino MMSD. In-Depth Proteomic Characterization of Classical and Non-Classical Monocyte Subsets. Proteomes 2018; 6:proteomes6010008. [PMID: 29401756 PMCID: PMC5874767 DOI: 10.3390/proteomes6010008] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2017] [Revised: 01/24/2018] [Accepted: 02/01/2018] [Indexed: 01/02/2023] Open
Abstract
Monocytes are bone marrow-derived leukocytes that are part of the innate immune system. Monocytes are divided into three subsets: classical, intermediate and non-classical, which can be differentiated by their expression of some surface antigens, mainly CD14 and CD16. These cells are key players in the inflammation process underlying the mechanism of many diseases. Thus, the molecular characterization of these cells may provide very useful information for understanding their biology in health and disease. We performed a multicentric proteomic study with pure classical and non-classical populations derived from 12 healthy donors. The robust workflow used provided reproducible results among the five participating laboratories. Over 5000 proteins were identified, and about half of them were quantified using a spectral counting approach. The results represent the protein abundance catalogue of pure classical and enriched non-classical blood peripheral monocytes, and could serve as a reference dataset of the healthy population. The functional analysis of the differences between cell subsets supports the consensus roles assigned to human monocytes.
Collapse
Affiliation(s)
- Víctor Segura
- Proteomics, Genomics and Bioinformatics Unit, Center for Applied Medical Research, University of Navarra, Pamplona 31008, Spain.
| | - M Luz Valero
- Proteomics Unit; Central Service for Experimental Research (SCSIE), University of Valencia. Dr Moliner 50, 46100 Burjassot, Spain.
| | - Laura Cantero
- Proteomics Unit; Central Service for Experimental Research (SCSIE), University of Valencia. Dr Moliner 50, 46100 Burjassot, Spain.
| | - Javier Muñoz
- Spanish National Cancer Research Centre (CNIO), Melchor Férnandez Almagro, 3, 28029 Madrid. Spain.
| | - Eduardo Zarzuela
- Spanish National Cancer Research Centre (CNIO), Melchor Férnandez Almagro, 3, 28029 Madrid. Spain.
| | - Fernando García
- Spanish National Cancer Research Centre (CNIO), Melchor Férnandez Almagro, 3, 28029 Madrid. Spain.
| | - Kerman Aloria
- Proteomics Core Facility-SGIKER, University of the Basque Country, UPV/EHU, 48940 Leioa, Spain.
| | - Javier Beaskoetxea
- Department of Biochemistry and Molecular Biology, University of the Basque Country, UPV/EHU, 48940 Leioa, Spain.
| | - Jesús M Arizmendi
- Department of Biochemistry and Molecular Biology, University of the Basque Country, UPV/EHU, 48940 Leioa, Spain.
| | - Rosana Navajas
- Proteomics Unit, Centro Nacional de Biotecnología-CSIC, Darwin 3, 28049 Madrid, Spain.
| | - Alberto Paradela
- Proteomics Unit, Centro Nacional de Biotecnología-CSIC, Darwin 3, 28049 Madrid, Spain.
| | - Paula Díez
- Department of Medicine and General Cytometry Service-Nucleus, Cancer Research Centre (IBMCC/CSIC/USAL/IBSAL), 37007 Salamanca, Spain.
- Proteomics Unit. Cancer Research Centre (IBMCC/CSIC/USAL/IBSAL), 37007 Salamanca, Spain.
| | - Rosa Mª Dégano
- Department of Medicine and General Cytometry Service-Nucleus, Cancer Research Centre (IBMCC/CSIC/USAL/IBSAL), 37007 Salamanca, Spain.
- Proteomics Unit. Cancer Research Centre (IBMCC/CSIC/USAL/IBSAL), 37007 Salamanca, Spain.
| | - Manuel Fuentes
- Department of Medicine and General Cytometry Service-Nucleus, Cancer Research Centre (IBMCC/CSIC/USAL/IBSAL), 37007 Salamanca, Spain.
- Proteomics Unit. Cancer Research Centre (IBMCC/CSIC/USAL/IBSAL), 37007 Salamanca, Spain.
| | - Alberto Orfao
- Cancer Research Center. University of Salamanca-CSIC, IBSAL, 37007 Salamanca, Spain.
| | - Andrés García Montero
- Spanish National DNA Bank Carlos III, University of Salamanca, 37007 Salamanca, Spain.
| | - Alba Garin-Muga
- Proteomics, Genomics and Bioinformatics Unit, Center for Applied Medical Research, University of Navarra, Pamplona 31008, Spain.
| | - Fernando J Corrales
- Proteomics Unit, Centro Nacional de Biotecnología-CSIC, Darwin 3, 28049 Madrid, Spain.
| | - Manuel M Sánchez Del Pino
- Department of Biochemistry and Molecular Biology, University of Valencia. Dr Moliner 50, 46100 Burjassot, Spain.
- Biotechnology and Biomedicine Interdisciplinary Research Unit (ERI BIOTECMED), University of Valencia. Dr Moliner 50, 46100 Burjassot, Spain.
| |
Collapse
|
49
|
Ellens KW, Christian N, Singh C, Satagopam VP, May P, Linster CL. Confronting the catalytic dark matter encoded by sequenced genomes. Nucleic Acids Res 2017; 45:11495-11514. [PMID: 29059321 PMCID: PMC5714238 DOI: 10.1093/nar/gkx937] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2017] [Accepted: 10/03/2017] [Indexed: 01/02/2023] Open
Abstract
The post-genomic era has provided researchers with a deluge of protein sequences. However, a significant fraction of the proteins encoded by sequenced genomes remains without an identified function. Here, we aim at determining how many enzymes of uncertain or unknown function are still present in the Saccharomyces cerevisiae and human proteomes. Using information available in the Swiss-Prot, BRENDA and KEGG databases in combination with a Hidden Markov Model-based method, we estimate that >600 yeast and 2000 human proteins (>30% of their proteins of unknown function) are enzymes whose precise function(s) remain(s) to be determined. This illustrates the impressive scale of the ‘unknown enzyme problem’. We extensively review classical biochemical as well as more recent systematic experimental and computational approaches that can be used to support enzyme function discovery research. Finally, we discuss the possible roles of the elusive catalysts in light of recent developments in the fields of enzymology and metabolism as well as the significance of the unknown enzyme problem in the context of metabolic modeling, metabolic engineering and rare disease research.
Collapse
Affiliation(s)
- Kenneth W Ellens
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Nils Christian
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Charandeep Singh
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Venkata P Satagopam
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| | - Carole L Linster
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4362 Esch-sur-Alzette, Luxembourg
| |
Collapse
|
50
|
Wippel HH, Santos MDM, Clasen MA, Kurt LU, Nogueira FCS, Carvalho CE, McCormick TM, Neto GPB, Alves LR, da Gloria da Costa Carvalho M, Carvalho PC, Fischer JDSDG. Comparing intestinal versus diffuse gastric cancer using a PEFF-oriented proteomic pipeline. J Proteomics 2017; 171:63-72. [PMID: 29032071 DOI: 10.1016/j.jprot.2017.10.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Revised: 09/29/2017] [Accepted: 10/06/2017] [Indexed: 12/19/2022]
Abstract
Gastric cancer is the fifth most common malignant neoplasia and the third leading cause of cancer death worldwide. Mac-Cormick et al. recently showed the importance of considering the anatomical region of the tumor in proteomic gastric cancer studies; more differences were found between distinct anatomical regions than when comparing healthy versus diseased tissue. Thus, failing to consider the anatomical region could lead to differential proteins that are not disease specific. With this as motivation, we compared the proteomic profiles of intestinal and diffuse adenocarcinoma from the same anatomical region, the corpus. To achieve this, we used isobaric labeling (iTRAQ) of peptides, a 10-step HILIC fractionation, and reversed-phase nano-chromatography coupled online with a Q-Exactive Plus mass spectrometer. We updated PatternLab to take advantage of the new Comet-PEFF search engine that enables identifying post-translational modifications and mutations included in neXtProt's PSI Extended FASTA Format (PEFF) metadata. Our pipeline then uses a text-mining tool that automatically extracts PubMed IDs from the proteomic result metadata and drills down keywords from manuscripts related with the biological processes at hand. Our results disclose important proteins such as apolipoprotein B-100, S100 and 14-3-3 proteins, among many others, highlighting the different pathways enriched by each cancer type. SIGNIFICANCE Gastric cancer is a heterogeneous and multifactorial disease responsible for a significant number of deaths every year. Despite the constant improvement of surgical techniques and multimodal treatments, survival rates are low, mostly due to limited diagnostic techniques and late symptoms. Intestinal and diffuse types of gastric cancer have distinct clinical and pathological characteristics; yet little is known about the molecular mechanisms regulating these two types of gastric tumors. Here we compared the proteomic profile of diffuse and intestinal types of gastric cancer from the same anatomical location, the corpus, from four male patients. This methodological design aimed to eliminate proteomic variations resulting from comparison of tumors from distinct anatomical regions. Our PEFF-tailored proteomic pipeline significantly increased the identifications as when compared to previous versions of PatternLab.
Collapse
Affiliation(s)
- Helisa Helena Wippel
- Computational Mass Spectrometry & Proteomics Group, Carlos Chagas Institute, Fiocruz - Paraná, Brazil
| | | | - Milan Avila Clasen
- Computational Mass Spectrometry & Proteomics Group, Carlos Chagas Institute, Fiocruz - Paraná, Brazil
| | - Louise Ulrich Kurt
- Computational Mass Spectrometry & Proteomics Group, Carlos Chagas Institute, Fiocruz - Paraná, Brazil
| | - Fabio Cesar Sousa Nogueira
- Laboratory of Proteomics, Institute of Chemistry, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil; Laboratory of Protein Chemistry, Proteomic Unit, Institute of Chemistry, Federal University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Carlos Eduardo Carvalho
- Pathology Service of the Clementino Fraga Filho University Hospital (HUCFF-UFRJ), Rio de Janeiro, Brazil
| | | | - Guilherme Pinto Bravo Neto
- Division of Esophageal and Gastric Surgery, General Surgery Service of the HUCFF-UFRJ, Rio de Janeiro, Brazil
| | | | | | - Paulo Costa Carvalho
- Computational Mass Spectrometry & Proteomics Group, Carlos Chagas Institute, Fiocruz - Paraná, Brazil.
| | | |
Collapse
|