1
|
Madej D, Lam H. On the use of tandem mass spectra acquired from samples of evolutionarily distant organisms to validate methods for false discovery rate estimation. Proteomics 2024; 24:e2300398. [PMID: 38491400 DOI: 10.1002/pmic.202300398] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2023] [Revised: 03/01/2024] [Accepted: 03/06/2024] [Indexed: 03/18/2024]
Abstract
Estimating the false discovery rate (FDR) of peptide identifications is a key step in proteomics data analysis, and many methods have been proposed for this purpose. Recently, an entrapment-inspired protocol to validate methods for FDR estimation appeared in articles showcasing new spectral library search tools. That validation approach involves generating incorrect spectral matches by searching spectra from evolutionarily distant organisms (entrapment queries) against the original target search space. Although this approach may appear similar to the solutions using entrapment databases, it represents a distinct conceptual framework whose correctness has not been verified yet. In this viewpoint, we first discussed the background of the entrapment-based validation protocols and then conducted a few simple computational experiments to verify the assumptions behind them. The results reveal that entrapment databases may, in some implementations, be a reasonable choice for validation, while the assumptions underpinning validation protocols based on entrapment queries are likely to be violated in practice. This article also highlights the need for well-designed frameworks for validating FDR estimation methods in proteomics.
Collapse
Affiliation(s)
- Dominik Madej
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
| | - Henry Lam
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong, China
| |
Collapse
|
2
|
Li Y, Liao Z, Fan X, Wang Y, Liu F, Zhang X, He J, Buttino I, Yan X, Tang C. The molecular response of Mytilus coruscus mantle to shell damage under acute acidified sea water revealed by iTRAQ based quantitative proteomic analysis. J Proteomics 2024; 294:105062. [PMID: 38158015 DOI: 10.1016/j.jprot.2023.105062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2023] [Revised: 11/29/2023] [Accepted: 12/05/2023] [Indexed: 01/03/2024]
Abstract
Mytilus coruscus is an economically important marine bivalve that lives in estuarine sea areas with seasonal coastal acidification and frequently suffers shell injury in the natural environment. However, the molecular responses and biochemical properties of Mytilus under these conditions are not fully understood. In the present study, we employed tandem mass spectrometry combined with isobaric tagging to identify differentially expressed proteins in the mantle tissue of M. coruscus under different short-term treatments, including shell-complete mussels raised in normal seawater (pH 8.1), shell-damaged mussels raised in normal seawater (pH 8.1), and acidified seawater (pH 7.4). A total of 2694 proteins were identified in the mantle, and analysis of their relative abundance from the three different treatments revealed alterations in the proteins involved in immune regulation, oxidation-reduction processes, protein folding and processing, energy provision, and cytoskeleton. The results obtained by quantitative proteomic analysis of the mantle allowed us to delineate the molecular strategies adopted by M. coruscus in the shell repair process in acidified environments, including an increase in proteins involved in oxidation-reduction processes, protein processing, and cell growth at the expense of proteins involved in immune capacity and energy metabolism. SIGNIFICANCE: The impact of global ocean acidification on calcifying organisms has become a major ecological and environmental problem in the world. Mytilus coruscus is an economically important marine bivalve living in estuary sea area with seasonal coastal acidification, and frequently suffering shell injury in natural environment. Molecular responses of M coruscus under the shell damage and acute acidification is still largely unknown. For this reason, iTRAQ based quantitative proteomic and histological analysis of the mantle from M. coruscus under shell damage and acute acidification were performed, for revealing the proteomic response and possible adaptation mechanism of Mytilus under combined shell damage and acidified sea water, and understanding how the mussel mantle implement a shell-repair process under acidified sea water. Our study provides important data for understanding the shell repair process and proteomic response of Mytilus under ocean acidification, and providing insights into potential adaptation of mussels to future global change.
Collapse
Affiliation(s)
- Yingao Li
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China
| | - Zhi Liao
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China.
| | - Xiaojun Fan
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China
| | - Ying Wang
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China
| | - Fei Liu
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China
| | - Xiaolin Zhang
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China
| | - Jianyu He
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China
| | - Isabella Buttino
- Italian Institute for Environmental Protection and Research (ISPRA), Via Vitaliano Brancati 48, 00144 Rome, Italy
| | - Xiaojun Yan
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China
| | - Changsheng Tang
- Laboratory of Marine Biology Protein Engineering, Marine Science and Technical College, Zhejiang Ocean University, Zhoushan City 316022, Zhejiang, China.
| |
Collapse
|
3
|
Zou H, Zhang B, Liang H, Li C, Chen J, Wu Y. Defence mechanisms of Pinctada fucata martensii to Vibrio parahaemolyticus infection: Insights from proteomics and metabolomics. FISH & SHELLFISH IMMUNOLOGY 2023; 143:109204. [PMID: 37931889 DOI: 10.1016/j.fsi.2023.109204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 10/28/2023] [Accepted: 10/30/2023] [Indexed: 11/08/2023]
Abstract
Survival of pearl oysters is not only challenged by coastal pollution, but also pathogen infection that may eventually incur substantial economic losses in the pearl farming industry. Yet, whether pearl oysters can defend themselves against pathogen infection through molecular mechanisms remains largely unexplored. By using iTRAQ proteomic and metabolomic analyses, we analysed the proteins and metabolites in the serum of pearl oysters (Pinctada fucata martensii) when stimulated by pathogenic bacteria (Vibrio parahaemolyticus). Proteomic results found that a total of 2,242 proteins were identified in the experimental (i.e., Vibrio-stimulated) and control groups, where 166 of them were differentially expressed (120 upregulated and 46 downregulated in the experimental group). Regarding the immune response enrichment results, the pathway of signal transduction was significantly enriched, such as cytoskeleton and calcium signalling pathways. Proteins, including cathepsin L, heat shock protein 20, myosin and astacin-like protein, also contributed to the immune response of oysters. Pathogen stimulation also altered the metabolite profile of oysters, where 49 metabolites associated with metabolism of energy, fatty acids and amino acids were found. Integrated analysis suggests that the oysters could respond to pathogen infection by coordinating multiple cellular processes. Thus, the proteins and metabolites identified herein not only represent valuable genetic resources for developing molecular biomarkers and genetic breeding research, but also open new avenues for studies on the molecular defence mechanisms of pearl oysters to pathogen infection.
Collapse
Affiliation(s)
- Hexin Zou
- Fisheries College of Guangdong Ocean University, Zhanjiang, Guangdong, 524088, China
| | - Bin Zhang
- Fisheries College of Guangdong Ocean University, Zhanjiang, Guangdong, 524088, China
| | - Haiying Liang
- Fisheries College of Guangdong Ocean University, Zhanjiang, Guangdong, 524088, China; Guangdong Provincial Key Laboratory of Aquatic Animal Disease Control and Healthy Culture, Zhanjiang, Guangdong, 524088, China.
| | - Chaojie Li
- Fisheries College of Guangdong Ocean University, Zhanjiang, Guangdong, 524088, China
| | - Jie Chen
- Fisheries College of Guangdong Ocean University, Zhanjiang, Guangdong, 524088, China
| | - Yifan Wu
- Fisheries College of Guangdong Ocean University, Zhanjiang, Guangdong, 524088, China
| |
Collapse
|
4
|
Xie ZX, He YB, Zhang SF, Lin L, Wang MH, Wang DZ. Metaexoproteomics Reveals Microbial Behavior in the Ocean's Interior. Front Microbiol 2022; 13:749874. [PMID: 35250917 PMCID: PMC8889253 DOI: 10.3389/fmicb.2022.749874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 01/10/2022] [Indexed: 11/13/2022] Open
Abstract
The proteins present in the extracellular environment of cells, named the "exoproteome," are critical for microbial survival, growth, and interaction with their surroundings. However, little is known about microbial exoproteomes in natural marine environments. Here, we used a metaproteomic approach to characterize the exoprotein profiles (10 kDa-0.2 μm) throughout a water column in the South China Sea. Viruses, together with Alpha- and Gammaproteobacteria were the predominant contributors. However, the exoprotein-producing microbial communities varied with depth: SAR11 in the shallow waters, Pseudomonadales and Nitrososphaeria in the mesopelagic layer, and Alteromonadales, Rhizobiales, and Betaproteobacteria in the bathypelagic layer. Besides viral and unknown proteins, diverse transporters contributed substantially to the exoproteomes and varied vertically in their microbial origins, but presented similar patterns in their predicted substrate identities throughout the water column. Other microbial metabolic processes subject to vertical zonation included proteolysis, the oxidation of ammonia, nitrite and carbon monoxide, C1 metabolism, and the degradation of sulfur-containing dissolved organic matter (DOM). Our metaexoproteomic study provides insights into the depth-variable trends in the in situ ecological traits of the marine microbial community hidden in the non-cellular world, including nutrient cycling, niche partitioning and DOM remineralization.
Collapse
Affiliation(s)
- Zhang-Xian Xie
- State Key Laboratory of Marine Environmental Science/College of the Environment and Ecology, Xiamen University, Xiamen, China.,College of Ocean and Earth Sciences, Xiamen University, Xiamen, China.,Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Zhuhai, China
| | | | - Shu-Feng Zhang
- State Key Laboratory of Marine Environmental Science/College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Lin Lin
- State Key Laboratory of Marine Environmental Science/College of the Environment and Ecology, Xiamen University, Xiamen, China.,Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Zhuhai, China
| | - Ming-Hua Wang
- State Key Laboratory of Marine Environmental Science/College of the Environment and Ecology, Xiamen University, Xiamen, China
| | - Da-Zhi Wang
- State Key Laboratory of Marine Environmental Science/College of the Environment and Ecology, Xiamen University, Xiamen, China.,Southern Marine Science and Engineering Guangdong Laboratory (Zhuhai), Sun Yat-sen University, Zhuhai, China
| |
Collapse
|
5
|
Effects of DISC1 on Alzheimer's disease cell models assessed by iTRAQ proteomics analysis. Biosci Rep 2022; 42:230594. [PMID: 34981809 PMCID: PMC8753346 DOI: 10.1042/bsr20211150] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2021] [Revised: 12/21/2021] [Accepted: 12/23/2021] [Indexed: 11/17/2022] Open
Abstract
Alzheimer’s disease (AD) is a form of neurodegenerative disease in the elderly with no cure at present. In a previous study, we found that the scaffold protein, disrupted in Schizophrenia 1 (DISC1) is down-regulated in the AD brains, and ectopic expression of DISC1 can delay the progression of AD by protecting synaptic plasticity and down-regulating BACE1. However, the underlying mechanisms remain not to be elucidated. In the present study, we compared the proteomes of normal and DISC1high AD cells expressing the amyloid precursor protein (APP) using isobaric tag for relative and absolute quantitation (iTRAQ) and mass spectrometry (MS). The differentially expressed proteins (DEPs) were identified, and the protein–protein interaction (PPI) network was constructed to identify the interacting partners of DISC1. Based on the interaction scores, NDE1, GRM3, PTGER3 and KATNA1 were identified as functionally or physically related to DISC1, and may therefore regulate AD development. The DEPs were functionally annotated by Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases with the DAVID software, and the Non-supervised Orthologous Groups (eggNOG) database was used to determine their evolutionary relationships. The DEPs were significantly enriched in microtubules and mitochondria-related pathways. Gene set enrichment analysis (GSEA) was performed to identify genes and pathways that are activated when DISC1 is overexpressed. Our findings provide novel insights into the regulatory mechanisms underlying DISC1 function in AD.
Collapse
|
6
|
Cassidy L, Kaulich PT, Maaß S, Bartel J, Becher D, Tholey A. Bottom-up and top-down proteomic approaches for the identification, characterization, and quantification of the low molecular weight proteome with focus on short open reading frame-encoded peptides. Proteomics 2021; 21:e2100008. [PMID: 34145981 DOI: 10.1002/pmic.202100008] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2021] [Revised: 06/09/2021] [Accepted: 06/09/2021] [Indexed: 01/14/2023]
Abstract
The recent discovery of alternative open reading frames creates a need for suitable analytical approaches to verify their translation and to characterize the corresponding gene products at the molecular level. As the analysis of small proteins within a background proteome by means of classical bottom-up proteomics is challenging, method development for the analysis of small open reading frame encoded peptides (SEPs) have become a focal point for research. Here, we highlight bottom-up and top-down proteomics approaches established for the analysis of SEPs in both pro- and eukaryotes. Major steps of analysis, including sample preparation and (small) proteome isolation, separation and mass spectrometry, data interpretation and quality control, quantification, the analysis of post-translational modifications, and exploration of functional aspects of the SEPs by means of proteomics technologies are described. These methods do not exclusively cover the analytics of SEPs but simultaneously include the low molecular weight proteome, and moreover, can also be used for the proteome-wide analysis of proteolytic processing events.
Collapse
Affiliation(s)
- Liam Cassidy
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Philipp T Kaulich
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| | - Sandra Maaß
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, Greifswald, Germany
| | - Jürgen Bartel
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, Greifswald, Germany
| | - Dörte Becher
- Department of Microbial Proteomics, Institute of Microbiology, University of Greifswald, Greifswald, Germany
| | - Andreas Tholey
- Systematic Proteome Research & Bioanalytics, Institute for Experimental Medicine, Christian-Albrechts-Universität zu Kiel, Kiel, Germany
| |
Collapse
|
7
|
Wang C, Feng Y, Zhou WJ, Cheng ZJ, Jiang MY, Zhou Y, Fei XY. Screening and identification of endometrial proteins as novel potential biomarkers for repeated implantation failure. PeerJ 2021; 9:e11009. [PMID: 33763303 PMCID: PMC7958897 DOI: 10.7717/peerj.11009] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Accepted: 02/03/2021] [Indexed: 01/11/2023] Open
Abstract
Inadequate endometrial receptivity may be responsible for the low implantation rate of transferred embryos in in vitro fertilization (IVF) treatments. Patients with repeated implantation failure (RIF) impact the clinical pregnancy rate for IVF. We collected endometrial tissue during the implantation window of hysteroscopy biopsies from September 2016 to December 2019 and clinical data were collected simultaneously. Patients were divided into RIF and pregnant controls group according to pregnancy outcomes. A total of 82 differentially expressed endometrial proteins were identified, including 55 up-regulated proteins (>1.50-fold, P < 0.05) and 27 down-regulated proteins (<0.67-fold, P < 0.05) by iTRAQ labeling coupled with the 2D LC MS/MS technique in the RIF group. String analysis found interactions between these proteins which assembled in two bunches: ribosomal proteins and blood homeostasis proteins. The most significant enriched Gene Ontology terms were negative regulation of hydrolase activity, blood microparticle, and enzyme inhibitor activity. Our results emphasized the corticosteroid-binding globulin and fetuin-A as the specific proteins of endometrial receptivity by Western-blot. Our study provided experimental data to establish the objective indicator of endometrial receptivity, and also provided new insight into the pathogenesis of RIF.
Collapse
Affiliation(s)
- Chong Wang
- Hangzhou Women's Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China.,Affiliated Hangzhou First People's Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Ying Feng
- Hangzhou Women's Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Wen-Jing Zhou
- Hangzhou Women's Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Zhao-Jun Cheng
- Hangzhou Women's Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Mei-Yan Jiang
- Hangzhou Women's Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Yan Zhou
- Hangzhou Women's Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Xiao-Yang Fei
- Hangzhou Women's Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| |
Collapse
|
8
|
Wang C, Fei X, Zhang H, Zhou W, Cheng Z, Feng Y. Proteomic Analysis of the Alterations in Follicular Fluid Proteins During Oocyte Maturation in Humans. Front Endocrinol (Lausanne) 2021; 12:830691. [PMID: 35185790 PMCID: PMC8850365 DOI: 10.3389/fendo.2021.830691] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Accepted: 12/31/2021] [Indexed: 11/13/2022] Open
Abstract
Many components in ovarian follicles (follicular fluid, cumulus cells, granular cells, etc.) dynamically change during folliculogenesis and play a positive or negative role in oocyte maturation. Infertile women who underwent intracytoplasmic sperm injection (ICSI) treatment in the reproductive medicine centre of Hangzhou Women's Hospital between October 2018 and October 2021 were included. The ovarian follicular fluid and cumulus cells of diminished ovarian response (DOR) patients and control subjects with medical records of clinical data were collected. In total, 31 differentially expressed proteins, including 10 upregulated proteins (>1.50-fold, P<0.05) and 21 downregulated proteins (<0.67-fold, P<0.05), were identified in mature vs. immature oocytes by iTRAQ labelling coupled with 2D LC-MS/MS. GO analysis revealed that 'cell population proliferation' was the most diverse enrichment trend between up/downregulated proteins, while phagosome process and the PI3K-Akt signaling pathway were the two most significant pathways revealed by KEGG enrichment classification. Human prostatic acid phosphatase (PAP, ACPP) and CD5 antigen-like (CD5L) were two proteins verified by ELISA to be differentially expressed between MII and Gv oocytes (P<0.0001 and P<0.0001, respectively). Further measurement found significantly lower level of ACPP in follicular fluids and cumulus cells of DOR patients (P=0.028 and P=0.004, respectively), as an indicator of oocyte quality. Otherwise, CD5L level is upregulated in follicular fluid of DOR patients (P<0.0001). Our study provided experimental data to establish the objective indicator of oocyte maturation in the microenvironment of ovarian follicles, and also provided new insight into the measurement of oocyte quality.
Collapse
Affiliation(s)
- Chong Wang
- Reproductive Medicine Center, Hangzhou Women’s Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
- Affiliated Hangzhou First People’s Hospital, Zhejiang University School of Medicine, Hangzhou, China
| | - Xiaoyang Fei
- Reproductive Medicine Center, Hangzhou Women’s Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Hongyan Zhang
- Reproductive Medicine Center, Hangzhou Women’s Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Wenjing Zhou
- Reproductive Medicine Center, Hangzhou Women’s Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Zhaojun Cheng
- Reproductive Medicine Center, Hangzhou Women’s Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
| | - Ying Feng
- Reproductive Medicine Center, Hangzhou Women’s Hospital (Hangzhou Maternity and Child Health Care Hospital), Hangzhou, China
- *Correspondence: Ying Feng,
| |
Collapse
|
9
|
Li K, Jain A, Malovannaya A, Wen B, Zhang B. DeepRescore: Leveraging Deep Learning to Improve Peptide Identification in Immunopeptidomics. Proteomics 2020; 20:e1900334. [PMID: 32864883 PMCID: PMC7718998 DOI: 10.1002/pmic.201900334] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2020] [Revised: 08/27/2020] [Indexed: 12/23/2022]
Abstract
The identification of major histocompatibility complex (MHC)-binding peptides in mass spectrometry (MS)-based immunopeptideomics relies largely on database search engines developed for proteomics data analysis. However, because immunopeptidomics experiments do not involve enzymatic digestion at specific residues, an inflated search space leads to a high false positive rate and low sensitivity in peptide identification. In order to improve the sensitivity and reliability of peptide identification, a post-processing tool named DeepRescore is developed. DeepRescore combines peptide features derived from deep learning predictions, namely accurate retention timeand MS/MS spectra predictions, with previously used features to rescore peptide-spectrum matches. Using two public immunopeptidomics datasets, it is shown that rescoring by DeepRescore increases both the sensitivity and reliability of MHC-binding peptide and neoantigen identifications compared to existing methods. It is also shown that the performance improvement is, to a large extent, driven by the deep learning-derived features. DeepRescore is developed using NextFlow and Docker and is available at https://github.com/bzhanglab/DeepRescore.
Collapse
Affiliation(s)
- Kai Li
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Antrix Jain
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX 77030, USA
| | - Anna Malovannaya
- Mass Spectrometry Proteomics Core, Baylor College of Medicine, Houston, TX 77030, USA
- Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| |
Collapse
|
10
|
Quan M, Peng J, Zhu Z, Zhou P, Luo S, Xie J, Xia L, Sun Y, Ding X. Construction of a Conditionally Asporogenous Bacillus thuringiensis Recombinant Strain Overproducing Cry Protein by Deletion of the leuB Gene. Front Microbiol 2020; 11:1769. [PMID: 32849393 PMCID: PMC7396631 DOI: 10.3389/fmicb.2020.01769] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2019] [Accepted: 07/06/2020] [Indexed: 11/13/2022] Open
Abstract
One of the common shortcomings with Bacillus thuringiensis (Bt) biopesticides in field application is their instability under UV irradiation. In Bt, the leuB gene encodes the 3-isopropylmalate dehydrogenase. In addition to its role in leucine biosynthesis, LeuB would be likely recruited to catalyze the dehydrogenation of malate in the final step of tricarboxylic acid cycle during sporulation. In this study, we constructed a Bt recombinant strain in which the gene leuB was deleted by using the markerless gene deletion system. The ΔleuB mutant strain showed a conditionally asporogenous phenotype while overproducing insecticidal crystal proteins and retaining its insecticidal activity well in both fermentation and LB media. Furthermore, the metabolic regulation mechanisms of LeuB was elucidated by iTRAQ-based quantitative proteomics approach. Evidences from proteomics data suggested that the inhibited supply of pyruvate (carbon source) was an important factor related to the conditionally asporogenous feature of the mutant. Consistently, the mutant regained its ability to sporulate in LB medium by adding 1% glucose or 1% sodium pyruvate. Taken together, our study demonstrated that deletion of the leuB gene resulted in delayed or completely blocked mother cell lysis, allowing the crystals encapsulated within cells, which makes this recombinant strain a good candidate for developing Bt preparations with better UV-stability.
Collapse
Affiliation(s)
- Meifang Quan
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China.,Key Laboratory of Molecular Epidemiology of Hunan Province, School of Medicine, Hunan Normal University, Changsha, China
| | - Jinli Peng
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Zirong Zhu
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Pengji Zhou
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Sisi Luo
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Junyan Xie
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Liqiu Xia
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Yunjun Sun
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| | - Xuezhi Ding
- Hunan Provincial Key Laboratory of Microbial Molecular Biology, State Key Laboratory of Developmental Biology of Freshwater Fish, College of Life Science, Hunan Normal University, Changsha, China
| |
Collapse
|
11
|
Li H, Zhou R, Xu S, Chen X, Hong Y, Lu Q, Liu H, Zhou B, Liang X. Improving Gene Annotation of the Peanut Genome by Integrated Proteogenomics Workflow. J Proteome Res 2020; 19:2226-2235. [DOI: 10.1021/acs.jproteome.9b00723] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Affiliation(s)
- Haifen Li
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Ruo Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Shaohang Xu
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xiaoping Chen
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Yanbin Hong
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Qing Lu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Hao Liu
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| | - Baojin Zhou
- Deepxomics Co., Ltd., Shenzhen 518000, China
| | - Xuanqiang Liang
- Crops Research Institute, Guangdong Academy of Agricultural Sciences, Guangdong Key Laboratory for Crops Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement, Guangzhou 510640, China
| |
Collapse
|
12
|
Liu H, Hong Y, Lu Q, Li H, Gu J, Ren L, Deng L, Zhou B, Chen X, Liang X. Integrated Analysis of Comparative Lipidomics and Proteomics Reveals the Dynamic Changes of Lipid Molecular Species in High-Oleic Acid Peanut Seed. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2020; 68:426-438. [PMID: 31855429 DOI: 10.1021/acs.jafc.9b04179] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Modern peanut contains fatty acid desaturase 2 (FAD2) mutation, which is capable of producing high oleic acid for human health. However, the dynamic changes of the lipidome regarding fad2 remain elusive in peanut seed. In the present study, 547 lipid features were identified in high- and normal-oleic peanut seeds by utilizing the mass spectrometric approach. The fad2-induced differently expressed lipids (DELs) were polarly distributed at early and maturation stages during high-oleic acid (OA) seed development. Subsequently, integration of previously published proteomic data and lipidomic data revealed that 21 proteins and 149 DELs were annotated into the triacylglycerol assembly map, of which nine enzymes and 31 lipid species shared similar variation tendencies. Additionally, the variation tendencies of 17 acyl fatty acids were described in a hypothetical biosynthetic pathway. Collectively, the understanding of the lipid composition correlated with fad2 established a foundation for future high-OA peanut breeding based on lipidomic data.
Collapse
Affiliation(s)
- Hao Liu
- Guangdong Provincial Key Laboratory of Crop Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement , Crops Research Institute, Guangdong Academy of Agricultural Sciences , Guangzhou 510640 , China
| | - Yanbin Hong
- Guangdong Provincial Key Laboratory of Crop Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement , Crops Research Institute, Guangdong Academy of Agricultural Sciences , Guangzhou 510640 , China
| | - Qing Lu
- Guangdong Provincial Key Laboratory of Crop Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement , Crops Research Institute, Guangdong Academy of Agricultural Sciences , Guangzhou 510640 , China
| | - Haifen Li
- Guangdong Provincial Key Laboratory of Crop Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement , Crops Research Institute, Guangdong Academy of Agricultural Sciences , Guangzhou 510640 , China
| | - Jianzhong Gu
- Peanut Research Institute , Kaifeng Academy of Agriculture and Forestry , Kaifeng 475004 , China
| | - Li Ren
- Peanut Research Institute , Kaifeng Academy of Agriculture and Forestry , Kaifeng 475004 , China
| | - Li Deng
- Peanut Research Institute , Kaifeng Academy of Agriculture and Forestry , Kaifeng 475004 , China
| | - Baojin Zhou
- Shenzhen Deepxomics Biotechnology Co. Ltd. , Shenzhen 518000 , China
| | - Xiaoping Chen
- Guangdong Provincial Key Laboratory of Crop Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement , Crops Research Institute, Guangdong Academy of Agricultural Sciences , Guangzhou 510640 , China
| | - Xuanqiang Liang
- Guangdong Provincial Key Laboratory of Crop Genetic Improvement, South China Peanut Sub-Center of National Center of Oilseed Crops Improvement , Crops Research Institute, Guangdong Academy of Agricultural Sciences , Guangzhou 510640 , China
| |
Collapse
|
13
|
Halloran JT, Zhang H, Kara K, Renggli C, The M, Zhang C, Rocke DM, Käll L, Noble WS. Speeding Up Percolator. J Proteome Res 2019; 18:3353-3359. [PMID: 31407580 PMCID: PMC6884961 DOI: 10.1021/acs.jproteome.9b00288] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The processing of peptide tandem mass spectrometry data involves matching observed spectra against a sequence database. The ranking and calibration of these peptide-spectrum matches can be improved substantially using a machine learning postprocessor. Here, we describe our efforts to speed up one widely used postprocessor, Percolator. The improved software is dramatically faster than the previous version of Percolator, even when using relatively few processors. We tested the new version of Percolator on a data set containing over 215 million spectra and recorded an overall reduction to 23% of the running time as compared to the unoptimized code. We also show that the memory footprint required by these speedups is modest relative to that of the original version of Percolator.
Collapse
Affiliation(s)
- John T. Halloran
- Department of Public Health Sciences, University of California, Davis, Davis, CA, USA
| | - Hantian Zhang
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| | - Kaan Kara
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| | - Cédric Renggli
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| | - Matthew The
- Science for Life Laboratory, KTH — Royal Institute of Technology, Solna, Sweden
| | - Ce Zhang
- Department of Computer Science, ETH Zurich, Zurich, Switzerland
| | - David M. Rocke
- Department of Public Health Sciences, University of California, Davis, Davis, CA, USA
| | - Lukas Käll
- Science for Life Laboratory, KTH — Royal Institute of Technology, Solna, Sweden
| | - William Stafford Noble
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Paul Allen School of Computer Science and Engineering, University of Washington, Seattle, WA, USA
| |
Collapse
|
14
|
Fan L, Wang L, Wang Z. Proteomic characterization of the hepatopancreas in the Pacific white shrimp Litopenaeus vannamei under cold stress: Revealing the organism homeostasis mechanism. FISH & SHELLFISH IMMUNOLOGY 2019; 92:438-449. [PMID: 31229644 DOI: 10.1016/j.fsi.2019.06.037] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/23/2019] [Revised: 05/25/2019] [Accepted: 06/17/2019] [Indexed: 06/09/2023]
Abstract
To understand the homeostasis mechanism of crustacean hepatopancreas to cold stress, iTRAQ proteomics based on the genome database of Litopenaeus vannamei (L. vannamei) was applied to investigate proteins changes and variety of the hepatopancreas during cold stress stage in this study. A total of 4062 distinct proteins were identified, 137 differentially expressed proteins (DEPs) including 62 differentially up-regulated proteins (DUPs) and 75 differentially down-regulated proteins (DDPs) were identified in G1 (18 °C) compared with CK (28 °C), 359 DEPs including 131 DUPs and 228 DDPs were identified in G2 (13 °C for 24 h) compared with CK. Based on bioinformatics analysis, the cold tolerance of L. vannamei might be related to energy metabolism such as amino acid, carbohydrate, lipid, and oxidative phosphorylation. Moreover, shrimp immunity was declined during cold stress stage. However, L. vannamei could cope with cold stress by enhancing the production of ATP and UFA. Notably, arginine kinase, heat shock proteins, and histones may act as positive regulators in L. vannamei under cold stress. Ten randomly selected proteins were used for validation using qRT-PCR and the expressions on the transcription level for most of the genes were similar to the results of iTRAQ. These results indicated that L. vannamei can maintain the organism homeostasis by a series of orderly regulatory process during cold stress. Furthermore, the results can provide guidance for shrimp farming.
Collapse
Affiliation(s)
- Lanfen Fan
- College of Marine Sciences, South China Agricultural University, Guangzhou, 510642, China; Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou, 510642, China.
| | - Lei Wang
- College of Marine Sciences, South China Agricultural University, Guangzhou, 510642, China; Department of Pharmaceutical Engineering, College of Materials and Energy, South China Agricultural University, Guangzhou, 510642, China
| | - Zhenlu Wang
- College of Marine Sciences, South China Agricultural University, Guangzhou, 510642, China
| |
Collapse
|
15
|
Vasaikar S, Huang C, Wang X, Petyuk VA, Savage SR, Wen B, Dou Y, Zhang Y, Shi Z, Arshad OA, Gritsenko MA, Zimmerman LJ, McDermott JE, Clauss TR, Moore RJ, Zhao R, Monroe ME, Wang YT, Chambers MC, Slebos RJC, Lau KS, Mo Q, Ding L, Ellis M, Thiagarajan M, Kinsinger CR, Rodriguez H, Smith RD, Rodland KD, Liebler DC, Liu T, Zhang B. Proteogenomic Analysis of Human Colon Cancer Reveals New Therapeutic Opportunities. Cell 2019; 177:1035-1049.e19. [PMID: 31031003 DOI: 10.1016/j.cell.2019.03.030] [Citation(s) in RCA: 429] [Impact Index Per Article: 85.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2018] [Revised: 11/22/2018] [Accepted: 03/12/2019] [Indexed: 12/12/2022]
Abstract
We performed the first proteogenomic study on a prospectively collected colon cancer cohort. Comparative proteomic and phosphoproteomic analysis of paired tumor and normal adjacent tissues produced a catalog of colon cancer-associated proteins and phosphosites, including known and putative new biomarkers, drug targets, and cancer/testis antigens. Proteogenomic integration not only prioritized genomically inferred targets, such as copy-number drivers and mutation-derived neoantigens, but also yielded novel findings. Phosphoproteomics data associated Rb phosphorylation with increased proliferation and decreased apoptosis in colon cancer, which explains why this classical tumor suppressor is amplified in colon tumors and suggests a rationale for targeting Rb phosphorylation in colon cancer. Proteomics identified an association between decreased CD8 T cell infiltration and increased glycolysis in microsatellite instability-high (MSI-H) tumors, suggesting glycolysis as a potential target to overcome the resistance of MSI-H tumors to immune checkpoint blockade. Proteogenomics presents new avenues for biological discoveries and therapeutic development.
Collapse
Affiliation(s)
- Suhas Vasaikar
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Chen Huang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Xiaojing Wang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Vladislav A Petyuk
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Sara R Savage
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN 37232, USA
| | - Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yongchao Dou
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Yun Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Zhiao Shi
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Osama A Arshad
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Marina A Gritsenko
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Lisa J Zimmerman
- Department of Biochemistry, Vanderbilt University, Nashville, TN 37232, USA
| | - Jason E McDermott
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Therese R Clauss
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Ronald J Moore
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Rui Zhao
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Matthew E Monroe
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Yi-Ting Wang
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Matthew C Chambers
- Department of Biochemistry, Vanderbilt University, Nashville, TN 37232, USA
| | - Robbert J C Slebos
- Department of Biochemistry, Vanderbilt University, Nashville, TN 37232, USA
| | - Ken S Lau
- Department of Cell and Developmental Biology, Vanderbilt University, Nashville, TN 37232, USA
| | - Qianxing Mo
- Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Li Ding
- The McDonnell Genome Institute, Washington University in St. Louis, Forest Park Avenue, Campus Box 8501, St. Louis, MO 63108, USA
| | - Matthew Ellis
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Mathangi Thiagarajan
- Leidos Biomedical Research Inc., Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Christopher R Kinsinger
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Henry Rodriguez
- Office of Cancer Clinical Proteomics Research, National Cancer Institute, Bethesda, MD 20892, USA
| | - Richard D Smith
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA
| | - Karin D Rodland
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA; Department of Cell, Developmental, and Cancer Biology, Oregon Health & Science University, Portland, OR 97221, USA.
| | - Daniel C Liebler
- Department of Biochemistry, Vanderbilt University, Nashville, TN 37232, USA.
| | - Tao Liu
- Biological Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99352, USA.
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Dan L Duncan Comprehensive Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA.
| | | |
Collapse
|
16
|
Rapid evolution of protein diversity by de novo origination in Oryza. Nat Ecol Evol 2019; 3:679-690. [PMID: 30858588 DOI: 10.1038/s41559-019-0822-5] [Citation(s) in RCA: 85] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2018] [Accepted: 01/23/2019] [Indexed: 12/22/2022]
Abstract
New protein-coding genes that arise de novo from non-coding DNA sequences contribute to protein diversity. However, de novo gene origination is challenging to study as it requires high-quality reference genomes for closely related species, evidence for ancestral non-coding sequences, and transcription and translation of the new genes. High-quality genomes of 13 closely related Oryza species provide unprecedented opportunities to understand de novo origination events. Here, we identify a large number of young de novo genes with discernible recent ancestral non-coding sequences and evidence of translation. Using pipelines examining the synteny relationship between genomes and reciprocal-best whole-genome alignments, we detected at least 175 de novo open reading frames in the focal species O. sativa subspecies japonica, which were all detected in RNA sequencing-based transcriptomes. Mass spectrometry-based targeted proteomics and ribosomal profiling show translational evidence for 57% of the de novo genes. In recent divergence of Oryza, an average of 51.5 de novo genes per million years were generated and retained. We observed evolutionary patterns in which excess indels and early transcription were favoured in origination with a stepwise formation of gene structure. These data reveal that de novo genes contribute to the rapid evolution of protein diversity under positive selection.
Collapse
|
17
|
Wen B, Wang X, Zhang B. PepQuery enables fast, accurate, and convenient proteomic validation of novel genomic alterations. Genome Res 2019; 29:485-493. [PMID: 30610011 PMCID: PMC6396417 DOI: 10.1101/gr.235028.118] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2018] [Accepted: 12/28/2018] [Indexed: 12/20/2022]
Abstract
Massively parallel or second-generation sequencing-based genomic studies continuously identify new genomic alterations that may lead to novel protein sequences, which are attractive candidates for disease biomarkers and therapeutic targets after proteomic validation. Integrative proteogenomic methods have been developed to use mass spectrometry (MS)-based proteomics data for such validation. These methods replace the reference sequence database in proteomic database searching with a customized protein database that incorporates sample- or disease-specific sequences derived from DNA or RNA sequencing, thus enabling the identification of novel protein sequences. Although useful, this spectrum-centric approach requires a full evaluation of all possible spectrum-peptide pairs, which is time-consuming, error-prone, and difficult to apply. Here, we present PepQuery, a peptide-centric approach that focuses on only novel DNA or protein sequences of interest. PepQuery allows quick and easy proteomic validation of genomic alterations without customized database construction. We demonstrated the sensitivity and specificity of the approach in validating completely novel proteins, novel splice junctions, and single amino acid variants using simulations and experimental data. Notably, enabling unrestricted modification searching in PepQuery reduced false positives by up to 95%. We implemented PepQuery as both web-based and stand-alone applications. The web version provides direct access to more than half a billion MS/MS spectra from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) and other cancer proteomic studies. The stand-alone version supports batch analysis and user-provided MS/MS data. PepQuery will increase the usage of proteogenomics beyond the proteomics community and will broaden the application of proteogenomics in personalized medicine.
Collapse
Affiliation(s)
- Bo Wen
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Xiaojing Wang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| | - Bing Zhang
- Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030, USA
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA
| |
Collapse
|
18
|
Rong M, Zhou B, Zhou R, Liao Q, Zeng Y, Xu S, Liu Z. PPIP: Automated Software for Identification of Bioactive Endogenous Peptides. J Proteome Res 2019; 18:721-727. [PMID: 30540478 DOI: 10.1021/acs.jproteome.8b00718] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Endogenous peptides play an important role in multiple biological processes in many species. Liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS) is an important technique for detecting these peptides on a large scale. We present PPIP, which is a dedicated peptidogenomics software for identifying endogenous peptides based on peptidomics and RNA-Seq data. This software automates the de novo transcript assembly based on RNA-Seq data, construction of a protein reference database based on the de novo assembled transcripts, peptide identification, function analysis, and HTML-based report generation. Different function components are integrated using Docker technology. The Docker image of PPIP is available at https://hub.docker.com/r/shawndp/ppip , and the source code under GPL-3 license is available at https://github.com/Shawn-Xu/PPIP . A user manual of PPIP is available at https://shawn-xu.github.io/PPIP .
Collapse
Affiliation(s)
- Mingqiang Rong
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| | - Baojin Zhou
- Deepxomics Co., Ltd. , Shenzhen 518000 , China
| | - Ruo Zhou
- Deepxomics Co., Ltd. , Shenzhen 518000 , China
| | - Qiong Liao
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| | - Yong Zeng
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| | - Shaohang Xu
- Deepxomics Co., Ltd. , Shenzhen 518000 , China
| | - Zhonghua Liu
- The National & Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences , Hunan Normal University , Changsha 410081 , Hunan , China
| |
Collapse
|
19
|
Yao B, Zhang M, Leng X, Zhao D. Proteomic analysis of the effects of antler extract on chondrocyte proliferation, differentiation and apoptosis. Mol Biol Rep 2019; 46:1635-1648. [DOI: 10.1007/s11033-019-04612-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 01/17/2019] [Indexed: 01/09/2023]
|
20
|
Wang X, Li M, Liu X, Zhang L, Duan Q, Zhang J. Quantitative Proteomic Analysis of Castor ( Ricinus communis L.) Seeds During Early Imbibition Provided Novel Insights into Cold Stress Response. Int J Mol Sci 2019; 20:E355. [PMID: 30654474 PMCID: PMC6359183 DOI: 10.3390/ijms20020355] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2018] [Revised: 01/08/2019] [Accepted: 01/08/2019] [Indexed: 12/23/2022] Open
Abstract
Early planting is one of the strategies used to increase grain yield in temperate regions. However, poor cold tolerance in castor inhibits seed germination, resulting in lower seedling emergence and biomass. Here, the elite castor variety Tongbi 5 was used to identify the differential abundance protein species (DAPS) between cold stress (4 °C) and control conditions (30 °C) imbibed seeds. As a result, 127 DAPS were identified according to isobaric tag for relative and absolute quantification (iTRAQ) strategy. These DAPS were mainly involved in carbohydrate and energy metabolism, translation and posttranslational modification, stress response, lipid transport and metabolism, and signal transduction. Enzyme-linked immunosorbent assays (ELISA) demonstrated that the quantitative proteomics data collected here were reliable. This study provided some invaluable insights into the cold stress responses of early imbibed castor seeds: (1) up-accumulation of all DAPS involved in translation might confer cold tolerance by promoting protein synthesis; (2) stress-related proteins probably protect the cell against damage caused by cold stress; (3) up-accumulation of key DAPS associated with fatty acid biosynthesis might facilitate resistance or adaptation of imbibed castor seeds to cold stress by the increased content of unsaturated fatty acid (UFA). The data has been deposited to the ProteomeXchange with identifier PXD010043.
Collapse
Affiliation(s)
- Xiaoyu Wang
- College of Life Science, Inner Mongolia University for Nationalities, Tongliao 028000, China.
- Inner Mongolia Key Laboratory for Castor, Tongliao 028000, China.
- Inner Mongolia Industrial Engineering Research Center of Universities for Castor, Tongliao 028000, China.
- Inner Mongolia Collaborate Innovation Cultivate Center for Castor, Tongliao 028000, China.
- Horqin Plant Stress Biology Research Institute of Inner Mongolia University for Nationalities, Tongliao 028000, China.
| | - Min Li
- College of Agriculture, Inner Mongolia University for Nationalities, Tongliao 028000, China.
| | - Xuming Liu
- College of Life Science, Inner Mongolia University for Nationalities, Tongliao 028000, China.
- Inner Mongolia Key Laboratory for Castor, Tongliao 028000, China.
- Inner Mongolia Industrial Engineering Research Center of Universities for Castor, Tongliao 028000, China.
- Inner Mongolia Collaborate Innovation Cultivate Center for Castor, Tongliao 028000, China.
- Horqin Plant Stress Biology Research Institute of Inner Mongolia University for Nationalities, Tongliao 028000, China.
| | - Lixue Zhang
- College of Life Science, Inner Mongolia University for Nationalities, Tongliao 028000, China.
- Inner Mongolia Key Laboratory for Castor, Tongliao 028000, China.
- Inner Mongolia Industrial Engineering Research Center of Universities for Castor, Tongliao 028000, China.
- Inner Mongolia Collaborate Innovation Cultivate Center for Castor, Tongliao 028000, China.
- Horqin Plant Stress Biology Research Institute of Inner Mongolia University for Nationalities, Tongliao 028000, China.
| | - Qiong Duan
- College of Life Science, Inner Mongolia University for Nationalities, Tongliao 028000, China.
- Inner Mongolia Key Laboratory for Castor, Tongliao 028000, China.
- Inner Mongolia Industrial Engineering Research Center of Universities for Castor, Tongliao 028000, China.
- Inner Mongolia Collaborate Innovation Cultivate Center for Castor, Tongliao 028000, China.
- Horqin Plant Stress Biology Research Institute of Inner Mongolia University for Nationalities, Tongliao 028000, China.
| | - Jixing Zhang
- College of Life Science, Inner Mongolia University for Nationalities, Tongliao 028000, China.
- Inner Mongolia Key Laboratory for Castor, Tongliao 028000, China.
- Inner Mongolia Industrial Engineering Research Center of Universities for Castor, Tongliao 028000, China.
- Inner Mongolia Collaborate Innovation Cultivate Center for Castor, Tongliao 028000, China.
- Horqin Plant Stress Biology Research Institute of Inner Mongolia University for Nationalities, Tongliao 028000, China.
| |
Collapse
|
21
|
Ren Z, Qi D, Pugh N, Li K, Wen B, Zhou R, Xu S, Liu S, Jones AR. Improvements to the Rice Genome Annotation Through Large-Scale Analysis of RNA-Seq and Proteomics Data Sets. Mol Cell Proteomics 2019; 18:86-98. [PMID: 30293062 PMCID: PMC6317475 DOI: 10.1074/mcp.ra118.000832] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Revised: 08/31/2018] [Indexed: 01/22/2023] Open
Abstract
Rice (Oryza sativa) is one of the most important worldwide crops. The genome has been available for over 10 years and has undergone several rounds of annotation. We created a comprehensive database of transcripts from 29 public RNA sequencing data sets, officially predicted genes from Ensembl plants, and common contaminants in which to search for protein-level evidence. We re-analyzed nine publicly accessible rice proteomics data sets. In total, we identified 420K peptide spectrum matches from 47K peptides and 8,187 protein groups. 4168 peptides were initially classed as putative novel peptides (not matching official genes). Following a strict filtration scheme to rule out other possible explanations, we discovered 1,584 high confidence novel peptides. The novel peptides were clustered into 692 genomic loci where our results suggest annotation improvements. 80% of the novel peptides had an ortholog match in the curated protein sequence set from at least one other plant species. For the peptides clustering in intergenic regions (and thus potentially new genes), 101 loci were identified, for which 43 had a high-confidence hit for a protein domain. Our results can be displayed as tracks on the Ensembl genome or other browsers supporting Track Hubs, to support re-annotation of the rice genome.
Collapse
Affiliation(s)
- Zhe Ren
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Da Qi
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Nina Pugh
- §Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK
| | - Kai Li
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Bo Wen
- ‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030;; ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030
| | - Ruo Zhou
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Shaohang Xu
- From the ‡BGI-Shenzhen, Shenzhen 518083, China
| | - Siqi Liu
- From the ‡BGI-Shenzhen, Shenzhen 518083, China;.
| | - Andrew R Jones
- §Institute of Integrative Biology, University of Liverpool, Liverpool, L69 7ZB, UK;.
| |
Collapse
|
22
|
The Chinese Medicinal Formulation Guzhi Zengsheng Zhitongwan Modulates Chondrocyte Structure, Dynamics, and Metabolism by Controlling Multiple Functional Proteins. BIOMED RESEARCH INTERNATIONAL 2019; 2018:9847286. [PMID: 30596102 PMCID: PMC6282133 DOI: 10.1155/2018/9847286] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Accepted: 11/08/2018] [Indexed: 12/15/2022]
Abstract
Traditional Chinese medicine is one of the oldest medical systems in the world and has its unique principles and theories in the prevention and treatment of human diseases, which are achieved through the interactions of different types of materia medica in the form of Chinese medicinal formulations. GZZSZTW, a classical and effective Chinese medicinal formulation, was designed and created by professor Bailing Liu who is the only national medical master professor in the clinical research field of traditional Chinese medicine and skeletal diseases. GZZSZTW has been widely used in clinical settings for several decades for the treatment of joint diseases. However, the underlying molecular mechanisms are still largely unknown. In the present study, we performed quantitative proteomic analysis to investigate the effects of GZZSZTW on mouse primary chondrocytes using state-of-the-art iTRAQ technology. We demonstrated that the Chinese medicinal formulation GZZSZTW modulates chondrocyte structure, dynamics, and metabolism by controlling multiple functional proteins that are involved in the cellular processes of DNA replication and transcription, protein synthesis and degradation, cytoskeleton dynamics, and signal transduction. Thus, this study has expanded the current knowledge of the molecular mechanism of GZZSZTW treatment on chondrocytes. It has also shed new light on possible strategies to further prevent and treat cartilage-related diseases using traditional Chinese medicinal formulations.
Collapse
|
23
|
Ren Y, He Y, Lin Z, Zi J, Yang H, Zhang S, Lou X, Wang Q, Li S, Liu S. Reagents for Isobaric Labeling Peptides in Quantitative Proteomics. Anal Chem 2018; 90:12366-12371. [PMID: 30260629 DOI: 10.1021/acs.analchem.8b00321] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
Currently, the commercial reagents for isobaric peptides labeling (TMT and iTRAQ) have some drawbacks, such as high cost in experiments, especially in quantitation for the modified peptides, and inconvenient handling for variable sizes of samples. Herein, we developed a set of 10-plex isobaric tags (IBT) with high stability and low cost. The labeled peptides were sensitively detected on Orbitrap Q Exactive MS with an MS2 resolution of 35 000 at 30% NCE, while the peptides were efficiently labeled over 97% by IBT at a ratio of 10:1 of reagent/peptide (w/w) in 200 mM TEAB buffer for 2 h. The IBT labeling was demonstrated with a wide dynamic range of 50-fold without obvious matrix effects on quantification. Importantly, there was little quantification bias found among the individual IBT tags, indicating that the peptides labeled by different tags were quantitatively comparable. The IBT 10-plex reagents were applied for dynamically monitoring the quantitative responses of phosphoproteome stimulated by EGF treatment in HeLa cells. In total, 5 361 unique phosphopeptides were identified, which reached a similar conclusion as others reported. The IBT reagents were therefore experimentally proven as a new type of reagents for isobaric peptides labeling and useful in a large quantity peptides of quantitative proteomics.
Collapse
Affiliation(s)
- Yan Ren
- BGI-Shenzhen , Beishan Industrial Zone 11th Building , Yantian District, Shenzhen , Guangdong 518083 , China
| | - Yanbin He
- BGI-Shenzhen , Beishan Industrial Zone 11th Building , Yantian District, Shenzhen , Guangdong 518083 , China
| | - Zhilong Lin
- BGI-Shenzhen , Beishan Industrial Zone 11th Building , Yantian District, Shenzhen , Guangdong 518083 , China
| | - Jin Zi
- BGI-Shenzhen , Beishan Industrial Zone 11th Building , Yantian District, Shenzhen , Guangdong 518083 , China
| | - Huanming Yang
- BGI-Shenzhen , Beishan Industrial Zone 11th Building , Yantian District, Shenzhen , Guangdong 518083 , China.,James D. Watson Institute of Genome Sciences , Hangzhou 310008 , China
| | - Shenyan Zhang
- BGI-Shenzhen , Beishan Industrial Zone 11th Building , Yantian District, Shenzhen , Guangdong 518083 , China
| | - Xiaomin Lou
- Beijing Institute of Genomics , Chinese Academy of Sciences , Beijing , 100101 , China
| | - Quanhui Wang
- Beijing Protein Innovation , B-8 Beijing Airport Industrial Zone , Beijing , 101318 , China
| | - Shuwei Li
- Institute for Bioscience and Biotechnology Research , University of Maryland College Park , Rockville , Maryland 20850 , United States
| | - Siqi Liu
- BGI-Shenzhen , Beishan Industrial Zone 11th Building , Yantian District, Shenzhen , Guangdong 518083 , China
| |
Collapse
|
24
|
Wu Y, Xiong Q, Li S, Yang X, Ge F. Integrated Proteomic and Transcriptomic Analysis Reveals Long Noncoding RNA HOX Transcript Antisense Intergenic RNA (HOTAIR) Promotes Hepatocellular Carcinoma Cell Proliferation by Regulating Opioid Growth Factor Receptor (OGFr). Mol Cell Proteomics 2018; 17:146-159. [PMID: 29079719 PMCID: PMC5750844 DOI: 10.1074/mcp.ra117.000277] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Indexed: 01/17/2023] Open
Abstract
Long noncoding RNA HOX transcript antisense RNA (HOTAIR) is involved in human tumorigenesis and is dysregulated in hepatocellular carcinoma (HCC). However, the molecular mechanisms underlying HOTAIR functions in HCC are largely unknown. Here, we employed an integrated transcriptomic and quantitative proteomic analysis to systematically explore the regulatory role of HOTAIR in HCC. A total of 673 transcripts and 293 proteins were found to be dysregulated after HOTAIR inhibition. Bioinformatics studies indicated that differentially expressed genes (DEGs) and differentially expressed proteins (DEPs) are involved in many biological processes, especially cancer-related signaling pathways. A set of DEGs and DEPs were validated by quantitative RT-PCR, Western blot and parallel reaction monitoring (PRM) analysis, respectively. Further functional studies of the opioid growth factor receptor (OGFr), a negative biological regulator of cell proliferation in HCC, revealed that HOTAIR exerts its effects on cell proliferation, at least in part, through the regulation of OGFr expression. By correlating the omics data with functional studies, the current results provide novel insights into the functional mechanisms of HOTAIR in HCC cells.
Collapse
Affiliation(s)
- Ying Wu
- From the ‡Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- §University of Chinese Academy of Sciences, Beijing 100049, China
| | - Qian Xiong
- From the ‡Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
| | - Siting Li
- From the ‡Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- §University of Chinese Academy of Sciences, Beijing 100049, China
| | - Xue Yang
- From the ‡Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China
- §University of Chinese Academy of Sciences, Beijing 100049, China
| | - Feng Ge
- From the ‡Key Laboratory of Algal Biology, Institute of Hydrobiology, Chinese Academy of Sciences, Wuhan 430072, China;
| |
Collapse
|
25
|
Kiseleva O, Poverennaya E, Shargunov A, Lisitsa A. Proteomic Cinderella: Customized analysis of bulky MS/MS data in one night. J Bioinform Comput Biol 2017; 16:1740011. [PMID: 29216772 DOI: 10.1142/s021972001740011x] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Proteomic challenges, stirred up by the advent of high-throughput technologies, produce large amount of MS data. Nowadays, the routine manual search does not satisfy the "speed" of modern science any longer. In our work, the necessity of single-thread analysis of bulky data emerged during interpretation of HepG2 proteome profiling results for proteoforms searching. We compared the contribution of each of the eight search engines (X!Tandem, MS-GF[Formula: see text], MS Amanda, MyriMatch, Comet, Tide, Andromeda, and OMSSA) integrated in an open-source graphical user interface SearchGUI ( http://searchgui.googlecode.com ) into total result of proteoforms identification and optimized set of engines working simultaneously. We also compared the results of our search combination with Mascot results using protein kit UPS2, containing 48 human proteins. We selected combination of X!Tandem, MS-GF[Formula: see text] and OMMSA as the most time-efficient and productive combination of search. We added homemade java-script to automatize pipeline from file picking to report generation. These settings resulted in rise of the efficiency of our customized pipeline unobtainable by manual scouting: the analysis of 192 files searched against human proteome (42153 entries) downloaded from UniProt took 11[Formula: see text]h.
Collapse
Affiliation(s)
- Olga Kiseleva
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| | - Ekaterina Poverennaya
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| | - Alexander Shargunov
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| | - Andrey Lisitsa
- 1 Department of Bioinformatics, Institute of Biomedical Chemistry, 10/8 Pogodinskaya str., Moscow 119121, Russia
| |
Collapse
|
26
|
Ma C, Xu S, Liu G, Liu X, Xu X, Wen B, Liu S. Improvement of peptide identification with considering the abundance of mRNA and peptide. BMC Bioinformatics 2017; 18:109. [PMID: 28201984 PMCID: PMC5311845 DOI: 10.1186/s12859-017-1491-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2016] [Accepted: 01/20/2017] [Indexed: 01/15/2023] Open
Abstract
BACKGROUND Tandem mass spectrometry (MS/MS) followed by database search is a main approach to identify peptides/proteins in proteomic studies. A lot of effort has been devoted to improve the identification accuracy and sensitivity for peptides/proteins, such as developing advanced algorithms and expanding protein databases. RESULTS Herein, we described a new strategy for enhancing the sensitivity of protein/peptide identification through combination of mRNA and peptide abundance in Percolator. In our strategy, a new workflow for peptide identification is established on the basis of the abundance of transcripts and potential novel transcripts derived from RNA-Seq and abundance of peptides towards the same life species. We demonstrate the utility of this strategy by two MS/MS datasets and the results indicate that about 5% ~ 8% improvement of peptide identification can be achieved with 1% FDR in peptide level by integrating the peptide abundance, the transcript abundance and potential novel transcripts from RNA-Seq data. Meanwhile, 181 and 154 novel peptides were identified in the two datasets, respectively. CONCLUSIONS We have demonstrated that this strategy could enable improvement of peptide/protein identification and discovery of novel peptides, as compared with the traditional search methods.
Collapse
Affiliation(s)
| | | | - Geng Liu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Xin Liu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Bo Wen
- BGI-Shenzhen, Shenzhen, 518083, China.
| | - Siqi Liu
- BGI-Shenzhen, Shenzhen, 518083, China.
| |
Collapse
|
27
|
Wen B, Xu S, Zhou R, Zhang B, Wang X, Liu X, Xu X, Liu S. PGA: an R/Bioconductor package for identification of novel peptides using a customized database derived from RNA-Seq. BMC Bioinformatics 2016; 17:244. [PMID: 27316337 PMCID: PMC4912784 DOI: 10.1186/s12859-016-1133-3] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2015] [Accepted: 06/09/2016] [Indexed: 11/27/2022] Open
Abstract
Background Peptide identification based upon mass spectrometry (MS) is generally achieved by comparison of the experimental mass spectra with the theoretically digested peptides derived from a reference protein database. Obviously, this strategy could not identify peptide and protein sequences that are absent from a reference database. A customized protein database on the basis of RNA-Seq data is thus proposed to assist with and improve the identification of novel peptides. Correspondingly, development of a comprehensive pipeline, which provides an end-to-end solution for novel peptide detection with the customized protein database, is necessary. Results A pipeline with an R package, assigned as a PGA utility, was developed that enables automated treatment to the tandem mass spectrometry (MS/MS) data acquired from different MS platforms and construction of customized protein databases based on RNA-Seq data with or without a reference genome guide. Hence, PGA can identify novel peptides and generate an HTML-based report with a visualized interface. On the basis of a published dataset, PGA was employed to identify peptides, resulting in 636 novel peptides, including 510 single amino acid polymorphism (SAP) peptides, 2 INDEL peptides, 49 splice junction peptides, and 75 novel transcript-derived peptides. The software is freely available from http://bioconductor.org/packages/PGA/, and the example reports are available at http://wenbostar.github.io/PGA/. Conclusions The pipeline of PGA, aimed at being platform-independent and easy-to-use, was successfully developed and shown to be capable of identifying novel peptides by searching the customized protein database derived from RNA-Seq data. Electronic supplementary material The online version of this article (doi:10.1186/s12859-016-1133-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Bo Wen
- BGI-Shenzhen, Shenzhen, 518083, China
| | | | - Ruo Zhou
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Bing Zhang
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, 37232, USA
| | - Xiaojing Wang
- Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, 37232, USA
| | - Xin Liu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Xun Xu
- BGI-Shenzhen, Shenzhen, 518083, China
| | - Siqi Liu
- BGI-Shenzhen, Shenzhen, 518083, China. .,Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, 100101, China.
| |
Collapse
|
28
|
Sanchez-Lucas R, Mehta A, Valledor L, Cabello-Hurtado F, Romero-Rodrıguez MC, Simova-Stoilova L, Demir S, Rodriguez-de-Francisco LE, Maldonado-Alconada AM, Jorrin-Prieto AL, Jorrín-Novo JV. A year (2014-2015) of plants in Proteomics journal. Progress in wet and dry methodologies, moving from protein catalogs, and the view of classic plant biochemists. Proteomics 2016; 16:866-76. [PMID: 26621614 DOI: 10.1002/pmic.201500351] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 10/26/2015] [Accepted: 11/04/2015] [Indexed: 12/23/2022]
Abstract
The present review is an update of the previous one published in Proteomics 2015 Reviews special issue [Jorrin-Novo, J. V. et al., Proteomics 2015, 15, 1089-1112] covering the July 2014-2015 period. It has been written on the bases of the publications that appeared in Proteomics journal during that period and the most relevant ones that have been published in other high-impact journals. Methodological advances and the contribution of the field to the knowledge of plant biology processes and its translation to agroforestry and environmental sectors will be discussed. This review has been organized in four blocks, with a starting general introduction (literature survey) followed by sections focusing on the methodology (in vitro, in vivo, wet, and dry), proteomics integration with other approaches (systems biology and proteogenomics), biological information, and knowledge (cell communication, receptors, and signaling), ending with a brief mention of some other biological and translational topics to which proteomics has made some contribution.
Collapse
Affiliation(s)
- Rosa Sanchez-Lucas
- Agroforestry and Plant Biochemistry and Proteomics Research Group, Department of Biochemistry and Molecular Biology, University of Córdoba-CeiA3, Córdoba, Spain
| | - Angela Mehta
- Embrapa Recursos Genéticos e Biotecnologia (CENARGEN), Brasília, DF, Brazil
| | - Luis Valledor
- Department of Biology of Organisms and Systems (BOS), University of Oviedo, Oviedo, Spain
| | | | - M Cristina Romero-Rodrıguez
- Centro Multidisciplinario de Investigaciones Tecnológicas, and Departamento de Fitoquímica, Facultad de Ciencias Químicas, Universidad Nacional de Asunción, San Lorenzo, Paraguay
| | - Lyudmila Simova-Stoilova
- Plant Molecular Biology Department, Institute of Plant Physiology and Genetics, Bulgarian Academy of Sciences, Sofia, Bulgaria
| | - Sekvan Demir
- Agroforestry and Plant Biochemistry and Proteomics Research Group, Department of Biochemistry and Molecular Biology, University of Córdoba-CeiA3, Córdoba, Spain
| | - Luis E Rodriguez-de-Francisco
- Agroforestry and Plant Biochemistry and Proteomics Research Group, Department of Biochemistry and Molecular Biology, University of Córdoba-CeiA3, Córdoba, Spain.,INTEC-Sto. Domingo, Santo Domingo, República Dominicana
| | - Ana M Maldonado-Alconada
- Agroforestry and Plant Biochemistry and Proteomics Research Group, Department of Biochemistry and Molecular Biology, University of Córdoba-CeiA3, Córdoba, Spain
| | - Ana L Jorrin-Prieto
- Agroforestry and Plant Biochemistry and Proteomics Research Group, Department of Biochemistry and Molecular Biology, University of Córdoba-CeiA3, Córdoba, Spain
| | - Jesus V Jorrín-Novo
- Agroforestry and Plant Biochemistry and Proteomics Research Group, Department of Biochemistry and Molecular Biology, University of Córdoba-CeiA3, Córdoba, Spain
| |
Collapse
|
29
|
Kremer LPM, Leufken J, Oyunchimeg P, Schulze S, Fufezan C. Ursgal, Universal Python Module Combining Common Bottom-Up Proteomics Tools for Large-Scale Analysis. J Proteome Res 2016; 15:788-94. [PMID: 26709623 DOI: 10.1021/acs.jproteome.5b00860] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Proteomics data integration has become a broad field with a variety of programs offering innovative algorithms to analyze increasing amounts of data. Unfortunately, this software diversity leads to many problems as soon as the data is analyzed using more than one algorithm for the same task. Although it was shown that the combination of multiple peptide identification algorithms yields more robust results, it is only recently that unified approaches are emerging; however, workflows that, for example, aim to optimize search parameters or that employ cascaded style searches can only be made accessible if data analysis becomes not only unified but also and most importantly scriptable. Here we introduce Ursgal, a Python interface to many commonly used bottom-up proteomics tools and to additional auxiliary programs. Complex workflows can thus be composed using the Python scripting language using a few lines of code. Ursgal is easily extensible, and we have made several database search engines (X!Tandem, OMSSA, MS-GF+, Myrimatch, MS Amanda), statistical postprocessing algorithms (qvality, Percolator), and one algorithm that combines statistically postprocessed outputs from multiple search engines ("combined FDR") accessible as an interface in Python. Furthermore, we have implemented a new algorithm ("combined PEP") that combines multiple search engines employing elements of "combined FDR", PeptideShaker, and Bayes' theorem.
Collapse
Affiliation(s)
- Lukas P M Kremer
- Institute of Plant Biology and Biotechnology, University of Muenster , Schlossplatz 8, 48143 Münster, Germany
| | - Johannes Leufken
- Institute of Plant Biology and Biotechnology, University of Muenster , Schlossplatz 8, 48143 Münster, Germany
| | - Purevdulam Oyunchimeg
- Institute of Plant Biology and Biotechnology, University of Muenster , Schlossplatz 8, 48143 Münster, Germany
| | - Stefan Schulze
- Institute of Plant Biology and Biotechnology, University of Muenster , Schlossplatz 8, 48143 Münster, Germany
| | - Christian Fufezan
- Institute of Plant Biology and Biotechnology, University of Muenster , Schlossplatz 8, 48143 Münster, Germany
| |
Collapse
|
30
|
Zhang Y, Li Q, Wu F, Zhou R, Qi Y, Su N, Chen L, Xu S, Jiang T, Zhang C, Cheng G, Chen X, Kong D, Wang Y, Zhang T, Zi J, Wei W, Gao Y, Zhen B, Xiong Z, Wu S, Yang P, Wang Q, Wen B, He F, Xu P, Liu S. Tissue-Based Proteogenomics Reveals that Human Testis Endows Plentiful Missing Proteins. J Proteome Res 2015; 14:3583-94. [DOI: 10.1021/acs.jproteome.5b00435] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Yao Zhang
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
- Institute of Microbiology, Chinese Academy of Science, Beijing 100101, China
- Graduate University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Qidan Li
- CAS
Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- BGI-Shenzhen, Shenzhen 518083, China
- Graduate University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Feilin Wu
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
- Life Science
College, Southwest Forestry University, Kunming 650224, P. R, China
| | - Ruo Zhou
- BGI-Shenzhen, Shenzhen 518083, China
| | - Yingzi Qi
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Na Su
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Lingsheng Chen
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
- State
Key Laboratory for Conservation and Utilization of Subtropical Agro-Bioresources, Guangxi University, Nanning 530005, China
| | | | - Tao Jiang
- BGI-Shenzhen, Shenzhen 518083, China
| | - Chengpu Zhang
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | | | - Xinguo Chen
- Institute of Organ Transportation, General Hospital of Chinese People’s Armed Police Forces, Beijing 100039, China
| | - Degang Kong
- General
Surgery Dept., Capital Medical University Affiliated Beijing YouAn Hospital, Beijing 100069, China
| | | | - Tao Zhang
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Jin Zi
- BGI-Shenzhen, Shenzhen 518083, China
| | - Wei Wei
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Yuan Gao
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Bei Zhen
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Zhi Xiong
- Life Science
College, Southwest Forestry University, Kunming 650224, P. R, China
| | - Songfeng Wu
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Pengyuan Yang
- Institutes
of Biomedical Sciences, Department of Chemistry and Zhongshan Hospital, Fudan University, 130 DongAn Road, Shanghai 200032, China
| | - Quanhui Wang
- CAS
Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- BGI-Shenzhen, Shenzhen 518083, China
- Graduate University of the Chinese Academy of Sciences, Beijing 100049, China
| | - Bo Wen
- BGI-Shenzhen, Shenzhen 518083, China
| | - Fuchu He
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
| | - Ping Xu
- State
Key Laboratory of Proteomics, Beijing Proteome Research Center, National
Engineering Research Center for Protein Drugs, National Center for
Protein Sciences, Beijing Institute of Radiation Medicine, Beijing 102206, China
- Key
Laboratory of Combinatorial Biosynthesis and Drug Discovery (Wuhan
University), Ministry of Education, and Wuhan University School of Pharmaceutical Sciences, Wuhan 430071, China
| | - Siqi Liu
- CAS
Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
- BGI-Shenzhen, Shenzhen 518083, China
- Graduate University of the Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|