1
|
Hosseini SM, Nemati S, Karimi-Abdolrezaee S. Astrocytes originated from neural stem cells drive the regenerative remodeling of pathologic CSPGs in spinal cord injury. Stem Cell Reports 2024; 19:1451-1473. [PMID: 39303705 DOI: 10.1016/j.stemcr.2024.08.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 08/20/2024] [Accepted: 08/21/2024] [Indexed: 09/22/2024] Open
Abstract
Neural degeneration is a hallmark of spinal cord injury (SCI). Multipotent neural precursor cells (NPCs) have the potential to reconstruct the damaged neuron-glia network due to their tri-lineage capacity to generate neurons, astrocytes, and oligodendrocytes. However, astrogenesis is the predominant fate of resident or transplanted NPCs in the SCI milieu adding to the abundant number of resident astrocytes in the lesion. How NPC-derived astrocytes respond to the inflammatory milieu of SCI and the mechanisms by which they contribute to the post-injury recovery processes remain largely unknown. Here, we uncover that activated NPC-derived astrocytes exhibit distinct molecular signature that is immune modulatory and foster neurogenesis, neuronal maturity, and synaptogenesis. Mechanistically, NPC-derived astrocytes perform regenerative matrix remodeling by clearing inhibitory chondroitin sulfate proteoglycans (CSPGs) from the injury milieu through LAR and PTP-σ receptor-mediated endocytosis and the production of ADAMTS1 and ADAMTS9, while most resident astrocytes are pro-inflammatory and contribute to the pathologic deposition of CSPGs. These novel findings unravel critical mechanisms of NPC-mediated astrogenesis in SCI repair.
Collapse
Affiliation(s)
- Seyed Mojtaba Hosseini
- Department of Physiology and Pathophysiology, Spinal Cord Research Centre, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, Canada; Manitoba Multiple Sclerosis Research Center, Winnipeg, MB, Canada
| | - Shiva Nemati
- Department of Physiology and Pathophysiology, Spinal Cord Research Centre, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, Canada; Manitoba Multiple Sclerosis Research Center, Winnipeg, MB, Canada
| | - Soheila Karimi-Abdolrezaee
- Department of Physiology and Pathophysiology, Spinal Cord Research Centre, Rady Faculty of Health Sciences, University of Manitoba, Winnipeg, MB, Canada; Manitoba Multiple Sclerosis Research Center, Winnipeg, MB, Canada; Children Hospital Research Institute of Manitoba, Winnipeg, MB, Canada.
| |
Collapse
|
2
|
Jiao X, Li X, Zhang N, Zhang W, Yan B, Huang J, Zhao J, Zhang H, Chen W, Fan D. Postmortem Muscle Proteome Characteristics of Silver Carp ( Hypophthalmichthys molitrix): Insights from Full-Length Transcriptome and Deep 4D Label-Free Proteomic. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2024; 72:1376-1390. [PMID: 38165648 DOI: 10.1021/acs.jafc.3c06902] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2024]
Abstract
The coverage of the protein database directly determines the results of shotgun proteomics. In this study, PacBio single-molecule real-time sequencing technology was performed on postmortem silver carp muscle transcripts. A total of 42.43 Gb clean data, 35,834 nonredundant transcripts, and 15,413 unigenes were obtained. In total, 99.32% of the unigenes were successfully annotated and assigned specific functions. PacBio long-read isoform sequencing (Iso-Seq) analysis can provide more accurate protein information with a higher proportion of complete coding sequences and longer lengths. Subsequently, 2671 proteins were identified in deep 4D proteomics informed by a full-length transcriptomics technique, which has been shown to improve the identification of low-abundance muscle proteins and potential protein isoforms. The feature of the sarcomeric protein profile and information on more than 30 major proteins in the white dorsal muscle of silver carp were reported here for the first time. Overall, this study provides valuable transcriptome data resources and the comprehensive muscle protein information detected to date for further study into the processing characteristic of early postmortem fish muscle, as well as a spectral library for data-independent acquisition and data processing. This batch of muscle-specific dependent acquisition data is available via PRIDE with identifier PXD043702.
Collapse
Affiliation(s)
- Xidong Jiao
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Xingying Li
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Nana Zhang
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- Key Laboratory of Refrigeration and Conditioning Aquatic Products Processing, Ministry of Agriculture and Rural Affairs, Xiamen 361022, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Wenhai Zhang
- Key Laboratory of Refrigeration and Conditioning Aquatic Products Processing, Ministry of Agriculture and Rural Affairs, Xiamen 361022, China
- Fujian Provincial Key Laboratory of Refrigeration and Conditioning Aquatic Products Processing, Xiamen 361022, China
- Anjoy Foods Group Co., Ltd., Xiamen 361022, China
| | - Bowen Yan
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- Key Laboratory of Refrigeration and Conditioning Aquatic Products Processing, Ministry of Agriculture and Rural Affairs, Xiamen 361022, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Jianlian Huang
- Key Laboratory of Refrigeration and Conditioning Aquatic Products Processing, Ministry of Agriculture and Rural Affairs, Xiamen 361022, China
- Fujian Provincial Key Laboratory of Refrigeration and Conditioning Aquatic Products Processing, Xiamen 361022, China
- Anjoy Foods Group Co., Ltd., Xiamen 361022, China
| | - Jianxin Zhao
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Hao Zhang
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Wei Chen
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| | - Daming Fan
- State Key Laboratory of Food Science and Resources, Jiangnan University, Wuxi 214122, China
- Key Laboratory of Refrigeration and Conditioning Aquatic Products Processing, Ministry of Agriculture and Rural Affairs, Xiamen 361022, China
- School of Food Science and Technology, Jiangnan University, Wuxi 214122, China
| |
Collapse
|
3
|
Mehta S, Bernt M, Chambers M, Fahrner M, Föll MC, Gruening B, Horro C, Johnson JE, Loux V, Rajczewski AT, Schilling O, Vandenbrouck Y, Gustafsson OJR, Thang WCM, Hyde C, Price G, Jagtap PD, Griffin TJ. A Galaxy of informatics resources for MS-based proteomics. Expert Rev Proteomics 2023; 20:251-266. [PMID: 37787106 DOI: 10.1080/14789450.2023.2265062] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 09/06/2023] [Indexed: 10/04/2023]
Abstract
INTRODUCTION Continuous advances in mass spectrometry (MS) technologies have enabled deeper and more reproducible proteome characterization and a better understanding of biological systems when integrated with other 'omics data. Bioinformatic resources meeting the analysis requirements of increasingly complex MS-based proteomic data and associated multi-omic data are critically needed. These requirements included availability of software that would span diverse types of analyses, scalability for large-scale, compute-intensive applications, and mechanisms to ease adoption of the software. AREAS COVERED The Galaxy ecosystem meets these requirements by offering a multitude of open-source tools for MS-based proteomics analyses and applications, all in an adaptable, scalable, and accessible computing environment. A thriving global community maintains these software and associated training resources to empower researcher-driven analyses. EXPERT OPINION The community-supported Galaxy ecosystem remains a crucial contributor to basic biological and clinical studies using MS-based proteomics. In addition to the current status of Galaxy-based resources, we describe ongoing developments for meeting emerging challenges in MS-based proteomic informatics. We hope this review will catalyze increased use of Galaxy by researchers employing MS-based proteomics and inspire software developers to join the community and implement new tools, workflows, and associated training content that will add further value to this already rich ecosystem.
Collapse
Affiliation(s)
- Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Matthias Bernt
- Helmholtz Centre for Environmental Research - UFZ, Department Computational Biology, Leipzig, Germany
| | | | - Matthias Fahrner
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Melanie Christine Föll
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bjoern Gruening
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, Germany
| | - Carlos Horro
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway
- Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, MN, USA
| | - Valentin Loux
- Université Paris-Saclay, INRAE, MaIAGE, Jouy-en-Josas, France
- Université Paris-Saclay, INRAE, BioinfOmics, MIGALE bioinformatics facility, Jouy-en-Josas, France
| | - Andrew T Rajczewski
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Oliver Schilling
- Institute for Surgical Pathology, Medical Center - University of Freiburg, Freiburg, Germany
- German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany
| | | | | | - W C Mike Thang
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Institute of Molecular Bioscience, University of Queensland, St Lucia, Australia
| | - Cameron Hyde
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Sippy Downs, University of the Sunshine Coast, Australia
| | - Gareth Price
- Queensland Cyber Infrastructure Foundation (QCIF), Australia
- Institute of Molecular Bioscience, University of Queensland, St Lucia, Australia
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, USA
| |
Collapse
|
4
|
Bai M, Deng J, Dai C, Pfeuffer J, Sachsenberg T, Perez-Riverol Y. LFQ-Based Peptide and Protein Intensity Differential Expression Analysis. J Proteome Res 2023. [PMID: 37220883 DOI: 10.1021/acs.jproteome.2c00812] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/25/2023]
Abstract
Testing for significant differences in quantities at the protein level is a common goal of many LFQ-based mass spectrometry proteomics experiments. Starting from a table of protein and/or peptide quantities from a given proteomics quantification software, many tools and R packages exist to perform the final tasks of imputation, summarization, normalization, and statistical testing. To evaluate the effects of packages and settings in their substeps on the final list of significant proteins, we studied several packages on three public data sets with known expected protein fold changes. We found that the results between packages and even across different parameters of the same package can vary significantly. In addition to usability aspects and feature/compatibility lists of different packages, this paper highlights sensitivity and specificity trade-offs that come with specific packages and settings.
Collapse
Affiliation(s)
- Mingze Bai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Jingwen Deng
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
| | - Chengxin Dai
- Chongqing Key Laboratory of Big Data for Bio Intelligence, Chongqing University of Posts and Telecommunications, Chongqing 400065, China
- State Key Laboratory of Proteomics, Beijing Proteome Research Center, National Center for Protein Sciences (Beijing), Beijing Institute of Life Omics, Beijing 102206, China
| | - Julianus Pfeuffer
- Algorithmic Bioinformatics, Freie Universität Berlin, Berlin 14195, Germany
- Visualization and Data Analysis, Zuse Institute Berlin, Berlin 14195, Germany
| | - Timo Sachsenberg
- Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen 72076, Germany
| | - Yasset Perez-Riverol
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hixton, Cambridge CB10 1SD, United Kingdom
| |
Collapse
|
5
|
Karadağ Gürel A, Gürel S. To detect potential pathways and target genes in infantile Pompe patients using computational analysis. BIOIMPACTS 2022; 12:89-105. [PMID: 35411297 PMCID: PMC8905584 DOI: 10.34172/bi.2022.23467] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/20/2020] [Revised: 02/08/2021] [Accepted: 02/13/2021] [Indexed: 11/21/2022]
Abstract
![]()
Introduction: Pompe disease (PD) is a disease caused by pathogenic variations in the GAA gene known as glycogen storage disease type II, characterized by heart hypertrophy, respiratory failure, and muscle hypotonia, leading to premature death if not treated early. The only treatment option, enzyme replacement therapy (ERT), significantly improves the prognosis for some patients while failing to help others. In this study, the determination of key genes involved in the response to ERT and potential molecular mechanisms were investigated.
Methods: Gene Expression Omnibus (GEO) data, accession number GSE38680, containing samples of biceps and quadriceps muscles was used. Expression array data were analyzed using BRB-Array Tools. Biceps group patients did not receive ERT, while quadriceps received treatment with rhGAA at 0, 12, and 52 weeks. Differentially expressed genes (DEGs) were deeply analyzed by DAVID, GO, KEGG and STRING online analyses, respectively.
Results: A total of 1727 genes in the biceps group and 1198 genes in the quadriceps group are expressed differently. It was observed that DEGs were enriched in the group that responded poorly to ERT in the 52nd week. Genes frequently changed in the weak response group; the expression of 530 genes increased and 1245 genes decreased compared to 0 and 12 weeks. The GO analysis demonstrated that the DEGs were mainly involved in vascular smooth muscle contraction, lysosomes, autophagy, regulation of actin cytoskeleton, inflammatory response, and the WNT signaling pathway. We also discovered that the WNT signaling pathway is highly correlated with DEGs. Several DEGs, such as WNT11, WNT5A, CTNNB1, M6PR, MYL12A, VCL, TLN, FYN, YES1, and BCL2, may be important in elucidating the mechanisms underlying poor response to ERT.
Conclusion: Early diagnosis and treatment of PD are very important for the clinic of the disease. As a result, it suggests that the enriched genes and new pathways emerging as a result of the analysis may help identify the group that responds poorly to treatment and the outcome of the treatment. Obtained genes and pathways in neonatal screening will guide diagnosis and treatment.
Collapse
Affiliation(s)
- Aynur Karadağ Gürel
- Department of Medical Biology, School of Medicine, Usak University, Usak, Turkey
| | - Selçuk Gürel
- Department of Pediatrics, School of Medicine, Bahcesehir University, İstanbul, Turkey
| |
Collapse
|
6
|
Keller C, Wei P, Wancewicz B, Cross TWL, Rey FE, Li L. Extraction optimization for combined metabolomics, peptidomics, and proteomics analysis of gut microbiota samples. JOURNAL OF MASS SPECTROMETRY : JMS 2021; 56:e4625. [PMID: 32885503 PMCID: PMC7855350 DOI: 10.1002/jms.4625] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/28/2020] [Revised: 06/28/2020] [Accepted: 07/11/2020] [Indexed: 05/02/2023]
Abstract
Multiomic studies are increasingly performed to gain a deeper understanding of molecular processes occurring in a biological system, such as the complex microbial communities (i.e., microbiota) that reside the distal gut. While a combination of metabolomics and proteomics is more commonly used, multiomics studies including peptidomcis characterization are less frequently undertaken. Here, we investigated three different extraction methods, chosen for their previous use in extracting metabolites, peptides, and proteins, and compared their ability to perform metabolomic, peptidomic, and proteomic analysis of mouse cecum content. The methanol/chloroform/water extraction performed the best for metabolomic and peptidomic analysis as it detected the largest number of small molecules and identified the largest number of peptides, but the acidified methanol extraction performed best for proteomics analysis as it had the highest number of protein identifications. The methanol/chloroform/water extraction was further analyzed by identifying metabolites with tandem mass spectrometry (MS/MS) analysis and by gene ontology analysis for the peptide and protein results to provide a multiomics analysis of the gut microbiota.
Collapse
Affiliation(s)
- Caitlin Keller
- Department of Chemistry, University of Wisconsin-Madison, Madison WI, 53705
| | - Pingli Wei
- Department of Chemistry, University of Wisconsin-Madison, Madison WI, 53705
| | - Benjamin Wancewicz
- Department of Cell and Regenerative Biology, University of Wisconsin-Madison, Madison WI, 53705
| | - Tzu-Wen L Cross
- Department of Bacteriology, University of Wisconsin-Madison, Madison WI, 53705
| | - Federico E. Rey
- Department of Bacteriology, University of Wisconsin-Madison, Madison WI, 53705
| | - Lingjun Li
- Department of Chemistry, University of Wisconsin-Madison, Madison WI, 53705
- School of Pharmacy, University of Wisconsin-Madison, Madison WI, 53705
| |
Collapse
|
7
|
Khodadadi E, Zeinalzadeh E, Taghizadeh S, Mehramouz B, Kamounah FS, Khodadadi E, Ganbarov K, Yousefi B, Bastami M, Kafil HS. Proteomic Applications in Antimicrobial Resistance and Clinical Microbiology Studies. Infect Drug Resist 2020; 13:1785-1806. [PMID: 32606829 PMCID: PMC7305820 DOI: 10.2147/idr.s238446] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 05/23/2020] [Indexed: 12/11/2022] Open
Abstract
Sequences of the genomes of all-important bacterial pathogens of man, plants, and animals have been completed. Still, it is not enough to achieve complete information of all the mechanisms controlling the biological processes of an organism. Along with all advances in different proteomics technologies, proteomics has completed our knowledge of biological processes all around the world. Proteomics is a valuable technique to explain the complement of proteins in any organism. One of the fields that has been notably benefited from other systems approaches is bacterial pathogenesis. An emerging field is to use proteomics to examine the infectious agents in terms of, among many, the response the host and pathogen to the infection process, which leads to a deeper knowledge of the mechanisms of bacterial virulence. This trend also enables us to identify quantitative measurements for proteins extracted from microorganisms. The present review study is an attempt to summarize a variety of different proteomic techniques and advances. The significant applications in bacterial pathogenesis studies are also covered. Moreover, the areas where proteomics may lead the future studies are introduced.
Collapse
Affiliation(s)
- Ehsaneh Khodadadi
- Drug Applied Research Center, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Elham Zeinalzadeh
- Student Research Committee, Tabriz University of Medical Sciences, Tabriz, Iran.,Hematology and Oncology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Sepehr Taghizadeh
- Drug Applied Research Center, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Bahareh Mehramouz
- Hematology and Oncology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Fadhil S Kamounah
- Department of Chemistry, University of Copenhagen, Copenhagen, DK 2100, Denmark
| | - Ehsan Khodadadi
- Department of Biology, Tabriz Branch, Islamic Azad University, Tabriz, Iran
| | | | - Bahman Yousefi
- Immunology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Milad Bastami
- Immunology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Hossein Samadi Kafil
- Drug Applied Research Center, Faculty of Medicine, Tabriz University of Medical Sciences, Tabriz, Iran
| |
Collapse
|
8
|
Saha S, Chatzimichali EA, Matthews DA, Bessant C. PITDB: a database of translated genomic elements. Nucleic Acids Res 2019; 46:D1223-D1228. [PMID: 30053269 PMCID: PMC5753392 DOI: 10.1093/nar/gkx906] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2016] [Accepted: 09/28/2017] [Indexed: 12/02/2022] Open
Abstract
PITDB is a freely available database of translated genomic elements (TGEs) that have been observed in PIT (proteomics informed by transcriptomics) experiments. In PIT, a sample is analyzed using both RNA-seq transcriptomics and proteomic mass spectrometry. Transcripts assembled from RNA-seq reads are used to create a library of sample-specific amino acid sequences against which the acquired mass spectra are searched, permitting detection of any TGE, not just those in canonical proteome databases. At the time of writing, PITDB contains over 74 000 distinct TGEs from four species, supported by more than 600 000 peptide spectrum matches. The database, accessible via http://pitdb.org, provides supporting evidence for each TGE, often from multiple experiments and an indication of the confidence in the TGE’s observation and its type, ranging from known protein (exact match to a UniProt protein sequence), through multiple types of protein variant including various splice isoforms, to a putative novel molecule. PITDB’s modern web interface allows TGEs to be viewed individually or by species or experiment, and downloaded for further analysis. PITDB is for bench scientists seeking to share their PIT results, for researchers investigating novel genome products in model organisms and for those wishing to construct proteomes for lesser studied species.
Collapse
Affiliation(s)
- Shyamasree Saha
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End, London E1 4NS, UK
| | - Eleni A Chatzimichali
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End, London E1 4NS, UK
| | - David A Matthews
- School of Cellular and Molecular Medicine, University of Bristol, University Walk, Bristol BS8 1TD, UK
| | - Conrad Bessant
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End, London E1 4NS, UK.,Centre for Computational Biology, Life Science Institute, Queen Mary University of London, Mile End, London E1 4NS, UK
| |
Collapse
|
9
|
Saha S, Matthews DA, Bessant C. High throughput discovery of protein variants using proteomics informed by transcriptomics. Nucleic Acids Res 2019; 46:4893-4902. [PMID: 29718325 PMCID: PMC6007231 DOI: 10.1093/nar/gky295] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Accepted: 04/11/2018] [Indexed: 11/13/2022] Open
Abstract
Proteomics informed by transcriptomics (PIT), in which proteomic MS/MS spectra are searched against open reading frames derived from de novo assembled transcripts, can reveal previously unknown translated genomic elements (TGEs). However, determining which TGEs are truly novel, which are variants of known proteins, and which are simply artefacts of poor sequence assembly, is challenging. We have designed and implemented an automated solution that classifies putative TGEs by comparing to reference proteome sequences. This allows large-scale identification of sequence polymorphisms, splice isoforms and novel TGEs supported by presence or absence of variant-specific peptide evidence. Unlike previously reported methods, ours does not require a catalogue of known variants, making it more applicable to non-model organisms. The method was validated on human PIT data, then applied to Mus musculus, Pteropus alecto and Aedes aegypti. Novel discoveries included 60 human protein isoforms, 32 392 polymorphisms in P. alecto, and TGEs with non-methionine start sites including tyrosine.
Collapse
Affiliation(s)
- Shyamasree Saha
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End, London E1 4NS, UK
| | - David A Matthews
- School of Cellular and Molecular Medicine, University of Bristol, University Walk, Bristol BS8 1TD, UK
| | - Conrad Bessant
- School of Biological and Chemical Sciences, Queen Mary University of London, Mile End, London E1 4NS, UK.,Centre for Computational Biology, Life Sciences Initiative, Queen Mary University of London, Mile End, London E1 4NS, UK
| |
Collapse
|
10
|
Guillot L, Delage L, Viari A, Vandenbrouck Y, Com E, Ritter A, Lavigne R, Marie D, Peterlongo P, Potin P, Pineau C. Peptimapper: proteogenomics workflow for the expert annotation of eukaryotic genomes. BMC Genomics 2019; 20:56. [PMID: 30654742 PMCID: PMC6337836 DOI: 10.1186/s12864-019-5431-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Accepted: 01/03/2019] [Indexed: 01/02/2023] Open
Abstract
Background Accurate structural annotation of genomes is still a challenge, despite the progress made over the past decade. The prediction of gene structure remains difficult, especially for eukaryotic species, and is often erroneous and incomplete. We used a proteogenomics strategy, taking advantage of the combination of proteomics datasets and bioinformatics tools, to identify novel protein coding-genes and splice isoforms, assign correct start sites, and validate predicted exons and genes. Results Our proteogenomics workflow, Peptimapper, was applied to the genome annotation of Ectocarpus sp., a key reference genome for both the brown algal lineage and stramenopiles. We generated proteomics data from various life cycle stages of Ectocarpus sp. strains and sub-cellular fractions using a shotgun approach. First, we directly generated peptide sequence tags (PSTs) from the proteomics data. Second, we mapped PSTs onto the translated genomic sequence. Closely located hits (i.e., PSTs locations on the genome) were then clustered to detect potential coding regions based on parameters optimized for the organism. Third, we evaluated each cluster and compared it to gene predictions from existing conventional genome annotation approaches. Finally, we integrated cluster locations into GFF files to use a genome viewer. We identified two potential novel genes, a ribosomal protein L22 and an aryl sulfotransferase and corrected the gene structure of a dihydrolipoamide acetyltransferase. We experimentally validated the results by RT-PCR and using transcriptomics data. Conclusions Peptimapper is a complementary tool for the expert annotation of genomes. It is suitable for any organism and is distributed through a Docker image available on two public bioinformatics docker repositories: Docker Hub and BioShaDock. This workflow is also accessible through the Galaxy framework and for use by non-computer scientists at https://galaxy.protim.eu. Data are available via ProteomeXchange under identifier PXD010618. Electronic supplementary material The online version of this article (10.1186/s12864-019-5431-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Laetitia Guillot
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35042, Rennes cedex, France.,Protim, Univ Rennes, F-35042, Rennes cedex, France
| | - Ludovic Delage
- Sorbonne Université, UPMC, CNRS, UMR 8227, Integrative Biology of Marine Models, Biological Station, CS 90074, F-29688, Roscoff, France
| | - Alain Viari
- INRIA Grenoble-Rhône-Alpes, F-38330, Montbonnot-Saint-Martin, France
| | - Yves Vandenbrouck
- University Grenoble Alpes, CEA, Inserm, BIG-BGE, 38000, Grenoble, France
| | - Emmanuelle Com
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35042, Rennes cedex, France.,Protim, Univ Rennes, F-35042, Rennes cedex, France
| | - Andrés Ritter
- Sorbonne Université, UPMC, CNRS, UMR 8227, Integrative Biology of Marine Models, Biological Station, CS 90074, F-29688, Roscoff, France.,Present address: Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology, F-75005, Paris, France
| | - Régis Lavigne
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35042, Rennes cedex, France.,Protim, Univ Rennes, F-35042, Rennes cedex, France
| | - Dominique Marie
- Sorbonne Université, UPMC, CNRS, UMR 8227, Integrative Biology of Marine Models, Biological Station, CS 90074, F-29688, Roscoff, France
| | | | - Philippe Potin
- Sorbonne Université, UPMC, CNRS, UMR 8227, Integrative Biology of Marine Models, Biological Station, CS 90074, F-29688, Roscoff, France
| | - Charles Pineau
- Univ Rennes, Inserm, EHESP, Irset (Institut de recherche en santé, environnement et travail) - UMR_S 1085, F-35042, Rennes cedex, France. .,Protim, Univ Rennes, F-35042, Rennes cedex, France.
| |
Collapse
|
11
|
Sajulga R, Mehta S, Kumar P, Johnson JE, Guerrero CR, Ryan MC, Karchin R, Jagtap PD, Griffin TJ. Bridging the Chromosome-centric and Biology/Disease-driven Human Proteome Projects: Accessible and Automated Tools for Interpreting the Biological and Pathological Impact of Protein Sequence Variants Detected via Proteogenomics. J Proteome Res 2018; 17:4329-4336. [DOI: 10.1021/acs.jproteome.8b00404] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Affiliation(s)
- Ray Sajulga
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Subina Mehta
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Praveen Kumar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
- Bioinformatics and Computational Biology Program, University of Minnesota-Rochester, Rochester, Minnesota 55904, United States
| | - James E. Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Candace R. Guerrero
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Michael C. Ryan
- In-Silico Solutions, Falls Church, Virginia 22043, United States
| | - Rachel Karchin
- Department of Biomedical Engineering, The Johns Hopkins University, Baltimore, Maryland 21218, United States
- The Institute for Computational Medicine, The Johns Hopkins University, Baltimore, Maryland 21218, United States
- Department of Oncology, The Johns Hopkins University School of Medicine, Baltimore, Maryland 21217, United States
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| |
Collapse
|
12
|
Misra BB, Langefeld CD, Olivier M, Cox LA. Integrated Omics: Tools, Advances, and Future Approaches. J Mol Endocrinol 2018; 62:JME-18-0055. [PMID: 30006342 DOI: 10.1530/jme-18-0055] [Citation(s) in RCA: 220] [Impact Index Per Article: 36.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 02/24/2018] [Revised: 07/02/2018] [Accepted: 07/12/2018] [Indexed: 12/13/2022]
Abstract
With the rapid adoption of high-throughput omic approaches to analyze biological samples such as genomics, transcriptomics, proteomics, and metabolomics, each analysis can generate tera- to peta-byte sized data files on a daily basis. These data file sizes, together with differences in nomenclature among these data types, make the integration of these multi-dimensional omics data into biologically meaningful context challenging. Variously named as integrated omics, multi-omics, poly-omics, trans-omics, pan-omics, or shortened to just 'omics', the challenges include differences in data cleaning, normalization, biomolecule identification, data dimensionality reduction, biological contextualization, statistical validation, data storage and handling, sharing, and data archiving. The ultimate goal is towards the holistic realization of a 'systems biology' understanding of the biological question in hand. Commonly used approaches in these efforts are currently limited by the 3 i's - integration, interpretation, and insights. Post integration, these very large datasets aim to yield unprecedented views of cellular systems at exquisite resolution for transformative insights into processes, events, and diseases through various computational and informatics frameworks. With the continued reduction in costs and processing time for sample analyses, and increasing types of omics datasets generated such as glycomics, lipidomics, microbiomics, and phenomics, an increasing number of scientists in this interdisciplinary domain of bioinformatics face these challenges. We discuss recent approaches, existing tools, and potential caveats in the integration of omics datasets for development of standardized analytical pipelines that could be adopted by the global omics research community.
Collapse
Affiliation(s)
- Biswapriya B Misra
- B Misra, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Carl D Langefeld
- C Langefeld, Biostatistical Sciences, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Michael Olivier
- M Olivier, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| | - Laura A Cox
- L Cox, Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, United States
| |
Collapse
|
13
|
Guitton Y, Tremblay-Franco M, Le Corguillé G, Martin JF, Pétéra M, Roger-Mele P, Delabrière A, Goulitquer S, Monsoor M, Duperier C, Canlet C, Servien R, Tardivel P, Caron C, Giacomoni F, Thévenot EA. Create, run, share, publish, and reference your LC–MS, FIA–MS, GC–MS, and NMR data analysis workflows with the Workflow4Metabolomics 3.0 Galaxy online infrastructure for metabolomics. Int J Biochem Cell Biol 2017; 93:89-101. [DOI: 10.1016/j.biocel.2017.07.002] [Citation(s) in RCA: 65] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2017] [Revised: 06/14/2017] [Accepted: 07/10/2017] [Indexed: 12/11/2022]
|
14
|
Chambers MC, Jagtap PD, Johnson JE, McGowan T, Kumar P, Onsongo G, Guerrero CR, Barsnes H, Vaudel M, Martens L, Grüning B, Cooke IR, Heydarian M, Reddy KL, Griffin TJ. An Accessible Proteogenomics Informatics Resource for Cancer Researchers. Cancer Res 2017; 77:e43-e46. [PMID: 29092937 DOI: 10.1158/0008-5472.can-17-0331] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2017] [Revised: 04/07/2017] [Accepted: 06/30/2017] [Indexed: 11/16/2022]
Abstract
Proteogenomics has emerged as a valuable approach in cancer research, which integrates genomic and transcriptomic data with mass spectrometry-based proteomics data to directly identify expressed, variant protein sequences that may have functional roles in cancer. This approach is computationally intensive, requiring integration of disparate software tools into sophisticated workflows, challenging its adoption by nonexpert, bench scientists. To address this need, we have developed an extensible, Galaxy-based resource aimed at providing more researchers access to, and training in, proteogenomic informatics. Our resource brings together software from several leading research groups to address two foundational aspects of proteogenomics: (i) generation of customized, annotated protein sequence databases from RNA-Seq data; and (ii) accurate matching of tandem mass spectrometry data to putative variants, followed by filtering to confirm their novelty. Directions for accessing software tools and workflows, along with instructional documentation, can be found at z.umn.edu/canresgithub. Cancer Res; 77(21); e43-46. ©2017 AACR.
Collapse
Affiliation(s)
| | - Pratik D Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota
| | - James E Johnson
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota
| | - Thomas McGowan
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota
| | - Praveen Kumar
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota.,Bioinformatics and Computational Biology Program, University of Minnesota-Rochester, Rochester, Minnesota
| | - Getiria Onsongo
- Minnesota Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota
| | - Candace R Guerrero
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota
| | - Harald Barsnes
- Proteomics Unit, Department of Biomedicine, University of Bergen, Bergen, Norway.,Computational Biology Unit, Department of Informatics, University of Bergen, Bergen, Norway
| | - Marc Vaudel
- KG Jebsen Center for Diabetes Research, Department of Clinical Science, University of Bergen, Bergen, Norway.,Center for Medical Genetics and Molecular Medicine, Haukeland University Hospital, Bergen, Norway
| | - Lennart Martens
- VIB-UGent Center for Medical Biotechnology, VIB, Ghent, Belgium.,Department of Biochemistry, Ghent University, Ghent, Belgium.,Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
| | - Björn Grüning
- Department of Computer Science, Albert-Ludwigs-University, Freiburg, Freiburg, Germany.,Center for Biological Systems Analysis (ZBSA), University of Freiburg, Freiburg, Germany
| | - Ira R Cooke
- Comparative Genomics Centre and Department of Molecular and Cell Biology, James Cook University, Queensland, Australia
| | | | - Karen L Reddy
- Department of Biological Chemistry, Center for Epigenetics and Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University, Baltimore, Maryland
| | - Timothy J Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota.
| |
Collapse
|
15
|
Starr AE, Deeke SA, Li L, Zhang X, Daoud R, Ryan J, Ning Z, Cheng K, Nguyen LVH, Abou-Samra E, Lavallée-Adam M, Figeys D. Proteomic and Metaproteomic Approaches to Understand Host–Microbe Interactions. Anal Chem 2017; 90:86-109. [DOI: 10.1021/acs.analchem.7b04340] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Affiliation(s)
- Amanda E. Starr
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Shelley A. Deeke
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Leyuan Li
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Xu Zhang
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Rachid Daoud
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - James Ryan
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Zhibin Ning
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Kai Cheng
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Linh V. H. Nguyen
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Elias Abou-Samra
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Mathieu Lavallée-Adam
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
| | - Daniel Figeys
- Ottawa Institute of Systems Biology and Department of Biochemistry, Microbiology and Immunology, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
- Department of Chemistry and Biomolecular Sciences, University of Ottawa, Ottawa, Ontario, K1N 6N5, Canada
- Molecular Architecture of Life Program, Canadian Institute for Advanced Research, Toronto, Ontario, M5G 1M1, Canada
| |
Collapse
|
16
|
Davidson AD, Matthews DA, Maringer K. Proteomics technique opens new frontiers in mobilome research. Mob Genet Elements 2017; 7:1-9. [PMID: 28932623 PMCID: PMC5599074 DOI: 10.1080/2159256x.2017.1362494] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2017] [Revised: 07/25/2017] [Accepted: 07/28/2017] [Indexed: 12/22/2022] Open
Abstract
A large proportion of the genome of most eukaryotic organisms consists of highly repetitive mobile genetic elements. The sum of these elements is called the "mobilome," which in eukaryotes is made up mostly of transposons. Transposable elements contribute to disease, evolution, and normal physiology by mediating genetic rearrangement, and through the "domestication" of transposon proteins for cellular functions. Although 'omics studies of mobilome genomes and transcriptomes are common, technical challenges have hampered high-throughput global proteomics analyses of transposons. In a recent paper, we overcame these technical hurdles using a technique called "proteomics informed by transcriptomics" (PIT), and thus published the first unbiased global mobilome-derived proteome for any organism (using cell lines derived from the mosquito Aedes aegypti). In this commentary, we describe our methods in more detail, and summarise our major findings. We also use new genome sequencing data to show that, in many cases, the specific genomic element expressing a given protein can be identified using PIT. This proteomic technique therefore represents an important technological advance that will open new avenues of research into the role that proteins derived from transposons and other repetitive and sequence diverse genetic elements, such as endogenous retroviruses, play in health and disease.
Collapse
Affiliation(s)
- Andrew D. Davidson
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, UK
| | - David A. Matthews
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, UK
| | - Kevin Maringer
- Department of Microbial Sciences, Faculty of Health and Medical Sciences, University of Surrey, Guildford, UK
| |
Collapse
|
17
|
Ruggles KV, Krug K, Wang X, Clauser KR, Wang J, Payne SH, Fenyö D, Zhang B, Mani DR. Methods, Tools and Current Perspectives in Proteogenomics. Mol Cell Proteomics 2017; 16:959-981. [PMID: 28456751 DOI: 10.1074/mcp.mr117.000024] [Citation(s) in RCA: 95] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2017] [Indexed: 12/20/2022] Open
Abstract
With combined technological advancements in high-throughput next-generation sequencing and deep mass spectrometry-based proteomics, proteogenomics, i.e. the integrative analysis of proteomic and genomic data, has emerged as a new research field. Early efforts in the field were focused on improving protein identification using sample-specific genomic and transcriptomic sequencing data. More recently, integrative analysis of quantitative measurements from genomic and proteomic studies have identified novel insights into gene expression regulation, cell signaling, and disease. Many methods and tools have been developed or adapted to enable an array of integrative proteogenomic approaches and in this article, we systematically classify published methods and tools into four major categories, (1) Sequence-centric proteogenomics; (2) Analysis of proteogenomic relationships; (3) Integrative modeling of proteogenomic data; and (4) Data sharing and visualization. We provide a comprehensive review of methods and available tools in each category and highlight their typical applications.
Collapse
Affiliation(s)
- Kelly V Ruggles
- From the ‡Department of Medicine, New York University School of Medicine, New York, New York 10016
| | - Karsten Krug
- §The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | - Xiaojing Wang
- ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030.,‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030
| | - Karl R Clauser
- §The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142
| | - Jing Wang
- ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030.,‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030
| | - Samuel H Payne
- **Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354
| | - David Fenyö
- ‡‡Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, New York 10016; .,§§Institute for Systems Genetics, New York University School of Medicine, New York, New York 10016
| | - Bing Zhang
- ¶Lester and Sue Smith Breast Center, Baylor College of Medicine, Houston, Texas 77030; .,‖Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030
| | - D R Mani
- §The Broad Institute of MIT and Harvard, Cambridge, Massachusetts 02142;
| |
Collapse
|
18
|
Jean Beltran PM, Federspiel JD, Sheng X, Cristea IM. Proteomics and integrative omic approaches for understanding host-pathogen interactions and infectious diseases. Mol Syst Biol 2017; 13:922. [PMID: 28348067 PMCID: PMC5371729 DOI: 10.15252/msb.20167062] [Citation(s) in RCA: 127] [Impact Index Per Article: 18.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Organisms are constantly exposed to microbial pathogens in their environments. When a pathogen meets its host, a series of intricate intracellular interactions shape the outcome of the infection. The understanding of these host–pathogen interactions is crucial for the development of treatments and preventive measures against infectious diseases. Over the past decade, proteomic approaches have become prime contributors to the discovery and understanding of host–pathogen interactions that represent anti‐ and pro‐pathogenic cellular responses. Here, we review these proteomic methods and their application to studying viral and bacterial intracellular pathogens. We examine approaches for defining spatial and temporal host–pathogen protein interactions upon infection of a host cell. Further expanding the understanding of proteome organization during an infection, we discuss methods that characterize the regulation of host and pathogen proteomes through alterations in protein abundance, localization, and post‐translational modifications. Finally, we highlight bioinformatic tools available for analyzing such proteomic datasets, as well as novel strategies for integrating proteomics with other omic tools, such as genomics, transcriptomics, and metabolomics, to obtain a systems‐level understanding of infectious diseases.
Collapse
Affiliation(s)
- Pierre M Jean Beltran
- Department of Molecular Biology, Lewis Thomas Laboratory, Princeton University, Princeton, NJ, USA
| | - Joel D Federspiel
- Department of Molecular Biology, Lewis Thomas Laboratory, Princeton University, Princeton, NJ, USA
| | - Xinlei Sheng
- Department of Molecular Biology, Lewis Thomas Laboratory, Princeton University, Princeton, NJ, USA
| | - Ileana M Cristea
- Department of Molecular Biology, Lewis Thomas Laboratory, Princeton University, Princeton, NJ, USA
| |
Collapse
|
19
|
Maringer K, Yousuf A, Heesom KJ, Fan J, Lee D, Fernandez-Sesma A, Bessant C, Matthews DA, Davidson AD. Proteomics informed by transcriptomics for characterising active transposable elements and genome annotation in Aedes aegypti. BMC Genomics 2017; 18:101. [PMID: 28103802 PMCID: PMC5248466 DOI: 10.1186/s12864-016-3432-5] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2016] [Accepted: 12/19/2016] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Aedes aegypti is a vector for the (re-)emerging human pathogens dengue, chikungunya, yellow fever and Zika viruses. Almost half of the Ae. aegypti genome is comprised of transposable elements (TEs). Transposons have been linked to diverse cellular processes, including the establishment of viral persistence in insects, an essential step in the transmission of vector-borne viruses. However, up until now it has not been possible to study the overall proteome derived from an organism's mobile genetic elements, partly due to the highly divergent nature of TEs. Furthermore, as for many non-model organisms, incomplete genome annotation has hampered proteomic studies on Ae. aegypti. RESULTS We analysed the Ae. aegypti proteome using our new proteomics informed by transcriptomics (PIT) technique, which bypasses the need for genome annotation by identifying proteins through matched transcriptomic (rather than genomic) data. Our data vastly increase the number of experimentally confirmed Ae. aegypti proteins. The PIT analysis also identified hotspots of incomplete genome annotation, and showed that poor sequence and assembly quality do not explain all annotation gaps. Finally, in a proof-of-principle study, we developed criteria for the characterisation of proteomically active TEs. Protein expression did not correlate with a TE's genomic abundance at different levels of classification. Most notably, long terminal repeat (LTR) retrotransposons were markedly enriched compared to other elements. PIT was superior to 'conventional' proteomic approaches in both our transposon and genome annotation analyses. CONCLUSIONS We present the first proteomic characterisation of an organism's repertoire of mobile genetic elements, which will open new avenues of research into the function of transposon proteins in health and disease. Furthermore, our study provides a proof-of-concept that PIT can be used to evaluate a genome's annotation to guide annotation efforts which has the potential to improve the efficiency of annotation projects in non-model organisms. PIT therefore represents a valuable new tool to study the biology of the important vector species Ae. aegypti, including its role in transmitting emerging viruses of global public health concern.
Collapse
Affiliation(s)
- Kevin Maringer
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK.
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA.
- Present address: Department of Microbial Sciences, University of Surrey, Guildford, GU2 7XH, UK.
| | - Amjad Yousuf
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK
- College of Applied Medical Sciences, Taibah University, Medina, Kingdom of Saudi Arabia
| | - Kate J Heesom
- School of Biochemistry, University of Bristol, Bristol, BS8 1TD, UK
| | - Jun Fan
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS, UK
| | - David Lee
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK
| | - Ana Fernandez-Sesma
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA
| | - Conrad Bessant
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS, UK
| | - David A Matthews
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK
| | - Andrew D Davidson
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK.
| |
Collapse
|
20
|
Maringer K, Yousuf A, Heesom KJ, Fan J, Lee D, Fernandez-Sesma A, Bessant C, Matthews DA, Davidson AD. Proteomics informed by transcriptomics for characterising active transposable elements and genome annotation in Aedes aegypti. BMC Genomics 2017. [PMID: 28103802 DOI: 10.1186/s12864-016-3432-5+10.1186/s12864-016-3432-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Aedes aegypti is a vector for the (re-)emerging human pathogens dengue, chikungunya, yellow fever and Zika viruses. Almost half of the Ae. aegypti genome is comprised of transposable elements (TEs). Transposons have been linked to diverse cellular processes, including the establishment of viral persistence in insects, an essential step in the transmission of vector-borne viruses. However, up until now it has not been possible to study the overall proteome derived from an organism's mobile genetic elements, partly due to the highly divergent nature of TEs. Furthermore, as for many non-model organisms, incomplete genome annotation has hampered proteomic studies on Ae. aegypti. RESULTS We analysed the Ae. aegypti proteome using our new proteomics informed by transcriptomics (PIT) technique, which bypasses the need for genome annotation by identifying proteins through matched transcriptomic (rather than genomic) data. Our data vastly increase the number of experimentally confirmed Ae. aegypti proteins. The PIT analysis also identified hotspots of incomplete genome annotation, and showed that poor sequence and assembly quality do not explain all annotation gaps. Finally, in a proof-of-principle study, we developed criteria for the characterisation of proteomically active TEs. Protein expression did not correlate with a TE's genomic abundance at different levels of classification. Most notably, long terminal repeat (LTR) retrotransposons were markedly enriched compared to other elements. PIT was superior to 'conventional' proteomic approaches in both our transposon and genome annotation analyses. CONCLUSIONS We present the first proteomic characterisation of an organism's repertoire of mobile genetic elements, which will open new avenues of research into the function of transposon proteins in health and disease. Furthermore, our study provides a proof-of-concept that PIT can be used to evaluate a genome's annotation to guide annotation efforts which has the potential to improve the efficiency of annotation projects in non-model organisms. PIT therefore represents a valuable new tool to study the biology of the important vector species Ae. aegypti, including its role in transmitting emerging viruses of global public health concern.
Collapse
Affiliation(s)
- Kevin Maringer
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK. .,Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA. .,Present address: Department of Microbial Sciences, University of Surrey, Guildford, GU2 7XH, UK.
| | - Amjad Yousuf
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK.,College of Applied Medical Sciences, Taibah University, Medina, Kingdom of Saudi Arabia
| | - Kate J Heesom
- School of Biochemistry, University of Bristol, Bristol, BS8 1TD, UK
| | - Jun Fan
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS, UK
| | - David Lee
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK
| | - Ana Fernandez-Sesma
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA
| | - Conrad Bessant
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS, UK
| | - David A Matthews
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK
| | - Andrew D Davidson
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK.
| |
Collapse
|
21
|
Maringer K, Yousuf A, Heesom KJ, Fan J, Lee D, Fernandez-Sesma A, Bessant C, Matthews DA, Davidson AD. Proteomics informed by transcriptomics for characterising active transposable elements and genome annotation in Aedes aegypti. BMC Genomics 2017. [PMID: 28103802 DOI: 10.1186/s12864-016-3432-5 10.1186/s12864-016-3432-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
BACKGROUND Aedes aegypti is a vector for the (re-)emerging human pathogens dengue, chikungunya, yellow fever and Zika viruses. Almost half of the Ae. aegypti genome is comprised of transposable elements (TEs). Transposons have been linked to diverse cellular processes, including the establishment of viral persistence in insects, an essential step in the transmission of vector-borne viruses. However, up until now it has not been possible to study the overall proteome derived from an organism's mobile genetic elements, partly due to the highly divergent nature of TEs. Furthermore, as for many non-model organisms, incomplete genome annotation has hampered proteomic studies on Ae. aegypti. RESULTS We analysed the Ae. aegypti proteome using our new proteomics informed by transcriptomics (PIT) technique, which bypasses the need for genome annotation by identifying proteins through matched transcriptomic (rather than genomic) data. Our data vastly increase the number of experimentally confirmed Ae. aegypti proteins. The PIT analysis also identified hotspots of incomplete genome annotation, and showed that poor sequence and assembly quality do not explain all annotation gaps. Finally, in a proof-of-principle study, we developed criteria for the characterisation of proteomically active TEs. Protein expression did not correlate with a TE's genomic abundance at different levels of classification. Most notably, long terminal repeat (LTR) retrotransposons were markedly enriched compared to other elements. PIT was superior to 'conventional' proteomic approaches in both our transposon and genome annotation analyses. CONCLUSIONS We present the first proteomic characterisation of an organism's repertoire of mobile genetic elements, which will open new avenues of research into the function of transposon proteins in health and disease. Furthermore, our study provides a proof-of-concept that PIT can be used to evaluate a genome's annotation to guide annotation efforts which has the potential to improve the efficiency of annotation projects in non-model organisms. PIT therefore represents a valuable new tool to study the biology of the important vector species Ae. aegypti, including its role in transmitting emerging viruses of global public health concern.
Collapse
Affiliation(s)
- Kevin Maringer
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK. .,Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA. .,Present address: Department of Microbial Sciences, University of Surrey, Guildford, GU2 7XH, UK.
| | - Amjad Yousuf
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK.,College of Applied Medical Sciences, Taibah University, Medina, Kingdom of Saudi Arabia
| | - Kate J Heesom
- School of Biochemistry, University of Bristol, Bristol, BS8 1TD, UK
| | - Jun Fan
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS, UK
| | - David Lee
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK
| | - Ana Fernandez-Sesma
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, 10029, NY, USA
| | - Conrad Bessant
- School of Biological and Chemical Sciences, Queen Mary University of London, London, E1 4NS, UK
| | - David A Matthews
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK
| | - Andrew D Davidson
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, BS8 1TD, UK.
| |
Collapse
|
22
|
Guerrero CR, Jagtap PD, Johnson JE, Griffin TJ. Using Galaxy for Proteomics. PROTEOME INFORMATICS 2016. [DOI: 10.1039/9781782626732-00289] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The area of informatics for mass spectrometry (MS)-based proteomics data has steadily grown over the last two decades. Numerous, effective software programs now exist for various aspects of proteomic informatics. However, many researchers still have difficulties in using these software. These difficulties arise from problems with running and integrating disparate software programs, scalability issues when dealing with large data volumes, and lack of ability to share and reproduce workflows comprised of different software. The Galaxy framework for bioinformatics provides an attractive option for solving many of these current issues in proteomic informatics. Originally developed as a workbench to enable genomic data analysis, numerous researchers are now turning to Galaxy to implement software for MS-based proteomics applications. Here, we provide an introduction to Galaxy and its features, and describe how software tools are deployed, published and shared via the scalable framework. We also describe some of the existing tools in Galaxy for basic MS-based proteomics data analysis and informatics. Finally, we describe how proteomics tools in Galaxy can be combined with other existing tools for genomic and transcriptomic data analysis to enable powerful multi-omic data analysis applications.
Collapse
Affiliation(s)
- Candace R. Guerrero
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota 321 Church St SE/6-155 Jackson Hall Minneapolis MN 55455 USA
| | - Pratik D. Jagtap
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota 321 Church St SE/6-155 Jackson Hall Minneapolis MN 55455 USA
- Center for Mass Spectrometry and Proteomics, University of Minnesota 1479 Gortner Avenue, St. Paul MN 55108 USA
| | - James E. Johnson
- Minnesota Supercomputing Institute, University of Minnesota 512 Walter Library, 117 Pleasant Street SE Minneapolis MN 55455 USA
| | - Timothy J. Griffin
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota 321 Church St SE/6-155 Jackson Hall Minneapolis MN 55455 USA
- Center for Mass Spectrometry and Proteomics, University of Minnesota 1479 Gortner Avenue, St. Paul MN 55108 USA
| |
Collapse
|
23
|
Prasad TSK, Mohanty AK, Kumar M, Sreenivasamurthy SK, Dey G, Nirujogi RS, Pinto SM, Madugundu AK, Patil AH, Advani J, Manda SS, Gupta MK, Dwivedi SB, Kelkar DS, Hall B, Jiang X, Peery A, Rajagopalan P, Yelamanchi SD, Solanki HS, Raja R, Sathe GJ, Chavan S, Verma R, Patel KM, Jain AP, Syed N, Datta KK, Khan AA, Dammalli M, Jayaram S, Radhakrishnan A, Mitchell CJ, Na CH, Kumar N, Sinnis P, Sharakhov IV, Wang C, Gowda H, Tu Z, Kumar A, Pandey A. Integrating transcriptomic and proteomic data for accurate assembly and annotation of genomes. Genome Res 2016; 27:133-144. [PMID: 28003436 PMCID: PMC5204337 DOI: 10.1101/gr.201368.115] [Citation(s) in RCA: 48] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2015] [Accepted: 11/10/2016] [Indexed: 01/05/2023]
Abstract
Complementing genome sequence with deep transcriptome and proteome data could enable more accurate assembly and annotation of newly sequenced genomes. Here, we provide a proof-of-concept of an integrated approach for analysis of the genome and proteome of Anopheles stephensi, which is one of the most important vectors of the malaria parasite. To achieve broad coverage of genes, we carried out transcriptome sequencing and deep proteome profiling of multiple anatomically distinct sites. Based on transcriptomic data alone, we identified and corrected 535 events of incomplete genome assembly involving 1196 scaffolds and 868 protein-coding gene models. This proteogenomic approach enabled us to add 365 genes that were missed during genome annotation and identify 917 gene correction events through discovery of 151 novel exons, 297 protein extensions, 231 exon extensions, 192 novel protein start sites, 19 novel translational frames, 28 events of joining of exons, and 76 events of joining of adjacent genes as a single gene. Incorporation of proteomic evidence allowed us to change the designation of more than 87 predicted “noncoding RNAs” to conventional mRNAs coded by protein-coding genes. Importantly, extension of the newly corrected genome assemblies and gene models to 15 other newly assembled Anopheline genomes led to the discovery of a large number of apparent discrepancies in assembly and annotation of these genomes. Our data provide a framework for how future genome sequencing efforts should incorporate transcriptomic and proteomic analysis in combination with simultaneous manual curation to achieve near complete assembly and accurate annotation of genomes.
Collapse
Affiliation(s)
- T S Keshava Prasad
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,YU-IOB Center for Systems Biology and Molecular Medicine, Yenepoya University, Mangalore 575018, India.,NIMHANS-IOB Proteomics and Bioinformatics Laboratory, Neurobiology Research Centre, National Institute of Mental Health and Neuro Sciences, Bangalore, Karnataka 560029, India
| | - Ajeet Kumar Mohanty
- National Institute of Malaria Research, Field Station, Goa 403001, India.,Department of Zoology, Goa University, Taleigao Plateau, Goa 403206, India
| | - Manish Kumar
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Sreelakshmi K Sreenivasamurthy
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Gourav Dey
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Raja Sekhar Nirujogi
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Centre for Bioinformatics, Pondicherry University, Puducherry 605014, India
| | - Sneha M Pinto
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,YU-IOB Center for Systems Biology and Molecular Medicine, Yenepoya University, Mangalore 575018, India
| | - Anil K Madugundu
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Centre for Bioinformatics, Pondicherry University, Puducherry 605014, India
| | - Arun H Patil
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Jayshree Advani
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Srikanth S Manda
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Centre for Bioinformatics, Pondicherry University, Puducherry 605014, India
| | - Manoj Kumar Gupta
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Sutopa B Dwivedi
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India
| | - Dhanashree S Kelkar
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India
| | - Brantley Hall
- Department of Biochemistry, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, USA
| | - Xiaofang Jiang
- Department of Biochemistry, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, USA
| | - Ashley Peery
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, USA
| | - Pavithra Rajagopalan
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Soujanya D Yelamanchi
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Hitendra S Solanki
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Remya Raja
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India
| | - Gajanan J Sathe
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Sandip Chavan
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Renu Verma
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Krishna M Patel
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India
| | - Ankit P Jain
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Nazia Syed
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Department of Biochemistry and Molecular Biology, Pondicherry University, Puducherry 605014, India
| | - Keshava K Datta
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Aafaque Ahmed Khan
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,School of Biotechnology, KIIT University, Bhubaneswar, Odisha 751024, India
| | - Manjunath Dammalli
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Department of Biotechnology, Siddaganga Institute of Technology, Tumkur, Karnataka 572103, India
| | - Savita Jayaram
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Manipal University, Madhav Nagar, Manipal, Karnataka 576104, India
| | - Aneesha Radhakrishnan
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,Department of Biochemistry and Molecular Biology, Pondicherry University, Puducherry 605014, India
| | - Christopher J Mitchell
- McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| | - Chan-Hyun Na
- Department of Neurology, Johns Hopkins University, Baltimore, Maryland 21205, USA
| | - Nirbhay Kumar
- Department of Tropical Medicine, Tulane University School of Public Health and Tropical Medicine, New Orleans, Louisiana 70112, USA
| | - Photini Sinnis
- Malaria Research Institute, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland 21205, USA
| | - Igor V Sharakhov
- Department of Entomology, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, USA
| | - Charles Wang
- Center for Genomics and Department of Basic Sciences, School of Medicine, Loma Linda University, Loma Linda, California 92350, USA
| | - Harsha Gowda
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,YU-IOB Center for Systems Biology and Molecular Medicine, Yenepoya University, Mangalore 575018, India
| | - Zhijian Tu
- Department of Biochemistry, Virginia Polytechnic Institute and State University, Blacksburg, Virginia 24061, USA
| | - Ashwani Kumar
- National Institute of Malaria Research, Field Station, Goa 403001, India
| | - Akhilesh Pandey
- Institute of Bioinformatics, International Technology Park, Bangalore, Karnataka 560066, India.,McKusick-Nathans Institute of Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.,Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.,Department of Oncology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA.,Department of Pathology, Johns Hopkins University School of Medicine, Baltimore, Maryland 21205, USA
| |
Collapse
|
24
|
Van Eyk JE, Corrales FJ, Aebersold R, Cerciello F, Deutsch EW, Roncada P, Sanchez JC, Yamamoto T, Yang P, Zhang H, Omenn GS. Highlights of the Biology and Disease-driven Human Proteome Project, 2015-2016. J Proteome Res 2016; 15:3979-3987. [PMID: 27573249 DOI: 10.1021/acs.jproteome.6b00444] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The Biology and Disease-driven Human Proteome Project (B/D-HPP) is aimed at supporting and enhancing the broad use of state-of-the-art proteomic methods to characterize and quantify proteins for in-depth understanding of the molecular mechanisms of biological processes and human disease. Based on a foundation of the pre-existing HUPO initiatives begun in 2002, the B/D-HPP is designed to provide standardized methods and resources for mass spectrometry and specific protein affinity reagents and facilitate accessibility of these resources to the broader life sciences research and clinical communities. Currently there are 22 B/D-HPP initiatives and 3 closely related HPP resource pillars. The B/D-HPP groups are working to define sets of protein targets that are highly relevant to each particular field to deliver relevant assays for the measurement of these selected targets and to disseminate and make publicly accessible the information and tools generated. Major developments are the 2016 publications of the Human SRM Atlas and of "popular protein sets" for six organ systems. Here we present the current activities and plans of the BD-HPP initiatives as highlighted in numerous B/D-HPP workshops at the 14th annual HUPO 2015 World Congress of Proteomics in Vancouver, Canada.
Collapse
Affiliation(s)
- Jennifer E Van Eyk
- Advanced Clinical BioSystems Research Institute, Department of Medicine, Cedars-Sinai Medical Centre , Los Angeles, California 90038, United States
| | - Fernando J Corrales
- Department of Hepatology, Proteomics Laboratory, CIMA, University of Navarra; Ciberhed; PRB2, ProteoRed-ISCIII, 31008 Pamplona, Spain
| | - Ruedi Aebersold
- Department of Biology, Institute of Molecular Systems Biology, ETH Zürich , 8093 Zürich, Switzerland
| | - Ferdinando Cerciello
- Department of Biology, Institute of Molecular Systems Biology, ETH Zürich , 8093 Zürich, Switzerland
| | - Eric W Deutsch
- Institute for Systems Biology , Seattle, Washington 98109, United States
| | - Paola Roncada
- Istituto Sperimentale Italiano L. Spallanzani , 20133 Milano, Italy
| | - Jean-Charles Sanchez
- Centre Medicale Universitaire , Human Protein Sciences Department, CH-1211 Geneva, Switzerland
| | - Tadashi Yamamoto
- Niigata University , Department of Structural Pathology, Institute of Nephrology, Medical and Dental School, Asachimachi-dori Niigata 951-8510, Japan
| | - Pengyuan Yang
- Fudan University , Department of Chemistry, Shanghai 200433, P.R. China
| | - Hui Zhang
- Johns Hopkins University , Department of Pathology, Baltimore, Maryland 21287, United States
| | - Gilbert S Omenn
- Center for Computational Medicine and Bioinformatics, University of Michigan , Ann Arbor, Michigan 48109, United States
| |
Collapse
|
25
|
Luge T, Fischer C, Sauer S. Efficient Application of De Novo RNA Assemblers for Proteomics Informed by Transcriptomics. J Proteome Res 2016; 15:3938-3943. [DOI: 10.1021/acs.jproteome.6b00301] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Toni Luge
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
| | - Cornelius Fischer
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- BIMSB
and BIH Genomics Platforms, Laboratory of Functional Genomics, Nutrigenomics
and Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Straße
10, 13125 Berlin, Germany
| | - Sascha Sauer
- Otto-Warburg-Laboratory, Max Planck Institute for Molecular Genetics, Ihnestraße 63-73, 14195 Berlin, Germany
- BIMSB
and BIH Genomics Platforms, Laboratory of Functional Genomics, Nutrigenomics
and Systems Biology, Max-Delbrück-Center for Molecular Medicine, Robert-Rössle-Straße
10, 13125 Berlin, Germany
- CU Systems
Medicine, University of Würzburg, Josef-Schneider-Straße 2, 97080 Würzburg, Germany
| |
Collapse
|
26
|
Vaudel M, Barsnes H, Ræder H, Berven FS. Using Proteomics Bioinformatics Tools and Resources in Proteogenomic Studies. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2016; 926:65-75. [DOI: 10.1007/978-3-319-42316-6_5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|