1
|
Martínez-Bartolomé S, Binz PA, Albar JP. The Minimal Information about a Proteomics Experiment (MIAPE) from the Proteomics Standards Initiative. Methods Mol Biol 2014; 1072:765-80. [PMID: 24136562 DOI: 10.1007/978-1-62703-631-3_53] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
During the last 10 years, the Proteomics Standards Initiative from the Human Proteome Organization (HUPO-PSI) has worked on defining standards for proteomics data representation as well as guidelines that state the minimum information that should be included when reporting a proteomics experiment (MIAPE). Such minimum information must describe the complete experiment, including both experimental protocols and data processing methods, allowing a critical evaluation of the whole process and the potential recreation of the work. In this chapter we describe the standardization work performed by the HUPO-PSI, and then we concentrate on the MIAPE guidelines, highlighting its importance when publishing proteomics experiments particularly in specialized proteomics journals. Finally, we describe existing bioinformatics resources that generate MIAPE compliant reports or that check proteomics data files for MIAPE compliance.
Collapse
|
2
|
Medina-Aunon JA, Krishna R, Ghali F, Albar JP, Jones AJ. A guide for integration of proteomic data standards into laboratory workflows. Proteomics 2013; 13:480-92. [DOI: 10.1002/pmic.201200268] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2012] [Revised: 08/14/2012] [Accepted: 09/10/2012] [Indexed: 01/28/2023]
Affiliation(s)
| | - Ritesh Krishna
- Institute of Integrative Biology; University of Liverpool; Liverpool; UK
| | - Fawaz Ghali
- Institute of Integrative Biology; University of Liverpool; Liverpool; UK
| | - Juan P. Albar
- Centro Nacional de Biotecnología; CSIC; Madrid; Spain
| | - Andrew J. Jones
- Institute of Integrative Biology; University of Liverpool; Liverpool; UK
| |
Collapse
|
3
|
Van Riper SK, de Jong EP, Carlis JV, Griffin TJ. Mass Spectrometry-Based Proteomics: Basic Principles and Emerging Technologies and Directions. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013; 990:1-35. [DOI: 10.1007/978-94-007-5896-4_1] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
|
4
|
Medina-Aunon JA, Martínez-Bartolomé S, López-García MA, Salazar E, Navajas R, Jones AR, Paradela A, Albar JP. The ProteoRed MIAPE web toolkit: a user-friendly framework to connect and share proteomics standards. Mol Cell Proteomics 2012; 10:M111.008334. [PMID: 21983993 DOI: 10.1074/mcp.m111.008334] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The development of the HUPO-PSI's (Proteomics Standards Initiative) standard data formats and MIAPE (Minimum Information About a Proteomics Experiment) guidelines should improve proteomics data sharing within the scientific community. Proteomics journals have encouraged the use of these standards and guidelines to improve the quality of experimental reporting and ease the evaluation and publication of manuscripts. However, there is an evident lack of bioinformatics tools specifically designed to create and edit standard file formats and reports, or embed them within proteomics workflows. In this article, we describe a new web-based software suite (The ProteoRed MIAPE web toolkit) that performs several complementary roles related to proteomic data standards. First, it can verify that the reports fulfill the minimum information requirements of the corresponding MIAPE modules, highlighting inconsistencies or missing information. Second, the toolkit can convert several XML-based data standards directly into human readable MIAPE reports stored within the ProteoRed MIAPE repository. Finally, it can also perform the reverse operation, allowing users to export from MIAPE reports into XML files for computational processing, data sharing, or public database submission. The toolkit is thus the first application capable of automatically linking the PSI's MIAPE modules with the corresponding XML data exchange standards, enabling bidirectional conversions. This toolkit is freely available at http://www.proteored.org/MIAPE/.
Collapse
|
5
|
Beynon RJ. Enabling proteomics: the need for an extendable 'workbench' for user-configurable solutions. Comp Funct Genomics 2010; 5:52-5. [PMID: 18629039 PMCID: PMC2447322 DOI: 10.1002/cfg.372] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2003] [Revised: 11/18/2003] [Accepted: 11/26/2003] [Indexed: 11/08/2022] Open
Abstract
Proteomics has the capability to generate overwhelming quantities of data in relatively short timescales, and it is not uncommon to see experimenters investing substantially more time in data analysis than in data gathering. Although several sophisticated tools for data reduction and analysis are available, they lack the flexibility to cope with increasingly innovative experimental strategies and new database resources that encode both qualitative and quantitative data. I will outline a specification of a flexible proteomics tool that could address many current bottlenecks and deficiencies.
Collapse
Affiliation(s)
- Robert J Beynon
- Department of Veterinary Preclinical Sciences, University of Liverpool, Crown Street, Liverpool L69 7ZJ, UK.
| |
Collapse
|
6
|
Jones A, Wastling J, Hunt E. Proposal for a standard representation of two-dimensional gel electrophoresis data. Comp Funct Genomics 2010; 4:492-501. [PMID: 18628860 PMCID: PMC2447283 DOI: 10.1002/cfg.323] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
The global analysis of proteins is now feasible due to improvements in techniques such as two-dimensional gel electrophoresis (2-DE), mass spectrometry, yeast two-hybrid
systems and the development of bioinformatics applications. The experiments form
the basis of proteomics, and present significant challenges in data analysis, storage and
querying. We argue that a standard format for proteome data is required to enable
the storage, exchange and subsequent re-analysis of large datasets. We describe the
criteria that must be met for the development of a standard for proteomics. We have
developed a model to represent data from 2-DE experiments, including difference
gel electrophoresis along with image analysis and statistical analysis across multiple
gels. This part of proteomics analysis is not represented in current proposals for
proteomics standards. We are working with the Proteomics Standards Initiative to
develop a model encompassing biological sample origin, experimental protocols, a
number of separation techniques and mass spectrometry. The standard format will
facilitate the development of central repositories of data, enabling results to be verified
or re-analysed, and the correlation of results produced by different research groups
using a variety of laboratory techniques.
Collapse
Affiliation(s)
- Andrew Jones
- Department of Computing Science, University of Glasgow, 17 Lilybank Gardens, Glasgow G12 8QQ, UK.
| | | | | |
Collapse
|
7
|
Abstract
Proteome--the protein complement of a genome--has become the protein renaissance and a key research tool in the post-genomic era. The basic technology involves the routine usage of gel electrophoresis and spectrometry procedures for deciphering the primary protein sequence/structure as well as knowing certain unique post-translational modifications that a particular protein has undergone to perform a specific function in the cell. However, the recent advancements in protein analysis have ushered this science to provide deeper, bigger and more valuable perspectives regarding performance of subtle protein-protein interactions. Applications of this branch of molecular biology are as vast as the subject is and include clinical diagnostics, pharmaceutical and biotechnological industries. The 21st century hails the use of products, procedures and advancements of this science as finer touches required for the grooming of fast-paced technology.
Collapse
Affiliation(s)
- Anu Kalia
- Department of Microbiology Punjab Agricultural University, Ludhiana, Punjab, India.
| | | |
Collapse
|
8
|
Hermjakob H, Montecchi-Palazzi L, Bader G, Wojcik J, Salwinski L, Ceol A, Moore S, Orchard S, Sarkans U, von Mering C, Roechert B, Poux S, Jung E, Mersch H, Kersey P, Lappe M, Li Y, Zeng R, Rana D, Nikolski M, Husi H, Brun C, Shanker K, Grant SGN, Sander C, Bork P, Zhu W, Pandey A, Brazma A, Jacq B, Vidal M, Sherman D, Legrain P, Cesareni G, Xenarios I, Eisenberg D, Steipe B, Hogue C, Apweiler R. The HUPO PSI's molecular interaction format--a community standard for the representation of protein interaction data. Nat Biotechnol 2004; 22:177-83. [PMID: 14755292 DOI: 10.1038/nbt926] [Citation(s) in RCA: 469] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
A major goal of proteomics is the complete description of the protein interaction network underlying cell physiology. A large number of small scale and, more recently, large-scale experiments have contributed to expanding our understanding of the nature of the interaction network. However, the necessary data integration across experiments is currently hampered by the fragmentation of publicly available protein interaction data, which exists in different formats in databases, on authors' websites or sometimes only in print publications. Here, we propose a community standard data model for the representation and exchange of protein interaction data. This data model has been jointly developed by members of the Proteomics Standards Initiative (PSI), a work group of the Human Proteome Organization (HUPO), and is supported by major protein interaction data providers, in particular the Biomolecular Interaction Network Database (BIND), Cellzome (Heidelberg, Germany), the Database of Interacting Proteins (DIP), Dana Farber Cancer Institute (Boston, MA, USA), the Human Protein Reference Database (HPRD), Hybrigenics (Paris, France), the European Bioinformatics Institute's (EMBL-EBI, Hinxton, UK) IntAct, the Molecular Interactions (MINT, Rome, Italy) database, the Protein-Protein Interaction Database (PPID, Edinburgh, UK) and the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING, EMBL, Heidelberg, Germany).
Collapse
Affiliation(s)
- Henning Hermjakob
- European Bioinformatics Institute, EBI-Hinxton, Wellcome Trust Genome Campus, Hinxton, Cambridge, UK.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
9
|
Warnock DE, Fahy E, Taylor SW. Identification of protein associations in organelles, using mass spectrometry-based proteomics. MASS SPECTROMETRY REVIEWS 2004; 23:259-280. [PMID: 15133837 DOI: 10.1002/mas.10077] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
Abstract
Recent literature that highlights the power of using mass spectrometry (MS) for protein identification from preparations of highly purified organelles and other large subcellular structures is covered in this review with an emphasis on techniques that preserve the integrity of the functional protein complexes. Recent advances in distinguishing contaminant proteins from "bonafide" organelle-localized proteins and the affinity capture of protein complexes are reviewed, as well as bioinformatic strategies to predict protein organellar localization and to integrate protein-protein interaction maps obtained from MS-affinity capture methods with data obtained from other techniques. Those developments demonstrate that a revolution in cellular biology, fueled by technical advances in MS-based proteomic techniques, is well underway.
Collapse
Affiliation(s)
- Dale E Warnock
- MitoKor, Inc., 11494 Sorrento Valley Road, San Diego, California 92121, USA
| | | | | |
Collapse
|
10
|
Navarro JD, Talreja N, Peri S, Vrushabendra BM, Rashmi BP, Padma N, Surendranath V, Jonnalagadda CK, Kousthub PS, Deshpande N, Shanker K, Pandey A. BioBuilder as a database development and functional annotation platform for proteins. BMC Bioinformatics 2004; 5:43. [PMID: 15099404 PMCID: PMC406495 DOI: 10.1186/1471-2105-5-43] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2003] [Accepted: 04/20/2004] [Indexed: 11/10/2022] Open
Abstract
Background The explosion in biological information creates the need for databases that are easy to develop, easy to maintain and can be easily manipulated by annotators who are most likely to be biologists. However, deployment of scalable and extensible databases is not an easy task and generally requires substantial expertise in database development. Results BioBuilder is a Zope-based software tool that was developed to facilitate intuitive creation of protein databases. Protein data can be entered and annotated through web forms along with the flexibility to add customized annotation features to protein entries. A built-in review system permits a global team of scientists to coordinate their annotation efforts. We have already used BioBuilder to develop Human Protein Reference Database , a comprehensive annotated repository of the human proteome. The data can be exported in the extensible markup language (XML) format, which is rapidly becoming as the standard format for data exchange. Conclusions As the proteomic data for several organisms begins to accumulate, BioBuilder will prove to be an invaluable platform for functional annotation and development of customizable protein centric databases. BioBuilder is open source and is available under the terms of LGPL.
Collapse
Affiliation(s)
- J Daniel Navarro
- McKusick-Nathans Institute of Genetic Medicine and the Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland, U.S.A
- Departamento de Automática y Computación, Área de Ciencias de la Computación e Inteligencia Artificial, Universidad Pública de Navarra, 31006, Pamplona, Spain
| | - Naveen Talreja
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - Suraj Peri
- McKusick-Nathans Institute of Genetic Medicine and the Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland, U.S.A
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Odense, Denmark
| | - BM Vrushabendra
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - BP Rashmi
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - N Padma
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - Vineeth Surendranath
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - Chandra Kiran Jonnalagadda
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - PS Kousthub
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - Nandan Deshpande
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - K Shanker
- Institute of Bioinformatics, Discoverer 7Floor, International Technology Park Ltd., Bangalore 560 066, India
| | - Akhilesh Pandey
- McKusick-Nathans Institute of Genetic Medicine and the Department of Biological Chemistry, Johns Hopkins University School of Medicine, Baltimore, Maryland, U.S.A
| |
Collapse
|
11
|
Lappe M, Holm L. Unraveling protein interaction networks with near-optimal efficiency. Nat Biotechnol 2003; 22:98-103. [PMID: 14661027 DOI: 10.1038/nbt921] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2003] [Accepted: 10/10/2003] [Indexed: 11/08/2022]
Abstract
The functional characterization of genes and their gene products is the main challenge of the genomic era. Examining interaction information for every gene product is a direct way to assemble the jigsaw puzzle of proteins into a functional map. Here we demonstrate a method in which the information gained from pull-down experiments, in which single proteins act as baits to detect interactions with other proteins, is maximized by using a network-based strategy to select the baits. Because of the scale-free distribution of protein interaction networks, we were able to obtain fast coverage by focusing on highly connected nodes (hubs) first. Unfortunately, locating hubs requires prior global information about the network one is trying to unravel. Here, we present an optimized 'pay-as-you-go' strategy that identifies highly connected nodes using only local information that is collected as successive pull-down experiments are performed. Using this strategy, we estimate that 90% of the human interactome can be covered by 10,000 pull-down experiments, with 50% of the interactions confirmed by reciprocal pull-down experiments.
Collapse
Affiliation(s)
- Michael Lappe
- EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
| | | |
Collapse
|
12
|
Abstract
Currently, relatively few proteomics studies of chloroplast have been published, but the field has just started emerging and is likely to develop more rapidly in the future. While the complex membrane structure of the chloroplast makes it difficult to study its entire proteome by global approaches, proteomics has considerably increased our knowledge of the proteins of single compartments such as, for instance, the envelope and the thylakoid lumen. Proteomics has also succeeded in the subunit characterisation of select protein complexes such as the ribosomes and the cytochrome b (6)f complex. In addition, proteomics was successfully applied to find new potential target pathways for thioredoxin-mediated signal transduction. In this review, we present an overview of the latest developments in the field of chloroplast proteomics and discuss their impact on photosynthesis research. In addition, we summarise the current state of research in proteomics of the photosynthetic cyanobactrium Synechocystis sp. PCC 6803.
Collapse
Affiliation(s)
- Wolfgang P Schröder
- Departments of Chemistry and Biochemistry, Umeå University, 901 87, Umeå, Sweden
| | | |
Collapse
|