1
|
Mou M, Pan Z, Lu M, Sun H, Wang Y, Luo Y, Zhu F. Application of Machine Learning in Spatial Proteomics. J Chem Inf Model 2022; 62:5875-5895. [PMID: 36378082 DOI: 10.1021/acs.jcim.2c01161] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Spatial proteomics is an interdisciplinary field that investigates the localization and dynamics of proteins, and it has gained extensive attention in recent years, especially the subcellular proteomics. Numerous evidence indicate that the subcellular localization of proteins is associated with various cellular processes and disease progression. Mass spectrometry (MS)-based and imaging-based experimental approaches have been developed to acquire large-scale spatial proteomic data. To allow the reliable analysis of increasingly complex spatial proteomics data, machine learning (ML) methods have been widely used in both MS-based and imaging-based spatial proteomic data analysis pipelines. Here, we comprehensively survey the applications of ML in spatial proteomics from following aspects: (1) data resources for spatial proteome are comprehensively introduced; (2) the roles of different ML algorithms in data analysis pipelines are elaborated; (3) successful applications of spatial proteomics and several analytical tools integrating ML methods are presented; (4) challenges existing in modern ML-based spatial proteomics studies are discussed. This review provides guidelines for researchers seeking to apply ML methods to analyze spatial proteomic data and can facilitate insightful understanding of cell biology as well as the future research in medical and drug discovery communities.
Collapse
Affiliation(s)
- Minjie Mou
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Ziqi Pan
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Mingkun Lu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Huaicheng Sun
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yunxia Wang
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Yongchao Luo
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Feng Zhu
- College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
2
|
Weke K, Singh A, Uwugiaren N, Alfaro JA, Wang T, Hupp TR, O'Neill JR, Vojtesek B, Goodlett DR, Williams SM, Zhou M, Kelly RT, Zhu Y, Dapic I. MicroPOTS Analysis of Barrett's Esophageal Cell Line Models Identifies Proteomic Changes after Physiologic and Radiation Stress. J Proteome Res 2021; 20:2195-2205. [PMID: 33491460 PMCID: PMC8155554 DOI: 10.1021/acs.jproteome.0c00629] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
![]()
Moving from macroscale
preparative systems in proteomics to micro-
and nanotechnologies offers researchers the ability to deeply profile
smaller numbers of cells that are more likely to be encountered in
clinical settings. Herein a recently developed microscale proteomic
method, microdroplet processing in one pot for trace samples (microPOTS),
was employed to identify proteomic changes in ∼200 Barrett’s
esophageal cells following physiologic and radiation stress exposure.
From this small population of cells, microPOTS confidently identified
>1500 protein groups, and achieved a high reproducibility with
a Pearson’s
correlation coefficient value of R > 0.9 and over
50% protein overlap from replicates. A Barrett’s cell line
model treated with either lithocholic acid (LCA) or X-ray had 21 (e.g.,
ASNS, RALY, FAM120A, UBE2M, IDH1, ESD) and 32 (e.g., GLUL, CALU, SH3BGRL3,
S100A9, FKBP3, AGR2) overexpressed proteins, respectively, compared
to the untreated set. These results demonstrate the ability of microPOTS
to routinely identify and quantify differentially expressed proteins
from limited numbers of cells.
Collapse
Affiliation(s)
- Kenneth Weke
- University of Gdansk, International Centre for Cancer Vaccine Science, ul. Kładki 24, 80-822 Gdansk, Poland
| | - Ashita Singh
- Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, Scotland EH4 2XR, U.K.,Research Centre for Applied Molecular Oncology (RECAMO), Masaryk Memorial Cancer Institute, 656 53 Brno, Czech Republic
| | - Naomi Uwugiaren
- University of Gdansk, International Centre for Cancer Vaccine Science, ul. Kładki 24, 80-822 Gdansk, Poland
| | - Javier A Alfaro
- University of Gdansk, International Centre for Cancer Vaccine Science, ul. Kładki 24, 80-822 Gdansk, Poland.,Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, Scotland EH4 2XR, U.K
| | - Tongjie Wang
- Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, Scotland EH4 2XR, U.K
| | - Ted R Hupp
- University of Gdansk, International Centre for Cancer Vaccine Science, ul. Kładki 24, 80-822 Gdansk, Poland.,Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, Scotland EH4 2XR, U.K
| | - J Robert O'Neill
- Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, Scotland EH4 2XR, U.K.,Cambridge Oesophagogastric Centre, Cambridge University Hospitals NHS Foundation Trust, Cambridge CB2 0QQ, U.K
| | - Borek Vojtesek
- Research Centre for Applied Molecular Oncology (RECAMO), Masaryk Memorial Cancer Institute, 656 53 Brno, Czech Republic
| | - David R Goodlett
- University of Gdansk, International Centre for Cancer Vaccine Science, ul. Kładki 24, 80-822 Gdansk, Poland.,University of Victoria - Genome British Columbia Proteomics Centre, Victoria, BC V8Z 7X8, Canada.,Department of Biochemistry and Microbiology, University of Victoria, Victoria, BC V8P 5C2, Canada
| | - Sarah M Williams
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Mowei Zhou
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Ryan T Kelly
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Ying Zhu
- Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Irena Dapic
- University of Gdansk, International Centre for Cancer Vaccine Science, ul. Kładki 24, 80-822 Gdansk, Poland
| |
Collapse
|
3
|
Eriksson O, Asplund A, Hegde G, Edqvist PH, Navani S, Pontén F, Siegbahn A. A stromal cell population in the large intestine identified by tissue factor expression that is lost during colorectal cancer progression. Thromb Haemost 2016; 116:1050-1059. [PMID: 27656710 DOI: 10.1160/th16-04-0267] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2016] [Accepted: 09/07/2016] [Indexed: 01/27/2023]
Abstract
Colorectal cancer (CRC) is a major cause of morbidity and mortality, and the composition of the tumour stroma is a strong predictor of survival in this cancer type. Tissue factor (TF) functions as the trigger of haemostasis together with its ligand coagulation factor VII/VIIa, and TF expression has been found in tumour cells of colorectal tumours. However, TF expression in the CRC tumour stroma or its relationship to patient outcome has not yet been studied. To address this question we developed and validated a specific anti-TF antibody using standardised methods within the Human Protein Atlas project. We used this antibody to investigate TF expression in normal colorectal tissue and CRC using immunofluorescence and immunohistochemistry in two patient cohorts. TF was strongly expressed in a cell population immediately adjacent to the colorectal epithelium. These TF-positive cells were ACTA2-negative but weakly vimentin-positive, defining a specific population of pericryptal sheath cells. In colorectal tumours, TF-positive sheath cells were progressively lost after the adenoma-to-carcinoma transition, demonstrating downregulation of this source of TF in CRC. Furthermore, loss of sheath cell TF was significantly associated with poor overall and disease-specific survival in rectal but not colon cancers. In conclusion, we demonstrate that TF is a marker of a specific cell population in the large intestine, which is lost during CRC progression. Our results highlight the role of the tumour stroma in this cancer type and suggest TF to be a potential prognostic biomarker in rectal cancers through the identification of pericryptal sheath cells.
Collapse
Affiliation(s)
- Oskar Eriksson
- Oskar Eriksson, Department of Medical Sciences, Clinical Chemistry, University Hospital, Entr. 61, 3rd floor, S-751 85 Uppsala, Sweden, Tel.: +46 186114251, Fax: +46 18552562, E-mail:
| | | | | | | | | | | | - Agneta Siegbahn
- Agneta Siegbahn, Department of Medical Sciences, Clinical Chemistry, University Hospital, Entr. 61, 3rd floor, S-751 85 Uppsala, Sweden, Tel.: +46 186114251, Fax: +46 18552562, E-mail:
| |
Collapse
|
4
|
Breckels LM, Holden SB, Wojnar D, Mulvey CM, Christoforou A, Groen A, Trotter MWB, Kohlbacher O, Lilley KS, Gatto L. Learning from Heterogeneous Data Sources: An Application in Spatial Proteomics. PLoS Comput Biol 2016; 12:e1004920. [PMID: 27175778 PMCID: PMC4866734 DOI: 10.1371/journal.pcbi.1004920] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2015] [Accepted: 04/16/2016] [Indexed: 11/19/2022] Open
Abstract
Sub-cellular localisation of proteins is an essential post-translational regulatory mechanism that can be assayed using high-throughput mass spectrometry (MS). These MS-based spatial proteomics experiments enable us to pinpoint the sub-cellular distribution of thousands of proteins in a specific system under controlled conditions. Recent advances in high-throughput MS methods have yielded a plethora of experimental spatial proteomics data for the cell biology community. Yet, there are many third-party data sources, such as immunofluorescence microscopy or protein annotations and sequences, which represent a rich and vast source of complementary information. We present a unique transfer learning classification framework that utilises a nearest-neighbour or support vector machine system, to integrate heterogeneous data sources to considerably improve on the quantity and quality of sub-cellular protein assignment. We demonstrate the utility of our algorithms through evaluation of five experimental datasets, from four different species in conjunction with four different auxiliary data sources to classify proteins to tens of sub-cellular compartments with high generalisation accuracy. We further apply the method to an experiment on pluripotent mouse embryonic stem cells to classify a set of previously unknown proteins, and validate our findings against a recent high resolution map of the mouse stem cell proteome. The methodology is distributed as part of the open-source Bioconductor pRoloc suite for spatial proteomics data analysis. Sub-cellular localisation of proteins is critical to their function in all cellular processes; proteins localising to their intended micro-environment, e.g organelles, vesicles or macro-molecular complexes, will meet the interaction partners and biochemical conditions suitable to pursue their molecular function. Therefore, sound data and methods to reliably and systematically study protein localisation, and hence their mis-localisation and the disruption of protein trafficking, that are relied upon by the cell biology community, are essential. Here we present a method to infer protein localisation relying on the optimal integration of experimental mass spectrometry-based data and auxiliary sources, such as GO annotation, outputs from third-party software, protein-protein interactions or immunocytochemistry data. We found that the application of transfer learning algorithms across these diverse data sources considerably improves on the quantity and reliability of sub-cellular protein assignment, compared to single data classifiers previously applied to infer sub-cellular localisation using experimental data only. We show how our method does not compromise biologically relevant experimental-specific signal after integration with heterogeneous freely available third-party resources. The integration of different data sources is an important challenge in the data intensive world of biology and we anticipate the transfer learning methods presented here will prove useful to many areas of biology, to unify data obtained from different but complimentary sources.
Collapse
Affiliation(s)
- Lisa M. Breckels
- Computational Proteomics Unit, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Sean B. Holden
- Computer Laboratory, University of Cambridge, Cambridge, United Kingdom
| | - David Wojnar
- Quantitative Biology Center, Universität Tübingen, Tübingen, Germany
| | - Claire M. Mulvey
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Andy Christoforou
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Arnoud Groen
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | | | - Oliver Kohlbacher
- Quantitative Biology Center, Universität Tübingen, Tübingen, Germany
- Center for Bioinformatics, Universität Tübingen, Tübingen, Germany
- Biomolecular Interactions, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Kathryn S. Lilley
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
| | - Laurent Gatto
- Computational Proteomics Unit, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
- Cambridge Centre for Proteomics, Department of Biochemistry, University of Cambridge, Cambridge, United Kingdom
- * E-mail:
| |
Collapse
|
5
|
Integrating phenotypic small-molecule profiling and human genetics: the next phase in drug discovery. Trends Genet 2014; 31:16-23. [PMID: 25498789 DOI: 10.1016/j.tig.2014.11.002] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2014] [Revised: 11/14/2014] [Accepted: 11/17/2014] [Indexed: 12/11/2022]
Abstract
Over the past decade, tremendous progress in high-throughput small molecule-screening methods has facilitated the rapid expansion of phenotype-based data. Parallel advances in genomic characterization methods have complemented these efforts by providing a growing list of annotated cell line features. Together, these developments have paved the way for feature-based identification of novel, exploitable cellular dependencies, subsequently expanding our therapeutic toolkit in cancer and other diseases. Here, we provide an overview of the evolution of phenotypic small-molecule profiling and discuss the most significant and recent profiling and analytical efforts, their impact on the field, and their clinical ramifications. We additionally provide a perspective for future developments in phenotypic profiling efforts guided by genomic science.
Collapse
|
6
|
Targeted development of specific biomarkers of endometrial stromal cell differentiation using bioinformatics: the IFITM1 model. Mod Pathol 2014; 27:569-79. [PMID: 24072182 DOI: 10.1038/modpathol.2013.123] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2013] [Revised: 05/13/2013] [Accepted: 05/14/2013] [Indexed: 12/24/2022]
Abstract
When classifying cellular uterine mesenchymal neoplasms, histological distinction of endometrial stromal from smooth muscle neoplasms can be difficult. The only widely established marker of endometrial stromal differentiation, CD10, has marginal specificity. We took a bioinformatics approach to identify more specific markers of endometrial stromal differentiation by searching the Human Protein Atlas, a public database of protein expression profiles. After screening the database using different methods, interferon-induced transmembrane protein 1 (IFITM1) was selected for further analysis. Immunohistochemistry for IFITM1 was performed using tissue sections from the selected cases of proliferative endometrium (22), secretory endometrium (6), inactive endometrium (19), adenomyosis (10), conventional leiomyoma (11), cellular leiomyoma (16), endometrial stromal nodule (2), low-grade endometrial stromal sarcoma (16), high-grade endometrial stromal sarcoma (2) and undifferentiated uterine sarcoma (2). Stained slides were scored in terms of intensity and distribution. Normal endometrial samples uniformly showed diffuse and strong IFITM1 staining. Endometrial stromal neoplasms, particularly low-grade endometrial stromal sarcoma, showed higher IFITM1 expression compared with smooth muscle neoplasms (P<0.0001). IFITM1 immunohistochemistry has high sensitivity and specificity, particularly in the distinction between low-grade endometrial stromal sarcoma and leiomyoma (81.2 and 86.7%, respectively). Our results indicate that IFITM1 is a sensitive and specific marker of endometrial stromal differentiation across the spectrum from proliferative endometrium to metastatic stromal sarcoma. IFITM1 is a potential valuable addition to immunohistochemical panels used in the diagnosis of cellular mesenchymal uterine tumors. Further studies with larger number of cases are necessary to corroborate this impression and determine the utility of IFITM1 in routine practice. This study is a clear example of how bioinformatics, particularly tools for mining genomic and proteomic databases, can enhance and accelerate biomarker development in diagnostic pathology.
Collapse
|
7
|
The application of high-throughput analyses to cancer diagnosis and prognosis. Mol Oncol 2013. [DOI: 10.1017/cbo9781139046947.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
8
|
Laser microdissection and two-dimensional difference gel electrophoresis with alkaline isoelectric point immobiline gel reveals proteomic intra-tumor heterogeneity in colorectal cancer. EUPA OPEN PROTEOMICS 2013. [DOI: 10.1016/j.euprot.2013.08.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
|
9
|
Danielsson F, Wiking M, Mahdessian D, Skogs M, Ait Blal H, Hjelmare M, Stadler C, Uhlén M, Lundberg E. RNA Deep Sequencing as a Tool for Selection of Cell Lines for Systematic Subcellular Localization of All Human Proteins. J Proteome Res 2012; 12:299-307. [DOI: 10.1021/pr3009308] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Frida Danielsson
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| | - Mikaela Wiking
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| | - Diana Mahdessian
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| | - Marie Skogs
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| | - Hammou Ait Blal
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| | - Martin Hjelmare
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| | - Charlotte Stadler
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| | - Mathias Uhlén
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
- Department
of Proteomics, KTH
- Royal Institute of Technology, SE-106
91 Stockholm, Sweden
| | - Emma Lundberg
- Science for Life Laboratory,
KTH - Royal Institute of Technology, SE-171
21 Stockholm, Sweden
| |
Collapse
|
10
|
Label-free quantitative proteomics trends for protein-protein interactions. J Proteomics 2012; 81:91-101. [PMID: 23153790 DOI: 10.1016/j.jprot.2012.10.027] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2012] [Revised: 10/24/2012] [Accepted: 10/31/2012] [Indexed: 12/14/2022]
Abstract
Understanding protein interactions within the complexity of a living cell is challenging, but techniques coupling affinity purification and mass spectrometry have enabled important progress to be made in the past 15 years. As identification of protein-protein interactions is becoming easier, the quantification of the interaction dynamics is the next frontier. Several quantitative mass spectrometric approaches have been developed to address this issue that vary in their strengths and weaknesses. While isotopic labeling approaches continue to contribute to the identification of regulated interactions, techniques that do not require labeling are becoming increasingly used in the field. Here, we describe the major types of label-free quantification used in interaction proteomics, and discuss the relative merits of data dependent and data independent acquisition approaches in label-free quantification. This article is part of a Special Issue entitled: From protein structures to clinical applications.
Collapse
|
11
|
Primig M. The bioinformatics tool box for reproductive biology. Biochim Biophys Acta Mol Basis Dis 2012; 1822:1880-95. [PMID: 22687534 DOI: 10.1016/j.bbadis.2012.05.018] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2011] [Revised: 05/04/2012] [Accepted: 05/28/2012] [Indexed: 10/28/2022]
Abstract
Genetics and molecular biology have been instrumental for a better understanding of heritable defects causing human infertility over the past decades. More recently, the field of reproductive biology has harnessed genome biological approaches to gain insight into molecular processes underlying normal and pathological gametogenesis and gamete function. We are currently witnessing yet another quantum leap in our ability to monitor the flow of information from the genome via the transcriptome to the proteome: tiling arrays that cover both strands of a given target genome and RNA-Seq, a method based on ultra-high throughput DNA sequencing, enable us to study noncoding and protein-coding transcripts with unprecedented precision and depth at a reasonable cost. These technologies have spawned a thriving discipline within the bioinformatics field that employs information technology for managing and interpreting biological high-throughput data. This review outlines database projects and online analysis tools useful for life scientists in general and discusses in detail selected projects that have specifically been developed for researchers and clinicians in the field of reproductive biology. This article is part of a Special Issue entitled: Molecular Genetics of Human Reproductive Failure.
Collapse
Affiliation(s)
- Michael Primig
- Inserm UMR1085-Irset, Université de Rennes 1, Rennes, France.
| |
Collapse
|
12
|
Bozzacco L, Yu H, Zebroski HA, Dengjel J, Deng H, Mojsov S, Steinman RM. Mass spectrometry analysis and quantitation of peptides presented on the MHC II molecules of mouse spleen dendritic cells. J Proteome Res 2011; 10:5016-30. [PMID: 21913724 PMCID: PMC3270889 DOI: 10.1021/pr200503g] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Major histocompatibility complex class II (MHC II) molecules are expressed on the surface of antigen-presenting cells and display short bound peptide fragments derived from self- and nonself antigens. These peptide-MHC complexes function to maintain immunological tolerance in the case of self-antigens and initiate the CD4(+) T cell response in the case of foreign proteins. Here we report the application of LC-MS/MS analysis to identify MHC II peptides derived from endogenous proteins expressed in freshly isolated murine splenic DCs. The cell number was enriched in vivo upon treatment with Flt3L-B16 melanoma cells. In a typical experiment, starting with about 5 × 10(8) splenic DCs, we were able to reliably identify a repertoire of over 100 MHC II peptides originating from about 55 proteins localized in membrane (23%), intracellular (26%), endolysosomal (12%), nuclear (14%), and extracellular (25%) compartments. Using synthetic isotopically labeled peptides corresponding to the sequences of representative bound MHC II peptides, we quantified by LC-MS relative peptide abundance. In a single experiment, peptides were detected in a wide concentration range spanning from 2.5 fmol/μL to 12 pmol/μL or from approximately 13 to 2 × 10(5) copies per DC. These peptides were found in similar amounts on B cells where we detected about 80 peptides originating from 55 proteins distributed homogenously within the same cellular compartments as in DCs. About 90 different binding motifs predicted by the epitope prediction algorithm were found within the sequences of the identified MHC II peptides. These results set a foundation for future studies to quantitatively investigate the MHC II repertoire on DCs generated under different immunization conditions.
Collapse
|
13
|
Pontén F, Schwenk JM, Asplund A, Edqvist PHD. The Human Protein Atlas as a proteomic resource for biomarker discovery. J Intern Med 2011; 270:428-46. [PMID: 21752111 DOI: 10.1111/j.1365-2796.2011.02427.x] [Citation(s) in RCA: 193] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The analysis of tissue-specific expression at both the gene and protein levels is vital for understanding human biology and disease. Antibody-based proteomics provides a strategy for the systematic generation of antibodies against all human proteins to combine with protein profiling in tissues and cells using tissue microarrays, immunohistochemistry and immunofluorescence. The Human Protein Atlas project was launched in 2003 with the aim of creating a map of protein expression patterns in normal cells, tissues and cancer. At present, 11,200 unique proteins corresponding to over 50% of all human protein-encoding genes have been analysed. All protein expression data, including underlying high-resolution images, are published on the free and publically available Human Protein Atlas portal (http://www.proteinatlas.org). This database provides an important source of information for numerous biomedical research projects, including biomarker discovery efforts. Moreover, the global analysis of how our genome is expressed at the protein level has provided basic knowledge on the ubiquitous expression of a large proportion of our proteins and revealed the paucity of cell- and tissue-type-specific proteins.
Collapse
Affiliation(s)
- F Pontén
- Department of Genetics and Pathology, Uppsala University, Uppsala, Sweden.
| | | | | | | |
Collapse
|