1
|
Koca MB, Sevilgen FE. Integration of single-cell proteomic datasets through distinctive proteins in cell clusters. Proteomics 2024; 24:e2300282. [PMID: 38135888 DOI: 10.1002/pmic.202300282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 11/01/2023] [Accepted: 12/04/2023] [Indexed: 12/24/2023]
Abstract
The use of mass spectrometry and antibody-based sequencing technologies at the single-cell level has led to an increase in single-cell proteomic datasets. Integrating these datasets is crucial to eliminate the batch effect that often arises due to their limited sequencing molecules. Although methods for horizontally integrating high-dimensional single-cell transcriptomic datasets can also be applied to single-cell proteomic datasets, a specialized approach explicitly tailored for low-dimensional proteomic datasets may enhance the integration process. Here, we introduce SCPRO-HI, an algorithm for the horizontal integration of antibody-based single-cell proteomic datasets. It utilizes a hierarchical cell anchoring technique to match cells based on the similarity of distinctive proteins for constituting cell clusters. A novel variational auto-encoder model is employed for correcting batch effects on the protein abundances, eliminating the need for mapping them into a new domain. Moreover, we propose a technique for extending the algorithm to high-dimensional datasets. The performance of the SCPRO-HI algorithm is evaluated using simulated and real-world single-cell proteomic datasets. The findings demonstrate our algorithm outperforms state-of-the-art methods, achieving a 75% higher silhouette score while preserving HVPs 13% better. Furthermore, the algorithm shows competitive performance in transcriptomic datasets, suggesting potential for integrating high-dimensional mass-spectrometry-based proteomic datasets.
Collapse
Affiliation(s)
- Mehmet Burak Koca
- Computer Engineering Department, Gebze Technical University, Kocaeli, Türkiye
| | - Fatih Erdoğan Sevilgen
- Institute for Data Science and Artificial Intelligence, Boğaziçi University, İstanbul, Türkiye
| |
Collapse
|
2
|
Arevalo J, Su E, van Dijk R, Carpenter AE, Singh S. Evaluating batch correction methods for image-based cell profiling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.15.558001. [PMID: 37745478 PMCID: PMC10516049 DOI: 10.1101/2023.09.15.558001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
High-throughput image-based profiling platforms are powerful technologies capable of collecting data from billions of cells exposed to thousands of perturbations in a time- and cost-effective manner. Therefore, image-based profiling data has been increasingly used for diverse biological applications, such as predicting drug mechanism of action or gene function. However, batch effects pose severe limitations to community-wide efforts to integrate and interpret image-based profiling data collected across different laboratories and equipment. To address this problem, we benchmarked seven high-performing scRNA-seq batch correction techniques, representing diverse approaches, using a newly released Cell Painting dataset, the largest publicly accessible image-based dataset. We focused on five different scenarios with varying complexity, and we found that Harmony, a mixture-model based method, consistently outperformed the other tested methods. Our proposed framework, benchmark, and metrics can additionally be used to assess new batch correction methods in the future. Overall, this work paves the way for improvements that allow the community to make best use of public Cell Painting data for scientific discovery.
Collapse
Affiliation(s)
- John Arevalo
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Ellen Su
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Robert van Dijk
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Anne E Carpenter
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Shantanu Singh
- Imaging Platform, Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| |
Collapse
|
3
|
Dowling P, Trollet C, Negroni E, Swandulla D, Ohlendieck K. How Can Proteomics Help to Elucidate the Pathophysiological Crosstalk in Muscular Dystrophy and Associated Multi-System Dysfunction? Proteomes 2024; 12:4. [PMID: 38250815 PMCID: PMC10801633 DOI: 10.3390/proteomes12010004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Revised: 01/09/2024] [Accepted: 01/12/2024] [Indexed: 01/23/2024] Open
Abstract
This perspective article is concerned with the question of how proteomics, which is a core technique of systems biology that is deeply embedded in the multi-omics field of modern bioresearch, can help us better understand the molecular pathogenesis of complex diseases. As an illustrative example of a monogenetic disorder that primarily affects the neuromuscular system but is characterized by a plethora of multi-system pathophysiological alterations, the muscle-wasting disease Duchenne muscular dystrophy was examined. Recent achievements in the field of dystrophinopathy research are described with special reference to the proteome-wide complexity of neuromuscular changes and body-wide alterations/adaptations. Based on a description of the current applications of top-down versus bottom-up proteomic approaches and their technical challenges, future systems biological approaches are outlined. The envisaged holistic and integromic bioanalysis would encompass the integration of diverse omics-type studies including inter- and intra-proteomics as the core disciplines for systematic protein evaluations, with sophisticated biomolecular analyses, including physiology, molecular biology, biochemistry and histochemistry. Integrated proteomic findings promise to be instrumental in improving our detailed knowledge of pathogenic mechanisms and multi-system dysfunction, widening the available biomarker signature of dystrophinopathy for improved diagnostic/prognostic procedures, and advancing the identification of novel therapeutic targets to treat Duchenne muscular dystrophy.
Collapse
Affiliation(s)
- Paul Dowling
- Department of Biology, Maynooth University, National University of Ireland, W23 F2H6 Maynooth, Co. Kildare, Ireland;
- Kathleen Lonsdale Institute for Human Health Research, Maynooth University, W23 F2H6 Maynooth, Co. Kildare, Ireland
| | - Capucine Trollet
- Center for Research in Myology U974, Sorbonne Université, INSERM, Myology Institute, 75013 Paris, France; (C.T.); (E.N.)
| | - Elisa Negroni
- Center for Research in Myology U974, Sorbonne Université, INSERM, Myology Institute, 75013 Paris, France; (C.T.); (E.N.)
| | - Dieter Swandulla
- Institute of Physiology, Faculty of Medicine, University of Bonn, D53115 Bonn, Germany;
| | - Kay Ohlendieck
- Department of Biology, Maynooth University, National University of Ireland, W23 F2H6 Maynooth, Co. Kildare, Ireland;
- Kathleen Lonsdale Institute for Human Health Research, Maynooth University, W23 F2H6 Maynooth, Co. Kildare, Ireland
| |
Collapse
|
4
|
Dowling P, Swandulla D, Ohlendieck K. Mass Spectrometry-Based Proteomic Technology and Its Application to Study Skeletal Muscle Cell Biology. Cells 2023; 12:2560. [PMID: 37947638 PMCID: PMC10649384 DOI: 10.3390/cells12212560] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 10/27/2023] [Accepted: 10/31/2023] [Indexed: 11/12/2023] Open
Abstract
Voluntary striated muscles are characterized by a highly complex and dynamic proteome that efficiently adapts to changed physiological demands or alters considerably during pathophysiological dysfunction. The skeletal muscle proteome has been extensively studied in relation to myogenesis, fiber type specification, muscle transitions, the effects of physical exercise, disuse atrophy, neuromuscular disorders, muscle co-morbidities and sarcopenia of old age. Since muscle tissue accounts for approximately 40% of body mass in humans, alterations in the skeletal muscle proteome have considerable influence on whole-body physiology. This review outlines the main bioanalytical avenues taken in the proteomic characterization of skeletal muscle tissues, including top-down proteomics focusing on the characterization of intact proteoforms and their post-translational modifications, bottom-up proteomics, which is a peptide-centric method concerned with the large-scale detection of proteins in complex mixtures, and subproteomics that examines the protein composition of distinct subcellular fractions. Mass spectrometric studies over the last two decades have decisively improved our general cell biological understanding of protein diversity and the heterogeneous composition of individual myofibers in skeletal muscles. This detailed proteomic knowledge can now be integrated with findings from other omics-type methodologies to establish a systems biological view of skeletal muscle function.
Collapse
Affiliation(s)
- Paul Dowling
- Department of Biology, Maynooth University, National University of Ireland, W23 F2H6 Maynooth, Co. Kildare, Ireland;
- Kathleen Lonsdale Institute for Human Health Research, Maynooth University, W23 F2H6 Maynooth, Co. Kildare, Ireland
| | - Dieter Swandulla
- Institute of Physiology, Faculty of Medicine, University of Bonn, D53115 Bonn, Germany;
| | - Kay Ohlendieck
- Department of Biology, Maynooth University, National University of Ireland, W23 F2H6 Maynooth, Co. Kildare, Ireland;
- Kathleen Lonsdale Institute for Human Health Research, Maynooth University, W23 F2H6 Maynooth, Co. Kildare, Ireland
| |
Collapse
|
5
|
van den Berg PR, Bérenger-Currias NMLP, Budnik B, Slavov N, Semrau S. Integration of a multi-omics stem cell differentiation dataset using a dynamical model. PLoS Genet 2023; 19:e1010744. [PMID: 37167320 DOI: 10.1371/journal.pgen.1010744] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 05/23/2023] [Accepted: 04/14/2023] [Indexed: 05/13/2023] Open
Abstract
Stem cell differentiation is a highly dynamic process involving pervasive changes in gene expression. The large majority of existing studies has characterized differentiation at the level of individual molecular profiles, such as the transcriptome or the proteome. To obtain a more comprehensive view, we measured protein, mRNA and microRNA abundance during retinoic acid-driven differentiation of mouse embryonic stem cells. We found that mRNA and protein abundance are typically only weakly correlated across time. To understand this finding, we developed a hierarchical dynamical model that allowed us to integrate all data sets. This model was able to explain mRNA-protein discordance for most genes and identified instances of potential microRNA-mediated regulation. Overexpression or depletion of microRNAs identified by the model, followed by RNA sequencing and protein quantification, were used to follow up on the predictions of the model. Overall, our study shows how multi-omics integration by a dynamical model could be used to nominate candidate regulators.
Collapse
Affiliation(s)
| | | | - Bogdan Budnik
- Mass Spectrometry and Proteomics Resource Laboratory, Harvard University, Cambridge, Massachusetts, United States of America
| | - Nikolai Slavov
- Department of Bioengineering, Northeastern University, Boston, Massachusetts, United States of America
| | - Stefan Semrau
- Leiden Institute of Physics, Leiden University, Leiden, Zuid-Holland, The Netherlands
| |
Collapse
|
6
|
Arnett LP, Rana R, Chung WWY, Li X, Abtahi M, Majonis D, Bassan J, Nitz M, Winnik MA. Reagents for Mass Cytometry. Chem Rev 2023; 123:1166-1205. [PMID: 36696538 DOI: 10.1021/acs.chemrev.2c00350] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Mass cytometry (cytometry by time-of-flight detection [CyTOF]) is a bioanalytical technique that enables the identification and quantification of diverse features of cellular systems with single-cell resolution. In suspension mass cytometry, cells are stained with stable heavy-atom isotope-tagged reagents, and then the cells are nebulized into an inductively coupled plasma time-of-flight mass spectrometry (ICP-TOF-MS) instrument. In imaging mass cytometry, a pulsed laser is used to ablate ca. 1 μm2 spots of a tissue section. The plume is then transferred to the CyTOF, generating an image of biomarker expression. Similar measurements are possible with multiplexed ion bean imaging (MIBI). The unit mass resolution of the ICP-TOF-MS detector allows for multiparametric analysis of (in principle) up to 130 different parameters. Currently available reagents, however, allow simultaneous measurement of up to 50 biomarkers. As new reagents are developed, the scope of information that can be obtained by mass cytometry continues to increase, particularly due to the development of new small molecule reagents which enable monitoring of active biochemistry at the cellular level. This review summarizes the history and current state of mass cytometry reagent development and elaborates on areas where there is a need for new reagents. Additionally, this review provides guidelines on how new reagents should be tested and how the data should be presented to make them most meaningful to the mass cytometry user community.
Collapse
Affiliation(s)
- Loryn P Arnett
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada
| | - Rahul Rana
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada
| | - Wilson Wai-Yip Chung
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada
| | - Xiaochong Li
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada
| | - Mahtab Abtahi
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada
| | - Daniel Majonis
- Standard BioTools Canada Inc. (formerly Fluidigm Canada Inc.), 1380 Rodick Road, Suite 400, Markham, OntarioL3R 4G5, Canada
| | - Jay Bassan
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada
| | - Mark Nitz
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada
| | - Mitchell A Winnik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, OntarioM5S 3H6, Canada.,Department of Chemical Engineering and Applied Chemistry, 200 College Street, Toronto, OntarioM5S 3E5, Canada
| |
Collapse
|
7
|
Sussman JH, Xu J, Amankulor N, Tan K. Dissecting the tumor microenvironment of epigenetically driven gliomas: Opportunities for single-cell and spatial multiomics. Neurooncol Adv 2023; 5:vdad101. [PMID: 37706202 PMCID: PMC10496944 DOI: 10.1093/noajnl/vdad101] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/15/2023] Open
Abstract
Malignant gliomas are incurable brain neoplasms with dismal prognoses and near-universal fatality, with minimal therapeutic progress despite billions of dollars invested in research and clinical trials over the last 2 decades. Many glioma studies have utilized disparate histologic and genomic platforms to characterize the stunning genomic, transcriptomic, and immunologic heterogeneity found in gliomas. Single-cell and spatial omics technologies enable unprecedented characterization of heterogeneity in solid malignancies and provide a granular annotation of transcriptional, epigenetic, and microenvironmental states with limited resected tissue. Heterogeneity in gliomas may be defined, at the broadest levels, by tumors ostensibly driven by epigenetic alterations (IDH- and histone-mutant) versus non-epigenetic tumors (IDH-wild type). Epigenetically driven tumors are defined by remarkable transcriptional programs, immunologically distinct microenvironments, and incompletely understood topography (unique cellular neighborhoods and cell-cell interactions). Thus, these tumors are the ideal substrate for single-cell multiomic technologies to disentangle the complex intra-tumoral features, including differentiation trajectories, tumor-immune cell interactions, and chromatin dysregulation. The current review summarizes the applications of single-cell multiomics to existing datasets of epigenetically driven glioma. More importantly, we discuss future capabilities and applications of novel multiomic strategies to answer outstanding questions, enable the development of potent therapeutic strategies, and improve personalized diagnostics and treatment via digital pathology.
Collapse
Affiliation(s)
- Jonathan H Sussman
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Medical Scientist Training Program, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Jason Xu
- Graduate Group in Genomics and Computational Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Medical Scientist Training Program, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Nduka Amankulor
- Department of Neurosurgery, Perelman School of Medicine, Philadelphia, Pennsylvania, USA
| | - Kai Tan
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
- Center for Childhood Cancer Research, Children’s Hospital of Philadelphia, Philadelphia, Pennsylvania, USA
| |
Collapse
|
8
|
Abstract
Single-cell proteomics is a promising field to provide direct yet comprehensive molecular insights into cellular functions without averaging effects. Here, we address a grand technical challenge impeding the maturation of single-cell proteomics─protein adsorption loss (PAL). Even though widely known, there is currently no quantitation on how profoundly and selectively PAL has affected single-cell proteomics. Therefore, the mitigations to this challenge have been generic, and their efficacy was only evaluated by the size of the resolved proteome with no specificity on individual proteins. We use the existing knowledge of PAL, protein expression, and the typical surface area used in single-cell proteomics to discuss the severity of protein loss. We also summarize the current solutions to this challenge and briefly review the available methods to characterize the physical and chemical properties of protein surface adsorption. By citing successful strategies in single-cell genomics for measurement errors in individual transcripts, we pinpoint the urgency to benchmark PAL at the proteome scale with individual protein resolution. Finally, orthogonal single-cell proteomic techniques that have the potential to cross validate PAL are proposed. We hope these efforts can promote the fruition of single-cell proteomics in the near future.
Collapse
Affiliation(s)
- Bingyun Sun
- Department of Chemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada.,Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| | - Sharwan Kumar
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, British Columbia V5A 1S6, Canada
| |
Collapse
|
9
|
Single-cell multiomics in neuroinflammation. Curr Opin Immunol 2022; 76:102180. [DOI: 10.1016/j.coi.2022.102180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 03/21/2022] [Indexed: 11/17/2022]
|