1
|
Csikász-Nagy A, Fichó E, Noto S, Reguly I. Computational tools to predict context-specific protein complexes. Curr Opin Struct Biol 2024; 88:102883. [PMID: 38986166 DOI: 10.1016/j.sbi.2024.102883] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Revised: 05/21/2024] [Accepted: 06/19/2024] [Indexed: 07/12/2024]
Abstract
Interactions between thousands of proteins define cells' protein-protein interaction (PPI) network. Some of these interactions lead to the formation of protein complexes. It is challenging to identify a protein complex in a haystack of protein-protein interactions, and it is even more difficult to predict all protein complexes of the complexome. Simulations and machine learning approaches try to crack these problems by looking at the PPI network or predicted protein structures. Clustering of PPI networks led to the first protein complex predictions, while most recently, atomistic models of protein complexes and deep-learning-based structure prediction methods have also emerged. The simulation of PPI level interactions even enables the quantitative prediction of protein complexes. These methods, the required data sources, and their potential future developments are discussed in this review.
Collapse
Affiliation(s)
- Attila Csikász-Nagy
- Cytocast Hungary Kft, Budapest, Hungary; Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary.
| | | | - Santiago Noto
- Cytocast Hungary Kft, Budapest, Hungary; Escola de Matemática Aplicada, Fundação Getúlio Vargas, Rio de Janeiro, Brazil
| | - István Reguly
- Cytocast Hungary Kft, Budapest, Hungary; Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
| |
Collapse
|
2
|
Miski M, Weber Á, Fekete-Molnár K, Keömley-Horváth BM, Csikász-Nagy A, Gáspári Z. Simulated complexes formed from a set of postsynaptic proteins suggest a localised effect of a hypomorphic Shank mutation. BMC Neurosci 2024; 25:32. [PMID: 38971749 PMCID: PMC11227168 DOI: 10.1186/s12868-024-00880-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Accepted: 06/28/2024] [Indexed: 07/08/2024] Open
Abstract
BACKGROUND The postsynaptic density is an elaborate protein network beneath the postsynaptic membrane involved in the molecular processes underlying learning and memory. The postsynaptic density is built up from the same major proteins but its exact composition and organization differs between synapses. Mutations perturbing protein: protein interactions generally occurring in this network might lead to effects specific for cell types or processes, the understanding of which can be especially challenging. RESULTS In this work we use systems biology-based modeling of protein complex distributions in a simplified set of major postsynaptic proteins to investigate the effect of a hypomorphic Shank mutation perturbing a single well-defined interaction. We use data sets with widely variable abundances of the constituent proteins. Our results suggest that the effect of the mutation is heavily dependent on the overall availability of all the protein components of the whole network and no trivial correspondence between the expression level of the directly affected proteins and overall complex distribution can be observed. CONCLUSIONS Our results stress the importance of context-dependent interpretation of mutations. Even the weakening of a generally occurring protein: protein interaction might have well-defined effects, and these can not easily be predicted based only on the abundance of the proteins directly affected. Our results provide insight on how cell-specific effects can be exerted by a mutation perturbing a generally occurring interaction even when the wider interaction network is largely similar.
Collapse
Affiliation(s)
- Marcell Miski
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
| | - Áron Weber
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
- Cytocast Hungary Kft, Budapest, Hungary
| | - Krisztina Fekete-Molnár
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
| | - Bence Márk Keömley-Horváth
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
- Cytocast Hungary Kft, Budapest, Hungary
| | - Attila Csikász-Nagy
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary.
- Cytocast Hungary Kft, Budapest, Hungary.
| | - Zoltán Gáspári
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary.
| |
Collapse
|
3
|
Miski M, Keömley-Horváth BM, Rákóczi Megyeriné D, Csikász-Nagy A, Gáspári Z. Diversity of synaptic protein complexes as a function of the abundance of their constituent proteins: A modeling approach. PLoS Comput Biol 2022; 18:e1009758. [PMID: 35041658 PMCID: PMC8797218 DOI: 10.1371/journal.pcbi.1009758] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2021] [Revised: 01/28/2022] [Accepted: 12/15/2021] [Indexed: 11/18/2022] Open
Abstract
The postsynaptic density (PSD) is a dense protein network playing a key role in information processing during learning and memory, and is also indicated in a number of neurological disorders. Efforts to characterize its detailed molecular organization are encumbered by the large variability of the abundance of its constituent proteins both spatially, in different brain areas, and temporally, during development, circadian rhythm, and also in response to various stimuli. In this study we ran large-scale stochastic simulations of protein binding events to predict the presence and distribution of PSD complexes. We simulated the interactions of seven major PSD proteins (NMDAR, AMPAR, PSD-95, SynGAP, GKAP, Shank3, Homer1) based on previously published, experimentally determined protein abundance data from 22 different brain areas and 42 patients (altogether 524 different simulations). Our results demonstrate that the relative ratio of the emerging protein complexes can be sensitive to even subtle changes in protein abundances and thus explicit simulations are invaluable to understand the relationships between protein availability and complex formation. Our observations are compatible with a scenario where larger supercomplexes are formed from available smaller binary and ternary associations of PSD proteins. Specifically, Homer1 and Shank3 self-association reactions substantially promote the emergence of very large protein complexes. The described simulations represent a first approximation to assess PSD complex abundance, and as such, use significant simplifications. Therefore, their direct biological relevance might be limited but we believe that the major qualitative findings can contribute to the understanding of the molecular features of the postsynapse. Chemical and electrical synapses connect neurons in the brain. In chemical synapses the information is sent via molecules from one neuron (presynaptic one) to the other neuron (postsynaptic one). The messenger molecule called neurotransmitter is released from the presynaptic neuron’s active zone and binds to receptor molecules sitting on the postsynaptic neuron’s cell surface. This part of the postsynaptic neuron is the dendrite. Inside the dendrite there is an electron dense region full of proteins binding to each other forming large protein complexes. These complexes make sure that the receptor molecules are on the right place usually in front of the active zone. The protein dense region of the postsynaptic cell in the dendrites is called the postsynaptic density. We have performed extensive simulations on the formation of postsynaptic protein complexes using a well-defined set of proteins and a large number of publicly available input data sets on protein abundance. We used a simulator implementing the Gillespie algorithm to simulate binding and unbinding events proteins. We found that the relationship between single protein and protein complex abundances can be non-trivial, since similar complex distributions can emerge from distinct relative protein abundances and quite different protein complexes can be formed from almost similar initial protein abundances. Our results are compatible with the idea that the association-dissociation of smaller subcomplexes lead to the formation of large supercomplexes. The emergence of supercomplexes is largely facilitated by the self-association of Homer1 and Shank3 proteins. Our results are qualitatively in agreement with the formation of the experimentally observed ‘nanodomains’ in the postsynaptic density.
Collapse
Affiliation(s)
- Marcell Miski
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
| | - Bence Márk Keömley-Horváth
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
- Cytocast Ltd., Vecsés, Hungary
| | - Dorina Rákóczi Megyeriné
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
| | - Attila Csikász-Nagy
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
- Cytocast Ltd., Vecsés, Hungary
- Randall Centre for Cell and Molecular Biophysics, King’s College London, London, United Kingdom
- * E-mail: (AC-N); (ZG)
| | - Zoltán Gáspári
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
- * E-mail: (AC-N); (ZG)
| |
Collapse
|
4
|
Will T, Helms V. Differential analysis of combinatorial protein complexes with CompleXChange. BMC Bioinformatics 2019; 20:300. [PMID: 31159772 PMCID: PMC6547514 DOI: 10.1186/s12859-019-2852-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Accepted: 04/26/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Although a considerable number of proteins operate as multiprotein complexes and not on their own, organism-wide studies so far are only able to quantify individual proteins or protein-coding genes in a condition-specific manner for a sizeable number of samples, but not their assemblies. Consequently, there exist large amounts of transcriptomic data and an increasing amount of data on proteome abundance, but quantitative knowledge on complexomes is missing. This deficiency impedes the applicability of the powerful tool of differential analysis in the realm of macromolecular complexes. Here, we present a pipeline for differential analysis of protein complexes based on predicted or manually assigned complexes and inferred complex abundances, which can be easily applied on a whole-genome scale. RESULTS We observed for simulated data that results obtained by our complex abundance estimation algorithm were in better agreement with the ground truth and physicochemically more reasonable compared to previous efforts that used linear programming while running in a fraction of the time. The practical usability of the method was assessed in the context of transcription factor complexes in human monocyte and lymphoblastoid samples. We demonstrated that our new method is robust against false-positive detection and reports deregulated complexomes that can only be partially explained by differential analysis of individual protein-coding genes. Furthermore we showed that deregulated complexes identified by the tool potentially harbor significant yet unused information content. CONCLUSIONS CompleXChange allows to analyze deregulation of the protein complexome on a whole-genome scale by integrating a plethora of input data that is already available. A platform-independent Java binary, a user guide with example data and the source code are freely available at https://sourceforge.net/projects/complexchange/ .
Collapse
Affiliation(s)
- Thorsten Will
- Center for Bioinformatics, Saarland University, Campus E2.1, Saarbrücken, 66123, Germany.,Graduate School of Computer Science, Saarland University, Campus E1.3, Saarbrücken, 66123, Germany
| | - Volkhard Helms
- Center for Bioinformatics, Saarland University, Campus E2.1, Saarbrücken, 66123, Germany.
| |
Collapse
|
5
|
Context-dependent prediction of protein complexes by SiComPre. NPJ Syst Biol Appl 2018; 4:37. [PMID: 30245847 PMCID: PMC6141528 DOI: 10.1038/s41540-018-0073-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2018] [Revised: 08/21/2018] [Accepted: 08/29/2018] [Indexed: 11/09/2022] Open
Abstract
Most cellular processes are regulated by groups of proteins interacting together to form protein complexes. Protein compositions vary between different tissues or disease conditions enabling or preventing certain protein-protein interactions and resulting in variations in the complexome. Quantitative and qualitative characterization of context-specific protein complexes will help to better understand context-dependent variations in the physiological behavior of cells. Here, we present SiComPre 1.0, a computational tool that predicts context-specific protein complexes by integrating multi-omics sources. SiComPre outperforms other protein complex prediction tools in qualitative predictions and is unique in giving quantitative predictions on the complexome depending on the specific interactions and protein abundances defined by the user. We provide tutorials and examples on the complexome prediction of common model organisms, various human tissues and how the complexome is affected by drug treatment.
Collapse
|
6
|
Abstract
Cellular functions are often performed by multiprotein structures called protein complexes. These complexes are dynamic structures that evolve during the cell cycle or in response to external and internal stimuli, and are tightly regulated by protein expression in different tissues resulting in quantitative and qualitative variation of protein complexes. Advances in high-throughput techniques, such as mass-spectrometry and yeast two-hybrid provided a large amount of data on protein-protein interactions. This sparked the development of computational methods able to predict protein complex formation under a variety of biological and clinical conditions. However, the challenges that need to be addressed for successful computational protein complex prediction are highly complex.The post-genomic era saw an emerging number of algorithms and software, which are able to predict protein complexes from protein-protein interaction networks and a variety of other sources. Despite the high capacity of these methods to qualitatively predict protein complexes, they could provide only limited or no quantitative information of the predicted complexes. Recently, a new large-scale simulation of protein complexes was able to achieve this task by simulating protein complex formation on the proteome scale.In this chapter, we review representative methods that can predict multiple protein complexes at different scales and discuss how these can be combined with emerging sources of data in order to improve protein complex characterization.
Collapse
|
7
|
Lam MPY, Ping P, Murphy E. Proteomics Research in Cardiovascular Medicine and Biomarker Discovery. J Am Coll Cardiol 2016; 68:2819-2830. [PMID: 28007144 PMCID: PMC5189682 DOI: 10.1016/j.jacc.2016.10.031] [Citation(s) in RCA: 57] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/03/2016] [Revised: 10/20/2016] [Accepted: 10/21/2016] [Indexed: 11/21/2022]
Abstract
Proteomics is a systems physiology discipline to address the large-scale characterization of protein species within a biological system, be it a cell, a tissue, a body biofluid, an organism, or a cohort population. Building on advances from chemical analytical platforms (e.g., mass spectrometry and other technologies), proteomics approaches have contributed powerful applications in cardiovascular biomedicine, most notably in: 1) the discovery of circulating protein biomarkers of heart diseases from plasma samples; and 2) the identification of disease mechanisms and potential therapeutic targets in cardiovascular tissues, in both preclinical models and translational studies. Contemporary proteomics investigations offer powerful means to simultaneously examine tens of thousands of proteins in various samples, and understand their molecular phenotypes in health and disease. This concise review introduces study design considerations, example applications and use cases, as well as interpretation and analysis of proteomics data in cardiovascular biomedicine.
Collapse
Affiliation(s)
- Maggie P Y Lam
- NIH BD2K Center of Excellence and Department of Physiology, Medicine and Bioinformatics, University of California, Los Angeles, California.
| | - Peipei Ping
- NIH BD2K Center of Excellence and Department of Physiology, Medicine and Bioinformatics, University of California, Los Angeles, California
| | - Elizabeth Murphy
- Systems Biology Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland.
| |
Collapse
|
8
|
Rudashevskaya EL, Sickmann A, Markoutsa S. Global profiling of protein complexes: current approaches and their perspective in biomedical research. Expert Rev Proteomics 2016; 13:951-964. [PMID: 27602509 DOI: 10.1080/14789450.2016.1233064] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Despite the rapid evolution of proteomic methods, protein interactions and their participation in protein complexes - an important aspect of their function - has rarely been investigated on the proteome-wide level. Disease states, such as muscular dystrophy or viral infection, are induced by interference in protein-protein interactions within complexes. The purpose of this review is to describe the current methods for global complexome analysis and to critically discuss the challenges and opportunities for the application of these methods in biomedical research. Areas covered: We discuss advancements in experimental techniques and computational tools that facilitate profiling of the complexome. The main focus is on the separation of native protein complexes via size exclusion chromatography and gel electrophoresis, which has recently been combined with quantitative mass spectrometry, for a global protein-complex profiling. The development of this approach has been supported by advanced bioinformatics strategies and fast and sensitive mass spectrometers that have allowed the analysis of whole cell lysates. The application of this technique to biomedical research is assessed, and future directions are anticipated. Expert commentary: The methodology is quite new, and has already shown great potential when combined with complementary methods for detection of protein complexes.
Collapse
Affiliation(s)
- Elena L Rudashevskaya
- a Department of Bioanalytics , Leibniz-Institut für Analytische Wissenschaften - ISAS eV , Dortmund , Germany
| | - Albert Sickmann
- a Department of Bioanalytics , Leibniz-Institut für Analytische Wissenschaften - ISAS eV , Dortmund , Germany.,b Medizinisches Proteom-Center , Ruhr-Universität Bochum , Bochum , Germany.,c School of Natural & Computing Sciences, Department of Chemistry , University of Aberdeen , Aberdeen , UK
| | - Stavroula Markoutsa
- a Department of Bioanalytics , Leibniz-Institut für Analytische Wissenschaften - ISAS eV , Dortmund , Germany
| |
Collapse
|