1
|
Nandigrami P, Fiser A. Assessing the functional impact of protein binding site definition. Protein Sci 2024; 33:e5026. [PMID: 38757384 PMCID: PMC11099757 DOI: 10.1002/pro.5026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2023] [Revised: 05/01/2024] [Accepted: 05/03/2024] [Indexed: 05/18/2024]
Abstract
Many biomedical applications, such as classification of binding specificities or bioengineering, depend on the accurate definition of protein binding interfaces. Depending on the choice of method used, substantially different sets of residues can be classified as belonging to the interface of a protein. A typical approach used to verify these definitions is to mutate residues and measure the impact of these changes on binding. Besides the lack of exhaustive data, this approach also suffers from the fundamental problem that a mutation introduces an unknown amount of alteration into an interface, which potentially alters the binding characteristics of the interface. In this study we explore the impact of alternative binding site definitions on the ability of a protein to recognize its cognate ligand using a pharmacophore approach, which does not affect the interface. The study also shows that methods for protein binding interface predictions should perform above approximately F-score = 0.7 accuracy level to capture the biological function of a protein.
Collapse
Affiliation(s)
- Prithviraj Nandigrami
- Departments of Systems and Computational Biology, and BiochemistryAlbert Einstein College of MedicineBronxNew YorkUSA
| | - Andras Fiser
- Departments of Systems and Computational Biology, and BiochemistryAlbert Einstein College of MedicineBronxNew YorkUSA
| |
Collapse
|
2
|
Grudman S, Fajardo JE, Fiser A. Optimal selection of suitable templates in protein interface prediction. Bioinformatics 2023; 39:btad510. [PMID: 37603727 PMCID: PMC10491951 DOI: 10.1093/bioinformatics/btad510] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Revised: 07/11/2023] [Accepted: 08/18/2023] [Indexed: 08/23/2023] Open
Abstract
MOTIVATION Molecular-level classification of protein-protein interfaces can greatly assist in functional characterization and rational drug design. The most accurate protein interface predictions rely on finding homologous proteins with known interfaces since most interfaces are conserved within the same protein family. The accuracy of these template-based prediction approaches depends on the correct choice of suitable templates. Choosing the right templates in the immunoglobulin superfamily (IgSF) is challenging because its members share low sequence identity and display a wide range of alternative binding sites despite structural homology. RESULTS We present a new approach to predict protein interfaces. First, template-specific, informative evolutionary profiles are established using a mutual information-based approach. Next, based on the similarity of residue level conservation scores derived from the evolutionary profiles, a query protein is hierarchically clustered with all available template proteins in its superfamily with known interface definitions. Once clustered, a subset of the most closely related templates is selected, and an interface prediction is made. These initial interface predictions are subsequently refined by extensive docking. This method was benchmarked on 51 IgSF proteins and can predict nontrivial interfaces of IgSF proteins with an average and median F-score of 0.64 and 0.78, respectively. We also provide a way to assess the confidence of the results. The average and median F-scores increase to 0.8 and 0.81, respectively, if 27% of low confidence cases and 17% of medium confidence cases are removed. Lastly, we provide residue level interface predictions, protein complexes, and confidence measurements for singletons in the IgSF. AVAILABILITY AND IMPLEMENTATION Source code is freely available at: https://gitlab.com/fiserlab.org/interdct_with_refinement.
Collapse
Affiliation(s)
- Steven Grudman
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - J Eduardo Fajardo
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| | - Andras Fiser
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, NY 10461, USA
| |
Collapse
|
3
|
Nandigrami P, Fiser A. Assessing the functional impact of protein binding site definition. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.26.525812. [PMID: 36747792 PMCID: PMC9900911 DOI: 10.1101/2023.01.26.525812] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Many biomedical applications, such as classification of binding specificities or bioengineering, depend on the accurate definition of protein binding interfaces. Depending on the choice of method used, substantially different sets of residues can be classified as belonging to the interface of a protein. A typical approach used to verify these definitions is to mutate residues and measure the impact of these changes on binding. Besides the lack of exhaustive data this approach generates, it also suffers from the fundamental problem that a mutation introduces an unknown amount of alteration into an interface, which potentially alters the binding characteristics of the interface. In this study we explore the impact of alternative binding site definitions on the ability of a protein to recognize its cognate ligand using a pharmacophore approach, which does not affect the interface. The study also provides guidance on the minimum expected accuracy of interface definition that is required to capture the biological function of a protein.
Collapse
Affiliation(s)
- Prithviraj Nandigrami
- Departments of Systems & Computational Biology, and Biochemistry, Albert Einstein College of Medicine 1300 Morris Park Ave, Bronx, NY 10461, USA
| | - Andras Fiser
- Departments of Systems & Computational Biology, and Biochemistry, Albert Einstein College of Medicine 1300 Morris Park Ave, Bronx, NY 10461, USA
| |
Collapse
|
4
|
Shrestha R, Garrett-Thomson S, Liu W, Almo SC, Fiser A. Allosteric regulation of binding specificity of HVEM for CD160 and BTLA ligands upon G89F mutation. Curr Res Struct Biol 2021; 3:337-345. [PMID: 34917954 PMCID: PMC8666650 DOI: 10.1016/j.crstbi.2021.11.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Revised: 10/08/2021] [Accepted: 11/01/2021] [Indexed: 11/25/2022] Open
Abstract
Molecular interactions mediated by engagement of the Herpes virus entry mediator (HVEM) with members of TNF and Ig superfamily generate distinct signals in T cell activation pathways that modulate inflammatory and inhibitory responses. HVEM interacts with CD160 and B and T lymphocyte attenuator (BTLA), both members of the immunoglobulin (Ig) superfamily, which share a common binding site that is unique from that of LIGHT, a TNF ligand. BTLA or CD160 engagement with HVEM deliver inhibitory or stimulatory signals to the host immune response in a context dependent fashion, whereas HVEM engagement with LIGHT results in pro-inflammatory responses. We identified a mutation in human HVEM, G89F, which directly interferes with the human LIGHT interaction, but interestingly, also differentially modulates the binding of human BTLA and CD160 via an apparent allosteric mechanism involving recognition surfaces remote from the site of the mutation. Specifically, the G89F mutation enhances binding of CD160, while decreasing that of BTLA to HVEM in cell-based assays. Molecular dynamics simulations for wild-type and G89F mutant HVEM, bound to different sets of ligands, were performed to define the molecular basis of this unexpected allosteric effect. These results were leveraged to design additional human HVEM mutants with altered binding specificities.
Collapse
Affiliation(s)
- Rojan Shrestha
- Department of Systems and Computational Biology, USA
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461, USA
| | - Sarah Garrett-Thomson
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461, USA
| | - Weifeng Liu
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461, USA
| | - Steven C. Almo
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461, USA
| | - Andras Fiser
- Department of Systems and Computational Biology, USA
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461, USA
| |
Collapse
|
5
|
Gil N, Shrestha R, Fiser A. Estimating the accuracy of pharmacophore-based detection of cognate receptor-ligand pairs in the immunoglobulin superfamily. Proteins 2021; 89:632-638. [PMID: 33483991 DOI: 10.1002/prot.26046] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Revised: 10/26/2020] [Accepted: 12/27/2020] [Indexed: 11/11/2022]
Abstract
Secreted and membrane-bound members of the immunoglobulin superfamily (IgSF) encompass a large, diverse array of proteins that play central roles in immune response and neural development, and are implicated in diseases ranging from cancer to rheumatoid arthritis. Despite the potential biomedical benefits of understanding IgSF:IgSF cognate receptor-ligand interactions, relatively little about them is known at a molecular level, and experimentally probing all possible receptor-ligand pairs is prohibitively costly. The Protein Ligand Interface Design (ProtLID) algorithm is a computational pharmacophore-based approach to identify cognate receptor-ligand pairs that was recently validated in a pilot study on a small set of IgSF complexes. Although ProtLID has shown a success rate of 61% at identifying at least one cognate ligand for a given receptor, it currently lacks any form of confidence measure that can prioritize individual receptor-ligand predictions to pursue experimentally. In this study, we expanded the application of ProtLID to cover all IgSF complexes with available structural data. In addition, we introduced an approach to estimate the confidence of predictions made by ProtLID based on a statistical analysis of how the ProtLID-constructed pharmacophore matches the structures of candidate ligands. The confidence score combines the physicochemical compatibility, spatial consistency, and mathematical skewness of the distribution of matches throughout a set of candidate ligands. Our results suggest that a subset of cases meeting stringent confidence criteria will always have at least one successful receptor-ligand prediction.
Collapse
Affiliation(s)
- Nelson Gil
- Department of Systems and Computational Biology, Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Rojan Shrestha
- Department of Systems and Computational Biology, Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Andras Fiser
- Department of Systems and Computational Biology, Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York, USA
| |
Collapse
|
6
|
Residue-based pharmacophore approaches to study protein-protein interactions. Curr Opin Struct Biol 2021; 67:205-211. [PMID: 33486430 DOI: 10.1016/j.sbi.2020.12.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 12/04/2020] [Accepted: 12/28/2020] [Indexed: 01/22/2023]
Abstract
This review focuses on pharmacophore approaches in researching protein interfaces that bind protein ligands. Pharmacophore descriptions of binding interfaces that employ molecular dynamics simulation can account for effects of solvation and conformational flexibility. In addition, these calculations provide an approximation to entropic considerations and as such, a better approximation of the free energy of binding. Residue-based pharmacophore approaches can facilitate a variety of drug discovery tasks such as the identification of receptor-ligand partners, identifying their binding poses, designing protein interfaces for selectivity, or defining a reduced mutational combinatorial exploration for subsequent experimental engineering techniques by orders of magnitudes.
Collapse
|
7
|
Shrestha R, Garrett-Thomson SC, Liu W, Almo SC, Fiser A. Redesigning HVEM Interface for Selective Binding to LIGHT, BTLA, and CD160. Structure 2020; 28:1197-1205.e2. [PMID: 32795404 DOI: 10.1016/j.str.2020.07.013] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Revised: 06/01/2020] [Accepted: 07/23/2020] [Indexed: 10/23/2022]
Abstract
Herpes virus entry mediator (HVEM) regulates positive and negative signals for T cell activation through co-signaling pathways. Dysfunction of the HVEM co-signaling network is associated with multiple pathologies related to autoimmunity, infectious disease, and cancer, making the associated molecules biologically and therapeutically attractive targets. HVEM interacts with three ligands from two different superfamilies using two different binding interfaces. The engagement with ligands CD160 and B- and T-lymphocyte attenuator (BTLA), members of immunoglobulin superfamily, is associated with inhibitory signals, whereas inflammatory responses are regulated through the interaction with LIGHT from the TNF superfamily. We computationally redesigned the HVEM recognition interfaces using a residue-specific pharmacophore approach, ProtLID, to achieve switchable-binding specificity. In subsequent cell-based binding assays the new interfaces, designed with only single or double mutations, exhibited selective binding to only one or two out of the three cognate ligands.
Collapse
Affiliation(s)
- Rojan Shrestha
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA; Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | - Sarah C Garrett-Thomson
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | - Weifeng Liu
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | - Steven C Almo
- Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA.
| | - Andras Fiser
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA; Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA.
| |
Collapse
|
8
|
Gil N, Fajardo EJ, Fiser A. Discovery of receptor-ligand interfaces in the immunoglobulin superfamily. Proteins 2019; 88:135-142. [PMID: 31298437 DOI: 10.1002/prot.25778] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2019] [Revised: 06/21/2019] [Accepted: 07/06/2019] [Indexed: 12/13/2022]
Abstract
Cell-surface-anchored immunoglobulin superfamily (IgSF) proteins are widespread throughout the human proteome, forming crucial components of diverse biological processes including immunity, cell-cell adhesion, and carcinogenesis. IgSF proteins generally function through protein-protein interactions carried out between extracellular, membrane-bound proteins on adjacent cells, known as trans-binding interfaces. These protein-protein interactions constitute a class of pharmaceutical targets important in the treatment of autoimmune diseases, chronic infections, and cancer. A molecular-level understanding of IgSF protein-protein interactions would greatly benefit further drug development. A critical step toward this goal is the reliable identification of IgSF trans-binding interfaces. We propose a novel combination of structure and sequence information to identify trans-binding interfaces in IgSF proteins. We developed a structure-based binding interface prediction approach that can identify broad regions of the protein surface that encompass the binding interfaces and suggests that IgSF proteins possess binding supersites. These interfaces could theoretically be pinpointed using sequence-based conservation analysis, with performance approaching the theoretical upper limit of binding interface prediction accuracy, but achieving this in practice is limited by the current ability to identify an appropriate multiple sequence alignment for conservation analysis. However, an important contribution of combining the two orthogonal methods is that agreement between these approaches can estimate the reliability of the predictions. This approach was benchmarked on the set of 22 IgSF proteins with experimentally solved structures in complex with their ligands. Additionally, we provide structure-based predictions and reliability scores for the 62 IgSF proteins with known structure but yet uncharacterized binding interfaces.
Collapse
Affiliation(s)
- Nelson Gil
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York.,Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York
| | - Eduardo J Fajardo
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York.,Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York
| | - Andras Fiser
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York.,Department of Biochemistry, Albert Einstein College of Medicine, Bronx, New York
| |
Collapse
|
9
|
Computational Redesign of PD-1 Interface for PD-L1 Ligand Selectivity. Structure 2019; 27:829-836.e3. [PMID: 30930066 PMCID: PMC6745709 DOI: 10.1016/j.str.2019.03.006] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2018] [Revised: 11/16/2018] [Accepted: 03/07/2019] [Indexed: 12/31/2022]
Abstract
Chronic or persistent stimulation of the programmed cell death-1 (PD-1) pathway prevents T cells from mounting anti-tumor and anti-viral immune responses. Blockade of this inhibitory checkpoint pathway has shown therapeutic importance by rescuing T cells from their exhausted state. Cognate ligands of the PD-1 receptor include the tissue-specific PD-L1 and PD-L2 proteins. Engineering a human PD-1 interface specific for PD-L1 or PD-L2 can provide a specific reagent and therapeutic advantage for tissue-specific disruption of the PD-1 pathway. We utilized ProtLID, a computational framework, which constitutes a residue-based pharmacophore approach, to custom-design a human PD-1 interface specific to human PD-L1 without any significant affinity to PD-L2. In subsequent cell assay experiments, half of all single-point mutant designs proved to introduce a statistically significant selectivity, with nine of these maintaining a close to wild-type affinity to PD-L1. This proof-of-concept study suggests a general approach to re-engineer protein interfaces for specificity.
Collapse
|
10
|
Viswanathan R, Fajardo E, Steinberg G, Haller M, Fiser A. Protein-protein binding supersites. PLoS Comput Biol 2019; 15:e1006704. [PMID: 30615604 PMCID: PMC6336348 DOI: 10.1371/journal.pcbi.1006704] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Revised: 01/17/2019] [Accepted: 12/05/2018] [Indexed: 11/19/2022] Open
Abstract
The lack of a deep understanding of how proteins interact remains an important roadblock in advancing efforts to identify binding partners and uncover the corresponding regulatory mechanisms of the functions they mediate. Understanding protein-protein interactions is also essential for designing specific chemical modifications to develop new reagents and therapeutics. We explored the hypothesis of whether protein interaction sites serve as generic biding sites for non-cognate protein ligands, just as it has been observed for small-molecule-binding sites in the past. Using extensive computational docking experiments on a test set of 241 protein complexes, we found that indeed there is a strong preference for non-cognate ligands to bind to the cognate binding site of a receptor. This observation appears to be robust to variations in docking programs, types of non-cognate protein probes, sizes of binding patches, relative sizes of binding patches and full-length proteins, and the exploration of obligate and non-obligate complexes. The accuracy of the docking scoring function appears to play a role in defining the correct site. The frequency of interaction of unrelated probes recognizing the binding interface was utilized in a simple prediction algorithm that showed accuracy competitive with other state of the art methods.
Collapse
Affiliation(s)
- Raji Viswanathan
- Department of Chemistry, Yeshiva University, New York, NY, United States of America
| | - Eduardo Fajardo
- Departments of Systems & Computational Biology, and Biochemistry, Albert Einstein College of Medicine, Bronx, NY, United States of America
| | - Gabriel Steinberg
- Department of Chemistry, Yeshiva University, New York, NY, United States of America
| | - Matthew Haller
- Department of Chemistry, Yeshiva University, New York, NY, United States of America
| | - Andras Fiser
- Departments of Systems & Computational Biology, and Biochemistry, Albert Einstein College of Medicine, Bronx, NY, United States of America
- * E-mail:
| |
Collapse
|
11
|
Chen J, Wang B, Wu Y. Structural Characterization and Function Prediction of Immunoglobulin-like Fold in Cell Adhesion and Cell Signaling. J Chem Inf Model 2018; 58:532-542. [PMID: 29356528 DOI: 10.1021/acs.jcim.7b00580] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Domains that belong to an immunoglobulin (Ig) fold are extremely abundant in cell surface receptors, which play significant roles in cell-cell adhesion and signaling. Although the structures of domains in an Ig fold share common topology of β-barrels, functions of receptors in adhesion and signaling are regulated by the very heterogeneous binding between these domains. Additionally, only a small number of domains are directly involved in the binding between two multidomain receptors. It is challenging and time consuming to experimentally detect the binding partners of a given receptor and further determine which specific domains in this receptor are responsible for binding. Therefore, current knowledge in the binding mechanism of Ig-fold domains and their impacts on cell adhesion and signaling is very limited. A bioinformatics study can shed light on this topic from a systematic point of view. However, there is so far no computational analysis on the structural and functional characteristics of the entire Ig fold. We constructed nonredundant structural data sets for all domains in Ig fold, depending on their functions in cell adhesion and signaling. We found that data sets of domains in adhesion receptors show different binding preference from domains in signaling receptors. Using structural alignment, we further built a common structural template for each group of a domain data set. By mapping the protein-protein binding interface of each domain in a group onto the surface of its structural template, we found binding interfaces are highly overlapped within each specific group. These overlapped interfaces, we called consensus binding interfaces, are distinguishable among different data sets of domains. Finally, the residue compositions on the consensus interfaces were used as indicators for multiple machine learning algorithms to predict if they can form homotypic interactions with each other. The overall performance of the cross-validation tests shows that our prediction accuracies ranged between 0.6 and 0.8.
Collapse
Affiliation(s)
- Jiawen Chen
- Department of Systems and Computational Biology, Albert Einstein College of Medicine , 1300 Morris Park Avenue, Bronx, New York 10461, United States
| | - Bo Wang
- Department of Systems and Computational Biology, Albert Einstein College of Medicine , 1300 Morris Park Avenue, Bronx, New York 10461, United States
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine , 1300 Morris Park Avenue, Bronx, New York 10461, United States
| |
Collapse
|