1
|
Xiao Y, Yuan Y, Liu Y, Lin Z, Zheng G, Zhou D, Lv D. Targeted Protein Degradation: Current and Emerging Approaches for E3 Ligase Deconvolution. J Med Chem 2024; 67:11580-11596. [PMID: 38981094 DOI: 10.1021/acs.jmedchem.4c00723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/11/2024]
Abstract
Targeted protein degradation (TPD), including the use of proteolysis-targeting chimeras (PROTACs) and molecular glue degraders (MGDs) to degrade proteins, is an emerging strategy to develop novel therapies for cancer and beyond. PROTACs or MGDs function by inducing the proximity between an E3 ligase and a protein of interest (POI), leading to ubiquitination and consequent proteasomal degradation of the POI. Notably, one major issue in TPD is the lack of ligandable E3 ligases, as current studies predominantly use CUL4CRBN and CUL2VHL. The TPD community is seeking to expand the landscape of ligandable E3 ligases, but most discoveries rely on phenotypic screens or serendipity, necessitating systematic target deconvolution. Here, we examine and discuss both current and emerging E3 ligase deconvolution approaches for degraders discovered from phenotypic screens or monovalent glue chemistry campaigns, highlighting future prospects for identifying more ligandable E3 ligases.
Collapse
Affiliation(s)
- Yufeng Xiao
- Department of Medicinal Chemistry, College of Pharmacy, University of Florida, 1333 Center Drive, Gainesville, Florida 32610, United States
| | - Yaxia Yuan
- Department of Biochemistry and Structural Biology and Center for Innovative Drug Discovery, School of Medicine, University of Texas Health San Antonio, 7703 Floyd Curl Drive, San Antonio, Texas 78229, United States
- Mays Cancer Center, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Drive, San Antonio, Texas 78229, United States
| | - Yi Liu
- Department of Medicinal Chemistry, College of Pharmacy, University of Florida, 1333 Center Drive, Gainesville, Florida 32610, United States
| | - Zongtao Lin
- Department of Biochemistry and Molecular Biophysics, Washington University in St. Louis, St. Louis, Missouri 63110, United States
| | - Guangrong Zheng
- Department of Medicinal Chemistry, College of Pharmacy, University of Florida, 1333 Center Drive, Gainesville, Florida 32610, United States
| | - Daohong Zhou
- Department of Biochemistry and Structural Biology and Center for Innovative Drug Discovery, School of Medicine, University of Texas Health San Antonio, 7703 Floyd Curl Drive, San Antonio, Texas 78229, United States
- Mays Cancer Center, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Drive, San Antonio, Texas 78229, United States
| | - Dongwen Lv
- Department of Biochemistry and Structural Biology and Center for Innovative Drug Discovery, School of Medicine, University of Texas Health San Antonio, 7703 Floyd Curl Drive, San Antonio, Texas 78229, United States
- Mays Cancer Center, University of Texas Health Science Center at San Antonio, 7703 Floyd Curl Drive, San Antonio, Texas 78229, United States
| |
Collapse
|
2
|
Zhao S, Cui Z, Zhang G, Gong Y, Su L. MGPPI: multiscale graph neural networks for explainable protein-protein interaction prediction. Front Genet 2024; 15:1440448. [PMID: 39076171 PMCID: PMC11284081 DOI: 10.3389/fgene.2024.1440448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 06/24/2024] [Indexed: 07/31/2024] Open
Abstract
Protein-Protein Interactions (PPIs) involves in various biological processes, which are of significant importance in cancer diagnosis and drug development. Computational based PPI prediction methods are more preferred due to their low cost and high accuracy. However, existing protein structure based methods are insufficient in the extraction of protein structural information. Furthermore, most methods are less interpretable, which hinder their practical application in the biomedical field. In this paper, we propose MGPPI, which is a Multiscale graph convolutional neural network model for PPI prediction. By incorporating multiscale module into the Graph Neural Network (GNN) and constructing multi convolutional layers, MGPPI can effectively capture both local and global protein structure information. For model interpretability, we introduce a novel visual explanation method named Gradient Weighted interaction Activation Mapping (Grad-WAM), which can highlight key binding residue sites. We evaluate the performance of MGPPI by comparing with state-of-the-arts methods on various datasets. Results shows that MGPPI outperforms other methods significantly and exhibits strong generalization capabilities on the multi-species dataset. As a practical case study, we predicted the binding affinity between the spike (S) protein of SARS-COV-2 and the human ACE2 receptor protein, and successfully identified key binding sites with known binding functions. Key binding sites mutation in PPIs can affect cancer patient survival statues. Therefore, we further verified Grad-WAM highlighted residue sites in separating patients survival groups in several different cancer type datasets. According to our results, some of the highlighted residues can be used as biomarkers in predicting patients survival probability. All these results together demonstrate the high accuracy and practical application value of MGPPI. Our method not only addresses the limitations of existing approaches but also can assists researchers in identifying crucial drug targets and help guide personalized cancer treatment.
Collapse
Affiliation(s)
| | | | | | | | - Lingtao Su
- College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao, China
| |
Collapse
|
3
|
Su Z, Griffin B, Emmons S, Wu Y. Prediction of interactions between cell surface proteins by machine learning. Proteins 2024; 92:567-580. [PMID: 38050713 DOI: 10.1002/prot.26648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 11/15/2023] [Accepted: 11/20/2023] [Indexed: 12/06/2023]
Abstract
Cells detect changes in their external environments or communicate with each other through proteins on their surfaces. These cell surface proteins form a complicated network of interactions in order to fulfill their functions. The interactions between cell surface proteins are highly dynamic and, thus, challenging to detect using traditional experimental techniques. Here, we tackle this challenge using a computational framework. The primary focus of the framework is to develop new tools to identify interactions between domains in the immunoglobulin (Ig) fold, which is the most abundant domain family in cell surface proteins. These interactions could be formed between ligands and receptors from different cells or between proteins on the same cell surface. In practice, we collected all structural data on Ig domain interactions and transformed them into an interface fragment pair library. A high-dimensional profile can then be constructed from the library for a given pair of query protein sequences. Multiple machine learning models were used to read this profile so that the probability of interaction between the query proteins could be predicted. We tested our models on an experimentally derived dataset that contains 564 cell surface proteins in humans. The cross-validation results show that we can achieve higher than 70% accuracy in identifying the PPIs within this dataset. We then applied this method to a group of 46 cell surface proteins in Caenorhabditis elegans. We screened every possible interaction between these proteins. Many interactions recognized by our machine learning classifiers have been experimentally confirmed in the literature. In conclusion, our computational platform serves as a useful tool to help identify potential new interactions between cell surface proteins in addition to current state-of-the-art experimental techniques. The tool is freely accessible for use by the scientific community. Moreover, the general framework of the machine learning classification can also be extended to study the interactions of proteins in other domain superfamilies.
Collapse
Affiliation(s)
- Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Brian Griffin
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Scott Emmons
- Department of Genetics, Albert Einstein College of Medicine, Bronx, New York, USA
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, Bronx, New York, USA
| |
Collapse
|
4
|
Yuan W, Qiu ZM, Li H, Huang M, Yuan JJ, Niu SL, Chen Q, Yang QW, Ouyang Q. Investigation of the Binding Interaction of Mfsd2a with NEDD4-2 via Molecular Dynamics Simulations. ACS Chem Neurosci 2024; 15:382-393. [PMID: 38155530 DOI: 10.1021/acschemneuro.3c00791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2023] Open
Abstract
Major facilitator superfamily domain-containing 2a (Mfsd2a) is a sodium-dependent lysophosphatidylcholine cotransporter that plays an important role in maintaining the integrity of the blood-brain barrier and neurological function. Abnormal degradation of Mfsd2a often leads to dysfunction of the blood-brain barrier, while upregulation of Mfsd2a can retrieve neurological damage. It has been reported that Mfsd2a can be specifically recognized and ubiquitinated by neural precursor cell-expressed developmentally downregulated gene 4 type 2 (NEDD4-2) ubiquitin ligase and finally degraded through the proteasome pathway. However, the structural basis for the specific binding of Mfsd2a to NEDD4-2 is unclear. In this work, we combined deep learning and molecular dynamics simulations to obtain a Mfsd2a structure with high quality and a stable Mfsd2a/NEDD4-2-WW3 interaction model. Moreover, molecular mechanics generalized Born surface area (MM-GBSA) methods coupled with per-residue energy decomposition studies were carried out to analyze the key residues that dominate the binding interaction. Based on these results, we designed three peptides containing the key residues by truncating the Mfsd2a sequences. One of them was found to significantly inhibit Mfsd2a ubiquitination, which was further validated in an oxygen-glucose deprivation (OGD) model in a human microvascular endothelial cell line. This work provides some new insights into the understanding of Mfsd2a and NEDD4-2 interaction and might promote further development of drugs targeting Mfsd2a ubiquitination.
Collapse
Affiliation(s)
- Wen Yuan
- Department of Medicinal Chemistry, Third Military Medical University, Chongqing 400038, China
| | - Zhong-Ming Qiu
- Department of Neurology, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, China
| | - Hongwei Li
- Department of Medicinal Chemistry, Third Military Medical University, Chongqing 400038, China
| | - Mouxin Huang
- Department of Medicinal Chemistry, Third Military Medical University, Chongqing 400038, China
| | - Jun-Jie Yuan
- Department of Neurology, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, China
| | - Sheng-Li Niu
- Department of Medicinal Chemistry, Third Military Medical University, Chongqing 400038, China
| | - Qiong Chen
- Department of Neurology, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, China
| | - Qing-Wu Yang
- Department of Neurology, Xinqiao Hospital, Third Military Medical University, Chongqing 400037, China
| | - Qin Ouyang
- Department of Medicinal Chemistry, Third Military Medical University, Chongqing 400038, China
| |
Collapse
|
5
|
Aijaz J. Why medical professionals must learn mathematics and computing? Pak J Med Sci 2024; 40:S106. [PMID: 38328646 PMCID: PMC10844920 DOI: 10.12669/pjms.40.2(icon).8952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Accepted: 10/28/2023] [Indexed: 02/09/2024] Open
Abstract
doi: https://doi.org/10.12669/pjms.40.2(ICON).8952
How to cite this: Aijaz J. Why medical professionals must learn mathematics and computing?. Pak J Med Sci. 2024;40(2):-S106. doi: https://doi.org/10.12669/pjms.40.2(ICON).8952
Collapse
Affiliation(s)
- Javeria Aijaz
- Javeria Aijaz, FCPS, PhD. Section Head, Molecular Biology, Pathology Department Indus Hospital & Health Network, Karachi, Pakistan.
| |
Collapse
|
6
|
Santos TG, Silva KS, Lima RM, Silva LC, Pereira M. State of the art in protein-protein interactions within the fungi kingdom. Future Microbiol 2023; 18:1119-1131. [PMID: 37540069 DOI: 10.2217/fmb-2022-0274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/05/2023] Open
Abstract
Proteins rarely exert their function by themselves. Protein-protein interactions (PPIs) regulate virtually every biological process that takes place in a cell. Such interactions are targets for new therapeutic agents against all sorts of diseases, through the screening and design of a variety of inhibitors. Here we discuss several aspects of PPIs that contribute to prediction of protein function and drug discovery. As the high-throughput techniques continue to release biological data, targets for fungal therapeutics that rely on PPIs are being proposed worldwide. Computational approaches have reduced the time taken to develop new therapeutic approaches. The near future brings the possibility of developing new PPI and interaction network inhibitors and a revolution in the way we treat fungal diseases.
Collapse
Affiliation(s)
- Thaynara G Santos
- Laboratório de Biologia Molecular, Instituto de Ciências Biológicas, Universidade Federal de Goiás, Goiânia, Goiás, 74 000, Brazil
| | - Kleber Sf Silva
- Laboratório de Biologia Molecular, Instituto de Ciências Biológicas, Universidade Federal de Goiás, Goiânia, Goiás, 74 000, Brazil
| | - Raisa M Lima
- Laboratório de Biologia Molecular, Instituto de Ciências Biológicas, Universidade Federal de Goiás, Goiânia, Goiás, 74 000, Brazil
| | - Lívia C Silva
- Laboratório de Biologia Molecular, Instituto de Ciências Biológicas, Universidade Federal de Goiás, Goiânia, Goiás, 74 000, Brazil
| | - Maristela Pereira
- Laboratório de Biologia Molecular, Instituto de Ciências Biológicas, Universidade Federal de Goiás, Goiânia, Goiás, 74 000, Brazil
| |
Collapse
|
7
|
Xie S, Xie X, Zhao X, Liu F, Wang Y, Ping J, Ji Z. HNSPPI: a hybrid computational model combing network and sequence information for predicting protein-protein interaction. Brief Bioinform 2023; 24:bbad261. [PMID: 37480553 DOI: 10.1093/bib/bbad261] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2023] [Revised: 06/24/2023] [Accepted: 06/26/2023] [Indexed: 07/24/2023] Open
Abstract
Most life activities in organisms are regulated through protein complexes, which are mainly controlled via Protein-Protein Interactions (PPIs). Discovering new interactions between proteins and revealing their biological functions are of great significance for understanding the molecular mechanisms of biological processes and identifying the potential targets in drug discovery. Current experimental methods only capture stable protein interactions, which lead to limited coverage. In addition, expensive cost and time consuming are also the obvious shortcomings. In recent years, various computational methods have been successfully developed for predicting PPIs based only on protein homology, primary sequences of protein or gene ontology information. Computational efficiency and data complexity are still the main bottlenecks for the algorithm generalization. In this study, we proposed a novel computational framework, HNSPPI, to predict PPIs. As a hybrid supervised learning model, HNSPPI comprehensively characterizes the intrinsic relationship between two proteins by integrating amino acid sequence information and connection properties of PPI network. The experimental results show that HNSPPI works very well on six benchmark datasets. Moreover, the comparison analysis proved that our model significantly outperforms other five existing algorithms. Finally, we used the HNSPPI model to explore the SARS-CoV-2-Human interaction system and found several potential regulations. In summary, HNSPPI is a promising model for predicting new protein interactions from known PPI data.
Collapse
Affiliation(s)
- Shijie Xie
- College of Artificial Intelligence, Nanjing Agricultural University, No. 1 Weigang Rd, Nanjing, Jiangsu 210095, China
| | - Xiaojun Xie
- College of Artificial Intelligence, Nanjing Agricultural University, No. 1 Weigang Rd, Nanjing, Jiangsu 210095, China
| | - Xin Zhao
- Department of Hepatobiliary Surgery, Beijing Chaoyang Hospital affiliated to Capital Medical University, Beijing 100020, China
| | - Fei Liu
- Joint International Research Laboratory of Animal Health and Food Safety of Ministry of Education & Single Molecule Nanometry Laboratory (Sinmolab), Nanjing Agricultural University, Nanjing, Jiangsu 210095, China
| | - Yiming Wang
- Key Laboratory of Biological Interactions and Crop Health, Department of Plant Pathology, Nanjing Agricultural University, 210095, Nanjing, China
| | - Jihui Ping
- MOE International Joint Collaborative Research Laboratory for Animal Health and Food Safety & Jiangsu Engineering Laboratory of Animal Immunology, College of Veterinary Medicine, Nanjing Agricultural University, Nanjing, Jiangsu 210095, China
| | - Zhiwei Ji
- College of Artificial Intelligence, Nanjing Agricultural University, No. 1 Weigang Rd, Nanjing, Jiangsu 210095, China
| |
Collapse
|
8
|
Su Z, Griffin B, Emmons S, Wu Y. Prediction of Interactions between Cell Surface Proteins by Machine Learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.12.557337. [PMID: 37745607 PMCID: PMC10515853 DOI: 10.1101/2023.09.12.557337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Cells detect changes of external environments or communicate with each other through proteins on their surfaces. These cell surface proteins form a complicated network of interactions in order to fulfill their functions. The interactions between cell surface proteins are highly dynamic and thus challenging to detect using traditional experimental techniques. Here we tackle this challenge by a computational framework. The primary focus of the framework is to develop new tools to identify interactions between domains in immunoglobulin (Ig) fold, which is the most abundant domain family in cell surface proteins. These interactions could be formed between ligands and receptors from different cells, or between proteins on the same cell surface. In practice, we collected all structural data of Ig domain interactions and transformed them into an interface fragment pair library. A high dimensional profile can be then constructed from the library for a given pair of query protein sequences. Multiple machine learning models were used to read this profile, so that the probability of interaction between the query proteins can be predicted. We tested our models to an experimentally derived dataset which contains 564 cell surface proteins in human. The cross-validation results show that we can achieve higher than 70% accuracy in identifying the PPIs within this dataset. We then applied this method to a group of 46 cell surface proteins in C elegans. We screened every possible interaction between these proteins. Many interactions recognized by our machine learning classifiers have been experimentally confirmed in the literatures. In conclusion, our computational platform serves a useful tool to help identifying potential new interactions between cell surface proteins in addition to current state-of-the-art experimental techniques. The tool is freely accessible for use by the scientific community. Moreover, the general framework of the machine learning classification can also be extended to study interactions of proteins in other domain superfamilies.
Collapse
Affiliation(s)
- Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| | - Brian Griffin
- Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| | - Scott Emmons
- Department of Genetics, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY, 10461
| |
Collapse
|
9
|
Jha K, Saha S, Karmakar S. Prediction of Protein-Protein Interactions Using Vision Transformer and Language Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:3215-3225. [PMID: 37027644 DOI: 10.1109/tcbb.2023.3248797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
The knowledge of protein-protein interaction (PPI) helps us to understand proteins' functions, the causes and growth of several diseases, and can aid in designing new drugs. The majority of existing PPI research has relied mainly on sequence-based approaches. With the availability of multi-omics datasets (sequence, 3D structure) and advancements in deep learning techniques, it is feasible to develop a deep multi-modal framework that fuses the features learned from different sources of information to predict PPI. In this work, we propose a multi-modal approach utilizing protein sequence and 3D structure. To extract features from the 3D structure of proteins, we use a pre-trained vision transformer model that has been fine-tuned on the structural representation of proteins. The protein sequence is encoded into a feature vector using a pre-trained language model. The feature vectors extracted from the two modalities are fused and then fed to the neural network classifier to predict the protein interactions. To showcase the effectiveness of the proposed methodology, we conduct experiments on two popular PPI datasets, namely, the human dataset and the S. cerevisiae dataset. Our approach outperforms the existing methodologies to predict PPI, including multi-modal approaches. We also evaluate the contributions of each modality by designing uni-modal baselines. We perform experiments with three modalities as well, having gene ontology as the third modality.
Collapse
|
10
|
A KK, Shayez Karim SM, Kumar M, Ravindranath Singh R. Prediction of transient and permanent protein interactions using AI methods. Bioinformation 2023; 19:749-753. [PMID: 37885791 PMCID: PMC10598364 DOI: 10.6026/97320630019749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 06/30/2023] [Accepted: 06/30/2023] [Indexed: 10/28/2023] Open
Abstract
Protein-protein interactions (PPIs) can be classified as permanent or transient interactions based on their stability or lifetime. Understanding the precise details of such protein interactions will pave the way for the discovery of inhibitors and for understanding the nature and function of PPIs. In the present work, 43 relevant physicochemical, geometrical and structural features were calculated for a curated dataset from the literature, comprising of 402 protein-protein complexes of permanent and transient categories, and 5 different Supervised Machine Learning models were developed with Scikit-learn to predict transient and permanent PPI. Additionally, deep learning method with Artificial Neural Network was also performed using Tensor Flow and Keras. Predicted models achieved accuracy ranging from 76.54% to 82.71% and k-NN has achieved the highest accuracy. Detailed analysis of these methods revealed that Interface areas such as Percent interface accessible area, Interface accessible area and Total interface area and the parameters defining the shape of the PPI interface such as Planarity, Eccentricity and Circularity are the most discriminating factors between these two categories. The present method could serve as an effective tool to understand the mechanism of protein association and to predict the transient and permanent interactions, which could supplement the costly and time-consuming experimental techniques.
Collapse
Affiliation(s)
- Kiran Kumar A
- Department of Bioinformatics, Central University of South Bihar, Gaya, Bihar-824236, India
| | | | - Mayank Kumar
- Department of Bioinformatics, Central University of South Bihar, Gaya, Bihar-824236, India
| | | |
Collapse
|
11
|
Ye J, Li A, Zheng H, Yang B, Lu Y. Machine Learning Advances in Predicting Peptide/Protein-Protein Interactions Based on Sequence Information for Lead Peptides Discovery. Adv Biol (Weinh) 2023; 7:e2200232. [PMID: 36775876 DOI: 10.1002/adbi.202200232] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 12/30/2022] [Indexed: 02/14/2023]
Abstract
Peptides have shown increasing advantages and significant clinical value in drug discovery and development. With the development of high-throughput technologies and artificial intelligence (AI), machine learning (ML) methods for discovering new lead peptides have been expanded and incorporated into rational drug design. Predictions of peptide-protein interactions (PepPIs) and protein-protein interactions (PPIs) are both opportunities and challenges in computational biology, which will help to better understand the mechanisms of disease and provide the impetus for the discovery of lead peptides. This paper comprehensively reviews computational models for PepPI and PPI predictions. It begins with an introduction of various databases of peptide ligands and target proteins. Then it discusses data formats and feature representations for proteins and peptides. Furthermore, classical ML methods and emerging deep learning (DL) methods that can be used to train prediction models of PepPI and PPI are classified into four categories, and their advantages and disadvantages are analyzed. To assess the relative performance of different models, different validation protocols and evaluation indexes are discussed. The goal of this review is to help researchers quickly get started to develop computational frameworks using these integrated resources and eventually promote the discovery of lead peptides.
Collapse
Affiliation(s)
- Jiahao Ye
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - An Li
- Department of Critical Care Medicine, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, 200072, China
- Department of Biochemical Pharmacy, School of Pharmacy, Second Military Medical University, Shanghai, 200433, China
| | - Hao Zheng
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Banghua Yang
- School of Medicine, Shanghai University, Shanghai, 200444, China
| | - Yiming Lu
- School of Medicine, Shanghai University, Shanghai, 200444, China
- Department of Critical Care Medicine, Shanghai Tenth People's Hospital, School of Medicine, Tongji University, Shanghai, 200072, China
- Department of Biochemical Pharmacy, School of Pharmacy, Second Military Medical University, Shanghai, 200433, China
| |
Collapse
|
12
|
STARK RYAN. Protein-mediated interactions in the dynamic regulation of acute inflammation. BIOCELL 2023; 47:1191-1198. [PMID: 37261220 PMCID: PMC10231872 DOI: 10.32604/biocell.2023.027838] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Accepted: 02/09/2023] [Indexed: 06/02/2023]
Abstract
Protein-mediated interactions are the fundamental mechanism through which cells regulate health and disease. These interactions require physical contact between proteins and their respective targets of interest. These targets include not only other proteins but also nucleic acids and other important molecules as well. These proteins are often involved in multibody complexes that work dynamically to regulate cellular health and function. Various techniques have been adapted to study these important interactions, such as affinity-based assays, mass spectrometry, and fluorescent detection. The application of these techniques has led to a greater understanding of how protein interactions are responsible for both the instigation and resolution of acute inflammatory diseases. These pursuits aim to provide opportunities to target specific protein interactions to alleviate acute inflammation.
Collapse
Affiliation(s)
- RYAN STARK
- Department of Pediatric Critical Care Medicine, Vanderbilt University Medical Center, 2200 Children’s Way, 5121 Doctors’ Office Tower, Nashville, TN 37232-9075
| |
Collapse
|
13
|
Jha K, Karmakar S, Saha S. Graph-BERT and language model-based framework for protein-protein interaction identification. Sci Rep 2023; 13:5663. [PMID: 37024543 PMCID: PMC10079975 DOI: 10.1038/s41598-023-31612-w] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 03/14/2023] [Indexed: 04/08/2023] Open
Abstract
Identification of protein-protein interactions (PPI) is among the critical problems in the domain of bioinformatics. Previous studies have utilized different AI-based models for PPI classification with advances in artificial intelligence (AI) techniques. The input to these models is the features extracted from different sources of protein information, mainly sequence-derived features. In this work, we present an AI-based PPI identification model utilizing a PPI network and protein sequences. The PPI network is represented as a graph where each node is a protein pair, and an edge is defined between two nodes if there exists a common protein between these nodes. Each node in a graph has a feature vector. In this work, we have used the language model to extract feature vectors directly from protein sequences. The feature vectors for protein in pairs are concatenated and used as a node feature vector of a PPI network graph. Finally, we have used the Graph-BERT model to encode the PPI network graph with sequence-based features and learn the hidden representation of the feature vector for each node. The next step involves feeding the learned representations of nodes to the fully connected layer, the output of which is fed into the softmax layer to classify the protein interactions. To assess the efficacy of the proposed PPI model, we have performed experiments on several PPI datasets. The experimental results demonstrate that the proposed approach surpasses the existing PPI works and designed baselines in classifying PPI.
Collapse
Affiliation(s)
- Kanchan Jha
- Department of Computer Science and Engineering, Indian Institute of Technology Patna, Patna, Bihar, 801103, India.
| | - Sourav Karmakar
- Department of Computer Science and Engineering, National Institute of Technology Durgapur, Durgapur, West Bengal, 713209, India
| | - Sriparna Saha
- Department of Computer Science and Engineering, Indian Institute of Technology Patna, Patna, Bihar, 801103, India
| |
Collapse
|
14
|
Gouesbet G. Deciphering Macromolecular Interactions Involved in Abiotic Stress Signaling: A Review of Bioinformatics Analysis. Methods Mol Biol 2023; 2642:257-294. [PMID: 36944884 DOI: 10.1007/978-1-0716-3044-0_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
Plant functioning and responses to abiotic stresses largely involve regulations at the transcriptomic level via complex interactions of signal molecules, signaling cascades, and regulators. Nevertheless, all the signaling networks involved in responses to abiotic stresses have not yet been fully established. The in-depth analysis of transcriptomes in stressed plants has become a relevant state-of-the-art methodology to study these regulations and signaling pathways that allow plants to cope with or attempt to survive abiotic stresses. The plant science and molecular biology community has developed databases about genes, proteins, protein-protein interactions, protein-DNA interactions and ontologies, which are valuable sources of knowledge for deciphering such regulatory and signaling networks. The use of these data and the development of bioinformatics tools help to make sense of transcriptomic data in specific contexts, such as that of abiotic stress signaling, using functional biological approaches. The aim of this chapter is to present and assess some of the essential online tools and resources that will allow novices in bioinformatics to decipher transcriptomic data in order to characterize the cellular processes and functions involved in abiotic stress responses and signaling. The analysis of case studies further describes how these tools can be used to conceive signaling networks on the basis of transcriptomic data. In these case studies, particular attention was paid to the characterization of abiotic stress responses and signaling related to chemical and xenobiotic stressors.
Collapse
Affiliation(s)
- Gwenola Gouesbet
- University of Rennes, CNRS, ECOBIO [(Ecosystèmes, Biodiversité, Evolution)] - UMR 6553, Rennes, France.
| |
Collapse
|
15
|
Couée I, Gouesbet G. Protein-Protein Interactions in Abiotic Stress Signaling: An Overview of Biochemical and Biophysical Methods of Characterization. Methods Mol Biol 2023; 2642:319-330. [PMID: 36944886 DOI: 10.1007/978-1-0716-3044-0_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/23/2023]
Abstract
The identification and characterization of bona fide abiotic stress signaling proteins can occur at different levels of the complete in vivo signaling cascade or network. Knowledge of a particular abiotic stress signaling protein could theoretically lead to the characterization of complete networks through the analysis of unknown proteins that interact with the previously known protein. Such signaling proteins of interest can indeed be experimentally used as bait proteins to catch interacting prey proteins, provided that the association of bait proteins and prey proteins should yield a biochemical or biophysical signal that can be detected. To this end, several biochemical and biophysical techniques are available to provide experimental evidence for specific protein-protein interactions, such as co-immunoprecipitation, bimolecular fluorescence complementation, tandem affinity purification coupled to mass spectrometry, yeast two hybrid, protein microarrays, Förster resonance energy transfer, or fluorescence correlation spectroscopy. This array of methods can be implemented to establish the biochemical reality of putative protein-protein interactions between two proteins of interest or to identify previously unknown partners related to an initially known protein of interest. The ultimate validity of these methods however depends on the in vitro/in vivo nature of the approach and on the heterologous/homologous context of the analysis. This chapter will review the application and success of some classical methods of protein-protein interaction analysis in the field of plant abiotic stress signaling.
Collapse
Affiliation(s)
- Ivan Couée
- UMR 6553 ECOBIO (Ecosystems-Biodiversity-Evolution), CNRS, Université de Rennes, Brittany, France.
| | - Gwenola Gouesbet
- UMR 6553 ECOBIO (Ecosystems-Biodiversity-Evolution), CNRS, Université de Rennes, Brittany, France
| |
Collapse
|
16
|
Jha K, Saha S. Analyzing Effect of Multi-Modality in Predicting Protein-Protein Interactions. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:162-173. [PMID: 35259112 DOI: 10.1109/tcbb.2022.3157531] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Nowadays, multiple sources of information about proteins are available such as protein sequences, 3D structures, Gene Ontology (GO), etc. Most of the works on protein-protein interaction (PPI) identification had utilized these information about proteins, mainly sequence-based, but individually. The new advances in deep learning techniques allow us to leverage multiple sources/modalities of proteins, which complement each other. Some recent works have shown that multi-modal PPI models perform better than uni-modal approaches. This paper aims to investigate whether the performance of multi-modal PPI models is always consistent or depends on other factors such as dataset distribution, algorithms used to learn features, etc. We have used three modalities for this study: Protein sequence, 3D structure, and GO. Various techniques, including deep learning algorithms, are employed to extract features from multiple sources of proteins. These feature vectors from different modalities are then integrated in several combinations (bi-modal and tri-modal) to predict PPI. To conduct this study, we have used Human and S. cerevisiae PPI datasets. The obtained results demonstrate the potentiality of a multi-modal approach and deep learning techniques in predicting protein interactions. However, the predictive capability of a model for PPI depends on feature extraction methods as well. Also, increasing the modality does not always ensure performance improvement. In this study, the PPI model integrating two modalities outperforms the designed uni-modal and tri-modal PPI models.
Collapse
|
17
|
Lim H, Tsai CJ, Keskin O, Nussinov R, Gursoy A. HMI-PRED 2.0: a biologist-oriented web application for prediction of host-microbe protein-protein interaction by interface mimicry. Bioinformatics 2022; 38:4962-4965. [PMID: 36124958 PMCID: PMC9620825 DOI: 10.1093/bioinformatics/btac633] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 08/05/2022] [Accepted: 09/15/2022] [Indexed: 11/19/2022] Open
Abstract
SUMMARY HMI-PRED 2.0 is a publicly available web service for the prediction of host-microbe protein-protein interaction by interface mimicry that is intended to be used without extensive computational experience. A microbial protein structure is screened against a database covering the entire available structural space of complexes of known human proteins. AVAILABILITY AND IMPLEMENTATION HMI-PRED 2.0 provides user-friendly graphic interfaces for predicting, visualizing and analyzing host-microbe interactions. HMI-PRED 2.0 is available at https://hmipred.org/.
Collapse
Affiliation(s)
- Hansaim Lim
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Cancer Innovation Laboratory, NCI-Frederick, Frederick, MD 21702, USA
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Cancer Innovation Laboratory, NCI-Frederick, Frederick, MD 21702, USA
| | - Ozlem Keskin
- Department of Chemical and Biological Engineering, Koç University, Istanbul 34450, Turkey
| | - Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research in the Cancer Innovation Laboratory, NCI-Frederick, Frederick, MD 21702, USA
- Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
| | - Attila Gursoy
- Department of Computer Engineering, Koç University, Istanbul 34450, Turkey
| |
Collapse
|
18
|
Soleymani F, Paquet E, Viktor H, Michalowski W, Spinello D. Protein-protein interaction prediction with deep learning: A comprehensive review. Comput Struct Biotechnol J 2022; 20:5316-5341. [PMID: 36212542 PMCID: PMC9520216 DOI: 10.1016/j.csbj.2022.08.070] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 08/29/2022] [Accepted: 08/30/2022] [Indexed: 11/15/2022] Open
Abstract
Most proteins perform their biological function by interacting with themselves or other molecules. Thus, one may obtain biological insights into protein functions, disease prevalence, and therapy development by identifying protein-protein interactions (PPI). However, finding the interacting and non-interacting protein pairs through experimental approaches is labour-intensive and time-consuming, owing to the variety of proteins. Hence, protein-protein interaction and protein-ligand binding problems have drawn attention in the fields of bioinformatics and computer-aided drug discovery. Deep learning methods paved the way for scientists to predict the 3-D structure of proteins from genomes, predict the functions and attributes of a protein, and modify and design new proteins to provide desired functions. This review focuses on recent deep learning methods applied to problems including predicting protein functions, protein-protein interaction and their sites, protein-ligand binding, and protein design.
Collapse
Affiliation(s)
- Farzan Soleymani
- Department of Mechanical Engineering, University of Ottawa, Ottawa, ON, Canada
| | - Eric Paquet
- National Research Council, 1200 Montreal Road, Ottawa, ON K1A 0R6, Canada
| | - Herna Viktor
- School of Electrical Engineering and Computer Science, University of Ottawa, ON, Canada
| | | | - Davide Spinello
- Department of Mechanical Engineering, University of Ottawa, Ottawa, ON, Canada
| |
Collapse
|
19
|
Robin V, Bodein A, Scott-Boyer MP, Leclercq M, Périn O, Droit A. Overview of methods for characterization and visualization of a protein-protein interaction network in a multi-omics integration context. Front Mol Biosci 2022; 9:962799. [PMID: 36158572 PMCID: PMC9494275 DOI: 10.3389/fmolb.2022.962799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 08/16/2022] [Indexed: 11/26/2022] Open
Abstract
At the heart of the cellular machinery through the regulation of cellular functions, protein-protein interactions (PPIs) have a significant role. PPIs can be analyzed with network approaches. Construction of a PPI network requires prediction of the interactions. All PPIs form a network. Different biases such as lack of data, recurrence of information, and false interactions make the network unstable. Integrated strategies allow solving these different challenges. These approaches have shown encouraging results for the understanding of molecular mechanisms, drug action mechanisms, and identification of target genes. In order to give more importance to an interaction, it is evaluated by different confidence scores. These scores allow the filtration of the network and thus facilitate the representation of the network, essential steps to the identification and understanding of molecular mechanisms. In this review, we will discuss the main computational methods for predicting PPI, including ones confirming an interaction as well as the integration of PPIs into a network, and we will discuss visualization of these complex data.
Collapse
Affiliation(s)
- Vivian Robin
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Antoine Bodein
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Marie-Pier Scott-Boyer
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Mickaël Leclercq
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| | - Olivier Périn
- Digital Sciences Department, L'Oréal Advanced Research, Aulnay-sous-bois, France
| | - Arnaud Droit
- Molecular Medicine Department, CHU de Québec Research Center, Université Laval, Québec, QC, Canada
| |
Collapse
|
20
|
Sen N, Anishchenko I, Bordin N, Sillitoe I, Velankar S, Baker D, Orengo C. Characterizing and explaining the impact of disease-associated mutations in proteins without known structures or structural homologs. Brief Bioinform 2022; 23:bbac187. [PMID: 35641150 PMCID: PMC9294430 DOI: 10.1093/bib/bbac187] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2021] [Revised: 04/23/2022] [Accepted: 04/27/2022] [Indexed: 12/12/2022] Open
Abstract
Mutations in human proteins lead to diseases. The structure of these proteins can help understand the mechanism of such diseases and develop therapeutics against them. With improved deep learning techniques, such as RoseTTAFold and AlphaFold, we can predict the structure of proteins even in the absence of structural homologs. We modeled and extracted the domains from 553 disease-associated human proteins without known protein structures or close homologs in the Protein Databank. We noticed that the model quality was higher and the Root mean square deviation (RMSD) lower between AlphaFold and RoseTTAFold models for domains that could be assigned to CATH families as compared to those which could only be assigned to Pfam families of unknown structure or could not be assigned to either. We predicted ligand-binding sites, protein-protein interfaces and conserved residues in these predicted structures. We then explored whether the disease-associated missense mutations were in the proximity of these predicted functional sites, whether they destabilized the protein structure based on ddG calculations or whether they were predicted to be pathogenic. We could explain 80% of these disease-associated mutations based on proximity to functional sites, structural destabilization or pathogenicity. When compared to polymorphisms, a larger percentage of disease-associated missense mutations were buried, closer to predicted functional sites, predicted as destabilizing and pathogenic. Usage of models from the two state-of-the-art techniques provide better confidence in our predictions, and we explain 93 additional mutations based on RoseTTAFold models which could not be explained based solely on AlphaFold models.
Collapse
Affiliation(s)
- Neeladri Sen
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| | - Ivan Anishchenko
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
| | - Nicola Bordin
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| | - Ian Sillitoe
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| | - Christine Orengo
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| |
Collapse
|
21
|
La Serra MA, Vidossich P, Acquistapace I, Ganesan AK, De Vivo M. Alchemical Free Energy Calculations to Investigate Protein-Protein Interactions: the Case of the CDC42/PAK1 Complex. J Chem Inf Model 2022; 62:3023-3033. [PMID: 35679463 PMCID: PMC9241073 DOI: 10.1021/acs.jcim.2c00348] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
![]()
Here, we show that
alchemical free energy calculations can quantitatively
compute the effect of mutations at the protein–protein interface.
As a test case, we have used the protein complex formed by the small
Rho-GTPase CDC42 and its downstream effector PAK1, a serine/threonine
kinase. Notably, the CDC42/PAK1 complex offers a wealth of structural,
mutagenesis, and binding affinity data because of its central role
in cellular signaling and cancer progression. In this context, we
have considered 16 mutations in the CDC42/PAK1 complex and obtained
excellent agreement between computed and experimental data on binding
affinity. Importantly, we also show that a careful analysis of the
side-chain conformations in the mutated amino acids can considerably
improve the computed estimates, solving issues related to sampling
limitations. Overall, this study demonstrates that alchemical free
energy calculations can conveniently be integrated into the design
of experimental mutagenesis studies.
Collapse
Affiliation(s)
- Maria Antonietta La Serra
- Laboratory of Molecular Modeling and Drug Discovery, Istituto Italiano di Tecnologia, via Morego 30, Genoa 16163, Italy
| | - Pietro Vidossich
- Laboratory of Molecular Modeling and Drug Discovery, Istituto Italiano di Tecnologia, via Morego 30, Genoa 16163, Italy
| | - Isabella Acquistapace
- Laboratory of Molecular Modeling and Drug Discovery, Istituto Italiano di Tecnologia, via Morego 30, Genoa 16163, Italy
| | - Anand K Ganesan
- Department of Dermatology, University of California, Irvine, Irvine, California 92697, United States.,Department of Biological Chemistry, University of California, Irvine, Irvine, California 92697, United States
| | - Marco De Vivo
- Laboratory of Molecular Modeling and Drug Discovery, Istituto Italiano di Tecnologia, via Morego 30, Genoa 16163, Italy
| |
Collapse
|
22
|
Jha K, Saha S, Singh H. Prediction of protein-protein interaction using graph neural networks. Sci Rep 2022; 12:8360. [PMID: 35589837 PMCID: PMC9120162 DOI: 10.1038/s41598-022-12201-9] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 04/18/2022] [Indexed: 01/09/2023] Open
Abstract
Proteins are the essential biological macromolecules required to perform nearly all biological processes, and cellular functions. Proteins rarely carry out their tasks in isolation but interact with other proteins (known as protein-protein interaction) present in their surroundings to complete biological activities. The knowledge of protein-protein interactions (PPIs) unravels the cellular behavior and its functionality. The computational methods automate the prediction of PPI and are less expensive than experimental methods in terms of resources and time. So far, most of the works on PPI have mainly focused on sequence information. Here, we use graph convolutional network (GCN) and graph attention network (GAT) to predict the interaction between proteins by utilizing protein's structural information and sequence features. We build the graphs of proteins from their PDB files, which contain 3D coordinates of atoms. The protein graph represents the amino acid network, also known as residue contact network, where each node is a residue. Two nodes are connected if they have a pair of atoms (one from each node) within the threshold distance. To extract the node/residue features, we use the protein language model. The input to the language model is the protein sequence, and the output is the feature vector for each amino acid of the underlying sequence. We validate the predictive capability of the proposed graph-based approach on two PPI datasets: Human and S. cerevisiae. Obtained results demonstrate the effectiveness of the proposed approach as it outperforms the previous leading methods. The source code for training and data to train the model are available at https://github.com/JhaKanchan15/PPI_GNN.git .
Collapse
Affiliation(s)
- Kanchan Jha
- Department of Computer Science and Engineering, Indian Institute of Technology Patna, Patna, Bihar, 801103, India.
| | - Sriparna Saha
- Department of Computer Science and Engineering, Indian Institute of Technology Patna, Patna, Bihar, 801103, India
| | - Hiteshi Singh
- Department of Electrical Engineering, Indian Institute of Technology Jodhpur, Jodhpur, Rajasthan, 342030, India
| |
Collapse
|
23
|
Wang S, Wu R, Lu J, Jiang Y, Huang T, Cai YD. Protein-protein interaction networks as miners of biological discovery. Proteomics 2022; 22:e2100190. [PMID: 35567424 DOI: 10.1002/pmic.202100190] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2021] [Revised: 03/28/2022] [Accepted: 04/29/2022] [Indexed: 11/12/2022]
Abstract
Protein-protein interactions (PPIs) form the basis of a myriad of biological pathways and mechanism, such as the formation of protein-complexes or the components of signaling cascades. Here, we reviewed experimental methods for identifying PPI pairs, including yeast two-hybrid, mass spectrometry, co-localization, and co-immunoprecipitation. Furthermore, a range of computational methods leveraging biochemical properties, evolution history, protein structures and more have enabled identification of additional PPIs. Given the wealth of known PPIs, we reviewed important network methods to construct and analyze networks of PPIs. These methods aid biological discovery through identifying hub genes and dynamic changes in the network, and have been thoroughly applied in various fields of biological research. Lastly, we discussed the challenges and future direction of research utilizing the power of PPI networks. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Steven Wang
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Runxin Wu
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jiaqi Lu
- Department of Chemistry and Biochemistry, University of Notre Dame, Notre Dame, IN, USA
| | - Yijia Jiang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Tao Huang
- Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai, China
| | - Yu-Dong Cai
- School of Life Sciences, Shanghai University, Shanghai, China
| |
Collapse
|
24
|
Hu X, Feng C, Ling T, Chen M. Deep learning frameworks for protein–protein interaction prediction. Comput Struct Biotechnol J 2022; 20:3223-3233. [PMID: 35832624 PMCID: PMC9249595 DOI: 10.1016/j.csbj.2022.06.025] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 05/27/2022] [Accepted: 06/12/2022] [Indexed: 11/26/2022] Open
|
25
|
Wu K, Nie B, Li L, Yang X, Yang J, He Z, Li Y, Cheng S, Shi M, Zeng Y. Bioinformatics analysis of high frequency mutations in myelodysplastic syndrome-related patients. ANNALS OF TRANSLATIONAL MEDICINE 2021; 9:1491. [PMID: 34805353 PMCID: PMC8573449 DOI: 10.21037/atm-21-4094] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 09/24/2021] [Indexed: 11/06/2022]
Abstract
Background Myelodysplastic syndrome (MDS) is a group of hematological malignancies that may progress to acute myeloid leukemia (AML). Bioinformatics-based analysis of high-frequency mutation genes in MDS-related patients is still relatively rare, so we conducted our research to explore whether high-frequency mutation genes in MDS-related patients can play a reference role in clinical guidance and prognosis. Methods Next generation sequencing (NGS) technology was used to detect 32 mutations in 64 MDS-related patients. We classified the patients' genes and analyzed them by Gene Ontology (GO) analysis, Kyoto Encyclopedia of Genes and Genomes (KEGG) analysis, protein-protein interaction (PPI) analysis, and then calculated the gene survival curve of high-frequency mutations. Results We discovered 32 mutant genes such as ASXL1, DNMT3A, KRAS, NRAS, TP53, SF3B1, and SRSF2. The overall survival (OS) of these genes decreased significantly after DNMT3A, ASXL1, RUNX1, and U2AF1 occurred mutation. These genes play a significant role in biological processes, not only in MDS but also in the occurrence and development of other diseases. Through retrospective analysis, genes associated with MDS-related diseases were identified, and their effects on the disease were predicted. Conclusions Thirty-two mutant genes were determined in MDS and when mutations occur in DNMT3A, ASXL1, RUNX1, and U2AF1, their survival time decreases significantly. This results providing a theoretical basis for clinical and scientific research and broadening the scope of research on MDS.
Collapse
Affiliation(s)
- Kun Wu
- Department of Clinical Laboratory, First Affiliated Hospital of Kunming Medical University, Kunming, China.,Yunnan Key Laboratory of Laboratory Medicine, Kunming, China.,Yunnan Innovation Team of Clinical Laboratory and Diagnosis, Kunming, China
| | - Bo Nie
- Department of Hematology, First Affiliated Hospital of Kunming Medical University, Hematology Research Center of Yunnan Province, Kunming, China
| | - Liyin Li
- Department of Hematology, First Affiliated Hospital of Kunming Medical University, Hematology Research Center of Yunnan Province, Kunming, China
| | - Xin Yang
- Department of Hematology, First Affiliated Hospital of Kunming Medical University, Hematology Research Center of Yunnan Province, Kunming, China
| | - Jinrong Yang
- Department of Hematology, First Affiliated Hospital of Kunming Medical University, Hematology Research Center of Yunnan Province, Kunming, China
| | - Zhenxin He
- Department of Hematology, First Affiliated Hospital of Kunming Medical University, Hematology Research Center of Yunnan Province, Kunming, China
| | - Yanhong Li
- Department of Clinical Laboratory, First Affiliated Hospital of Kunming Medical University, Kunming, China.,Yunnan Key Laboratory of Laboratory Medicine, Kunming, China.,Yunnan Innovation Team of Clinical Laboratory and Diagnosis, Kunming, China
| | - Shenju Cheng
- Department of Clinical Laboratory, First Affiliated Hospital of Kunming Medical University, Kunming, China.,Yunnan Key Laboratory of Laboratory Medicine, Kunming, China.,Yunnan Innovation Team of Clinical Laboratory and Diagnosis, Kunming, China
| | - Mingxia Shi
- Department of Hematology, First Affiliated Hospital of Kunming Medical University, Hematology Research Center of Yunnan Province, Kunming, China
| | - Yun Zeng
- Department of Hematology, First Affiliated Hospital of Kunming Medical University, Hematology Research Center of Yunnan Province, Kunming, China
| |
Collapse
|
26
|
Srdic-Rajic T, Metlas R. Antibody VH domain sequence analysis by a bioinformatics approach based on electronic amino acid properties may help to predict paratop location. Immunol Lett 2021; 241:55-57. [PMID: 34785254 DOI: 10.1016/j.imlet.2021.11.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Revised: 10/15/2021] [Accepted: 11/11/2021] [Indexed: 11/28/2022]
Abstract
Gene as the basic functional unit of DNA encodes information about the product such as protein. The majority of proteins realize function through protein-protein interactions involving short protein motifs. However, some proteins such as antibodies are established by the rearrangement of several (V-D-J) gene segments with the potential addition of nontemplated nucleotides that may change information encoded by the respective gene segment used. Antibody VH domain sequence analysis by ISM bioinformatics approach that is based on amino acids physicochemical features, enable to distinguish the contribution of the information encoded by VH gene or generated during VDJ gene recombination for antibody-antigen interaction. The data presented in this report revealed the significance of CDRH3 for the interaction of antibody specific for immunogenic molecules while CDRH3 contribution is minor for antibody interaction with nonimmunogenic molecules such as haptens and native mammalian dsDNA. Thus, paratopes might be located in the CDRH3 or VH regions.
Collapse
Affiliation(s)
- Tatjana Srdic-Rajic
- Department of Experimental Oncology, Institute for oncology and radiology of Serbia, Belgrade,Serbia
| | - Radmila Metlas
- Center for Multidisciplinary Research, Institute of Nuclear Sciences VINCA, Belgrade, Serbia.
| |
Collapse
|
27
|
Selvaraj C, Chandra I, Singh SK. Artificial intelligence and machine learning approaches for drug design: challenges and opportunities for the pharmaceutical industries. Mol Divers 2021; 26:1893-1913. [PMID: 34686947 PMCID: PMC8536481 DOI: 10.1007/s11030-021-10326-z] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Accepted: 09/24/2021] [Indexed: 12/27/2022]
Abstract
The global spread of COVID-19 has raised the importance of pharmaceutical drug development as intractable and hot research. Developing new drug molecules to overcome any disease is a costly and lengthy process, but the process continues uninterrupted. The critical point to consider the drug design is to use the available data resources and to find new and novel leads. Once the drug target is identified, several interdisciplinary areas work together with artificial intelligence (AI) and machine learning (ML) methods to get enriched drugs. These AI and ML methods are applied in every step of the computer-aided drug design, and integrating these AI and ML methods results in a high success rate of hit compounds. In addition, this AI and ML integration with high-dimension data and its powerful capacity have taken a step forward. Clinical trials output prediction through the AI/ML integrated models could further decrease the clinical trials cost by also improving the success rate. Through this review, we discuss the backend of AI and ML methods in supporting the computer-aided drug design, along with its challenge and opportunity for the pharmaceutical industry. From the available information or data, the AI and ML based prediction for the high throughput virtual screening. After this integration of AI and ML, the success rate of hit identification has gained a momentum with huge success by providing novel drugs.
Collapse
Affiliation(s)
- Chandrabose Selvaraj
- CADD and Molecular Modelling Lab, Department of Bioinformatics, Alagappa University, Science Block, Karaikudi, Tamil Nadu, 630004, India.
| | - Ishwar Chandra
- CADD and Molecular Modelling Lab, Department of Bioinformatics, Alagappa University, Science Block, Karaikudi, Tamil Nadu, 630004, India
| | - Sanjeev Kumar Singh
- CADD and Molecular Modelling Lab, Department of Bioinformatics, Alagappa University, Science Block, Karaikudi, Tamil Nadu, 630004, India.
| |
Collapse
|
28
|
Tuncel G, Akcan N, Gul S, Sag SO, Bundak R, Mocan G, Temel SG, Ergoren MC. Identification of a Novel De Novo COMP Gene Variant as a Likely Cause of Pseudoachondroplasia. Appl Immunohistochem Mol Morphol 2021; 29:546-550. [PMID: 33595934 DOI: 10.1097/pai.0000000000000914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2020] [Accepted: 12/28/2020] [Indexed: 11/25/2022]
Abstract
Next-generation sequencing technology and advanced sequence analysis techniques are markedly speeding up the identification of gene variants causing rare genetic diseases. Pseudoachondroplasia (PSACH, MIM 177170) is a rare disease inherited in an autosomal dominant manner. It is known that variations in the cartilage oligomeric matrix protein (COMP) gene are associated with the disease. Here, we report a 39-month-old boy with short stature. He gave visible growth and development delayed phenotype after 12 months. Further genetic resequencing analysis was carried out to identified the disease-causing variant. Furthermore, computational approaches were used to characterize the effect of the variant. In this study, we identify and report a novel variation in the COMP gene, c.1420_1422del (p.Asn47del), causing a spontaneous form of PSACH in our patient. Our in silico model indicated that any mutational changes in this region are very susceptible to PASCH phenotype. Overall, this study is the first PSACH case in the Turkish Cypriot population. Moreover, this finding contributes to the concept that the genotype-phenotype correlation in COMP is still unknown and also improves our understanding of this complex disorder.
Collapse
Affiliation(s)
- Gulten Tuncel
- Departments of Medical Genetics
- DESAM Insitute, Near East University
| | | | - Seref Gul
- Department of Chemical and Biological Engineering, Faculty of Engineering, Koc University, Istanbul
| | | | - Ruveyde Bundak
- Department of Pediatrics, Faculty of Medicine, Kyrenia University, Nicosia, Cyprus
| | - Gamze Mocan
- Departments of Medical Genetics
- Medical Pathology, Faculty of Medicine
| | - Sehime G Temel
- Departments of Medical Genetics
- Histology and Embryology, Faculty of Medicine
- Department of Translational Medicine, Institute of Health Sciences, Bursa Uludag University, Bursa, Turkey
| | - Mahmut C Ergoren
- Departments of Medical Genetics
- DESAM Insitute, Near East University
| |
Collapse
|
29
|
Databases for Protein-Protein Interactions. Methods Mol Biol 2021; 2361:229-248. [PMID: 34236665 DOI: 10.1007/978-1-0716-1641-3_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023]
Abstract
Protein-protein interaction networks have a crucial role in biological processes. Proteins perform multiple functions in forming physical and functional interactions in cellular systems. Information concerning an enormous number of protein interactions in a wide range of species has accumulated and has been integrated into various resources for molecular biology and systems biology. This chapter provides a review of the representative databases and the major computational methods used for protein-protein interactions.
Collapse
|
30
|
Rauer C, Sen N, Waman VP, Abbasian M, Orengo CA. Computational approaches to predict protein functional families and functional sites. Curr Opin Struct Biol 2021; 70:108-122. [PMID: 34225010 DOI: 10.1016/j.sbi.2021.05.012] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 05/13/2021] [Accepted: 05/25/2021] [Indexed: 01/06/2023]
Abstract
Understanding the mechanisms of protein function is indispensable for many biological applications, such as protein engineering and drug design. However, experimental annotations are sparse, and therefore, theoretical strategies are needed to fill the gap. Here, we present the latest developments in building functional subclassifications of protein superfamilies and using evolutionary conservation to detect functional determinants, for example, catalytic-, binding- and specificity-determining residues important for delineating the functional families. We also briefly review other features exploited for functional site detection and new machine learning strategies for combining multiple features.
Collapse
Affiliation(s)
- Clemens Rauer
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| | - Neeladri Sen
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| | - Vaishali P Waman
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| | - Mahnaz Abbasian
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK
| | - Christine A Orengo
- Institute of Structural and Molecular Biology, University College London, London, WC1E 6BT, UK.
| |
Collapse
|
31
|
Sudhakar P, Machiels K, Verstockt B, Korcsmaros T, Vermeire S. Computational Biology and Machine Learning Approaches to Understand Mechanistic Microbiome-Host Interactions. Front Microbiol 2021; 12:618856. [PMID: 34046017 PMCID: PMC8148342 DOI: 10.3389/fmicb.2021.618856] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Accepted: 03/19/2021] [Indexed: 12/11/2022] Open
Abstract
The microbiome, by virtue of its interactions with the host, is implicated in various host functions including its influence on nutrition and homeostasis. Many chronic diseases such as diabetes, cancer, inflammatory bowel diseases are characterized by a disruption of microbial communities in at least one biological niche/organ system. Various molecular mechanisms between microbial and host components such as proteins, RNAs, metabolites have recently been identified, thus filling many gaps in our understanding of how the microbiome modulates host processes. Concurrently, high-throughput technologies have enabled the profiling of heterogeneous datasets capturing community level changes in the microbiome as well as the host responses. However, due to limitations in parallel sampling and analytical procedures, big gaps still exist in terms of how the microbiome mechanistically influences host functions at a system and community level. In the past decade, computational biology and machine learning methodologies have been developed with the aim of filling the existing gaps. Due to the agnostic nature of the tools, they have been applied in diverse disease contexts to analyze and infer the interactions between the microbiome and host molecular components. Some of these approaches allow the identification and analysis of affected downstream host processes. Most of the tools statistically or mechanistically integrate different types of -omic and meta -omic datasets followed by functional/biological interpretation. In this review, we provide an overview of the landscape of computational approaches for investigating mechanistic interactions between individual microbes/microbiome and the host and the opportunities for basic and clinical research. These could include but are not limited to the development of activity- and mechanism-based biomarkers, uncovering mechanisms for therapeutic interventions and generating integrated signatures to stratify patients.
Collapse
Affiliation(s)
- Padhmanand Sudhakar
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Kathleen Machiels
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
| | - Bram Verstockt
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| | - Tamas Korcsmaros
- Earlham Institute, Norwich, United Kingdom
- Quadram Institute Bioscience, Norwich, United Kingdom
| | - Séverine Vermeire
- Department of Chronic Diseases, Metabolism and Ageing, Translational Research Center for Gastrointestinal Disorders (TARGID), KU Leuven, Leuven, Belgium
- Department of Gastroenterology and Hepatology, University Hospitals Leuven, KU Leuven, Leuven, Belgium
| |
Collapse
|
32
|
Hu L, Wang X, Huang YA, Hu P, You ZH. A survey on computational models for predicting protein-protein interactions. Brief Bioinform 2021; 22:6159365. [PMID: 33693513 DOI: 10.1093/bib/bbab036] [Citation(s) in RCA: 58] [Impact Index Per Article: 14.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 12/31/2020] [Indexed: 12/24/2022] Open
Abstract
Proteins interact with each other to play critical roles in many biological processes in cells. Although promising, laboratory experiments usually suffer from the disadvantages of being time-consuming and labor-intensive. The results obtained are often not robust and considerably uncertain. Due recently to advances in high-throughput technologies, a large amount of proteomics data has been collected and this presents a significant opportunity and also a challenge to develop computational models to predict protein-protein interactions (PPIs) based on these data. In this paper, we present a comprehensive survey of the recent efforts that have been made towards the development of effective computational models for PPI prediction. The survey introduces the algorithms that can be used to learn computational models for predicting PPIs, and it classifies these models into different categories. To understand their relative merits, the paper discusses different validation schemes and metrics to evaluate the prediction performance. Biological databases that are commonly used in different experiments for performance comparison are also described and their use in a series of extensive experiments to compare different prediction models are discussed. Finally, we present some open issues in PPI prediction for future work. We explain how the performance of PPI prediction can be improved if these issues are effectively tackled.
Collapse
Affiliation(s)
- Lun Hu
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, 830011, Urumqi, China
| | - Xiaojuan Wang
- School of Computer Science and Technology, Wuhan University of Technology, 430070, Wuhan, China
| | - Yu-An Huang
- College of Computer Science and Software Engineering, Shenzhen University, 518060, Shenzhen, China
| | | | - Zhu-Hong You
- Xinjiang Technical Institute of Physics and Chemistry, Chinese Academy of Sciences, 830011, Urumqi, China
| |
Collapse
|
33
|
Ruiz-Blanco YB, Ávila-Barrientos LP, Hernández-García E, Antunes A, Agüero-Chapin G, García-Hernández E. Engineering protein fragments via evolutionary and protein-protein interaction algorithms: de novo design of peptide inhibitors for F O F 1 -ATP synthase. FEBS Lett 2020; 595:183-194. [PMID: 33151544 DOI: 10.1002/1873-3468.13988] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2020] [Revised: 10/23/2020] [Accepted: 10/30/2020] [Indexed: 11/08/2022]
Abstract
Enzyme subunit interfaces have remarkable potential in drug design as both target and scaffold for their own inhibitors. We show an evolution-driven strategy for the de novo design of peptide inhibitors targeting interfaces of the Escherichia coli FoF1-ATP synthase as a case study. The evolutionary algorithm ROSE was applied to generate diversity-oriented peptide libraries by engineering peptide fragments from ATP synthase interfaces. The resulting peptides were scored with PPI-Detect, a sequence-based predictor of protein-protein interactions. Two selected peptides were confirmed by in vitro inhibition and binding tests. The proposed methodology can be widely applied to design peptides targeting relevant interfaces of enzymatic complexes.
Collapse
Affiliation(s)
| | | | | | - Agostinho Antunes
- CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Portugal.,Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Portugal
| | - Guillermin Agüero-Chapin
- CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Terminal de Cruzeiros do Porto de Leixões, Portugal.,Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Portugal
| | | |
Collapse
|
34
|
Amalgamation of 3D structure and sequence information for protein-protein interaction prediction. Sci Rep 2020; 10:19171. [PMID: 33154416 PMCID: PMC7645622 DOI: 10.1038/s41598-020-75467-x] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Accepted: 09/17/2020] [Indexed: 11/08/2022] Open
Abstract
Protein is the primary building block of living organisms. It interacts with other proteins and is then involved in various biological processes. Protein-protein interactions (PPIs) help in predicting and hence help in understanding the functionality of the proteins, causes and growth of diseases, and designing new drugs. However, there is a vast gap between the available protein sequences and the identification of protein-protein interactions. To bridge this gap, researchers proposed several computational methods to reveal the interactions between proteins. These methods merely depend on sequence-based information of proteins. With the advancement of technology, different types of information related to proteins are available such as 3D structure information. Nowadays, deep learning techniques are adopted successfully in various domains, including bioinformatics. So, current work focuses on the utilization of different modalities, such as 3D structures and sequence-based information of proteins, and deep learning algorithms to predict PPIs. The proposed approach is divided into several phases. We first get several illustrations of proteins using their 3D coordinates information, and three attributes, such as hydropathy index, isoelectric point, and charge of amino acids. Amino acids are the building blocks of proteins. A pre-trained ResNet50 model, a subclass of a convolutional neural network, is utilized to extract features from these representations of proteins. Autocovariance and conjoint triad are two widely used sequence-based methods to encode proteins, which are used here as another modality of protein sequences. A stacked autoencoder is utilized to get the compact form of sequence-based information. Finally, the features obtained from different modalities are concatenated in pairs and fed into the classifier to predict labels for protein pairs. We have experimented on the human PPIs dataset and Saccharomyces cerevisiae PPIs dataset and compared our results with the state-of-the-art deep-learning-based classifiers. The results achieved by the proposed method are superior to those obtained by the existing methods. Extensive experimentations on different datasets indicate that our approach to learning and combining features from two different modalities is useful in PPI prediction.
Collapse
|
35
|
Beg AZ, Khan AU. Motifs and interface amino acid-mediated regulation of amyloid biogenesis in microbes to humans: potential targets for intervention. Biophys Rev 2020; 12:1249-1256. [PMID: 32930961 DOI: 10.1007/s12551-020-00759-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 09/04/2020] [Indexed: 02/08/2023] Open
Abstract
Amyloids are linked to many debilitating diseases in mammals. Some organisms produce amyloids that have a functional role in the maintenance of their biological processes. Microbes utilize functional bacterial amyloids (FuBA) for pathogenicity and infections. Amyloid biogenesis is regulated differentially in various systems to avoid its toxic accumulation. A familiar feature in the process of amyloid biogenesis from humans to microbes is its regulation by protein-protein interactions (PPI). The spatial arrangement of amino acid residues in proteins generates topologies like flat interface and linear motif, which participate in protein interactions. Motifs and interface residue-mediated interactions have a direct or an indirect impact on amyloid secretion and assembly. Some motifs undergo post-translational modifications (PTM), which effects interactions and dynamics of the amyloid biogenesis cascade. Interaction-induced local changes stimulate global conformational transitions in the PPI complex, which indirectly affects amyloid formation. Perturbation of such motifs and interface residues results in amyloid abolishment. Interface residues, motifs and their respective interactive protein partners could serve as potential targets for intervention to inhibit amyloid biogenesis.
Collapse
Affiliation(s)
- Ayesha Z Beg
- Medical Microbiology and Molecular Biology, Interdisciplinary Biotechnology Unit, Aligarh Muslim University, Aligarh, 202002, India
| | - Asad U Khan
- Medical Microbiology and Molecular Biology, Interdisciplinary Biotechnology Unit, Aligarh Muslim University, Aligarh, 202002, India.
| |
Collapse
|
36
|
Poot Velez AH, Fontove F, Del Rio G. Protein-Protein Interactions Efficiently Modeled by Residue Cluster Classes. Int J Mol Sci 2020; 21:E4787. [PMID: 32640745 PMCID: PMC7370293 DOI: 10.3390/ijms21134787] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2020] [Revised: 06/20/2020] [Accepted: 06/28/2020] [Indexed: 01/22/2023] Open
Abstract
Predicting protein-protein interactions (PPI) represents an important challenge in structural bioinformatics. Current computational methods display different degrees of accuracy when predicting these interactions. Different factors were proposed to help improve these predictions, including choosing the proper descriptors of proteins to represent these interactions, among others. In the current work, we provide a representative protein structure that is amenable to PPI classification using machine learning approaches, referred to as residue cluster classes. Through sampling and optimization, we identified the best algorithm-parameter pair to classify PPI from more than 360 different training sets. We tested these classifiers against PPI datasets that were not included in the training set but shared sequence similarity with proteins in the training set to reproduce the situation of most proteins sharing sequence similarity with others. We identified a model with almost no PPI error (96-99% of correctly classified instances) and showed that residue cluster classes of protein pairs displayed a distinct pattern between positive and negative protein interactions. Our results indicated that residue cluster classes are structural features relevant to model PPI and provide a novel tool to mathematically model the protein structure/function relationship.
Collapse
Affiliation(s)
- Albros Hermes Poot Velez
- Department of biochemistry and structural biology, Instituto de fisiologia celular, UNAM Mexico City 04510, Mexico;
| | | | - Gabriel Del Rio
- Department of biochemistry and structural biology, Instituto de fisiologia celular, UNAM Mexico City 04510, Mexico;
| |
Collapse
|
37
|
Temel SG, Ergoren MC, Manara E, Paolacci S, Tuncel G, Gul S, Bertelli M. Unique combination and in silico modeling of biallelic POLR3A variants as a cause of Wiedemann-Rautenstrauch syndrome. Eur J Hum Genet 2020; 28:1675-1680. [PMID: 32555393 DOI: 10.1038/s41431-020-0673-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2019] [Revised: 05/19/2020] [Accepted: 06/02/2020] [Indexed: 11/10/2022] Open
Abstract
Neonatal progeroid syndrome or Wiedemann-Rautenstrauch syndrome (WRS; MIM 264090) is a rare genetic disorder that has clinical symptoms including premature aging, lipodystrophy, and variable mental impairment. Until recently genetic background of the disease was unclear. However, recent studies have indicated that WRS patients have compound heterozygote variations in the POLR3A (RNA polymerase III subunit 3A; MIM 614258) gene that might be responsible for the disease phenotype. In this study we report a WRS patient that has compound heterozygote variations in the POLR3A gene. One of the reported variations in our patient, c.3568C>T, p.(Gln1190Ter), is a novel variation that was not reported before. The other variant, c.3337-11T>C, was previously shown in WRS patients in trans with other variations.
Collapse
Affiliation(s)
- Sehime Gulsun Temel
- Bursa Uludag University, Faculty of Medicine, Department of Medical Genetics, Bursa, Turkey. .,Bursa Uludag University, Faculty of Medicine, Department of Histology and Embryology, Bursa, Turkey. .,Bursa Uludag University, Institute of Health Sciences, Department of Translational Medicine, Bursa, Turkey.
| | - Mahmut Cerkez Ergoren
- Near East University, Faculty of Medicine, Department of Medical Biology, 99138, Nicosia, Cyprus.,Near East University, DESAM Insitute, 99138, Nicosia, Cyprus
| | | | | | - Gulten Tuncel
- Near East University, Faculty of Medicine, Department of Medical Biology, 99138, Nicosia, Cyprus.,Near East University, DESAM Insitute, 99138, Nicosia, Cyprus
| | - Seref Gul
- Koc University, Faculty of Engineering, Department of Chemical and Biological Engineering, Istanbul, Turkey
| | | |
Collapse
|
38
|
Rosa N, Campos B, Esteves AC, Duarte AS, Correia MJ, Silva RM, Barros M. Tracking the functional meaning of the human oral-microbiome protein-protein interactions. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2020; 121:199-235. [PMID: 32312422 DOI: 10.1016/bs.apcsb.2019.11.014] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The interactome - the network of protein-protein interactions (PPIs) within a cell or organism - is technically difficult to assess. Bioinformatic tools can, not only, identify potential PPIs that can be later experimentally validated, but also be used to assign functional meaning to PPIs. Saliva's potential as a non-invasive diagnostic fluid is currently being explored by several research groups. But, in order to fully attain its potential, it is necessary to achieve the full characterization of the mechanisms that take place within this ecosystem. The onset of omics technologies, and specifically of proteomics, delivered a huge set of data that is largely underexplored. Quantitative information relative to proteins within a given context (for example a given disease) can be used by computational algorithms to generate information regarding PPIs. These PPIs can be further analyzed concerning their functional meaning and used to identify potential biomarkers, therapeutic targets, defense and pathogenicity mechanisms. We describe a computational pipeline that can be used to identify and analyze PPIs between human and microbial proteins. The pipeline was tested within the scenario of human PPIs of systemic (Zika Virus infection) and of oral conditions (Periodontal disease) and also in the context of microbial interactions (Candida-Streptococcus) and showed to successfully predict functionally relevant PPIs. The pipeline can be applied to different scientific areas, such as pharmacological research, since a functional meaningful PPI network can provide insights on potential drug targets, and even new uses for existing drugs on the market.
Collapse
Affiliation(s)
- Nuno Rosa
- Universidade Católica Portuguesa, Faculty of Dental Medicine, Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal
| | - Bruno Campos
- Universidade Católica Portuguesa, Faculty of Dental Medicine, Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal
| | - Ana Cristina Esteves
- Universidade Católica Portuguesa, Faculty of Dental Medicine, Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal
| | - Ana Sofia Duarte
- Universidade Católica Portuguesa, Faculty of Dental Medicine, Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal
| | - Maria José Correia
- Universidade Católica Portuguesa, Faculty of Dental Medicine, Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal
| | - Raquel M Silva
- Universidade Católica Portuguesa, Faculty of Dental Medicine, Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal
| | - Marlene Barros
- Universidade Católica Portuguesa, Faculty of Dental Medicine, Center for Interdisciplinary Research in Health (CIIS), Viseu, Portugal
| |
Collapse
|
39
|
Starokadomskyy P, Wilton KM, Krzewski K, Lopez A, Sifuentes-Dominguez L, Overlee B, Chen Q, Ray A, Gil-Krzewska A, Peterson M, Kinch LN, Rohena L, Grunebaum E, Zinn AR, Grishin NV, Billadeau DD, Burstein E. NK cell defects in X-linked pigmentary reticulate disorder. JCI Insight 2019; 4:125688. [PMID: 31672938 PMCID: PMC6948767 DOI: 10.1172/jci.insight.125688] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 10/02/2019] [Indexed: 01/16/2023] Open
Abstract
X-linked reticulate pigmentary disorder (XLPDR, Mendelian Inheritance in Man #301220) is a rare syndrome characterized by recurrent infections and sterile multiorgan inflammation. The syndrome is caused by an intronic mutation in POLA1, the gene encoding the catalytic subunit of DNA polymerase-α (Pol-α), which is responsible for Okazaki fragment synthesis during DNA replication. Reduced POLA1 expression in this condition triggers spontaneous type I interferon expression, which can be linked to the autoinflammatory manifestations of the disease. However, the history of recurrent infections in this syndrome is as yet unexplained. Here we report that patients with XLPDR have reduced NK cell cytotoxic activity and decreased numbers of NK cells, particularly differentiated, stage V, cells (CD3–CD56dim). This phenotype is reminiscent of hypomorphic mutations in MCM4, which encodes a component of the minichromosome maintenance (MCM) helicase complex that is functionally linked to Pol-α during the DNA replication process. We find that POLA1 deficiency leads to MCM4 depletion and that both can impair NK cell natural cytotoxicity and show that this is due to a defect in lytic granule polarization. Altogether, our study provides mechanistic connections between Pol-α and the MCM complex and demonstrates their relevance in NK cell function. X-linked reticulate pigmentary disorder is associated with functional NK cell defect due to abnormal lytic granule polarization.
Collapse
Affiliation(s)
- Petro Starokadomskyy
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Katelynn M Wilton
- Department of Immunology and Department of Biochemistry and Molecular Biology, Mayo Clinic College of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Konrad Krzewski
- Receptor Cell Biology Section, Laboratory of Immunogenetics, National Institute of Allergy and Infectious Diseases, NIH, Rockville, Maryland, USA
| | - Adam Lopez
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | | | - Brittany Overlee
- Department of Immunology and Department of Biochemistry and Molecular Biology, Mayo Clinic College of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Qing Chen
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, USA.,Department of Surgery, Tongji University affiliated Tongji Hospital, Shanghai, China
| | - Ann Ray
- Department of Microbiology, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Aleksandra Gil-Krzewska
- Receptor Cell Biology Section, Laboratory of Immunogenetics, National Institute of Allergy and Infectious Diseases, NIH, Rockville, Maryland, USA
| | - Mary Peterson
- Molecular and Cellular Immunology Section, Laboratory of Immunogenetics, National Institute of Allergy and Infectious Diseases, NIH, Rockville, Maryland, USA
| | - Lisa N Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| | - Luis Rohena
- Division of Genetics, Department of Pediatrics, San Antonio Military Medical Center, San Antonio, Texas, USA
| | - Eyal Grunebaum
- Division of Immunology and Allergy and Department of Pediatrics, Developmental and Stem Cell Biology Program, Research Institute, Hospital for Sick Children, Toronto, Ontario, Canada
| | - Andrew R Zinn
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, USA.,Eugene McDermott Center for Human Growth and Development
| | - Nick V Grishin
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, USA.,Department of Biochemistry.,Department of Biophysics, and
| | - Daniel D Billadeau
- Department of Immunology and Department of Biochemistry and Molecular Biology, Mayo Clinic College of Medicine, Mayo Clinic, Rochester, Minnesota, USA
| | - Ezra Burstein
- Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, Texas, USA.,Department of Molecular Biology, University of Texas Southwestern Medical Center, Dallas, Texas, USA
| |
Collapse
|
40
|
Badal VD, Wright D, Katsis Y, Kim HC, Swafford AD, Knight R, Hsu CN. Challenges in the construction of knowledge bases for human microbiome-disease associations. MICROBIOME 2019; 7:129. [PMID: 31488215 PMCID: PMC6728997 DOI: 10.1186/s40168-019-0742-2] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2019] [Accepted: 08/20/2019] [Indexed: 05/05/2023]
Abstract
The last few years have seen tremendous growth in human microbiome research, with a particular focus on the links to both mental and physical health and disease. Medical and experimental settings provide initial sources of information about these links, but individual studies produce disconnected pieces of knowledge bounded in context by the perspective of expert researchers reading full-text publications. Building a knowledge base (KB) consolidating these disconnected pieces is an essential first step to democratize and accelerate the process of accessing the collective discoveries of human disease connections to the human microbiome. In this article, we survey the existing tools and development efforts that have been produced to capture portions of the information needed to construct a KB of all known human microbiome-disease associations and highlight the need for additional innovations in natural language processing (NLP), text mining, taxonomic representations, and field-wide vocabulary standardization in human microbiome research. Addressing these challenges will enable the construction of KBs that help identify new insights amenable to experimental validation and potentially clinical decision support.
Collapse
Affiliation(s)
- Varsha Dave Badal
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
| | - Dustin Wright
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
- Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
| | - Yannis Katsis
- Scalable Knowledge Intelligence, IBM Research-Almaden, 650 Harry Road, San Jose, CA 95120 USA
| | - Ho-Cheol Kim
- Scalable Knowledge Intelligence, IBM Research-Almaden, 650 Harry Road, San Jose, CA 95120 USA
| | - Austin D. Swafford
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
| | - Rob Knight
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
- Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
- UCSD Health Department of Pediatrics, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
- Department of Bioengineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
| | - Chun-Nan Hsu
- Center for Microbiome Innovation, Jacobs School of Engineering, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
- Department of Neurosciences and Center for Research in Biological Systems, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093 USA
| |
Collapse
|
41
|
Tiwari S, Dwivedi UN. Discovering Innovative Drugs Targeting Both Cancer and Cardiovascular Disease by Shared Protein-Protein Interaction Network Analyses. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2019; 23:417-425. [PMID: 31329050 DOI: 10.1089/omi.2019.0095] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Cancer and cardiovascular disease (CVD) have a common co-occurrence. Both diseases display overlapping pathophysiology and risk factors, suggesting shared biological mechanisms. Conditions such as obesity, diabetes, hypertension, smoking, poor diet, and inadequate physical activity can cause both heart disease and cancer. The burgeoning field of onco-cardiology aims to develop diagnostics and innovative therapeutics for both diseases through targeting shared mechanisms and molecular targets. In this overarching context, this expert review presents an analysis of the protein-protein interaction (PPI) networks for onco-cardiology drug discovery. Several PPI complexes such as MDM2-TP53 and CDK4-pRB have been studied for their tumor-suppressive functions. In addition, XIAP-SMAC, RAC1-GEF, Sur-2ESX, and TP53-BRCA1 are other PPI complexes that offer potential breakthrough for onco-cardiology therapeutics innovation. As both cancer and CVD share biological mechanisms to a certain degree, the PPI network analyses for onco-cardiology drug discovery are promising for addressing comorbid diseases in the spirit of systems medicine. We discuss the emerging architecture of PPI networks in cancer and CVD and prospects and challenges for their exploitation toward therapeutics applications. Finally, we emphasize that PPIs that were once thought to be undruggable have become potential new class of innovative drug targets.
Collapse
Affiliation(s)
- Sameeksha Tiwari
- Bioinformatics Infrastructure Facility, Department of Biochemistry, Centre of Excellence in Bioinformatics, University of Lucknow, Lucknow, Uttar Pradesh, India
| | - Upendra N Dwivedi
- Bioinformatics Infrastructure Facility, Department of Biochemistry, Centre of Excellence in Bioinformatics, University of Lucknow, Lucknow, Uttar Pradesh, India.,Institute for Development of Advanced Computing, ONGC Centre for Advanced Studies, University of Lucknow, Lucknow, Uttar Pradesh, India
| |
Collapse
|
42
|
Ding Z, Kihara D. Computational identification of protein-protein interactions in model plant proteomes. Sci Rep 2019; 9:8740. [PMID: 31217453 PMCID: PMC6584649 DOI: 10.1038/s41598-019-45072-8] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2019] [Accepted: 05/30/2019] [Indexed: 12/12/2022] Open
Abstract
Protein-protein interactions (PPIs) play essential roles in many biological processes. A PPI network provides crucial information on how biological pathways are structured and coordinated from individual protein functions. In the past two decades, large-scale PPI networks of a handful of organisms were determined by experimental techniques. However, these experimental methods are time-consuming, expensive, and are not easy to perform on new target organisms. Large-scale PPI data is particularly sparse in plant organisms. Here, we developed a computational approach for detecting PPIs trained and tested on known PPIs of Arabidopsis thaliana and applied to three plants, Arabidopsis thaliana, Glycine max (soybean), and Zea mays (maize) to discover new PPIs on a genome-scale. Our method considers a variety of features including protein sequences, gene co-expression, functional association, and phylogenetic profiles. This is the first work where a PPI prediction method was developed for is the first PPI prediction method applied on benchmark datasets of Arabidopsis. The method showed a high prediction accuracy of over 90% and very high precision of close to 1.0. We predicted 50,220 PPIs in Arabidopsis thaliana, 13,175,414 PPIs in corn, and 13,527,834 PPIs in soybean. Newly predicted PPIs were classified into three confidence levels according to the availability of existing supporting evidence and discussed. Predicted PPIs in the three plant genomes are made available for future reference.
Collapse
Affiliation(s)
- Ziyun Ding
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
| | - Daisuke Kihara
- Department of Biological Sciences, Purdue University, West Lafayette, IN, 47907, USA.
- Department of Computer Science, Purdue University, West Lafayette, IN, 47907, USA.
- Department of Pediatrics, University of Cincinnati, Cincinnati, OH, 45229, USA.
| |
Collapse
|
43
|
Aryal UK, Ding Z, Hedrick V, Sobreira TJP, Kihara D, Sherman LA. Analysis of Protein Complexes in the Unicellular Cyanobacterium Cyanothece ATCC 51142. J Proteome Res 2018; 17:3628-3643. [PMID: 30216071 DOI: 10.1021/acs.jproteome.8b00170] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The unicellular cyanobacterium Cyanothece ATCC 51142 is capable of oxygenic photosynthesis and biological N2 fixation (BNF), a process highly sensitive to oxygen. Previous work has focused on determining protein expression levels under different growth conditions. A major gap of our knowledge is an understanding on how these expressed proteins are assembled into complexes and organized into metabolic pathways, an area that has not been thoroughly investigated. Here, we combined size-exclusion chromatography (SEC) with label-free quantitative mass spectrometry (MS) and bioinformatics to characterize many protein complexes from Cyanothece 51142 cells grown under a 12 h light-dark cycle. We identified 1386 proteins in duplicate biological replicates, and 64% of those proteins were identified as putative complexes. Pairwise computational prediction of protein-protein interaction (PPI) identified 74 822 putative interactions, of which 2337 interactions were highly correlated with published protein coexpressions. Many sequential glycolytic and TCA cycle enzymes were identified as putative complexes. We also identified many membrane complexes that contain cytoplasmic domains. Subunits of NDH-1 complex eluted in a fraction with an approximate mass of ∼669 kDa, and subunits composition revealed coexistence of distinct forms of NDH-1 complex subunits responsible for respiration, electron flow, and CO2 uptake. The complex form of the phycocyanin beta subunit was nonphosphorylated, and the monomer form was phosphorylated at Ser20, suggesting phosphorylation-dependent deoligomerization of the phycocyanin beta subunit. This study provides an analytical platform for future studies to reveal how these complexes assemble and disassemble as a function of diurnal and circadian rhythms.
Collapse
|