1
|
Wang Y, Mei C, Zhou Y, Wang Y, Zheng C, Zhen X, Xiong Y, Chen P, Zhang J, Wang B. Semi-supervised prediction of protein interaction sites from unlabeled sample information. BMC Bioinformatics 2019; 20:699. [PMID: 31874616 PMCID: PMC6929468 DOI: 10.1186/s12859-019-3274-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Background The recognition of protein interaction sites is of great significance in many biological processes, signaling pathways and drug designs. However, most sites on protein sequences cannot be defined as interface or non-interface sites because only a small part of protein interactions had been identified, which will cause the lack of prediction accuracy and generalization ability of predictors in protein interaction sites prediction. Therefore, it is necessary to effectively improve prediction performance of protein interaction sites using large amounts of unlabeled data together with small amounts of labeled data and background knowledge today. Results In this work, three semi-supervised support vector machine–based methods are proposed to improve the performance in the protein interaction sites prediction, in which the information of unlabeled protein sites can be involved. Herein, five features related with the evolutionary conservation of amino acids are extracted from HSSP database and Consurf Sever, i.e., residue spatial sequence spectrum, residue sequence information entropy and relative entropy, residue sequence conserved weight and residual Base evolution rate, to represent the residues within the protein sequence. Then three predictors are built for identifying the interface residues from protein surface using three types of semi-supervised support vector machine algorithms. Conclusion The experimental results demonstrated that the semi-supervised approaches can effectively improve prediction performance of protein interaction sites when unlabeled information is involved into the predictors and one of them can achieve the best prediction performance, i.e., the accuracy of 70.7%, the sensitivity of 62.67% and the specificity of 78.72%, respectively. With comparison to the existing studies, the semi-supervised models show the improvement of the predication performance.
Collapse
Affiliation(s)
- Ye Wang
- School of Electrical and Information Engineering, Anhui University of Technology, Maanshan, 243002, Anhui, China
| | - Changqing Mei
- School of Electrical and Information Engineering, Anhui University of Technology, Maanshan, 243002, Anhui, China
| | - Yuming Zhou
- School of Electrical and Information Engineering, Anhui University of Technology, Maanshan, 243002, Anhui, China
| | - Yan Wang
- School of Electrical and Information Engineering, Anhui University of Technology, Maanshan, 243002, Anhui, China
| | - Chunhou Zheng
- Co-Innovation Center for Information Supply & Assurance Technology, Anhui University, Hefei, 230601, Anhui, China
| | - Xiao Zhen
- School of Computer Science and Technology, Anhui University of Technology, Maanshan, 243002, Anhui, China
| | - Yan Xiong
- School of Computer Science and Technology, University of Science & Technology, Hefei, 230026, Anhui, China
| | - Peng Chen
- Institute of Health Sciences, Anhui University, Hefei, 230601, Anhui, China.
| | - Jun Zhang
- College of Electrical Engineering and Automation, Anhui University, Hefei, 230601, Anhui, China
| | - Bing Wang
- School of Electrical and Information Engineering, Anhui University of Technology, Maanshan, 243002, Anhui, China. .,Co-Innovation Center for Information Supply & Assurance Technology, Anhui University, Hefei, 230601, Anhui, China.
| |
Collapse
|
2
|
Zahiri J, Emamjomeh A, Bagheri S, Ivazeh A, Mahdevar G, Sepasi Tehrani H, Mirzaie M, Fakheri BA, Mohammad-Noori M. Protein complex prediction: A survey. Genomics 2019; 112:174-183. [PMID: 30660789 DOI: 10.1016/j.ygeno.2019.01.011] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Revised: 11/27/2018] [Accepted: 01/15/2019] [Indexed: 02/08/2023]
Abstract
Protein complexes are one of the most important functional units for deriving biological processes within the cell. Experimental methods have provided valuable data to infer protein complexes. However, these methods have inherent limitations. Considering these limitations, many computational methods have been proposed to predict protein complexes, in the last decade. Almost all of these in-silico methods predict protein complexes from the ever-increasing protein-protein interaction (PPI) data. These computational approaches usually use the PPI data in the format of a huge protein-protein interaction network (PPIN) as input and output various sub-networks of the given PPIN as the predicted protein complexes. Some of these methods have already reached a promising efficiency in protein complex detection. Nonetheless, there are challenges in prediction of other types of protein complexes, specially sparse and small ones. New methods should further incorporate the knowledge of biological properties of proteins to improve the performance. Additionally, there are several challenges that should be considered more effectively in designing the new complex prediction algorithms in the future. This article not only reviews the history of computational protein complex prediction but also provides new insight for improvement of new methodologies. In this article, most important computational methods for protein complex prediction are evaluated and compared. In addition, some of the challenges in the reconstruction of the protein complexes are discussed. Finally, various tools for protein complex prediction and PPIN analysis as well as the current high-throughput databases are reviewed.
Collapse
Affiliation(s)
- Javad Zahiri
- Bioinformatics and Computational Omics Lab (BioCOOL), Department of Biophysics, Faculty of Biological Sciences, Tarbiat Modares University, Tehran, Iran
| | - Abbasali Emamjomeh
- Laboratory of Computational Biotechnology and Bioinformatics (CBB), Department of Plant Breeding and Biotechnology, University of Zabol, Zabol, Iran.
| | - Samaneh Bagheri
- Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Asma Ivazeh
- Database Research Group (DBRG), Control and intelligent Processing Center of Excellence (CIPCE), School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran
| | - Ghasem Mahdevar
- Department of Mathematics, Faculty of Sciences, University of Isfahan, Isfahan, Iran
| | - Hessam Sepasi Tehrani
- Department of Biology, Science and Research Branch, Islamic Azad University, Tehran, Iran
| | - Mehdi Mirzaie
- Department of Applied Mathematics, Faculty of Mathematical Sciences, Tarbiat Modares University, Tehran, Iran
| | - Barat Ali Fakheri
- Department of Plant Breeding and Biotechnology (PBB), Faculty of Agriculture, University of Zabol, Zabol, Iran
| | - Morteza Mohammad-Noori
- School of Mathematics, Statistics, and Computer Science, College of Science, University of Tehran, Tehran, Iran
| |
Collapse
|
3
|
Hamzeh-Mivehroud M, Sokouti B, Dastmalchi S. Molecular Docking at a Glance. Oncology 2017. [DOI: 10.4018/978-1-5225-0549-5.ch030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The current chapter introduces different aspects of molecular docking technique in order to give an overview to the readers about the topics which will be dealt with throughout this volume. Like many other fields of science, molecular docking studies has experienced a lagging period of slow and steady increase in terms of acquiring attention of scientific community as well as its frequency of application, followed by a pronounced era of exponential expansion in theory, methodology, areas of application and performance due to developments in related technologies such as computational resources and theoretical as well as experimental biophysical methods. In the following sections the evolution of molecular docking will be reviewed and its different components including methods, search algorithms, scoring functions, validation of the methods, and area of applications plus few case studies will be touched briefly.
Collapse
Affiliation(s)
| | | | - Siavoush Dastmalchi
- Biotechnology Research Center, Tabriz University of Medical Sciences, Iran & School of Pharmacy, Tabriz University of Medical Sciences, Iran
| |
Collapse
|
4
|
Chang CW, Chou CW, Chang DTH. CCProf: exploring conformational change profile of proteins. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw029. [PMID: 27016699 PMCID: PMC4808249 DOI: 10.1093/database/baw029] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 02/23/2016] [Indexed: 12/18/2022]
Abstract
In many biological processes, proteins have important interactions with various molecules such as proteins, ions or ligands. Many proteins undergo conformational changes upon these interactions, where regions with large conformational changes are critical to the interactions. This work presents the CCProf platform, which provides conformational changes of entire proteins, named conformational change profile (CCP) in the context. CCProf aims to be a platform where users can study potential causes of novel conformational changes. It provides 10 biological features, including conformational change, potential binding target site, secondary structure, conservation, disorder propensity, hydropathy propensity, sequence domain, structural domain, phosphorylation site and catalytic site. All these information are integrated into a well-aligned view, so that researchers can capture important relevance between different biological features visually. The CCProf contains 986 187 protein structure pairs for 3123 proteins. In addition, CCProf provides a 3D view in which users can see the protein structures before and after conformational changes as well as binding targets that induce conformational changes. All information (e.g. CCP, binding targets and protein structures) shown in CCProf, including intermediate data are available for download to expedite further analyses. Database URL: http://zoro.ee.ncku.edu.tw/ccprof/
Collapse
Affiliation(s)
- Che-Wei Chang
- Department of Electrical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
| | - Chai-Wei Chou
- Department of Electrical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
| | - Darby Tien-Hao Chang
- Department of Electrical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
| |
Collapse
|
5
|
Spread of Mutant Middle East Respiratory Syndrome Coronavirus with Reduced Affinity to Human CD26 during the South Korean Outbreak. mBio 2016; 7:e00019. [PMID: 26933050 PMCID: PMC4810480 DOI: 10.1128/mbio.00019-16] [Citation(s) in RCA: 71] [Impact Index Per Article: 8.9] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The newly emerging Middle East respiratory syndrome coronavirus (MERS-CoV) causes a severe respiratory infection with a high mortality rate (~35%). MERS-CoV has been a global threat due to continuous outbreaks in the Arabian peninsula and international spread by infected travelers since 2012. From May to July 2015, a large outbreak initiated by an infected traveler from the Arabian peninsula swept South Korea and resulted in 186 confirmed cases with 38 deaths (case fatality rate, 20.4%). Here, we show the rapid emergence and spread of a mutant MERS-CoV with reduced affinity to the human CD26 receptor during the South Korean outbreak. We isolated 13 new viral genomes from 14 infected patients treated at a hospital and found that 12 of these genomes possess a point mutation in the receptor-binding domain (RBD) of viral spike (S) protein. Specifically, 11 of these genomes have an I529T mutation in RBD, and 1 has a D510G mutation. Strikingly, both mutations result in reduced affinity of RBD to human CD26 compared to wild-type RBD, as measured by surface plasmon resonance analysis and cellular binding assay. Additionally, pseudotyped virus bearing an I529T mutation in S protein showed reduced entry into host cells compared to virus with wild-type S protein. These unexpected findings suggest that MERS-CoV adaptation during human-to-human spread may be driven by host immunological pressure such as neutralizing antibodies, resulting in reduced affinity to host receptor, and thereby impairs viral fitness and virulence, rather than positive selection for a better affinity to CD26. Recently, a large outbreak initiated by an MERS-CoV-infected traveler from the Middle East swept South Korea and resulted in 186 confirmed cases with 38 deaths. This is the largest outbreak outside the Middle East, and it raised strong concerns about the possible emergence of MERS-CoV mutations. Here, we isolated 13 new viral genomes and found that 12 of them possess a point mutation in the receptor-binding domain of viral spike protein, resulting in reduced affinity to the human cognate receptor, CD26, compared to the wild-type virus. These unexpected findings suggest that MERS-CoV adaptation in humans may be driven by host immunological pressure.
Collapse
|
6
|
Choi YS, Yoon S, Kim KL, Yoo J, Song P, Kim M, Shin YE, Yang WJ, Noh JE, Cho HS, Kim S, Chung J, Ryu SH. Computational design of binding proteins to EGFR domain II. PLoS One 2014; 9:e92513. [PMID: 24710267 PMCID: PMC3977815 DOI: 10.1371/journal.pone.0092513] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Accepted: 02/24/2014] [Indexed: 12/03/2022] Open
Abstract
We developed a process to produce novel interactions between two previously unrelated proteins. This process selects protein scaffolds and designs protein interfaces that bind to a surface patch of interest on a target protein. Scaffolds with shapes complementary to the target surface patch were screened using an exhaustive computational search of the human proteome and optimized by directed evolution using phage display. This method was applied to successfully design scaffolds that bind to epidermal growth factor receptor (EGFR) domain II, the interface of EGFR dimerization, with high reactivity toward the target surface patch of EGFR domain II. One potential application of these tailor-made protein interactions is the development of therapeutic agents against specific protein targets.
Collapse
Affiliation(s)
- Yoon Sup Choi
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- School of Interdisciplinary Bioscience and Bioengineering (I-Bio), Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- KT Institute of Convergence Technology, Seocho-gu, Seoul, Korea
| | - Soomin Yoon
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Department of Biochemistry and Molecular Biology, Seoul National University School of Medicine, Seoul, Republic of Korea
| | - Kyung-Lock Kim
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Jiho Yoo
- Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Parkyong Song
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Minsoo Kim
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Scripps Korea Antibody Institute, Chuncheon, Republic of Korea
| | - Young-Eun Shin
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Won Jun Yang
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Department of Biochemistry and Molecular Biology, Seoul National University School of Medicine, Seoul, Republic of Korea
| | - Jung-eun Noh
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
| | - Hyun-soo Cho
- Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul, Republic of Korea
| | - Sanguk Kim
- School of Interdisciplinary Bioscience and Bioengineering (I-Bio), Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- Division of IT Convergence Engineering, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- * E-mail: (SK); (JC); (SHR)
| | - Junho Chung
- Cancer Research Institute, Seoul National University School of Medicine, Seoul, Republic of Korea
- Department of Biochemistry and Molecular Biology, Seoul National University School of Medicine, Seoul, Republic of Korea
- * E-mail: (SK); (JC); (SHR)
| | - Sung Ho Ryu
- School of Interdisciplinary Bioscience and Bioengineering (I-Bio), Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- Division of Integrative Bioscience and Biotechnology, Pohang University of Science and Technology (POSTECH), Pohang, Republic of Korea
- * E-mail: (SK); (JC); (SHR)
| |
Collapse
|
7
|
Verma R, Schwaneberg U, Roccatano D. Computer-Aided Protein Directed Evolution: a Review of Web Servers, Databases and other Computational Tools for Protein Engineering. Comput Struct Biotechnol J 2012; 2:e201209008. [PMID: 24688649 PMCID: PMC3962222 DOI: 10.5936/csbj.201209008] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2012] [Revised: 10/07/2012] [Accepted: 10/12/2012] [Indexed: 12/01/2022] Open
Abstract
The combination of computational and directed evolution methods has proven a winning strategy for protein engineering. We refer to this approach as computer-aided protein directed evolution (CAPDE) and the review summarizes the recent developments in this rapidly growing field. We will restrict ourselves to overview the availability, usability and limitations of web servers, databases and other computational tools proposed in the last five years. The goal of this review is to provide concise information about currently available computational resources to assist the design of directed evolution based protein engineering experiment.
Collapse
Affiliation(s)
- Rajni Verma
- School of Engineering and Science, Jacobs University Bremen, Campus Ring 1, 28759 Bremen, Germany ; Department of Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany
| | - Ulrich Schwaneberg
- Department of Biotechnology, RWTH Aachen University, Worringer Weg 1, 52074 Aachen, Germany
| | - Danilo Roccatano
- School of Engineering and Science, Jacobs University Bremen, Campus Ring 1, 28759 Bremen, Germany
| |
Collapse
|
8
|
Biophysical characterization of Entamoeba histolytica phosphoserine aminotransferase (EhPSAT): role of cofactor and domains in stability and subunit assembly. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2010; 40:599-610. [DOI: 10.1007/s00249-010-0654-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2010] [Revised: 11/21/2010] [Accepted: 11/25/2010] [Indexed: 10/18/2022]
|