201
|
Fu H, Cao Z, Li M, Wang S. ACEP: improving antimicrobial peptides recognition through automatic feature fusion and amino acid embedding. BMC Genomics 2020; 21:597. [PMID: 32859150 PMCID: PMC7455913 DOI: 10.1186/s12864-020-06978-0] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 08/11/2020] [Indexed: 12/14/2022] Open
Abstract
BACKGROUND Antimicrobial resistance is one of our most serious health threats. Antimicrobial peptides (AMPs), effecter molecules of innate immune system, can defend host organisms against microbes and most have shown a lowered likelihood for bacteria to form resistance compared to many conventional drugs. Thus, AMPs are gaining popularity as better substitute to antibiotics. To aid researchers in novel AMPs discovery, we design computational approaches to screen promising candidates. RESULTS In this work, we design a deep learning model that can learn amino acid embedding patterns, automatically extract sequence features, and fuse heterogeneous information. Results show that the proposed model outperforms state-of-the-art methods on recognition of AMPs. By visualizing data in some layers of the model, we overcome the black-box nature of deep learning, explain the working mechanism of the model, and find some import motifs in sequences. CONCLUSIONS ACEP model can capture similarity between amino acids, calculate attention scores for different parts of a peptide sequence in order to spot important parts that significantly contribute to final predictions, and automatically fuse a variety of heterogeneous information or features. For high-throughput AMPs recognition, open source software and datasets are made freely available at https://github.com/Fuhaoyi/ACEP .
Collapse
Affiliation(s)
- Haoyi Fu
- School of Information Science and Engineering, Yunnan University, Kunming, 650500, China
| | - Zicheng Cao
- School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou, 510006, China
| | - Mingyuan Li
- School of Information Science and Engineering, Yunnan University, Kunming, 650500, China
| | - Shunfang Wang
- School of Information Science and Engineering, Yunnan University, Kunming, 650500, China.
| |
Collapse
|
202
|
Jing R, Li Y, Xue L, Liu F, Li M, Luo J. autoBioSeqpy: A Deep Learning Tool for the Classification of Biological Sequences. J Chem Inf Model 2020; 60:3755-3764. [PMID: 32786512 DOI: 10.1021/acs.jcim.0c00409] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Deep learning has proven to be a powerful method with applications in various fields including image, language, and biomedical data. Thanks to the libraries and toolkits such as TensorFlow, PyTorch, and Keras, researchers can use different deep learning architectures and data sets for rapid modeling. However, the available implementations of neural networks using these toolkits are usually designed for a specific research and are difficult to transfer to other work. Here, we present autoBioSeqpy, a tool that uses deep learning for biological sequence classification. The advantage of this tool is its simplicity. Users only need to prepare the input data set and then use a command line interface. Then, autoBioSeqpy automatically executes a series of customizable steps including text reading, parameter initialization, sequence encoding, model loading, training, and evaluation. In addition, the tool provides various ready-to-apply and adapt model templates to improve the usability of these networks. We introduce the application of autoBioSeqpy on three biological sequence problems: the prediction of type III secreted proteins, protein subcellular localization, and CRISPR/Cas9 sgRNA activity. autoBioSeqpy is freely available with examples at https://github.com/jingry/autoBioSeqpy.
Collapse
Affiliation(s)
- Runyu Jing
- College of Cybersecurity, Sichuan University, Chengdu 610065, China
| | - Yizhou Li
- College of Cybersecurity, Sichuan University, Chengdu 610065, China
| | - Li Xue
- School of Public Health, Southwest Medical University, Luzhou, Sichuan 646000, China
| | - Fengjuan Liu
- School of Geography and Resources, Guizhou Education University, Guiyang 550018, China
| | - Menglong Li
- College of Chemistry, Sichuan University, Chengdu 610065, China
| | - Jiesi Luo
- Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan 646000, China
| |
Collapse
|
203
|
Carpenter K, Pilozzi A, Huang X. A Pilot Study of Multi-Input Recurrent Neural Networks for Drug-Kinase Binding Prediction. Molecules 2020; 25:E3372. [PMID: 32722290 PMCID: PMC7435591 DOI: 10.3390/molecules25153372] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2020] [Revised: 07/20/2020] [Accepted: 07/24/2020] [Indexed: 01/01/2023] Open
Abstract
The use of virtual drug screening can be beneficial to research teams, enabling them to narrow down potentially useful compounds for further study. A variety of virtual screening methods have been developed, typically with machine learning classifiers at the center of their design. In the present study, we created a virtual screener for protein kinase inhibitors. Experimental compound-target interaction data were obtained from the IDG-DREAM Drug-Kinase Binding Prediction Challenge. These data were converted and fed as inputs into two multi-input recurrent neural networks (RNNs). The first network utilized data encoded in one-hot representation, while the other incorporated embedding layers. The models were developed in Python, and were designed to output the IC50 of the target compounds. The performance of the models was assessed primarily through analysis of the Q2 values produced from runs of differing sample and epoch size; recorded loss values were also reported and graphed. The performance of the models was limited, though multiple changes are proposed for potential improvement of a multi-input recurrent neural network-based screening tool.
Collapse
Affiliation(s)
| | | | - Xudong Huang
- Neurochemistry Laboratory, Department of Psychiatry, Massachusetts General Hospital and Harvard Medical School, Charlestown, MA 02129, USA; (K.C.); (A.P.)
| |
Collapse
|
204
|
Liu Z, Chen Q, Lan W, Liang J, Chen YPP, Chen B. A Survey of Network Embedding for Drug Analysis and Prediction. Curr Protein Pept Sci 2020; 22:CPPS-EPUB-107859. [PMID: 32614745 DOI: 10.2174/1389203721666200702145701] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/01/2020] [Revised: 04/05/2020] [Accepted: 05/21/2020] [Indexed: 11/22/2022]
Abstract
Traditional network-based computational methods have shown good results in drug analysis and prediction. However, these methods are time consuming and lack universality, and it is difficult to exploit the auxiliary information of nodes and edges. Network embedding provides a promising way for alleviating the above problems by transforming network into a low-dimensional space while preserving network structure and auxiliary information. This thus facilitates the application of machine learning algorithms for subsequent processing. Network embedding has been introduced into drug analysis and prediction in the last few years, and has shown superior performance over traditional methods. However, there is no systematic review of this issue. This article offers a comprehensive survey of the primary network embedding methods and their applications in drug analysis and prediction. The network embedding technologies applied in homogeneous network and heterogeneous network are investigated and compared, including matrix decomposition, random walk, and deep learning. Especially, the Graph neural network (GNN) methods in deep learning are highlighted. Further, the applications of network embedding in drug similarity estimation, drug-target interaction prediction, adverse drug reactions prediction, protein function and therapeutic peptides prediction are discussed. Several future potential research directions are also discussed.
Collapse
Affiliation(s)
- Zhixian Liu
- School of Medical, Guangxi University, Nanning. China
| | - Qingfeng Chen
- School of Computer, Electronic and Information, Guangxi University, Nanning. China
| | - Wei Lan
- School of Computer, Electronic and Information, Guangxi University, Nanning. China
| | - Jiahai Liang
- School of Electronics and Information Engineering, Beibu Gulf University, Qinzhou. China
| | - Yi-Ping Phoebe Chen
- Department of Computer Science and Information Technology, La Trobe University, Melbourne. Australia
| | - Baoshan Chen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources, Guangxi University, Nanning. China
| |
Collapse
|
205
|
Timmons PB, Hewage CM. HAPPENN is a novel tool for hemolytic activity prediction for therapeutic peptides which employs neural networks. Sci Rep 2020; 10:10869. [PMID: 32616760 PMCID: PMC7331684 DOI: 10.1038/s41598-020-67701-3] [Citation(s) in RCA: 42] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Accepted: 06/09/2020] [Indexed: 12/11/2022] Open
Abstract
The growing prevalence of resistance to antibiotics motivates the search for new antibacterial agents. Antimicrobial peptides are a diverse class of well-studied membrane-active peptides which function as part of the innate host defence system, and form a promising avenue in antibiotic drug research. Some antimicrobial peptides exhibit toxicity against eukaryotic membranes, typically characterised by hemolytic activity assays, but currently, the understanding of what differentiates hemolytic and non-hemolytic peptides is limited. This study leverages advances in machine learning research to produce a novel artificial neural network classifier for the prediction of hemolytic activity from a peptide's primary sequence. The classifier achieves best-in-class performance, with cross-validated accuracy of [Formula: see text] and Matthews correlation coefficient of 0.71. This innovative classifier is available as a web server at https://research.timmons.eu/happenn , allowing the research community to utilise it for in silico screening of peptide drug candidates for high therapeutic efficacies.
Collapse
Affiliation(s)
- Patrick Brendan Timmons
- UCD School of Biomolecular and Biomedical Science, UCD Centre for Synthesis and Chemical Biology, UCD Conway Institute, University College Dublin, Dublin 4, Ireland
| | - Chandralal M Hewage
- UCD School of Biomolecular and Biomedical Science, UCD Centre for Synthesis and Chemical Biology, UCD Conway Institute, University College Dublin, Dublin 4, Ireland.
| |
Collapse
|
206
|
Shi Q, Chen W, Huang S, Jin F, Dong Y, Wang Y, Xue Z. DNN-Dom: predicting protein domain boundary from sequence alone by deep neural network. Bioinformatics 2020; 35:5128-5136. [PMID: 31197306 DOI: 10.1093/bioinformatics/btz464] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 05/07/2019] [Accepted: 06/05/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Accurate delineation of protein domain boundary plays an important role for protein engineering and structure prediction. Although machine-learning methods are widely used to predict domain boundary, these approaches often ignore long-range interactions among residues, which have been proven to improve the prediction performance. However, how to simultaneously model the local and global interactions to further improve domain boundary prediction is still a challenging problem. RESULTS This article employs a hybrid deep learning method that combines convolutional neural network and gate recurrent units' models for domain boundary prediction. It not only captures the local and non-local interactions, but also fuses these features for prediction. Additionally, we adopt balanced Random Forest for classification to deal with high imbalance of samples and high dimensions of deep features. Experimental results show that our proposed approach (DNN-Dom) outperforms existing machine-learning-based methods for boundary prediction. We expect that DNN-Dom can be useful for assisting protein structure and function prediction. AVAILABILITY AND IMPLEMENTATION The method is available as DNN-Dom Server at http://isyslab.info/DNN-Dom/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Qiang Shi
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Weiya Chen
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Siqi Huang
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Fanglin Jin
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Yinghao Dong
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Yan Wang
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| | - Zhidong Xue
- School of Software Engineering and College of Life Science & Technology, Huazhong University of Science and Technology, Wuhan 430074, China
| |
Collapse
|
207
|
Xie R, Li J, Wang J, Dai W, Leier A, Marquez-Lago TT, Akutsu T, Lithgow T, Song J, Zhang Y. DeepVF: a deep learning-based hybrid framework for identifying virulence factors using the stacking strategy. Brief Bioinform 2020; 22:5864586. [PMID: 32599617 DOI: 10.1093/bib/bbaa125] [Citation(s) in RCA: 37] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2019] [Revised: 05/22/2020] [Accepted: 05/22/2020] [Indexed: 12/14/2022] Open
Abstract
Virulence factors (VFs) enable pathogens to infect their hosts. A wealth of individual, disease-focused studies has identified a wide variety of VFs, and the growing mass of bacterial genome sequence data provides an opportunity for computational methods aimed at predicting VFs. Despite their attractive advantages and performance improvements, the existing methods have some limitations and drawbacks. Firstly, as the characteristics and mechanisms of VFs are continually evolving with the emergence of antibiotic resistance, it is more and more difficult to identify novel VFs using existing tools that were previously developed based on the outdated data sets; secondly, few systematic feature engineering efforts have been made to examine the utility of different types of features for model performances, as the majority of tools only focused on extracting very few types of features. By addressing the aforementioned issues, the accuracy of VF predictors can likely be significantly improved. This, in turn, would be particularly useful in the context of genome wide predictions of VFs. In this work, we present a deep learning (DL)-based hybrid framework (termed DeepVF) that is utilizing the stacking strategy to achieve more accurate identification of VFs. Using an enlarged, up-to-date dataset, DeepVF comprehensively explores a wide range of heterogeneous features with popular machine learning algorithms. Specifically, four classical algorithms, including random forest, support vector machines, extreme gradient boosting and multilayer perceptron, and three DL algorithms, including convolutional neural networks, long short-term memory networks and deep neural networks are employed to train 62 baseline models using these features. In order to integrate their individual strengths, DeepVF effectively combines these baseline models to construct the final meta model using the stacking strategy. Extensive benchmarking experiments demonstrate the effectiveness of DeepVF: it achieves a more accurate and stable performance compared with baseline models on the benchmark dataset and clearly outperforms state-of-the-art VF predictors on the independent test. Using the proposed hybrid ensemble model, a user-friendly online predictor of DeepVF (http://deepvf.erc.monash.edu/) is implemented. Furthermore, its utility, from the user's viewpoint, is compared with that of existing toolkits. We believe that DeepVF will be exploited as a useful tool for screening and identifying potential VFs from protein-coding gene sequences in bacterial genomes.
Collapse
Affiliation(s)
- Ruopeng Xie
- Bioinformatics Lab at Guilin University of Electronic Technology
| | - Jiahui Li
- Bioinformatics Lab at Guilin University of Electronic Technology
| | - Jiawei Wang
- Biomedicine Discovery Institute and the Department of Microbiology at Monash University, Australia
| | - Wei Dai
- School of Computer Science and Information Security, Guilin University of Electronic Technology, China
| | - André Leier
- Department of Genetics and the Department of Cell, Developmental and Integrative Biology, University of Alabama at Birmingham (UAB) School of Medicine, USA
| | - Tatiana T Marquez-Lago
- Department of Genetics and the Department of Cell, Developmental and Integrative Biology, University of Alabama at Birmingham (UAB) School of Medicine, USA
| | | | - Trevor Lithgow
- Biomedicine Discovery Institute and the Director of the Centre to Impact AMR at Monash University, Australia
| | - Jiangning Song
- Group Leader in the Biomedicine Discovery Institute and the Department of Biochemistry and Molecular Biology, Monash University, Melbourne, Australia
| | - Yanju Zhang
- Leiden Institute of Advanced Computer Science, Leiden University
| |
Collapse
|
208
|
Proteomic Screening for Prediction and Design of Antimicrobial Peptides with AmpGram. Int J Mol Sci 2020; 21:ijms21124310. [PMID: 32560350 PMCID: PMC7352166 DOI: 10.3390/ijms21124310] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Revised: 06/10/2020] [Accepted: 06/11/2020] [Indexed: 01/12/2023] Open
Abstract
Antimicrobial peptides (AMPs) are molecules widespread in all branches of the tree of life that participate in host defense and/or microbial competition. Due to their positive charge, hydrophobicity and amphipathicity, they preferentially disrupt negatively charged bacterial membranes. AMPs are considered an important alternative to traditional antibiotics, especially at the time when multidrug-resistant bacteria being on the rise. Therefore, to reduce the costs of experimental research, robust computational tools for AMP prediction and identification of the best AMP candidates are essential. AmpGram is our novel tool for AMP prediction; it outperforms top-ranking AMP classifiers, including AMPScanner, CAMPR3R and iAMPpred. It is the first AMP prediction tool created for longer AMPs and for high-throughput proteomic screening. AmpGram prediction reliability was confirmed on the example of lactoferrin and thrombin. The former is a well known antimicrobial protein and the latter a cryptic one. Both proteins produce (after protease treatment) functional AMPs that have been experimentally validated at molecular level. The lactoferrin and thrombin AMPs were located in the antimicrobial regions clearly detected by AmpGram. Moreover, AmpGram also provides a list of shot 10 amino acid fragments in the antimicrobial regions, along with their probability predictions; these can be used for further studies and the rational design of new AMPs. AmpGram is available as a web-server, and an easy-to-use R package for proteomic analysis at CRAN repository.
Collapse
|
209
|
Maurya NS, Kushwaha S, Mani A. Recent Advances and Computational Approaches in Peptide Drug Discovery. Curr Pharm Des 2020; 25:3358-3366. [PMID: 31544714 DOI: 10.2174/1381612825666190911161106] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Accepted: 09/05/2019] [Indexed: 12/19/2022]
Abstract
BACKGROUND Drug design and development is a vast field that requires huge investment along with a long duration for providing approval to suitable drug candidates. With the advancement in the field of genomics, the information about druggable targets is being updated at a fast rate which is helpful in finding a cure for various diseases. METHODS There are certain biochemicals as well as physiological advantages of using peptide-based therapeutics. Additionally, the limitations of peptide-based drugs can be overcome by modulating the properties of peptide molecules through various biomolecular engineering techniques. Recent advances in computational approaches have been helpful in studying the effect of peptide drugs on the biomolecular targets. Receptor - ligand-based molecular docking studies have made it easy to screen compatible inhibitors against a target.Furthermore, there are simulation tools available to evaluate stability of complexes at the molecular level. Machine learning methods have added a new edge by enabling accurate prediction of therapeutic peptides. RESULTS Peptide-based drugs are expected to take over many popular drugs in the near future due to their biosafety, lower off-target binding chances and multifunctional properties. CONCLUSION This article summarises the latest developments in the field of peptide-based therapeutics related to their usage, tools, and databases.
Collapse
Affiliation(s)
- Neha S Maurya
- Department of Biotechnology, Motilal Nehru National Institute of Technology, Allahabad, India
| | - Sandeep Kushwaha
- Department of Plant Breeding, Sveriges lantbruksuniversitet, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Ashutosh Mani
- Department of Biotechnology, Motilal Nehru National Institute of Technology, Allahabad, India
| |
Collapse
|
210
|
Divyashree M, Mani MK, Reddy D, Kumavath R, Ghosh P, Azevedo V, Barh D. Clinical Applications of Antimicrobial Peptides (AMPs): Where do we Stand Now? Protein Pept Lett 2020; 27:120-134. [PMID: 31553285 DOI: 10.2174/0929866526666190925152957] [Citation(s) in RCA: 54] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2018] [Revised: 04/24/2019] [Accepted: 08/04/2019] [Indexed: 12/12/2022]
Abstract
In this era of multi-drug resistance (MDR), antimicrobial peptides (AMPs) are one of the most promising classes of potential drug candidates to combat communicable as well as noncommunicable diseases such as cancers and diabetes. AMPs show a wide spectrum of biological activities which include antiviral, antifungal, anti-mitogenic, anticancer, and anti-inflammatory properties. Apart from these prospective therapeutic potentials, the AMPs can act as food preservatives and immune modulators. Therefore, AMPs have the potential to replace conventional drugs and may gain a significant global drug market share. Although several AMPs have shown therapeutic potential in vitro or in vivo, in most cases they have failed the clinical trial owing to various issues. In this review, we discuss in brief (i) molecular mechanisms of AMPs in various diseases, (ii) importance of AMPs in pharmaceutical industries, (iii) the challenges in using AMPs as therapeutics and how to overcome, (iv) available AMP therapeutics in market, and (v) AMPs under clinical trials. Here, we specifically focus on the therapeutic AMPs in the areas of dermatology, surgery, oncology and metabolic diseases.
Collapse
Affiliation(s)
- Mithoor Divyashree
- Nitte University Centre for Science Education & Research (NUCSER), NITTE (Deemed to be University), Paneer campus, Deralakatte, Mangalore - 575018, Karnataka,India
| | - Madhu K Mani
- Nitte University Centre for Science Education & Research (NUCSER), NITTE (Deemed to be University), Paneer campus, Deralakatte, Mangalore - 575018, Karnataka,India
| | - Dhanasekhar Reddy
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Tejaswini Hills, Periya (P.O) Kasaragod, Kerala-671316,India
| | - Ranjith Kumavath
- Department of Genomic Science, School of Biological Sciences, Central University of Kerala, Tejaswini Hills, Periya (P.O) Kasaragod, Kerala-671316,India
| | - Preetam Ghosh
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284,United States
| | - Vasco Azevedo
- Laboratório de GenéticaCelular e Molecular, Programa de Pós-graduaçãoemBioinformática, Instituto de CiênciasBiológicas (ICB), Universidade Federal de Minas Gerais, Av. Antonio Carlos 6627, Pampulha, Belo Horizonte, CEP 31270-901,Brazil
| | - Debmalya Barh
- Nitte University Centre for Science Education & Research (NUCSER), NITTE (Deemed to be University), Paneer campus, Deralakatte, Mangalore - 575018, Karnataka,India.,Centre for Genomics and Applied Gene Technology, Institute of Integrative Omics and Applied Biotechnology (IIOAB), Nonakuri, PurbaMedinipur, West Bengal, India
| |
Collapse
|
211
|
Antimicrobial peptide CGA-N12 decreases the Candida tropicalis mitochondrial membrane potential via mitochondrial permeability transition pore. Biosci Rep 2020; 40:223802. [PMID: 32368781 PMCID: PMC7225414 DOI: 10.1042/bsr20201007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 04/20/2020] [Accepted: 05/04/2020] [Indexed: 12/17/2022] Open
Abstract
Amino acid sequence from 65th to 76th residue of the N-terminus of Chromogranin A (CGA-N12) is an antimicrobial peptide (AMP). Our previous studies showed that CGA-N12 reduces Candida tropicalis mitochondrial membrane potential. Here, we explored the mechanism that CGA-N12 collapsed the mitochondrial membrane potential by investigations of its action on the mitochondrial permeability transition pore (mPTP) complex of C. tropicalis. The results showed that CGA-N12 induced cytochrome c (Cyt c) leakage, mitochondria swelling and led to polyethylene glycol (PEG) of molecular weight 1000 Da penetrate mitochondria. mPTP opening inhibitors bongkrekic acid (BA) could contract the mitochondrial swelling induced by CGA-N12, but cyclosporin A (CsA) could not. Therefore, we speculated that CGA-N12 could induce C. tropicolis mPTP opening by preventing the matrix-facing (m) conformation of adenine nucleotide transporter (ANT), thereby increasing the permeability of the mitochondrial membrane and resulted in the mitochondrial potential dissipation.
Collapse
|
212
|
Zheng L, Huang S, Mu N, Zhang H, Zhang J, Chang Y, Yang L, Zuo Y. RAACBook: a web server of reduced amino acid alphabet for sequence-dependent inference by using Chou's five-step rule. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2019:5650975. [PMID: 31802128 PMCID: PMC6893003 DOI: 10.1093/database/baz131] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/20/2019] [Revised: 10/16/2019] [Accepted: 10/17/2019] [Indexed: 12/12/2022]
Abstract
By reducing amino acid alphabet, the protein complexity can be significantly simplified, which could improve computational efficiency, decrease information redundancy and reduce chance of overfitting. Although some reduced alphabets have been proposed, different classification rules could produce distinctive results for protein sequence analysis. Thus, it is urgent to construct a systematical frame for reduced alphabets. In this work, we constructed a comprehensive web server called RAACBook for protein sequence analysis and machine learning application by integrating reduction alphabets. The web server contains three parts: (i) 74 types of reduced amino acid alphabet were manually extracted to generate 673 reduced amino acid clusters (RAACs) for dealing with unique protein problems. It is easy for users to select desired RAACs from a multilayer browser tool. (ii) An online tool was developed to analyze primary sequence of protein. The tool could produce K-tuple reduced amino acid composition by defining three correlation parameters (K-tuple, g-gap, λ-correlation). The results are visualized as sequence alignment, mergence of RAA composition, feature distribution and logo of reduced sequence. (iii) The machine learning server is provided to train the model of protein classification based on K-tuple RAAC. The optimal model could be selected according to the evaluation indexes (ROC, AUC, MCC, etc.). In conclusion, RAACBook presents a powerful and user-friendly service in protein sequence analysis and computational proteomics. RAACBook can be freely available at http://bioinfor.imu.edu.cn/raacbook. Database URL: http://bioinfor.imu.edu.cn/raacbook
Collapse
Affiliation(s)
- Lei Zheng
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Zhaojun Road No.24, Hohhot, 010070, China
| | - Shenghui Huang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Zhaojun Road No.24, Hohhot, 010070, China
| | - Nengjiang Mu
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Zhaojun Road No.24, Hohhot, 010070, China
| | - Haoyue Zhang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Zhaojun Road No.24, Hohhot, 010070, China
| | - Jiayu Zhang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Zhaojun Road No.24, Hohhot, 010070, China
| | - Yu Chang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Zhaojun Road No.24, Hohhot, 010070, China
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Baojian Road No.157, Harbin 150081, China
| | - Yongchun Zuo
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Zhaojun Road No.24, Hohhot, 010070, China
| |
Collapse
|
213
|
Yan J, Bhadra P, Li A, Sethiya P, Qin L, Tai HK, Wong KH, Siu SWI. Deep-AmPEP30: Improve Short Antimicrobial Peptides Prediction with Deep Learning. MOLECULAR THERAPY. NUCLEIC ACIDS 2020; 20:882-894. [PMID: 32464552 PMCID: PMC7256447 DOI: 10.1016/j.omtn.2020.05.006] [Citation(s) in RCA: 101] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/21/2020] [Revised: 04/08/2020] [Accepted: 05/06/2020] [Indexed: 12/12/2022]
Abstract
Antimicrobial peptides (AMPs) are a valuable source of antimicrobial agents and a potential solution to the multi-drug resistance problem. In particular, short-length AMPs have been shown to have enhanced antimicrobial activities, higher stability, and lower toxicity to human cells. We present a short-length (≤30 aa) AMP prediction method, Deep-AmPEP30, developed based on an optimal feature set of PseKRAAC reduced amino acids composition and convolutional neural network. On a balanced benchmark dataset of 188 samples, Deep-AmPEP30 yields an improved performance of 77% in accuracy, 85% in the area under the receiver operating characteristic curve (AUC-ROC), and 85% in area under the precision-recall curve (AUC-PR) over existing machine learning-based methods. To demonstrate its power, we screened the genome sequence of Candida glabrata—a gut commensal fungus expected to interact with and/or inhibit other microbes in the gut—for potential AMPs and identified a peptide of 20 aa (P3, FWELWKFLKSLWSIFPRRRP) with strong anti-bacteria activity against Bacillus subtilis and Vibrio parahaemolyticus. The potency of the peptide is remarkably comparable to that of ampicillin. Therefore, Deep-AmPEP30 is a promising prediction tool to identify short-length AMPs from genomic sequences for drug discovery. Our method is available at https://cbbio.cis.um.edu.mo/AxPEP for both individual sequence prediction and genome screening for AMPs.
Collapse
Affiliation(s)
- Jielu Yan
- Department of Computer and Information Science, University of Macau, Macau, China
| | - Pratiti Bhadra
- Department of Computer and Information Science, University of Macau, Macau, China
| | - Ang Li
- Faculty of Health Sciences, University of Macau, Macau, China
| | - Pooja Sethiya
- Faculty of Health Sciences, University of Macau, Macau, China
| | - Longguang Qin
- Faculty of Health Sciences, University of Macau, Macau, China
| | - Hio Kuan Tai
- Department of Computer and Information Science, University of Macau, Macau, China
| | - Koon Ho Wong
- Faculty of Health Sciences, University of Macau, Macau, China; Institute of Translational Medicines, University of Macau, Macau, China
| | - Shirley W I Siu
- Department of Computer and Information Science, University of Macau, Macau, China.
| |
Collapse
|
214
|
Ao C, Zhang Y, Li D, Zhao Y, Zou Q. Progress in the development of antimicrobial peptide prediction tools. Curr Protein Pept Sci 2020; 22:CPPS-EPUB-103746. [PMID: 31957609 DOI: 10.2174/1389203721666200117163802] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2019] [Revised: 06/12/2019] [Accepted: 07/15/2019] [Indexed: 11/22/2022]
Abstract
Antimicrobial peptides (AMPs) are natural polypeptides with antimicrobial activities and are found in most organisms. AMPs are evolutionarily conservative components that belong to the innate immune system and show potent activity against bacteria, fungi, viruses and in some cases display antitumor activity. Thus, AMPs are major candidates in the development of new antibacterial reagents. In the last few decades, AMPs have attracted significant attention from the research community. During the early stages of the development of this research field, AMPs were experimentally identified, which is an expensive and time-consuming procedure. Therefore, research and development (R&D) of fast, highly efficient computational tools for predicting AMPs has enabled the rapid identification and analysis of new AMPs from a wide range of organisms. Moreover, these computational tools have allowed researchers to better understand the activities of AMPs, which has promoted R&D of antibacterial drugs. In this review, we systematically summarize AMP prediction tools and their corresponding algorithms used.
Collapse
Affiliation(s)
- Chunyan Ao
- Institute of Fundamental and Frontier Sciences - University of Electronic Science and Technology of China Chengdu. China
| | - Yu Zhang
- Department of neurosurgery - Heilongjiang Province Land Reclamation Headquarters General Hospital Harbin. China
| | - Dapeng Li
- Department of Internal Medicine-Oncology - The Fourth Hospital in Qinhuangdao Hebei. China
| | - Yuming Zhao
- Information and Computer Engineering College - Northeast Forestry University Harbin. China
| | - Quan Zou
- Institute of Fundamental and Frontier Sciences - University of Electronic Science and Technology of China Chengdu. China
| |
Collapse
|
215
|
Bakare OO, Fadaka AO, Klein A, Keyster M, Pretorius A. Diagnostic approaches of pneumonia for commercial-scale biomedical applications: an overview. ALL LIFE 2020. [DOI: 10.1080/26895293.2020.1826363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Affiliation(s)
- Olalekan Olanrewaju Bakare
- Bioinformatics Research Group, Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Bellville, South Africa
- Environmental Biotechnology Laboratory, Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Bellville, South Africa
| | - Adewale Oluwaseun Fadaka
- Bioinformatics Research Group, Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Bellville, South Africa
- Department of Science and Technology/Mintek Nanotechnology Innovation Centre, Bio-labels Node, Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Bellville, South Africa
| | - Ashwil Klein
- Environmental Biotechnology Laboratory, Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Bellville, South Africa
| | - Marshall Keyster
- Environmental Biotechnology Laboratory, Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Bellville, South Africa
| | - Ashley Pretorius
- Bioinformatics Research Group, Department of Biotechnology, Faculty of Natural Sciences, University of the Western Cape, Bellville, South Africa
| |
Collapse
|
216
|
Guo L, Wang S, Li M, Cao Z. Accurate classification of membrane protein types based on sequence and evolutionary information using deep learning. BMC Bioinformatics 2019; 20:700. [PMID: 31874615 PMCID: PMC6929490 DOI: 10.1186/s12859-019-3275-6] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Background Membrane proteins play an important role in the life activities of organisms. Knowing membrane protein types provides clues for understanding the structure and function of proteins. Though various computational methods for predicting membrane protein types have been developed, the results still do not meet the expectations of researchers. Results We propose two deep learning models to process sequence information and evolutionary information, respectively. Both models obtained better results than traditional machine learning models. Furthermore, to improve the performance of the sequence information model, we also provide a new vector representation method to replace the one-hot encoding, whose overall success rate improved by 3.81% and 6.55% on two datasets. Finally, a more effective model is obtained by fusing the above two models, whose overall success rate reached 95.68% and 92.98% on two datasets. Conclusion The final experimental results show that our method is more effective than existing methods for predicting membrane protein types, which can help laboratory researchers to identify the type of novel membrane proteins.
Collapse
Affiliation(s)
- Lei Guo
- Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming, 650504, People's Republic of China
| | - Shunfang Wang
- Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming, 650504, People's Republic of China.
| | - Mingyuan Li
- Department of Computer Science and Engineering, School of Information Science and Engineering, Yunnan University, Kunming, 650504, People's Republic of China
| | - Zicheng Cao
- School of Public Health (Shenzhen), Sun Yat-sen University, Guangzhou, 510006, People's Republic of China
| |
Collapse
|
217
|
Su X, Xu J, Yin Y, Quan X, Zhang H. Antimicrobial peptide identification using multi-scale convolutional network. BMC Bioinformatics 2019; 20:730. [PMID: 31870282 PMCID: PMC6929291 DOI: 10.1186/s12859-019-3327-y] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Accepted: 12/16/2019] [Indexed: 01/14/2023] Open
Abstract
Background Antibiotic resistance has become an increasingly serious problem in the past decades. As an alternative choice, antimicrobial peptides (AMPs) have attracted lots of attention. To identify new AMPs, machine learning methods have been commonly used. More recently, some deep learning methods have also been applied to this problem. Results In this paper, we designed a deep learning model to identify AMP sequences. We employed the embedding layer and the multi-scale convolutional network in our model. The multi-scale convolutional network, which contains multiple convolutional layers of varying filter lengths, could utilize all latent features captured by the multiple convolutional layers. To further improve the performance, we also incorporated additional information into the designed model and proposed a fusion model. Results showed that our model outperforms the state-of-the-art models on two AMP datasets and the Antimicrobial Peptide Database (APD)3 benchmark dataset. The fusion model also outperforms the state-of-the-art model on an anti-inflammatory peptides (AIPs) dataset at the accuracy. Conclusions Multi-scale convolutional network is a novel addition to existing deep neural network (DNN) models. The proposed DNN model and the modified fusion model outperform the state-of-the-art models for new AMP discovery. The source code and data are available at https://github.com/zhanglabNKU/APIN.
Collapse
Affiliation(s)
- Xin Su
- College of Artificial Intelligence, Nankai University, Tongyan Road, Tianjin, 300350, China
| | - Jing Xu
- College of Computer Science, Nankai University, Tongyan Road, Tianjin, 300350, China
| | - Yanbin Yin
- Nebraska Food for Health Center, Department of Food Science and Technology, University of Nebraska-Lincoln, 1400 R Street, Lincoln, NE, 68588, USA
| | - Xiongwen Quan
- College of Artificial Intelligence, Nankai University, Tongyan Road, Tianjin, 300350, China
| | - Han Zhang
- College of Artificial Intelligence, Nankai University, Tongyan Road, Tianjin, 300350, China.
| |
Collapse
|
218
|
Shi Q, Chen W, Huang S, Wang Y, Xue Z. Deep learning for mining protein data. Brief Bioinform 2019; 22:194-218. [PMID: 31867611 DOI: 10.1093/bib/bbz156] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Revised: 10/21/2019] [Accepted: 11/07/2019] [Indexed: 01/16/2023] Open
Abstract
The recent emergence of deep learning to characterize complex patterns of protein big data reveals its potential to address the classic challenges in the field of protein data mining. Much research has revealed the promise of deep learning as a powerful tool to transform protein big data into valuable knowledge, leading to scientific discoveries and practical solutions. In this review, we summarize recent publications on deep learning predictive approaches in the field of mining protein data. The application architectures of these methods include multilayer perceptrons, stacked autoencoders, deep belief networks, two- or three-dimensional convolutional neural networks, recurrent neural networks, graph neural networks, and complex neural networks and are described from five perspectives: residue-level prediction, sequence-level prediction, three-dimensional structural analysis, interaction prediction, and mass spectrometry data mining. The advantages and deficiencies of these architectures are presented in relation to various tasks in protein data mining. Additionally, some practical issues and their future directions are discussed, such as robust deep learning for protein noisy data, architecture optimization for specific tasks, efficient deep learning for limited protein data, multimodal deep learning for heterogeneous protein data, and interpretable deep learning for protein understanding. This review provides comprehensive perspectives on general deep learning techniques for protein data analysis.
Collapse
Affiliation(s)
- Qiang Shi
- School of Software Engineering, Huazhong University of Science and Technology. His main interests cover machine learning especially deep learning, protein data analysis, and big data mining
| | - Weiya Chen
- School of Software Engineering, Huazhong University of Science & Technology, Wuhan, China. His research interests cover bioinformatics, virtual reality, and data visualization
| | - Siqi Huang
- Software Engineering at Huazhong University of science and technology, focusing on Machine learning and data mining
| | - Yan Wang
- School of life, University of Science & Technology; her main interests cover protein structure and function prediction and big data mining
| | - Zhidong Xue
- School of Software Engineering, Huazhong University of Science & Technology, Wuhan, China. His research interests cover bioinformatics, machine learning, and image processing
| |
Collapse
|
219
|
iQSP: A Sequence-Based Tool for the Prediction and Analysis of Quorum Sensing Peptides via Chou's 5-Steps Rule and Informative Physicochemical Properties. Int J Mol Sci 2019; 21:ijms21010075. [PMID: 31861928 PMCID: PMC6981611 DOI: 10.3390/ijms21010075] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 12/13/2019] [Accepted: 12/18/2019] [Indexed: 01/18/2023] Open
Abstract
Understanding of quorum-sensing peptides (QSPs) in their functional mechanism plays an essential role in finding new opportunities to combat bacterial infections by designing drugs. With the avalanche of the newly available peptide sequences in the post-genomic age, it is highly desirable to develop a computational model for efficient, rapid and high-throughput QSP identification purely based on the peptide sequence information alone. Although, few methods have been developed for predicting QSPs, their prediction accuracy and interpretability still requires further improvements. Thus, in this work, we proposed an accurate sequence-based predictor (called iQSP) and a set of interpretable rules (called IR-QSP) for predicting and analyzing QSPs. In iQSP, we utilized a powerful support vector machine (SVM) cooperating with 18 informative features from physicochemical properties (PCPs). Rigorous independent validation test showed that iQSP achieved maximum accuracy and MCC of 93.00% and 0.86, respectively. Furthermore, a set of interpretable rules IR-QSP was extracted by using random forest model and the 18 informative PCPs. Finally, for the convenience of experimental scientists, the iQSP web server was established and made freely available online. It is anticipated that iQSP will become a useful tool or at least as a complementary existing method for predicting and analyzing QSPs.
Collapse
|
220
|
Fukunaga I, Sawada R, Shibata T, Kaitoh K, Sakai Y, Yamanishi Y. Prediction of the Health Effects of Food Peptides and Elucidation of the Mode-of-action Using Multi-task Graph Convolutional Neural Network. Mol Inform 2019; 39:e1900134. [PMID: 31778042 DOI: 10.1002/minf.201900134] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Accepted: 11/13/2019] [Indexed: 12/29/2022]
Abstract
Food proteins work not only as nutrients but also modulators for the physiological functions of the human body. The physiological functions of food proteins are basically regulated by peptides encrypted in food protein sequences (food peptides). In this study, we propose a novel deep learning-based method to predict the health effects of food peptides and elucidate the mode-of-action. In the algorithm, we estimate potential target proteins of food peptides using a multi-task graph convolutional neural network, and predict its health effects using information about therapeutic targets for diseases. We constructed predictive models based on 21,103 peptide-protein interactions involving 10,950 peptides and 2,533 proteins, and applied the models to food peptides (e. g., lactotripeptide, isoleucyltyrosine and sardine peptide) defined in food for specified health use. The models suggested potential effects such as blood-pressure lowering effects, blood glucose level lowering effects, and anti-cancer effects for several food peptides. The interactions of food peptides with target proteins were confirmed by docking simulations.
Collapse
Affiliation(s)
- Itsuki Fukunaga
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| | - Ryusuke Sawada
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| | - Tomokazu Shibata
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| | - Kazuma Kaitoh
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| | - Yukie Sakai
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| | - Yoshihiro Yamanishi
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| |
Collapse
|
221
|
Torres MD, Sothiselvam S, Lu TK, de la Fuente-Nunez C. Peptide Design Principles for Antimicrobial Applications. J Mol Biol 2019; 431:3547-3567. [DOI: 10.1016/j.jmb.2018.12.015] [Citation(s) in RCA: 184] [Impact Index Per Article: 36.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Revised: 12/19/2018] [Accepted: 12/22/2018] [Indexed: 02/08/2023]
|
222
|
Zuo Y, Chang Y, Huang S, Zheng L, Yang L, Cao G. iDEF-PseRAAC: Identifying the Defensin Peptide by Using Reduced Amino Acid Composition Descriptor. Evol Bioinform Online 2019; 15:1176934319867088. [PMID: 31391777 PMCID: PMC6669840 DOI: 10.1177/1176934319867088] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2019] [Accepted: 07/08/2019] [Indexed: 11/18/2022] Open
Abstract
Defensins as 1 of major classes of host defense peptides play a significant role in the innate immunity, which are extremely evolved in almost all living organisms. Developing high-throughput computational methods can accurately help in designing drugs or medical means to defense against pathogens. To take up such a challenge, an up-to-date server based on rigorous benchmark dataset, referred to as iDEF-PseRAAC, was designed for predicting the defensin family in this study. By extracting primary sequence compositions based on different types of reduced amino acid alphabet, it was calculated that the best overall accuracy of the selected feature subset was achieved to 92.38%. Therefore, we can conclude that the information provided by abundant types of amino acid reduction will provide efficient and rational methodology for defensin identification. And, a free online server is freely available for academic users at http://bioinfor.imu.edu.cn/idpf. We hold expectations that iDEF-PseRAAC may be a promising weapon for the function annotation about the defensins protein.
Collapse
Affiliation(s)
- Yongchun Zuo
- College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot, China.,State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, China
| | - Yu Chang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, China
| | - Shenghui Huang
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, China
| | - Lei Zheng
- State Key Laboratory of Reproductive Regulation and Breeding of Grassland Livestock, College of Life Sciences, Inner Mongolia University, Hohhot, China
| | - Lei Yang
- College of Bioinformatics Science and Technology, Harbin Medical University, Harbin, China
| | - Guifang Cao
- College of Veterinary Medicine, Inner Mongolia Agricultural University, Hohhot, China
| |
Collapse
|
223
|
Redesigning Arenicin-1, an Antimicrobial Peptide from the Marine Polychaeta Arenicola marina, by Strand Rearrangement or Branching, Substitution of Specific Residues, and Backbone Linearization or Cyclization. Mar Drugs 2019; 17:md17060376. [PMID: 31234579 PMCID: PMC6627698 DOI: 10.3390/md17060376] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2019] [Revised: 06/19/2019] [Accepted: 06/21/2019] [Indexed: 12/17/2022] Open
Abstract
Arenicin-1, a β-sheet antimicrobial peptide isolated from the marine polychaeta Arenicola marina coelomocytes, has a potent, broad-spectrum microbicidal activity and also shows significant toxicity towards mammalian cells. Several variants were rationally designed to elucidate the role of structural features such as cyclization, a certain symmetry of the residue arrangement, or the presence of specific residues in the sequence, in its membranolytic activity and the consequent effect on microbicidal efficacy and toxicity. The effect of variations on the structure was probed using molecular dynamics simulations, which indicated a significant stability of the β-hairpin scaffold and showed that modifying residue symmetry and β-strand arrangement affected both the twist and the kink present in the native structure. In vitro assays against a panel of Gram-negative and Gram-positive bacteria, including drug-resistant clinical isolates, showed that inversion of the residue arrangement improved the activity against Gram-negative strains but decreased it towards Gram-positive ones. Variants with increased symmetry were somewhat less active, whereas both backbone-cyclized and linear versions of the peptides, as well as variants with R→K and W→F replacement, showed antimicrobial activity comparable with that of the native peptide. All these variants permeabilized both the outer and the inner membranes of Escherichia coli, suggesting that a membranolytic mechanism of action was maintained. Our results indicate that the arenicin scaffold can support a considerable degree of variation while maintaining useful biological properties and can thus serve as a template for the elaboration of novel anti-infective agents.
Collapse
|
224
|
Lin Y, Cai Y, Liu J, Lin C, Liu X. An advanced approach to identify antimicrobial peptides and their function types for penaeus through machine learning strategies. BMC Bioinformatics 2019; 20:291. [PMID: 31182007 PMCID: PMC6557738 DOI: 10.1186/s12859-019-2766-9] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
BACKGROUND Antimicrobial peptides (AMPs) are essential components of the innate immune system and can protect the host from various pathogenic bacteria. The marine environment is known to be one of the richest sources for AMPs. Effective usage of AMPs and their derivatives can greatly improve the immunity and breeding survival rate of aquatic products. It is highly desirable to develop computational tools for rapidly and accurately identifying AMPs and their functional types, for the purpose of helping design new and more effective antimicrobial agents. RESULTS In this study, we made an attempt to develop an advanced machine learning based computational approach, MAMPs-Pred, for identification of AMPs and its function types. Initially, SVM-prot 188-D features were extracted that were subsequently used as input to a two-layer multi-label classifier. In specific, the first layer is to identify whether it is an AMP by applying RF classifier, and the second layer addresses the multi-type problem by identifying the activites or function types of AMPs by applying PS-RF and LC-RF classifiers. To benchmark the methods,the MAMPs-Pred method is also compared with existing best-performing methods in literature and has shown an improved identification accuracy. CONCLUSIONS The results reported in this study indicate that the MAMP-Pred method achieves high performance for identifying AMPs and its functional types.The proposed approach is believed to supplement the tools and techniques that have been developed in the past for predicting AMPs and their function types.
Collapse
Affiliation(s)
- Yuan Lin
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen, 361005 China
- Sparebanken Vest, Jonsvollsgaten 2, 5011 Bergen, Bergen, 5058 Norway
| | - Yinyin Cai
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen, 361005 China
| | - Juan Liu
- Department of Instrumental and Electrical Engineering, School of Aerospace Engineering, Xiamen University, Xiamen, 361005 China
| | - Chen Lin
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen, 361005 China
| | - Xiangrong Liu
- Department of Computer Science, School of Information Science and Technology, Xiamen University, Xiamen, 361005 China
| |
Collapse
|
225
|
Chung CR, Kuo TR, Wu LC, Lee TY, Horng JT. Characterization and identification of antimicrobial peptides with different functional activities. Brief Bioinform 2019; 21:bbz043. [PMID: 31155657 DOI: 10.1093/bib/bbz043] [Citation(s) in RCA: 74] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Revised: 03/20/2019] [Accepted: 03/20/2019] [Indexed: 02/28/2024] Open
Abstract
In recent years, antimicrobial peptides (AMPs) have become an emerging area of focus when developing therapeutics hot spot residues of proteins are dominant against infections. Importantly, AMPs are produced by virtually all known living organisms and are able to target a wide range of pathogenic microorganisms, including viruses, parasites, bacteria and fungi. Although several studies have proposed different machine learning methods to predict peptides as being AMPs, most do not consider the diversity of AMP activities. On this basis, we specifically investigated the sequence features of AMPs with a range of functional activities, including anti-parasitic, anti-viral, anti-cancer and anti-fungal activities and those that target mammals, Gram-positive and Gram-negative bacteria. A new scheme is proposed to systematically characterize and identify AMPs and their functional activities. The 1st stage of the proposed approach is to identify the AMPs, while the 2nd involves further characterization of their functional activities. Sequential forward selection was employed to extract potentially informative features that are possibly associated with the functional activities of the AMPs. These features include hydrophobicity, the normalized van der Waals volume, polarity, charge and solvent accessibility-all of which are essential attributes in classifying between AMPs and non-AMPs. The results revealed the 1st stage AMP classifier was able to achieve an area under the receiver operating characteristic curve (AUC) value of 0.9894. During the 2nd stage, we found pseudo amino acid composition to be an informative attribute when differentiating between AMPs in terms of their functional activities. The independent testing results demonstrated that the AUCs of the multi-class models were 0.7773, 0.9404, 0.8231, 0.8578, 0.8648, 0.8745 and 0.8672 for anti-parasitic, anti-viral, anti-cancer, anti-fungal AMPs and those that target mammals, Gram-positive and Gram-negative bacteria, respectively. The proposed scheme helps facilitate biological experiments related to the functional analysis of AMPs. Additionally, it was implemented as a user-friendly web server (AMPfun, http://fdblab.csie.ncu.edu.tw/AMPfun/index.html) that allows individuals to explore the antimicrobial functions of peptides of interest.
Collapse
Affiliation(s)
- Chia-Ru Chung
- Department of Computer Science and Information Engineering, National Central University, Taoyuan, Taiwan
| | - Ting-Rung Kuo
- Department of Computer Science and Information Engineering, National Central University, Taoyuan, Taiwan
| | - Li-Ching Wu
- Department of Biomedical Sciences and Engineering, National Central University, Taoyuan, Taiwan
| | - Tzong-Yi Lee
- School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, China
- School of Life and Health Sciences, The Chinese University of Hong Kong, Shenzhen, China
- Warshel Institute for Computational Biology, The Chinese University of Hong Kong, Shenzhen, China
| | - Jorng-Tzong Horng
- Department of Computer Science and Information Engineering, National Central University, Taoyuan, Taiwan
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan
| |
Collapse
|
226
|
Torres MDT, de la Fuente-Nunez C. Toward computer-made artificial antibiotics. Curr Opin Microbiol 2019; 51:30-38. [PMID: 31082661 DOI: 10.1016/j.mib.2019.03.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2019] [Accepted: 03/12/2019] [Indexed: 02/07/2023]
Abstract
Merging concepts from synthetic biology and computational biology may yield antibiotics that are less likely to elicit resistance than existing drugs and that yet can fight drug-resistant infections. Indeed, computer-guided strategies coupled with massively parallel high-throughput experimental methods represent a new paradigm for antibiotic discovery. Infections caused by multidrug-resistant microorganisms are increasingly deadly. In the current post-antibiotic era, many of these infections cannot be treated with our existing antimicrobial arsenal. Furthermore, we may have already exhausted the category of large molecules produced in nature having antimicrobial activity: the antibiotic scaffolds we have discovered so far may represent the majority of those that exist. The rise in drug-resistant bacteria and lack of new antibiotic classes clearly call for out-of-the-box strategies. Recent advances in computational synthetic biology have enabled the development of antimicrobials. New molecular descriptors and genetic and pattern recognition algorithms are powerful tools that bring us a step closer to developing efficient antibiotics. We review several computational tools for drug design and a number of recently generated antibiotic candidates, with an emphasis on peptide-based molecules. Design strategies can generate a diversity of synthetic antimicrobial peptides, which may help to mitigate the spread of resistance and combat multidrug-resistant microorganisms.
Collapse
Affiliation(s)
- Marcelo Der Torossian Torres
- Machine Biology Group, Departments of Psychiatry and Microbiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, United States of America; Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America
| | - Cesar de la Fuente-Nunez
- Machine Biology Group, Departments of Psychiatry and Microbiology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, United States of America; Department of Bioengineering, University of Pennsylvania, Philadelphia, Pennsylvania, United States of America.
| |
Collapse
|
227
|
Kalkatawi M, Magana-Mora A, Jankovic B, Bajic VB. DeepGSR: an optimized deep-learning structure for the recognition of genomic signals and regions. Bioinformatics 2019; 35:1125-1132. [PMID: 30184052 PMCID: PMC6449759 DOI: 10.1093/bioinformatics/bty752] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2017] [Revised: 07/15/2018] [Accepted: 08/31/2018] [Indexed: 01/05/2023] Open
Abstract
MOTIVATION Recognition of different genomic signals and regions (GSRs) in DNA is crucial for understanding genome organization, gene regulation, and gene function, which in turn generate better genome and gene annotations. Although many methods have been developed to recognize GSRs, their pure computational identification remains challenging. Moreover, various GSRs usually require a specialized set of features for developing robust recognition models. Recently, deep-learning (DL) methods have been shown to generate more accurate prediction models than 'shallow' methods without the need to develop specialized features for the problems in question. Here, we explore the potential use of DL for the recognition of GSRs. RESULTS We developed DeepGSR, an optimized DL architecture for the prediction of different types of GSRs. The performance of the DeepGSR structure is evaluated on the recognition of polyadenylation signals (PAS) and translation initiation sites (TIS) of different organisms: human, mouse, bovine and fruit fly. The results show that DeepGSR outperformed the state-of-the-art methods, reducing the classification error rate of the PAS and TIS prediction in the human genome by up to 29% and 86%, respectively. Moreover, the cross-organisms and genome-wide analyses we performed, confirmed the robustness of DeepGSR and provided new insights into the conservation of examined GSRs across species. AVAILABILITY AND IMPLEMENTATION DeepGSR is implemented in Python using Keras API; it is available as open-source software and can be obtained at https://doi.org/10.5281/zenodo.1117159. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Manal Kalkatawi
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah, Saudi Arabia
| | - Arturo Magana-Mora
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
- Drilling Technology Team, EXPEC-ARC, Saudi Aramco, Dhahran, Saudi Arabia
| | - Boris Jankovic
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| | - Vladimir B Bajic
- Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
| |
Collapse
|
228
|
Ryskaliyeva A, Henry C, Miranda G, Faye B, Konuspayeva G, Martin P. Alternative splicing events expand molecular diversity of camel CSN1S2 increasing its ability to generate potentially bioactive peptides. Sci Rep 2019; 9:5243. [PMID: 30918277 PMCID: PMC6437144 DOI: 10.1038/s41598-019-41649-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2018] [Accepted: 03/14/2019] [Indexed: 02/08/2023] Open
Abstract
In a previous study on camel milk from Kazakhstan, we reported the occurrence of two unknown proteins (UP1 and UP2) with different levels of phosphorylation. Here we show that UP1 and UP2 are isoforms of camel αs2-CN (αs2-CNsv1 and αs2-CNsv2, respectively) arising from alternative splicing events. First described as a 178 amino-acids long protein carrying eight phosphate groups, the major camel αs2-CN isoform (called here αs2-CN) has a molecular mass of 21,906 Da. αs2-CNsv1, a rather frequent (35%) isoform displaying a higher molecular mass (+1,033 Da), is present at four phosphorylation levels (8P to 11P). Using cDNA-sequencing, αs2-CNsv1 was shown to be a variant arising from the splicing-in of an in-frame 27-nucleotide sequence encoding the nonapeptide ENSKKTVDM, for which the presence at the genome level was confirmed. αs2-CNsv2, which appeared to be present at 8P to 12P, was shown to include an additional decapeptide (VKAYQIIPNL) revealed by LC-MS/MS, encoded by a 3'-extension of exon 16. Since milk proteins represent a reservoir of biologically active peptides, the molecular diversity generated by differential splicing might increase its content. To evaluate this possibility, we searched for bioactive peptides encrypted in the different camel αs2-CN isoforms, using an in silico approach. Several peptides, putatively released from the C-terminal part of camel αs2-CN isoforms after in silico digestion by proteases from the digestive tract, were predicted to display anti-bacterial and antihypertensive activities.
Collapse
Affiliation(s)
- Alma Ryskaliyeva
- INRA, UMR GABI, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Céline Henry
- INRA, MICALIS Institute, Plateforme d'Analyse Protéomique Paris Sud-Ouest (PAPPSO), Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Guy Miranda
- INRA, UMR GABI, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France
| | - Bernard Faye
- CIRAD, UMR SELMET, 34398, Montpellier Cedex 5, France
| | - Gaukhar Konuspayeva
- Al-Farabi Kazakh National University, Biotechnology department, 050040, Almaty, Kazakhstan
| | - Patrice Martin
- INRA, UMR GABI, AgroParisTech, Université Paris-Saclay, 78350, Jouy-en-Josas, France.
| |
Collapse
|
229
|
Wu Q, Ke H, Li D, Wang Q, Fang J, Zhou J. Recent Progress in Machine Learning-based Prediction of Peptide Activity for Drug Discovery. Curr Top Med Chem 2019; 19:4-16. [DOI: 10.2174/1568026619666190122151634] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2018] [Revised: 11/14/2018] [Accepted: 11/16/2018] [Indexed: 12/25/2022]
Abstract
Over the past decades, peptide as a therapeutic candidate has received increasing attention in
drug discovery, especially for antimicrobial peptides (AMPs), anticancer peptides (ACPs) and antiinflammatory
peptides (AIPs). It is considered that the peptides can regulate various complex diseases
which are previously untouchable. In recent years, the critical problem of antimicrobial resistance drives
the pharmaceutical industry to look for new therapeutic agents. Compared to organic small drugs, peptide-
based therapy exhibits high specificity and minimal toxicity. Thus, peptides are widely recruited in
the design and discovery of new potent drugs. Currently, large-scale screening of peptide activity with
traditional approaches is costly, time-consuming and labor-intensive. Hence, in silico methods, mainly
machine learning approaches, for their accuracy and effectiveness, have been introduced to predict the
peptide activity. In this review, we document the recent progress in machine learning-based prediction
of peptides which will be of great benefit to the discovery of potential active AMPs, ACPs and AIPs.
Collapse
Affiliation(s)
- Qihui Wu
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Hanzhong Ke
- Department of Pathobiology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61802, United States
| | - Dongli Li
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Qi Wang
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Jiansong Fang
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| | - Jingwei Zhou
- Institute of Clinical Pharmacology, Guangzhou University of Chinese Medicine, Guangzhou 510405, China
| |
Collapse
|
230
|
Ayala‐Ruano S, Santander‐Gordón D, Tejera E, Perez‐Castillo Y, Armijos-Jaramillo V. A putative antimicrobial peptide from Hymenoptera in the megaplasmid pSCL4 of Streptomyces clavuligerus ATCC 27064 reveals a singular case of horizontal gene transfer with potential applications. Ecol Evol 2019; 9:2602-2614. [PMID: 30891203 PMCID: PMC6406012 DOI: 10.1002/ece3.4924] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Revised: 12/31/2018] [Accepted: 01/02/2019] [Indexed: 11/06/2022] Open
Abstract
Streptomyces clavuligerus is a Gram-positive bacterium that is a high producer of secondary metabolites with industrial applications. The production of antibiotics such as clavulanic acid or cephamycin has been extensively studied in this species; nevertheless, other aspects, such as evolution or ecology, have received less attention. Furthermore, genes that arise from ancient events of lateral transfer have been demonstrated to be implicated in important functions of host species. This approximation discovered relevant genes that genomic analyses overlooked. Thus, we studied the impact of horizontal gene transfer in the S. clavuligerus genome. To perform this task, we applied whole-genome analysis to identify a laterally transferred sequence from different domains. The most relevant result was a putative antimicrobial peptide (AMP) with a clear origin in the Hymenoptera order of insects. Next, we determined that two copies of these genes were present in the megaplasmid pSCL4 but absent in the S. clavuligerus ATCC 27064 chromosome. Additionally, we found that these sequences were exclusive to the ATCC 27064 strain (and so were not present in any other bacteria) and we also verified the expression of the genes using RNAseq data. Next, we used several AMP predictors to validate the original annotation extracted from Hymenoptera sequences and explored the possibility that these proteins had post-translational modifications using peptidase cleavage prediction. We suggest that Hymenoptera AMP-like proteins of S. clavuligerus ATCC 27064 may be useful for both species adaptation and as an antimicrobial molecule with industrial applications.
Collapse
Affiliation(s)
- Sebastián Ayala‐Ruano
- Universidad San Francisco de Quito, Colegio de Ciencias Biológicas y Ambientales (COCIBA‐USFQ)QuitoEcuador
| | - Daniela Santander‐Gordón
- Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias AplicadasUniversidad de Las AméricasQuitoEcuador
| | - Eduardo Tejera
- Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias AplicadasUniversidad de Las AméricasQuitoEcuador
- Grupo de Bio‐QuimioinformáticaUniversidad de Las AméricasQuitoEcuador
| | - Yunierkis Perez‐Castillo
- Grupo de Bio‐QuimioinformáticaUniversidad de Las AméricasQuitoEcuador
- Ciencias Físicas y Matemáticas‐Facultad de Formación GeneralUniversidad de Las AméricasQuitoEcuador
| | - Vinicio Armijos-Jaramillo
- Carrera de Ingeniería en Biotecnología, Facultad de Ingeniería y Ciencias AplicadasUniversidad de Las AméricasQuitoEcuador
- Grupo de Bio‐QuimioinformáticaUniversidad de Las AméricasQuitoEcuador
| |
Collapse
|
231
|
Nussinov R, Tsai CJ, Shehu A, Jang H. Computational Structural Biology: Successes, Future Directions, and Challenges. Molecules 2019; 24:molecules24030637. [PMID: 30759724 PMCID: PMC6384756 DOI: 10.3390/molecules24030637] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 02/05/2019] [Accepted: 02/10/2019] [Indexed: 02/06/2023] Open
Abstract
Computational biology has made powerful advances. Among these, trends in human health have been uncovered through heterogeneous 'big data' integration, and disease-associated genes were identified and classified. Along a different front, the dynamic organization of chromatin is being elucidated to gain insight into the fundamental question of genome regulation. Powerful conformational sampling methods have also been developed to yield a detailed molecular view of cellular processes. when combining these methods with the advancements in the modeling of supramolecular assemblies, including those at the membrane, we are finally able to get a glimpse into how cells' actions are regulated. Perhaps most intriguingly, a major thrust is on to decipher the mystery of how the brain is coded. Here, we aim to provide a broad, yet concise, sketch of modern aspects of computational biology, with a special focus on computational structural biology. We attempt to forecast the areas that computational structural biology will embrace in the future and the challenges that it may face. We skirt details, highlight successes, note failures, and map directions.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA.
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Chung-Jung Tsai
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA.
| | - Amarda Shehu
- Departments of Computer Science, Department of Bioengineering, and School of Systems Biology, George Mason University, Fairfax, VA 22030, USA.
| | - Hyunbum Jang
- Computational Structural Biology Section, Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA.
| |
Collapse
|
232
|
Xue L, Tang B, Chen W, Luo J. DeepT3: deep convolutional neural networks accurately identify Gram-negative bacterial type III secreted effectors using the N-terminal sequence. Bioinformatics 2018; 35:2051-2057. [DOI: 10.1093/bioinformatics/bty931] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2018] [Revised: 10/22/2018] [Accepted: 11/07/2018] [Indexed: 11/12/2022] Open
Affiliation(s)
- Li Xue
- School of Public Health, Southwest Medical University, Luzhou, Sichuan, PR, China
| | - Bin Tang
- Basic Medical College of Southwest Medical University, Luzhou, Sichuan, PR, China
| | - Wei Chen
- Integrative Genomics Core, City of Hope National Medical Center, Duarte, CA, USA
| | - Jiesi Luo
- Key Laboratory for Aging and Regenerative Medicine, Department of Pharmacology, School of Pharmacy, Southwest Medical University, Luzhou, Sichuan, China
| |
Collapse
|
233
|
Agrawal P, Raghava GPS. Prediction of Antimicrobial Potential of a Chemically Modified Peptide From Its Tertiary Structure. Front Microbiol 2018; 9:2551. [PMID: 30416494 PMCID: PMC6212470 DOI: 10.3389/fmicb.2018.02551] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2018] [Accepted: 10/05/2018] [Indexed: 12/14/2022] Open
Abstract
Designing novel antimicrobial peptides is a hot area of research in the field of therapeutics especially after the emergence of resistant strains against the conventional antibiotics. In the past number of in silico methods have been developed for predicting the antimicrobial property of the peptide containing natural residues. This study describes models developed for predicting the antimicrobial property of a chemically modified peptide. Our models have been trained, tested and evaluated on a dataset that contains 948 antimicrobial and 931 non-antimicrobial peptides, containing chemically modified and natural residues. Firstly, the tertiary structure of all peptides has been predicted using software PEPstrMOD. Structure analysis indicates that certain type of modifications enhance the antimicrobial property of peptides. Secondly, a wide range of features was computed from the structure of these peptides using software PaDEL. Finally, models were developed for predicting the antimicrobial potential of chemically modified peptides using a wide range of structural features of these peptides. Our best model based on support vector machine achieve maximum MCC of 0.84 with an accuracy of 91.62% on training dataset and MCC of 0.80 with an accuracy of 89.89% on validation dataset. To assist the scientific community, we have developed a web server called "AntiMPmod" which predicts the antimicrobial property of the chemically modified peptide. The web server is present at the following link (http://webs.iiitd.edu.in/raghava/antimpmod/).
Collapse
Affiliation(s)
- Piyush Agrawal
- CSIR-Institute of Microbial Technology, Chandigarh, India.,Center for Computational Biology, Indraprastha Institute of Information Technology, Delhi, New Delhi, India
| | - Gajendra P S Raghava
- Center for Computational Biology, Indraprastha Institute of Information Technology, Delhi, New Delhi, India
| |
Collapse
|
234
|
Thurston BA, Ferguson AL. Machine learning and molecular design of self-assembling -conjugated oligopeptides. MOLECULAR SIMULATION 2018. [DOI: 10.1080/08927022.2018.1469754] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/16/2022]
Affiliation(s)
- Bryce A. Thurston
- Department of Physics, University of Illinois at Urbana-Champaign , Urbana, IL, USA
| | - Andrew L. Ferguson
- Department of Physics, University of Illinois at Urbana-Champaign , Urbana, IL, USA
- Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign , Urbana, IL, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign , Urbana, IL, USA
| |
Collapse
|