1
|
Angaitkar P, Aljrees T, Kumar Pandey S, Kumar A, Janghel RR, Sahu TP, Singh KU, Singh T. Inferring linear-B cell epitopes using 2-step metaheuristic variant-feature selection using genetic algorithm. Sci Rep 2023; 13:14593. [PMID: 37670007 PMCID: PMC10480427 DOI: 10.1038/s41598-023-41179-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Accepted: 08/23/2023] [Indexed: 09/07/2023] Open
Abstract
Linear-B cell epitopes (LBCE) play a vital role in vaccine design; thus, efficiently detecting them from protein sequences is of primary importance. These epitopes consist of amino acids arranged in continuous or discontinuous patterns. Vaccines employ attenuated viruses and purified antigens. LBCE stimulate humoral immunity in the body, where B and T cells target circulating infections. To predict LBCE, the underlying protein sequences undergo a process of feature extraction, feature selection, and classification. Various system models have been proposed for this purpose, but their classification accuracy is only moderate. In order to enhance the accuracy of LBCE classification, this paper presents a novel 2-step metaheuristic variant-feature selection method that combines a linear support vector classifier (LSVC) with a Modified Genetic Algorithm (MGA). The feature selection model employs mono-peptide, dipeptide, and tripeptide features, focusing on the most diverse ones. These selected features are fed into a machine learning (ML)-based parallel ensemble classifier. The ensemble classifier combines correctly classified instances from various classifiers, including k-Nearest Neighbor (kNN), random forest (RF), logistic regression (LR), and support vector machine (SVM). The ensemble classifier came up with an impressively high accuracy of 99.3% as a result of its work. This accuracy is superior to the most recent models that are considered to be state-of-the-art for linear B-cell classification. As a direct consequence of this, the entire system model can now be utilised effectively in real-time clinical settings.
Collapse
Affiliation(s)
- Pratik Angaitkar
- Department of Information Technology, National Institute of Technology, Raipur, G.E. Road, Raipur, 492010, Chhattisgarh, India
| | - Turki Aljrees
- College of Computer Science and Engineering, University of Hafr Al Batin, 39524, Hafar Al Batin, Saudi Arabia
| | - Saroj Kumar Pandey
- Department of Computer Engineering & Applications, GLA University, Mathura, India
| | - Ankit Kumar
- Department of Computer Engineering & Applications, GLA University, Mathura, India.
| | - Rekh Ram Janghel
- Department of Information Technology, National Institute of Technology, Raipur, G.E. Road, Raipur, 492010, Chhattisgarh, India
| | - Tirath Prasad Sahu
- Department of Information Technology, National Institute of Technology, Raipur, G.E. Road, Raipur, 492010, Chhattisgarh, India
| | | | - Teekam Singh
- Department of Computer Science and Engineering, Graphic Era Deemed to be University, Dehradun, 248002, Uttarakhand, India
| |
Collapse
|
2
|
Abstract
Antibody-protein interactions play a critical role in the humoral immune response. B-cells secrete antibodies, which bind antigens (e.g., cell surface proteins of pathogens). The specific parts of antigens that are recognized by antibodies are called B-cell epitopes. These epitopes can be linear, corresponding to a contiguous amino acid sequence fragment of an antigen, or conformational, in which residues critical for recognition may not be contiguous in the primary sequence, but are in close proximity within the folded protein 3D structure.Identification of B-cell epitopes in target antigens is one of the key steps in epitope-driven subunit vaccine design, immunodiagnostic tests, and antibody production. In silico bioinformatics techniques offer a promising and cost-effective approach for identifying potential B-cell epitopes in a target vaccine candidate. In this chapter, we show how to utilize online B-cell epitope prediction tools to identify linear B-cell epitopes from the primary amino acid sequence of proteins.
Collapse
|
3
|
Abstract
Immunomics is a relatively new field of research which integrates the disciplines of immunology, genomics, proteomics, transcriptomics and bioinformatics to characterize the host-pathogen interface. Herein, we discuss how rapid advances in molecular immunology, sophisticated tools and molecular databases are facilitating in-depth exploration of the immunome. In our opinion, an immunomics-based approach presides over traditional antigen and vaccine discovery methods that have proved ineffective for highly complex pathogens such as the causative agents of malaria, tuberculosis and schistosomiasis that have evolved genetic and immunological host-parasite adaptations over time. By using an integrative multidisciplinary approach, immunomics offers enormous potential to advance 21st century antigen discovery and rational vaccine design against complex pathogens such as the Plasmodium parasite.
Collapse
|
4
|
Gupta S, Madhu MK, Sharma AK, Sharma VK. ProInflam: a webserver for the prediction of proinflammatory antigenicity of peptides and proteins. J Transl Med 2016; 14:178. [PMID: 27301453 PMCID: PMC4908730 DOI: 10.1186/s12967-016-0928-3] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2016] [Accepted: 05/30/2016] [Indexed: 12/12/2022] Open
Abstract
Background Proinflammatory immune response involves a complex series of molecular events leading to inflammatory reaction at a site, which enables host to combat plurality of infectious agents. It can be initiated by specific stimuli such as viral, bacterial, parasitic or allergenic antigens, or by non-specific stimuli such as LPS. On counter with such antigens, the complex interaction of antigen presenting cells, T cells and inflammatory mediators like IL1α, IL1β, TNFα, IL12, IL18 and IL23 lead to proinflammatory immune response and further clearance of infection. In this study, we have tried to establish a relation between amino acid sequence of antigen and induction of proinflammatory response. Results A total of 729 experimentally-validated proinflammatory and 171 non-proinflammatory epitopes were obtained from IEDB database. The A, F, I, L and V amino acids and AF, FA, FF, PF, IV, IN dipeptides were observed as preferred residues in proinflammatory epitopes. Using the compositional and motif-based features of proinflammatory and non-proinflammatory epitopes, we have developed machine learning-based models for prediction of proinflammatory response of peptides. The hybrid of motifs and dipeptide-based features displayed best performance with MCC = 0.58 and an accuracy of 87.6 %. Conclusion The amino acid sequence-based features of peptides were used to develop a machine learning-based prediction tool for the prediction of proinflammatory epitopes. This is a unique tool for the computational identification of proinflammatory peptide antigen/candidates and provides leads for experimental validations. The prediction model and tools for epitope mapping and similarity search are provided as a comprehensive web server which is freely available at http://metagenomics.iiserb.ac.in/proinflam/ and http://metabiosys.iiserb.ac.in/proinflam/. Electronic supplementary material The online version of this article (doi:10.1186/s12967-016-0928-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Sudheer Gupta
- Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, Madhya Pradesh, India
| | - Midhun K Madhu
- Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, Madhya Pradesh, India
| | - Ashok K Sharma
- Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, Madhya Pradesh, India
| | - Vineet K Sharma
- Metagenomics and Systems Biology Group, Department of Biological Sciences, Indian Institute of Science Education and Research Bhopal, Bhopal, Madhya Pradesh, India.
| |
Collapse
|
5
|
Computational Prediction of Immunodominant Epitopes on Outer Membrane Protein (Omp) H of Pasteurella multocida Toward Designing of a Peptide Vaccine. Methods Mol Biol 2016. [PMID: 27076289 DOI: 10.1007/978-1-4939-3389-1_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/09/2023]
Abstract
Contemporary vaccine design necessitates discrimination between the immunogenic and non-immunogenic components within a pathogen. To successfully target a humoral immune response, the vaccine antigen should contain not only B-cell epitopes but abounding Th-cell agretopes and MHC-II binding regions as well. No single computational method is available that allows the identification of such regions on antigens with good reliability. A consensus approach based on several prediction methods can be adopted to overcome this problem.Targeting the outer membrane protein (Omp) H as a candidate, a comprehensive work flow is described for the computational identification of immunodominant epitopes toward the designing of a peptide vaccine against Pasteurella multocida.
Collapse
|
6
|
Caoili SEC. An integrative structure-based framework for predicting biological effects mediated by antipeptide antibodies. J Immunol Methods 2015; 427:19-29. [PMID: 26410103 DOI: 10.1016/j.jim.2015.09.002] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2015] [Revised: 08/30/2015] [Accepted: 09/20/2015] [Indexed: 01/18/2023]
Abstract
A general framework is presented for predicting quantitative biological effects mediated by antipeptide antibodies, primarily on the basis of antigen structure (possibly featuring intrinsic disorder) analyzed to estimate epitope-paratope binding affinities, which in turn is considered within the context of dose-response relationships as regards antibody concentration. This is illustrated mainly using an approach based on protein structural energetics, whereby expected amounts of solvent-accessible surface area buried upon epitope-paratope binding are related to the corresponding binding affinity, which is estimated from putative B-cell epitope structure with implicit treatment of paratope structure, for antipeptide antibodies either reacting with peptides or cross-reacting with cognate protein antigens. Key methods described are implemented in SAPPHIRE/SUITE (Structural-energetic Analysis Program for Predicting Humoral Immune Response Epitopes/SAPPHIRE User Interface Tool Ensemble; publicly accessible via http://freeshell.de/~badong/suite.htm). Representative results thus obtained are compared with published experimental data on binding affinities and quantitative biological effects, with special attention to loss of paratope sidechain conformational entropy (neglected in previous analyses) and in light of key in-vivo constraints on antigen-antibody binding affinity and antibody-mediated effects. Implications for further refinement of B-cell epitope prediction methods are discussed as regards envisioned biomedical applications including the development of prophylactic and therapeutic antibodies, peptide-based vaccines and immunodiagnostics.
Collapse
Affiliation(s)
- Salvador Eugenio C Caoili
- Department of Biochemistry and Molecular Biology, College of Medicine, University of the Philippines Manila, Manila, Philippines.
| |
Collapse
|
7
|
Lian Y, Ge M, Pan XM. EPMLR: sequence-based linear B-cell epitope prediction method using multiple linear regression. BMC Bioinformatics 2014; 15:414. [PMID: 25523327 PMCID: PMC4307399 DOI: 10.1186/s12859-014-0414-y] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2014] [Accepted: 12/09/2014] [Indexed: 11/10/2022] Open
Abstract
Background B-cell epitopes have been studied extensively due to their immunological applications, such as peptide-based vaccine development, antibody production, and disease diagnosis and therapy. Despite several decades of research, the accurate prediction of linear B-cell epitopes has remained a challenging task. Results In this work, based on the antigen’s primary sequence information, a novel linear B-cell epitope prediction model was developed using the multiple linear regression (MLR). A 10-fold cross-validation test on a large non-redundant dataset was performed to evaluate the performance of our model. To alleviate the problem caused by the noise of negative dataset, 300 experiments utilizing 300 sub-datasets were performed. We achieved overall sensitivity of 81.8%, precision of 64.1% and area under the receiver operating characteristic curve (AUC) of 0.728. Conclusions We have presented a reliable method for the identification of linear B cell epitope using antigen’s primary sequence information. Moreover, a web server EPMLR has been developed for linear B-cell epitope prediction: http://www.bioinfo.tsinghua.edu.cn/epitope/EPMLR/. Electronic supplementary material The online version of this article (doi:10.1186/s12859-014-0414-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yao Lian
- The Key Laboratory of Bioinformatics, Ministry of Education, School of Life Sciences, Tsinghua University, Beijing, 100084, China.
| | - Meng Ge
- CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.
| | - Xian-Ming Pan
- The Key Laboratory of Bioinformatics, Ministry of Education, School of Life Sciences, Tsinghua University, Beijing, 100084, China.
| |
Collapse
|
8
|
Soria-Guerra RE, Nieto-Gomez R, Govea-Alonso DO, Rosales-Mendoza S. An overview of bioinformatics tools for epitope prediction: implications on vaccine development. J Biomed Inform 2014; 53:405-14. [PMID: 25464113 DOI: 10.1016/j.jbi.2014.11.003] [Citation(s) in RCA: 254] [Impact Index Per Article: 25.4] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2014] [Revised: 09/16/2014] [Accepted: 11/03/2014] [Indexed: 10/24/2022]
Abstract
Exploitation of recombinant DNA and sequencing technologies has led to a new concept in vaccination in which isolated epitopes, capable of stimulating a specific immune response, have been identified and used to achieve advanced vaccine formulations; replacing those constituted by whole pathogen-formulations. In this context, bioinformatics approaches play a critical role on analyzing multiple genomes to select the protective epitopes in silico. It is conceived that cocktails of defined epitopes or chimeric protein arrangements, including the target epitopes, may provide a rationale design capable to elicit convenient humoral or cellular immune responses. This review presents a comprehensive compilation of the most advantageous online immunological software and searchable, in order to facilitate the design and development of vaccines. An outlook on how these tools are supporting vaccine development is presented. HIV and influenza have been taken as examples of promising developments on vaccination against hypervariable viruses. Perspectives in this field are also envisioned.
Collapse
Affiliation(s)
- Ruth E Soria-Guerra
- Laboratorio de Ingeniería de Biorreactores, Facultad de Ciencias Químicas, Universidad Autónoma de San Luis Potosí, Av. Dr. Manuel Nava 6, SLP 78210, Mexico
| | - Ricardo Nieto-Gomez
- Laboratorio de Biofarmacéuticos Recombinantes, Facultad de Ciencias Químicas, Universidad Autónoma de San Luis Potosí, Av. Dr. Manuel Nava 6, SLP 78210, Mexico
| | - Dania O Govea-Alonso
- Laboratorio de Biofarmacéuticos Recombinantes, Facultad de Ciencias Químicas, Universidad Autónoma de San Luis Potosí, Av. Dr. Manuel Nava 6, SLP 78210, Mexico
| | - Sergio Rosales-Mendoza
- Laboratorio de Biofarmacéuticos Recombinantes, Facultad de Ciencias Químicas, Universidad Autónoma de San Luis Potosí, Av. Dr. Manuel Nava 6, SLP 78210, Mexico.
| |
Collapse
|