1
|
Ananya, Panchariya DC, Karthic A, Singh SP, Mani A, Chawade A, Kushwaha S. Vaccine design and development: Exploring the interface with computational biology and AI. Int Rev Immunol 2024:1-20. [PMID: 38982912 DOI: 10.1080/08830185.2024.2374546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 06/26/2024] [Indexed: 07/11/2024]
Abstract
Computational biology involves applying computer science and informatics techniques in biology to understand complex biological data. It allows us to collect, connect, and analyze biological data at a large scale and build predictive models. In the twenty first century, computational resources along with Artificial Intelligence (AI) have been widely used in various fields of biological sciences such as biochemistry, structural biology, immunology, microbiology, and genomics to handle massive data for decision-making, including in applications such as drug design and vaccine development, one of the major areas of focus for human and animal welfare. The knowledge of available computational resources and AI-enabled tools in vaccine design and development can improve our ability to conduct cutting-edge research. Therefore, this review article aims to summarize important computational resources and AI-based tools. Further, the article discusses the various applications and limitations of AI tools in vaccine development.
Collapse
Affiliation(s)
- Ananya
- National Institute of Animal Biotechnology, Hyderabad, India
| | | | | | | | - Ashutosh Mani
- Motilal Nehru National Institute of Technology, Prayagraj, India
| | - Aakash Chawade
- Swedish University of Agricultural Sciences, Alnarp, Sweden
| | | |
Collapse
|
2
|
Deng N, Sinha KM, Vilar E. MONET: a database for prediction of neoantigens derived from microsatellite loci. Front Immunol 2024; 15:1394593. [PMID: 38835776 PMCID: PMC11148240 DOI: 10.3389/fimmu.2024.1394593] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2024] [Accepted: 05/03/2024] [Indexed: 06/06/2024] Open
Abstract
Background Microsatellite instability (MSI) secondary to mismatch repair (MMR) deficiency is characterized by insertions and deletions (indels) in short DNA sequences across the genome. These indels can generate neoantigens, which are ideal targets for precision immune interception. However, current neoantigen databases lack information on neoantigens arising from coding microsatellites. To address this gap, we introduce The MicrOsatellite Neoantigen Discovery Tool (MONET). Method MONET identifies potential mutated tumor-specific neoantigens (neoAgs) by predicting frameshift mutations in coding microsatellite sequences of the human genome. Then MONET annotates these neoAgs with key features such as binding affinity, stability, expression, frequency, and potential pathogenicity using established algorithms, tools, and public databases. A user-friendly web interface (https://monet.mdanderson.org/) facilitates access to these predictions. Results MONET predicts over 4 million and 15 million Class I and Class II potential frameshift neoAgs, respectively. Compared to existing databases, MONET demonstrates superior coverage (>85% vs. <25%) using a set of experimentally validated neoAgs. Conclusion MONET is a freely available, user-friendly web tool that leverages publicly available resources to identify neoAgs derived from microsatellite loci. This systems biology approach empowers researchers in the field of precision immune interception.
Collapse
Affiliation(s)
- Nan Deng
- Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
| | - Krishna M Sinha
- Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
| | - Eduardo Vilar
- Department of Clinical Cancer Prevention, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
- Department of Gastrointestinal Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
- Department of Clinical Cancer Genetics Program, The University of Texas MD Anderson Cancer Center, Houston, TX, United States
| |
Collapse
|
3
|
Barra C, Nilsson JB, Saksager A, Carri I, Deleuran S, Garcia Alvarez HM, Høie MH, Li Y, Clifford JN, Wan YTR, Moreta LS, Nielsen M. In Silico Tools for Predicting Novel Epitopes. Methods Mol Biol 2024; 2813:245-280. [PMID: 38888783 DOI: 10.1007/978-1-0716-3890-3_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/20/2024]
Abstract
Identifying antigens within a pathogen is a critical task to develop effective vaccines and diagnostic methods, as well as understanding the evolution and adaptation to host immune responses. Historically, antigenicity was studied with experiments that evaluate the immune response against selected fragments of pathogens. Using this approach, the scientific community has gathered abundant information regarding which pathogenic fragments are immunogenic. The systematic collection of this data has enabled unraveling many of the fundamental rules underlying the properties defining epitopes and immunogenicity, and has resulted in the creation of a large panel of immunologically relevant predictive (in silico) tools. The development and application of such tools have proven to accelerate the identification of novel epitopes within biomedical applications reducing experimental costs. This chapter introduces some basic concepts about MHC presentation, T cell and B cell epitopes, the experimental efforts to determine those, and focuses on state-of-the-art methods for epitope prediction, highlighting their strengths and limitations, and catering instructions for their rational use.
Collapse
Affiliation(s)
- Carolina Barra
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark.
| | | | - Astrid Saksager
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark
| | - Ibel Carri
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín (UNSAM) - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), San Martín, Argentina
| | - Sebastian Deleuran
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark
| | - Heli M Garcia Alvarez
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín (UNSAM) - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), San Martín, Argentina
| | - Magnus Haraldson Høie
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark
| | - Yuchen Li
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark
| | | | - Yat-Tsai Richie Wan
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark
| | - Lys Sanz Moreta
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark
| | - Morten Nielsen
- Section for Bioinformatics, Health Tech, Technical University of Denmark, Lyngby, Denmark
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín (UNSAM) - Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), San Martín, Argentina
| |
Collapse
|
4
|
Kumar N, Bajiya N, Patiyal S, Raghava GPS. Multi-perspectives and challenges in identifying B-cell epitopes. Protein Sci 2023; 32:e4785. [PMID: 37733481 PMCID: PMC10578127 DOI: 10.1002/pro.4785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 09/11/2023] [Accepted: 09/16/2023] [Indexed: 09/23/2023]
Abstract
The identification of B-cell epitopes (BCEs) in antigens is a crucial step in developing recombinant vaccines or immunotherapies for various diseases. Over the past four decades, numerous in silico methods have been developed for predicting BCEs. However, existing reviews have only covered specific aspects, such as the progress in predicting conformational or linear BCEs. Therefore, in this paper, we have undertaken a systematic approach to provide a comprehensive review covering all aspects associated with the identification of BCEs. First, we have covered the experimental techniques developed over the years for identifying linear and conformational epitopes, including the limitations and challenges associated with these techniques. Second, we have briefly described the historical perspectives and resources that maintain experimentally validated information on BCEs. Third, we have extensively reviewed the computational methods developed for predicting conformational BCEs from the structure of the antigen, as well as the methods for predicting conformational epitopes from the sequence. Fourth, we have systematically reviewed the in silico methods developed in the last four decades for predicting linear or continuous BCEs. Finally, we have discussed the overall challenge of identifying continuous or conformational BCEs. In this review, we only listed major computational resources; a complete list with the URL is available from the BCinfo website (https://webs.iiitd.edu.in/raghava/bcinfo/).
Collapse
Affiliation(s)
- Nishant Kumar
- Department of Computational BiologyIndraprastha Institute of Information TechnologyNew DelhiIndia
| | - Nisha Bajiya
- Department of Computational BiologyIndraprastha Institute of Information TechnologyNew DelhiIndia
| | - Sumeet Patiyal
- Department of Computational BiologyIndraprastha Institute of Information TechnologyNew DelhiIndia
| | - Gajendra P. S. Raghava
- Department of Computational BiologyIndraprastha Institute of Information TechnologyNew DelhiIndia
| |
Collapse
|
5
|
Zhang X, Wu J, Baeza J, Gu K, Zheng Y, Chen S, Zhou Z. DeepTAP: An RNN-based method of TAP-binding peptide prediction in the selection of tumor neoantigens. Comput Biol Med 2023; 164:107247. [PMID: 37454505 DOI: 10.1016/j.compbiomed.2023.107247] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Revised: 05/31/2023] [Accepted: 07/07/2023] [Indexed: 07/18/2023]
Abstract
The transport of peptides from the cytoplasm to the endoplasmic reticulum (ER) by transporters associated with antigen processing (TAP) is a critical step in the intracellular presentation of cytotoxic T lymphocyte (CTL) epitopes. The development and application of computational methods, especially deep learning methods and new neural network strategies that can automatically learn feature representations with limited knowledge, provide an opportunity to develop fast and efficient methods to identify TAP-binding peptides. Herein, this study presents a comprehensive analysis of TAP-binding peptide sequences to derive TAP-binding motifs and preferences for N-terminal and C-terminal amino acids. A novel recurrent neural network (RNN)-based method called DeepTAP, using bidirectional gated recurrent unit (BiGRU), was developed for the accurate prediction of TAP-binding peptides. Our results demonstrated that DeepTAP achieves an optimal balance between prediction precision and false positives, outperforming other baseline models. Furthermore, DeepTAP significantly improves the prediction accuracy of high-confidence neoantigens, especially the top-ranked ones, making it a valuable tool for researchers studying antigen presentation processes and T-cell epitope screening. DeepTAP is freely available at https://github.com/zjupgx/deeptap and https://pgx.zju.edu.cn/deeptap.
Collapse
Affiliation(s)
- Xue Zhang
- National Key Laboratory of Advanced Drug Delivery and Release Systems & Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Jingcheng Wu
- National Key Laboratory of Advanced Drug Delivery and Release Systems & Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Joseph Baeza
- Biology Program, Iowa State University, Ames, IA, 50011, USA
| | - Katie Gu
- Biology Program, Washington University in St. Louis, St. Louis, MO, 63130, USA
| | - Yichun Zheng
- The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, 322000, China.
| | - Shuqing Chen
- National Key Laboratory of Advanced Drug Delivery and Release Systems & Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China.
| | - Zhan Zhou
- National Key Laboratory of Advanced Drug Delivery and Release Systems & Zhejiang Provincial Key Laboratory of Anti-Cancer Drug Research, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China; The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu, 322000, China; Innovation Institute for Artificial Intelligence in Medicine, Zhejiang University, Hangzhou, 310018, China; Alibaba-Zhejiang University Joint Research Center of Future Digital Healthcare, Hangzhou, 310058, China.
| |
Collapse
|
6
|
<i>In silico</i> Research at the Stages of Designing Modern Means for Prevention of Plague (by the Example of Subunit Vaccines). PROBLEMS OF PARTICULARLY DANGEROUS INFECTIONS 2022. [DOI: 10.21055/0370-1069-2022-3-6-13] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/16/2023]
Abstract
The purpose of this review was to analyze the findings of domestic and foreign researchers on the development of modern drugs for the specific prevention of plague and to illustrate the possibilities of using bioinformatics analysis at the design stages to create an effective and safe vaccine. Work on the creation of an effective new-generation plague vaccine is hampered by several factors associated primarily with the presence of mechanisms of evasion from the immune system of the macroorganism, as well as a large number of pathogenicity determinants in the plague agent. Due to the development of approaches that are based on in silico studies, there is a progressive development of vaccine technologies oriented primarily to the use of the most important immunogens of the plague microbe (F1 and V antigen). Studies aimed at improving the antigenic properties of F1 and LcrV, as well as work on bioinformatic search and analysis of additional promising components to be included in the composition of subunit vaccines are considered as topical applications of bioinformatics data analysis in developing the tools for enhancing the effectiveness of protection through vaccination with subunit preparations.
Collapse
|
7
|
Kumar S, Kumar GS, Maitra SS, Malý P, Bharadwaj S, Sharma P, Dwivedi VD. Viral informatics: bioinformatics-based solution for managing viral infections. Brief Bioinform 2022; 23:6659740. [PMID: 35947964 DOI: 10.1093/bib/bbac326] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Revised: 06/26/2022] [Accepted: 07/18/2022] [Indexed: 11/13/2022] Open
Abstract
Several new viral infections have emerged in the human population and establishing as global pandemics. With advancements in translation research, the scientific community has developed potential therapeutics to eradicate or control certain viral infections, such as smallpox and polio, responsible for billions of disabilities and deaths in the past. Unfortunately, some viral infections, such as dengue virus (DENV) and human immunodeficiency virus-1 (HIV-1), are still prevailing due to a lack of specific therapeutics, while new pathogenic viral strains or variants are emerging because of high genetic recombination or cross-species transmission. Consequently, to combat the emerging viral infections, bioinformatics-based potential strategies have been developed for viral characterization and developing new effective therapeutics for their eradication or management. This review attempts to provide a single platform for the available wide range of bioinformatics-based approaches, including bioinformatics methods for the identification and management of emerging or evolved viral strains, genome analysis concerning the pathogenicity and epidemiological analysis, computational methods for designing the viral therapeutics, and consolidated information in the form of databases against the known pathogenic viruses. This enriched review of the generally applicable viral informatics approaches aims to provide an overview of available resources capable of carrying out the desired task and may be utilized to expand additional strategies to improve the quality of translation viral informatics research.
Collapse
Affiliation(s)
- Sanjay Kumar
- School of Biotechnology, Jawaharlal Nehru University, New Delhi, India.,Center for Bioinformatics, Computational and Systems Biology, Pathfinder Research and Training Foundation, Greater Noida, India
| | - Geethu S Kumar
- Department of Life Science, School of Basic Science and Research, Sharda University, Greater Noida, Uttar Pradesh, India.,Center for Bioinformatics, Computational and Systems Biology, Pathfinder Research and Training Foundation, Greater Noida, India
| | | | - Petr Malý
- Laboratory of Ligand Engineering, Institute of Biotechnology of the Czech Academy of Sciences v.v.i., BIOCEV Research Center, Vestec, Czech Republic
| | - Shiv Bharadwaj
- Laboratory of Ligand Engineering, Institute of Biotechnology of the Czech Academy of Sciences v.v.i., BIOCEV Research Center, Vestec, Czech Republic
| | - Pradeep Sharma
- Department of Biophysics, All India Institute of Medical Sciences, New Delhi, India
| | - Vivek Dhar Dwivedi
- Center for Bioinformatics, Computational and Systems Biology, Pathfinder Research and Training Foundation, Greater Noida, India.,Institute of Advanced Materials, IAAM, 59053 Ulrika, Sweden
| |
Collapse
|
8
|
Wang Y, Tang H, Gao C, Ge M, Li Z, Dong Z, Zhao L. Flexibility-aware graph model for accurate epitope identification. Comput Biol Med 2022; 149:106064. [DOI: 10.1016/j.compbiomed.2022.106064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 08/05/2022] [Accepted: 08/27/2022] [Indexed: 11/25/2022]
|
9
|
La Marca AF, Lopes RDS, Lotufo ADP, Bartholomeu DC, Minussi CR. BepFAMN: A Method for Linear B-Cell Epitope Predictions Based on Fuzzy-ARTMAP Artificial Neural Network. SENSORS 2022; 22:s22114027. [PMID: 35684648 PMCID: PMC9185646 DOI: 10.3390/s22114027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Revised: 05/22/2022] [Accepted: 05/24/2022] [Indexed: 12/02/2022]
Abstract
The public health system is extremely dependent on the use of vaccines to immunize the population from a series of infectious and dangerous diseases, preventing the system from collapsing and millions of people dying every year. However, to develop these vaccines and effectively monitor these diseases, it is necessary to use accurate diagnostic methods capable of identifying highly immunogenic regions within a given pathogenic protein. Existing experimental methods are expensive, time-consuming, and require arduous laboratory work, as they require the screening of a large number of potential candidate epitopes, making the methods extremely laborious, especially for application to larger microorganisms. In the last decades, researchers have developed in silico prediction methods, based on machine learning, to identify these markers, to drastically reduce the list of potential candidate epitopes for experimental tests, and, consequently, to reduce the laborious task associated with their mapping. Despite these efforts, the tools and methods still have low accuracy, slow diagnosis, and offline training. Thus, we develop a method to predict B-cell linear epitopes which are based on a Fuzzy-ARTMAP neural network architecture, called BepFAMN (B Epitope Prediction Fuzzy ARTMAP Artificial Neural Network). This was trained using a linear averaging scheme on 15 properties that include an amino acid ratio scale and a set of 14 physicochemical scales. The database used was obtained from the IEDB website, from which the amino acid sequences with the annotations of their positive and negative epitopes were taken. To train and validate the knowledge models, five-fold cross-validation and competition techniques were used. The BepiPred-2.0 database, an independent database, was used for the tests. In our experiment, the validation dataset reached sensitivity = 91.50%, specificity = 91.49%, accuracy = 91.49%, MCC = 0.83, and an area under the curve (AUC) ROC of approximately 0.9289. The result in the testing dataset achieves a significant improvement, with sensitivity = 81.87%, specificity = 74.75%, accuracy = 78.27%, MCC = 0.56, and AOC = 0.7831. These achieved values demonstrate that BepFAMN outperforms all other linear B-cell epitope prediction tools currently used. In addition, the architecture provides mechanisms for online training, which allow the user to find a new B-cell linear epitope, and to improve the model without need to re-train itself with the whole dataset. This fact contributes to a considerable reduction in the number of potential linear epitopes to be experimentally validated, reducing laboratory time and accelerating the development of diagnostic tests, vaccines, and immunotherapeutic approaches.
Collapse
Affiliation(s)
- Anthony F. La Marca
- Electrical Engineering Department, UNESP—São Paulo State University, Av. Brasil 56, Ilha Solteira 15385-000, Brazil; (A.F.L.M.); (A.D.P.L.)
| | - Robson da S. Lopes
- Computer Science Course, UFMT—Mato Grosso Federal University, Av. Valdon Varjão, 6390 Setor Industrial, Barra do Garças 78605-091, Brazil;
| | - Anna Diva P. Lotufo
- Electrical Engineering Department, UNESP—São Paulo State University, Av. Brasil 56, Ilha Solteira 15385-000, Brazil; (A.F.L.M.); (A.D.P.L.)
| | - Daniella C. Bartholomeu
- Parasite Immunology and Genomics Laboratory, Institute of Biological Sciences, Minas Gerais Federal University, Belo Horizonte 31270-901, Brazil;
| | - Carlos R. Minussi
- Electrical Engineering Department, UNESP—São Paulo State University, Av. Brasil 56, Ilha Solteira 15385-000, Brazil; (A.F.L.M.); (A.D.P.L.)
- Correspondence: ; Tel.: +55-1837431225
| |
Collapse
|
10
|
Biological databases and their application. Bioinformatics 2022. [DOI: 10.1016/b978-0-323-89775-4.00021-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
|
11
|
Dhusia K, Su Z, Wu Y. A structural-based machine learning method to classify binding affinities between TCR and peptide-MHC complexes. Mol Immunol 2021; 139:76-86. [PMID: 34455212 PMCID: PMC10811653 DOI: 10.1016/j.molimm.2021.07.020] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 07/13/2021] [Accepted: 07/25/2021] [Indexed: 11/27/2022]
Abstract
The activation of T cells is triggered by the interactions of T cell receptors (TCRs) with their epitopes, which are peptides presented by major histocompatibility complex (MHC) on the surfaces of antigen presenting cells (APC). While each TCR can only recognize a specific subset from a large repertoire of peptide-MHC (pMHC) complexes, it is very often that peptides in this subset share little sequence similarity. This is known as the specificity and cross-reactivity of T cells, respectively. The binding affinities between different types of TCRs and pMHC are the major driving force to shape this specificity and cross-reactivity in T cell recognition. The binding affinities, furthermore, are determined by the sequence and structural properties at the interfaces between TCRs and pMHC. Fortunately, a wealth of data on binding and structures of TCR-pMHC interactions becomes publicly accessible in online resources, which offers us the opportunity to develop a random forest classifier for predicting the binding affinities between TCR and pMHC based on the structure of their complexes. Specifically, the structure and sequence of a given complex were projected onto a high-dimensional feature space as the input of the classifier, which was then trained by a large-scale benchmark dataset. Based on the cross-validation results, we found that our machine learning model can predict if the binding affinity of a given TCR-pMHC complex is stronger or weaker than a predefined threshold with an overall accuracy approximately around 75 %. The significance of our prediction was estimated by statistical analysis. Moreover, more than 60 % of binding affinities in the ATLAS database can be successfully classified into groups within the range of 2 kcal/mol. Additionally, we show that TCR-pMHC complexes with strong binding affinity prefer hydrophobic interactions between amino acids with large aromatic rings instead of electrostatic interactions. Our results therefore provide insights to design engineered TCRs which enhance the specificity for their targeted epitopes. Taken together, this method can serve as a useful addition to a suite of existing approaches which study binding between TCR and pMHC.
Collapse
Affiliation(s)
- Kalyani Dhusia
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States
| | - Zhaoqian Su
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States
| | - Yinghao Wu
- Department of Systems and Computational Biology, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, United States.
| |
Collapse
|
12
|
Qiao X, Qu L, Guo Y, Hoshino T. Secondary Structure and Conformational Stability of the Antigen Residues Making Contact with Antibodies. J Phys Chem B 2021; 125:11374-11385. [PMID: 34615354 DOI: 10.1021/acs.jpcb.1c05997] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Antibodies are crucial biomolecules that bring high therapeutic efficacy in medicine and accurate molecular detection in diagnosis. Many studies have been devoted to analyzing the antigen-antibody interaction from the importance of understanding the antibody recognition mechanism. However, most of the previous studies examined the characteristic of the antibody for interaction. It is also informative to clarify the significant antigen residues contributing to the binding. To characterize the molecular interaction of antigens, we computationally analyzed 350 antigen-antibody complex structures by molecular mechanics (MM) calculations and molecular dynamics (MD) simulations. Based on the MM calculations, the antigen residues contributing to the binding were extracted from all the 350 complexes. The extracted residues are located at the antigen-antibody interface and are responsible for making contact with the antibody. The appearances of the charged polar residues, Asp, Glu, Arg, and Lys, were noticeably large. In contrast, the populations of the hydrophobic residues, Leu, Val, and Ala, were relatively low. The appearance frequencies of the other amino acid residues were almost close to the abundance of general proteins of eukaryotes. The binding score indicated that the hydrophilic interaction was dominant at the antigen-antibody contact instead of the hydrophobic one. The positively charged residues, Arg and Lys, remarkably contributed to the binding compared to the negatively charged ones, Asp and Glu. Considerable contributions were also observed for the noncharged polar residues, Asn and Gln. The analysis of the secondary structures of the extracted antigen residues suggested that there was no marked difference in recognition by antibodies among helix, sheet, turn, and coil. A long helix of the antigen sometimes made contact with antibody complementarity-determining regions, and a large sheet also frequently covered the antibody heavy and light chains. The turn structure was the most popularly observed at the contact with antibody among 350 complexes. Three typical complexes were picked up for each of the four secondary structures. MD simulations were performed to examine the stability of the interfacial structures of the antigens for these 12 complex models. The alterations of secondary structures were monitored through the simulations. The structural fluctuations of the contact residues were low compared with the other domains of antigen molecules. No drastic conversion was observed for every model during the 100 ns simulation. The motions of the interfacial antigen residues were small compared to the other residues on the protein surface. Therefore, diverse molecular conformations are possible for antibody recognition as long as the target areas are polar, nonflexible, and protruding on the protein surface.
Collapse
Affiliation(s)
- Xinyue Qiao
- Graduate School of Pharmaceutical Sciences, Chiba University, Inohana 1-8-1, Chuo-ku, Chiba 260-8675, Japan
| | - Liang Qu
- Graduate School of Pharmaceutical Sciences, Chiba University, Inohana 1-8-1, Chuo-ku, Chiba 260-8675, Japan
| | - Yan Guo
- Graduate School of Pharmaceutical Sciences, Chiba University, Inohana 1-8-1, Chuo-ku, Chiba 260-8675, Japan
| | - Tyuji Hoshino
- Graduate School of Pharmaceutical Sciences, Chiba University, Inohana 1-8-1, Chuo-ku, Chiba 260-8675, Japan
| |
Collapse
|
13
|
Ramchandani R, Hossenbaccus L, Ellis AK. Immunoregulatory T cell epitope peptides for the treatment of allergic disease. Immunotherapy 2021; 13:1283-1291. [PMID: 34558985 DOI: 10.2217/imt-2021-0133] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Allergic diseases are type 2 inflammatory reactions with an increasing worldwide prevalence, making the search for new therapeutic options pertinent. Allergen immunotherapy is the only disease-modifying approach for allergic rhinitis, though it can result in systemic reactions. Recently, peptide immunotherapy (PIT), involving T-cell epitope peptides that bind to major histocompatibility complexes, have been developed. It is speculated that they can induce T helper cell type 2 anergy, Treg cell upregulation or immune deviation. Promising results in cat dander, honeybee venom, Japanese cedar pollen, grass pollens, ragweed and house dust mite clinical trials have shown safety, efficacy and tolerability to PIT. Hence, PIT may hold the potential to change the treatment algorithm for allergic rhinitis.
Collapse
Affiliation(s)
- Rashi Ramchandani
- Department of Medicine, Queen's University, Kingston, ON, K7L 3N6, Canada.,Allergy Research Unit, Kingston Health Sciences Center - KGH Site, Kingston, on, K7L 2V7, Canada
| | - Lubnaa Hossenbaccus
- Department of Biomedical & Molecular Sciences, Queen's University, Kingston, ON, K7L 3N6, Canada.,Allergy Research Unit, Kingston Health Sciences Center - KGH Site, Kingston, on, K7L 2V7, Canada
| | - Anne K Ellis
- Department of Medicine, Queen's University, Kingston, ON, K7L 3N6, Canada.,Department of Biomedical & Molecular Sciences, Queen's University, Kingston, ON, K7L 3N6, Canada.,Allergy Research Unit, Kingston Health Sciences Center - KGH Site, Kingston, on, K7L 2V7, Canada
| |
Collapse
|
14
|
Rawal K, Sinha R, Abbasi BA, Chaudhary A, Nath SK, Kumari P, Preeti P, Saraf D, Singh S, Mishra K, Gupta P, Mishra A, Sharma T, Gupta S, Singh P, Sood S, Subramani P, Dubey AK, Strych U, Hotez PJ, Bottazzi ME. Identification of vaccine targets in pathogens and design of a vaccine using computational approaches. Sci Rep 2021; 11:17626. [PMID: 34475453 PMCID: PMC8413327 DOI: 10.1038/s41598-021-96863-x] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 08/10/2021] [Indexed: 02/07/2023] Open
Abstract
Antigen identification is an important step in the vaccine development process. Computational approaches including deep learning systems can play an important role in the identification of vaccine targets using genomic and proteomic information. Here, we present a new computational system to discover and analyse novel vaccine targets leading to the design of a multi-epitope subunit vaccine candidate. The system incorporates reverse vaccinology and immuno-informatics tools to screen genomic and proteomic datasets of several pathogens such as Trypanosoma cruzi, Plasmodium falciparum, and Vibrio cholerae to identify potential vaccine candidates (PVC). Further, as a case study, we performed a detailed analysis of the genomic and proteomic dataset of T. cruzi (CL Brenner and Y strain) to shortlist eight proteins as possible vaccine antigen candidates using properties such as secretory/surface-exposed nature, low transmembrane helix (< 2), essentiality, virulence, antigenic, and non-homology with host/gut flora proteins. Subsequently, highly antigenic and immunogenic MHC class I, MHC class II and B cell epitopes were extracted from top-ranking vaccine targets. The designed vaccine construct containing 24 epitopes, 3 adjuvants, and 4 linkers was analysed for its physicochemical properties using different tools, including docking analysis. Immunological simulation studies suggested significant levels of T-helper, T-cytotoxic cells, and IgG1 will be elicited upon administration of such a putative multi-epitope vaccine construct. The vaccine construct is predicted to be soluble, stable, non-allergenic, non-toxic, and to offer cross-protection against related Trypanosoma species and strains. Further, studies are required to validate safety and immunogenicity of the vaccine.
Collapse
Affiliation(s)
- Kamal Rawal
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India.
| | - Robin Sinha
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Bilal Ahmed Abbasi
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Amit Chaudhary
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Swarsat Kaushik Nath
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Priya Kumari
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - P Preeti
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Devansh Saraf
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Shachee Singh
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Kartik Mishra
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Pranjay Gupta
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Astha Mishra
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Trapti Sharma
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Srijanee Gupta
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Prashant Singh
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Shriya Sood
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Preeti Subramani
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Aman Kumar Dubey
- Centre for Computational Biology and Bioinformatics, Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida, India
| | - Ulrich Strych
- Texas Children's Hospital Center for Vaccine Development, Departments of Pediatrics and Molecular Virology and Microbiology, National School of Tropical Medicine, Baylor College of Medicine, Houston, TX, USA
| | - Peter J Hotez
- Texas Children's Hospital Center for Vaccine Development, Departments of Pediatrics and Molecular Virology and Microbiology, National School of Tropical Medicine, Baylor College of Medicine, Houston, TX, USA
- Department of Biology, Baylor University, Waco, TX, USA
| | - Maria Elena Bottazzi
- Texas Children's Hospital Center for Vaccine Development, Departments of Pediatrics and Molecular Virology and Microbiology, National School of Tropical Medicine, Baylor College of Medicine, Houston, TX, USA
- Department of Biology, Baylor University, Waco, TX, USA
| |
Collapse
|
15
|
Jiang L, Yu H, Li J, Tang J, Guo Y, Guo F. Predicting MHC class I binder: existing approaches and a novel recurrent neural network solution. Brief Bioinform 2021; 22:6299205. [PMID: 34131696 DOI: 10.1093/bib/bbab216] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 05/14/2021] [Accepted: 05/17/2021] [Indexed: 01/04/2023] Open
Abstract
Major histocompatibility complex (MHC) possesses important research value in the treatment of complex human diseases. A plethora of computational tools has been developed to predict MHC class I binders. Here, we comprehensively reviewed 27 up-to-date MHC I binding prediction tools developed over the last decade, thoroughly evaluating feature representation methods, prediction algorithms and model training strategies on a benchmark dataset from Immune Epitope Database. A common limitation was identified during the review that all existing tools can only handle a fixed peptide sequence length. To overcome this limitation, we developed a bilateral and variable long short-term memory (BVLSTM)-based approach, named BVLSTM-MHC. It is the first variable-length MHC class I binding predictor. In comparison to the 10 mainstream prediction tools on an independent validation dataset, BVLSTM-MHC achieved the best performance in six out of eight evaluated metrics. A web server based on the BVLSTM-MHC model was developed to enable accurate and efficient MHC class I binder prediction in human, mouse, macaque and chimpanzee.
Collapse
Affiliation(s)
- Limin Jiang
- Comprehensive cancer center, Department of Internal Medicine, University of New Mexico, Albuquerque, NM, USA
| | - Hui Yu
- Comprehensive cancer center, Department of Internal Medicine, University of New Mexico, Albuquerque, NM, USA
| | - Jiawei Li
- School of Computer Science and Technology, College of Intelligence and Computing, Tianjin University, Tianjin, China
| | - Jijun Tang
- Department of Computer Science, University of South Carolina, SC, USA.,Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
| | - Yan Guo
- Comprehensive cancer center, Department of Internal Medicine, University of New Mexico, Albuquerque, NM, USA
| | - Fei Guo
- School of Computer Science and Engineering, Central South University, Changsha, China
| |
Collapse
|
16
|
Data curation to improve the pattern recognition performance of B-cell epitope prediction by support vector machine. PURE APPL CHEM 2021. [DOI: 10.1515/pac-2020-1107] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
Abstract
B-cell epitope will be recognized and attached to the surface of receptors in B-lymphocytes to trigger immune response, thus are the vital elements in the field of epitope-based vaccine design, antibody production and therapeutic development. However, the experimental approaches in mapping epitopes are time consuming and costly. Computational prediction could offer an unbiased preliminary selection to reduce the number of epitopes for experimental validation. The deposited B-cell epitopes in the databases are those with experimentally determined positive/negative peptides and some are ambiguous resulted from different experimental methods. Prior to the development of B-cell epitope prediction module, the available dataset need to be handled with care. In this work, we first pre-processed the B-cell epitope dataset prior to B-cell epitopes prediction based on pattern recognition using support vector machine (SVM). By using only the absolute epitopes and non-epitopes, the datasets were classified into five categories of pathogen and worked on the 6-mers peptide sequences. The pre-processing of the datasets have improved the B-cell epitope prediction performance up to 99.1 % accuracy and showed significant improvement in cross validation results. It could be useful when incorporated with physicochemical propensity ranking in the future for the development of B-cell epitope prediction module.
Collapse
|
17
|
Galanis KA, Nastou KC, Papandreou NC, Petichakis GN, Pigis DG, Iconomidou VA. Linear B-Cell Epitope Prediction for In Silico Vaccine Design: A Performance Review of Methods Available via Command-Line Interface. Int J Mol Sci 2021; 22:3210. [PMID: 33809918 PMCID: PMC8004178 DOI: 10.3390/ijms22063210] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2021] [Revised: 03/15/2021] [Accepted: 03/19/2021] [Indexed: 12/17/2022] Open
Abstract
Linear B-cell epitope prediction research has received a steadily growing interest ever since the first method was developed in 1981. B-cell epitope identification with the help of an accurate prediction method can lead to an overall faster and cheaper vaccine design process, a crucial necessity in the COVID-19 era. Consequently, several B-cell epitope prediction methods have been developed over the past few decades, but without significant success. In this study, we review the current performance and methodology of some of the most widely used linear B-cell epitope predictors which are available via a command-line interface, namely, BcePred, BepiPred, ABCpred, COBEpro, SVMTriP, LBtope, and LBEEP. Additionally, we attempted to remedy performance issues of the individual methods by developing a consensus classifier, which combines the separate predictions of these methods into a single output, accelerating the epitope-based vaccine design. While the method comparison was performed with some necessary caveats and individual methods might perform much better for specialized datasets, we hope that this update in performance can aid researchers towards the choice of a predictor, for the development of biomedical applications such as designed vaccines, diagnostic kits, immunotherapeutics, immunodiagnostic tests, antibody production, and disease diagnosis and therapy.
Collapse
Affiliation(s)
| | | | | | | | | | - Vassiliki A. Iconomidou
- Section of Cell Biology and Biophysics, Department of Biology, School of Sciences, National and Kapodistrian University of Athens, 15701 Athens, Greece; (K.A.G.); (K.C.N.); (N.C.P.); (G.N.P.); (D.G.P.)
| |
Collapse
|
18
|
Abstract
The assessment of immunogenicity of biopharmaceuticals is a crucial step in the process of their development. Immunogenicity is related to the activation of adaptive immunity. The complexity of the immune system manifests through numerous different mechanisms, which allows the use of different approaches for predicting the immunogenicity of biopharmaceuticals. The direct experimental approaches are sometimes expensive and time consuming, or their results need to be confirmed. In this case, computational methods for immunogenicity prediction appear as an appropriate complement in the process of drug design. In this review, we analyze the use of various In silico methods and approaches for immunogenicity prediction of biomolecules: sequence alignment algorithms, predicting subcellular localization, searching for major histocompatibility complex (MHC) binding motifs, predicting T and B cell epitopes based on machine learning algorithms, molecular docking, and molecular dynamics simulations. Computational tools for antigenicity and allergenicity prediction also are considered.
Collapse
|
19
|
Anand R, Biswal S, Bhatt R, Tiwary BN. Computational perspectives revealed prospective vaccine candidates from five structural proteins of novel SARS corona virus 2019 (SARS-CoV-2). PeerJ 2020; 8:e9855. [PMID: 33062414 PMCID: PMC7531350 DOI: 10.7717/peerj.9855] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Accepted: 08/11/2020] [Indexed: 12/30/2022] Open
Abstract
Background The present pandemic COVID-19 is caused by SARS-CoV-2, a single-stranded positive-sense RNA virus from the Coronaviridae family. Due to a lack of antiviral drugs, vaccines against the virus are urgently required. Methods In this study, validated computational approaches were used to identify peptide-based epitopes from six structural proteins having antigenic properties. The Net-CTL 1.2 tool was used for the prediction of CD8+ T-cell epitopes, while the robust tools Bepi-Pred 2 and LBtope was employed for the identification of linear B-cell epitopes. Docking studies of the identified epitopes were performed using HADDOCK 2.4 and the structures were visualized by Discovery Studio and LigPlot+. Antigenicity, immunogenicity, conservancy, population coverage and allergenicity of the predicted epitopes were determined by the bioinformatics tools like VaxiJen v2.0 server, the Immune Epitope Database tools and AllerTOP v.2.0, AllergenFP 1.0 and ElliPro. Results The predicted T cell and linear B-cell epitopes were considered as prime vaccine targets in case they passed the requisite parameters like antigenicity, immunogenicity, conservancy, non-allergenicity and broad range of population coverage. Among the predicted CD8+ T cell epitopes, potential vaccine targets from surface glycoprotein were; YQPYRVVVL, PYRVVVLSF, GVYFASTEK, QLTPTWRVY, and those from ORF3a protein were LKKRWQLAL, HVTFFIYNK. Similarly, RFLYIIKLI, LTWICLLQF from membrane protein and three epitopes viz; SPRWYFYYL, TWLTYTGAI, KTFPPTEPK from nucleocapsid phosphoprotein were the superior vaccine targets observed in our study. The negative values of HADDOCK and Z scores obtained for the best cluster indicated the potential of the epitopes as suitable vaccine candidates. Analysis of the 3D and 2D interaction diagrams of best cluster produced by HADDOCK 2.4 displayed the binding interaction of leading T cell epitopes within the MHC-1 peptide binding clefts. On the other hand, among linear B cell epitopes the majority of potential vaccine targets were from nucleocapsid protein, viz; 59−HGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMKDLS−105, 227−LNQLE SKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATK−266, 3−DNGPQNQRNAPRITFGGP−20, 29−GERSGARSKQRRPQGL−45. Two other prime vaccine targets, 370−NSASFSTFKCYGVSPTKLNDLCFTNV−395 and 260−AGAAAYYVGYLQPRT−274 were identified in the spike protein. The potential B-cell conformational epitopes were predicted on the basis of a higher protrusion index indicating greater solvent accessibility. These conformational epitopes were of various lengths and belonged to spike, ORF3a, membrane and nucleocapsid proteins. Conclusions Taken together, eleven T cell epitopes, seven B cell linear epitopes and ten B cell conformational epitopes were identified from five structural proteins of SARS-CoV-2 using advanced computational tools. These potential vaccine candidates may provide important timely directives for an effective vaccine against SARS-CoV-2.
Collapse
Affiliation(s)
- Rajesh Anand
- Department of Biotechnology, Guru Ghasidas Vishwavidyalaya, (A Central University), Bilaspur, Chhattisgarh, India
| | - Subham Biswal
- Department of Biotechnology, Guru Ghasidas Vishwavidyalaya, (A Central University), Bilaspur, Chhattisgarh, India
| | - Renu Bhatt
- Department of Biotechnology, Guru Ghasidas Vishwavidyalaya, (A Central University), Bilaspur, Chhattisgarh, India
| | - Bhupendra N Tiwary
- Department of Biotechnology, Guru Ghasidas Vishwavidyalaya, (A Central University), Bilaspur, Chhattisgarh, India
| |
Collapse
|
20
|
Zhou WJ, Qu Z, Song CY, Sun Y, Lai AL, Luo MY, Ying YZ, Meng H, Liang Z, He YJ, Li YH, Liu J. NeoPeptide: an immunoinformatic database of T-cell-defined neoantigens. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2019:5670755. [PMID: 31819989 PMCID: PMC6901387 DOI: 10.1093/database/baz128] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Revised: 10/11/2019] [Accepted: 10/15/2019] [Indexed: 12/13/2022]
Abstract
Therapeutic vaccines represent a promising immunotherapeutic modality against cancer. Discovery and validation of antigens is the key to develop effective anti-cancer vaccines. Neoantigens, arising from somatic mutations in individual cancers, are considered as ideal cancer vaccine targets because of their immunogenicity and lack of expression in normal tissues. However, only few databases support convenient access to these neoantigens for use in vaccines. To address this gap, we developed a web-accessible database, called NeoPeptide, which contains most of the important characteristics of neoantigens (such as mutation site, subunit sequence, major histocompatibility complex restriction) derived from published literature and other immunological resources. NeoPeptide also provides links to resources for further characterization of the novel features of these neoantigens. NeoPeptide will be regularly updated with newly identified and published neoantigens. Our work will help researchers in identifying neoantigens in different cancers and hasten the search for appropriate cancer vaccine candidates.
Collapse
Affiliation(s)
- Wei-Jun Zhou
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Zhi Qu
- School of Computer Science and Technology, Harbin Institute of Technology, 92 West Dazhi Street,Nan Gang District, Harbin 150001, China
| | - Chao-Yang Song
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yang Sun
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - An-Li Lai
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Ma-Yao Luo
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yu-Zhe Ying
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Hu Meng
- The Second Clinical Medical College (Zhujiang Hospital), Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Zhao Liang
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yan-Jie He
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Yu-Hua Li
- Department of Hematology, Zhujiang Hospital, Southern Medical University, 253 Industrial Avenue, Haizhu, Guangzhou 510282, China
| | - Jian Liu
- College of Computer Science, NanKai University, No.38 Tongyan Road, Jinnan District, Tianjin 300350, China
| |
Collapse
|
21
|
Graves J, Byerly J, Priego E, Makkapati N, Parish SV, Medellin B, Berrondo M. A Review of Deep Learning Methods for Antibodies. Antibodies (Basel) 2020; 9:E12. [PMID: 32354020 PMCID: PMC7344881 DOI: 10.3390/antib9020012] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2020] [Revised: 04/15/2020] [Accepted: 04/16/2020] [Indexed: 01/09/2023] Open
Abstract
Driven by its successes across domains such as computer vision and natural language processing, deep learning has recently entered the field of biology by aiding in cellular image classification, finding genomic connections, and advancing drug discovery. In drug discovery and protein engineering, a major goal is to design a molecule that will perform a useful function as a therapeutic drug. Typically, the focus has been on small molecules, but new approaches have been developed to apply these same principles of deep learning to biologics, such as antibodies. Here we give a brief background of deep learning as it applies to antibody drug development, and an in-depth explanation of several deep learning algorithms that have been proposed to solve aspects of both protein design in general, and antibody design in particular.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Monica Berrondo
- Macromoltek, Inc, 2500 W William Cannon Dr, Suite 204, Austin, Austin, TX 78745, USA
| |
Collapse
|
22
|
Abstract
With advancements in sequencing technologies, vast amount of experimental data has accumulated. Due to rapid progress in the development of bioinformatics tools and the accumulation of data, immunoinformatics or computational immunology emerged as a special branch of bioinformatics which utilizes bioinformatics approaches for understanding and interpreting immunological data. One extensively studied aspect of applied immunology involves using available databases and tools for prediction of B- and T-cell epitopes. B and T cells comprise two arms of adaptive immunity.This chapter first reviews the methodology we used for computational identification of B- and T-cell epitopes against enterotoxigenic Escherichia coli (ETEC). Then we discuss other databases of epitopes and analysis tools for T-cell and B-cell epitope prediction and vaccine design. The predicted peptides were analyzed for conservation and population coverage. HLA distribution analysis for predicted epitopes identified efficient MHC binders. Epitopes were further tested using computational docking studies to bind in MHC-I molecule cleft. The predicted epitopes were conserved and covered more than 80% of the world population.
Collapse
MESH Headings
- Antigens, Bacterial/chemistry
- Antigens, Bacterial/genetics
- Antigens, Bacterial/immunology
- Computational Biology
- Databases, Protein
- Enterotoxigenic Escherichia coli/genetics
- Enterotoxigenic Escherichia coli/immunology
- Epitope Mapping/methods
- Epitopes, B-Lymphocyte/chemistry
- Epitopes, B-Lymphocyte/genetics
- Epitopes, B-Lymphocyte/immunology
- Epitopes, T-Lymphocyte/chemistry
- Epitopes, T-Lymphocyte/genetics
- Epitopes, T-Lymphocyte/immunology
- Escherichia coli Vaccines/genetics
- Escherichia coli Vaccines/immunology
- Humans
- Models, Molecular
- Molecular Docking Simulation
Collapse
Affiliation(s)
- Jayashree Ramana
- Department of Biotechnology and Bioinformatics, Jaypee University of Information Technology, Waknaghat, HP, India.
| | - Kusum Mehla
- National Bureau of Animal Genetic Resources, Karnal, Haryana, India
| |
Collapse
|
23
|
Quinzo MJ, Lafuente EM, Zuluaga P, Flower DR, Reche PA. Computational assembly of a human Cytomegalovirus vaccine upon experimental epitope legacy. BMC Bioinformatics 2019; 20:476. [PMID: 31823715 PMCID: PMC6905002 DOI: 10.1186/s12859-019-3052-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 08/23/2019] [Indexed: 01/05/2023] Open
Abstract
Background Human Cytomegalovirus (HCMV) is a ubiquitous herpesvirus affecting approximately 90% of the world population. HCMV causes disease in immunologically naive and immunosuppressed patients. The prevention, diagnosis and therapy of HCMV infection are thus crucial to public health. The availability of effective prophylactic and therapeutic treatments remain a significant challenge and no vaccine is currently available. Here, we sought to define an epitope-based vaccine against HCMV, eliciting B and T cell responses, from experimentally defined HCMV-specific epitopes. Results We selected 398 and 790 experimentally validated HCMV-specific B and T cell epitopes, respectively, from available epitope resources and apply a knowledge-based approach in combination with immunoinformatic predictions to ensemble a universal vaccine against HCMV. The T cell component consists of 6 CD8 and 6 CD4 T cell epitopes that are conserved among HCMV strains. All CD8 T cell epitopes were reported to induce cytotoxic activity, are derived from early expressed genes and are predicted to provide population protection coverage over 97%. The CD4 T cell epitopes are derived from HCMV structural proteins and provide a population protection coverage over 92%. The B cell component consists of just 3 B cell epitopes from the ectodomain of glycoproteins L and H that are highly flexible and exposed to the solvent. Conclusions We have defined a multiantigenic epitope vaccine ensemble against the HCMV that should elicit T and B cell responses in the entire population. Importantly, although we arrived to this epitope ensemble with the help of computational predictions, the actual epitopes are not predicted but are known to be immunogenic.
Collapse
Affiliation(s)
- Monica J Quinzo
- Faculty of Medicine, University Complutense of Madrid, Pza Ramon y Cajal, s/n, 28040, Madrid, Spain
| | - Esther M Lafuente
- Faculty of Medicine, University Complutense of Madrid, Pza Ramon y Cajal, s/n, 28040, Madrid, Spain
| | - Pilar Zuluaga
- Faculty of Medicine, University Complutense of Madrid, Pza Ramon y Cajal, s/n, 28040, Madrid, Spain
| | - Darren R Flower
- School of Life and Health Sciences, Aston University, Aston Triangle, Birmingham, B4 7ET, UK
| | - Pedro A Reche
- Faculty of Medicine, University Complutense of Madrid, Pza Ramon y Cajal, s/n, 28040, Madrid, Spain.
| |
Collapse
|
24
|
Karapetyan AR, Chaipan C, Winkelbach K, Wimberger S, Jeong JS, Joshi B, Stein RB, Underwood D, Castle JC, van Dijk M, Seibert V. TCR Fingerprinting and Off-Target Peptide Identification. Front Immunol 2019; 10:2501. [PMID: 31695703 PMCID: PMC6817589 DOI: 10.3389/fimmu.2019.02501] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2019] [Accepted: 10/07/2019] [Indexed: 01/06/2023] Open
Abstract
Adoptive T cell therapy using patient T cells redirected to recognize tumor-specific antigens by expressing genetically engineered high-affinity T-cell receptors (TCRs) has therapeutic potential for melanoma and other solid tumors. Clinical trials implementing genetically modified TCRs in melanoma patients have raised concerns regarding off-target toxicities resulting in lethal destruction of healthy tissue, highlighting the urgency of assessing which off-target peptides can be recognized by a TCR. As a model system we used the clinically efficacious NY-ESO-1-specific TCR C259, which recognizes the peptide epitope SLLMWITQC presented by HLA-A*02:01. We investigated which amino acids at each position enable a TCR interaction by sequentially replacing every amino acid position outside of anchor positions 2 and 9 with all 19 possible alternative amino acids, resulting in 134 peptides (133 altered peptides plus epitope peptide). Each peptide was individually evaluated using three different in vitro assays: binding of the NY-ESOc259 TCR to the peptide, peptide-dependent activation of TCR-expressing cells, and killing of peptide-presenting target cells. To represent the TCR recognition kernel, we defined Position Weight Matrices (PWMs) for each assay by assigning normalized measurements to each of the 20 amino acids in each position. To predict potential off-target peptides, we applied a novel algorithm projecting the PWM-defined kernel into the human proteome, scoring NY-ESOc259 TCR recognition of 336,921 predicted human HLA-A*02:01 binding 9-mer peptides. Of the 12 peptides with high predicted score, we confirmed 7 (including NY-ESO-1 antigen SLLMWITQC) strongly activate human primary NY-ESOc259-expressing T cells. These off-target peptides include peptides with up to 7 amino acid changes (of 9 possible), which could not be predicted using the recognition motif as determined by alanine scans. Thus, this replacement scan assay determines the “TCR fingerprint” and, when coupled with the algorithm applied to the database of human 9-mer peptides binding to HLA-A*02:01, enables the identification of potential off-target antigens and the tissues where they are expressed. This platform enables both screening of multiple TCRs to identify the best candidate for clinical development and identification of TCR-specific cross-reactive peptide recognition and constitutes an improved methodology for the identification of potential off-target peptides presented on MHC class I molecules.
Collapse
|
25
|
Bahrami AA, Payandeh Z, Khalili S, Zakeri A, Bandehpour M. Immunoinformatics: In Silico Approaches and Computational Design of a Multi-epitope, Immunogenic Protein. Int Rev Immunol 2019; 38:307-322. [PMID: 31478759 DOI: 10.1080/08830185.2019.1657426] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Immunoinformatics is a new critical field with several tools and databases that conduct the eyesight of experimental selection and facilitate analysis of the great amount of immunologic data obtained from experimental researches and helps to design and introducing new hypothesis. Given these visages, immunoinformatics seems to be the way that develop and progress the immunological research. Bioinformatics methods and applications are successfully employed in vaccine informatics to assist different sites of the preclinical, clinical, and post-licensure vaccine enterprises. On the other hand, the progression of molecular biology and immunology caused epitope vaccines have become the focus of research on molecular vaccines. Moreover, reverse vaccinology could improve vaccine production and vaccination protocols by in silico prediction of protein-vaccine candidates from genome sequences. B- and T-cell immune epitopes could be predicted by immunoinformatics algorithms and computational methods to improve the vaccine design, protective immunity analysis, assessment of vaccine safety and efficacy, and immunization modeling. This review aims to discuss the power of computational approaches in vaccine design and their relevance to the development of effective vaccines. Furthermore, the various divisions of this field and available tools in each item are introduced and reviewed.
Collapse
Affiliation(s)
- Armina Alagheband Bahrami
- Department of Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Zahra Payandeh
- Immunology Research Center, Tabriz University of Medical Sciences, Tabriz, Iran
| | - Saeed Khalili
- Department of Biology Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran
| | - Alireza Zakeri
- Department of Biology Sciences, Shahid Rajaee Teacher Training University, Tehran, Iran
| | - Mojgan Bandehpour
- Department of Biotechnology, School of Advanced Technologies in Medicine, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
26
|
Li Z, Miao Q, Yan F, Meng Y, Zhou P. Machine Learning in Quantitative Protein–peptide Affinity Prediction: Implications for Therapeutic Peptide Design. Curr Drug Metab 2019; 20:170-176. [DOI: 10.2174/1389200219666181012151944] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2017] [Revised: 11/07/2017] [Accepted: 08/20/2018] [Indexed: 01/03/2023]
Abstract
Background:Protein–peptide recognition plays an essential role in the orchestration and regulation of cell signaling networks, which is estimated to be responsible for up to 40% of biological interaction events in the human interactome and has recently been recognized as a new and attractive druggable target for drug development and disease intervention.Methods:We present a systematic review on the application of machine learning techniques in the quantitative modeling and prediction of protein–peptide binding affinity, particularly focusing on its implications for therapeutic peptide design. We also briefly introduce the physical quantities used to characterize protein–peptide affinity and attempt to extend the content of generalized machine learning methods.Results:Existing issues and future perspective on the statistical modeling and regression prediction of protein– peptide binding affinity are discussed.Conclusion:There is still a long way to go before establishment of general, reliable and efficient machine leaningbased protein–peptide affinity predictors.
Collapse
Affiliation(s)
- Zhongyan Li
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Qingqing Miao
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Fugang Yan
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Yang Meng
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| | - Peng Zhou
- Center for Informational Biology, School of Life Science and Technology, University of Electronic Science and Technology of China (UESTC), Chengdu 610054, China
| |
Collapse
|
27
|
Dingman R, Balu-Iyer SV. Immunogenicity of Protein Pharmaceuticals. J Pharm Sci 2019; 108:1637-1654. [PMID: 30599169 PMCID: PMC6720129 DOI: 10.1016/j.xphs.2018.12.014] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2018] [Revised: 12/19/2018] [Accepted: 12/20/2018] [Indexed: 02/07/2023]
Abstract
Protein therapeutics have drastically changed the landscape of treatment for many diseases by providing a regimen that is highly specific and lacks many off-target toxicities. The clinical utility of many therapeutic proteins has been undermined by the potential development of unwanted immune responses against the protein, limiting their efficacy and negatively impacting its safety profile. This review attempts to provide an overview of immunogenicity of therapeutic proteins, including immune mechanisms and factors influencing immunogenicity, impact of immunogenicity, preclinical screening methods, and strategies to mitigate immunogenicity.
Collapse
Affiliation(s)
- Robert Dingman
- Department of Pharmaceutical Sciences, University at Buffalo, The State University of New York, Buffalo, New York 14214
| | - Sathy V Balu-Iyer
- Department of Pharmaceutical Sciences, University at Buffalo, The State University of New York, Buffalo, New York 14214.
| |
Collapse
|
28
|
Cretich M, Gori A, D'Annessa I, Chiari M, Colombo G. Peptides for Infectious Diseases: From Probe Design to Diagnostic Microarrays. Antibodies (Basel) 2019; 8:E23. [PMID: 31544829 PMCID: PMC6640701 DOI: 10.3390/antib8010023] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Revised: 02/28/2019] [Accepted: 03/04/2019] [Indexed: 01/03/2023] Open
Abstract
Peptides and peptidomimetics have attracted revived interest regarding their applications in chemical biology over the last few years. Their chemical versatility, synthetic accessibility and the ease of storage and management compared to full proteins have made peptides particularly interesting in diagnostic applications, where they proved to efficiently recapitulate the molecular recognition properties of larger protein antigens, and were proven to be able to capture antibodies circulating in the plasma and serum of patients previously exposed to bacterial or viral infections. Here, we describe the development, integration and application of strategies for computational prediction and design, advanced chemical synthesis, and diagnostic deployment in multiplexed assays of peptide-based materials which are able to bind antibodies of diagnostic as well as therapeutic interest. By presenting successful applications of such an integrated strategy, we argue that they will have an ever-increasing role in both basic and clinical realms of research, where important advances can be expected in the next few years.
Collapse
Affiliation(s)
- Marina Cretich
- Consiglio Nazionale delle Ricerche, Istituto di Chimica del Riconoscimento Molecolare (ICRM), Via Mario Bianco 9, 20131 Milano, Italy.
| | - Alessandro Gori
- Consiglio Nazionale delle Ricerche, Istituto di Chimica del Riconoscimento Molecolare (ICRM), Via Mario Bianco 9, 20131 Milano, Italy.
| | - Ilda D'Annessa
- Consiglio Nazionale delle Ricerche, Istituto di Chimica del Riconoscimento Molecolare (ICRM), Via Mario Bianco 9, 20131 Milano, Italy.
| | - Marcella Chiari
- Consiglio Nazionale delle Ricerche, Istituto di Chimica del Riconoscimento Molecolare (ICRM), Via Mario Bianco 9, 20131 Milano, Italy.
| | - Giorgio Colombo
- Consiglio Nazionale delle Ricerche, Istituto di Chimica del Riconoscimento Molecolare (ICRM), Via Mario Bianco 9, 20131 Milano, Italy.
- Dipartimento di Chimica, Università di Pavia, V.le Taramelli 12, 27100 Pavia, Italy.
| |
Collapse
|
29
|
Bioinformatics Applications in Advancing Animal Virus Research. RECENT ADVANCES IN ANIMAL VIROLOGY 2019. [PMCID: PMC7121192 DOI: 10.1007/978-981-13-9073-9_23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Viruses serve as infectious agents for all living entities. There have been various research groups that focus on understanding the viruses in terms of their host-viral relationships, pathogenesis and immune evasion. However, with the current advances in the field of science, now the research field has widened up at the ‘omics’ level. Apparently, generation of viral sequence data has been increasing. There are numerous bioinformatics tools available that not only aid in analysing such sequence data but also aid in deducing useful information that can be exploited in developing preventive and therapeutic measures. This chapter elaborates on bioinformatics tools that are specifically designed for animal viruses as well as other generic tools that can be exploited to study animal viruses. The chapter further provides information on the tools that can be used to study viral epidemiology, phylogenetic analysis, structural modelling of proteins, epitope recognition and open reading frame (ORF) recognition and tools that enable to analyse host-viral interactions, gene prediction in the viral genome, etc. Various databases that organize information on animal and human viruses have also been described. The chapter will converse on overview of the current advances, online and downloadable tools and databases in the field of bioinformatics that will enable the researchers to study animal viruses at gene level.
Collapse
|
30
|
Computational B-cell epitope identification and production of neutralizing murine antibodies against Atroxlysin-I. Sci Rep 2018; 8:14904. [PMID: 30297733 PMCID: PMC6175905 DOI: 10.1038/s41598-018-33298-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2018] [Accepted: 09/03/2018] [Indexed: 11/08/2022] Open
Abstract
Epitope identification is essential for developing effective antibodies that can detect and neutralize bioactive proteins. Computational prediction is a valuable and time-saving alternative for experimental identification. Current computational methods for epitope prediction are underused and undervalued due to their high false positive rate. In this work, we targeted common properties of linear B-cell epitopes identified in an individual protein class (metalloendopeptidases) and introduced an alternative method to reduce the false positive rate and increase accuracy, proposing to restrict predictive models to a single specific protein class. For this purpose, curated epitope sequences from metalloendopeptidases were transformed into frame-shifted Kmers (3 to 15 amino acid residues long). These Kmers were decomposed into a matrix of biochemical attributes and used to train a decision tree classifier. The resulting prediction model showed a lower false positive rate and greater area under the curve when compared to state-of-the-art methods. Our predictions were used for synthesizing peptides mimicking the predicted epitopes for immunization of mice. A predicted linear epitope that was previously undetected by an experimental immunoassay was able to induce neutralizing-antibody production in mice. Therefore, we present an improved prediction alternative and show that computationally identified epitopes can go undetected during experimental mapping.
Collapse
|
31
|
Usmani SS, Kumar R, Bhalla S, Kumar V, Raghava GPS. In Silico Tools and Databases for Designing Peptide-Based Vaccine and Drugs. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2018; 112:221-263. [PMID: 29680238 DOI: 10.1016/bs.apcsb.2018.01.006] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The prolonged conventional approaches of drug screening and vaccine designing prerequisite patience, vigorous effort, outrageous cost as well as additional manpower. Screening and experimentally validating thousands of molecules for a specific therapeutic property never proved to be an easy task. Similarly, traditional way of vaccination includes administration of either whole or attenuated pathogen, which raises toxicity and safety issues. Emergence of sequencing and recombinant DNA technology led to the epitope-based advanced vaccination concept, i.e., small peptides (epitope) can stimulate specific immune response. Advent of bioinformatics proved to be an adjunct in vaccine and drug designing. Genomic study of pathogens aid to identify and analyze the protective epitope. A number of in silico tools have been developed to design immunotherapy as well as peptide-based drugs in the last two decades. These tools proved to be a catalyst in drug and vaccine designing. This review solicits therapeutic peptide databases as well as in silico tools developed for designing peptide-based vaccine and drugs.
Collapse
Affiliation(s)
- Salman Sadullah Usmani
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India; Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Rajesh Kumar
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India; Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Sherry Bhalla
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India
| | - Vinod Kumar
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India; Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India
| | - Gajendra P S Raghava
- Center for Computational Biology, Indraprastha Institute of Information Technology, New Delhi, India; Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, India.
| |
Collapse
|
32
|
Fundamentals and Methods for T- and B-Cell Epitope Prediction. J Immunol Res 2017; 2017:2680160. [PMID: 29445754 PMCID: PMC5763123 DOI: 10.1155/2017/2680160] [Citation(s) in RCA: 284] [Impact Index Per Article: 40.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Revised: 11/22/2017] [Accepted: 11/27/2017] [Indexed: 12/25/2022] Open
Abstract
Adaptive immunity is mediated by T- and B-cells, which are immune cells capable of developing pathogen-specific memory that confers immunological protection. Memory and effector functions of B- and T-cells are predicated on the recognition through specialized receptors of specific targets (antigens) in pathogens. More specifically, B- and T-cells recognize portions within their cognate antigens known as epitopes. There is great interest in identifying epitopes in antigens for a number of practical reasons, including understanding disease etiology, immune monitoring, developing diagnosis assays, and designing epitope-based vaccines. Epitope identification is costly and time-consuming as it requires experimental screening of large arrays of potential epitope candidates. Fortunately, researchers have developed in silico prediction methods that dramatically reduce the burden associated with epitope mapping by decreasing the list of potential epitope candidates for experimental testing. Here, we analyze aspects of antigen recognition by T- and B-cells that are relevant for epitope prediction. Subsequently, we provide a systematic and inclusive review of the most relevant B- and T-cell epitope prediction methods and tools, paying particular attention to their foundations.
Collapse
|
33
|
Abstract
INTRODUCTION The ability to predict epitopes plays an enormous role in vaccine development in terms of our ability to zero in on where to do a more thorough in-vivo analysis of the protein in question. Though for the past decade there have been numerous advancements and improvements in epitope prediction, on average the best benchmark prediction accuracies are still only around 60%. New machine learning algorithms have arisen within the domain of deep learning, text mining, and convolutional networks. This paper presents a novel analytically trained and string kernel using deep neural network, which is tailored for continuous epitope prediction, called: Deep Ridge Regressed Epitope Predictor (DRREP). RESULTS DRREP was tested on long protein sequences from the following datasets: SARS, Pellequer, HIV, AntiJen, and SEQ194. DRREP was compared to numerous state of the art epitope predictors, including the most recently published predictors called LBtope and DMNLBE. Using area under ROC curve (AUC), DRREP achieved a performance improvement over the best performing predictors on SARS (13.7%), HIV (8.9%), Pellequer (1.5%), and SEQ194 (3.1%), with its performance being matched only on the AntiJen dataset, by the LBtope predictor, where both DRREP and LBtope achieved an AUC of 0.702. CONCLUSION DRREP is an analytically trained deep neural network, thus capable of learning in a single step through regression. By combining the features of deep learning, string kernels, and convolutional networks, the system is able to perform residue-by-residue prediction of continues epitopes with higher accuracy than the current state of the art predictors.
Collapse
Affiliation(s)
- Gene Sher
- Department of Computer Science, University of Central Florida, Orlando, FL USA
| | - Degui Zhi
- School of Biomedical Informatics, University of Texas Health Science Center at Houston, Houston, TX USA
| | - Shaojie Zhang
- Department of Computer Science, University of Central Florida, Orlando, FL USA
| |
Collapse
|
34
|
Gourlay L, Peri C, Bolognesi M, Colombo G. Structure and Computation in Immunoreagent Design: From Diagnostics to Vaccines. Trends Biotechnol 2017; 35:1208-1220. [PMID: 28739221 DOI: 10.1016/j.tibtech.2017.06.018] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2017] [Revised: 06/28/2017] [Accepted: 06/30/2017] [Indexed: 11/26/2022]
Abstract
Novel immunological tools for efficient diagnosis and treatment of emerging infections are urgently required. Advances in the diagnostic and vaccine development fields are continuously progressing, with reverse vaccinology and structural vaccinology (SV) methods for antigen identification and structure-based antigen (re)design playing increasingly relevant roles. SV, in particular, is predicted to be the front-runner in the future development of diagnostics and vaccines targeting challenging diseases such as AIDS and cancer. We review state-of-the-art methodologies for structure-based epitope identification and antigen design, with specific applicative examples. We highlight the implications of such methods for the engineering of biomolecules with improved immunological properties, potential diagnostic and/or therapeutic uses, and discuss the perspectives of structure-based rational design for the production of advanced immunoreagents.
Collapse
Affiliation(s)
- Louise Gourlay
- Dipartimento di Bioscienze, Università di Milano, Via Celoria 26, 20133, Milan, Italy
| | - Claudio Peri
- Istituto di Chimica del Riconoscimento Molecolare, Consiglio Nazionale delle Ricerche, Via Mario Bianco, 9, 20131, Milan, Italy
| | - Martino Bolognesi
- Dipartimento di Bioscienze, Università di Milano, Via Celoria 26, 20133, Milan, Italy; Centro di Ricerca Pediatrica Romeo ed Enrica Invernizzi, Università di Milano, Milan, Italy.
| | - Giorgio Colombo
- Istituto di Chimica del Riconoscimento Molecolare, Consiglio Nazionale delle Ricerche, Via Mario Bianco, 9, 20131, Milan, Italy.
| |
Collapse
|
35
|
Vang YS, Xie X. HLA class I binding prediction via convolutional neural networks. Bioinformatics 2017; 33:2658-2665. [DOI: 10.1093/bioinformatics/btx264] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2016] [Accepted: 04/18/2017] [Indexed: 01/19/2023] Open
Affiliation(s)
- Yeeleng S Vang
- Department of Computer Science, University of California, Irvine, CA, USA
| | - Xiaohui Xie
- Department of Computer Science, University of California, Irvine, CA, USA
- Institute for Genomics and Bioinformatics, University of California, Irvine, CA, USA
| |
Collapse
|
36
|
Borrman T, Cimons J, Cosiano M, Purcaro M, Pierce BG, Baker BM, Weng Z. ATLAS: A database linking binding affinities with structures for wild-type and mutant TCR-pMHC complexes. Proteins 2017; 85:908-916. [PMID: 28160322 DOI: 10.1002/prot.25260] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Revised: 01/17/2017] [Accepted: 01/23/2017] [Indexed: 11/07/2022]
Abstract
The ATLAS (Altered TCR Ligand Affinities and Structures) database (https://zlab.umassmed.edu/atlas/web/) is a manually curated repository containing the binding affinities for wild-type and mutant T cell receptors (TCRs) and their antigens, peptides presented by the major histocompatibility complex (pMHC). The database links experimentally measured binding affinities with the corresponding three dimensional (3D) structures for TCR-pMHC complexes. The user can browse and search affinities, structures, and experimental details for TCRs, peptides, and MHCs of interest. We expect this database to facilitate the development of next-generation protein design algorithms targeting TCR-pMHC interactions. ATLAS can be easily parsed using modeling software that builds protein structures for training and testing. As an example, we provide structural models for all mutant TCRs in ATLAS, built using the Rosetta program. Utilizing these structures, we report a correlation of 0.63 between experimentally measured changes in binding energies and our predicted changes. Proteins 2017; 85:908-916. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Tyler Borrman
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| | - Jennifer Cimons
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, Indiana, 46556
| | - Michael Cosiano
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, Indiana, 46556
| | - Michael Purcaro
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, Maryland, 20850
| | - Brian M Baker
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, Indiana, 46556
| | - Zhiping Weng
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, Massachusetts, 01605
| |
Collapse
|
37
|
Potocnakova L, Bhide M, Pulzova LB. An Introduction to B-Cell Epitope Mapping and In Silico Epitope Prediction. J Immunol Res 2016; 2016:6760830. [PMID: 28127568 PMCID: PMC5227168 DOI: 10.1155/2016/6760830] [Citation(s) in RCA: 198] [Impact Index Per Article: 24.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2016] [Revised: 11/21/2016] [Accepted: 12/13/2016] [Indexed: 01/09/2023] Open
Abstract
Identification of B-cell epitopes is a fundamental step for development of epitope-based vaccines, therapeutic antibodies, and diagnostic tools. Epitope-based antibodies are currently the most promising class of biopharmaceuticals. In the last decade, in-depth in silico analysis and categorization of the experimentally identified epitopes stimulated development of algorithms for epitope prediction. Recently, various in silico tools are employed in attempts to predict B-cell epitopes based on sequence and/or structural data. The main objective of epitope identification is to replace an antigen in the immunization, antibody production, and serodiagnosis. The accurate identification of B-cell epitopes still presents major challenges for immunologists. Advances in B-cell epitope mapping and computational prediction have yielded molecular insights into the process of biorecognition and formation of antigen-antibody complex, which may help to localize B-cell epitopes more precisely. In this paper, we have comprehensively reviewed state-of-the-art experimental methods for B-cell epitope identification, existing databases for epitopes, and novel in silico resources and prediction tools available online. We have also elaborated new trends in the antibody-based epitope prediction. The aim of this review is to assist researchers in identification of B-cell epitopes.
Collapse
Affiliation(s)
- Lenka Potocnakova
- Laboratory of Biomedical Microbiology and Immunology, Department of Microbiology and Immunology, The University of Veterinary Medicine and Pharmacy in Kosice, 041 81 Kosice, Slovakia
| | - Mangesh Bhide
- Laboratory of Biomedical Microbiology and Immunology, Department of Microbiology and Immunology, The University of Veterinary Medicine and Pharmacy in Kosice, 041 81 Kosice, Slovakia
- Institute of Neuroimmunology of Slovak Academy of Sciences, 845 10 Bratislava, Slovakia
| | - Lucia Borszekova Pulzova
- Laboratory of Biomedical Microbiology and Immunology, Department of Microbiology and Immunology, The University of Veterinary Medicine and Pharmacy in Kosice, 041 81 Kosice, Slovakia
| |
Collapse
|
38
|
Jandrlić DR. SVM and SVR-based MHC-binding prediction using a mathematical presentation of peptide sequences. Comput Biol Chem 2016; 65:117-127. [PMID: 27816828 DOI: 10.1016/j.compbiolchem.2016.10.011] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2016] [Revised: 09/16/2016] [Accepted: 10/24/2016] [Indexed: 11/16/2022]
Abstract
At present, there are a number of methods for the prediction of T-cell epitopes and major histocompatibility complex (MHC)-binding peptides. Despite numerous methods for predicting T-cell epitopes, there still exist limitations that affect the reliability of prevailing methods. For this reason, the development of models with high accuracy are crucial. An accurate prediction of the peptides that bind to specific major histocompatibility complex class I and II (MHC-I and MHC-II) molecules is important for an understanding of the functioning of the immune system and the development of peptide-based vaccines. Peptide binding is the most selective step in identifying T-cell epitopes. In this paper, we present a new approach to predicting MHC-binding ligands that takes into account new weighting schemes for position-based amino acid frequencies, BLOSUM and VOGG substitution of amino acids, and the physicochemical and molecular properties of amino acids. We have made models for quantitatively and qualitatively predicting MHC-binding ligands. Our models are based on two machine learning methods support vector machine (SVM) and support vector regression (SVR), where our models have used for feature selection, several different encoding and weighting schemes for peptides. The resulting models showed comparable, and in some cases better, performance than the best existing predictors. The obtained results indicate that the physicochemical and molecular properties of amino acids (AA) contribute significantly to the peptide-binding affinity.
Collapse
Affiliation(s)
- Davorka R Jandrlić
- University of Belgrade, Faculty of Mechanical Engineering, Kraljice Marije 16, Belgrade, Serbia.
| |
Collapse
|
39
|
sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides. Sci Rep 2016; 6:32115. [PMID: 27558848 PMCID: PMC4997263 DOI: 10.1038/srep32115] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 08/02/2016] [Indexed: 12/19/2022] Open
Abstract
Understanding the binding between human leukocyte antigens (HLAs) and peptides is important to understand the functioning of the immune system. Since it is time-consuming and costly to measure the binding between large numbers of HLAs and peptides, computational methods including machine learning models and network approaches have been developed to predict HLA-peptide binding. However, there are several limitations for the existing methods. We developed a network-based algorithm called sNebula to address these limitations. We curated qualitative Class I HLA-peptide binding data and demonstrated the prediction performance of sNebula on this dataset using leave-one-out cross-validation and five-fold cross-validations. This algorithm can predict not only peptides of different lengths and different types of HLAs, but also the peptides or HLAs that have no existing binding data. We believe sNebula is an effective method to predict HLA-peptide binding and thus improve our understanding of the immune system.
Collapse
|
40
|
Hackl H, Charoentong P, Finotello F, Trajanoski Z. Computational genomics tools for dissecting tumour–immune cell interactions. Nat Rev Genet 2016; 17:441-58. [DOI: 10.1038/nrg.2016.67] [Citation(s) in RCA: 188] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
41
|
Wang S, Guo L, Liu D, Liu W, Wu Y. HLAsupE: an integrated database of HLA supertype-specific epitopes to aid in the development of vaccines with broad coverage of the human population. BMC Immunol 2016; 17:17. [PMID: 27307005 PMCID: PMC4910211 DOI: 10.1186/s12865-016-0156-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Accepted: 06/08/2016] [Indexed: 01/24/2023] Open
Abstract
Background Promiscuous T-cell epitopes that can be presented by multiple human leukocyte antigens (HLAs) are prime targets for vaccine and immunotherapy development because they are effective in a high proportion of the human population. Although there are a number of epitope databases currently available online, the epitope data in these databases were annotated using specific MHC restrictions, and none of these databases was specifically designed for retrieving data on promiscuous epitopes. Description HLAsupE is an integrated database of HLA supertype-specific epitopes (promiscuous T-cell epitopes in the context of HLA supertypes). The source data for the T-cell activities and HLA-binding capacities of peptides with a specific HLA restriction were extracted from public epitope databases. After a manual curation, these allele-specific data were integrated into supertype-specific datasets based on the defined supertypes and corresponding alleles. Each supertype-specific peptide in HLAsupE is annotated in terms of its cross-reactivity to HLA molecules within the same supertype. Promiscuous peptides that can be presented by multiple HLA molecules across multiple HLA supertypes were also included in this database. Several web-based tools are provided to access and download the data. Conclusions HLAsupE is the first database of promiscuous T cell epitopes that is organized based on the HLA supertypes. The main advantage of this database is the ability to search for promiscuous T-cell epitopes based on the cross-reactivity to specific alleles or supertypes. HLAsupE will be a valuable resource for the development of epitope-based vaccines and immunotherapies with broad coverage of human population.
Collapse
Affiliation(s)
- Shufeng Wang
- Institute of Immunology PLA, Third Military Medical University, Chongqing, 400038, China
| | - Ling Guo
- Institute of Immunology PLA, Third Military Medical University, Chongqing, 400038, China
| | - Dong Liu
- Institute of Immunology PLA, Third Military Medical University, Chongqing, 400038, China
| | - Wei Liu
- Institute of Immunology PLA, Third Military Medical University, Chongqing, 400038, China.
| | - Yuzhang Wu
- Institute of Immunology PLA, Third Military Medical University, Chongqing, 400038, China.
| |
Collapse
|
42
|
Kozlova E, Viart B, de Avila R, Felicori L, Chavez-Olortegui C. Classification epitopes in groups based on their protein family. BMC Bioinformatics 2015; 16 Suppl 19:S7. [PMID: 26696329 PMCID: PMC4686779 DOI: 10.1186/1471-2105-16-s19-s7] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Background The humoral immune system response is based on the interaction between antibodies and antigens for the clearance of pathogens and foreign molecules. The interaction between these proteins occurs at specific positions known as antigenic determinants or B-cell epitopes. The experimental identification of epitopes is costly and time consuming. Therefore the use of in silico methods, to help discover new epitopes, is an appealing alternative due the importance of biomedical applications such as vaccine design, disease diagnostic, anti-venoms and immune-therapeutics. However, the performance of predictions is not optimal been around 70% of accuracy. Further research could increase our understanding of the biochemical and structural properties that characterize a B-cell epitope. Results We investigated the possibility of linear epitopes from the same protein family to share common properties. This hypothesis led us to analyze physico-chemical (PCP) and predicted secondary structure (PSS) features of a curated dataset of epitope sequences available in the literature belonging to two different groups of antigens (metalloproteinases and neurotoxins). We discovered statistically significant parameters with data mining techniques which allow us to distinguish neurotoxin from metalloproteinase and these two from random sequences. After a five cross fold validation we found that PCP based models obtained area under the curve values (AUC) and accuracy above 0.9 for regression, decision tree and support vector machine. Conclusions We demonstrated that antigen's family can be inferred from properties within a single group of linear epitopes (metalloproteinases or neurotoxins). Also we discovered the characteristics that represent these two epitope groups including their similarities and differences with random peptides and their respective amino acid sequence. These findings open new perspectives to improve epitope prediction by considering the specific antigen's protein family. We expect that these findings will help to improve current computational mapping methods based on physico-chemical due it's potential application during epitope discovery.
Collapse
|
43
|
Abstract
Immunoinformatics involves the application of computational methods to immunological problems. Prediction of B- and T-cell epitopes has long been the focus of immunoinformatics, given the potential translational implications, and many tools have been developed. With the advent of next-generation sequencing (NGS) methods, an unprecedented wealth of information has become available that requires more-advanced immunoinformatics tools. Based on information from whole-genome sequencing, exome sequencing and RNA sequencing, it is possible to characterize with high accuracy an individual’s human leukocyte antigen (HLA) allotype (i.e., the individual set of HLA alleles of the patient), as well as changes arising in the HLA ligandome (the collection of peptides presented by the HLA) owing to genomic variation. This has allowed new opportunities for translational applications of epitope prediction, such as epitope-based design of prophylactic and therapeutic vaccines, and personalized cancer immunotherapies. Here, we review a wide range of immunoinformatics tools, with a focus on B- and T-cell epitope prediction. We also highlight fundamental differences in the underlying algorithms and discuss the various metrics employed to assess prediction quality, comparing their strengths and weaknesses. Finally, we discuss the new challenges and opportunities presented by high-throughput data-sets for the field of epitope prediction.
Collapse
Affiliation(s)
- Linus Backert
- Applied Bioinformatics, Center of Bioinformatics and Department of Computer Science, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
| | - Oliver Kohlbacher
- Applied Bioinformatics, Center of Bioinformatics and Department of Computer Science, University of Tübingen, Sand 14, 72076, Tübingen, Germany.,Quantitative Biology Center, University of Tübingen, Auf der Morgenstelle 10, 72076, Tübingen, Germany.,Biomolecular Interactions, Max Planck Institute for Developmental Biology, Spemannstrasse 35, 72076, Tübingen, Germany
| |
Collapse
|
44
|
Review on the identification and role of Toxoplasma gondii antigenic epitopes. Parasitol Res 2015; 115:459-68. [PMID: 26581372 DOI: 10.1007/s00436-015-4824-1] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2015] [Accepted: 11/10/2015] [Indexed: 12/12/2022]
Abstract
Toxoplasma gondii is an obligate intracellular protozoan parasite with a broad range of hosts, and it causes severe toxoplasmasis in both humans and animals. It is well known that the progression and severity of a disease depend on the immunological status of the host. Immunological studies on antigens indicate that antigens do not exert their functions through the entire protein molecule, but instead, specific epitopes are responsible for the immune response. Protein antigens not only contain epitope structures used by B, T, cytotoxic T lymphocyte (CTL), and NK cells to mediate immunological responses but can also contain structures that are unfavorable for protective immunity. Therefore, the study of antigenic epitopes from T. gondii has not only enhanced our understanding of the structure and function of antigens, the reactions between antigens and antibodies, and many other aspects of immunology but it also plays a significant role in the development of new diagnostic reagents and vaccines. In this review, we summarized the immune mechanisms induced by antigen epitopes and the latest advances in identifying T. gondii antigen epitopes. Particular attention was paid to the potential clinical usefulness of epitopes in this context. Through a critical analysis of the current state of knowledge, we elucidated the latest data concerning the biological effects of epitopes and the immune results aimed at the development of future epitope-based applications, such as vaccines and diagnostic reagents.
Collapse
|
45
|
Molero-Abraham M, Glutting JP, Flower DR, Lafuente EM, Reche PA. EPIPOX: Immunoinformatic Characterization of the Shared T-Cell Epitome between Variola Virus and Related Pathogenic Orthopoxviruses. J Immunol Res 2015; 2015:738020. [PMID: 26605344 PMCID: PMC4641182 DOI: 10.1155/2015/738020] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2015] [Revised: 09/08/2015] [Accepted: 10/01/2015] [Indexed: 11/26/2022] Open
Abstract
Concerns that variola viruses might be used as bioweapons have renewed the interest in developing new and safer smallpox vaccines. Variola virus genomes are now widely available, allowing computational characterization of the entire T-cell epitome and the use of such information to develop safe and yet effective vaccines. To this end, we identified 124 proteins shared between various species of pathogenic orthopoxviruses including variola minor and major, monkeypox, cowpox, and vaccinia viruses, and we targeted them for T-cell epitope prediction. We recognized 8,106, and 8,483 unique class I and class II MHC-restricted T-cell epitopes that are shared by all mentioned orthopoxviruses. Subsequently, we developed an immunological resource, EPIPOX, upon the predicted T-cell epitome. EPIPOX is freely available online and it has been designed to facilitate reverse vaccinology. Thus, EPIPOX includes key epitope-focused protein annotations: time point expression, presence of leader and transmembrane signals, and known location on outer membrane structures of the infective viruses. These features can be used to select specific T-cell epitopes suitable for experimental validation restricted by single MHC alleles, as combinations thereof, or by MHC supertypes.
Collapse
Affiliation(s)
- Magdalena Molero-Abraham
- School of Medicine, Unit of Immunology, Complutense University of Madrid, Pza. Ramón y Cajal, s/n, 28040 Madrid, Spain
| | - John-Paul Glutting
- School of Medicine, Unit of Immunology, Complutense University of Madrid, Pza. Ramón y Cajal, s/n, 28040 Madrid, Spain
| | - Darren R. Flower
- School of Life and Health Sciences, University of Aston, Aston Triangle, Birmingham B4 7ET, UK
| | - Esther M. Lafuente
- School of Medicine, Unit of Immunology, Complutense University of Madrid, Pza. Ramón y Cajal, s/n, 28040 Madrid, Spain
| | - Pedro A. Reche
- School of Medicine, Unit of Immunology, Complutense University of Madrid, Pza. Ramón y Cajal, s/n, 28040 Madrid, Spain
| |
Collapse
|
46
|
Luo H, Ye H, Ng H, Shi L, Tong W, Mattes W, Mendrick D, Hong H. Understanding and predicting binding between human leukocyte antigens (HLAs) and peptides by network analysis. BMC Bioinformatics 2015; 16 Suppl 13:S9. [PMID: 26424483 PMCID: PMC4597169 DOI: 10.1186/1471-2105-16-s13-s9] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
BACKGROUND As the major histocompatibility complex (MHC), human leukocyte antigens (HLAs) are one of the most polymorphic genes in humans. Patients carrying certain HLA alleles may develop adverse drug reactions (ADRs) after taking specific drugs. Peptides play an important role in HLA related ADRs as they are the necessary co-binders of HLAs with drugs. Many experimental data have been generated for understanding HLA-peptide binding. However, efficiently utilizing the data for understanding and accurately predicting HLA-peptide binding is challenging. Therefore, we developed a network analysis based method to understand and predict HLA-peptide binding. METHODS Qualitative Class I HLA-peptide binding data were harvested and prepared from four major databases. An HLA-peptide binding network was constructed from this dataset and modules were identified by the fast greedy modularity optimization algorithm. To examine the significance of signals in the yielded models, the modularity was compared with the modularity values generated from 1,000 random networks. The peptides and HLAs in the modules were characterized by similarity analysis. The neighbor-edges based and unbiased leverage algorithm (Nebula) was developed for predicting HLA-peptide binding. Leave-one-out (LOO) validations and two-fold cross-validations were conducted to evaluate the performance of Nebula using the constructed HLA-peptide binding network. RESULTS Nine modules were identified from analyzing the HLA-peptide binding network with a highest modularity compared to all the random networks. Peptide length and functional side chains of amino acids at certain positions of the peptides were different among the modules. HLA sequences were module dependent to some extent. Nebula archived an overall prediction accuracy of 0.816 in the LOO validations and average accuracy of 0.795 in the two-fold cross-validations and outperformed the method reported in the literature. CONCLUSIONS Network analysis is a useful approach for analyzing large and sparse datasets such as the HLA-peptide binding dataset. The modules identified from the network analysis clustered peptides and HLAs with similar sequences and properties of amino acids. Nebula performed well in the predictions of HLA-peptide binding. We demonstrated that network analysis coupled with Nebula is an efficient approach to understand and predict HLA-peptide binding interactions and thus, could further our understanding of ADRs.
Collapse
|
47
|
Abstract
Modem immunology and vaccinology incorporate immunoinformatics techniques to give insights into immune systems and accelerate vaccine design. Databases managing epitope data in a structured form with immune-related annotations including sequences, alleles, source organisms, structures, and diseases could be the most crucial part of immunoinformatics offering data sources for the analysis of immune systems and development of prediction methods. This chapter provides an overview of publicly available databases of T-cell epitopes including general databases, pathogen- and tumor-specific databases, and 3D structure databases.
Collapse
|
48
|
Bunkute E, Cummins C, Crofts FJ, Bunce G, Nabney IT, Flower DR. PIP-DB: the Protein Isoelectric Point database. Bioinformatics 2014; 31:295-6. [PMID: 25252779 DOI: 10.1093/bioinformatics/btu637] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
UNLABELLED A protein's isoelectric point or pI corresponds to the solution pH at which its net surface charge is zero. Since the early days of solution biochemistry, the pI has been recorded and reported, and thus literature reports of pI abound. The Protein Isoelectric Point database (PIP-DB) has collected and collated these data to provide an increasingly comprehensive database for comparison and benchmarking purposes. A web application has been developed to warehouse this database and provide public access to this unique resource. PIP-DB is a web-enabled SQL database with an HTML GUI front-end. PIP-DB is fully searchable across a range of properties. AVAILABILITY AND IMPLEMENTATION The PIP-DB database and documentation are available at http://www.pip-db.org.
Collapse
Affiliation(s)
- Egle Bunkute
- School of Life and Health Sciences and School of Engineering and Applied Science, University of Aston, Aston Triangle, Birmingham B4 7ET, UK
| | - Christopher Cummins
- School of Life and Health Sciences and School of Engineering and Applied Science, University of Aston, Aston Triangle, Birmingham B4 7ET, UK
| | - Fraser J Crofts
- School of Life and Health Sciences and School of Engineering and Applied Science, University of Aston, Aston Triangle, Birmingham B4 7ET, UK
| | - Gareth Bunce
- School of Life and Health Sciences and School of Engineering and Applied Science, University of Aston, Aston Triangle, Birmingham B4 7ET, UK
| | - Ian T Nabney
- School of Life and Health Sciences and School of Engineering and Applied Science, University of Aston, Aston Triangle, Birmingham B4 7ET, UK
| | - Darren R Flower
- School of Life and Health Sciences and School of Engineering and Applied Science, University of Aston, Aston Triangle, Birmingham B4 7ET, UK
| |
Collapse
|
49
|
Khalili S, Jahangiri A, Borna H, Ahmadi Zanoos K, Amani J. Computational vaccinology and epitope vaccine design by immunoinformatics. Acta Microbiol Immunol Hung 2014; 61:285-307. [PMID: 25261943 DOI: 10.1556/amicr.61.2014.3.4] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Human immune system includes variety of different cells and molecules correlating with other body systems. These instances complicate the analysis of the system; particularly in postgenomic era by introducing more amount of data, the complexity is increased and necessity of using computational approaches to process and interpret them is more tangible.Immunoinformatics as a subset of bioinformatics is a new approach with variety of tools and databases that facilitate analysis of enormous amount of immunologic data obtained from experimental researches. In addition to directing the insight regarding experiment selections, it helps new thesis design which was not feasible with conventional methods due to the complexity of data. Considering this features immunoinformatics appears to be one of the fields that accelerate the immunological research progression.In this study we discuss advances in genomics and vaccine design and their relevance to the development of effective vaccines furthermore several division of this field and available tools in each item are introduced.
Collapse
Affiliation(s)
- Saeed Khalili
- 1 Tarbiat Modares University Department of Medical Biotechnology Tehran Iran
| | - Abolfazl Jahangiri
- 2 Baqiyatallah University of Medical Sciences Applied Microbiology Research Center Tehran Iran
| | - Hojat Borna
- 3 Baqiyatallah Medical Science University Chemical Injuries Research Center Tehran Iran
| | | | - Jafar Amani
- 2 Baqiyatallah University of Medical Sciences Applied Microbiology Research Center Tehran Iran
| |
Collapse
|
50
|
Kim Y, Sidney J, Buus S, Sette A, Nielsen M, Peters B. Dataset size and composition impact the reliability of performance benchmarks for peptide-MHC binding predictions. BMC Bioinformatics 2014; 15:241. [PMID: 25017736 PMCID: PMC4111843 DOI: 10.1186/1471-2105-15-241] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2014] [Accepted: 07/08/2014] [Indexed: 11/23/2022] Open
Abstract
Background It is important to accurately determine the performance of peptide:MHC binding predictions, as this enables users to compare and choose between different prediction methods and provides estimates of the expected error rate. Two common approaches to determine prediction performance are cross-validation, in which all available data are iteratively split into training and testing data, and the use of blind sets generated separately from the data used to construct the predictive method. In the present study, we have compared cross-validated prediction performances generated on our last benchmark dataset from 2009 with prediction performances generated on data subsequently added to the Immune Epitope Database (IEDB) which served as a blind set. Results We found that cross-validated performances systematically overestimated performance on the blind set. This was found not to be due to the presence of similar peptides in the cross-validation dataset. Rather, we found that small size and low sequence/affinity diversity of either training or blind datasets were associated with large differences in cross-validated vs. blind prediction performances. We use these findings to derive quantitative rules of how large and diverse datasets need to be to provide generalizable performance estimates. Conclusion It has long been known that cross-validated prediction performance estimates often overestimate performance on independently generated blind set data. We here identify and quantify the specific factors contributing to this effect for MHC-I binding predictions. An increasing number of peptides for which MHC binding affinities are measured experimentally have been selected based on binding predictions and thus are less diverse than historic datasets sampling the entire sequence and affinity space, making them more difficult benchmark data sets. This has to be taken into account when comparing performance metrics between different benchmarks, and when deriving error estimates for predictions based on benchmark performance. Electronic supplementary material The online version of this article (doi:10.1186/1471-2105-15-241) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | | | | | | | | | - Bjoern Peters
- La Jolla Institute for Allergy & Immunology, 9420 Athena Circle, La Jolla, CA 92037, USA.
| |
Collapse
|