1
|
Abstract
Immunoinformatics is a discipline that applies methods of computer science to study and model the immune system. A fundamental question addressed by immunoinformatics is how to understand the rules of antigen presentation by MHC molecules to T cells, a process that is central to adaptive immune responses to infections and cancer. In the modern era of personalized medicine, the ability to model and predict which antigens can be presented by MHC is key to manipulating the immune system and designing strategies for therapeutic intervention. Since the MHC is both polygenic and extremely polymorphic, each individual possesses a personalized set of MHC molecules with different peptide-binding specificities, and collectively they present a unique individualized peptide imprint of the ongoing protein metabolism. Mapping all MHC allotypes is an enormous undertaking that cannot be achieved without a strong bioinformatics component. Computational tools for the prediction of peptide-MHC binding have thus become essential in most pipelines for T cell epitope discovery and an inescapable component of vaccine and cancer research. Here, we describe the development of several such tools, from pioneering efforts to the current state-of-the-art methods, that have allowed for accurate predictions of peptide binding of all MHC molecules, even including those that have not yet been characterized experimentally.
Collapse
Affiliation(s)
- Morten Nielsen
- Department of Health Technology, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, CP 1650 San Martin, Buenos Aires, Argentina
| | - Massimo Andreatta
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, CP 1650 San Martin, Buenos Aires, Argentina
| | - Bjoern Peters
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, California 92037, USA
- Department of Medicine, University of California, San Diego, La Jolla, California 92093, USA
| | - Søren Buus
- Department of Immunology and Microbiology, Faculty of Health Sciences, University of Copenhagen, DK-2200 Copenhagen, Denmark
| |
Collapse
|
2
|
Abstract
Throughout the body, T cells monitor MHC-bound ligands expressed on the surface of essentially all cell types. MHC ligands that trigger a T cell immune response are referred to as T cell epitopes. Identifying such epitopes enables tracking, phenotyping, and stimulating T cells involved in immune responses in infectious disease, allergy, autoimmunity, transplantation, and cancer. The specific T cell epitopes recognized in an individual are determined by genetic factors such as the MHC molecules the individual expresses, in parallel to the individual's environmental exposure history. The complexity and importance of T cell epitope mapping have motivated the development of computational approaches that predict what T cell epitopes are likely to be recognized in a given individual or in a broader population. Such predictions guide experimental epitope mapping studies and enable computational analysis of the immunogenic potential of a given protein sequence region.
Collapse
Affiliation(s)
- Bjoern Peters
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, California 92037, USA; ,
- Department of Medicine, University of California San Diego, La Jolla, California 92093, USA
| | - Morten Nielsen
- Department of Health Technology, Technical University of Denmark, DK-2800 Kgs. Lyngby, Denmark;
- Instituto de Investigaciones Biotecnológicas, Universidad Nacional de San Martín, B1650 Buenos Aires, Argentina
| | - Alessandro Sette
- Division of Vaccine Discovery, La Jolla Institute for Immunology, La Jolla, California 92037, USA; ,
- Department of Medicine, University of California San Diego, La Jolla, California 92093, USA
| |
Collapse
|
3
|
Charlton Hume HK, Vidigal J, Carrondo MJT, Middelberg APJ, Roldão A, Lua LHL. Synthetic biology for bioengineering virus-like particle vaccines. Biotechnol Bioeng 2019; 116:919-935. [PMID: 30597533 PMCID: PMC7161758 DOI: 10.1002/bit.26890] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 11/08/2018] [Accepted: 11/29/2018] [Indexed: 12/13/2022]
Abstract
Vaccination is the most effective method of disease prevention and control. Many viruses and bacteria that once caused catastrophic pandemics (e.g., smallpox, poliomyelitis, measles, and diphtheria) are either eradicated or effectively controlled through routine vaccination programs. Nonetheless, vaccine manufacturing remains incredibly challenging. Viruses exhibiting high antigenic diversity and high mutation rates cannot be fairly contested using traditional vaccine production methods and complexities surrounding the manufacturing processes, which impose significant limitations. Virus-like particles (VLPs) are recombinantly produced viral structures that exhibit immunoprotective traits of native viruses but are noninfectious. Several VLPs that compositionally match a given natural virus have been developed and licensed as vaccines. Expansively, a plethora of studies now confirms that VLPs can be designed to safely present heterologous antigens from a variety of pathogens unrelated to the chosen carrier VLPs. Owing to this design versatility, VLPs offer technological opportunities to modernize vaccine supply and disease response through rational bioengineering. These opportunities are greatly enhanced with the application of synthetic biology, the redesign and construction of novel biological entities. This review outlines how synthetic biology is currently applied to engineer VLP functions and manufacturing process. Current and developing technologies for the identification of novel target-specific antigens and their usefulness for rational engineering of VLP functions (e.g., presentation of structurally diverse antigens, enhanced antigen immunogenicity, and improved vaccine stability) are described. When applied to manufacturing processes, synthetic biology approaches can also overcome specific challenges in VLP vaccine production. Finally, we address several challenges and benefits associated with the translation of VLP vaccine development into the industry.
Collapse
Affiliation(s)
- Hayley K. Charlton Hume
- The University of Queensland, Australian Institute of Bioengineering and NanotechnologySt LuciaQueenslandAustralia
| | - João Vidigal
- Health & Pharma Division, Animal Cell Technology Unit, Instituto de Biologia Experimental e Tecnológica (iBET)OeirasPortugal
- Health & Pharma Division, Animal Cell Technology Unit, Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da RepúblicaOeirasPortugal
| | - Manuel J. T. Carrondo
- Health & Pharma Division, Animal Cell Technology Unit, Instituto de Biologia Experimental e Tecnológica (iBET)OeirasPortugal
| | - Anton P. J. Middelberg
- Faculty of Engineering, Computer and Mathematical Sciences, The University of AdelaideAdelaideSouth AustraliaAustralia
| | - António Roldão
- Health & Pharma Division, Animal Cell Technology Unit, Instituto de Biologia Experimental e Tecnológica (iBET)OeirasPortugal
- Health & Pharma Division, Animal Cell Technology Unit, Instituto de Tecnologia Química e Biológica António Xavier, Universidade Nova de Lisboa, Av. da RepúblicaOeirasPortugal
| | | |
Collapse
|
4
|
Kar P, Ruiz-Perez L, Arooj M, Mancera RL. Current methods for the prediction of T-cell epitopes. Pept Sci (Hoboken) 2018. [DOI: 10.1002/pep2.24046] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Prattusha Kar
- School of Pharmacy and Biomedical Sciences; Curtin Health Innovation Research Institute and Curtin Institute for Computation, Curtin University; Perth Western Australia 6845 Australia
| | - Lanie Ruiz-Perez
- School of Pharmacy and Biomedical Sciences; Curtin Health Innovation Research Institute and Curtin Institute for Computation, Curtin University; Perth Western Australia 6845 Australia
| | - Mahreen Arooj
- School of Pharmacy and Biomedical Sciences; Curtin Health Innovation Research Institute and Curtin Institute for Computation, Curtin University; Perth Western Australia 6845 Australia
| | - Ricardo L. Mancera
- School of Pharmacy and Biomedical Sciences; Curtin Health Innovation Research Institute and Curtin Institute for Computation, Curtin University; Perth Western Australia 6845 Australia
| |
Collapse
|
5
|
Kume A, Kawai S, Kato R, Iwata S, Shimizu K, Honda H. Exploring high-affinity binding properties of octamer peptides by principal component analysis of tetramer peptides. J Biosci Bioeng 2016; 123:230-238. [PMID: 27618533 DOI: 10.1016/j.jbiosc.2016.08.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2016] [Revised: 07/25/2016] [Accepted: 08/17/2016] [Indexed: 11/24/2022]
Abstract
To investigate the binding properties of a peptide sequence, we conducted principal component analysis (PCA) of the physicochemical features of a tetramer peptide library comprised of 512 peptides, and the variables were reduced to two principal components. We selected IL-2 and IgG as model proteins and the binding affinity to these proteins was assayed using the 512 peptides mentioned above. PCA of binding affinity data showed that 16 and 18 variables were suitable for localizing IL-2 and IgG high-affinity binding peptides, respectively, into a restricted region of the PCA plot. We then investigated whether the binding affinity of octamer peptide libraries could be predicted using the identified region in the tetramer PCA. The results show that octamer high-affinity binding peptides were also concentrated in the tetramer high-affinity binding region of both IL-2 and IgG. The average fluorescence intensity of high-affinity binding peptides was 3.3- and 2.1-fold higher than that of low-affinity binding peptides for IL-2 and IgG, respectively. We conclude that PCA may be used to identify octamer peptides with high- or low-affinity binding properties from data from a tetramer peptide library.
Collapse
Affiliation(s)
- Akiko Kume
- Department of Biotechnology, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| | - Shun Kawai
- Department of Basic Medicinal Sciences, Graduate School of Pharmaceutical Sciences, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan
| | - Ryuji Kato
- Department of Basic Medicinal Sciences, Graduate School of Pharmaceutical Sciences, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8601, Japan
| | - Shinmei Iwata
- Department of Biotechnology, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| | - Kazunori Shimizu
- Department of Biotechnology, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan
| | - Hiroyuki Honda
- Department of Biotechnology, Graduate School of Engineering, Nagoya University, Furo-cho, Chikusa-ku, Nagoya 464-8603, Japan.
| |
Collapse
|
6
|
sNebula, a network-based algorithm to predict binding between human leukocyte antigens and peptides. Sci Rep 2016; 6:32115. [PMID: 27558848 PMCID: PMC4997263 DOI: 10.1038/srep32115] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2016] [Accepted: 08/02/2016] [Indexed: 12/19/2022] Open
Abstract
Understanding the binding between human leukocyte antigens (HLAs) and peptides is important to understand the functioning of the immune system. Since it is time-consuming and costly to measure the binding between large numbers of HLAs and peptides, computational methods including machine learning models and network approaches have been developed to predict HLA-peptide binding. However, there are several limitations for the existing methods. We developed a network-based algorithm called sNebula to address these limitations. We curated qualitative Class I HLA-peptide binding data and demonstrated the prediction performance of sNebula on this dataset using leave-one-out cross-validation and five-fold cross-validations. This algorithm can predict not only peptides of different lengths and different types of HLAs, but also the peptides or HLAs that have no existing binding data. We believe sNebula is an effective method to predict HLA-peptide binding and thus improve our understanding of the immune system.
Collapse
|
7
|
Pan Y, Ke H, Yan Z, Geng Y, Asner N, Palani S, Munirathinam G, Dasari S, Nitiss KC, Bliss S, Patel P, Shen H, Reardon CA, Getz GS, Chen A, Zheng G. The western-type diet induces anti-HMGB1 autoimmunity in Apoe(-/-) mice. Atherosclerosis 2016; 251:31-38. [PMID: 27240253 PMCID: PMC4983250 DOI: 10.1016/j.atherosclerosis.2016.05.027] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Revised: 05/15/2016] [Accepted: 05/18/2016] [Indexed: 10/21/2022]
Abstract
BACKGROUND AND AIMS Anti-HMGB1 autoimmunity plays a role in systemic lupus erythematosus (SLE). Because SLE increases atherosclerosis, we asked whether the same autoimmunity might play a role in atherogenesis. METHODS We looked for the induction of HMGB1-specific B and T cell responses by a western-type diet (WTD) in the Apoe(-/-) mouse model of atherosclerosis. We also determined whether modifying the responses modulates atherosclerosis. RESULTS In the plasma of male Apoe(-/-) mice fed WTD, the level of anti-HMGB1 antibodies (Abs) was detected at ∼50 μg/ml, which was ∼6 times higher than that in either Apoe(-/-) mice fed a normal chow or Apoe(+/+) mice fed WTD (p ≤ 0.0005). The Abs were directed largely toward a novel, dominant epitope of HMGB1 named HMW4; accordingly, compared with chow-fed mice, WTD-fed Apoe(-/-) mice had more activated HMW4-reactive B and T cells (p = 0.005 and p = 0.01, respectively). Compared with mock-immunized mice, Apoe(-/-) mice immunized with HMW4 along with an immunogenic adjuvant showed proportional increases in anti-HMW4 IgG and IgM Abs, HMW4-reactive B-1 and B-2 cells, and HMW4-reactive Treg and Teff cells, which was associated with ∼30% increase in aortic arch lesions (p ≤ 0.01) by two methods. In contrast, Apoe(-/-) mice immunized with HMW4 using a tolerogenic adjuvant showed preferential increases in anti-HMW4 IgM (over IgG) Abs, HMW4-reactive B-1 (over B-2) cells, and HMW4-specific Treg (over Teff) cells, which was associated with ∼40% decrease in aortic arch lesions (p ≤ 0.03). CONCLUSIONS Anti-HMGB1 autoimmunity may potentially play a role in atherogenesis.
Collapse
Affiliation(s)
- Yue Pan
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Hanzhong Ke
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Zhaoqi Yan
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Yajun Geng
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Nathan Asner
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Sunil Palani
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Gnanasekar Munirathinam
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Subramanyam Dasari
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Karin C Nitiss
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Sarah Bliss
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Priyanka Patel
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Hongming Shen
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA
| | - Catherine A Reardon
- Department of Pathology (C.A.R., G.S.G.), University of Chicago, Chicago, IL 60637, USA
| | - Godfrey S Getz
- Department of Pathology (C.A.R., G.S.G.), University of Chicago, Chicago, IL 60637, USA
| | - Aoshuang Chen
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA.
| | - Guoxing Zheng
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107, USA.
| |
Collapse
|
8
|
MetaMHCpan, A Meta Approach for Pan-Specific MHC Peptide Binding Prediction. Methods Mol Biol 2016. [PMID: 27076335 DOI: 10.1007/978-1-4939-3389-1_49] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Recent computational approaches in bioinformatics can achieve high performance, by which they can be a powerful support for performing real biological experiments, making biologists pay more attention to bioinformatics than before. In immunology, predicting peptides which can bind to MHC alleles is an important task, being tackled by many computational approaches. However, this situation causes a serious problem for immunologists to select the appropriate method to be used in bioinformatics. To overcome this problem, we develop an ensemble prediction-based Web server, which we call MetaMHCpan, consisting of two parts: MetaMHCIpan and MetaMHCIIpan, for predicting peptides which can bind MHC-I and MHC-II, respectively. MetaMHCIpan and MetaMHCIIpan use two (MHC2SKpan and LApan) and four (TEPITOPEpan, MHC2SKpan, LApan, and MHC2MIL) existing predictors, respectively. MetaMHCpan is available at http://datamining-iip.fudan.edu.cn/MetaMHCpan/index.php/pages/view/info .
Collapse
|
9
|
Nanda Kumar Y, Jeyakodi G, Gunasekaran K, Jambulingam P. Computational screening and characterization of putative vaccine candidates of Plasmodium vivax. J Biomol Struct Dyn 2015; 34:1736-50. [PMID: 26338678 DOI: 10.1080/07391102.2015.1090344] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
Plasmodium vivax is the most prevalent species of malaria affecting millions of people annually worldwide and demands effective interventions to develop a successful vaccine. In this milieu, we have dedicated noteworthy efforts to characterize the proteome of P. vivax to give a lead for the epitope-based vaccine development. Membrane proteins of P. vivax were collected from SWISS PROT database and 10 antigenic proteins were identified among them by in silico analysis using multiple servers. T-cell and B-cell epitopes were identified and their immunity was assessed. Their ability to trigger humoral and cell-mediated responses was determined. Three dimensional models were constructed for the antigenic proteins using Modeller, Phyre2, and Modloop tools and their quality was validated using PROCHECK and ProSA-web validation servers. Further, the binding affinity and molecular interactions of these antigenic proteins were characterized by performing protein-protein docking against transmission-blocking anti-malaria antibody Fab2A8 (PDB ID: 3S62) using Z-dock module of Discovery Studio 4.0. The presence of potential B & T-cell epitopes, major histocompatibility complex-binding sites, and their efficient interactions with Fab2A8 antibody suggests the use of predicted antigenic proteins for the construction of multi-epitope peptide vaccine against P. vivax.
Collapse
Affiliation(s)
- Y Nanda Kumar
- a Biomedical Informatics Centre, Vector Control Research Centre , Indian Council of Medical Research , Pondicherry 605006 , India
| | - G Jeyakodi
- a Biomedical Informatics Centre, Vector Control Research Centre , Indian Council of Medical Research , Pondicherry 605006 , India
| | - K Gunasekaran
- a Biomedical Informatics Centre, Vector Control Research Centre , Indian Council of Medical Research , Pondicherry 605006 , India
| | - P Jambulingam
- a Biomedical Informatics Centre, Vector Control Research Centre , Indian Council of Medical Research , Pondicherry 605006 , India
| |
Collapse
|
10
|
Luo H, Ye H, Ng HW, Shi L, Tong W, Mendrick DL, Hong H. Machine Learning Methods for Predicting HLA-Peptide Binding Activity. Bioinform Biol Insights 2015; 9:21-9. [PMID: 26512199 PMCID: PMC4603527 DOI: 10.4137/bbi.s29466] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Revised: 07/30/2015] [Accepted: 08/02/2015] [Indexed: 11/23/2022] Open
Abstract
As major histocompatibility complexes in humans, the human leukocyte antigens (HLAs) have important functions to present antigen peptides onto T-cell receptors for immunological recognition and responses. Interpreting and predicting HLA–peptide binding are important to study T-cell epitopes, immune reactions, and the mechanisms of adverse drug reactions. We review different types of machine learning methods and tools that have been used for HLA–peptide binding prediction. We also summarize the descriptors based on which the HLA–peptide binding prediction models have been constructed and discuss the limitation and challenges of the current methods. Lastly, we give a future perspective on the HLA–peptide binding prediction method based on network analysis.
Collapse
Affiliation(s)
- Heng Luo
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA. ; University of Arkansas at Little Rock/University of Arkansas for Medical Sciences Bioinformatics Graduate Program, Little Rock, AR, USA
| | - Hao Ye
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Hui Wen Ng
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Leming Shi
- Center for Pharmacogenomics, School of Pharmacy, Fudan University, Shanghai, China
| | - Weida Tong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Donna L Mendrick
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| | - Huixiao Hong
- National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, USA
| |
Collapse
|
11
|
Kuksa PP, Min MR, Dugar R, Gerstein M. High-order neural networks and kernel methods for peptide-MHC binding prediction. Bioinformatics 2015. [PMID: 26206306 DOI: 10.1093/bioinformatics/btv371] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
MOTIVATION Effective computational methods for peptide-protein binding prediction can greatly help clinical peptide vaccine search and design. However, previous computational methods fail to capture key nonlinear high-order dependencies between different amino acid positions. As a result, they often produce low-quality rankings of strong binding peptides. To solve this problem, we propose nonlinear high-order machine learning methods including high-order neural networks (HONNs) with possible deep extensions and high-order kernel support vector machines to predict major histocompatibility complex-peptide binding. RESULTS The proposed high-order methods improve quality of binding predictions over other prediction methods. With the proposed methods, a significant gain of up to 25-40% is observed on the benchmark and reference peptide datasets and tasks. In addition, for the first time, our experiments show that pre-training with high-order semi-restricted Boltzmann machines significantly improves the performance of feed-forward HONNs. Moreover, our experiments show that the proposed shallow HONN outperform the popular pre-trained deep neural network on most tasks, which demonstrates the effectiveness of modelling high-order feature interactions for predicting major histocompatibility complex-peptide binding. AVAILABILITY AND IMPLEMENTATION There is no associated distributable software. CONTACT renqiang@nec-labs.com or mark.gerstein@yale.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Pavel P Kuksa
- Institute for Biomedical Informatics, Department of Pathology and Laboratory Medicine, University of Pennsylvania School of Medicine, Philadelphia, PA 19104, USA, Department of Machine Learning, NEC Laboratories America, Princeton, NJ 08540, USA
| | - Martin Renqiang Min
- Department of Machine Learning, NEC Laboratories America, Princeton, NJ 08540, USA
| | - Rishabh Dugar
- Department of Machine Learning, NEC Laboratories America, Princeton, NJ 08540, USA
| | - Mark Gerstein
- Program of Computational Biology and Bioinformatics and Department of Molecular Biophysics and Biochemistry and Department of Computer Science, Yale University, New Haven, CT 06511, USA
| |
Collapse
|
12
|
Xu Y, Luo C, Qian M, Huang X, Zhu S. MHC2MIL: a novel multiple instance learning based method for MHC-II peptide binding prediction by considering peptide flanking region and residue positions. BMC Genomics 2014; 15 Suppl 9:S9. [PMID: 25521198 PMCID: PMC4290625 DOI: 10.1186/1471-2164-15-s9-s9] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Background Computational prediction of major histocompatibility complex class II (MHC-II) binding peptides can assist researchers in understanding the mechanism of immune systems and developing peptide based vaccines. Although many computational methods have been proposed, the performance of these methods are far from satisfactory. The difficulty of MHC-II peptide binding prediction comes mainly from the large length variation of binding peptides. Methods We develop a novel multiple instance learning based method called MHC2MIL, in order to predict MHC-II binding peptides. We deem each peptide in MHC2MIL as a bag, and some substrings of the peptide as the instances in the bag. Unlike previous multiple instance learning based methods that consider only instances of fixed length 9 (9 amino acids), MHC2MIL is able to deal with instances of both lengths of 9 and 11 (11 amino acids), simultaneously. As such, MHC2MIL incorporates important information in the peptide flanking region. For measuring the distances between different instances, furthermore, MHC2MIL explicitly highlights the amino acids in some important positions. Results Experimental results on a benchmark dataset have shown that, the performance of MHC2MIL is significantly improved by considering the instances of both 9 and 11 amino acids, as well as by emphasizing amino acids at key positions in the instance. The results are consistent with those reported in the literature on MHC-II peptide binding. In addition to five important positions (1, 4, 6, 7 and 9) for HLA(human leukocyte antigen, the name of MHC in Humans) DR peptide binding, we also find that position 2 may play some roles in the binding process. By using 5-fold cross validation on the benchmark dataset, MHC2MIL outperforms two state-of-the-art methods of MHC2SK and NN-align with being statistically significant, on 12 HLA DP and DQ molecules. In addition, it achieves comparable performance with MHC2SK and NN-align on 14 HLA DR molecules. MHC2MIL is freely available at http://datamining-iip.fudan.edu.cn/service/MHC2MIL/index.html.
Collapse
|
13
|
Gupta S, Chavan S, Deobagkar DN, Deobagkar DD. Bio/chemoinformatics in India: an outlook. Brief Bioinform 2014; 16:710-31. [PMID: 25159593 DOI: 10.1093/bib/bbu028] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2014] [Accepted: 07/28/2014] [Indexed: 12/25/2022] Open
Abstract
With the advent of significant establishment and development of Internet facilities and computational infrastructure, an overview on bio/chemoinformatics is presented along with its multidisciplinary facts, promises and challenges. The Government of India has paved the way for more profound research in biological field with the use of computational facilities and schemes/projects to collaborate with scientists from different disciplines. Simultaneously, the growth of available biomedical data has provided fresh insight into the nature of redundant and compensatory data. Today, bioinformatics research in India is characterized by a powerful grid computing systems, great variety of biological questions addressed and the close collaborations between scientists and clinicians, with a full spectrum of focuses ranging from database building and methods development to biological discoveries. In fact, this outlook provides a resourceful platform highlighting the funding agencies, institutes and industries working in this direction, which would certainly be of great help to students seeking their career in bioinformatics. Thus, in short, this review highlights the current bio/chemoinformatics trend, educations, status, diverse applicability and demands for further development.
Collapse
|
14
|
Chen A, Geng Y, Ke H, Constant L, Yan Z, Pan Y, Lee P, Tan I, Williams K, George S, Munirathinam G, Reardon CA, Getz GS, Wang B, Zheng G. Cutting edge: Dexamethasone potentiates the responses of both regulatory T cells and B-1 cells to antigen immunization in the ApoE(-/-) mouse model of atherosclerosis. THE JOURNAL OF IMMUNOLOGY 2014; 193:35-9. [PMID: 24899497 DOI: 10.4049/jimmunol.1302469] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
The immunosuppressant dexamethasone was shown to preferentially deplete CD4+ effector T cells while sparing regulatory T cells (Tregs) in vivo. In the current study, we show that it also preferentially depletes B-2 cells while sparing B-1 cells. In the ApoE(-/-) mouse model of atherosclerosis, in which both Tregs and B-1 cells are thought to play an atheroprotective role, we show that HSP60-targeted immunization in the presence of dexamethasone raises Ag-reactive Tregs and B-1 cells concomitantly and reduces the severity of atherosclerosis. These results indicate that dexamethasone is an adjuvant that potentiates both the Treg and B-1 responses to immunogens. This study shows that B-1 cells with a specificity for a disease-relevant Ag can be raised in vivo by immunization.
Collapse
Affiliation(s)
- Aoshuang Chen
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107;
| | - Yajun Geng
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Hanzhong Ke
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Laura Constant
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Zhaoqi Yan
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Yue Pan
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Patricia Lee
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Isaiah Tan
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Kurt Williams
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Samantha George
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | - Gnanasekar Munirathinam
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107
| | | | - Godfrey S Getz
- Department of Pathology, University of Chicago, Chicago, IL 60637; and
| | - Bin Wang
- Key Laboratory of Medical Molecular Virology of Ministry of Health and Ministry of Education, Fudan University, Shanghai 200032, China
| | - Guoxing Zheng
- Department of Biomedical Sciences, University of Illinois College of Medicine at Rockford, Rockford, IL 61107;
| |
Collapse
|
15
|
Guo L, Luo C, Zhu S. MHC2SKpan: a novel kernel based approach for pan-specific MHC class II peptide binding prediction. BMC Genomics 2013; 14 Suppl 5:S11. [PMID: 24564280 PMCID: PMC3852073 DOI: 10.1186/1471-2164-14-s5-s11] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Computational methods for the prediction of Major Histocompatibility Complex (MHC) class II binding peptides play an important role in facilitating the understanding of immune recognition and the process of epitope discovery. To develop an effective computational method, we need to consider two important characteristics of the problem: (1) the length of binding peptides is highly flexible; and (2) MHC molecules are extremely polymorphic and for the vast majority of them there are no sufficient training data. METHODS We develop a novel string kernel MHC2SK (MHC-II String Kernel) method to measure the similarities among peptides with variable lengths. By considering the distinct features of MHC-II peptide binding prediction problem, MHC2SK differs significantly from the recently developed kernel based method, GS (Generic String) kernel, in the way of computing similarities. Furthermore, we extend MHC2SK to MHC2SKpan for pan-specific MHC-II peptide binding prediction by leveraging the binding data of various MHC molecules. RESULTS MHC2SK outperformed GS in allele specific prediction using a benchmark dataset, which demonstrates the effectiveness of MHC2SK. Furthermore, we evaluated the performance of MHC2SKpan using various benckmark data sets from several different perspectives: Leave-one-allele-out (LOO), 5-fold cross validation as well as independent data testing. MHC2SKpan has achieved comparable performance with NetMHCIIpan-2.0 and outperformed NetMHCIIpan-1.0, TEPITOPEpan and MultiRTA, being statistically significant. MHC2SKpan can be freely accessed at http://datamining-iip.fudan.edu.cn/service/MHC2SKpan/index.html.
Collapse
|
16
|
Zhang L, Chen Y, Wong HS, Zhou S, Mamitsuka H, Zhu S. TEPITOPEpan: extending TEPITOPE for peptide binding prediction covering over 700 HLA-DR molecules. PLoS One 2012; 7:e30483. [PMID: 22383964 PMCID: PMC3285624 DOI: 10.1371/journal.pone.0030483] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2011] [Accepted: 12/16/2011] [Indexed: 12/28/2022] Open
Abstract
Motivation Accurate identification of peptides binding to specific Major Histocompatibility Complex Class II (MHC-II) molecules is of great importance for elucidating the underlying mechanism of immune recognition, as well as for developing effective epitope-based vaccines and promising immunotherapies for many severe diseases. Due to extreme polymorphism of MHC-II alleles and the high cost of biochemical experiments, the development of computational methods for accurate prediction of binding peptides of MHC-II molecules, particularly for the ones with few or no experimental data, has become a topic of increasing interest. TEPITOPE is a well-used computational approach because of its good interpretability and relatively high performance. However, TEPITOPE can be applied to only 51 out of over 700 known HLA DR molecules. Method We have developed a new method, called TEPITOPEpan, by extrapolating from the binding specificities of HLA DR molecules characterized by TEPITOPE to those uncharacterized. First, each HLA-DR binding pocket is represented by amino acid residues that have close contact with the corresponding peptide binding core residues. Then the pocket similarity between two HLA-DR molecules is calculated as the sequence similarity of the residues. Finally, for an uncharacterized HLA-DR molecule, the binding specificity of each pocket is computed as a weighted average in pocket binding specificities over HLA-DR molecules characterized by TEPITOPE. Result The performance of TEPITOPEpan has been extensively evaluated using various data sets from different viewpoints: predicting MHC binding peptides, identifying HLA ligands and T-cell epitopes and recognizing binding cores. Among the four state-of-the-art competing pan-specific methods, for predicting binding specificities of unknown HLA-DR molecules, TEPITOPEpan was roughly the second best method next to NETMHCIIpan-2.0. Additionally, TEPITOPEpan achieved the best performance in recognizing binding cores. We further analyzed the motifs detected by TEPITOPEpan, examining the corresponding literature of immunology. Its online server and PSSMs therein are available at http://www.biokdd.fudan.edu.cn/Service/TEPITOPEpan/.
Collapse
Affiliation(s)
- Lianming Zhang
- School of Computer Science and Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai, China
| | - Yiqing Chen
- School of Computer Science and Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai, China
| | - Hau-San Wong
- Department of Computer Science, City University of Hong Kong, Hong Kong, China
| | - Shuigeng Zhou
- School of Computer Science and Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai, China
| | - Hiroshi Mamitsuka
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Gokasho, Uji, Japan
| | - Shanfeng Zhu
- School of Computer Science and Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai, China
- * E-mail:
| |
Collapse
|
17
|
He J, Yang G, Rao H, Li Z, Ding X, Chen Y. Prediction of human major histocompatibility complex class II binding peptides by continuous kernel discrimination method. Artif Intell Med 2011; 55:107-15. [PMID: 22134095 DOI: 10.1016/j.artmed.2011.10.005] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2011] [Revised: 10/12/2011] [Accepted: 10/21/2011] [Indexed: 11/25/2022]
Abstract
OBJECTIVE Accurate prediction of major histocompatibility complex (MHC) class II binding peptides helps reducing the experimental cost for identifying helper T cell epitopes, which has been a challenging problem partly because of the variable length of the binding peptides. This work is to develop an accurate model for predicting MHC-binding peptides using machine learning methods. METHODS In this work, a machine learning method, continuous kernel discrimination (CKD), was used for predicting MHC class II binders of variable lengths. The composition transition and distribution features were used for encoding peptide sequence and the Metropolis Monte Carlo simulated annealing approach was used for feature selection. RESULTS Feature selection was found to significantly improve the performance of the model. For benchmark dataset Dataset-1, the number of features is reduced from 147 to 24 and the area under the receiver operating characteristic curve (AUC) is improved from 0.8088 to 0.9034, while for benchmark dataset Dataset-2, the number of features is reduced from 147 to 44 and the AUC is improved from 0.7349 to 0.8499. An optimal CKD model was derived from the feature selection and bandwidth optimization using 10-fold cross-validation. Its AUC values are between 0.831 and 0.980 evaluated on benchmark datasets BM-Set1 and are between 0.806 and 0.949 on benchmark datasets BM-Set2 for MHC class II alleles. These results indicate a significantly better performance for our CKD model over other earlier models based on the training and testing of the same datasets. CONCLUSIONS Our study suggested that the CKD method outperforms other machine learning methods proposed earlier in the prediction of MHC class II biding peptides. Moreover, the choice of the cut-off for CKD classifier is crucial for its performance.
Collapse
Affiliation(s)
- Ju He
- College of Chemistry, Sichuan University, Chengdu 610064, People's Republic of China
| | | | | | | | | | | |
Collapse
|
18
|
EL-Manzalawy Y, Dobbs D, Honavar V. Predicting MHC-II binding affinity using multiple instance regression. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2011; 8:1067-1079. [PMID: 20855923 PMCID: PMC3400677 DOI: 10.1109/tcbb.2010.94] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark data sets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir.
Collapse
Affiliation(s)
- Yasser EL-Manzalawy
- Department of Systems and Computers Engineering, Al-Azhar University, Cairo, Egypt.
| | | | | |
Collapse
|
19
|
Atanasova M, Dimitrov I, Flower DR, Doytchinova I. MHC Class II Binding Prediction by Molecular Docking. Mol Inform 2011; 30:368-75. [PMID: 27466953 DOI: 10.1002/minf.201000132] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2010] [Accepted: 12/01/2010] [Indexed: 01/08/2023]
Abstract
Proteins of the Major Histocompatibility Complex (MHC) bind self and nonself peptide antigens or epitopes within the cell and present them at the cell surface for recognition by T cells. All T-cell epitopes are MHC binders but not all MCH binders are T-cell epitopes. The MHC class II proteins are extremely polymorphic. Polymorphic residues cluster in the peptide-binding region and largely determine the MHC's peptide selectivity. The peptide binding site on MHC class II proteins consist of five binding pockets. Using molecular docking, we have modelled the interactions between peptide and MHC class II proteins from locus DRB1. A combinatorial peptide library was generated by mutation of residues at peptide positions which correspond to binding pockets (so called anchor positions). The binding affinities were assessed using different scoring functions. The normalized scoring functions for each amino acid at each anchor position were used to construct quantitative matrices (QM) for MHC class II binding prediction. Models were validated by external test sets comprising 4540 known binders. Eighty percent of the known binders are identified in the best predicted 15 % of all overlapping peptides, originating from one protein.
Collapse
Affiliation(s)
- M Atanasova
- Faculty of Pharmacy, Medical University of Sofia, 2 Dunav str, 1000 Sofia, Bulgaria phone: +359 2 9236599; fax: +359 2 9879874.
| | - I Dimitrov
- Faculty of Pharmacy, Medical University of Sofia, 2 Dunav str, 1000 Sofia, Bulgaria phone: +359 2 9236599; fax: +359 2 9879874
| | - D R Flower
- Life and Health Sciences, Aston University, Aston Triangle, Birmingham, B4 7ET, UK
| | - I Doytchinova
- Faculty of Pharmacy, Medical University of Sofia, 2 Dunav str, 1000 Sofia, Bulgaria phone: +359 2 9236599; fax: +359 2 9879874
| |
Collapse
|
20
|
Bordner AJ. Towards universal structure-based prediction of class II MHC epitopes for diverse allotypes. PLoS One 2010; 5:e14383. [PMID: 21187956 PMCID: PMC3004863 DOI: 10.1371/journal.pone.0014383] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2010] [Accepted: 11/30/2010] [Indexed: 01/10/2023] Open
Abstract
The binding of peptide fragments of antigens to class II MHC proteins is a crucial step in initiating a helper T cell immune response. The discovery of these peptide epitopes is important for understanding the normal immune response and its misregulation in autoimmunity and allergies and also for vaccine design. In spite of their biomedical importance, the high diversity of class II MHC proteins combined with the large number of possible peptide sequences make comprehensive experimental determination of epitopes for all MHC allotypes infeasible. Computational methods can address this need by predicting epitopes for a particular MHC allotype. We present a structure-based method for predicting class II epitopes that combines molecular mechanics docking of a fully flexible peptide into the MHC binding cleft followed by binding affinity prediction using a machine learning classifier trained on interaction energy components calculated from the docking solution. Although the primary advantage of structure-based prediction methods over the commonly employed sequence-based methods is their applicability to essentially any MHC allotype, this has not yet been convincingly demonstrated. In order to test the transferability of the prediction method to different MHC proteins, we trained the scoring method on binding data for DRB1*0101 and used it to make predictions for multiple MHC allotypes with distinct peptide binding specificities including representatives from the other human class II MHC loci, HLA-DP and HLA-DQ, as well as for two murine allotypes. The results showed that the prediction method was able to achieve significant discrimination between epitope and non-epitope peptides for all MHC allotypes examined, based on AUC values in the range 0.632-0.821. We also discuss how accounting for peptide binding in multiple registers to class II MHC largely explains the systematically worse performance of prediction methods for class II MHC compared with those for class I MHC based on quantitative prediction performance estimates for peptide binding to class II MHC in a fixed register.
Collapse
Affiliation(s)
- Andrew J Bordner
- Department of Molecular Pharmacology and Experimental Therapeutics, Mayo Clinic, Scottsdale, Arizona, United States of America.
| |
Collapse
|
21
|
Nelson PA, Khodadoust M, Prodhomme T, Spencer C, Patarroyo JC, Varrin-Doyer M, Ho JD, Stroud RM, Zamvil SS. Immunodominant T cell determinants of aquaporin-4, the autoantigen associated with neuromyelitis optica. PLoS One 2010; 5:e15050. [PMID: 21151500 PMCID: PMC2994828 DOI: 10.1371/journal.pone.0015050] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2010] [Accepted: 10/14/2010] [Indexed: 01/02/2023] Open
Abstract
Autoantibodies that target the water channel aquaporin-4 (AQP4) in neuromyelitis optica (NMO) are IgG1, a T cell-dependent Ig subclass. However, a role for AQP4-specific T cells in this CNS inflammatory disease is not known. To evaluate their potential role in CNS autoimmunity, we have identified and characterized T cells that respond to AQP4 in C57BL/6 and SJL/J mice, two strains that are commonly studied in models of CNS inflammatory diseases. Mice were immunized with either overlapping peptides or intact hAQP4 protein encompassing the entire 323 amino acid sequence. T cell determinants identified from examination of the AQP4 peptide (p) library were located within AQP4 p21-40, p91-110, p101-120, p166-180, p231-250 and p261-280 in C57BL/6 mice, and within p11-30, p21-40, p101-120, p126-140 and p261-280 in SJL/J mice. AQP4-specific T cells were CD4+ and MHC II-restricted. In recall responses to immunization with intact AQP4, T cells responded primarily to p21-40, indicating this region contains the immunodominant T cell epitope(s) for both strains. AQP4 p21-40-primed T cells secreted both IFN-γ and IL-17. The core immunodominant AQP4 21-40 T cell determinant was mapped to residues 24-35 in C57BL/6 mice and 23-35 in SJL/J mice. Our identification of the AQP4 T cell determinants and characterization of its immunodominant determinant should permit investigators to evaluate the role of AQP4-specific T cells in vivo and to develop AQP4-targeted murine NMO models.
Collapse
Affiliation(s)
- Patricia A. Nelson
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Mojgan Khodadoust
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Thomas Prodhomme
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Collin Spencer
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Juan Carlos Patarroyo
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Michel Varrin-Doyer
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
| | - Joseph D. Ho
- Department of Biochemistry, University of California San Francisco, San Francisco, California, United States of America
| | - Robert M. Stroud
- Department of Biochemistry, University of California San Francisco, San Francisco, California, United States of America
| | - Scott S. Zamvil
- Department of Neurology, University of California San Francisco, San Francisco, California, United States of America
- * E-mail:
| |
Collapse
|
22
|
Nielsen M, Justesen S, Lund O, Lundegaard C, Buus S. NetMHCIIpan-2.0 - Improved pan-specific HLA-DR predictions using a novel concurrent alignment and weight optimization training procedure. Immunome Res 2010; 6:9. [PMID: 21073747 PMCID: PMC2994798 DOI: 10.1186/1745-7580-6-9] [Citation(s) in RCA: 116] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2010] [Accepted: 11/13/2010] [Indexed: 01/16/2023] Open
Abstract
Background Binding of peptides to Major Histocompatibility class II (MHC-II) molecules play a central role in governing responses of the adaptive immune system. MHC-II molecules sample peptides from the extracellular space allowing the immune system to detect the presence of foreign microbes from this compartment. Predicting which peptides bind to an MHC-II molecule is therefore of pivotal importance for understanding the immune response and its effect on host-pathogen interactions. The experimental cost associated with characterizing the binding motif of an MHC-II molecule is significant and large efforts have therefore been placed in developing accurate computer methods capable of predicting this binding event. Prediction of peptide binding to MHC-II is complicated by the open binding cleft of the MHC-II molecule, allowing binding of peptides extending out of the binding groove. Moreover, the genes encoding the MHC molecules are immensely diverse leading to a large set of different MHC molecules each potentially binding a unique set of peptides. Characterizing each MHC-II molecule using peptide-screening binding assays is hence not a viable option. Results Here, we present an MHC-II binding prediction algorithm aiming at dealing with these challenges. The method is a pan-specific version of the earlier published allele-specific NN-align algorithm and does not require any pre-alignment of the input data. This allows the method to benefit also from information from alleles covered by limited binding data. The method is evaluated on a large and diverse set of benchmark data, and is shown to significantly out-perform state-of-the-art MHC-II prediction methods. In particular, the method is found to boost the performance for alleles characterized by limited binding data where conventional allele-specific methods tend to achieve poor prediction accuracy. Conclusions The method thus shows great potential for efficient boosting the accuracy of MHC-II binding prediction, as accurate predictions can be obtained for novel alleles at highly reduced experimental costs. Pan-specific binding predictions can be obtained for all alleles with know protein sequence and the method can benefit by including data in the training from alleles even where only few binders are known. The method and benchmark data are available at http://www.cbs.dtu.dk/services/NetMHCIIpan-2.0
Collapse
Affiliation(s)
- Morten Nielsen
- Center A for Biological Sequence Analysis, BioCentrum-DTU, Building 208, Technical University of Denmark, DK-2800 Lyngby, Denmark.
| | | | | | | | | |
Collapse
|
23
|
King CA, Bradley P. Structure-based prediction of protein-peptide specificity in Rosetta. Proteins 2010; 78:3437-49. [PMID: 20954182 DOI: 10.1002/prot.22851] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2010] [Revised: 07/16/2010] [Accepted: 07/28/2010] [Indexed: 01/03/2023]
Abstract
Protein-peptide interactions mediate many of the connections in intracellular signaling networks. A generalized computational framework for atomically precise modeling of protein-peptide specificity may allow for predicting molecular interactions, anticipating the effects of drugs and genetic mutations, and redesigning molecules for new interactions. We have developed an extensible, general algorithm for structure-based prediction of protein-peptide specificity as part of the Rosetta molecular modeling package. The algorithm is not restricted to any one peptide-binding domain family and, at minimum, does not require an experimentally characterized structure of the target protein nor any information about sequence specificity; although known structural data can be incorporated when available to improve performance. We demonstrate substantial success in specificity prediction across a diverse set of peptide-binding proteins, and show how performance is affected when incorporating varying degrees of input structural data. We also illustrate how structure-based approaches can provide atomic-level insight into mechanisms of peptide recognition and can predict the effects of point mutations on peptide specificity. Shortcomings and artifacts of our benchmark predictions are explained and limits on the generality of the method are explored. This work provides a promising foundation upon which further development of completely generalized, de novo prediction of peptide specificity may progress.
Collapse
Affiliation(s)
- Christopher A King
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, USA.
| | | |
Collapse
|
24
|
Bordner AJ, Mittelmann HD. MultiRTA: a simple yet reliable method for predicting peptide binding affinities for multiple class II MHC allotypes. BMC Bioinformatics 2010; 11:482. [PMID: 20868497 PMCID: PMC2957400 DOI: 10.1186/1471-2105-11-482] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2010] [Accepted: 09/24/2010] [Indexed: 12/26/2022] Open
Abstract
Background The binding of peptide fragments of antigens to class II MHC is a crucial step in initiating a helper T cell immune response. The identification of such peptide epitopes has potential applications in vaccine design and in better understanding autoimmune diseases and allergies. However, comprehensive experimental determination of peptide-MHC binding affinities is infeasible due to MHC diversity and the large number of possible peptide sequences. Computational methods trained on the limited experimental binding data can address this challenge. We present the MultiRTA method, an extension of our previous single-type RTA prediction method, which allows the prediction of peptide binding affinities for multiple MHC allotypes not used to train the model. Thus predictions can be made for many MHC allotypes for which experimental binding data is unavailable. Results We fit MultiRTA models for both HLA-DR and HLA-DP using large experimental binding data sets. The performance in predicting binding affinities for novel MHC allotypes, not in the training set, was tested in two different ways. First, we performed leave-one-allele-out cross-validation, in which predictions are made for one allotype using a model fit to binding data for the remaining MHC allotypes. Comparison of the HLA-DR results with those of two other prediction methods applied to the same data sets showed that MultiRTA achieved performance comparable to NetMHCIIpan and better than the earlier TEPITOPE method. We also directly tested model transferability by making leave-one-allele-out predictions for additional experimentally characterized sets of overlapping peptide epitopes binding to multiple MHC allotypes. In addition, we determined the applicability of prediction methods like MultiRTA to other MHC allotypes by examining the degree of MHC variation accounted for in the training set. An examination of predictions for the promiscuous binding CLIP peptide revealed variations in binding affinity among alleles as well as potentially distinct binding registers for HLA-DR and HLA-DP. Finally, we analyzed the optimal MultiRTA parameters to discover the most important peptide residues for promiscuous and allele-specific binding to HLA-DR and HLA-DP allotypes. Conclusions The MultiRTA method yields competitive performance but with a significantly simpler and physically interpretable model compared with previous prediction methods. A MultiRTA prediction webserver is available at http://bordnerlab.org/MultiRTA.
Collapse
Affiliation(s)
- Andrew J Bordner
- Mayo Clinic, 13400 East Shea Boulevard, Scottsdale, AZ 85259, USA.
| | | |
Collapse
|
25
|
Zhang W, Liu J, Niu Y. Quantitative prediction of MHC-II binding affinity using particle swarm optimization. Artif Intell Med 2010; 50:127-32. [PMID: 20541921 DOI: 10.1016/j.artmed.2010.05.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2009] [Revised: 03/31/2010] [Accepted: 05/12/2010] [Indexed: 01/13/2023]
Abstract
OBJECTIVE Helper T-cell epitopes (Th epitopes) are the basic units which activate helper T-cell's immune response, and they are helpful for understanding the immune mechanism and developing vaccines. Peptide and major histocompatibility complex class II (MHC-II) binding is an important prerequisite event for helper T-cell immune response, and the binding peptides are usually recognized as Th epitopes, therefore we can identify Th epitopes by predicting MHC-II binding peptides. Recently, instead of differentiating the peptides as binder or non-binder, researchers are more interested in predicting binding affinities between MHC-II molecules and peptides. METHODOLOGY Motivated by the collective search strategy of the particle swarm optimization algorithm (PSO), a method was developed to make the direct prediction of peptide binding affinity. In our paper, PSO was utilized to search for the optimal position-specific scoring matrices (PSSM) from the experimentally derived allele-related peptides, and then the prediction models were constructed based on the matrices. Moreover, we evaluated several factors influencing the binding affinity, including peptide length and flanking residue length, and incorporated them into our models. RESULTS The performance of our models was evaluated on three MHC-II alleles from AntiJen database and 14 MHC-II alleles from IEDB database. When compared to the existing popular quantitative methods such as MHCPred, SVRMHC, ARB and SMM-align, our method can give out better performance in terms of correlation coefficient (r) and area under ROC curve (AUC). In addition, the results demonstrated that the performance of models was further improved by incorporating the global length information, achieving average AUC value of 0.7534 and average r value of 0.4707. CONCLUSIONS Quantitative prediction of MHC-II binding affinity can be modeled as an optimization problem. Our PSO based method can find the optimal PSSM, which will then be used for identifying the binding cores and scoring the binding affinities of the peptides. The experiment results show that our method is promising for the prediction of MHC-II binding affinity.
Collapse
Affiliation(s)
- Wen Zhang
- School of Computer Science, Wuhan University, Wuhan 430072, People's Republic of China.
| | | | | |
Collapse
|
26
|
MHC Class II Binding Prediction-A Little Help from a Friend. J Biomed Biotechnol 2010; 2010:705821. [PMID: 20508817 PMCID: PMC2875769 DOI: 10.1155/2010/705821] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2009] [Revised: 01/20/2010] [Accepted: 02/22/2010] [Indexed: 11/18/2022] Open
Abstract
Vaccines are the greatest single instrument of prophylaxis against infectious diseases, with immeasurable benefits to human wellbeing. The accurate and reliable prediction of peptide-MHC binding is fundamental to the robust identification of T-cell epitopes and thus the successful design of peptide- and protein-based vaccines. The prediction of MHC class II peptide binding has hitherto proved recalcitrant and refractory. Here we illustrate the utility of existing computational tools for in silico prediction of peptides binding to class II MHCs. Most of the methods, tested in the present study, detect more than the half of the true binders in the top 5% of all possible nonamers generated from one protein. This number increases in the top 10% and 15% and then does not change significantly. For the top 15% the identified binders approach 86%. In terms of lab work this means 85% less expenditure on materials, labour and time. We show that while existing caveats are well founded, nonetheless use of computational models of class II binding can still offer viable help to the work of the immunologist and vaccinologist.
Collapse
|
27
|
Hu X, Zhou W, Udaka K, Mamitsuka H, Zhu S. MetaMHC: a meta approach to predict peptides binding to MHC molecules. Nucleic Acids Res 2010; 38:W474-9. [PMID: 20483919 PMCID: PMC2896142 DOI: 10.1093/nar/gkq407] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
As antigenic peptides binding to major histocompatibility complex (MHC) molecules is the prerequisite of cellular immune responses, an accurate computational predictor will be of great benefit to biologists and immunologists for understanding the underlying mechanism of immune recognition as well as facilitating the process of epitope mapping and vaccine design. Although various computational approaches have been developed, recent experimental results on benchmark data sets show that the development of improved predictors is needed, especially for MHC Class II peptide binding. To make the most of current methods and achieve a higher predictive performance, we developed a new web server, MetaMHC, to integrate the outputs of leading predictors by several popular ensemble strategies. MetaMHC consists of two components: MetaMHCI and MetaMHCII for MHC Class I peptide and MHC Class II peptide binding predictions, respectively. Experimental results by both cross-validation and using an independent data set show that the ensemble approaches outperform individual predictors, being statistically significant. MetaMHC is freely available at http://www.biokdd.fudan.edu.cn/Service/MetaMHC.html.
Collapse
Affiliation(s)
- Xihao Hu
- School of Computer Science and Shanghai Key Lab of Intelligent Information Processing, Fudan University, Shanghai 200433, China
| | | | | | | | | |
Collapse
|
28
|
Abstract
SUMMARY Major histocompatibility complex class II (MHC-II) molecules sample peptides from the extracellular space, allowing the immune system to detect the presence of foreign microbes from this compartment. To be able to predict the immune response to given pathogens, a number of methods have been developed to predict peptide-MHC binding. However, few methods other than the pioneering TEPITOPE/ProPred method have been developed for MHC-II. Despite recent progress in method development, the predictive performance for MHC-II remains significantly lower than what can be obtained for MHC-I. One reason for this is that the MHC-II molecule is open at both ends allowing binding of peptides extending out of the groove. The binding core of MHC-II-bound peptides is therefore not known a priori and the binding motif is hence not readily discernible. Recent progress has been obtained by including the flanking residues in the predictions. All attempts to make ab initio predictions based on protein structure have failed to reach predictive performances similar to those that can be obtained by data-driven methods. Thousands of different MHC-II alleles exist in humans. Recently developed pan-specific methods have been able to make reasonably accurate predictions for alleles that were not included in the training data. These methods can be used to define supertypes (clusters) of MHC-II alleles where alleles within each supertype have similar binding specificities. Furthermore, the pan-specific methods have been used to make a graphical atlas such as the MHCMotifviewer, which allows for visual comparison of specificities of different alleles.
Collapse
Affiliation(s)
- Morten Nielsen
- Department of Systems Biology, Technical University of Denmark, Centre for Biological Sequence Analysis, Lyngby, Denmark.
| | | | | | | |
Collapse
|
29
|
|
30
|
Bordner AJ, Mittelmann HD. Prediction of the binding affinities of peptides to class II MHC using a regularized thermodynamic model. BMC Bioinformatics 2010; 11:41. [PMID: 20089173 PMCID: PMC2828437 DOI: 10.1186/1471-2105-11-41] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2009] [Accepted: 01/20/2010] [Indexed: 12/25/2022] Open
Abstract
Background The binding of peptide fragments of extracellular peptides to class II MHC is a crucial event in the adaptive immune response. Each MHC allotype generally binds a distinct subset of peptides and the enormous number of possible peptide epitopes prevents their complete experimental characterization. Computational methods can utilize the limited experimental data to predict the binding affinities of peptides to class II MHC. Results We have developed the Regularized Thermodynamic Average, or RTA, method for predicting the affinities of peptides binding to class II MHC. RTA accounts for all possible peptide binding conformations using a thermodynamic average and includes a parameter constraint for regularization to improve accuracy on novel data. RTA was shown to achieve higher accuracy, as measured by AUC, than SMM-align on the same data for all 17 MHC allotypes examined. RTA also gave the highest accuracy on all but three allotypes when compared with results from 9 different prediction methods applied to the same data. In addition, the method correctly predicted the peptide binding register of 17 out of 18 peptide-MHC complexes. Finally, we found that suboptimal peptide binding registers, which are often ignored in other prediction methods, made significant contributions of at least 50% of the total binding energy for approximately 20% of the peptides. Conclusions The RTA method accurately predicts peptide binding affinities to class II MHC and accounts for multiple peptide binding registers while reducing overfitting through regularization. The method has potential applications in vaccine design and in understanding autoimmune disorders. A web server implementing the RTA prediction method is available at http://bordnerlab.org/RTA/.
Collapse
|
31
|
Nielsen M, Lund O. NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction. BMC Bioinformatics 2009; 10:296. [PMID: 19765293 PMCID: PMC2753847 DOI: 10.1186/1471-2105-10-296] [Citation(s) in RCA: 380] [Impact Index Per Article: 25.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2009] [Accepted: 09/18/2009] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND The major histocompatibility complex (MHC) molecule plays a central role in controlling the adaptive immune response to infections. MHC class I molecules present peptides derived from intracellular proteins to cytotoxic T cells, whereas MHC class II molecules stimulate cellular and humoral immunity through presentation of extracellularly derived peptides to helper T cells. Identification of which peptides will bind a given MHC molecule is thus of great importance for the understanding of host-pathogen interactions, and large efforts have been placed in developing algorithms capable of predicting this binding event. RESULTS Here, we present a novel artificial neural network-based method, NN-align that allows for simultaneous identification of the MHC class II binding core and binding affinity. NN-align is trained using a novel training algorithm that allows for correction of bias in the training data due to redundant binding core representation. Incorporation of information about the residues flanking the peptide-binding core is shown to significantly improve the prediction accuracy. The method is evaluated on a large-scale benchmark consisting of six independent data sets covering 14 human MHC class II alleles, and is demonstrated to outperform other state-of-the-art MHC class II prediction methods. CONCLUSION The NN-align method is competitive with the state-of-the-art MHC class II peptide binding prediction algorithms. The method is publicly available at http://www.cbs.dtu.dk/services/NetMHCII-2.0.
Collapse
Affiliation(s)
- Morten Nielsen
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, DK-2800 Lyngby, Denmark.
| | | |
Collapse
|
32
|
Rajapakse M, Feng L. Peptide binding to major histocompatibility complex: prediction and characterization using an evolutionary algorithmic approach. IEEE ENGINEERING IN MEDICINE AND BIOLOGY MAGAZINE : THE QUARTERLY MAGAZINE OF THE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY 2009; 28:73-77. [PMID: 19622428 DOI: 10.1109/memb.2009.932922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
|
33
|
Davies MN, Flower DR. Computational Vaccinology. BIOINFORMATICS FOR IMMUNOMICS 2009. [PMCID: PMC7121138 DOI: 10.1007/978-1-4419-0540-6_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
34
|
Lin HH, Zhang GL, Tongchusak S, Reinherz EL, Brusic V. Evaluation of MHC-II peptide binding prediction servers: applications for vaccine research. BMC Bioinformatics 2008; 9 Suppl 12:S22. [PMID: 19091022 PMCID: PMC2638162 DOI: 10.1186/1471-2105-9-s12-s22] [Citation(s) in RCA: 158] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Initiation and regulation of immune responses in humans involves recognition of peptides presented by human leukocyte antigen class II (HLA-II) molecules. These peptides (HLA-II T-cell epitopes) are increasingly important as research targets for the development of vaccines and immunotherapies. HLA-II peptide binding studies involve multiple overlapping peptides spanning individual antigens, as well as complete viral proteomes. Antigen variation in pathogens and tumor antigens, and extensive polymorphism of HLA molecules increase the number of targets for screening studies. Experimental screening methods are expensive and time consuming and reagents are not readily available for many of the HLA class II molecules. Computational prediction methods complement experimental studies, minimize the number of validation experiments, and significantly speed up the epitope mapping process. We collected test data from four independent studies that involved 721 peptide binding assays. Full overlapping studies of four antigens identified binding affinity of 103 peptides to seven common HLA-DR molecules (DRB1*0101, 0301, 0401, 0701, 1101, 1301, and 1501). We used these data to analyze performance of 21 HLA-II binding prediction servers accessible through the WWW. RESULTS Because not all servers have predictors for all tested HLA-II molecules, we assessed a total of 113 predictors. The length of test peptides ranged from 15 to 19 amino acids. We tried three prediction strategies - the best 9-mer within the longer peptide, the average of best three 9-mer predictions, and the average of all 9-mer predictions within the longer peptide. The best strategy was the identification of a single best 9-mer within the longer peptide. Overall, measured by the receiver operating characteristic method (AROC), 17 predictors showed good (AROC > 0.8), 41 showed marginal (AROC > 0.7), and 55 showed poor performance (AROC < 0.7). Good performance predictors included HLA-DRB1*0101 (seven), 1101 (six), 0401 (three), and 0701 (one). The best individual predictor was NETMHCIIPAN, closely followed by PROPRED, IEDB (Consensus), and MULTIPRED (SVM). None of the individual predictors was shown to be suitable for prediction of promiscuous peptides. Current predictive capabilities allow prediction of only 50% of actual T-cell epitopes using practical thresholds. CONCLUSION The available HLA-II servers do not match prediction capabilities of HLA-I predictors. Currently available HLA-II prediction servers offer only a limited prediction accuracy and the development of improved predictors is needed for large-scale studies, such as proteome-wide epitope mapping. The requirements for accuracy of HLA-II binding predictions are stringent because of the substantial effect of false positives.
Collapse
Affiliation(s)
- Hong Huang Lin
- Cancer Vaccine Center, Dana-Farber Cancer Institute, Boston, MA 02215, USA.
| | | | | | | | | |
Collapse
|
35
|
Zaitlen N, Reyes-Gomez M, Heckerman D, Jojic N. Shift-invariant adaptive double threading: learning MHC II-peptide binding. J Comput Biol 2008; 15:927-42. [PMID: 18771399 DOI: 10.1089/cmb.2007.0183] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
The major histocompatibility complex (MHC) plays important roles in the workings of the human immune system. Specificity of MHC binding to peptide fragments from cellular and pathogens' proteins has been found to correlate with disease outcome and pathogen or cancer evolution. In this paper we propose a novel approach to predicting binding configurations and energies for MHC class II molecules, whose epitopes are generally predicted less well than the MHC I epitopes due in part to larger variation in bound peptide length. We treat the relative position of the peptide as a hidden variable, and model the ensemble of different binding configurations, rather than use a separate alignment procedure to narrow it down to one. Thus, our predictor infers a distribution over peptide positions from the MHC II and peptide sequences, and computes the total binding affinity. The training procedure iterates the predictions with re-estimation of the parameters of the binding groove model. For a given relative peptide position, any MHC class I prediction model can be used. Here we choose the physics based model of Jojic et al. (2006). We show that the parameters of the binding model can be learned efficiently from the training data and then used to estimate binding energies for previously untested peptides. Our technique performs on par with previous approaches to MHC II epitope prediction. Furthermore, our model choice allows generalization to new MHC class II alleles, which were not a part of the training set.
Collapse
Affiliation(s)
- Noah Zaitlen
- Bioinformatics Program, University of California, San Diego, La Jolla, California, USA
| | | | | | | |
Collapse
|
36
|
Zhang W, Liu J, Niu YQ, Wang L, Hu X. A Bayesian regression approach to the prediction of MHC-II binding affinity. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2008; 92:1-7. [PMID: 18562039 DOI: 10.1016/j.cmpb.2008.05.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/05/2008] [Revised: 05/06/2008] [Accepted: 05/06/2008] [Indexed: 05/26/2023]
Abstract
Peptide-major histocompatibility complex (MHC) binding is an important prerequisite event and has immediate consequences to immune response. Those peptides binding to MHC molecules can activate the T-cell immunity, and they are useful for understanding the immune mechanism and developing vaccines for diseases. Recently, researchers are interested in making prediction about binding affinity instead of differentiating the peptides as binder or non-binder. In this paper, we use sparse Bayesian regression algorithm proposed by Tipping [M.E. Tipping, Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. (2001)] to derive position-specific scoring matrices from allele-related peptides, and develop the models allowing for the prediction of MHC-II binding affinity. We explore the peptide length and peptide flanking residue length's impact on binding affinity, and incorporate these factors into our models to enhance prediction performance. When applied to the datasets from AntiJen database and IEDB database, our method produces better performances than several popular quantitative methods.
Collapse
Affiliation(s)
- Wen Zhang
- School of Computer Science, Wuhan University, Wuhan 430079, China.
| | | | | | | | | |
Collapse
|
37
|
El-Manzalawy Y, Dobbs D, Honavar V. On evaluating MHC-II binding peptide prediction methods. PLoS One 2008; 3:e3268. [PMID: 18813344 PMCID: PMC2533399 DOI: 10.1371/journal.pone.0003268] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Accepted: 08/20/2008] [Indexed: 12/26/2022] Open
Abstract
Choice of one method over another for MHC-II binding peptide prediction is typically based on published reports of their estimated performance on standard benchmark datasets. We show that several standard benchmark datasets of unique peptides used in such studies contain a substantial number of peptides that share a high degree of sequence identity with one or more other peptide sequences in the same dataset. Thus, in a standard cross-validation setup, the test set and the training set are likely to contain sequences that share a high degree of sequence identity with each other, leading to overly optimistic estimates of performance. Hence, to more rigorously assess the relative performance of different prediction methods, we explore the use of similarity-reduced datasets. We introduce three similarity-reduced MHC-II benchmark datasets derived from MHCPEP, MHCBN, and IEDB databases. The results of our comparison of the performance of three MHC-II binding peptide prediction methods estimated using datasets of unique peptides with that obtained using their similarity-reduced counterparts shows that the former can be rather optimistic relative to the performance of the same methods on similarity-reduced counterparts of the same datasets. Furthermore, our results demonstrate that conclusions regarding the superiority of one method over another drawn on the basis of performance estimates obtained using commonly used datasets of unique peptides are often contradicted by the observed performance of the methods on the similarity-reduced versions of the same datasets. These results underscore the importance of using similarity-reduced datasets in rigorously comparing the performance of alternative MHC-II peptide prediction methods.
Collapse
Affiliation(s)
- Yasser El-Manzalawy
- Department of Computer Science, Center for Computational Intelligence, Learning, and Discovery, Iowa State University, Ames, Iowa, USA.
| | | | | |
Collapse
|
38
|
El-Manzalawy Y, Dobbs D, Honavar V. Predicting linear B-cell epitopes using string kernels. J Mol Recognit 2008; 21:243-55. [PMID: 18496882 PMCID: PMC2683948 DOI: 10.1002/jmr.893] [Citation(s) in RCA: 507] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
The identification and characterization of B‐cell epitopes play an important role in vaccine design, immunodiagnostic tests, and antibody production. Therefore, computational tools for reliably predicting linear B‐cell epitopes are highly desirable. We evaluated Support Vector Machine (SVM) classifiers trained utilizing five different kernel methods using fivefold cross‐validation on a homology‐reduced data set of 701 linear B‐cell epitopes, extracted from Bcipep database, and 701 non‐epitopes, randomly extracted from SwissProt sequences. Based on the results of our computational experiments, we propose BCPred, a novel method for predicting linear B‐cell epitopes using the subsequence kernel. We show that the predictive performance of BCPred (AUC = 0.758) outperforms 11 SVM‐based classifiers developed and evaluated in our experiments as well as our implementation of AAP (AUC = 0.7), a recently proposed method for predicting linear B‐cell epitopes using amino acid pair antigenicity. Furthermore, we compared BCPred with AAP and ABCPred, a method that uses recurrent neural networks, using two data sets of unique B‐cell epitopes that had been previously used to evaluate ABCPred. Analysis of the data sets used and the results of this comparison show that conclusions about the relative performance of different B‐cell epitope prediction methods drawn on the basis of experiments using data sets of unique B‐cell epitopes are likely to yield overly optimistic estimates of performance of evaluated methods. This argues for the use of carefully homology‐reduced data sets in comparing B‐cell epitope prediction methods to avoid misleading conclusions about how different methods compare to each other. Our homology‐reduced data set and implementations of BCPred as well as the APP method are publicly available through our web‐based server, BCPREDS, at: http://ailab.cs.iastate.edu/bcpreds/. Copyright © 2008 John Wiley & Sons, Ltd.
Collapse
Affiliation(s)
- Yasser El-Manzalawy
- Artificial Intelligence Laboratory, Iowa State University, Ames, IA 50010, USA.
| | | | | |
Collapse
|
39
|
Pfeifer N, Kohlbacher O. Multiple Instance Learning Allows MHC Class II Epitope Predictions Across Alleles. LECTURE NOTES IN COMPUTER SCIENCE 2008. [DOI: 10.1007/978-3-540-87361-7_18] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
|
40
|
Nielsen M, Lundegaard C, Blicher T, Peters B, Sette A, Justesen S, Buus S, Lund O. Quantitative predictions of peptide binding to any HLA-DR molecule of known sequence: NetMHCIIpan. PLoS Comput Biol 2008; 4:e1000107. [PMID: 18604266 PMCID: PMC2430535 DOI: 10.1371/journal.pcbi.1000107] [Citation(s) in RCA: 215] [Impact Index Per Article: 13.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2008] [Accepted: 05/28/2008] [Indexed: 01/05/2023] Open
Abstract
CD4 positive T helper cells control many aspects of specific immunity. These cells are specific for peptides derived from protein antigens and presented by molecules of the extremely polymorphic major histocompatibility complex (MHC) class II system. The identification of peptides that bind to MHC class II molecules is therefore of pivotal importance for rational discovery of immune epitopes. HLA-DR is a prominent example of a human MHC class II. Here, we present a method, NetMHCIIpan, that allows for pan-specific predictions of peptide binding to any HLA-DR molecule of known sequence. The method is derived from a large compilation of quantitative HLA-DR binding events covering 14 of the more than 500 known HLA-DR alleles. Taking both peptide and HLA sequence information into account, the method can generalize and predict peptide binding also for HLA-DR molecules where experimental data is absent. Validation of the method includes identification of endogenously derived HLA class II ligands, cross-validation, leave-one-molecule-out, and binding motif identification for hitherto uncharacterized HLA-DR molecules. The validation shows that the method can successfully predict binding for HLA-DR molecules—even in the absence of specific data for the particular molecule in question. Moreover, when compared to TEPITOPE, currently the only other publicly available prediction method aiming at providing broad HLA-DR allelic coverage, NetMHCIIpan performs equivalently for alleles included in the training of TEPITOPE while outperforming TEPITOPE on novel alleles. We propose that the method can be used to identify those hitherto uncharacterized alleles, which should be addressed experimentally in future updates of the method to cover the polymorphism of HLA-DR most efficiently. We thus conclude that the presented method meets the challenge of keeping up with the MHC polymorphism discovery rate and that it can be used to sample the MHC “space,” enabling a highly efficient iterative process for improving MHC class II binding predictions. CD4 positive T helper cells provide essential help for stimulation of both cellular and humoral immune reactions. T helper cells recognize peptides presented by molecules of the major histocompatibility complex (MHC) class II system. HLA-DR is a prominent example of a human MHC class II locus. The HLA molecules are extremely polymorphic, and more than 500 different HLA-DR protein sequences are known today. Each HLA-DR molecule potentially binds a unique set of antigenic peptides, and experimental characterization of the binding specificity for each molecule would be an immense and highly costly task. Only a very limited set of MHC molecules has been characterized experimentally. We have demonstrated earlier that it is possible to derive accurate predictions for MHC class I proteins by interpolating information from neighboring molecules. It is not straightforward to take a similar approach to derive pan-specific HLA-DR class II predictions because the HLA class II molecules can bind peptides of very different lengths. Here, we nonetheless show that this is indeed possible. We develop an HLA-DR pan-specific method that allows for prediction of binding to any HLA-DR molecule of known sequence—even in the absence of specific data for the particular molecule in question.
Collapse
Affiliation(s)
- Morten Nielsen
- Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark.
| | | | | | | | | | | | | | | |
Collapse
|
41
|
Computationally assisted screening and design of cell-interactive peptides by a cell-based assay using peptide arrays and a fuzzy neural network algorithm. Biotechniques 2008; 44:393-402. [PMID: 18361793 DOI: 10.2144/000112693] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
We developed a method of effective peptide screening that combines experiments and computational analysis. The method is based on the concept that screening efficiency can be enhanced from even limited data by use of a model derived from computational analysis that serves as a guide to screening and combining the model with subsequent repeated experiments. Here we focus on cell-adhesion peptides as a model application of this peptide-screening strategy. Cell-adhesion peptides were screened by use of a cell-based assay of a peptide array. Starting with the screening data obtained from a limited, random 5-mer library (643 sequences), a rule regarding structural characteristics of cell-adhesion peptides was extracted by fuzzy neural network (FNN) analysis. According to this rule, peptides with unfavored residues in certain positions that led to inefficient binding were eliminated from the random sequences. In the restricted, second random library (273 sequences), the yield of cell-adhesion peptides having an adhesion rate more than 1.5-fold to that of the basal array support was significantly high (31%) compared with the unrestricted random library (20%). In the restricted third library (50 sequences), the yield of cell-adhesion peptides increased to 84%. We conclude that a repeated cycle of experiments screening limited numbers of peptides can be assisted by the rule-extracting feature of FNN.
Collapse
|
42
|
Zhang W, Liu J, Niu Y. Quantitative prediction of MHC-II peptide binding affinity using relevance vector machine. APPL INTELL 2008. [DOI: 10.1007/s10489-008-0121-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
43
|
Todman SJ, Halling-Brown MD, Davies MN, Flower DR, Kayikci M, Moss DS. Toward the atomistic simulation of T cell epitopes. J Mol Graph Model 2008; 26:957-61. [PMID: 17766153 DOI: 10.1016/j.jmgm.2007.07.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2007] [Revised: 07/25/2007] [Accepted: 07/25/2007] [Indexed: 01/01/2023]
Abstract
Epitopes mediated by T cells lie at the heart of the adaptive immune response and form the essential nucleus of anti-tumour peptide or epitope-based vaccines. Antigenic T cell epitopes are mediated by major histocompatibility complex (MHC) molecules, which present them to T cell receptors. Calculating the affinity between a given MHC molecule and an antigenic peptide using experimental approaches is both difficult and time consuming, thus various computational methods have been developed for this purpose. A server has been developed to allow a structural approach to the problem by generating specific MHC:peptide complex structures and providing configuration files to run molecular modelling simulations upon them. A system has been produced which allows the automated construction of MHC:peptide structure files and the corresponding configuration files required to execute a molecular dynamics simulation using NAMD. The system has been made available through a web-based front end and stand-alone scripts. Previous attempts at structural prediction of MHC:peptide affinity have been limited due to the paucity of structures and the computational expense in running large scale molecular dynamics simulations. The MHCsim server (http://igrid-ext.cryst.bbk.ac.uk/MHCsim) allows the user to rapidly generate any desired MHC:peptide complex and will facilitate molecular modelling simulation of MHC complexes on an unprecedented scale.
Collapse
Affiliation(s)
- Sarah J Todman
- Department of Crystallography, University of London, Birkbeck College, Malet Street, London WC1E 7HX, United Kingdom
| | | | | | | | | | | |
Collapse
|
44
|
Kumar KK, Shelokar PS. An SVM method using evolutionary information for the identification of allergenic proteins. Bioinformation 2008; 2:253-6. [PMID: 18317576 PMCID: PMC2258428 DOI: 10.6026/97320630002253] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2007] [Revised: 01/17/2008] [Accepted: 01/19/2008] [Indexed: 12/27/2022] Open
Abstract
This study presents an allergenic protein prediction system that appears to be capable of producing high sensitivity and specificity. The proposed system is based on support vector machine (SVM) using evolutionary information in the form of an amino acid position specific scoring matrix (PSSM). The performance of this system is assessed by a 10-fold cross-validation experiment using a dataset consisting of 693 allergens and 1041 non-allergens obtained from Swiss-Prot and Structural Database of Allergenic Proteins (SDAP). The PSSM method produced an accuracy of 90.1% in comparison to the methods based on SVM using amino acid, dipeptide composition, pseudo (5-tier) amino acid composition that achieved an accuracy of 86.3, 86.5 and 82.1% respectively. The results show that evolutionary information can be useful to build more effective and efficient allergen prediction systems.
Collapse
Affiliation(s)
- Kandaswamy Krishna Kumar
- Insilico Consulting, 402, Citi Centre, 39/2 Erandwane, Karve Road, Pune-411004, Maharashtra, India.
| | | |
Collapse
|
45
|
EL-Manzalawy Y, Dobbs D, Honavar V. Predicting flexible length linear B-cell epitopes. COMPUTATIONAL SYSTEMS BIOINFORMATICS. COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE 2008; 7:121-132. [PMID: 19642274 PMCID: PMC3400678] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/28/2023]
Abstract
Identifying B-cell epitopes play an important role in vaccine design, immunodiagnostic tests, and antibody production. Therefore, computational tools for reliably predicting B-cell epitopes are highly desirable. We explore two machine learning approaches for predicting flexible length linear B-cell epitopes. The first approach utilizes four sequence kernels for determining a similarity score between any arbitrary pair of variable length sequences. The second approach utilizes four different methods of mapping a variable length sequence into a fixed length feature vector. Based on our empirical comparisons, we propose FBCPred, a novel method for predicting flexible length linear B-cell epitopes using the subsequence kernel. Our results demonstrate that FBCPred significantly outperforms all other classifiers evaluated in this study. An implementation of FBCPred and the datasets used in this study are publicly available through our linear B-cell epitope prediction server, BCPREDS, at: http://ailab.cs.iastate.edu/bcpreds/.
Collapse
Affiliation(s)
- Yasser EL-Manzalawy
- Artificial Intelligence Laboratory, Iowa State University, Ames, IA 50010, USA
- Department of Computer Science, Iowa State University, Ames, IA 50010, USA
- Center for Computational Intelligence, Learning, and Discovery, Iowa State University, Ames, IA 50010, USA
| | - Drena Dobbs
- Department of Genetics, Development and Cell Biology, Iowa State University, Ames, IA 50010, USA
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, IA 50010, USA
- Center for Computational Intelligence, Learning, and Discovery, Iowa State University, Ames, IA 50010, USA
| | - Vasant Honavar
- Artificial Intelligence Laboratory, Iowa State University, Ames, IA 50010, USA
- Department of Computer Science, Iowa State University, Ames, IA 50010, USA
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, IA 50010, USA
- Center for Computational Intelligence, Learning, and Discovery, Iowa State University, Ames, IA 50010, USA
| |
Collapse
|
46
|
A probabilistic meta-predictor for the MHC class II binding peptides. Immunogenetics 2007; 60:25-36. [PMID: 18092156 DOI: 10.1007/s00251-007-0266-y] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2007] [Accepted: 11/21/2007] [Indexed: 12/27/2022]
Abstract
Several computational methods for the prediction of major histocompatibility complex (MHC) class II binding peptides embodying different strengths and weaknesses have been developed. To provide reliable prediction, it is important to design a system that enables the integration of outcomes from various predictors. The construction of a meta-predictor of this type based on a probabilistic approach is introduced in this paper. The design permits the easy incorporation of results obtained from any number of individual predictors. It is demonstrated that this integrated method outperforms six state-of-the-art individual predictors based on computational studies using MHC class II peptides from 13 HLA alleles and three mouse MHC alleles obtained from the Immune Epitope Database and Analysis Resource. It is concluded that this integrative approach provides a clearly enhanced reliability of prediction. Moreover, this computational framework can be directly extended to MHC class I binding predictions.
Collapse
|
47
|
Jacob L, Vert JP. Efficient peptide-MHC-I binding prediction for alleles with few known binders. ACTA ACUST UNITED AC 2007; 24:358-66. [PMID: 18083718 DOI: 10.1093/bioinformatics/btm611] [Citation(s) in RCA: 105] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
MOTIVATION In silico methods for the prediction of antigenic peptides binding to MHC class I molecules play an increasingly important role in the identification of T-cell epitopes. Statistical and machine learning methods in particular are widely used to score candidate binders based on their similarity with known binders and non-binders. The genes coding for the MHC molecules, however, are highly polymorphic, and statistical methods have difficulties building models for alleles with few known binders. In this context, recent work has demonstrated the utility of leveraging information across alleles to improve the performance of the prediction. RESULTS We design a support vector machine algorithm that is able to learn peptide-MHC-I binding models for many alleles simultaneously, by sharing binding information across alleles. The sharing of information is controlled by a user-defined measure of similarity between alleles. We show that this similarity can be defined in terms of supertypes, or more directly by comparing key residues known to play a role in the peptide-MHC binding. We illustrate the potential of this approach on various benchmark experiments where it outperforms other state-of-the-art methods. AVAILABILITY The method is implemented on a web server: http://cbio.ensmp.fr/kiss. All data and codes are freely and publicly available from the authors.
Collapse
Affiliation(s)
- Laurent Jacob
- Centre for Computational Biology, Ecole des Mines de Paris, 35 rue Saint Honoré, 77305 Fontainebleau Cedex, France.
| | | |
Collapse
|
48
|
Rajapakse M, Schmidt B, Feng L, Brusic V. Predicting peptides binding to MHC class II molecules using multi-objective evolutionary algorithms. BMC Bioinformatics 2007; 8:459. [PMID: 18031584 PMCID: PMC2212666 DOI: 10.1186/1471-2105-8-459] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2007] [Accepted: 11/22/2007] [Indexed: 12/30/2022] Open
Abstract
Background Peptides binding to Major Histocompatibility Complex (MHC) class II molecules are crucial for initiation and regulation of immune responses. Predicting peptides that bind to a specific MHC molecule plays an important role in determining potential candidates for vaccines. The binding groove in class II MHC is open at both ends, allowing peptides longer than 9-mer to bind. Finding the consensus motif facilitating the binding of peptides to a MHC class II molecule is difficult because of different lengths of binding peptides and varying location of 9-mer binding core. The level of difficulty increases when the molecule is promiscuous and binds to a large number of low affinity peptides. In this paper, we propose two approaches using multi-objective evolutionary algorithms (MOEA) for predicting peptides binding to MHC class II molecules. One uses the information from both binders and non-binders for self-discovery of motifs. The other, in addition, uses information from experimentally determined motifs for guided-discovery of motifs. Results The proposed methods are intended for finding peptides binding to MHC class II I-Ag7 molecule – a promiscuous binder to a large number of low affinity peptides. Cross-validation results across experiments on two motifs derived for I-Ag7 datasets demonstrate better generalization abilities and accuracies of the present method over earlier approaches. Further, the proposed method was validated and compared on two publicly available benchmark datasets: (1) an ensemble of qualitative HLA-DRB1*0401 peptide data obtained from five different sources, and (2) quantitative peptide data obtained for sixteen different alleles comprising of three mouse alleles and thirteen HLA alleles. The proposed method outperformed earlier methods on most datasets, indicating that it is well suited for finding peptides binding to MHC class II molecules. Conclusion We present two MOEA-based algorithms for finding motifs, one for self-discovery and the other for guided-discovery by experimentally determined motifs, and thereby predicting binding peptides to I-Ag7 molecule. Our experiments show that the proposed MOEA-based algorithms are better than earlier methods in predicting binding sites not only on I-Ag7 but also on most alleles of class II MHC benchmark datasets. This shows that our methods could be applicable to find binding motifs in a wide range of alleles.
Collapse
Affiliation(s)
- Menaka Rajapakse
- Institute for Infocomm Research, 21 Heng Mui Keng Terrace, 119613 Singapore.
| | | | | | | |
Collapse
|
49
|
Holm L, Frech K, Dzhambazov B, Holmdahl R, Kihlberg J, Linusson A. Quantitative Structure−Activity Relationship of Peptides Binding to the Class II Major Histocompatibility Complex Molecule Aq Associated with Autoimmune Arthritis. J Med Chem 2007; 50:2049-59. [PMID: 17425295 DOI: 10.1021/jm061209b] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Presentation of (glyco)peptides by the class II major histocompatibility complex molecule Aq to T cells plays a central role in collagen-induced arthritis, an animal model for the autoimmune disease rheumatoid arthritis. A peptide library was designed using statistical molecular design in amino acid space in which five positions in the minimal mouse collagen type II binding epitope CII260-267 were varied. A substantially reduced peptide library of 24 peptides with diverse and representative molecular characteristics was selected, synthesized, and evaluated for the binding strength to Aq. A multivariate QSAR model was established by correlating calculated descriptors, compressed to its principle properties, with the binding data using partial least-square regression. The model was successfully validated by an external test set. Interpretation of the model provided a molecular property binding motif for peptides interacting with Aq. The information may be useful in future research directed toward new treatments of rheumatoid arthritis.
Collapse
Affiliation(s)
- Lotta Holm
- Department of Chemistry, Umeå University, SE-901 87 Umeå, Sweden
| | | | | | | | | | | |
Collapse
|