1
|
Zhang L, Liu T. PreAlgPro: Prediction of allergenic proteins with pre-trained protein language model and efficient neutral network. Int J Biol Macromol 2024; 280:135762. [PMID: 39322150 DOI: 10.1016/j.ijbiomac.2024.135762] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2024] [Revised: 09/03/2024] [Accepted: 09/16/2024] [Indexed: 09/27/2024]
Abstract
Allergy is a prevalent phenomenon, involving allergens such as nuts and milk. Avoiding exposure to allergens is the most effective preventive measure against allergic reactions. However, current homology-based methods for identifying allergenic proteins encounter challenges when dealing with non-homologous data. Traditional machine learning approaches rely on manually extracted features, which lack important protein functional characteristics, including evolutionary information. Consequently, there is still considerable room for improvement in existing methods. In this study, we present PreAlgPro, a method for identifying allergenic proteins based on pre-trained protein language models and deep learning techniques. Specifically, we employed the ProtT5 model to extract protein embedding features, replacing the manual feature extraction step. Furthermore, we devised an Attention-CNN neural network architecture to identify potential features that contribute to the classification of allergenic proteins. The performance of our model was evaluated on four independent test sets, and the experimental results demonstrate that PreAlgPro surpasses existing state-of-the-art methods. Additionally, we collected allergenic protein samples to validate the robustness of the model and conducted an analysis of model interpretability.
Collapse
Affiliation(s)
- Lingrong Zhang
- College of Information Technology, Shanghai Ocean University, Shanghai 201306, China
| | - Taigang Liu
- College of Information Technology, Shanghai Ocean University, Shanghai 201306, China.
| |
Collapse
|
2
|
Qureshi A, Connolly JB. Bioinformatic and literature assessment of toxicity and allergenicity of a CRISPR-Cas9 engineered gene drive to control Anopheles gambiae the mosquito vector of human malaria. Malar J 2023; 22:234. [PMID: 37580703 PMCID: PMC10426224 DOI: 10.1186/s12936-023-04665-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 08/07/2023] [Indexed: 08/16/2023] Open
Abstract
BACKGROUND Population suppression gene drive is currently being evaluated, including via environmental risk assessment (ERA), for malaria vector control. One such gene drive involves the dsxFCRISPRh transgene encoding (i) hCas9 endonuclease, (ii) T1 guide RNA (gRNA) targeting the doublesex locus, and (iii) DsRed fluorescent marker protein, in genetically-modified mosquitoes (GMMs). Problem formulation, the first stage of ERA, for environmental releases of dsxFCRISPRh previously identified nine potential harms to the environment or health that could occur, should expressed products of the transgene cause allergenicity or toxicity. METHODS Amino acid sequences of hCas9 and DsRed were interrogated against those of toxins or allergens from NCBI, UniProt, COMPARE and AllergenOnline bioinformatic databases and the gRNA was compared with microRNAs from the miRBase database for potential impacts on gene expression associated with toxicity or allergenicity. PubMed was also searched for any evidence of toxicity or allergenicity of Cas9 or DsRed, or of the donor organisms from which these products were originally derived. RESULTS While Cas9 nuclease activity can be toxic to some cell types in vitro and hCas9 was found to share homology with the prokaryotic toxin VapC, there was no evidence from previous studies of a risk of toxicity to humans and other animals from hCas9. Although hCas9 did contain an 8-mer epitope found in the latex allergen Hev b 9, the full amino acid sequence of hCas9 was not homologous to any known allergens. Combined with a lack of evidence in the literature of Cas9 allergenicity, this indicated negligible risk to humans of allergenicity from hCas9. No matches were found between the gRNA and microRNAs from either Anopheles or humans. Moreover, potential exposure to dsxFCRISPRh transgenic proteins from environmental releases was assessed as negligible. CONCLUSIONS Bioinformatic and literature assessments found no convincing evidence to suggest that transgenic products expressed from dsxFCRISPRh were allergens or toxins, indicating that environmental releases of this population suppression gene drive for malaria vector control should not result in any increased allergenicity or toxicity in humans or animals. These results should also inform evaluations of other GMMs being developed for vector control and in vivo clinical applications of CRISPR-Cas9.
Collapse
Affiliation(s)
- Alima Qureshi
- Department of Life Sciences, Imperial College London, Silwood Park, Sunninghill, Ascot, UK
| | - John B Connolly
- Department of Life Sciences, Imperial College London, Silwood Park, Sunninghill, Ascot, UK.
| |
Collapse
|
3
|
He C, Ye X, Yang Y, Hu L, Si Y, Zhao X, Chen L, Fang Q, Wei Y, Wu F, Ye G. DeepAlgPro: an interpretable deep neural network model for predicting allergenic proteins. Brief Bioinform 2023:bbad246. [PMID: 37385595 DOI: 10.1093/bib/bbad246] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 05/08/2023] [Accepted: 06/13/2023] [Indexed: 07/01/2023] Open
Abstract
Allergies have become an emerging public health problem worldwide. The most effective way to prevent allergies is to find the causative allergen at the source and avoid re-exposure. However, most of the current computational methods used to identify allergens were based on homology or conventional machine learning methods, which were inefficient and still had room to be improved for the detection of allergens with low homology. In addition, few methods based on deep learning were reported, although deep learning has been successfully applied to several tasks in protein sequence analysis. In the present work, a deep neural network-based model, called DeepAlgPro, was proposed to identify allergens. We showed its great accuracy and applicability to large-scale forecasts by comparing it to other available tools. Additionally, we used ablation experiments to demonstrate the critical importance of the convolutional module in our model. Moreover, further analyses showed that epitope features contributed to model decision-making, thus improving the model's interpretability. Finally, we found that DeepAlgPro was capable of detecting potential new allergens. Overall, DeepAlgPro can serve as powerful software for identifying allergens.
Collapse
Affiliation(s)
- Chun He
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Xinhai Ye
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China
- Shanghai Institute for Advanced Study, Zhejiang University, Shanghai, China
| | - Yi Yang
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Liya Hu
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China
| | - Yuxuan Si
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China
| | - Xianxin Zhao
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Longfei Chen
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Qi Fang
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| | - Ying Wei
- Department of Computer Science, City University of Hong Kong, Hong Kong, China
| | - Fei Wu
- College of Computer Science and Technology, Zhejiang University, Hangzhou, China
- Shanghai Institute for Advanced Study, Zhejiang University, Shanghai, China
| | - Gongyin Ye
- State Key Laboratory of Rice Biology and Breeding & Ministry of Agricultural and Rural Affairs Key Laboratory of Molecular Biology of Crop Pathogens and Insects, Institute of Insect Sciences, Zhejiang University, Hangzhou, China
| |
Collapse
|
4
|
Goto K, Tamehiro N, Yoshida T, Hanada H, Sakuma T, Adachi R, Kondo K, Takeuchi I. Novel Machine Learning Method AllerStat Identifies Statistically Significant Allergen-Specific Patterns in Protein Sequences. J Biol Chem 2023; 299:104733. [PMID: 37086787 DOI: 10.1016/j.jbc.2023.104733] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2022] [Revised: 04/12/2023] [Accepted: 04/18/2023] [Indexed: 04/24/2023] Open
Abstract
Cutting-edge technologies such as genome editing and synthetic biology allow us to produce novel foods and functional proteins. However, their toxicity and allergenicity must be accurately evaluated. It is known that specific amino-acid sequences in proteins make some proteins allergic, but many of these sequences remain uncharacterized. In this study, we introduce a data-driven approach and a machine-learning (ML) method to find undiscovered allergen specific patterns (ASPs) among amino acid sequences. The proposed method enables an exhaustive search for amino-acid subsequences whose frequencies are statistically significantly higher in allergenic proteins. As a proof-of-concept, we created a database containing 21,154 proteins of which the presence or absence of allergic reactions are already known, and applied the proposed method to the database. The detected ASPs in this proof-of-concept study were consistent with known biological findings, and the allergenicity prediction performance using the detected ASPs was higher than extant approaches, indicating this method may be useful in evaluating the utility of synthetic foods and proteins.
Collapse
Affiliation(s)
- Kento Goto
- Department of Computer Science, Nagoya Institute of Technology. Gokiso-cho, Showa-ku, Nagoya, Aichi, 466-8555, Japan
| | - Norimasa Tamehiro
- Division of Biochemistry, National Institute of Health Sciences. 3-25-26 Tonomachi, Kawasaki-ku, Kawasaki, Kanagawa, 210-9501, Japan
| | - Takumi Yoshida
- Department of Computer Science, Nagoya Institute of Technology. Gokiso-cho, Showa-ku, Nagoya, Aichi, 466-8555, Japan
| | - Hiroyuki Hanada
- Center for Advanced Intelligence Project, RIKEN. 1-4-1 Nihonbashi, Chuo-ku, Tokyo, 103-0027, Japan
| | - Takuto Sakuma
- Department of Computer Science, Nagoya Institute of Technology. Gokiso-cho, Showa-ku, Nagoya, Aichi, 466-8555, Japan
| | - Reiko Adachi
- Division of Biochemistry, National Institute of Health Sciences. 3-25-26 Tonomachi, Kawasaki-ku, Kawasaki, Kanagawa, 210-9501, Japan
| | - Kazunari Kondo
- Division of Biochemistry, National Institute of Health Sciences. 3-25-26 Tonomachi, Kawasaki-ku, Kawasaki, Kanagawa, 210-9501, Japan.
| | - Ichiro Takeuchi
- Graduate School of Engineering, Nagoya University.Furo-cho, Chikusa-ku, Nagoya, 464-8603, Japan; Center for Advanced Intelligence Project, RIKEN. 1-4-1 Nihonbashi, Chuo-ku, Tokyo, 103-0027, Japan.
| |
Collapse
|
5
|
Negi H, Saxena H, Singh IK, Singh A. Herbivory-inducible lipid-transfer proteins (LTPs) of Cicer arietinum as potential human allergens. J Biomol Struct Dyn 2023; 41:12863-12879. [PMID: 36703620 DOI: 10.1080/07391102.2023.2169353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Accepted: 01/10/2023] [Indexed: 01/28/2023]
Abstract
Lipid-transfer proteins (LTPs) are lipid-binding small proteins, ubiquitously distributed amongst plant kingdom. Apart from their involvement in plant defense, it has also been discovered that they induce allergic reactions in humans. A plethora of LTPs have been identified in vegetables, fruits, pollens, nuts, and latex, among which Pru p 3, a LTP allergen from peach fruit, is extensively studied and exhibits cross-reactivity with potential allergens from different species. In Cicer arietinum, a family of LTPs (CaLTPs) has been identified and their importance in plant defense during Helicoverpa armigera-infestation has been recognized. However, the determination of the allergenicity potential of CaLTPs has not been attempted. In this study, we aim to decipher the allergenicity potential of defense-related CaLTPs. The allergenicity potential prediction, and identification of B-cell epitope binding regions showed that the CaLTPs had conserved domains and B-cell epitopes in the same regions as Prup3 (a marker allergen for LTPs). Using molecular docking and simulations, we observed that the CaLTPs successfully interacted with the Immunoglobin E(IgE)with docking energies ranging from -315.5 to -268.4 and the structures were stabilized within 10 ns of simulation. Through this study, we intend to embellish our present knowledge and understanding of the sensitization and allergenicity potential of CaLTPs.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Harshita Negi
- Department of Botany, Hans Raj College, University of Delhi, Delhi, India
- Department of Biological Sciences, University of South Carolina, Columbia, SC, USA
| | - Harshita Saxena
- Department of Botany, Hans Raj College, University of Delhi, Delhi, India
- Institute of Plant Breeding, Genetics and Genomics, University of Georgia, Griffin, GA, USA
| | - Indrakant K Singh
- Department of Zoology, Deshbandhu College, University of Delhi, New Delhi, India
| | - Archana Singh
- Department of Botany, Hans Raj College, University of Delhi, Delhi, India
- J C Bose Centre for Plant Genomics, Hansraj College, University of Delhi, Delhi, India
- Delhi School of Climate Change and Sustainability, Institution of Eminence, Maharishi Karnad Bhawan, University of Delhi, Delhi, India
| |
Collapse
|
6
|
Vij S, Thakur R, Kumari L, Suri CR, Rishi P. Potential of a novel flagellin epitope as a broad-spectrum vaccine candidate against enteric fever. Microb Pathog 2023; 174:105936. [PMID: 36494021 DOI: 10.1016/j.micpath.2022.105936] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2022] [Revised: 11/24/2022] [Accepted: 12/05/2022] [Indexed: 12/12/2022]
Abstract
Relentless emergence of antibiotic resistant Salmonella strains, coupled with the drawbacks associated with currently available vaccines against enteric fever, warrants an urgent need to look for new vaccine candidates. Out of the multiple virulence factors harbored by Salmonella, flagella are regarded as one of the most important targets of innate as well as adaptive immune response. Individual Salmonella serotypes alternate between expression of two different antigenic forms encoded by fliC and fljB genes, respectively thereby employing this as a strategy to escape the host immune response. In the present study, using various immunoinformatic approaches, a flagellin epitope, present in both antigenic forms of typhoidal Salmonellae has been targeted. Following B-cell epitope and B-cell derived T-cell epitope prediction and interaction studies with major histocompatibility complexes using molecular docking, a peptide epitope was selected. Further, it was screened for its presence in majority of typhoidal serovars along with other useful attributes, in silico. Thereafter, safety studies were performed with the synthesized peptide. Subsequently, immunization studies were carried out using S. Typhi as well as S. Paratyphi A induced murine peritonitis model. Active immunization with peptide-BSA conjugate resulted in 75% and 80% mice survival following lethal challenge with S. Typhi and S. Paratyphi A respectively, along with a significant IgG antibody titer, thereby highlighting its immunogenic potential. Reduced bacterial burden in vital organs along with improved histoarchitecture and cytokine levels further substantiated the protective efficacy of the proposed candidate. Passive immunization studies with the candidate verified the protective efficacy of the generated antibodies against lethal challenge of bacteria in mice. Given the endemic nature of enteric fever and the antigenic variability observed in Salmonella serotypes, present study highlights the importance of using a vaccine candidate, which, along with generating a strong immune response, also exhibits a broad coverage against both, S. Typhi as well as S. Paratyphi A strains.
Collapse
Affiliation(s)
- Shania Vij
- Department of Microbiology, Basic Medical Sciences, Block I, South Campus, Panjab University, Chandigarh, India
| | - Reena Thakur
- Department of Microbiology, Basic Medical Sciences, Block I, South Campus, Panjab University, Chandigarh, India
| | - Laxmi Kumari
- University Institute of Pharmaceutical Sciences, Panjab University, Chandigarh, India
| | | | - Praveen Rishi
- Department of Microbiology, Basic Medical Sciences, Block I, South Campus, Panjab University, Chandigarh, India.
| |
Collapse
|
7
|
Identification of allergens and allergen hydrolysates by proteomics and metabolomics: A comparative study of natural and enzymolytic bee pollen. Food Res Int 2022; 158:111572. [DOI: 10.1016/j.foodres.2022.111572] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Revised: 06/15/2022] [Accepted: 06/23/2022] [Indexed: 11/23/2022]
|
8
|
Islam E. Development of epitope-based chimeric protein as a vaccine against Lujo virus by utilizing immunoinformatic tools. Future Virol 2022. [DOI: 10.2217/fvl-2021-0105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Aim: Lujo is a modern zoonotic virus that is potentially fatal and spreads by bodily fluids. In this research, immunoinformatic tools are used to build a vaccine. Methodology: The epitopes of cytotoxic T-lymphocytes, helper T-lymphocytes and linear B-lymphocytes were predicted from the most antigenic protein. The designed vaccine's physiochemical properties and 3D structure have been forecasted. Low free energy and strong binding affinity estimated in molecular docking against toll-like receptor 4 (TLR4) and dynamic simulation. Furthermore, in silico cloning in the Escherichia coli K12 host system was performed for high level of expression. Conclusion: Finally, immune simulation was used to determine immune responses to the vaccine that was formulated confirming the developed vaccine as a good candidate against Lujo virus.
Collapse
Affiliation(s)
- Enayetul Islam
- Department of Genetic Engineering & Biotechnology, University of Chittagong, Chittagong, Bangladesh
| |
Collapse
|
9
|
Design of Vaccine Targeting Zika Virus Polyprotein by Immunoinformatics Technique. Int J Pept Res Ther 2022. [DOI: 10.1007/s10989-022-10409-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
10
|
Yin S, Tao Y, Jiang Y, Meng L, Zhao L, Xue X, Li Q, Wu L. A Combined Proteomic and Metabolomic Strategy for Allergens Characterization in Natural and Fermented Brassica napus Bee Pollen. Front Nutr 2022; 9:822033. [PMID: 35155540 PMCID: PMC8833084 DOI: 10.3389/fnut.2022.822033] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Accepted: 01/03/2022] [Indexed: 01/14/2023] Open
Abstract
Bee pollen is consumed for its nutritional and pharmacological benefits, but it also contains hazardous allergens which have not been identified. Here, we identified two potential allergens, glutaredoxin and oleosin-B2, in Brassica napus bee pollen using mass spectrometry-based proteomics analyses, and used bioinformatics to predict their antigenic epitopes. Comparison of fermented (by Saccharomyces cerevisiae) and unfermented bee pollen samples indicated that glutaredoxin and oleosin-B2 contents were significantly decreased following fermentation, while the contents of their major constituent oligopeptides and amino acids were significantly increased based on metabolomics analyses. Immunoblot analysis indicated that the IgE-binding affinity with extracted bee pollen proteins was also significantly decreased after fermentation, suggesting a reduction in the allergenicity of fermented bee pollen. Furthermore, fermentation apparently promoted the biosynthesis of L-valine, L-isoleucine, L-tryptophan, and L-phenylalanine, as well as their precursors or intermediates. Thus, fermentation could potentially alleviate allergenicity, while also positively affecting nutritional properties of B. napus bee pollen. Our findings might provide a scientific foundation for improving the safety of bee pollen products to facilitate its wider application.
Collapse
Affiliation(s)
- Shuting Yin
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
- College of Animal Science, Shanxi Agricultural University, Shanxi, China
| | - Yuxiao Tao
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Yusuo Jiang
- College of Animal Science, Shanxi Agricultural University, Shanxi, China
| | - Lifeng Meng
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Liuwei Zhao
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Xiaofeng Xue
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
| | - Qiangqiang Li
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
- *Correspondence: Qiangqiang Li
| | - Liming Wu
- Institute of Apicultural Research, Chinese Academy of Agricultural Sciences, Beijing, China
- Liming Wu
| |
Collapse
|
11
|
Allergic Diseases: A Comprehensive Review on Risk Factors, Immunological Mechanisms, Link with COVID-19, Potential Treatments, and Role of Allergen Bioinformatics. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021; 18:ijerph182212105. [PMID: 34831860 PMCID: PMC8622387 DOI: 10.3390/ijerph182212105] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 11/02/2021] [Accepted: 11/16/2021] [Indexed: 12/24/2022]
Abstract
The prevalence of allergic diseases is regarded as one of the key challenges in health worldwide. Although the precise mechanisms underlying this rapid increase in prevalence are unknown, emerging evidence suggests that genetic and environmental factors play a significant role. The immune system, microbiota, viruses, and bacteria have all been linked to the onset of allergy disorders in recent years. Avoiding allergen exposure is the best treatment option; however, steroids, antihistamines, and other symptom-relieving drugs are also used. Allergen bioinformatics encompasses both computational tools/methods and allergen-related data resources for managing, archiving, and analyzing allergological data. This study highlights allergy-promoting mechanisms, algorithms, and concepts in allergen bioinformatics, as well as major areas for future research in the field of allergology.
Collapse
|
12
|
Dixit NK. Design of Monovalent and Chimeric Tetravalent Dengue Vaccine Using an Immunoinformatics Approach. Int J Pept Res Ther 2021; 27:2607-2624. [PMID: 34602919 PMCID: PMC8475484 DOI: 10.1007/s10989-021-10277-x] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/24/2021] [Indexed: 12/15/2022]
Abstract
An immunoinformatics technique was used to predict a monovalent amide immunogen candidate capable of producing therapeutic antibodies as well as a potent immunogen candidate capable of acting as a universal vaccination against all dengue fever virus serotypes. The capsid protein is an attractive goal for anti-DENV due to its position in the dengue existence cycle. The widely accessible immunological data, advances in antigenic peptide prediction using reverse vaccinology, and the introduction of molecular docking in immunoinformatics have directed vaccine manufacturing. The C-proteins of DENV-1-4 serotypes were known as antigens to assist with logical design. Binding epitopes for TC cells, TH cells, and B cells is predicted from structural dengue virus capsid proteins. Each T cell epitope of C-protein integrated with a B cell as a templet was used as a vaccine and produce antibodies in contrast to serotype of the dengue virus. A chimeric tetravalent vaccine was created by combining four vaccines, each representing four dengue serotypes, to serve as a standard vaccine candidate for all four Sero groups. The LKRARNRVS, RGFRKEIGR, KNGAIKVLR, and KAINVLRGF from dengue 1, dengue 2, dengue 3, and dengue 4 epitopes may be essential immunotherapeutic representatives for controlling outbreaks.
Collapse
Affiliation(s)
- Neeraj Kumar Dixit
- Department of Biotechnology, Saroj Institute of Technology & Management, Lucknow, Utter Pradesh India
| |
Collapse
|
13
|
Benedé S, Lozano-Ojalvo D, Cristobal S, Costa J, D'Auria E, Velickovic TC, Garrido-Arandia M, Karakaya S, Mafra I, Mazzucchelli G, Picariello G, Romero-Sahagun A, Villa C, Roncada P, Molina E. New applications of advanced instrumental techniques for the characterization of food allergenic proteins. Crit Rev Food Sci Nutr 2021; 62:8686-8702. [PMID: 34060381 DOI: 10.1080/10408398.2021.1931806] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
Abstract
Current approaches based on electrophoretic, chromatographic or immunochemical principles have allowed characterizing multiple allergens, mapping their epitopes, studying their mechanisms of action, developing detection and diagnostic methods and therapeutic strategies for the food and pharmaceutical industry. However, some of the common structural features related to the allergenic potential of food proteins remain unknown, or the pathological mechanism of food allergy is not yet fully understood. In addition, it is also necessary to evaluate new allergens from novel protein sources that may pose a new risk for consumers. Technological development has allowed the expansion of advanced technologies for which their whole potential has not been entirely exploited and could provide novel contributions to still unexplored molecular traits underlying both the structure of food allergens and the mechanisms through which they sensitize or elicit adverse responses in human subjects, as well as improving analytical techniques for their detection. This review presents cutting-edge instrumental techniques recently applied when studying structural and functional aspects of proteins, mechanism of action and interaction between biomolecules. We also exemplify their role in the food allergy research and discuss their new possible applications in several areas of the food allergy field.
Collapse
Affiliation(s)
- Sara Benedé
- Instituto de Investigación en Ciencias de la Alimentación (CIAL, CSIC-UAM), Madrid, Spain
| | - Daniel Lozano-Ojalvo
- Precision Immunology Institute, Icahn School of Medicine at Mount Sinai, Jaffe Food Allergy Institute, New York, NY, USA
| | - Susana Cristobal
- Department of Biomedical and Clinical Sciences, Cell Biology, Faculty of Medicine, Linköping University, Linköping, Sweden.,IKERBASQUE, Basque Foundation for Science, Department of Physiology, Faculty of Medicine and Nursing, University of the Basque Country UPV/EHU, Leioa, Spain
| | - Joana Costa
- REQUIMTE-LAQV, Faculdade de Farmácia, Universidade do Porto, Porto, Portugal
| | - Enza D'Auria
- Clinica Pediatrica, Ospedale dei Bambini Vittore Buzzi, Università degli Studi, Milano, Italy
| | - Tanja Cirkovic Velickovic
- Faculty of Chemistry, University of Belgrade, Belgrade, Serbia.,Ghent University Global Campus, Incheon, South Korea.,Faculty of Bioscience Engineering, Ghent University, Ghent, Belgium.,Serbian Academy of Sciences and Arts, Belgrade, Serbia
| | - María Garrido-Arandia
- Centro de Biotecnología y Genómica de Plantas (UPM-INIA), Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, Spain
| | - Sibel Karakaya
- Department of Food Engineering, Ege University, Izmir, Turkey
| | - Isabel Mafra
- REQUIMTE-LAQV, Faculdade de Farmácia, Universidade do Porto, Porto, Portugal
| | - Gabriel Mazzucchelli
- Mass Spectrometry Laboratory, MolSys Research Unit, University of Liege, Liege, Belgium
| | - Gianluca Picariello
- Institute of Food Sciences, National Research Council (CNR), Avellino, Italy
| | - Alejandro Romero-Sahagun
- Centro de Biotecnología y Genómica de Plantas (UPM-INIA), Universidad Politécnica de Madrid, Pozuelo de Alarcón, Madrid, Spain
| | - Caterina Villa
- REQUIMTE-LAQV, Faculdade de Farmácia, Universidade do Porto, Porto, Portugal
| | - Paola Roncada
- Department of Health Sciences, University Magna Graecia, Catanzaro, Italy
| | - Elena Molina
- Instituto de Investigación en Ciencias de la Alimentación (CIAL, CSIC-UAM), Madrid, Spain
| |
Collapse
|
14
|
Khan YD, Alzahrani E, Alghamdi W, Ullah MZ. Sequence-based Identification of Allergen Proteins Developed by Integration of PseAAC and Statistical Moments via 5-Step Rule. Curr Bioinform 2021. [DOI: 10.2174/1574893615999200424085947] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Background:
Allergens are antigens that can stimulate an atopic type I human
hypersensitivity reaction by an immunoglobulin E (IgE) reaction. Some proteins are naturally
allergenic than others. The challenge for toxicologists is to identify properties that allow proteins
to cause allergic sensitization and allergic diseases. The identification of allergen proteins is a very
critical and pivotal task. The experimental identification of protein functions is a hectic, laborious
and costly task; therefore, computer scientists have proposed various methods in the field of
computational biology and bioinformatics using various data science approaches. Objectives:
Herein, we report a novel predictor for the identification of allergen proteins.
Methods:
For feature extraction, statistical moments and various position-based features have been
incorporated into Chou’s pseudo amino acid composition (PseAAC), and are used for training of a
neural network.
Results:
The predictor is validated through 10-fold cross-validation and Jackknife testing, which
gave 99.43% and 99.87% accurate results.
Conclusions:
Thus, the proposed predictor can help in predicting the Allergen proteins in an
efficient and accurate way and can provide baseline data for the discovery of new drugs and
biomarkers.
Collapse
Affiliation(s)
- Yaser Daanial Khan
- Department of Computer Science, School of Systems and Technology, University of Management and Technology, C II Johar Town, Lahore 54770, Pakistan
| | - Ebraheem Alzahrani
- Department of Mathematics, Faculty of Science, King Abdulaziz University, P.O. Box 80203, Jeddah 21589, Saudi Arabia
| | - Wajdi Alghamdi
- Department of Information Technology, Faculty of Computing and Information Technology, King Abdulaziz University, P.O. Box 80221, Jeddah, Saudi Arabia
| | - Malik Zaka Ullah
- Department of Mathematics, Faculty of Science, King Abdulaziz University, P.O. Box 80203, Jeddah 21589, Saudi Arabia
| |
Collapse
|
15
|
Kardani K, Bolhassani A, Namvar A. An overview of in silico vaccine design against different pathogens and cancer. Expert Rev Vaccines 2020; 19:699-726. [PMID: 32648830 DOI: 10.1080/14760584.2020.1794832] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
INTRODUCTION Due to overcome the hardness of the vaccine design, computational vaccinology is emerging widely. Prediction of T cell and B cell epitopes, antigen processing analysis, antigenicity analysis, population coverage, conservancy analysis, allergenicity assessment, toxicity prediction, and protein-peptide docking are important steps in the process of designing and developing potent vaccines against various viruses and cancers. In order to perform all of the analyses, several bioinformatics tools and online web servers have been developed. Scientists must take the decision to apply more suitable and precise servers for each part based on their accuracy. AREAS COVERED In this review, a wide-range list of different bioinformatics tools and online web servers has been provided. Moreover, some studies were proposed to show the importance of various bioinformatics tools for predicting and developing efficient vaccines against different pathogens including viruses, bacteria, parasites, and fungi as well as cancer. EXPERT OPINION Immunoinformatics is the best way to find potential vaccine candidates against different pathogens. Thus, the selection of the most accurate tools is necessary to predict and develop potent preventive and therapeutic vaccines. To further evaluation of the computational and in silico vaccine design, in vitro/in vivo analyses are required to develop vaccine candidates.
Collapse
Affiliation(s)
- Kimia Kardani
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shahid Beheshti University of Medical Sciences , Tehran, Iran.,Department of Hepatitis and AIDS, Pasteur Institute of Iran , Tehran, Iran
| | - Azam Bolhassani
- Department of Hepatitis and AIDS, Pasteur Institute of Iran , Tehran, Iran
| | - Ali Namvar
- Iranian Comprehensive Hemophilia Care Center , Tehran, Iran
| |
Collapse
|
16
|
Maurer-Stroh S, Krutz NL, Kern PS, Gunalan V, Nguyen MN, Limviphuvadh V, Eisenhaber F, Gerberick GF. AllerCatPro-prediction of protein allergenicity potential from the protein sequence. Bioinformatics 2020; 35:3020-3027. [PMID: 30657872 PMCID: PMC6736023 DOI: 10.1093/bioinformatics/btz029] [Citation(s) in RCA: 98] [Impact Index Per Article: 19.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Revised: 12/18/2018] [Accepted: 01/14/2019] [Indexed: 12/22/2022] Open
Abstract
Motivation Due to the risk of inducing an immediate Type I (IgE-mediated) allergic response, proteins intended for use in consumer products must be investigated for their allergenic potential before introduction into the marketplace. The FAO/WHO guidelines for computational assessment of allergenic potential of proteins based on short peptide hits and linear sequence window identity thresholds misclassify many proteins as allergens. Results We developed AllerCatPro which predicts the allergenic potential of proteins based on similarity of their 3D protein structure as well as their amino acid sequence compared with a data set of known protein allergens comprising of 4180 unique allergenic protein sequences derived from the union of the major databases Food Allergy Research and Resource Program, Comprehensive Protein Allergen Resource, WHO/International Union of Immunological Societies, UniProtKB and Allergome. We extended the hexamer hit rule by removing peptides with high probability of random occurrence measured by sequence entropy as well as requiring 3 or more hexamer hits consistent with natural linear epitope patterns in known allergens. This is complemented with a Gluten-like repeat pattern detection. We also switched from a linear sequence window similarity to a B-cell epitope-like 3D surface similarity window which became possible through extensive 3D structure modeling covering the majority (74%) of allergens. In case no structure similarity is found, the decision workflow reverts to the old linear sequence window rule. The overall accuracy of AllerCatPro is 84% compared with other current methods which range from 51 to 73%. Both the FAO/WHO rules and AllerCatPro achieve highest sensitivity but AllerCatPro provides a 37-fold increase in specificity. Availability and implementation https://allercatpro.bii.a-star.edu.sg/ Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Sebastian Maurer-Stroh
- Biomolecular Function Discovery Division, Bioinformatics Institute, Agency for Science, Technology and Research, Singapore.,Department of Biological Sciences, National University of Singapore, Singapore
| | - Nora L Krutz
- The Procter & Gamble Services Company, Strombeek-Bever, Belgium
| | - Petra S Kern
- The Procter & Gamble Services Company, Strombeek-Bever, Belgium
| | - Vithiagaran Gunalan
- Biomolecular Function Discovery Division, Bioinformatics Institute, Agency for Science, Technology and Research, Singapore
| | - Minh N Nguyen
- Biomolecular Function Discovery Division, Bioinformatics Institute, Agency for Science, Technology and Research, Singapore
| | - Vachiranee Limviphuvadh
- Biomolecular Function Discovery Division, Bioinformatics Institute, Agency for Science, Technology and Research, Singapore
| | - Frank Eisenhaber
- Biomolecular Function Discovery Division, Bioinformatics Institute, Agency for Science, Technology and Research, Singapore.,Department of Biological Sciences, National University of Singapore, Singapore
| | | |
Collapse
|
17
|
Bappy SS, Sultana S, Adhikari J, Mahmud S, Khan MA, Kibria KMK, Rahman MM, Shibly AZ. Extensive immunoinformatics study for the prediction of novel peptide-based epitope vaccine with docking confirmation against envelope protein of Chikungunya virus: a computational biology approach. J Biomol Struct Dyn 2020; 39:1139-1154. [PMID: 32037968 DOI: 10.1080/07391102.2020.1726815] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Chikungunya virus (CHIKV) instigating Chikungunya fever is a global infective menace resulting in high fever, weakened joint-muscle pain, and brain inflammation. Inaccessibility and unavailability of effective drugs have led us to an uncertain arena when it comes to providing proper medical treatment to the affected people. In this study, authentic encroachment has been made concerning the peptide-based epitope vaccine designing against CHIKV. A Proteome-wide search was performed to locate a conserved portion among the accessible viral outer membrane proteins which showcase a remarkable immune response using specific immunoinformatics and docking simulation tools. Primarily, the most probable immunogenic envelope glycoproteins E1 and E2 were identified from the UniProt database depending on their antigenicity scores. Subsequently, we selected two distinctive sequences "SEDVYANTQLVLQRP" and "IMLLYPDHPTLLSYR" in both E1 and E2 glycoproteins respectively. These two sequences identified as the most potent T and B cell epitope-based peptides as they interacted with 6 and 7 HLA-I and 5 HLA-II molecules with an extremely low IC50 score that was verified by molecular docking. Moreover, the sequences possess no allergenicity and are certainly located outside the transmembrane region. In addition, the sequences exhibited 88.46% and 100.00% Conservancy, covering high population coverage of 89.49% to 94.74% and 60.51% to 88.87% respectively in endemic countries. The identified peptide SEDVYANTQLVLQRP and IMLLYPDHPTLLSYR can be utilized next for the development of peptide-based epitope vaccine contrary to CHIKV, so further documentations and experimentations like Antigen testing, Antigen production, Clinical trials are needed to prove the validity of it. Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Syed Shahariar Bappy
- Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh
| | - Sorna Sultana
- Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh
| | - Juthi Adhikari
- Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh
| | - Shafi Mahmud
- Department of Genetic Engineering and Biotechnology, University of Rajshahi, Rajshahi, Bangladesh
| | - Md Arif Khan
- Department of Biotechnology and Genetic Engineering, University of Development Alternative, Dhaka, Bangladesh.,Bio-Bio-1 Research Foundation, Sangskriti Bikash Kendra Bhaban, Dhaka, Bangladesh
| | - K M Kaderi Kibria
- Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh
| | - Md Masuder Rahman
- Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh
| | - Abu Zaffar Shibly
- Department of Biotechnology and Genetic Engineering, Mawlana Bhashani Science and Technology University, Tangail, Bangladesh
| |
Collapse
|
18
|
Rahmatabadi SS, Sadeghian I, Ghasemi Y, Sakhteman A, Hemmati S. Identification and characterization of a sterically robust phenylalanine ammonia-lyase among 481 natural isoforms through association of in silico and in vitro studies. Enzyme Microb Technol 2018; 122:36-54. [PMID: 30638507 DOI: 10.1016/j.enzmictec.2018.12.006] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2018] [Revised: 11/14/2018] [Accepted: 12/09/2018] [Indexed: 10/27/2022]
Abstract
The enzyme phenylalanine ammonia lyase (PAL) is of special importance for the treatment of phenylketonuria patients. The aim of this study was to find a stable recombinant PAL with suitable kinetic properties among all natural PAL producing species using in silico and experimental approaches. To find such a stable PAL among 481 natural isoforms, 48,000 of 3-D models were predicted using the Modeller 9.10 program and evaluated by Ramachandran plot. Correlation analysis between Ramachandran plot and the energy of different thermodynamic components indicated that this plot could be an appropriate tool to predict protein stability. Hence, PAL6 from Lotus japonicus (LjPAL6) was selected as a stable isoform. Molecular dynamic (MD) simulation for 50 ns and docking has been conducted for LjPAL6-phenylalanine complex. The best PAL-phenylalanine frame was selected by re-docking with l-phenylalanine (L-Phe) and root-mean-square deviation (RMSD) value. MD simulation showed that the complex has a good stability, depicted by the low RMSD value, binding free energy and hydrogen bindings. Docking results showed that LjPAL6 has a high affinity toward l-Phe according to the low level of binding free energy. By overexpressing Ljpal6 in E. coli BL21, a total of 33.5 mg/l of protein was obtained, which has been increased to 83.7 mg/l via the optimization of LjPAL6 production using response surface methodology. The optimal pH and temperature were 8.5 and 50 °C, respectively. LjPAL6 showed a specific activity of 42 nkat/mg protein, with Km, Kcat and Kcat/Km values of 0.483 mM, 7 S-1 and 14.5 S-1 mM-1 for l-phe, respectively. In conclusion, finding models with the most reasonable stereo-chemical quality and lowest numbers of steric clashes would result in easier folding. Hence, in silico analyses of bulk data from natural origin will lead one to find an optimal model for in vitro studies and drug design.
Collapse
Affiliation(s)
- Seyyed Soheil Rahmatabadi
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Issa Sadeghian
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Younes Ghasemi
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran; Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Amirhossein Sakhteman
- Department of Medicinal Chemistry, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran.
| | - Shiva Hemmati
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran; Pharmaceutical Sciences Research Center, Shiraz University of Medical Sciences, Shiraz, Iran.
| |
Collapse
|
19
|
Abstract
An extensive safety assessment process exists for genetically-engineered (GE) crops. The assessment includes an evaluation of the introduced protein as well as the crop containing the protein with the goal of demonstrating the GE crop is "as-safe-as" non-GE crops in the food supply. One of the evaluations for GE crops is to assess the expressed protein for allergenic potential. Currently, no single factor is recognized as a predictor for protein allergenicity. Therefore, a weight-of-the-evidence approach, which accounts for a variety of factors and approaches for an overall assessment of allergenic potential, is conducted. This assessment includes an evaluation of the history of exposure and safety of the gene(s) source; protein structure (e.g. amino acid sequence identity to human allergens); stability of the protein to pepsin digestion in vitro; heat stability of the protein; glycosylation status; and when appropriate, specific IgE binding studies with sera from relevant clinically allergic subjects. Since GE crops were first commercialized over 20 years ago, there is no proof that the introduced novel protein(s) in any commercialized GE food crop has caused food allergy.
Collapse
|
20
|
Song P, Podevin N, Mirsky H, Anderson J, Delaney B, Mathesius C, Rowe L, Herman RA. Q-X1-P-X2 motif search for potential celiac disease risk has poor selectivity. Regul Toxicol Pharmacol 2018; 99:233-237. [PMID: 30266240 DOI: 10.1016/j.yrtph.2018.09.021] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2018] [Revised: 09/19/2018] [Accepted: 09/21/2018] [Indexed: 11/24/2022]
Abstract
The European Food Safety Authority (EFSA) recently published guidelines for assessment of potential celiac disease risk for newly expressed proteins in genetically modified (GM) crops. This novel step-wise approach prescribes, in part, how to conduct sequence identity searches between a newly expressed protein and known celiac disease peptides including a Q/E-X1-P-X2 amino acid motif. To evaluate the specificity of the recommended sequence identity searches in the context of risk assessment, protein sequences from celiac disease causing crops, as well as from crops not associated with celiac disease, were compared with known HLA-DQ restricted epitopes and searched for the presence of motifs followed by peptide analysis. Searches for the presence of the Q/E-X1-P-X2-motif were found to generate a high proportion of false-positive hits irrelevant to celiac disease risk. Identification of a 9mer exact match between a newly expressed protein and the known celiac disease peptides (recommended by the guideline) along with a supplementary sequence comparisons (suggested by FARRP/AllergenOnline) is considered better suited to more specifically capture the potential risk of a newly expressed protein for celiac disease.
Collapse
Affiliation(s)
- Ping Song
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Indianapolis, IN, USA.
| | - Nancy Podevin
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Johnston, IA, USA
| | - Henry Mirsky
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Johnston, IA, USA
| | - Jennifer Anderson
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Johnston, IA, USA
| | - Bryan Delaney
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Johnston, IA, USA
| | - Carey Mathesius
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Johnston, IA, USA
| | - Laura Rowe
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Johnston, IA, USA
| | - Rod A Herman
- Corteva Agriscience™, Agriculture Division of DowDuPont™, Indianapolis, IN, USA
| |
Collapse
|
21
|
Distinguishing allergens from non-allergenic homologues using Physical-Chemical Property (PCP) motifs. Mol Immunol 2018; 99:1-8. [PMID: 29627609 DOI: 10.1016/j.molimm.2018.03.022] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Revised: 03/22/2018] [Accepted: 03/27/2018] [Indexed: 02/07/2023]
Abstract
Quantitative guidelines to distinguish allergenic proteins from related, but non-allergenic ones are urgently needed for regulatory agencies, biotech companies and physicians. In a previous study, we found that allergenic proteins populate a relatively small number of protein families, as characterized by the Pfam database. However, these families also contain non-allergenic proteins, meaning that allergenic determinants must lie within more discrete regions of the sequence. Thus, new methods are needed to discriminate allergenic proteins within those families. Physical-Chemical Properties (PCP)-motifs specific for allergens within a Pfam class were determined for 17 highly populated protein domains. A novel scoring method based on PCP-motifs that characterize known allergenic proteins within these families was developed, and validated for those domains. The motif scores distinguished sequences of allergens from a large selection of 80,000 randomly selected non-allergenic sequences. The motif scores for the birch pollen allergen (Bet v 1) family, which also contains related fruit and nut allergens, correlated better than global sequence similarities with clinically observed cross-reactivities among those allergens. Further, we demonstrated that the average scores of allergen specific motifs for allergenic profilins are significantly different from the scores of non-allergenic profilins. Several of the selective motifs coincide with experimentally determined IgE epitopes of allergenic profilins. The motifs also discriminated allergenic pectate lyases, including Jun a 1 from mountain cedar pollen, from similar proteins in the human microbiome, which can be assumed to be non-allergens. The latter lacked key motifs characteristic of the known allergens, some of which correlate with known IgE binding sites.
Collapse
|
22
|
Bragin AO, Sokolov VS, Demenkov PS, Ivanisenko TV, Bragina EY, Matushkin YG, Ivanisenko VA. Prediction of Bacterial and Archaeal Allergenicity with AllPred Program. Mol Biol 2018. [DOI: 10.1134/s0026893317050041] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
23
|
Li J, Wang J, Li J. Improving Allergen Prediction in Main Crops Using a Weighted Integrative Method. Interdiscip Sci 2017; 9:545-549. [PMID: 27734271 DOI: 10.1007/s12539-016-0192-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 09/29/2016] [Accepted: 10/03/2016] [Indexed: 10/20/2022]
Abstract
As a public health problem, food allergy is frequently caused by food allergy proteins, which trigger a type-I hypersensitivity reaction in the immune system of atopic individuals. The food allergens in our daily lives are mainly from crops including rice, wheat, soybean and maize. However, allergens in these main crops are far from fully uncovered. Although some bioinformatics tools or methods predicting the potential allergenicity of proteins have been proposed, each method has their limitation. In this paper, we built a novel algorithm PREALW, which integrated PREAL, FAO/WHO criteria and motif-based method by a weighted average score, to benefit the advantages of different methods. Our results illustrated PREALW has better performance significantly in the crops' allergen prediction. This integrative allergen prediction algorithm could be useful for critical food safety matters. The PREALW could be accessed at http://lilab.life.sjtu.edu.cn:8080/prealw .
Collapse
Affiliation(s)
- Jing Li
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, People's Republic of China
| | - Jing Wang
- Center for Quantitative Sciences, Vanderbilt University School of Medicine, Nashville, TN, 37203, USA
| | - Jing Li
- Department of Bioinformatics and Biostatistics, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan Road, Shanghai, 200240, People's Republic of China.
| |
Collapse
|
24
|
Radauer C. Navigating through the Jungle of Allergens: Features and Applications of Allergen Databases. Int Arch Allergy Immunol 2017; 173:1-11. [PMID: 28456806 DOI: 10.1159/000471806] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
The increasing number of available data on allergenic proteins demanded the establishment of structured, freely accessible allergen databases. In this review article, features and applications of 6 of the most widely used allergen databases are discussed. The WHO/IUIS Allergen Nomenclature Database is the official resource of allergen designations. Allergome is the most comprehensive collection of data on allergens and allergen sources. AllergenOnline is aimed at providing a peer-reviewed database of allergen sequences for prediction of allergenicity of proteins, such as those planned to be inserted into genetically modified crops. The Structural Database of Allergenic Proteins (SDAP) provides a database of allergen sequences, structures, and epitopes linked to bioinformatics tools for sequence analysis and comparison. The Immune Epitope Database (IEDB) is the largest repository of T-cell, B-cell, and major histocompatibility complex protein epitopes including epitopes of allergens. AllFam classifies allergens into families of evolutionarily related proteins using definitions from the Pfam protein family database. These databases contain mostly overlapping data, but also show differences in terms of their targeted users, the criteria for including allergens, data shown for each allergen, and the availability of bioinformatics tools.
Collapse
Affiliation(s)
- Christian Radauer
- Department of Pathophysiology and Allergy Research, Center for Pathophysiology, Infectiology and Immunology, Medical University of Vienna, Vienna, Austria
| |
Collapse
|
25
|
Multilevel ensemble model for prediction of IgA and IgG antibodies. Immunol Lett 2017; 184:51-60. [DOI: 10.1016/j.imlet.2017.01.017] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2016] [Revised: 01/30/2017] [Accepted: 01/30/2017] [Indexed: 01/04/2023]
|
26
|
Abstract
The rapidly increasing number of characterized allergens has created huge demands for advanced information storage, retrieval, and analysis. Bioinformatics and machine learning approaches provide useful tools for the study of allergens and epitopes prediction, which greatly complement traditional laboratory techniques. The specific applications mainly include identification of B- and T-cell epitopes, and assessment of allergenicity and cross-reactivity. In order to facilitate the work of clinical and basic researchers who are not familiar with bioinformatics, we review in this chapter the most important databases, bioinformatic tools, and methods with relevance to the study of allergens.
Collapse
|
27
|
Bogdanov IV, Shenkarev ZO, Finkina EI, Melnikova DN, Rumynskiy EI, Arseniev AS, Ovchinnikova TV. A novel lipid transfer protein from the pea Pisum sativum: isolation, recombinant expression, solution structure, antifungal activity, lipid binding, and allergenic properties. BMC PLANT BIOLOGY 2016; 16:107. [PMID: 27137920 PMCID: PMC4852415 DOI: 10.1186/s12870-016-0792-6] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/12/2015] [Accepted: 04/21/2016] [Indexed: 05/22/2023]
Abstract
BACKGROUND Plant lipid transfer proteins (LTPs) assemble a family of small (7-9 kDa) ubiquitous cationic proteins with an ability to bind and transport lipids as well as participate in various physiological processes including defense against phytopathogens. They also form one of the most clinically relevant classes of plant allergens. Nothing is known to date about correlation between lipid-binding and IgE-binding properties of LTPs. The garden pea Pisum sativum is widely consumed crop and important allergenic specie of the legume family. This work is aimed at isolation of a novel LTP from pea seeds and characterization of its structural, functional, and allergenic properties. RESULTS Three novel lipid transfer proteins, designated as Ps-LTP1-3, were found in the garden pea Pisum sativum, their cDNA sequences were determined, and mRNA expression levels of all the three proteins were measured at different pea organs. Ps-LTP1 was isolated for the first time from the pea seeds, and its complete amino acid sequence was determined. The protein exhibits antifungal activity and is a membrane-active compound that causes a leakage from artificial liposomes. The protein binds various lipids including bioactive jasmonic acid. Spatial structure of the recombinant uniformly (13)C,(15)N-labelled Ps-LTP1 was solved by heteronuclear NMR spectroscopy. In solution the unliganded protein represents the mixture of two conformers (relative populations ~ 85:15) which are interconnected by exchange process with characteristic time ~ 100 ms. Hydrophobic residues of major conformer form a relatively large internal tunnel-like lipid-binding cavity (van der Waals volume comes up to ~1000 Å(3)). The minor conformer probably corresponds to the protein with the partially collapsed internal cavity. CONCLUSIONS For the first time conformational heterogeneity in solution was shown for an unliganded plant lipid transfer protein. Heat denaturation profile and simulated gastrointestinal digestion assay showed that Ps-LTP1 displayed a high thermal and digestive proteolytic resistance proper for food allergens. The reported structural and immunological findings seem to describe Ps-LTP1 as potential cross-reactive allergen in LTP-sensitized patients, mostly Pru p 3(+) ones. Similarly to allergenic LTPs the potential IgE-binding epitope of Ps-LTP1 is located near the proposed entrance into internal cavity and could be involved in lipid-binding.
Collapse
Affiliation(s)
- Ivan V Bogdanov
- M.M.Shemyakin and Yu.A.Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya str., 16/10, 117997, Moscow, Russia
| | - Zakhar O Shenkarev
- M.M.Shemyakin and Yu.A.Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya str., 16/10, 117997, Moscow, Russia
| | - Ekaterina I Finkina
- M.M.Shemyakin and Yu.A.Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya str., 16/10, 117997, Moscow, Russia
| | - Daria N Melnikova
- M.M.Shemyakin and Yu.A.Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya str., 16/10, 117997, Moscow, Russia
| | - Eugene I Rumynskiy
- M.M.Shemyakin and Yu.A.Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya str., 16/10, 117997, Moscow, Russia
| | - Alexander S Arseniev
- M.M.Shemyakin and Yu.A.Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya str., 16/10, 117997, Moscow, Russia
| | - Tatiana V Ovchinnikova
- M.M.Shemyakin and Yu.A.Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Miklukho-Maklaya str., 16/10, 117997, Moscow, Russia.
| |
Collapse
|
28
|
Garino C, Coïsson JD, Arlorio M. In silico allergenicity prediction of several lipid transfer proteins. Comput Biol Chem 2015; 60:32-42. [PMID: 26643760 DOI: 10.1016/j.compbiolchem.2015.11.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Revised: 11/04/2015] [Accepted: 11/10/2015] [Indexed: 10/22/2022]
Abstract
Non-specific lipid transfer proteins (nsLTPs) are common allergens and they are particularly widespread within the plant kingdom. They have a highly conserved three-dimensional structure that generate a strong cross-reactivity among the members of this family. In the last years several web tools for the prediction of allergenicity of new molecules based on their homology with known allergens have been released, and guidelines to assess potential allergenicity of proteins through bioinformatics have been established. Even if such tools are only partially reliable yet, they can provide important indications when other kinds of molecular characterization are lacking. The potential allergenicity of 28 amino acid sequences of LTPs homologs, either retrieved from the UniProt database or in silico deduced from the corresponding EST coding sequence, was predicted using 7 publicly available web tools. Moreover, their similarity degree to their closest known LTP allergens was calculated, in order to evaluate their potential cross-reactivity. Finally, all sequences were studied for their identity degree with the peach allergen Pru p 3, considering the regions involved in the formation of its known conformational IgE-binding epitope. Most of the analyzed sequences displayed a high probability to be allergenic according to all the software employed. The analyzed LTPs from bell pepper, cassava, mango, mungbean and soybean showed high homology (>70%) with some known allergenic LTPs, suggesting a potential risk of cross-reactivity for sensitized individuals. Other LTPs, like for example those from canola, cassava, mango, mungbean, papaya or persimmon, displayed a high degree of identity with Pru p 3 within the consensus sequence responsible for the formation, at three-dimensional level, of its major conformational epitope. Since recent studies highlighted how in patients mono-sensitized to peach LTP the levels of IgE seem directly proportional to the chance of developing cross-reactivity to LTPs from non-Rosaceae foods, and these chances increase the more similar the protein is to Pru p 3, these proteins should be taken into special account for future studies aimed at evaluating the risk of cross-allergenicity in highly sensitized individuals.
Collapse
Affiliation(s)
- Cristiano Garino
- Dipartimento di Scienze del Farmaco & Drug and Food Biotechnology (DFB) Center, Università del Piemonte Orientale "A. Avogadro", largo Donegani 2, 28100 Novara, Italy.
| | - Jean Daniel Coïsson
- Dipartimento di Scienze del Farmaco & Drug and Food Biotechnology (DFB) Center, Università del Piemonte Orientale "A. Avogadro", largo Donegani 2, 28100 Novara, Italy.
| | - Marco Arlorio
- Dipartimento di Scienze del Farmaco & Drug and Food Biotechnology (DFB) Center, Università del Piemonte Orientale "A. Avogadro", largo Donegani 2, 28100 Novara, Italy.
| |
Collapse
|
29
|
Song P, Herman R, Kumpatla S. 1:1 FASTA update: Using the power of E-values in FASTA to detect potential allergen cross-reactivity. Toxicol Rep 2015; 2:1145-1148. [PMID: 28962455 PMCID: PMC5598423 DOI: 10.1016/j.toxrep.2015.08.005] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2015] [Revised: 08/11/2015] [Accepted: 08/11/2015] [Indexed: 11/24/2022] Open
Abstract
In the context of regulatory assessment of transgenic proteins for potential allergenicity, a previous investigation demonstrated that a 1:1 FASTA comparison using an E-value of 1.0E-09 as a criterion is superior to the conventional FASTA search (using the whole sequence as a query) for >35% identity over 80 amino acids, but with improved specificity. A further study, using groups of known cross-reactive peanut allergens, indicates the sensitivity of this approach is superior to the conventional FASTA search and equivalent to 80-mer sliding window FASTA search recommended by WHO/FAO. Specifically, the 1:1 FASTA approach eliminated the technical issues resulting from lack of identification of short query sequences with high identity to known allergens, or high identity over short amino acid stretches, and different E-value settings when searching for >35% identity over 80aa. Based on the performance of this simple application of existing bioinformatics tools, and its ease of implementation and interpretation in the context of a regulatory assessment, we advocate that adoption of this 1:1 FASTA approach as a supplement to the FAO/WHO/ CODEX criterion (>35% identity over 80aa) formulated 13 years ago. Adoption of this approach eliminates many biologically irrelevant homology hits generated by the FAO/WHO/CODEX criterion and improves the safety assessment of GM crops.
Collapse
Affiliation(s)
- Ping Song
- Dow AgroSciences LLC, 9330 Zionsville Road, Indianapolis, IN 46268, USA
| | - Rod Herman
- Dow AgroSciences LLC, 9330 Zionsville Road, Indianapolis, IN 46268, USA
| | - Siva Kumpatla
- Dow AgroSciences LLC, 9330 Zionsville Road, Indianapolis, IN 46268, USA
| |
Collapse
|
30
|
Hayes M, Rougé P, Barre A, Herouet-Guicheney C, Roggen EL. In silico tools for exploring potential human allergy to proteins. ACTA ACUST UNITED AC 2015. [DOI: 10.1016/j.ddmod.2016.06.001] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
31
|
Purification, characterization and safety assessment of the introduced cold shock protein B in DroughtGard maize. Regul Toxicol Pharmacol 2014; 71:164-73. [PMID: 25545317 DOI: 10.1016/j.yrtph.2014.12.014] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2014] [Revised: 12/17/2014] [Accepted: 12/18/2014] [Indexed: 11/21/2022]
Abstract
DroughtGard maize was developed through constitutive expression of cold shock protein B (CSPB) from Bacillus subtilis to improve performance of maize (Zea mays) under water-limited conditions. B. subtilis commonly occurs in fermented foods and CSPB has a history of safe use. Safety studies were performed to further evaluate safety of CSPB introduced into maize. CSPB was compared to proteins found in current allergen and protein toxin databases and there are no sequence similarities between CSPB and known allergens or toxins. In order to validate the use of Escherichia coli-derived CSPB in other safety studies, physicochemical and functional characterization confirmed that the CSPB produced by DroughtGard possesses comparable molecular weight, immunoreactivity, and functional activity to CSPB produced from E. coli and that neither is glycosylated. CSPB was completely digested with sequential exposure to pepsin and pancreatin for 2 min and 30 s, respectively, suggesting that CSPB will be degraded in the mammalian digestive tract and would not be expected to be allergenic. Mice orally dosed with CSPB at 2160 mg/kg, followed by analysis of body weight gains, food consumption and clinical observations, showed no discernible adverse effects. This comprehensive safety assessment indicated that the CSPB protein from DroughtGard is safe for food and feed consumption.
Collapse
|
32
|
Jeong KY, Son M, Lee JH, Hong CS, Park JW. Allergenic characterization of a novel allergen, homologous to chymotrypsin, from german cockroach. ALLERGY, ASTHMA & IMMUNOLOGY RESEARCH 2014; 7:283-9. [PMID: 25749759 PMCID: PMC4397369 DOI: 10.4168/aair.2015.7.3.283] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/11/2014] [Accepted: 09/30/2014] [Indexed: 11/20/2022]
Abstract
Purpose Cockroach feces are known to be rich in IgE-reactive components. Various protease allergens were identified by proteomic analysis of German cockroach fecal extract in a previous study. In this study, we characterized a novel allergen, a chymotrypsin-like serine protease. Methods A cDNA sequence homologous to chymotrypsin was obtained by analysis of German cockroach expressed sequence tag (EST) clones. The recombinant chymotrypsins from the German cockroach and house dust mite (Der f 6) were expressed in Escherichia coli using the pEXP5NT/TOPO vector system, and their allergenicity was investigated by ELISA. Results The deduced amino acid sequence of German cockroach chymotrypsin showed 32.7 to 43.1% identity with mite group 3 (trypsin) and group 6 (chymotrypsin) allergens. Sera from 8 of 28 German cockroach allergy subjects (28.6%) showed IgE binding to the recombinant protein. IgE binding to the recombinant cockroach chymotrypsin was inhibited by house dust mite chymotrypsin Der f 6, while it minimally inhibited the German cockroach whole body extract. Conclusions A novel allergen homologous to chymotrypsin was identified from the German cockroach and was cross-reactive with Der f 6.
Collapse
Affiliation(s)
- Kyoung Yong Jeong
- Department of Internal Medicine, Institute of Allergy, Yonsei University College of Medicine, Seoul, Korea
| | - Mina Son
- Department of Internal Medicine, Institute of Allergy, Yonsei University College of Medicine, Seoul, Korea
| | - Jae Hyun Lee
- Department of Internal Medicine, Institute of Allergy, Yonsei University College of Medicine, Seoul, Korea
| | - Chein Soo Hong
- Department of Internal Medicine, Institute of Allergy, Yonsei University College of Medicine, Seoul, Korea
| | - Jung Won Park
- Department of Internal Medicine, Institute of Allergy, Yonsei University College of Medicine, Seoul, Korea.
| |
Collapse
|
33
|
Podevin N, du Jardin P. Possible consequences of the overlap between the CaMV 35S promoter regions in plant transformation vectors used and the viral gene VI in transgenic plants. GM CROPS & FOOD 2014; 3:296-300. [DOI: 10.4161/gmcr.21406] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
34
|
Saravanan V, Lakshmi PTV. Fuzzy Logic for Personalized Healthcare and Diagnostics: FuzzyApp—A Fuzzy Logic Based Allergen-Protein Predictor. OMICS-A JOURNAL OF INTEGRATIVE BIOLOGY 2014; 18:570-81. [DOI: 10.1089/omi.2014.0021] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Vijayakumar Saravanan
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Pondicherry, India
| | - PTV Lakshmi
- Centre for Bioinformatics, School of Life Sciences, Pondicherry University, Pondicherry, India
| |
Collapse
|
35
|
Dimitrov I, Bangov I, Flower DR, Doytchinova I. AllerTOP v.2--a server for in silico prediction of allergens. J Mol Model 2014; 20:2278. [PMID: 24878803 DOI: 10.1007/s00894-014-2278-5] [Citation(s) in RCA: 693] [Impact Index Per Article: 63.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2013] [Accepted: 04/25/2014] [Indexed: 11/29/2022]
Abstract
Allergy is an overreaction by the immune system to a previously encountered, ordinarily harmless substance--typically proteins--resulting in skin rash, swelling of mucous membranes, sneezing or wheezing, or other abnormal conditions. The use of modified proteins is increasingly widespread: their presence in food, commercial products, such as washing powder, and medical therapeutics and diagnostics, makes predicting and identifying potential allergens a crucial societal issue. The prediction of allergens has been explored widely using bioinformatics, with many tools being developed in the last decade; many of these are freely available online. Here, we report a set of novel models for allergen prediction utilizing amino acid E-descriptors, auto- and cross-covariance transformation, and several machine learning methods for classification, including logistic regression (LR), decision tree (DT), naïve Bayes (NB), random forest (RF), multilayer perceptron (MLP) and k nearest neighbours (kNN). The best performing method was kNN with 85.3% accuracy at 5-fold cross-validation. The resulting model has been implemented in a revised version of the AllerTOP server (http://www.ddg-pharmfac.net/AllerTOP).
Collapse
Affiliation(s)
- Ivan Dimitrov
- Faculty of Pharmacy, Medical University of Sofia, 2 Dunav st., 1000, Sofia, Bulgaria
| | | | | | | |
Collapse
|
36
|
Reduction of the number of major representative allergens: from clinical testing to 3-dimensional structures. Mediators Inflamm 2014; 2014:291618. [PMID: 24778467 PMCID: PMC3980986 DOI: 10.1155/2014/291618] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2013] [Accepted: 02/07/2014] [Indexed: 12/02/2022] Open
Abstract
Vast amounts of allergen sequence data have been accumulated, thus complicating the identification of specific allergenic proteins when performing diagnostic allergy tests and immunotherapy. This study aims to rank the importance/potency of the allergens so as to logically reduce the number of allergens and/or allergenic sources. Meta-analysis of 62 allergenic sources used for intradermal testing on 3,335 allergic patients demonstrated that in southern China, mite, sesame, spiny amaranth, Pseudomonas aeruginosa, and house dust account for 88.0% to 100% of the observed positive reactions to the 62 types of allergenic sources tested. The Kolmogorov-Smironov Test results of the website-obtained allergen data and allergen family featured peptides suggested that allergen research in laboratories worldwide has been conducted in parallel on many of the same species. The major allergens were reduced to 21 representative allergens, which were further divided into seven structural classes, each of which contains similar structural components. This study therefore has condensed numerous allergenic sources and major allergens into fewer major representative ones, thus allowing for the use of a smaller number of allergens when conducting comprehensive allergen testing and immunotherapy treatments.
Collapse
|
37
|
Crameri R, Garbani M, Rhyner C, Huitema C. Fungi: the neglected allergenic sources. Allergy 2014; 69:176-85. [PMID: 24286281 DOI: 10.1111/all.12325] [Citation(s) in RCA: 121] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/14/2013] [Indexed: 12/15/2022]
Abstract
Allergic diseases are considered the epidemics of the twentieth century estimated to affect more than 30% of the population in industrialized countries with a still increasing incidence. During the past two decades, the application of molecular biology allowed cloning, production and characterization of hundreds of recombinant allergens. In turn, knowledge about molecular, chemical and biologically relevant allergens contributed to increase our understanding of the mechanisms underlying IgE-mediated type I hypersensitivity reactions. It has been largely demonstrated that fungi are potent sources of allergenic molecules covering a vast variety of molecular structures including enzymes, toxins, cell wall components and phylogenetically highly conserved cross-reactive proteins. Despite the large knowledge accumulated and the compelling evidence for an involvement of fungal allergens in the pathophysiology of allergic diseases, fungi as a prominent source of allergens are still largely neglected in basic research as well as in clinical practice. This review aims to highlight the impact of fungal allergens with focus on asthma and atopic dermatitis.
Collapse
Affiliation(s)
- R. Crameri
- Swiss Institute of Allergy and Asthma Research (SIAF); University of Zürich; Davos Switzerland
| | - M. Garbani
- Swiss Institute of Allergy and Asthma Research (SIAF); University of Zürich; Davos Switzerland
| | - C. Rhyner
- Swiss Institute of Allergy and Asthma Research (SIAF); University of Zürich; Davos Switzerland
| | - C. Huitema
- Swiss Institute of Allergy and Asthma Research (SIAF); University of Zürich; Davos Switzerland
| |
Collapse
|
38
|
Goodman RE, Hefle SL. Gaining perspective on the allergenicity assessment of genetically modified food crops. Expert Rev Clin Immunol 2014; 1:561-78. [DOI: 10.1586/1744666x.1.4.561] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
39
|
Brusic V, Petrovsky N. Immunoinformatics and its relevance to understanding human immune disease. Expert Rev Clin Immunol 2014; 1:145-57. [DOI: 10.1586/1744666x.1.1.145] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
40
|
Marzban G, Herndl A, Maghuly F, Katinger H, Laimer M. Mapping of fruit allergens by 2D electrophoresis and immunodetection. Expert Rev Proteomics 2014; 5:61-75. [DOI: 10.1586/14789450.5.1.61] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
41
|
Wang J, Yang L, Zhao X, Li J, Zhang D. Characterization and phylogenetic analysis of allergenic Tryp_alpha_amyl protein family in plants. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2014; 62:270-278. [PMID: 24328177 DOI: 10.1021/jf402463w] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Most known allergenic proteins in rice ( Oryza sativa ) seed belong to the Tryp_alpha_amyl family (PF00234), but the sequence characterization and the evolution of the allergenic Tryp_alpha_amyl family members in plants have not been fully investigated. In this study, two specific motifs were found besides the common alpha-amylase inhibitors (AAI) domain from the allergenic Tryp_alpha_amyl family members in rice seeds (trRSAs). To understand the evolution and functional importance of the Tryp_alpha_amy1 family and the specific motifs for the allergenic one, a BLAST search identified 75 homologous proteins of trRSAs (trHAs) from 22 plant species including main crops such as rice, maize ( Zea mays ), wheat ( Triticum aestivum ), and sorghum ( Sorghum bicolor ) from all available sequences in the public databases. Statistical analysis showed that the allergenicity of trHAs is closely associated with these two motifs with high number of cysteine residues (p value = 0.00026), and the trHAs with and without the two motifs were clustered into separate clades, respectively. Furthermore, significant difference was observed on the secondary and tertiary structures of allergenic and nonallergenic trHAs. In addition, expression analysis showed that trHA-encoding genes of purple false brome ( Brachypodium distachyon ), barrel medic ( Medicago truncatula ), rice, and sorghum are dominantly expressed in seeds. This work provides insight into the understanding of the properties of allergens in the Tryp_alpha_amyl family and is helpful for allergy therapy.
Collapse
Affiliation(s)
- Jing Wang
- National Center for Molecular Characterization of Genetically Modified Organisms, State Key Laboratory of Hybrid Rice, School of Life Science and Biotechnology, Shanghai Jiao Tong University , 800 Dongchuan Road, Shanghai 200240, People's Republic of China
| | | | | | | | | |
Collapse
|
42
|
Dang HX, Lawrence CB. Allerdictor: fast allergen prediction using text classification techniques. ACTA ACUST UNITED AC 2014; 30:1120-1128. [PMID: 24403538 DOI: 10.1093/bioinformatics/btu004] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2013] [Accepted: 12/30/2013] [Indexed: 11/14/2022]
Abstract
MOTIVATION Accurately identifying and eliminating allergens from biotechnology-derived products are important for human health. From a biomedical research perspective, it is also important to identify allergens in sequenced genomes. Many allergen prediction tools have been developed during the past years. Although these tools have achieved certain levels of specificity, when applied to large-scale allergen discovery (e.g. at a whole-genome scale), they still yield many false positives and thus low precision (even at low recall) due to the extreme skewness of the data (allergens are rare). Moreover, the most accurate tools are relatively slow because they use protein sequence alignment to build feature vectors for allergen classifiers. Additionally, only web server implementations of the current allergen prediction tools are publicly available and are without the capability of large batch submission. These weaknesses make large-scale allergen discovery ineffective and inefficient in the public domain. RESULTS We developed Allerdictor, a fast and accurate sequence-based allergen prediction tool that models protein sequences as text documents and uses support vector machine in text classification for allergen prediction. Test results on multiple highly skewed datasets demonstrated that Allerdictor predicted allergens with high precision over high recall at fast speed. For example, Allerdictor only took ∼6 min on a single core PC to scan a whole Swiss-Prot database of ∼540 000 sequences and identified <1% of them as allergens. AVAILABILITY AND IMPLEMENTATION Allerdictor is implemented in Python and available as standalone and web server versions at http://allerdictor.vbi.vt.edu CONTACT: lawrence@vbi.vt.edu Supplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Ha X Dang
- Virginia Bioinformatics Institute and Department of Biological Sciences, Virginia Tech, Blacksburg, VA 24061, USA
| | - Christopher B Lawrence
- Virginia Bioinformatics Institute and Department of Biological Sciences, Virginia Tech, Blacksburg, VA 24061, USA Virginia Bioinformatics Institute and Department of Biological Sciences, Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
43
|
Abstract
Currently, the prediction of new allergens is becoming important due to use of genetically modified (GM) foods and biopharmaceuticals. In this chapter, we describe how to use four popular allergenic prediction servers: (1) Structural Database of Allergenic Proteins (SDAP), (2) Allermatch, (3) Evaller 2, and (4) AlgPred. The first two prediction servers are based on traditional approaches, whereas Evaller 2 and AlgPred use sophisticated machine learning techniques.
Collapse
Affiliation(s)
- Gaurab Sircar
- Division of Plant Biology, Bose Institute, 93/1 Acharya Prafulla Chandra Road, Kolkata, 700009, India
| | | | | | | |
Collapse
|
44
|
Abstract
A large volume of data relevant to immunology research has accumulated due to sequencing of genomes of the human and other model organisms. At the same time, huge amounts of clinical and epidemiologic data are being deposited in various scientific literature and clinical records. This accumulation of the information is like a goldmine for researchers looking for mechanisms of immune function and disease pathogenesis. Thus the need to handle this rapidly growing immunological resource has given rise to the field known as immunoinformatics. Immunoinformatics, otherwise known as computational immunology, is the interface between computer science and experimental immunology. It represents the use of computational methods and resources for the understanding of immunological information. It not only helps in dealing with huge amount of data but also plays a great role in defining new hypotheses related to immune responses. This chapter reviews classical immunology, different databases, and prediction tool. Further, it briefly describes applications of immunoinformatics in reverse vaccinology, immune system modeling, and cancer diagnosis and therapy. It also explores the idea of integrating immunoinformatics with systems biology for the development of personalized medicine. All these efforts save time and cost to a great extent.
Collapse
Affiliation(s)
- Namrata Tomar
- Machine Intelligence Unit, Indian Statistical Institute, 203 B.T. Road, Kolkata, 700108, India,
| | | |
Collapse
|
45
|
PREAL: prediction of allergenic protein by maximum Relevance Minimum Redundancy (mRMR) feature selection. BMC SYSTEMS BIOLOGY 2013; 7 Suppl 5:S9. [PMID: 24565053 PMCID: PMC4029432 DOI: 10.1186/1752-0509-7-s5-s9] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
BACKGROUND Assessment of potential allergenicity of protein is necessary whenever transgenic proteins are introduced into the food chain. Bioinformatics approaches in allergen prediction have evolved appreciably in recent years to increase sophistication and performance. However, what are the critical features for protein's allergenicity have been not fully investigated yet. RESULTS We presented a more comprehensive model in 128 features space for allergenic proteins prediction by integrating various properties of proteins, such as biochemical and physicochemical properties, sequential features and subcellular locations. The overall accuracy in the cross-validation reached 93.42% to 100% with our new method. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) procedure were applied to obtain which features are essential for allergenicity. Results of the performance comparisons showed the superior of our method to the existing methods used widely. More importantly, it was observed that the features of subcellular locations and amino acid composition played major roles in determining the allergenicity of proteins, particularly extracellular/cell surface and vacuole of the subcellular locations for wheat and soybean. To facilitate the allergen prediction, we implemented our computational method in a web application, which can be available at http://gmobl.sjtu.edu.cn/PREAL/index.php. CONCLUSIONS Our new approach could improve the accuracy of allergen prediction. And the findings may provide novel insights for the mechanism of allergies.
Collapse
|
46
|
Chen Z, He Y, Shi B, Yang D. Human serum albumin from recombinant DNA technology: Challenges and strategies. Biochim Biophys Acta Gen Subj 2013; 1830:5515-25. [DOI: 10.1016/j.bbagen.2013.04.037] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2013] [Revised: 04/24/2013] [Accepted: 04/29/2013] [Indexed: 12/22/2022]
|
47
|
Dimitrov I, Naneva L, Doytchinova I, Bangov I. AllergenFP: allergenicity prediction by descriptor fingerprints. Bioinformatics 2013; 30:846-51. [PMID: 24167156 DOI: 10.1093/bioinformatics/btt619] [Citation(s) in RCA: 450] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Allergenicity, like antigenicity and immunogenicity, is a property encoded linearly and non-linearly, and therefore the alignment-based approaches are not able to identify this property unambiguously. A novel alignment-free descriptor-based fingerprint approach is presented here and applied to identify allergens and non-allergens. The approach was implemented into a four step algorithm. Initially, the protein sequences are described by amino acid principal properties as hydrophobicity, size, relative abundance, helix and β-strand forming propensities. Then, the generated strings of different length are converted into vectors with equal length by auto- and cross-covariance (ACC). The vectors were transformed into binary fingerprints and compared in terms of Tanimoto coefficient. RESULTS The approach was applied to a set of 2427 known allergens and 2427 non-allergens and identified correctly 88% of them with Matthews correlation coefficient of 0.759. The descriptor fingerprint approach presented here is universal. It could be applied for any classification problem in computational biology. The set of E-descriptors is able to capture the main structural and physicochemical properties of amino acids building the proteins. The ACC transformation overcomes the main problem in the alignment-based comparative studies arising from the different length of the aligned protein sequences. The conversion of protein ACC values into binary descriptor fingerprints allows similarity search and classification. AVAILABILITY AND IMPLEMENTATION The algorithm described in the present study was implemented in a specially designed Web site, named AllergenFP (FP stands for FingerPrint). AllergenFP is written in Python, with GIU in HTML. It is freely accessible at http://ddg-pharmfac.net/Allergen FP. CONTACT idoytchinova@pharmfac.net or ivanbangov@shu-bg.net.
Collapse
Affiliation(s)
- Ivan Dimitrov
- Medical University of Sofia, Faculty of Pharmacy, 2 Dunav st., 1000 Sofia and Konstantin Preslavski Shumen University, Faculty of Natural Sciences, 115 Universitetska st., 9712 Shumen, Bulgaria
| | | | | | | |
Collapse
|
48
|
Dimitrov I, Flower DR, Doytchinova I. AllerTOP--a server for in silico prediction of allergens. BMC Bioinformatics 2013; 14 Suppl 6:S4. [PMID: 23735058 PMCID: PMC3633022 DOI: 10.1186/1471-2105-14-s6-s4] [Citation(s) in RCA: 270] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Background Allergy is a form of hypersensitivity to normally innocuous substances, such as dust, pollen, foods or drugs. Allergens are small antigens that commonly provoke an IgE antibody response. There are two types of bioinformatics-based allergen prediction. The first approach follows FAO/WHO Codex alimentarius guidelines and searches for sequence similarity. The second approach is based on identifying conserved allergenicity-related linear motifs. Both approaches assume that allergenicity is a linearly coded property. In the present study, we applied ACC pre-processing to sets of known allergens, developing alignment-independent models for allergen recognition based on the main chemical properties of amino acid sequences. Results A set of 684 food, 1,156 inhalant and 555 toxin allergens was collected from several databases. A set of non-allergens from the same species were selected to mirror the allergen set. The amino acids in the protein sequences were described by three z-descriptors (z1, z2 and z3) and by auto- and cross-covariance (ACC) transformation were converted into uniform vectors. Each protein was presented as a vector of 45 variables. Five machine learning methods for classification were applied in the study to derive models for allergen prediction. The methods were: discriminant analysis by partial least squares (DA-PLS), logistic regression (LR), decision tree (DT), naïve Bayes (NB) and k nearest neighbours (kNN). The best performing model was derived by kNN at k = 3. It was optimized, cross-validated and implemented in a server named AllerTOP, freely accessible at http://www.pharmfac.net/allertop. AllerTOP also predicts the most probable route of exposure. In comparison to other servers for allergen prediction, AllerTOP outperforms them with 94% sensitivity. Conclusions AllerTOP is the first alignment-free server for in silico prediction of allergens based on the main physicochemical properties of proteins. Significantly, as well allergenicity AllerTOP is able to predict the route of allergen exposure: food, inhalant or toxin.
Collapse
Affiliation(s)
- Ivan Dimitrov
- Faculty of Pharmacy, Medical University of Sofia, 2 Dunav st,, Sofia, Bulgaria
| | | | | |
Collapse
|
49
|
Wang J, Yu Y, Zhao Y, Zhang D, Li J. Evaluation and integration of existing methods for computational prediction of allergens. BMC Bioinformatics 2013; 14 Suppl 4:S1. [PMID: 23514097 PMCID: PMC3599076 DOI: 10.1186/1471-2105-14-s4-s1] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Background Allergy involves a series of complex reactions and factors that contribute to the development of the disease and triggering of the symptoms, including rhinitis, asthma, atopic eczema, skin sensitivity, even acute and fatal anaphylactic shock. Prediction and evaluation of the potential allergenicity is of importance for safety evaluation of foods and other environment factors. Although several computational approaches for assessing the potential allergenicity of proteins have been developed, their performance and relative merits and shortcomings have not been compared systematically. Results To evaluate and improve the existing methods for allergen prediction, we collected an up-to-date definitive dataset consisting of 989 known allergens and massive putative non-allergens. The three most widely used allergen computational prediction approaches including sequence-, motif- and SVM-based (Support Vector Machine) methods were systematically compared using the defined parameters and we found that SVM-based method outperformed the other two methods with higher accuracy and specificity. The sequence-based method with the criteria defined by FAO/WHO (FAO: Food and Agriculture Organization of the United Nations; WHO: World Health Organization) has higher sensitivity of over 98%, but having a low specificity. The advantage of motif-based method is the ability to visualize the key motif within the allergen. Notably, the performances of the sequence-based method defined by FAO/WHO and motif eliciting strategy could be improved by the optimization of parameters. To facilitate the allergen prediction, we integrated these three methods in a web-based application proAP, which provides the global search of the known allergens and a powerful tool for allergen predication. Flexible parameter setting and batch prediction were also implemented. The proAP can be accessed at http://gmobl.sjtu.edu.cn/proAP/main.html. Conclusions This study comprehensively evaluated sequence-, motif- and SVM-based computational prediction approaches for allergens and optimized their parameters to obtain better performance. These findings may provide helpful guidance for the researchers in allergen-prediction. Furthermore, we integrated these methods into a web application proAP, greatly facilitating users to do customizable allergen search and prediction.
Collapse
Affiliation(s)
- Jing Wang
- Bor Luh Food Safety Center, National Center for Molecular Characterization of Genetically Modified Organisms, State Key Laboratory of Hybrid Rice, School of Life Science and Biotechnology, Shanghai Jiao Tong University, China
| | | | | | | | | |
Collapse
|
50
|
Safety assessment of dehydration-responsive element-binding (DREB) 4 protein expressed in E. coli. Food Chem Toxicol 2012; 50:4077-84. [DOI: 10.1016/j.fct.2012.06.026] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2012] [Revised: 06/15/2012] [Accepted: 06/15/2012] [Indexed: 11/22/2022]
|