1
|
Tejera-Nevado P, Serrano E, González-Herrero A, Bermejo R, Rodríguez-González A. Unlocking the power of AI models: exploring protein folding prediction through comparative analysis. J Integr Bioinform 2024; 0:jib-2023-0041. [PMID: 38797876 DOI: 10.1515/jib-2023-0041] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Accepted: 04/10/2024] [Indexed: 05/29/2024] Open
Abstract
Protein structure determination has made progress with the aid of deep learning models, enabling the prediction of protein folding from protein sequences. However, obtaining accurate predictions becomes essential in certain cases where the protein structure remains undescribed. This is particularly challenging when dealing with rare, diverse structures and complex sample preparation. Different metrics assess prediction reliability and offer insights into result strength, providing a comprehensive understanding of protein structure by combining different models. In a previous study, two proteins named ARM58 and ARM56 were investigated. These proteins contain four domains of unknown function and are present in Leishmania spp. ARM refers to an antimony resistance marker. The study's main objective is to assess the accuracy of the model's predictions, thereby providing insights into the complexities and supporting metrics underlying these findings. The analysis also extends to the comparison of predictions obtained from other species and organisms. Notably, one of these proteins shares an ortholog with Trypanosoma cruzi and Trypanosoma brucei, leading further significance to our analysis. This attempt underscored the importance of evaluating the diverse outputs from deep learning models, facilitating comparisons across different organisms and proteins. This becomes particularly pertinent in cases where no previous structural information is available.
Collapse
Affiliation(s)
- Paloma Tejera-Nevado
- ETS Ingenieros Informáticos, 16771 Universidad Politécnica de Madrid , Madrid, Spain
- Centro de Tecnología Biomédica, 16771 Universidad Politécnica de Madrid , Pozuelo de Alarcón, Madrid, Spain
| | - Emilio Serrano
- ETS Ingenieros Informáticos, 16771 Universidad Politécnica de Madrid , Madrid, Spain
| | - Ana González-Herrero
- 54446 Margarita Salas Center for Biological Research (CIB-CSIC), Spanish National Research Council , Madrid, Spain
| | - Rodrigo Bermejo
- 54446 Margarita Salas Center for Biological Research (CIB-CSIC), Spanish National Research Council , Madrid, Spain
| | - Alejandro Rodríguez-González
- ETS Ingenieros Informáticos, 16771 Universidad Politécnica de Madrid , Madrid, Spain
- Centro de Tecnología Biomédica, 16771 Universidad Politécnica de Madrid , Pozuelo de Alarcón, Madrid, Spain
| |
Collapse
|
2
|
Salauddin M, Kayesh MEH, Ahammed MS, Saha S, Hossain MG. Development of membrane protein-based vaccine against lumpy skin disease virus (LSDV) using immunoinformatic tools. Vet Med Sci 2024; 10:e1438. [PMID: 38555573 PMCID: PMC10981917 DOI: 10.1002/vms3.1438] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 02/09/2024] [Accepted: 03/10/2024] [Indexed: 04/02/2024] Open
Abstract
INTRODUCTION Lumpy skin disease, an economically significant bovine illness, is now found in previously unheard-of geographic regions. Vaccination is one of the most important ways to stop its further spread. AIM Therefore, in this study, we applied advanced immunoinformatics approaches to design and develop an effective lumpy skin disease virus (LSDV) vaccine. METHODS The membrane glycoprotein was selected for prediction of the different B- and T-cell epitopes by using the immune epitope database. The selected B- and T-cell epitopes were combined with the appropriate linkers and adjuvant resulted in a vaccine chimera construct. Bioinformatics tools were used to predict, refine and validate the 2D, 3D structures and for molecular docking with toll-like receptor 4 using different servers. The constructed vaccine candidate was further processed on the basis of antigenicity, allergenicity, solubility, different physiochemical properties and molecular docking scores. RESULTS The in silico immune simulation induced significant response for immune cells. In silico cloning and codon optimization were performed to express the vaccine candidate in Escherichia coli. This study highlights a good signal for the design of a peptide-based LSDV vaccine. CONCLUSION Thus, the present findings may indicate that the engineered multi-epitope vaccine is structurally stable and can induce a strong immune response, which should help in developing an effective vaccine towards controlling LSDV infection.
Collapse
Affiliation(s)
- Md. Salauddin
- Department of Microbiology and Public HealthKhulna Agricultural UniversityKhulnaBangladesh
| | | | - Md. Suruj Ahammed
- Department of ChemistryBangladesh University of Engineering and TechnologyDhakaBangladesh
| | - Sukumar Saha
- Department of Microbiology and HygieneBangladesh Agricultural UniversityMymensinghBangladesh
| | - Md. Golzar Hossain
- Department of Microbiology and HygieneBangladesh Agricultural UniversityMymensinghBangladesh
| |
Collapse
|
3
|
Liu C, Zhang L, Zhang J, Wang M, You S, Su R, Qi W. Rational design of antibodies and development of a novel method for (1-3)-β-D glucan detection as an alternative to Limulus amebocyte lysate assay. Front Cell Infect Microbiol 2024; 14:1322264. [PMID: 38328671 PMCID: PMC10847287 DOI: 10.3389/fcimb.2024.1322264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 01/02/2024] [Indexed: 02/09/2024] Open
Abstract
With advances in medicine, increasing medical interventions have increased the risk of invasive fungal disease development. (1-3)-β-D glucan (BDG) is a common fungal biomarker in serological tests. However, the scarcity of Limulus resources for BDG detection poses a challenge. This study addresses the need for an alternative to Limulus amebocyte lysate by using BDG mutant antibody for chemiluminescence detection. The wild-type BDG antibody was obtained by immunizing rabbits. An optimal V52HI/N34L Y mutant antibody, which has increased 3.7-fold of the testing efficiency compared to the wild-type antibody, was first achieved by mutating "hot-spot" residues that contribute to strong non-covalent bonds, as determined by alanine scanning and molecular dynamics simulation. The mutant was then applied to develop the magnetic particle chemiluminescence method. 574 clinical samples were tested using the developed method, with a cutoff value of 95 pg/mL set by Limulus amebocyte lysate. The receiver operating characteristic curve demonstrated an area under the curve of 0.905 (95% CI: 0.880-0.929). Chemiluminescence detected an antigen concentration of 89.98 pg/mL, exhibiting a sensitivity of 83.33% and specificity of 89.76%. In conclusion, the results showed a good agreement with Limulus amebocyte lysate and demonstrated the feasibility of using BDG mutant antibodies for invasive fungal disease diagnosis. The new method based on chemiluminescence for detecting BDG could shorten the sample-to-result time to approximately 30 min, rescue Limulus from being endangered and is resource efficient in terms of equipment and the non-use of a skilled technician.
Collapse
Affiliation(s)
- Chunlong Liu
- Chemical Engineering Research Center, School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- R&D Department, Dynamiker Biotechnology (Tianjin) Co., Ltd, Tianjin, China
| | - Lin Zhang
- Chemical Engineering Research Center, School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
| | - Jiaxing Zhang
- Chemical Engineering Research Center, School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
| | - Mengfan Wang
- Chemical Engineering Research Center, School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
| | - Shengping You
- Chemical Engineering Research Center, School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Tianjin Key Laboratory of Membrane Science and Desalination Technology, Tianjin University, Tianjin, China
| | - Rongxin Su
- Chemical Engineering Research Center, School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Tianjin Key Laboratory of Membrane Science and Desalination Technology, Tianjin University, Tianjin, China
- State Key Laboratory of Chemical Engineering, Tianjin University, Tianjin, China
- Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China
| | - Wei Qi
- Chemical Engineering Research Center, School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Tianjin Key Laboratory of Membrane Science and Desalination Technology, Tianjin University, Tianjin, China
- State Key Laboratory of Chemical Engineering, Tianjin University, Tianjin, China
- Collaborative Innovation Center of Chemical Science and Engineering (Tianjin), Tianjin, China
| |
Collapse
|
4
|
Peng CX, Liang F, Xia YH, Zhao KL, Hou MH, Zhang GJ. Recent Advances and Challenges in Protein Structure Prediction. J Chem Inf Model 2024; 64:76-95. [PMID: 38109487 DOI: 10.1021/acs.jcim.3c01324] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Artificial intelligence has made significant advances in the field of protein structure prediction in recent years. In particular, DeepMind's end-to-end model, AlphaFold2, has demonstrated the capability to predict three-dimensional structures of numerous unknown proteins with accuracy levels comparable to those of experimental methods. This breakthrough has opened up new possibilities for understanding protein structure and function as well as accelerating drug discovery and other applications in the field of biology and medicine. Despite the remarkable achievements of artificial intelligence in the field, there are still some challenges and limitations. In this Review, we discuss the recent progress and some of the challenges in protein structure prediction. These challenges include predicting multidomain protein structures, protein complex structures, multiple conformational states of proteins, and protein folding pathways. Furthermore, we highlight directions in which further improvements can be conducted.
Collapse
Affiliation(s)
- Chun-Xiang Peng
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Fang Liang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu-Hao Xia
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Kai-Long Zhao
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Ming-Hua Hou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
5
|
Peng Z, Wang W, Wei H, Li X, Yang J. Improved protein structure prediction with trRosettaX2, AlphaFold2, and optimized MSAs in CASP15. Proteins 2023; 91:1704-1711. [PMID: 37565699 DOI: 10.1002/prot.26570] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2023] [Revised: 07/17/2023] [Accepted: 07/31/2023] [Indexed: 08/12/2023]
Abstract
We present the monomer and multimer structure prediction results of our methods in CASP15. We first designed an elaborate pipeline that leverages complementary sequence databases and advanced database searching algorithms to generate high-quality multiple sequence alignments (MSAs). Top MSAs were then selected for the subsequent step of structure prediction. We utilized trRosettaX2 and AlphaFold2 for monomer structure prediction (group name Yang-Server), and AlphaFold-Multimer for multimer structure prediction (group name Yang-Multimer). Yang-Server and Yang-Multimer are ranked at the top and the fourth, respectively, for monomer and multimer structure prediction. For 94 monomers, the average TM-score of the predicted structure models by Yang-Server is 0.876, compared to 0.798 by the default AlphaFold2 (i.e., the group NBIS-AF2-standard). For 42 multimers, the average DockQ score of the predicted structure models by Yang-Multimer is 0.464, compared to 0.389 by the default AlphaFold-Multimer (i.e., the group NBIS-AF2-multimer). Detailed analysis of the results shows that several factors contribute to the improvement, including improved MSAs, iterated modeling for large targets, interplay between monomer and multimer structure prediction for intertwined structures, etc. However, the structure predictions for orphan proteins and multimers remain challenging, and breakthroughs in this area are anticipated in the future.
Collapse
Affiliation(s)
- Zhenling Peng
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China
| | - Wenkai Wang
- School of Mathematical Sciences, Nankai University, Tianjin, China
| | - Hong Wei
- School of Mathematical Sciences, Nankai University, Tianjin, China
| | - Xiaoge Li
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China
| | - Jianyi Yang
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China
| |
Collapse
|
6
|
De Salis SKF, Chen JZ, Skarratt KK, Fuller SJ, Balle T. Deep learning structural insights into heterotrimeric alternatively spliced P2X7 receptors. Purinergic Signal 2023:10.1007/s11302-023-09978-3. [PMID: 38032425 DOI: 10.1007/s11302-023-09978-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2023] [Accepted: 10/31/2023] [Indexed: 12/01/2023] Open
Abstract
P2X7 receptors (P2X7Rs) are membrane-bound ATP-gated ion channels that are composed of three subunits. Different subunit structures may be expressed due to alternative splicing of the P2RX7 gene, altering the receptor's function when combined with the wild-type P2X7A subunits. In this study, the application of the deep-learning method, AlphaFold2-Multimer (AF2M), for the generation of trimeric P2X7Rs was validated by comparing an AF2M-generated rat wild-type P2X7A receptor with a structure determined by cryogenic electron microscopy (cryo-EM) (Protein Data Bank Identification: 6U9V). The results suggested AF2M could firstly, accurately predict the structures of P2X7Rs and secondly, accurately identify the highest quality model through the ranking system. Subsequently, AF2M was used to generate models of heterotrimeric alternatively spliced P2X7Rs consisting of one or two wild-type P2X7A subunits in combination with one or two P2X7B, P2X7E, P2X7J, and P2X7L splice variant subunits. The top-ranking models were deemed valid based on AF2M's confidence measures, stability in molecular dynamics simulations, and consistent flexibility of the conserved regions between the models. The structure of the heterotrimeric receptors, which were missing key residues in the ATP binding sites and carboxyl terminal domains (CTDs) compared to the wild-type receptor, help to explain their observed functions. Overall, the models produced in this study (available as supplementary material) unlock the possibility of structure-based studies into the heterotrimeric P2X7Rs.
Collapse
Affiliation(s)
- Sophie K F De Salis
- Brain and Mind Centre, The University of Sydney, Camperdown, NSW, 2050, Australia
- Sydney Pharmacy School, The University of Sydney, Camperdown, NSW, 2050, Australia
| | - Jake Zheng Chen
- Brain and Mind Centre, The University of Sydney, Camperdown, NSW, 2050, Australia
- Sydney Pharmacy School, The University of Sydney, Camperdown, NSW, 2050, Australia
| | - Kristen K Skarratt
- The University of Sydney, Nepean Clinical School, Kingswood, NSW, 2747, Australia
| | - Stephen J Fuller
- The University of Sydney, Nepean Clinical School, Kingswood, NSW, 2747, Australia
| | - Thomas Balle
- Brain and Mind Centre, The University of Sydney, Camperdown, NSW, 2050, Australia.
- Sydney Pharmacy School, The University of Sydney, Camperdown, NSW, 2050, Australia.
| |
Collapse
|
7
|
Wang W, Feng C, Han R, Wang Z, Ye L, Du Z, Wei H, Zhang F, Peng Z, Yang J. trRosettaRNA: automated prediction of RNA 3D structure with transformer network. Nat Commun 2023; 14:7266. [PMID: 37945552 PMCID: PMC10636060 DOI: 10.1038/s41467-023-42528-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 10/13/2023] [Indexed: 11/12/2023] Open
Abstract
RNA 3D structure prediction is a long-standing challenge. Inspired by the recent breakthrough in protein structure prediction, we developed trRosettaRNA, an automated deep learning-based approach to RNA 3D structure prediction. The trRosettaRNA pipeline comprises two major steps: 1D and 2D geometries prediction by a transformer network; and 3D structure folding by energy minimization. Benchmark tests suggest that trRosettaRNA outperforms traditional automated methods. In the blind tests of the 15th Critical Assessment of Structure Prediction (CASP15) and the RNA-Puzzles experiments, the automated trRosettaRNA predictions for the natural RNAs are competitive with the top human predictions. trRosettaRNA also outperforms other deep learning-based methods in CASP15 when measured by the Z-score of the Root-Mean-Square Deviation. Nevertheless, it remains challenging to predict accurate structures for synthetic RNAs with an automated approach. We hope this work could be a good start toward solving the hard problem of RNA structure prediction with deep learning.
Collapse
Affiliation(s)
- Wenkai Wang
- School of Mathematical Sciences, Nankai University, Tianjin, 300071, China
| | - Chenjie Feng
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, 266237, China
- School of Science, Ningxia Medical University, Yinchuan, 750004, China
| | - Renmin Han
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, 266237, China
| | - Ziyi Wang
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, 266237, China
| | - Lisha Ye
- School of Mathematical Sciences, Nankai University, Tianjin, 300071, China
| | - Zongyang Du
- School of Mathematical Sciences, Nankai University, Tianjin, 300071, China
| | - Hong Wei
- School of Mathematical Sciences, Nankai University, Tianjin, 300071, China
| | - Fa Zhang
- School of Medical Technology, Beijing Institute of Technology, Beijing, 100081, China.
| | - Zhenling Peng
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, 266237, China.
| | - Jianyi Yang
- MOE Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, 266237, China.
| |
Collapse
|
8
|
Li D, Zaraei SO, Sbenati RM, Ravi A, Wen Y, Zeng L, Wang J, El-Gamal MI, Xu H. Synthesis and Biological Activity of Sulfamate-Adamantane Derivatives as Glucosinolate Sulfatase Inhibitors. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2023; 71:15476-15484. [PMID: 37818663 DOI: 10.1021/acs.jafc.3c04879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/12/2023]
Abstract
The glucosinolate-myrosinase system, exclusively found in the Brassicaceae family, is a main defense strategy against insect resistance. The efficient detoxification activity of glucosinolate sulfatases (GSSs) has successfully supported the feeding of Plutella xylostella on cruciferous plants. With the activity of GSSs hampered in P. xylostella, the toxic isothiocyanates produced from glucosinolates severely impair larval growth and adult reproduction. Therefore, inhibitors of GSSs have been suggested as an alternative approach to controlling P. xylostella. Herein, we synthesized eight adamantyl-possessing sulfamate derivatives as novel inhibitors of GSSs. Adam-20-S exhibited the most potent GSS inhibitory activity, with an IC50 value of 9.04 mg/L. The suppression of GSSs by Adam-20-S impaired glucosinolate metabolism to produce more toxic isothiocyanates in P. xylostella. Consequently, the growth and development of P. xylostella were significantly hindered when feeding on the host plant. Our study may help facilitate the development of a comprehensive pest management strategy that combines insect detoxification enzyme inhibitors with plant chemical defenses.
Collapse
Affiliation(s)
- Dehong Li
- National Key Laboratory of Green Pesticide, Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, College of Plant Protection, South China Agricultural University, Guangzhou, Guangdong 510642, People's Republic of China
| | - Seyed-Omar Zaraei
- Research Institute for Medical and Health Sciences, University of Sharjah, Sharjah 27272, United Arab Emirates
| | - Rawan M Sbenati
- Research Institute for Medical and Health Sciences, University of Sharjah, Sharjah 27272, United Arab Emirates
| | - Anil Ravi
- Research Institute for Medical and Health Sciences, University of Sharjah, Sharjah 27272, United Arab Emirates
| | - Yingjie Wen
- National Key Laboratory of Green Pesticide, Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, College of Plant Protection, South China Agricultural University, Guangzhou, Guangdong 510642, People's Republic of China
| | - Lingda Zeng
- National Key Laboratory of Green Pesticide, Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, College of Plant Protection, South China Agricultural University, Guangzhou, Guangdong 510642, People's Republic of China
| | - Jiali Wang
- National Key Laboratory of Green Pesticide, Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, College of Plant Protection, South China Agricultural University, Guangzhou, Guangdong 510642, People's Republic of China
| | - Mohammed I El-Gamal
- Research Institute for Medical and Health Sciences, University of Sharjah, Sharjah 27272, United Arab Emirates
- Department of Medicinal Chemistry, College of Pharmacy, University of Sharjah, Sharjah 27272, United Arab Emirates
- Department of Medicinal Chemistry, Faculty of Pharmacy, Mansoura University, Mansoura 35516, Egypt
| | - Hanhong Xu
- National Key Laboratory of Green Pesticide, Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, College of Plant Protection, South China Agricultural University, Guangzhou, Guangdong 510642, People's Republic of China
| |
Collapse
|
9
|
Zhu HT, Xia YH, Zhang GJ. E2EDA: Protein Domain Assembly Based on End-to-End Deep Learning. J Chem Inf Model 2023; 63:6451-6461. [PMID: 37788318 DOI: 10.1021/acs.jcim.3c01387] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]
Abstract
With the development of deep learning, almost all single-domain proteins can be predicted at experimental resolution. However, the structure prediction of multi-domain proteins remains a challenge. Achieving end-to-end protein domain assembly and further improving the accuracy of the full-chain modeling by accurately predicting inter-domain orientation while improving the assembly efficiency will provide significant insights into structure-based drug discovery. In this work, we propose an End-to-End Domain Assembly method based on deep learning, named E2EDA. We first develop RMNet, an EfficientNetV2-based deep learning model that fuses multiple features using an attention mechanism to predict inter-domain rigid motion. Then, the predicted rigid motions are transformed into inter-domain spatial transformations to directly assemble the full-chain model. Finally, the scoring strategy RMscore is designed to select the best model from multiple assembled models. The experimental results show that the average TM-score of the model assembled by E2EDA on the benchmark set (282) is 0.827, which is better than those of other domain assembly methods SADA (0.792) and DEMO (0.730). Meanwhile, on our constructed multi-domain data set from AlphaFold DB, the model reassembled by E2EDA is 7.0% higher in TM-score compared to the full-chain model predicted by AlphaFold2, indicating that E2EDA can capture more accurate inter-domain orientations to improve the quality of the model predicted by AlphaFold2. Furthermore, compared to SADA and AlphaFold2, E2EDA reduced the average runtime on the benchmark by 64.7% and 19.2%, respectively, indicating that E2EDA can significantly improve assembly efficiency through an end-to-end approach. The online server is available at http://zhanglab-bioinf.com/E2EDA.
Collapse
Affiliation(s)
- Hai-Tao Zhu
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, 310023, China
| | - Yu-Hao Xia
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, 310023, China
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, 310023, China
| |
Collapse
|
10
|
Shao J, Zhang Q, Yan K, Liu B. PreHom-PCLM: protein remote homology detection by combing motifs and protein cubic language model. Brief Bioinform 2023; 24:bbad347. [PMID: 37833837 DOI: 10.1093/bib/bbad347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Revised: 08/14/2023] [Accepted: 09/14/2023] [Indexed: 10/15/2023] Open
Abstract
Protein remote homology detection is essential for structure prediction, function prediction, disease mechanism understanding, etc. The remote homology relationship depends on multiple protein properties, such as structural information and local sequence patterns. Previous studies have shown the challenges for predicting remote homology relationship by protein features at sequence level (e.g. position-specific score matrix). Protein motifs have been used in structure and function analysis due to their unique sequence patterns and implied structural information. Therefore, designing a usable architecture to fuse multiple protein properties based on motifs is urgently needed to improve protein remote homology detection performance. To make full use of the characteristics of motifs, we employed the language model called the protein cubic language model (PCLM). It combines multiple properties by constructing a motif-based neural network. Based on the PCLM, we proposed a predictor called PreHom-PCLM by extracting and fusing multiple motif features for protein remote homology detection. PreHom-PCLM outperforms the other state-of-the-art methods on the test set and independent test set. Experimental results further prove the effectiveness of multiple features fused by PreHom-PCLM for remote homology detection. Furthermore, the protein features derived from the PreHom-PCLM show strong discriminative power for proteins from different structural classes in the high-dimensional space. Availability and Implementation: http://bliulab.net/PreHom-PCLM.
Collapse
Affiliation(s)
- Jiangyi Shao
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Qi Zhang
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
| | - Ke Yan
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
- Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing 100081, China
| | - Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
- Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing 100081, China
| |
Collapse
|
11
|
Guo D, Liu P, Liu Q, Zheng L, Liu S, Shen C, Liu L, Fan S, Li N, Dong J, Wang T. Legume-specific SnRK1 promotes malate supply to bacteroids for symbiotic nitrogen fixation. MOLECULAR PLANT 2023; 16:1396-1412. [PMID: 37598296 DOI: 10.1016/j.molp.2023.08.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2022] [Revised: 01/12/2023] [Accepted: 08/16/2023] [Indexed: 08/21/2023]
Abstract
Nodulation is an energy-expensive behavior driven by legumes by providing carbon sources to bacteroids and obtaining nitrogen sources in return. The energy sensor sucrose nonfermenting 1-related protein kinase 1 (SnRK1) is the hub of energy regulation in eukaryotes. However, the molecular mechanism by which SnRK1 coordinates the allocation of energy and substances during symbiotic nitrogen fixation (SNF) remains unknown. In this study, we identified the novel legume-specific SnRK1α4, a member of the SnRK1 family that positively regulates SNF. Phenotypic analysis showed that nodule size and nitrogenase activity increased in SnRK1α4-overexpressing plants and decreased significantly in snrk1α4 mutants. We demonstrated that a key upstream kinase involved in nodulation, Does Not Make Infection 2 (DMI2), can phosphorylate SnRK1α4 at Thr175 to cause its activation. Further evidence clarified that SnRK1α4 phosphorylates the malate dehydrogenases MDH1/2 to promote malate production in the cytoplasm, supplying carbon sources to bacteroids. Therefore, our findings reveal an essential role of the DMI2-SnRK1α4-MDH pathway in supplying carbon sources to bacteroids for SNF and provide a new module for constructing cereal crops with SNF.
Collapse
Affiliation(s)
- Da Guo
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Peng Liu
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Qianwen Liu
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Lihua Zheng
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Sikai Liu
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Chen Shen
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Li Liu
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Shasha Fan
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Nan Li
- College of Biological Sciences, China Agricultural University, Beijing 100193, China
| | - Jiangli Dong
- College of Biological Sciences, China Agricultural University, Beijing 100193, China.
| | - Tao Wang
- College of Biological Sciences, China Agricultural University, Beijing 100193, China.
| |
Collapse
|
12
|
Mani H, Chang CC, Hsu HJ, Yang CH, Yen JH, Liou JW. Comparison, Analysis, and Molecular Dynamics Simulations of Structures of a Viral Protein Modeled Using Various Computational Tools. Bioengineering (Basel) 2023; 10:1004. [PMID: 37760106 PMCID: PMC10525864 DOI: 10.3390/bioengineering10091004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 08/16/2023] [Accepted: 08/22/2023] [Indexed: 09/29/2023] Open
Abstract
The structural analysis of proteins is a major domain of biomedical research. Such analysis requires resolved three-dimensional structures of proteins. Advancements in computer technology have led to progress in biomedical research. In silico prediction and modeling approaches have facilitated the construction of protein structures, with or without structural templates. In this study, we used three neural network-based de novo modeling approaches-AlphaFold2 (AF2), Robetta-RoseTTAFold (Robetta), and transform-restrained Rosetta (trRosetta)-and two template-based tools-the Molecular Operating Environment (MOE) and iterative threading assembly refinement (I-TASSER)-to construct the structure of a viral capsid protein, hepatitis C virus core protein (HCVcp), whose structure have not been fully resolved by laboratory techniques. Templates with sufficient sequence identity for the homology modeling of complete HCVcp are currently unavailable. Therefore, we performed domain-based homology modeling for MOE simulations. The templates for each domain were obtained through sequence-based searches on NCBI and the Protein Data Bank. Then, the modeled domains were assembled to construct the complete structure of HCVcp. The full-length structure and two truncated forms modeled using various computational tools were compared. Molecular dynamics (MD) simulations were performed to refine the structures. The root mean square deviation of backbone atoms, root mean square fluctuation of Cα atoms, and radius of gyration were calculated to monitor structural changes and convergence in the simulations. The model quality was evaluated through ERRAT and phi-psi plot analysis. In terms of the initial prediction for protein modeling, Robetta and trRosetta outperformed AF2. Regarding template-based tools, MOE outperformed I-TASSER. MD simulations resulted in compactly folded protein structures, which were of good quality and theoretically accurate. Thus, the predicted structures of certain proteins must be refined to obtain reliable structural models. MD simulation is a promising tool for this purpose.
Collapse
Affiliation(s)
- Hemalatha Mani
- Institute of Medical Sciences, Tzu Chi University, Hualien 97004, Taiwan
| | - Chun-Chun Chang
- Department of Laboratory Medicine, Hualien Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation, Hualien 97004, Taiwan
- Department of Laboratory Medicine and Biotechnology, Tzu Chi University, Hualien 97004, Taiwan
| | - Hao-Jen Hsu
- Department of Biomedical Sciences and Engineering, Tzu Chi University, Hualien 97004, Taiwan
| | - Chin-Hao Yang
- Department of Biochemistry, School of Medicine, Tzu Chi University, Hualien 97004, Taiwan
| | - Jui-Hung Yen
- Department of Molecular Biology and Human Genetics, Tzu Chi University, Hualien 97004, Taiwan
| | - Je-Wen Liou
- Institute of Medical Sciences, Tzu Chi University, Hualien 97004, Taiwan
- Department of Laboratory Medicine and Biotechnology, Tzu Chi University, Hualien 97004, Taiwan
- Department of Biochemistry, School of Medicine, Tzu Chi University, Hualien 97004, Taiwan
| |
Collapse
|
13
|
Lin P, Yan Y, Tao H, Huang SY. Deep transfer learning for inter-chain contact predictions of transmembrane protein complexes. Nat Commun 2023; 14:4935. [PMID: 37582780 PMCID: PMC10427616 DOI: 10.1038/s41467-023-40426-3] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 07/21/2023] [Indexed: 08/17/2023] Open
Abstract
Membrane proteins are encoded by approximately a quarter of human genes. Inter-chain residue-residue contact information is important for structure prediction of membrane protein complexes and valuable for understanding their molecular mechanism. Although many deep learning methods have been proposed to predict the intra-protein contacts or helix-helix interactions in membrane proteins, it is still challenging to accurately predict their inter-chain contacts due to the limited number of transmembrane proteins. Addressing the challenge, here we develop a deep transfer learning method for predicting inter-chain contacts of transmembrane protein complexes, named DeepTMP, by taking advantage of the knowledge pre-trained from a large data set of non-transmembrane proteins. DeepTMP utilizes a geometric triangle-aware module to capture the correct inter-chain interaction from the coevolution information generated by protein language models. DeepTMP is extensively evaluated on a test set of 52 self-associated transmembrane protein complexes, and compared with state-of-the-art methods including DeepHomo2.0, CDPred, GLINTER, DeepHomo, and DNCON2_Inter. It is shown that DeepTMP considerably improves the precision of inter-chain contact prediction and outperforms the existing approaches in both accuracy and robustness.
Collapse
Affiliation(s)
- Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Huanyu Tao
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei, 430074, China.
| |
Collapse
|
14
|
Tavoulari S, Sichrovsky M, Kunji ERS. Fifty years of the mitochondrial pyruvate carrier: New insights into its structure, function, and inhibition. Acta Physiol (Oxf) 2023; 238:e14016. [PMID: 37366179 PMCID: PMC10909473 DOI: 10.1111/apha.14016] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 06/12/2023] [Accepted: 06/14/2023] [Indexed: 06/28/2023]
Abstract
The mitochondrial pyruvate carrier (MPC) resides in the mitochondrial inner membrane, where it links cytosolic and mitochondrial metabolism by transporting pyruvate produced in glycolysis into the mitochondrial matrix. Due to its central metabolic role, it has been proposed as a potential drug target for diabetes, non-alcoholic fatty liver disease, neurodegeneration, and cancers relying on mitochondrial metabolism. Little is known about the structure and mechanism of MPC, as the proteins involved were only identified a decade ago and technical difficulties concerning their purification and stability have hindered progress in functional and structural analyses. The functional unit of MPC is a hetero-dimer comprising two small homologous membrane proteins, MPC1/MPC2 in humans, with the alternative complex MPC1L/MPC2 forming in the testis, but MPC proteins are found throughout the tree of life. The predicted topology of each protomer consists of an amphipathic helix followed by three transmembrane helices. An increasing number of inhibitors are being identified, expanding MPC pharmacology and providing insights into the inhibitory mechanism. Here, we provide critical insights on the composition, structure, and function of the complex and we summarize the different classes of small molecule inhibitors and their potential in therapeutics.
Collapse
Affiliation(s)
- Sotiria Tavoulari
- Medical Research Council Mitochondrial Biology UnitUniversity of CambridgeCambridgeUK
| | - Maximilian Sichrovsky
- Medical Research Council Mitochondrial Biology UnitUniversity of CambridgeCambridgeUK
| | - Edmund R. S. Kunji
- Medical Research Council Mitochondrial Biology UnitUniversity of CambridgeCambridgeUK
| |
Collapse
|
15
|
Rivera M, Burgos‐Bravo F, Engelberger F, Asor R, Lagos‐Espinoza MIA, Figueroa M, Kukura P, Ramírez‐Sarmiento CA, Baez M, Smith SB, Wilson CAM. Effect of temperature and nucleotide on the binding of BiP chaperone to a protein substrate. Protein Sci 2023; 32:e4706. [PMID: 37323096 PMCID: PMC10303699 DOI: 10.1002/pro.4706] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2022] [Revised: 06/01/2023] [Accepted: 06/12/2023] [Indexed: 06/17/2023]
Abstract
BiP (immunoglobulin heavy-chain binding protein) is a Hsp70 monomeric ATPase motor that plays broad and crucial roles in maintaining proteostasis inside the cell. Structurally, BiP is formed by two domains, a nucleotide-binding domain (NBD) with ATPase activity connected by a flexible hydrophobic linker to the substrate-binding domain. While the ATPase and substrate binding activities of BiP are allosterically coupled, the latter is also dependent on nucleotide binding. Recent structural studies have provided new insights into BiP's allostery; however, the influence of temperature on the coupling between substrate and nucleotide binding to BiP remains unexplored. Here, we study BiP's binding to its substrate at the single molecule level using thermo-regulated optical tweezers which allows us to mechanically unfold the client protein and explore the effect of temperature and different nucleotides on BiP binding. Our results confirm that the affinity of BiP for its protein substrate relies on nucleotide binding, by mainly regulating the binding kinetics between BiP and its substrate. Interestingly, our findings also showed that the apparent affinity of BiP for its protein substrate in the presence of nucleotides remains invariable over a wide range of temperatures, suggesting that BiP may interact with its client proteins with similar affinities even when the temperature is not optimal. Thus, BiP could play a role as a "thermal buffer" in proteostasis.
Collapse
Affiliation(s)
- Maira Rivera
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological SciencesPontificia Universidad Católica de ChileSantiagoChile
- ANID–Millennium Science Initiative Program–Millennium Institute for Integrative Biology (iBio)SantiagoChile
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Químicas y FarmacéuticasUniversidad de ChileSantiagoChile
| | - Francesca Burgos‐Bravo
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Químicas y FarmacéuticasUniversidad de ChileSantiagoChile
- Institute for Quantitative BiosciencesUniversity of CaliforniaBerkeleyCaliforniaUSA
| | - Felipe Engelberger
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological SciencesPontificia Universidad Católica de ChileSantiagoChile
- ANID–Millennium Science Initiative Program–Millennium Institute for Integrative Biology (iBio)SantiagoChile
| | - Roi Asor
- Physical and Theoretical Chemistry Laboratory, Department of ChemistryUniversity of OxfordOxfordUK
- The Kavli Institute for Nanoscience DiscoveryOxfordUK
| | - Miguel I. A. Lagos‐Espinoza
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Químicas y FarmacéuticasUniversidad de ChileSantiagoChile
| | - Maximiliano Figueroa
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias BiológicasUniversidad de ConcepciónConcepciónChile
| | - Philipp Kukura
- Physical and Theoretical Chemistry Laboratory, Department of ChemistryUniversity of OxfordOxfordUK
- The Kavli Institute for Nanoscience DiscoveryOxfordUK
| | - César A. Ramírez‐Sarmiento
- Institute for Biological and Medical Engineering, Schools of Engineering, Medicine and Biological SciencesPontificia Universidad Católica de ChileSantiagoChile
- ANID–Millennium Science Initiative Program–Millennium Institute for Integrative Biology (iBio)SantiagoChile
| | - Mauricio Baez
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Químicas y FarmacéuticasUniversidad de ChileSantiagoChile
| | | | - Christian A. M. Wilson
- Departamento de Bioquímica y Biología Molecular, Facultad de Ciencias Químicas y FarmacéuticasUniversidad de ChileSantiagoChile
| |
Collapse
|
16
|
Spiers AJ, Dorfmueller HC, Jerdan R, McGregor J, Nicoll A, Steel K, Cameron S. Bioinformatics characterization of BcsA-like orphan proteins suggest they form a novel family of pseudomonad cyclic-β-glucan synthases. PLoS One 2023; 18:e0286540. [PMID: 37267309 PMCID: PMC10237404 DOI: 10.1371/journal.pone.0286540] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 05/18/2023] [Indexed: 06/04/2023] Open
Abstract
Bacteria produce a variety of polysaccharides with functional roles in cell surface coating, surface and host interactions, and biofilms. We have identified an 'Orphan' bacterial cellulose synthase catalytic subunit (BcsA)-like protein found in four model pseudomonads, P. aeruginosa PA01, P. fluorescens SBW25, P. putida KT2440 and P. syringae pv. tomato DC3000. Pairwise alignments indicated that the Orphan and BcsA proteins shared less than 41% sequence identity suggesting they may not have the same structural folds or function. We identified 112 Orphans among soil and plant-associated pseudomonads as well as in phytopathogenic and human opportunistic pathogenic strains. The wide distribution of these highly conserved proteins suggest they form a novel family of synthases producing a different polysaccharide. In silico analysis, including sequence comparisons, secondary structure and topology predictions, and protein structural modelling, revealed a two-domain transmembrane ovoid-like structure for the Orphan protein with a periplasmic glycosyl hydrolase family GH17 domain linked via a transmembrane region to a cytoplasmic glycosyltransferase family GT2 domain. We suggest the GT2 domain synthesises β-(1,3)-glucan that is transferred to the GH17 domain where it is cleaved and cyclised to produce cyclic-β-(1,3)-glucan (CβG). Our structural models are consistent with enzymatic characterisation and recent molecular simulations of the PaPA01 and PpKT2440 GH17 domains. It also provides a functional explanation linking PaPAK and PaPA14 Orphan (also known as NdvB) transposon mutants with CβG production and biofilm-associated antibiotic resistance. Importantly, cyclic glucans are also involved in osmoregulation, plant infection and induced systemic suppression, and our findings suggest this novel family of CβG synthases may provide similar range of adaptive responses for pseudomonads.
Collapse
Affiliation(s)
- Andrew J. Spiers
- School of Applied Sciences, Abertay University, Dundee, United Kingdom
| | - Helge C. Dorfmueller
- Division of Molecular Microbiology, School of Life Sciences, University of Dundee, Dundee, United Kingdom
| | - Robyn Jerdan
- School of Applied Sciences, Abertay University, Dundee, United Kingdom
| | - Jessica McGregor
- Nuffield Research Placement Students, School of Applied Sciences, Abertay University, Dundee, United Kingdom
| | - Abbie Nicoll
- Nuffield Research Placement Students, School of Applied Sciences, Abertay University, Dundee, United Kingdom
| | - Kenzie Steel
- Nuffield Research Placement Students, School of Applied Sciences, Abertay University, Dundee, United Kingdom
| | - Scott Cameron
- School of Applied Sciences, Abertay University, Dundee, United Kingdom
| |
Collapse
|
17
|
Kahveci-Türköz S, Bläsius K, Wozniak J, Rinkens C, Seifert A, Kasparek P, Ohm H, Oltzen S, Nieszporek M, Schwarz N, Babendreyer A, Preisinger C, Sedlacek R, Ludwig A, Düsterhöft S. A structural model of the iRhom-ADAM17 sheddase complex reveals functional insights into its trafficking and activity. Cell Mol Life Sci 2023; 80:135. [PMID: 37119365 PMCID: PMC10148629 DOI: 10.1007/s00018-023-04783-y] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2023] [Revised: 04/16/2023] [Accepted: 04/17/2023] [Indexed: 05/01/2023]
Abstract
Several membrane-anchored signal mediators such as cytokines (e.g. TNFα) and growth factors are proteolytically shed from the cell surface by the metalloproteinase ADAM17, which, thus, has an essential role in inflammatory and developmental processes. The membrane proteins iRhom1 and iRhom2 are instrumental for the transport of ADAM17 to the cell surface and its regulation. However, the structure-function determinants of the iRhom-ADAM17 complex are poorly understood. We used AI-based modelling to gain insights into the structure-function relationship of this complex. We identified different regions in the iRhom homology domain (IRHD) that are differentially responsible for iRhom functions. We have supported the validity of the predicted structure-function determinants with several in vitro, ex vivo and in vivo approaches and demonstrated the regulatory role of the IRHD for iRhom-ADAM17 complex cohesion and forward trafficking. Overall, we provide mechanistic insights into the iRhom-ADAM17-mediated shedding event, which is at the centre of several important cytokine and growth factor pathways.
Collapse
Affiliation(s)
- Selcan Kahveci-Türköz
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Katharina Bläsius
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Justyna Wozniak
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Cindy Rinkens
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Anke Seifert
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Petr Kasparek
- Czech Centre for Phenogenomics, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | - Henrike Ohm
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Shixin Oltzen
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Martin Nieszporek
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Nicole Schwarz
- Institute of Molecular and Cellular Anatomy, Medical Faculty, RWTH Aachen University, Aachen, Germany
| | - Aaron Babendreyer
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | | | - Radislav Sedlacek
- Czech Centre for Phenogenomics, Institute of Molecular Genetics of the Czech Academy of Sciences, Prague, Czech Republic
| | - Andreas Ludwig
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany
| | - Stefan Düsterhöft
- Institute of Molecular Pharmacology, Medical Faculty, RWTH Aachen University, Wendlingweg 2, 52074, Aachen, Germany.
| |
Collapse
|
18
|
Dall'Armellina F, Stagi M, Swan LE. In silico modeling human VPS13 proteins associated with donor and target membranes suggests lipid transfer mechanisms. Proteins 2023; 91:439-455. [PMID: 36404287 PMCID: PMC10953354 DOI: 10.1002/prot.26446] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Revised: 10/14/2022] [Accepted: 11/03/2022] [Indexed: 11/22/2022]
Abstract
The VPS13 protein family constitutes a novel class of bridge-like lipid transferases. Autosomal recessive inheritance of mutations in VPS13 genes is associated with the development of neurodegenerative diseases in humans. Bioinformatic approaches previously recognized the domain architecture of these proteins. In this study, we model the first ever full-length structures of the four human homologs VPS13A, VPS13B, VPS13C, and VPS13D in association with model membranes, to investigate their lipid transfer ability and potential structural association with membrane leaflets. We analyze the evolutionary conservation and physicochemical properties of these proteins, focusing on conserved C-terminal amphipathic helices that disturb organelle surfaces and that, adjoined, resemble a traditional Venetian gondola. The gondola domains share significant structural homology with lipid droplet surface-binding proteins. We introduce in silico protein-membrane models displaying the mode of association of VPS13A, VPS13B, VPS13C, and VPS13D to donor and target membranes, and present potential models of action for protein-mediated lipid transfer.
Collapse
Affiliation(s)
- Filippo Dall'Armellina
- Department of Biochemistry and Systems BiologyInstitute of Systems, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
| | - Massimiliano Stagi
- Department of Biochemistry and Systems BiologyInstitute of Systems, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
| | - Laura E. Swan
- Department of Biochemistry and Systems BiologyInstitute of Systems, Molecular and Integrative Biology, University of LiverpoolLiverpoolUK
| |
Collapse
|
19
|
Jiang Y, Wang R, Feng J, Jin J, Liang S, Li Z, Yu Y, Ma A, Su R, Zou Q, Ma Q, Wei L. Explainable Deep Hypergraph Learning Modeling the Peptide Secondary Structure Prediction. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023; 10:e2206151. [PMID: 36794291 PMCID: PMC10104664 DOI: 10.1002/advs.202206151] [Citation(s) in RCA: 13] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 01/20/2023] [Indexed: 06/18/2023]
Abstract
Accurately predicting peptide secondary structures remains a challenging task due to the lack of discriminative information in short peptides. In this study, PHAT is proposed, a deep hypergraph learning framework for the prediction of peptide secondary structures and the exploration of downstream tasks. The framework includes a novel interpretable deep hypergraph multi-head attention network that uses residue-based reasoning for structure prediction. The algorithm can incorporate sequential semantic information from large-scale biological corpus and structural semantic information from multi-scale structural segmentation, leading to better accuracy and interpretability even with extremely short peptides. The interpretable models are able to highlight the reasoning of structural feature representations and the classification of secondary substructures. The importance of secondary structures in peptide tertiary structure reconstruction and downstream functional analysis is further demonstrated, highlighting the versatility of our models. To facilitate the use of the model, an online server is established which is accessible via http://inner.wei-group.net/PHAT/. The work is expected to assist in the design of functional peptides and contribute to the advancement of structural biology research.
Collapse
Affiliation(s)
- Yi Jiang
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| | - Ruheng Wang
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| | - Jiuxin Feng
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| | - Junru Jin
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| | - Sirui Liang
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| | - Zhongshen Li
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| | - Yingying Yu
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| | - Anjun Ma
- Department of Biomedical InformaticsCollege of MedicineThe Ohio State UniversityColumbusOH43210USA
| | - Ran Su
- College of Intelligence and ComputingTianjin UniversityTianjin300350China
| | - Quan Zou
- Institute of Fundamental and Frontier SciencesUniversity of Electronic Science and Technology of ChinaChengduSichuan610054China
| | - Qin Ma
- Department of Biomedical InformaticsCollege of MedicineThe Ohio State UniversityColumbusOH43210USA
| | - Leyi Wei
- School of SoftwareShandong UniversityJinanShandong250101China
- Joint SDU‐NTU Centre for Artificial Intelligence Research (C‐FAIR)Shandong UniversityJinanShandong250101China
| |
Collapse
|
20
|
Real-to-bin conversion for protein residue distances. Comput Biol Chem 2023; 104:107834. [PMID: 36863243 DOI: 10.1016/j.compbiolchem.2023.107834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2022] [Revised: 02/07/2023] [Accepted: 02/16/2023] [Indexed: 02/26/2023]
Abstract
Protein Structure Prediction (PSP) has achieved significant progress lately. Prediction of inter-residue distances by machine learning and their exploitation during the conformational search is largely among the critical factors behind the progress. Real values than bin probabilities could more naturally represent inter-residue distances, while the latter, via spline curves more naturally helps obtain differentiable objective functions than the former. Consequently, PSP methods that exploit predicted binned distances perform better than those that exploit predicted real-valued distances. To leverage the advantage of bin probabilities in getting differentiable objective functions, in this work, we propose techniques to convert real-valued distances into distance bin probabilities. Using standard benchmark proteins, we then show that our real-to-bin converted distances help PSP methods obtain three-dimensional structures with 4%-16% better root mean squared deviation (RMSD), template modeling score (TM-Score), and global distance test (GDT) values than existing similar PSP methods. Our proposed PSP method is named real to bin (R2B) inter-residue distance predictor, and its code is available from https://gitlab.com/mahnewton/r2b.
Collapse
|
21
|
Wu F, Jing X, Luo X, Xu J. Improving protein structure prediction using templates and sequence embedding. Bioinformatics 2023; 39:6820926. [PMID: 36355462 PMCID: PMC9805584 DOI: 10.1093/bioinformatics/btac723] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 10/17/2022] [Accepted: 11/09/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Protein structure prediction has been greatly improved by deep learning, but the contribution of different information is yet to be fully understood. This article studies the impacts of two kinds of information for structure prediction: template and multiple sequence alignment (MSA) embedding. Templates have been used by some methods before, such as AlphaFold2, RoseTTAFold and RaptorX. AlphaFold2 and RosetTTAFold only used templates detected by HHsearch, which may not perform very well on some targets. In addition, sequence embedding generated by pre-trained protein language models has not been fully explored for structure prediction. In this article, we study the impact of templates (including the number of templates, the template quality and how the templates are generated) on protein structure prediction accuracy, especially when the templates are detected by methods other than HHsearch. We also study the impact of sequence embedding (generated by MSATransformer and ESM-1b) on structure prediction. RESULTS We have implemented a deep learning method for protein structure prediction that may take templates and MSA embedding as extra inputs. We study the contribution of templates and MSA embedding to structure prediction accuracy. Our experimental results show that templates can improve structure prediction on 71 of 110 CASP13 (13th Critical Assessment of Structure Prediction) targets and 47 of 91 CASP14 targets, and templates are particularly useful for targets with similar templates. MSA embedding can improve structure prediction on 63 of 91 CASP14 (14th Critical Assessment of Structure Prediction) targets and 87 of 183 CAMEO targets and is particularly useful for proteins with shallow MSAs. When both templates and MSA embedding are used, our method can predict correct folds (TMscore > 0.5) for 16 of 23 CASP14 FM targets and 14 of 18 Continuous Automated Model Evaluation (CAMEO) targets, outperforming RoseTTAFold by 5% and 7%, respectively. AVAILABILITY AND IMPLEMENTATION Available at https://github.com/xluo233/RaptorXFold. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | | | - Xiao Luo
- Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
| | - Jinbo Xu
- To whom correspondence should be addressed.
| |
Collapse
|
22
|
Mohammadi Y, Nezafat N, Negahdaripour M, Eskandari S, Zamani M. In silico design and evaluation of a novel mRNA vaccine against BK virus: a reverse vaccinology approach. Immunol Res 2022; 71:422-441. [PMID: 36580228 PMCID: PMC9797904 DOI: 10.1007/s12026-022-09351-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Accepted: 12/02/2022] [Indexed: 12/30/2022]
Abstract
Human polyomavirus type 1, or BK virus (BKV), is a ubiquitous pathogen belonging to the polyomaviridae family mostly known for causing BKV-associated nephropathy (BKVN) and allograft rejection in kidney transplant recipients (KTRs) following the immunosuppression regimens recommended in these patients. Reduction of the immunosuppression level and anti-viral agents are the usual approaches for BKV clearance, which have not met a desired outcome yet. There are also debating matters such as the effect of this pathogen on emerging various comorbidities and the related malignancies in the human population. In this study, a reverse vaccinology approach was implemented to design a mRNA vaccine against BKV by identifying the most antigenic proteins of this pathogen. Potential immunogenic T and B lymphocyte epitopes were predicted through various immunoinformatic tools. The final epitopes were selected according to antigenicity, toxicity, allergenicity, and cytokine inducibility scores. According to the obtained results, the designed vaccine was antigenic, neutral at the physiological pH, non-toxic, and non-allergenic with a world population coverage of 93.77%. Since the mRNA codon optimization ensures the efficient expression of the vaccine in a host cell, evaluation of different parameters showed our designed mRNA vaccine has a stable structure. Moreover, it had strong interactions with toll-like receptor 4 (TLR4) according to the molecular dynamic simulation studies. The in silico immune simulation analyses revealed an overall increase in the immune responses following repeated exposure to the designed vaccine. Based on our findings, the vaccine candidate is ready to be tested as a promising novel mRNA therapeutic vaccine against BKV.
Collapse
Affiliation(s)
- Yasaman Mohammadi
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Science, Shiraz, Iran
| | - Navid Nezafat
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran.
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Science, Shiraz, Iran.
| | - Manica Negahdaripour
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Shiraz University of Medical Sciences, Shiraz, Iran.
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Science, Shiraz, Iran.
| | - Sedigheh Eskandari
- Pharmaceutical Sciences Research Center, Shiraz University of Medical Science, Shiraz, Iran
| | - Mozhdeh Zamani
- Autophagy Research Center, Shiraz University of Medical Science, Shiraz, Iran
| |
Collapse
|
23
|
Miao H, Zhe Y, Xiang X, Cao Y, Han N, Wu Q, Huang Z. Enhanced Extracellular Expression of a Ca 2+- and Mg 2+-Dependent Hyperthermostable Protease EA1 in Bacillus subtilis via Systematic Screening of Optimal Signal Peptides. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2022; 70:15830-15839. [PMID: 36480738 DOI: 10.1021/acs.jafc.2c06741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
Proteases have been widely applied in various industries, including tanning, silk, feed, medicine, food, and environmental protection. Herein, the protease EA1 (GenBank accession no. U25630.1) was successfully expressed in Bacillus subtilis and demonstrated to function as a Ca2+- and Mg2+-dependent hyperthermostable neutral protease. At 80 °C, its half-life (t1/2) in the presence of 10 mM Mg2+ and Ca2+ was 50.4-fold longer than that in their absence (7.4 min), which can be explained by structural analysis. Compared with the currently available commercial proteases, protease EA1 has obvious advantages in heat resistance. The largest peptide library was used to enhance the extracellular expression of protease EA1 via constructing and screening 244 signal peptides (SPs). Eleven SPs with high yields of protease EA1 were identified from 5000 clones using a high-throughput assay. Specifically, the enzyme activity of protease produced by the strain (217.6 U/mL) containing the SP XynD was 5.2-fold higher than that of the strain with the initial SP. In brief, the protease is a potential candidate for future use in the high-temperature industry.
Collapse
Affiliation(s)
- Huabiao Miao
- Engineering Research Center of Sustainable Development and Utilization of Biomass Energy, Ministry of Education, Kunming 650500, China
- School of Life Science, Yunnan Normal University, Kunming 650500, China
| | - Yuanyuan Zhe
- School of Life Science, Yunnan Normal University, Kunming 650500, China
| | - Xia Xiang
- School of Life Science, Yunnan Normal University, Kunming 650500, China
| | - Yan Cao
- School of Life Science, Yunnan Normal University, Kunming 650500, China
| | - Nanyu Han
- Engineering Research Center of Sustainable Development and Utilization of Biomass Energy, Ministry of Education, Kunming 650500, China
- School of Life Science, Yunnan Normal University, Kunming 650500, China
| | - Qian Wu
- Engineering Research Center of Sustainable Development and Utilization of Biomass Energy, Ministry of Education, Kunming 650500, China
- School of Life Science, Yunnan Normal University, Kunming 650500, China
| | - Zunxi Huang
- Engineering Research Center of Sustainable Development and Utilization of Biomass Energy, Ministry of Education, Kunming 650500, China
- School of Life Science, Yunnan Normal University, Kunming 650500, China
| |
Collapse
|
24
|
Mufassirin MMM, Newton MAH, Sattar A. Artificial intelligence for template-free protein structure prediction: a comprehensive review. Artif Intell Rev 2022. [DOI: 10.1007/s10462-022-10350-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
25
|
Newton MH, Zaman R, Mataeimoghadam F, Rahman J, Sattar A. Constraint Guided Beta-Sheet Refinement for Protein Structure Prediction. Comput Biol Chem 2022; 101:107773. [DOI: 10.1016/j.compbiolchem.2022.107773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Revised: 09/15/2022] [Accepted: 09/16/2022] [Indexed: 11/16/2022]
|
26
|
Hou Q, Waury K, Gogishvili D, Feenstra KA. Ten quick tips for sequence-based prediction of protein properties using machine learning. PLoS Comput Biol 2022; 18:e1010669. [PMID: 36454728 PMCID: PMC9714715 DOI: 10.1371/journal.pcbi.1010669] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
The ubiquitous availability of genome sequencing data explains the popularity of machine learning-based methods for the prediction of protein properties from their amino acid sequences. Over the years, while revising our own work, reading submitted manuscripts as well as published papers, we have noticed several recurring issues, which make some reported findings hard to understand and replicate. We suspect this may be due to biologists being unfamiliar with machine learning methodology, or conversely, machine learning experts may miss some of the knowledge needed to correctly apply their methods to proteins. Here, we aim to bridge this gap for developers of such methods. The most striking issues are linked to a lack of clarity: how were annotations of interest obtained; which benchmark metrics were used; how are positives and negatives defined. Others relate to a lack of rigor: If you sneak in structural information, your method is not sequence-based; if you compare your own model to "state-of-the-art," take the best methods; if you want to conclude that some method is better than another, obtain a significance estimate to support this claim. These, and other issues, we will cover in detail. These points may have seemed obvious to the authors during writing; however, they are not always clear-cut to the readers. We also expect many of these tips to hold for other machine learning-based applications in biology. Therefore, many computational biologists who develop methods in this particular subject will benefit from a concise overview of what to avoid and what to do instead.
Collapse
Affiliation(s)
- Qingzhen Hou
- Department of Biostatistics, School of Public Health, Cheeloo College of Medicine, Shandong University, Shandong, P. R. China
- National Institute of Health Data Science of China, Shandong University, Shandong, P. R. China
| | - Katharina Waury
- Department of Computer Science, Bioinformatics Group, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - Dea Gogishvili
- Department of Computer Science, Bioinformatics Group, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| | - K. Anton Feenstra
- Department of Computer Science, Bioinformatics Group, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands
| |
Collapse
|
27
|
Wang W, Peng Z, Yang J. Single-sequence protein structure prediction using supervised transformer protein language models. NATURE COMPUTATIONAL SCIENCE 2022; 2:804-814. [PMID: 38177395 DOI: 10.1038/s43588-022-00373-3] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 11/06/2022] [Indexed: 01/06/2024]
Abstract
Significant progress has been made in protein structure prediction in recent years. However, it remains challenging for AlphaFold2 and other deep learning-based methods to predict protein structure with single-sequence input. Here we introduce trRosettaX-Single, an automated algorithm for single-sequence protein structure prediction. It incorporates the sequence embedding from a supervised transformer protein language model into a multi-scale network enhanced by knowledge distillation to predict inter-residue two-dimensional geometry, which is then used to reconstruct three-dimensional structures via energy minimization. Benchmark tests show that trRosettaX-Single outperforms AlphaFold2 and RoseTTAFold on orphan proteins and works well on human-designed proteins (with an average template modeling score (TM-score) of 0.79). An experimental test shows that the full trRosettaX-Single pipeline is two times faster than AlphaFold2, using much fewer computing resources (<10%). On 2,000 designed proteins from network hallucination, trRosettaX-Single generates structure models with high confidence. As a demonstration, trRosettaX-Single is applied to missense mutation analysis. These data suggest that trRosettaX-Single may find potential applications in protein design and related studies.
Collapse
Affiliation(s)
- Wenkai Wang
- School of Mathematical Sciences, Nankai University, Tianjin, China
| | - Zhenling Peng
- Ministry of Education Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China
| | - Jianyi Yang
- Ministry of Education Frontiers Science Center for Nonlinear Expectations, Research Center for Mathematics and Interdisciplinary Sciences, Shandong University, Qingdao, China.
| |
Collapse
|
28
|
Protein structure prediction in the deep learning era. Curr Opin Struct Biol 2022; 77:102495. [PMID: 36371845 DOI: 10.1016/j.sbi.2022.102495] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 10/03/2022] [Accepted: 10/04/2022] [Indexed: 11/11/2022]
Abstract
Significant advances have been achieved in protein structure prediction, especially with the recent development of the AlphaFold2 and the RoseTTAFold systems. This article reviews the progress in deep learning-based protein structure prediction methods in the past two years. First, we divide the representative methods into two categories: the two-step approach and the end-to-end approach. Then, we show that the two-step approach is possible to achieve similar accuracy to the state-of-the-art end-to-end approach AlphaFold2. Compared to the end-to-end approach, the two-step approach requires fewer computing resources. We conclude that it is valuable to keep developing both approaches. Finally, a few outstanding challenges in function-orientated protein structure prediction are pointed out for future development.
Collapse
|
29
|
Ishwarlall TZ, Adeleke VT, Maharaj L, Okpeku M, Adeniyi AA, Adeleke MA. Identification of potential candidate vaccines against Mycobacterium ulcerans based on the major facilitator superfamily transporter protein. Front Immunol 2022; 13:1023558. [PMID: 36426350 PMCID: PMC9679648 DOI: 10.3389/fimmu.2022.1023558] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Accepted: 10/19/2022] [Indexed: 11/09/2023] Open
Abstract
Buruli ulcer is a neglected tropical disease that is characterized by non-fatal lesion development. The causative agent is Mycobacterium ulcerans (M. ulcerans). There are no known vectors or transmission methods, preventing the development of control methods. There are effective diagnostic techniques and treatment routines; however, several socioeconomic factors may limit patients' abilities to receive these treatments. The Bacillus Calmette-Guérin vaccine developed against tuberculosis has shown limited efficacy, and no conventionally designed vaccines have passed clinical trials. This study aimed to generate a multi-epitope vaccine against M. ulcerans from the major facilitator superfamily transporter protein using an immunoinformatics approach. Twelve M. ulcerans genome assemblies were analyzed, resulting in the identification of 11 CD8+ and 7 CD4+ T-cell epitopes and 2 B-cell epitopes. These conserved epitopes were computationally predicted to be antigenic, immunogenic, non-allergenic, and non-toxic. The CD4+ T-cell epitopes were capable of inducing interferon-gamma and interleukin-4. They successfully bound to their respective human leukocyte antigens alleles in in silico docking studies. The expected global population coverage of the T-cell epitopes and their restricted human leukocyte antigens alleles was 99.90%. The population coverage of endemic regions ranged from 99.99% (Papua New Guinea) to 21.81% (Liberia). Two vaccine constructs were generated using the Toll-like receptors 2 and 4 agonists, LprG and RpfE, respectively. Both constructs were antigenic, non-allergenic, non-toxic, thermostable, basic, and hydrophilic. The DNA sequences of the vaccine constructs underwent optimization and were successfully in-silico cloned with the pET-28a(+) plasmid. The vaccine constructs were successfully docked to their respective toll-like receptors. Molecular dynamics simulations were carried out to analyze the binding interactions within the complex. The generated binding energies indicate the stability of both complexes. The constructs generated in this study display severable favorable properties, with construct one displaying a greater range of favorable properties. However, further analysis and laboratory validation are required.
Collapse
Affiliation(s)
- Tamara Z. Ishwarlall
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Victoria T. Adeleke
- Department of Chemical Engineering, Mangosuthu University of Technology, Durban, South Africa
| | - Leah Maharaj
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Moses Okpeku
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Adebayo A. Adeniyi
- Department of Chemistry, Faculty of Natural and Agricultural Sciences, University of the Free State, Bloemfontein, South Africa
- Department of Industrial Chemistry, Federal University Oye Ekiti, Oye-Ekiti, Ekiti State, Nigeria
| | - Matthew A. Adeleke
- Discipline of Genetics, School of Life Sciences, University of KwaZulu-Natal, Durban, South Africa
| |
Collapse
|
30
|
Chen L, Zhang X, Guo X, Peng W, Zhu Y, Wang Z, Yu X, Shi H, Li Y, Zhang L, Wang L, Wang P, Cheng G. Neighboring mutation-mediated enhancement of dengue virus infectivity and spread. EMBO Rep 2022; 23:e55671. [PMID: 36197120 PMCID: PMC9638853 DOI: 10.15252/embr.202255671] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Revised: 09/06/2022] [Accepted: 09/07/2022] [Indexed: 10/07/2023] Open
Abstract
Frequent turnover of dengue virus (DENV) clades is one of the major forces driving DENV persistence and prevalence. In this study, we assess the fitness advantage of nine stable substitutions within the envelope (E) protein of DENV serotypes. Two tandem neighboring substitutions, threonine to lysine at the 226th (T226K) and glycine to glutamic acid at the 228th (G228E) residues in the DENV2 Asian I genotype, enhance virus infectivity in either mosquitoes or mammalian hosts, thereby promoting clades turnover and dengue epidemics. Mechanistic studies indicate that the substitution-mediated polarity changes in these two residues increase the binding affinity of E for host C-type lectins. Accordingly, we predict that a G228E substitution could potentially result in a forthcoming epidemic of the DENV2 Cosmopolitan genotype. Investigations into the substitutions associated with DENV fitness in hosts may offer mechanistic insights into dengue prevalence, thus providing a warning of potential epidemics in the future.
Collapse
Affiliation(s)
- Lu Chen
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Xianwen Zhang
- Institute of Infectious DiseasesShenzhen Bay LaboratoryShenzhenChina
| | - Xuan Guo
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Wenyu Peng
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Yibin Zhu
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Zhaoyang Wang
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Xi Yu
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Huicheng Shi
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Yuhan Li
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Liming Zhang
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
| | - Lei Wang
- Institute of Infectious DiseasesShenzhen Bay LaboratoryShenzhenChina
| | - Penghua Wang
- Department of Immunology, School of Medicinethe University of Connecticut Health CenterFarmingtonCTUSA
| | - Gong Cheng
- Tsinghua‐Peking Joint Center for Life Sciences, School of MedicineTsinghua UniversityBeijingChina
- Institute of Infectious DiseasesShenzhen Bay LaboratoryShenzhenChina
| |
Collapse
|
31
|
Li D, Wen Y, Ou Z, Yu Y, Zhao C, Lin F. Inhibitor of Glucosinolate Sulfatases as a Potential Friendly Insecticide to Control Plutella xylostella. JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY 2022; 70:13528-13537. [PMID: 36251030 DOI: 10.1021/acs.jafc.2c04542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
The glucosinolate-myrosinase system is a two-component defense system characteristic of cruciferous plants. To evade the glucosinolate-myrosinase system, the crucifer specialist insect, Plutella xylostella, promptly desulfates the glucosinolates into harmless compounds by glucosinolate sulfatases (GSSs) in the gut. In this study, we identified an effective inhibitor of GSSs by virtual screening, molecular docking analysis, and in vitro enzyme inhibition assay. The combined effect of the GSS inhibitor with the plant glucosinolate-myrosinase system was assessed by the bioassay of P. xylostella. We show that irosustat is a GSS inhibitor and the inhibition of GSSs impairs the ability of P. xylostella to detoxify the glucosinolate-myrosinase system, leading to the systematic accumulation of toxic isothiocyanates in larvae, thereby severely affecting feeding, growth, survival, and reproduction of P. xylostella. While fed on the Arabidopsis mutants deficient in myrosinase or glucosinolates, irosustat had no significant negative effect on P. xylostella. These findings reveal that the GSS inhibitor is a novel friendly insecticide to control P. xylostella utilizing the plant glucosinolate-myrosinase system and promote the development of insecticide-plant chemical defense combination strategies.
Collapse
Affiliation(s)
- Dehong Li
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources and Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Yingjie Wen
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources and Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Ziyue Ou
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources and Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Ye Yu
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources and Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Chen Zhao
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources and Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, South China Agricultural University, Guangzhou, Guangdong 510642, China
| | - Fei Lin
- State Key Laboratory for Conservation and Utilization of Subtropical Agro-bioresources and Key Laboratory of Natural Pesticide and Chemical Biology, Ministry of Education, South China Agricultural University, Guangzhou, Guangdong 510642, China
| |
Collapse
|
32
|
Peng CX, Zhou XG, Xia YH, Liu J, Hou MH, Zhang GJ. Structural analogue-based protein structure domain assembly assisted by deep learning. Bioinformatics 2022; 38:4513-4521. [PMID: 35962986 DOI: 10.1093/bioinformatics/btac553] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 07/27/2022] [Accepted: 08/08/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION With the breakthrough of AlphaFold2, the protein structure prediction problem has made remarkable progress through deep learning end-to-end techniques, in which correct folds could be built for nearly all single-domain proteins. However, the full-chain modelling appears to be lower on average accuracy than that for the constituent domains and requires higher demand on computing hardware, indicating the performance of full-chain modelling still needs to be improved. In this study, we investigate whether the predicted accuracy of the full-chain model can be further improved by domain assembly assisted by deep learning. RESULTS In this article, we developed a structural analogue-based protein structure domain assembly method assisted by deep learning, named SADA. In SADA, a multi-domain protein structure database was constructed for the full-chain analogue detection using individual domain models. Starting from the initial model constructed from the analogue, the domain assembly simulation was performed to generate the full-chain model through a two-stage differential evolution algorithm guided by the energy function with an inter-residue distance potential predicted by deep learning. SADA was compared with the state-of-the-art domain assembly methods on 356 benchmark proteins, and the average TM-score of SADA models is 8.1% and 27.0% higher than that of DEMO and AIDA, respectively. We also assembled 293 human multi-domain proteins, where the average TM-score of the full-chain model after the assembly by SADA is 1.1% higher than that of the model by AlphaFold2. To conclude, we find that the domains often interact in the similar way in the quaternary orientations if the domains have similar tertiary structures. Furthermore, homologous templates and structural analogues are complementary for multi-domain protein full-chain modelling. AVAILABILITY AND IMPLEMENTATION http://zhanglab-bioinf.com/SADA. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chun-Xiang Peng
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Xiao-Gen Zhou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu-Hao Xia
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Jun Liu
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Ming-Hua Hou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
33
|
Capel H, Weiler R, Dijkstra M, Vleugels R, Bloem P, Feenstra KA. ProteinGLUE multi-task benchmark suite for self-supervised protein modeling. Sci Rep 2022; 12:16047. [PMID: 36163232 PMCID: PMC9512797 DOI: 10.1038/s41598-022-19608-4] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Accepted: 08/31/2022] [Indexed: 11/09/2022] Open
Abstract
Self-supervised language modeling is a rapidly developing approach for the analysis of protein sequence data. However, work in this area is heterogeneous and diverse, making comparison of models and methods difficult. Moreover, models are often evaluated only on one or two downstream tasks, making it unclear whether the models capture generally useful properties. We introduce the ProteinGLUE benchmark for the evaluation of protein representations: a set of seven per-amino-acid tasks for evaluating learned protein representations. We also offer reference code, and we provide two baseline models with hyperparameters specifically trained for these benchmarks. Pre-training was done on two tasks, masked symbol prediction and next sentence prediction. We show that pre-training yields higher performance on a variety of downstream tasks such as secondary structure and protein interaction interface prediction, compared to no pre-training. However, the larger base model does not outperform the smaller medium model. We expect the ProteinGLUE benchmark dataset introduced here, together with the two baseline pre-trained models and their performance evaluations, to be of great value to the field of protein sequence-based property prediction. Availability: code and datasets from https://github.com/ibivu/protein-glue .
Collapse
Affiliation(s)
- Henriette Capel
- Informatics Institute, Vrije Universiteit, 1081 HV, Amsterdam, The Netherlands
| | - Robin Weiler
- Informatics Institute, Vrije Universiteit, 1081 HV, Amsterdam, The Netherlands
| | - Maurits Dijkstra
- Informatics Institute, Vrije Universiteit, 1081 HV, Amsterdam, The Netherlands
| | - Reinier Vleugels
- Informatics Institute, Vrije Universiteit, 1081 HV, Amsterdam, The Netherlands
| | - Peter Bloem
- Informatics Institute, Vrije Universiteit, 1081 HV, Amsterdam, The Netherlands
| | - K Anton Feenstra
- Informatics Institute, Vrije Universiteit, 1081 HV, Amsterdam, The Netherlands.
| |
Collapse
|
34
|
Mehta S, Goel M, Priyakumar UD. MO-MEMES: A method for accelerating virtual screening using multi-objective Bayesian optimization. Front Med (Lausanne) 2022; 9:916481. [PMID: 36213671 PMCID: PMC9537730 DOI: 10.3389/fmed.2022.916481] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2022] [Accepted: 08/29/2022] [Indexed: 11/13/2022] Open
Abstract
The pursuit of potential inhibitors for novel targets has become a very important problem especially over the last 2 years with the world in the midst of the COVID-19 pandemic. This entails performing high throughput screening exercises on drug libraries to identify potential “hits”. These hits are identified using analysis of their physical properties like binding affinity to the target receptor, octanol-water partition coefficient (LogP) and more. However, drug libraries can be extremely large and it is infeasible to calculate and analyze the physical properties for each of those molecules within acceptable time and moreover, each molecule must possess a multitude of properties apart from just the binding affinity. To address this problem, in this study, we propose an extension to the Machine learning framework for Enhanced MolEcular Screening (MEMES) framework for multi-objective Bayesian optimization. This approach is capable of identifying over 90% of the most desirable molecules with respect to all required properties while explicitly calculating the values of each of those properties on only 6% of the entire drug library. This framework would provide an immense boost in identifying potential hits that possess all properties required for a drug molecules.
Collapse
|
35
|
Liu Z, Yu DJ. cpxDeepMSA: A Deep Cascade Algorithm for Constructing Multiple Sequence Alignments of Protein–Protein Interactions. Int J Mol Sci 2022; 23:ijms23158459. [PMID: 35955594 PMCID: PMC9369210 DOI: 10.3390/ijms23158459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 07/18/2022] [Accepted: 07/28/2022] [Indexed: 12/10/2022] Open
Abstract
Protein–protein interactions (PPIs) are fundamental to many biological processes. The coevolution-based prediction of interacting residues has made great strides in protein complexes that are known to interact. A multiple sequence alignment (MSA) is the basis of coevolution analysis. MSAs have recently made significant progress in the protein monomer sequence analysis. However, no standard or efficient pipelines are available for the sensitive protein complex MSA (cpxMSA) collection. How to generate cpxMSA is one of the most challenging problems of sequence coevolution analysis. Although several methods have been developed to address this problem, no standalone program exists. Furthermore, the number of built-in properties is limited; hence, it is often difficult for users to analyze sequence coevolution according to their desired cpxMSA. In this article, we developed a novel cpxMSA approach (cpxDeepMSA. We used different protein monomer databases and incorporated the three strategies (genomic distance, phylogeny information, and STRING interaction network) used to join the monomer MSA results of protein complexes, which can prevent using a single method fail to the joint two-monomer MSA causing the cpxMSA construction failure. We anticipate that the cpxDeepMSA algorithm will become a useful high-throughput tool in protein complex structure predictions, inter-protein residue-residue contacts, and the biological sequence coevolution analysis.
Collapse
|
36
|
Rahman J, Newton MAH, Hasan MAM, Sattar A. A stacked meta-ensemble for protein inter-residue distance prediction. Comput Biol Med 2022; 148:105824. [PMID: 35863250 DOI: 10.1016/j.compbiomed.2022.105824] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 06/21/2022] [Accepted: 07/03/2022] [Indexed: 11/25/2022]
Abstract
Predicted inter-residue distances are a key behind recent success in high quality protein structure prediction (PSP). However, prediction of both short and long distance values together is challenging. Consequently, predicted short distances are mostly used by existing PSP methods. In this paper, we use a stacked meta-ensemble method to combine deep learning models trained for different ranges of real-valued distances. On five benchmark sets of proteins, our proposed inter-residue distance prediction method improves mean Local Distance Different Test (LDDT) scores at least by 5% over existing such methods. Moreover, using a real-valued distance based conformational search algorithm, we also show that predicted long distances help obtain significantly better protein conformations than when only predicted short distances are used. Our method is named meta-ensemble for distance prediction (MDP) and its program is available from https://gitlab.com/mahnewton/mdp.
Collapse
Affiliation(s)
- Julia Rahman
- School of Information and Communication Technology, Griffith University, Queensland, Australia.
| | - M A Hakim Newton
- Institute of Integrated and Intelligent Systems, Griffith University, Queensland, Australia; School of Information and Physical Sciences, The University of Newcastle, New South Wales, Australia.
| | | | - Abdul Sattar
- School of Information and Communication Technology, Griffith University, Queensland, Australia; Institute of Integrated and Intelligent Systems, Griffith University, Queensland, Australia
| |
Collapse
|
37
|
Huo W, Busch LM, Hernandez-Bird J, Hamami E, Marshall CW, Geisinger E, Cooper VS, van Opijnen T, Rosch JW, Isberg RR. Immunosuppression broadens evolutionary pathways to drug resistance and treatment failure during Acinetobacter baumannii pneumonia in mice. Nat Microbiol 2022; 7:796-809. [PMID: 35618774 PMCID: PMC9159950 DOI: 10.1038/s41564-022-01126-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2021] [Accepted: 04/20/2022] [Indexed: 01/02/2023]
Abstract
Acinetobacter baumannii is increasingly refractory to antibiotic treatment in healthcare settings. As is true of most human pathogens, the genetic path to antimicrobial resistance (AMR) and the role that the immune system plays in modulating AMR during disease are poorly understood. Here we reproduced several routes to fluoroquinolone resistance, performing evolution experiments using sequential lung infections in mice that are replete with or depleted of neutrophils, providing two key insights into the evolution of drug resistance. First, neutropenic hosts acted as reservoirs for the accumulation of drug resistance during drug treatment. Selection for variants with altered drug sensitivity profiles arose readily in the absence of neutrophils, while immunocompetent animals restricted the appearance of these variants. Secondly, antibiotic treatment failure in the immunocompromised host was shown to occur without clinically defined resistance, an unexpected result that provides a model for how antibiotic failure occurs clinically in the absence of AMR. The genetic mechanism underlying both these results is initiated by mutations activating the drug egress pump regulator AdeL, which drives persistence in the presence of antibiotic. Therefore, antibiotic persistence mutations present a two-pronged risk during disease, causing drug treatment failure in the immunocompromised host while simultaneously increasing the emergence of high-level AMR.
Collapse
Affiliation(s)
- Wenwen Huo
- Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, MA, USA
| | - Lindsay M Busch
- Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, MA, USA
- Division of Infectious Diseases, Emory University School of Medicine, Atlanta, GA, USA
| | - Juan Hernandez-Bird
- Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, MA, USA
| | - Efrat Hamami
- Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, MA, USA
| | - Christopher W Marshall
- Department of Microbiology and Molecular Genetics and Center for Evolutionary Biology and Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
- Department of Biological Sciences, Marquette University, Milwaukee, WI, USA
| | | | - Vaughn S Cooper
- Department of Microbiology and Molecular Genetics and Center for Evolutionary Biology and Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA, USA
| | | | - Jason W Rosch
- Department of Infectious Diseases, St Jude Children's Research Hospital, Memphis, TN, USA
| | - Ralph R Isberg
- Department of Molecular Biology and Microbiology, Tufts University School of Medicine, Boston, MA, USA.
| |
Collapse
|
38
|
Zhang H, Huang Y, Bei Z, Ju Z, Meng J, Hao M, Zhang J, Zhang H, Xi W. Inter-Residue Distance Prediction From Duet Deep Learning Models. Front Genet 2022; 13:887491. [PMID: 35651930 PMCID: PMC9148999 DOI: 10.3389/fgene.2022.887491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 03/30/2022] [Indexed: 12/04/2022] Open
Abstract
Residue distance prediction from the sequence is critical for many biological applications such as protein structure reconstruction, protein–protein interaction prediction, and protein design. However, prediction of fine-grained distances between residues with long sequence separations still remains challenging. In this study, we propose DuetDis, a method based on duet feature sets and deep residual network with squeeze-and-excitation (SE), for protein inter-residue distance prediction. DuetDis embraces the ability to learn and fuse features directly or indirectly extracted from the whole-genome/metagenomic databases and, therefore, minimize the information loss through ensembling models trained on different feature sets. We evaluate DuetDis and 11 widely used peer methods on a large-scale test set (610 proteins chains). The experimental results suggest that 1) prediction results from different feature sets show obvious differences; 2) ensembling different feature sets can improve the prediction performance; 3) high-quality multiple sequence alignment (MSA) used for both training and testing can greatly improve the prediction performance; and 4) DuetDis is more accurate than peer methods for the overall prediction, more reliable in terms of model prediction score, and more robust against shallow multiple sequence alignment (MSA).
Collapse
Affiliation(s)
- Huiling Zhang
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Ying Huang
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zhendong Bei
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Zhen Ju
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Jintao Meng
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Min Hao
- College of Electronic and Information Engineering, Southwest University, Chongqing, China
| | - Jingjing Zhang
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Haiping Zhang
- University of Chinese Academy of Sciences, Beijing, China
| | - Wenhui Xi
- Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, China
- University of Chinese Academy of Sciences, Beijing, China
- *Correspondence: Wenhui Xi,
| |
Collapse
|
39
|
Newton MAH, Rahman J, Zaman R, Sattar A. Enhancing Protein Contact Map Prediction Accuracy via Ensembles of Inter-Residue Distance Predictors. Comput Biol Chem 2022; 99:107700. [DOI: 10.1016/j.compbiolchem.2022.107700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2022] [Revised: 05/19/2022] [Accepted: 05/19/2022] [Indexed: 11/03/2022]
|
40
|
Yu J, Li X, Huang J, Yu M, Wu Z, Cao S. Molecular dynamics simulation of α‐gliadin in ethanol/aqueous organic solvents. Int J Food Sci Technol 2022. [DOI: 10.1111/ijfs.15754] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jie‐Ting Yu
- School of Food Science and Engineering Foshan University Foshan528000China
- Guangdong Key Laboratory of Food Intelligent Manufacturing Foshan University Foshan528000China
| | - Xin‐Yao Li
- School of Food Science and Engineering Foshan University Foshan528000China
- Guangdong Key Laboratory of Food Intelligent Manufacturing Foshan University Foshan528000China
| | - Jia‐Hui Huang
- School of Food College South China Agricultural University Guangzhou510642China
| | - Ming‐Yi Yu
- School of Food Science and Engineering Foshan University Foshan528000China
| | - Zi‐Yi Wu
- School of Food College South China Agricultural University Guangzhou510642China
| | - Shi‐Lin Cao
- School of Food Science and Engineering Foshan University Foshan528000China
- Guangdong Key Laboratory of Food Intelligent Manufacturing Foshan University Foshan528000China
| |
Collapse
|
41
|
Nisar M, Paracha RZ, Gul A, Arshad I, Ejaz S, Murad D, Khan S, Mustansar Z. Interaction Analysis of Adenovirus L5 Protein With Pancreatic Cancer Cell Surface Receptor to Analyze Its Affinity for Oncolytic Virus Therapy. Front Oncol 2022; 12:832277. [PMID: 35359382 PMCID: PMC8960272 DOI: 10.3389/fonc.2022.832277] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Accepted: 02/07/2022] [Indexed: 11/13/2022] Open
Abstract
This study seeks to investigate the interaction profile of the L5 protein of oncolytic adenovirus with the overexpressed surface receptors of pancreatic cancer. This is an important area of research because pancreatic cancer is one of the most fatal malignancies with a very low patient survival rate. Multiple therapies to date to improve the survival rate are reported; however, they show a comparatively low success rate. Among them, oncolytic virus therapy is a type of immunotherapy that is currently under deliberation by researchers for multiple cancer types in various clinical trials. Talimogene laherparepvec (T-VEC) is the first oncolytic virus approved by the US Food and Drug Administration (FDA) for melanoma. The oncolytic virus not only kills cancer cells but also activates the anticancer immune response. Therefore, it is preferred over others to deal with aggressive pancreatic cancer. The efficacy of therapy primarily depends on how effectively the oncolytic virus enters and infects the cancer cell. Cell surface receptors and their interactions with virus coat proteins are a crucial step for oncolytic virus entry and a pivotal determinant. The L5 proteins of the virus coat are the first to interact with host cell surface receptors. Therefore, the objective of this study is to analyze the interaction profile of the L5 protein of oncolytic adenovirus with overexpressed surface receptors of pancreatic cancer. The L5 proteins of three adenovirus serotypes HAdV2, HAdV5, and HAdV3 were utilized in this study. Overexpressed pancreatic cancer receptors include SLC2A1, MET, IL1RAP, NPR3, GABRP, SLC6A6, and TMPRSS4. The protein structures of viral and cancer cell protein were docked using the High Ambiguity Driven protein–protein DOCKing (HADDOCK) server. The binding affinity and interaction profile of viral proteins against all the receptors were analyzed. Results suggest that the HAdV3 L5 protein shows better interaction as compared to HAdV2 and HAdV5 by elucidating high binding affinity with 4 receptors (NPR3, GABRP, SLC6A6, and TMPRSS4). The current study proposed that HAdV5 or HAdV2 virus pseudotyped with the L5 protein of HAdV3 can be able to effectively infect pancreatic cancer cells. Moreover, the current study surmises that the affinity maturation of HAdV3 L5 can enhance virus attachment with all the receptors of cancer cells.
Collapse
Affiliation(s)
- Maryum Nisar
- Research Center for Modelling and Simulation (RCMS), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Rehan Zafar Paracha
- Research Center for Modelling and Simulation (RCMS), National University of Sciences and Technology (NUST), Islamabad, Pakistan
- *Correspondence: Rehan Zafar Paracha,
| | - Alvina Gul
- Atta-ur-Rahman School of Applied Biosciences (ASAB), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Iqra Arshad
- Research Center for Modelling and Simulation (RCMS), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Saima Ejaz
- Research Center for Modelling and Simulation (RCMS), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Didar Murad
- Research Center for Modelling and Simulation (RCMS), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| | - Shahzeb Khan
- Department of Pharmacy, University of Malakand, Chakdara, Pakistan
- Discipline of Pharmaceutical Sciences, School of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| | - Zartasha Mustansar
- Research Center for Modelling and Simulation (RCMS), National University of Sciences and Technology (NUST), Islamabad, Pakistan
| |
Collapse
|
42
|
Stringer B, de Ferrante H, Abeln S, Heringa J, Feenstra KA, Haydarlou R. PIPENN: protein interface prediction from sequence with an ensemble of neural nets. Bioinformatics 2022; 38:2111-2118. [PMID: 35150231 PMCID: PMC9004643 DOI: 10.1093/bioinformatics/btac071] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 01/16/2022] [Accepted: 02/04/2022] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION The interactions between proteins and other molecules are essential to many biological and cellular processes. Experimental identification of interface residues is a time-consuming, costly and challenging task, while protein sequence data are ubiquitous. Consequently, many computational and machine learning approaches have been developed over the years to predict such interface residues from sequence. However, the effectiveness of different Deep Learning (DL) architectures and learning strategies for protein-protein, protein-nucleotide and protein-small molecule interface prediction has not yet been investigated in great detail. Therefore, we here explore the prediction of protein interface residues using six DL architectures and various learning strategies with sequence-derived input features. RESULTS We constructed a large dataset dubbed BioDL, comprising protein-protein interactions from the PDB, and DNA/RNA and small molecule interactions from the BioLip database. We also constructed six DL architectures, and evaluated them on the BioDL benchmarks. This shows that no single architecture performs best on all instances. An ensemble architecture, which combines all six architectures, does consistently achieve peak prediction accuracy. We confirmed these results on the published benchmark set by Zhang and Kurgan (ZK448), and on our own existing curated homo- and heteromeric protein interaction dataset. Our PIPENN sequence-based ensemble predictor outperforms current state-of-the-art sequence-based protein interface predictors on ZK448 on all interaction types, achieving an AUC-ROC of 0.718 for protein-protein, 0.823 for protein-nucleotide and 0.842 for protein-small molecule. AVAILABILITY AND IMPLEMENTATION Source code and datasets are available at https://github.com/ibivu/pipenn/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Hans de Ferrante
- Department of Computer Science, IBIVU—Center for Integrative Bioinformatics, Vrije Universiteit, 1081HV Amsterdam, The Netherlands
| | - Sanne Abeln
- Department of Computer Science, IBIVU—Center for Integrative Bioinformatics, Vrije Universiteit, 1081HV Amsterdam, The Netherlands
| | - Jaap Heringa
- Department of Computer Science, IBIVU—Center for Integrative Bioinformatics, Vrije Universiteit, 1081HV Amsterdam, The Netherlands
| | - K Anton Feenstra
- Department of Computer Science, IBIVU—Center for Integrative Bioinformatics, Vrije Universiteit, 1081HV Amsterdam, The Netherlands
| | | |
Collapse
|
43
|
Perni S. The Builders of the Junction: Roles of Junctophilin1 and Junctophilin2 in the Assembly of the Sarcoplasmic Reticulum–Plasma Membrane Junctions in Striated Muscle. Biomolecules 2022; 12:biom12010109. [PMID: 35053257 PMCID: PMC8774113 DOI: 10.3390/biom12010109] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2021] [Revised: 01/06/2022] [Accepted: 01/07/2022] [Indexed: 02/06/2023] Open
Abstract
Contraction of striated muscle is triggered by a massive release of calcium from the sarcoplasmic reticulum (SR) into the cytoplasm. This intracellular calcium release is initiated by membrane depolarization, which is sensed by voltage-gated calcium channels CaV1.1 (in skeletal muscle) and CaV1.2 (in cardiac muscle) in the plasma membrane (PM), which in turn activate the calcium-releasing channel ryanodine receptor (RyR) embedded in the SR membrane. This cross-communication between channels in the PM and in the SR happens at specialized regions, the SR-PM junctions, where these two compartments come in close proximity. Junctophilin1 and Junctophilin2 are responsible for the formation and stabilization of SR-PM junctions in striated muscle and actively participate in the recruitment of the two essential players in intracellular calcium release, CaV and RyR. This short review focuses on the roles of junctophilins1 and 2 in the formation and organization of SR-PM junctions in skeletal and cardiac muscle and on the functional consequences of the absence or malfunction of these proteins in striated muscle in light of recently published data and recent advancements in protein structure prediction.
Collapse
Affiliation(s)
- Stefano Perni
- Department of Physiology and Biophysics, Anschutz Medical Campus, University of Colorado, Aurora, CO 80045, USA
| |
Collapse
|
44
|
Fanelli A, Sullivan ML. Tools for protein structure prediction and for molecular docking applied to enzyme active site analysis: A case study using a BAHD hydroxycinnamoyltransferase. Methods Enzymol 2022; 683:41-79. [PMID: 37087195 DOI: 10.1016/bs.mie.2022.10.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
Elucidating the structure of an enzyme and how substrates bind to the active site is an important step for understanding its reaction mechanism and function. Nevertheless, the methods available to obtain three-dimensional structures of proteins, such as x-ray crystallography and NMR, can be expensive and time-consuming. Considering this, an alternative is using structural bioinformatic tools to predict the tertiary structure of a protein from its primary sequence, followed by molecular docking of one or more substrates into the enzyme structure model. In the past few years, significant advances have been made in these computational tools, which can give useful information about the active site and enzyme-substrate interactions before the structure can be resolved using physical methods. Here, using common bean (Phaseolus vulgaris) hydroxycinnamoyl-coenzyme A:tetrahydroxyhexanedioic acid hydroxycinnamoyltransferase (HHHT) as an example, we describe methods and workflows for protein structure prediction and molecular docking that can be performed on a personal computer using only open-source tools.
Collapse
Affiliation(s)
- Amanda Fanelli
- US Dairy Forage Research Center, USDA Agricultural Research Service, Madison, WI, United States.
| | - Michael L Sullivan
- US Dairy Forage Research Center, USDA Agricultural Research Service, Madison, WI, United States
| |
Collapse
|