1
|
Pratyush P, Kc DB. Advances in Prediction of Posttranslational Modification Sites Known to Localize in Protein Supersecondary Structures. Methods Mol Biol 2025; 2870:117-151. [PMID: 39543034 DOI: 10.1007/978-1-0716-4213-9_8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2024]
Abstract
Posttranslational modifications (PTMs) play a crucial role in modulating the structure, function, localization, and interactions of proteins, with many PTMs being localized within supersecondary structures, such as helical pairs. These modifications can significantly influence the conformation and stability of these structures. For instance, phosphorylation introduces negative charges that alter electrostatic interactions, while acetylation or methylation of lysine residues affects the stability and interactions of alpha helices or beta strands. Given the pivotal role of supersecondary structures in the overall protein architecture, their modulation by PTMs is essential for protein functionality. This chapter explores the latest advancements in predicting sites for the five PTMs (phosphorylation, acetylation, glycosylation, methylation, and ubiquitination) known to be localized within supersecondary structures. The chapter highlights the recent advances in the prediction of these PTM sites, including the use of global contextualized embeddings from protein language models, integration of structural information, utilization of reliable positive and negative sites, and application of contrastive learning. These methodologies and emerging trends offer a roadmap for novel innovations in addressing PTM prediction challenges, particularly those linked to supersecondary structures.
Collapse
Affiliation(s)
- Pawel Pratyush
- Computer Science Department, Michigan Technological University, Houghton, MI, USA
- Computer Science Department, Rochester Institute of Technology, Henrietta, NY, USA
| | - Dukka B Kc
- Computer Science Department, Michigan Technological University, Houghton, MI, USA.
- Computer Science Department, Rochester Institute of Technology, Henrietta, NY, USA.
| |
Collapse
|
2
|
Gutierrez CS, Kassim AA, Gutierrez BD, Raines RT. Sitetack: a deep learning model that improves PTM prediction by using known PTMs. Bioinformatics 2024; 40:btae602. [PMID: 39388212 PMCID: PMC11552626 DOI: 10.1093/bioinformatics/btae602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 10/05/2024] [Accepted: 10/08/2024] [Indexed: 10/15/2024] Open
Abstract
MOTIVATION Post-translational modifications (PTMs) increase the diversity of the proteome and are vital to organismal life and therapeutic strategies. Deep learning has been used to predict PTM locations. Still, limitations in datasets and their analyses compromise success. RESULTS We evaluated the use of known PTM sites in prediction via sequence-based deep learning algorithms. For each PTM, known locations of that PTM were encoded as a separate amino acid before sequences were encoded via word embedding and passed into a convolutional neural network that predicts the probability of that PTM at a given site. Without labeling known PTMs, our models are on par with others. With labeling, however, we improved significantly upon extant models. Moreover, knowing PTM locations can increase the predictability of a different PTM. Our findings highlight the importance of PTMs for the installation of additional PTMs. We anticipate that including known PTM locations will enhance the performance of other proteomic machine learning algorithms. AVAILABILITY AND IMPLEMENTATION Sitetack is available as a web tool at https://sitetack.net; the source code, representative datasets, instructions for local use, and select models are available at https://github.com/clair-gutierrez/sitetack.
Collapse
Affiliation(s)
- Clair S Gutierrez
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, United States
- Broad Institute of MIT and Harvard, Cambridge, MA 02143, United States
| | - Alia A Kassim
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, United States
| | | | - Ronald T Raines
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, United States
- Broad Institute of MIT and Harvard, Cambridge, MA 02143, United States
- Koch Institute for Integrated Cancer Research at MIT, Cambridge, MA 02139, United States
| |
Collapse
|
3
|
Desai M, Sun B. Positions of cysteine residues reveal local clusters and hidden relationships to Sequons and Transmembrane domains in Human proteins. Sci Rep 2024; 14:25886. [PMID: 39468182 PMCID: PMC11519667 DOI: 10.1038/s41598-024-77056-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2024] [Accepted: 10/18/2024] [Indexed: 10/30/2024] Open
Abstract
Membrane proteins often possess critical structural features, such as transmembrane domains (TMs), N-glycosylation, and disulfide bonds (SS bonds), which are essential to their structure and function. Here, we extend the study of the motifs carrying N-glycosylation, i.e. the sequons, and the Cys residues supporting the SS bonds, to the whole human proteome with a particular focus on the Cys positions in human proteins with respect to those of sequons and TMs. As the least abundant amino acid residue in protein sequences, the positions of Cys residues in proteins are not random but rather selected through evolution. We discovered that the frequency of Cys residues in proteins is length dependent, and the frequency of CC gaps formed between adjacent Cys residues can be used as a classifier to distinguish proteins with special structures and functions, such as keratin-associated proteins (KAPs), extracellular proteins with EGF-like domains, and nuclear proteins with zinc finger C2H2 domains. Most importantly, by comparing the positions of Cys residues to those of sequons and TMs, we discovered that these structural features can form dense clusters in highly repeated and mutually exclusive modalities in protein sequences. The evolutionary advantages of such complementarity among the three structural features are discussed, particularly in light of structural dynamics in proteins that are lacking from computational predictions. The discoveries made here highlight the sequence-structure-function axis in biological organisms that can be utilized in future protein engineering toward synthetic biology.
Collapse
Affiliation(s)
- Manthan Desai
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
- Department of Computing Science, Simon Fraser University, Burnaby, BC, Canada
| | - Bingyun Sun
- Department of Chemistry, Simon Fraser University, Burnaby, BC, Canada.
- Simon Fraser University, Burnaby, BC, V5A 1S6, Canada.
| |
Collapse
|
4
|
Yom A, Chiang A, Lewis NE. Boltzmann Model Predicts Glycan Structures from Lectin Binding. Anal Chem 2024; 96:8332-8341. [PMID: 38720429 PMCID: PMC11162346 DOI: 10.1021/acs.analchem.3c04992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/20/2024]
Abstract
Glycans are complex oligosaccharides that are involved in many diseases and biological processes. Unfortunately, current methods for determining glycan composition and structure (glycan sequencing) are laborious and require a high level of expertise. Here, we assess the feasibility of sequencing glycans based on their lectin binding fingerprints. By training a Boltzmann model on lectin binding data, we predict the approximate structures of 88 ± 7% of N-glycans and 87 ± 13% of O-glycans in our test set. We show that our model generalizes well to the pharmaceutically relevant case of Chinese hamster ovary (CHO) cell glycans. We also analyze the motif specificity of a wide array of lectins and identify the most and least predictive lectins and glycan features. These results could help streamline glycoprotein research and be of use to anyone using lectins for glycobiology.
Collapse
Affiliation(s)
- Aria Yom
- Department of Physics, University of California, San Diego, California 92093, United States
| | - Austin Chiang
- Department of Pediatrics, University of California, San Diego, California 92093, United States
- Immunology Center of Georgia, Augusta University, Augusta, Georgia 30912, United States
- Department of Medicine, Augusta University, Augusta, Georgia 30912, United States
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, California 92093, United States
- Department of Bioengineering, University of California, San Diego, California 92093, United States
| |
Collapse
|
5
|
Kellman BP, Mariethoz J, Zhang Y, Shaul S, Alteri M, Sandoval D, Jeffris M, Armingol E, Bao B, Lisacek F, Bojar D, Lewis NE. Decoding glycosylation potential from protein structure across human glycoproteins with a multi-view recurrent neural network. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.15.594334. [PMID: 38798633 PMCID: PMC11118808 DOI: 10.1101/2024.05.15.594334] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Glycosylation is described as a non-templated biosynthesis. Yet, the template-free premise is antithetical to the observation that different N-glycans are consistently placed at specific sites. It has been proposed that glycosite-proximal protein structures could constrain glycosylation and explain the observed microheterogeneity. Using site-specific glycosylation data, we trained a hybrid neural network to parse glycosites (recurrent neural network) and match them to feasible N-glycosylation events (graph neural network). From glycosite-flanking sequences, the algorithm predicts most human N-glycosylation events documented in the GlyConnect database and proposed structures corresponding to observed monosaccharide composition of the glycans at these sites. The algorithm also recapitulated glycosylation in Enhanced Aromatic Sequons, SARS-CoV-2 spike, and IgG3 variants, thus demonstrating the ability of the algorithm to predict both glycan structure and abundance. Thus, protein structure constrains glycosylation, and the neural network enables predictive in silico glycosylation of uncharacterized or novel protein sequences and genetic variants.
Collapse
Affiliation(s)
- Benjamin P. Kellman
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
- Augment Biologics, La Jolla, CA 92092
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
| | - Julien Mariethoz
- Proteome Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
| | - Yujie Zhang
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Sigal Shaul
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Mia Alteri
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Daniel Sandoval
- Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, CA 92093, USA
| | - Mia Jeffris
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
| | - Erick Armingol
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Bokan Bao
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
| | - Frederique Lisacek
- Proteome Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
- Computer Science Department & Section of Biology, University of Geneva, route de Drize 7, CH-1227, Geneva, Switzerland
| | - Daniel Bojar
- Wallenberg Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg 41390, Sweden
- Department of Chemistry and Molecular Biology, University of Gothenburg, Gothenburg 41390, Sweden
| | - Nathan E. Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA, USA
| |
Collapse
|
6
|
Prabhu H, Bhosale H, Sane A, Dhadwal R, Ramakrishnan V, Valadi J. Protein feature engineering framework for AMPylation site prediction. Sci Rep 2024; 14:8695. [PMID: 38622194 PMCID: PMC11369087 DOI: 10.1038/s41598-024-58450-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Accepted: 03/29/2024] [Indexed: 04/17/2024] Open
Abstract
AMPylation is a biologically significant yet understudied post-translational modification where an adenosine monophosphate (AMP) group is added to Tyrosine and Threonine residues primarily. While recent work has illuminated the prevalence and functional impacts of AMPylation, experimental identification of AMPylation sites remains challenging. Computational prediction techniques provide a faster alternative approach. The predictive performance of machine learning models is highly dependent on the features used to represent the raw amino acid sequences. In this work, we introduce a novel feature extraction pipeline to encode the key properties relevant to AMPylation site prediction. We utilize a recently published dataset of curated AMPylation sites to develop our feature generation framework. We demonstrate the utility of our extracted features by training various machine learning classifiers, on various numerical representations of the raw sequences extracted with the help of our framework. Tenfold cross-validation is used to evaluate the model's capability to distinguish between AMPylated and non-AMPylated sites. The top-performing set of features extracted achieved MCC score of 0.58, Accuracy of 0.8, AUC-ROC of 0.85 and F1 score of 0.73. Further, we elucidate the behaviour of the model on the set of features consisting of monogram and bigram counts for various representations using SHapley Additive exPlanations.
Collapse
Affiliation(s)
- Hardik Prabhu
- Computing and Data Sciences, FLAME University, Pune, 412115, India
- Robert Bosch Centre for Cyber Physical Systems, Indian Institute of Science, Bengaluru, 560012, India
| | | | - Aamod Sane
- Computing and Data Sciences, FLAME University, Pune, 412115, India
| | - Renu Dhadwal
- Computing and Data Sciences, FLAME University, Pune, 412115, India
| | - Vigneshwar Ramakrishnan
- Bioinformatics Center, School of Chemical and Biotechnology, SASTRA Deemed to be University, Thanjavur, 613401, India
| | - Jayaraman Valadi
- Computing and Data Sciences, FLAME University, Pune, 412115, India.
| |
Collapse
|
7
|
Desai M, Chowdhury SR, Sun B. A quest for cytosolic sequons and their functions. Sci Rep 2024; 14:7736. [PMID: 38565583 PMCID: PMC10987669 DOI: 10.1038/s41598-024-57334-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2023] [Accepted: 03/18/2024] [Indexed: 04/04/2024] Open
Abstract
Evolution shapes protein sequences for their functions. Here, we studied the moonlighting functions of the N-linked sequon NXS/T, where X is not P, in human nucleocytosolic proteins. By comparing membrane and secreted proteins in which sequons are well known for N-glycosylation, we discovered that cyto-sequons can participate in nucleic acid binding, particularly in zinc finger proteins. Our global studies further discovered that sequon occurrence is largely proportional to protein length. The contribution of sequons to protein functions, including both N-glycosylation and nucleic acid binding, can be regulated through their density as well as the biased usage between NXS and NXT. In proteins where other PTMs or structural features are rich, such as phosphorylation, transmembrane ɑ-helices, and disulfide bridges, sequon occurrence is scarce. The information acquired here should help understand the relationship between protein sequence and function and assist future protein design and engineering.
Collapse
Affiliation(s)
- Manthan Desai
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada
- Department of Computing Science, Simon Fraser University, Burnaby, BC, Canada
| | | | - Bingyun Sun
- Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
- Department of Chemistry, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada.
| |
Collapse
|
8
|
Ertelt M, Mulligan VK, Maguire JB, Lyskov S, Moretti R, Schiffner T, Meiler J, Schoeder CT. Combining machine learning with structure-based protein design to predict and engineer post-translational modifications of proteins. PLoS Comput Biol 2024; 20:e1011939. [PMID: 38484014 PMCID: PMC10965067 DOI: 10.1371/journal.pcbi.1011939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2023] [Revised: 03/26/2024] [Accepted: 02/20/2024] [Indexed: 03/27/2024] Open
Abstract
Post-translational modifications (PTMs) of proteins play a vital role in their function and stability. These modifications influence protein folding, signaling, protein-protein interactions, enzyme activity, binding affinity, aggregation, degradation, and much more. To date, over 400 types of PTMs have been described, representing chemical diversity well beyond the genetically encoded amino acids. Such modifications pose a challenge to the successful design of proteins, but also represent a major opportunity to diversify the protein engineering toolbox. To this end, we first trained artificial neural networks (ANNs) to predict eighteen of the most abundant PTMs, including protein glycosylation, phosphorylation, methylation, and deamidation. In a second step, these models were implemented inside the computational protein modeling suite Rosetta, which allows flexible combination with existing protocols to model the modified sites and understand their impact on protein stability as well as function. Lastly, we developed a new design protocol that either maximizes or minimizes the predicted probability of a particular site being modified. We find that this combination of ANN prediction and structure-based design can enable the modification of existing, as well as the introduction of novel, PTMs. The potential applications of our work include, but are not limited to, glycan masking of epitopes, strengthening protein-protein interactions through phosphorylation, as well as protecting proteins from deamidation liabilities. These applications are especially important for the design of new protein therapeutics where PTMs can drastically change the therapeutic properties of a protein. Our work adds novel tools to Rosetta's protein engineering toolbox that allow for the rational design of PTMs.
Collapse
Affiliation(s)
- Moritz Ertelt
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, New York, New York, United States of America
| | - Jack B. Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States of America
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Torben Schiffner
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
| | - Jens Meiler
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Clara T. Schoeder
- Institute for Drug Discovery, Leipzig University Medical Faculty, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
| |
Collapse
|
9
|
Makkonen K, Jännäri M, Crisóstomo L, Kuusi M, Patyra K, Melnyk V, Linnossuo V, Ojala J, Ravi R, Löf C, Mäkelä JA, Miettinen P, Laakso S, Ojaniemi M, Jääskeläinen J, Laakso M, Bossowski F, Sawicka B, Stożek K, Bossowski A, Kleinau G, Scheerer P, FinnGen F, Reeve MP, Kero J. Mechanisms of thyrotropin receptor-mediated phenotype variability deciphered by gene mutations and M453T-knockin model. JCI Insight 2024; 9:e167092. [PMID: 38194289 PMCID: PMC11143923 DOI: 10.1172/jci.insight.167092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2022] [Accepted: 01/05/2024] [Indexed: 01/10/2024] Open
Abstract
The clinical spectrum of thyrotropin receptor-mediated (TSHR-mediated) diseases varies from loss-of-function mutations causing congenital hypothyroidism to constitutively active mutations (CAMs) leading to nonautoimmune hyperthyroidism (NAH). Variation at the TSHR locus has also been associated with altered lipid and bone metabolism and autoimmune thyroid diseases. However, the extrathyroidal roles of TSHR and the mechanisms underlying phenotypic variability among TSHR-mediated diseases remain unclear. Here we identified and characterized TSHR variants and factors involved in phenotypic variability in different patient cohorts, the FinnGen database, and a mouse model. TSHR CAMs were found in all 16 patients with NAH, with 1 CAM in an unexpected location in the extracellular leucine-rich repeat domain (p.S237N) and another in the transmembrane domain (p.I640V) in 2 families with distinct hyperthyroid phenotypes. In addition, screening of the FinnGen database revealed rare functional variants as well as distinct common noncoding TSHR SNPs significantly associated with thyroid phenotypes, but there was no other significant association between TSHR variants and more than 2,000 nonthyroid disease endpoints. Finally, our TSHR M453T-knockin model revealed that the phenotype was dependent on the mutation's signaling properties and was ameliorated by increased iodine intake. In summary, our data show that TSHR-mediated disease risk can be modified by variants at the TSHR locus both inside and outside the coding region as well as by altered TSHR-signaling and dietary iodine, supporting the need for personalized treatment strategies.
Collapse
Affiliation(s)
- Kristiina Makkonen
- Department of Clinical Sciences, Faculty of Medicine, and
- Integrative Physiology and Pharmacology, Institute of Biomedicine, University of Turku, Turku, Finland
| | - Meeri Jännäri
- Department of Clinical Sciences, Faculty of Medicine, and
- Integrative Physiology and Pharmacology, Institute of Biomedicine, University of Turku, Turku, Finland
| | - Luís Crisóstomo
- Integrative Physiology and Pharmacology, Institute of Biomedicine, University of Turku, Turku, Finland
| | - Matilda Kuusi
- Integrative Physiology and Pharmacology, Institute of Biomedicine, University of Turku, Turku, Finland
| | - Konrad Patyra
- Department of Clinical Sciences, Faculty of Medicine, and
- Integrative Physiology and Pharmacology, Institute of Biomedicine, University of Turku, Turku, Finland
| | | | - Veli Linnossuo
- Department of Clinical Sciences, Faculty of Medicine, and
| | - Johanna Ojala
- Department of Clinical Sciences, Faculty of Medicine, and
| | - Rowmika Ravi
- Department of Clinical Sciences, Faculty of Medicine, and
| | - Christoffer Löf
- Integrative Physiology and Pharmacology, Institute of Biomedicine, University of Turku, Turku, Finland
| | - Juho-Antti Mäkelä
- Integrative Physiology and Pharmacology, Institute of Biomedicine, University of Turku, Turku, Finland
| | - Päivi Miettinen
- New Children’s Hospital, Helsinki University Hospital, Helsinki, Finland
| | - Saila Laakso
- New Children’s Hospital, Helsinki University Hospital, Helsinki, Finland
| | - Marja Ojaniemi
- Department of Pediatrics and Adolescence, PEDEGO Research Unit and Medical Research Center, University and University Hospital of Oulu, Oulu, Finland
| | | | - Markku Laakso
- Institute of Clinical Medicine, Internal Medicine, University of Eastern Finland, Kuopio, Finland
| | - Filip Bossowski
- Department of Pediatrics, Endocrinology, Diabetes with a Cardiology Unit, Medical University in Białystok, Bialystok, Poland
| | - Beata Sawicka
- Department of Pediatrics, Endocrinology, Diabetes with a Cardiology Unit, Medical University in Białystok, Bialystok, Poland
| | - Karolina Stożek
- Department of Pediatrics, Endocrinology, Diabetes with a Cardiology Unit, Medical University in Białystok, Bialystok, Poland
| | - Artur Bossowski
- Department of Pediatrics, Endocrinology, Diabetes with a Cardiology Unit, Medical University in Białystok, Bialystok, Poland
| | - Gunnar Kleinau
- Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, and
- Humboldt - Universität zu Berlin, Institute of Medical Physics, Biophysics, Group Structural Biology of Cellular Signaling, Berlin, Germany
| | - Patrick Scheerer
- Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin, and
- Humboldt - Universität zu Berlin, Institute of Medical Physics, Biophysics, Group Structural Biology of Cellular Signaling, Berlin, Germany
| | - FinnGen FinnGen
- Institute for Molecular Medicine Finland, HiLIFE, University of Helsinki, Helsinki, Finland
- FinnGen is detailed in Supplemental Acknowledgments
| | - Mary Pat Reeve
- Institute for Molecular Medicine Finland, HiLIFE, University of Helsinki, Helsinki, Finland
| | - Jukka Kero
- Department of Clinical Sciences, Faculty of Medicine, and
- Department of Pediatrics and Adolescent Medicine, Turku University Hospital, Turku, Finland
| |
Collapse
|
10
|
Li Y, Liu H, Liang T, Han W, Bo Z, Qiu T, Li J, Xu M, Wang W, Yang S, Gui C. Importance of N-Glycosylation for the Expression and Function of Human Organic Anion Transporting Polypeptide 2B1. ACS Pharmacol Transl Sci 2023; 6:1347-1356. [PMID: 37854627 PMCID: PMC10580385 DOI: 10.1021/acsptsci.3c00076] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Indexed: 10/20/2023]
Abstract
Human organic anion transporting polypeptide 2B1 (OATP2B1) is a membrane transporter widely expressed in organs crucial for drug absorption and disposition such as the intestine, liver, and kidney. Evidence indicates that OATP2B1 is a glycoprotein. However, the sites of glycosylation and their contribution to the function and expression of OATP2B1 are largely unknown. In this study, by site-directed mutagenesis, we determined that two of four potential N-glycosylation sites in OATP2B1, N176 and N538, are indeed glycosylated. Functional studies revealed that the transport activities of mutants N176Q and N538Q were greatly reduced as compared to that of wild-type OATP2B1. However, the reduced activity was not due to the impairment of transport function per se but due to the decreased surface expression as the Km and normalized Vmax values of N176Q and N538Q were comparable to those of OATP2B1. Quantitative polymerase chain reaction (PCR) revealed that N176Q and N538Q mutations did not affect the expression of OATP2B1 at a transcriptional level. Immunofluorescence analysis showed that deglycosylated OATP2B1 was largely retained in the endoplasmic reticulum, which may activate the endoplasmic reticulum-associated degradation pathway, and the ubiquitin-proteasome system played a major role in the degradation of OATP2B1. Taken together, OATP2B1 is N-glycosylated, and N-glycosylation is essential for the surface expression of OATP2B1 but not critical for the transport function of OATP2B1 per se.
Collapse
Affiliation(s)
| | | | | | - Wanjun Han
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| | - Zheyue Bo
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| | - Tian Qiu
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| | - Jiawei Li
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| | - Mingming Xu
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| | - Weipeng Wang
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| | - Shuang Yang
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| | - Chunshan Gui
- College of Pharmaceutical
Sciences, Soochow University, 199 Renai Road, Suzhou Industrial
Park, Suzhou, Jiangsu 215123, People’s
Republic of China
| |
Collapse
|
11
|
García-Arnáez I, Romero-Gavilán F, Cerqueira A, Azkargorta M, Elortza F, Suay J, Goñi I, Gurruchaga M. Proteomics as a tool to study the osteoimmunomodulatory role of metallic ions in a sol-gel coating. J Mater Chem B 2023; 11:8194-8205. [PMID: 37552201 DOI: 10.1039/d3tb01204b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/09/2023]
Abstract
The success of bone implants depends on the osteoimmunomodulatory (OIM) activity of the biomaterials in the interactions with the periimplantary tissues. Many in vitro tests have been conducted to evaluate the osteoimmunology effects of biomaterials. However, results of these tests have often been inconclusive. This study examines the properties of newly developed sol-gel coatings doped with two metal ions associated with bone regeneration, Ca and Zn. The study uses both proteomic methods and traditional in vitro assays. The results demonstrate that proteomics is an effective tool to scrutinize the OIM properties of the materials. Moreover, sol-gel coatings offer excellent base materials to evaluate the effects of metal ions on these properties. The obtained data highlight the highly tunable nature of sol-gel materials; studying the materials with different doping levels supplies valuable information on the interactions between the immune and bone-forming processes.
Collapse
Affiliation(s)
- Iñaki García-Arnáez
- Departament of Polymers and Advanced Materials: Physics, Chemistry and Technology, Universidad del País Vasco, Po Manuel de Lardizábal, 3, 20018 San Sebastián, Spain.
| | - Francisco Romero-Gavilán
- Department of Industrial Systems Engineering and Design, Universitat Jaume I, Av. Vicent Sos Baynat s/n, 12071 Castellón de la Plana, Spain
| | - Andreia Cerqueira
- Department of Industrial Systems Engineering and Design, Universitat Jaume I, Av. Vicent Sos Baynat s/n, 12071 Castellón de la Plana, Spain
| | - Mikel Azkargorta
- Proteomics Platform, CIC bioGUNE, Basque Research and Technology Alliance (BRTA), CIBERehd, ProteoRed-ISCIII, Bizkaia Science and Technology Park, 48160 Derio, Spain
| | - Félix Elortza
- Proteomics Platform, CIC bioGUNE, Basque Research and Technology Alliance (BRTA), CIBERehd, ProteoRed-ISCIII, Bizkaia Science and Technology Park, 48160 Derio, Spain
| | - Julio Suay
- Department of Industrial Systems Engineering and Design, Universitat Jaume I, Av. Vicent Sos Baynat s/n, 12071 Castellón de la Plana, Spain
| | - Isabel Goñi
- Departament of Polymers and Advanced Materials: Physics, Chemistry and Technology, Universidad del País Vasco, Po Manuel de Lardizábal, 3, 20018 San Sebastián, Spain.
| | - Mariló Gurruchaga
- Departament of Polymers and Advanced Materials: Physics, Chemistry and Technology, Universidad del País Vasco, Po Manuel de Lardizábal, 3, 20018 San Sebastián, Spain.
| |
Collapse
|
12
|
Li S, Dohlman HG. Evolutionary conservation of sequence motifs at sites of protein modification. J Biol Chem 2023; 299:104617. [PMID: 36933807 PMCID: PMC10139944 DOI: 10.1016/j.jbc.2023.104617] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 02/20/2023] [Accepted: 03/11/2023] [Indexed: 03/18/2023] Open
Abstract
Gene duplications are common in biology and are likely to be an important source of functional diversification and specialization. The yeast Saccharomyces cerevisiae underwent a whole-genome duplication event early in evolution, and a substantial number of duplicated genes have been retained. We identified more than 3500 instances where only one of two paralogous proteins undergoes posttranslational modification despite having retained the same amino acid residue in both. We also developed a web-based search algorithm (CoSMoS.c.) that scores conservation of amino acid sequences based on 1011 wild and domesticated yeast isolates and used it to compare differentially modified pairs of paralogous proteins. We found that the most common modifications-phosphorylation, ubiquitylation, and acylation but not N-glycosylation-occur in regions of high sequence conservation. Such conservation is evident even for ubiquitylation and succinylation, where there is no established 'consensus site' for modification. Differences in phosphorylation were not associated with predicted secondary structure or solvent accessibility but did mirror known differences in kinase-substrate interactions. Thus, differences in posttranslational modification likely result from differences in adjoining amino acids and their interactions with modifying enzymes. By integrating data from large-scale proteomics and genomics analysis, in a system with such substantial genetic diversity, we obtained a more comprehensive understanding of the functional basis for genetic redundancies that have persisted for 100 million years.
Collapse
Affiliation(s)
- Shuang Li
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Henrik G Dohlman
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.
| |
Collapse
|
13
|
Mahameed M, Wang P, Xue S, Fussenegger M. Engineering receptors in the secretory pathway for orthogonal signalling control. Nat Commun 2022; 13:7350. [PMID: 36446786 PMCID: PMC9708828 DOI: 10.1038/s41467-022-35161-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 11/21/2022] [Indexed: 12/02/2022] Open
Abstract
Synthetic receptors targeted to the secretory pathway often fail to exhibit the expected activity due to post-translational modifications (PTMs) and/or improper folding. Here, we engineered synthetic receptors that reside in the cytoplasm, inside the endoplasmic reticulum (ER), or on the plasma membrane through orientation adjustment of the receptor parts and by elimination of dysfunctional PTMs sites. The cytoplasmic receptors consist of split-TEVp domains that reconstitute an active protease through chemically-induced dimerization (CID) that is triggered by rapamycin, abscisic acid, or gibberellin. Inside the ER, however, some of these receptors were non-functional, but their activity was restored by mutagenesis of cysteine and asparagine, residues that are typically associated with PTMs. Finally, we engineered orthogonal chemically activated cell-surface receptors (OCARs) consisting of the Notch1 transmembrane domain fused to cytoplasmic tTA and extracellular CID domains. Mutagenesis of cysteine residues in CID domains afforded functional OCARs which enabled fine-tuning of orthogonal signalling in mammalian cells.
Collapse
Affiliation(s)
- Mohamed Mahameed
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, CH-4058, Basel, Switzerland
| | - Pengli Wang
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, CH-4058, Basel, Switzerland
| | - Shuai Xue
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, CH-4058, Basel, Switzerland
| | - Martin Fussenegger
- Department of Biosystems Science and Engineering, ETH Zurich, Mattenstrasse 26, CH-4058, Basel, Switzerland.
- Faculty of Life Science, University of Basel, Mattenstrasse 26, CH-4058, Basel, Switzerland.
| |
Collapse
|
14
|
Irfan M, Iqbal T, Hashmi S, Ghani U, Bhatti A. Insilico prediction and functional analysis of nonsynonymous SNPs in human CTLA4 gene. Sci Rep 2022; 12:20441. [PMID: 36443461 PMCID: PMC9705290 DOI: 10.1038/s41598-022-24699-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2022] [Accepted: 11/18/2022] [Indexed: 11/29/2022] Open
Abstract
The CTLA4 receptor is an immune checkpoint involved in the downregulation of T cells. Polymorphisms in this gene have been found to be associated with different diseases like rheumatoid arthritis, autosomal dominant immune dysregulation syndrome, juvenile idiopathic arthritis and autoimmune Addison's disease. Therefore, the identification of polymorphisms that have an effect on the structure and function of CTLA4 gene is important. Here we identified the most damaging missense or non-synonymous SNPs (nsSNPs) that might be crucial for the structure and function of CTLA4 using different bioinformatics tools. These in silico tools included SIFT, PROVEAN, PhD-SNP, PolyPhen-2 followed by MutPred2, I-Mutant 2.0 and ConSurf. The protein structures were predicted using Phyre2 and I-TASSER, while the gene-gene interactions were predicted by GeneMANIA and STRING. Our study identified three damaging missense SNPs rs1553657429, rs1559591863 and rs778534474 in coding region of CTLA4 gene. Among these SNPs the rs1553657429 showed a loss of potential phosphorylation site and was found to be highly conserved. The prediction of gene-gene interaction showed the interaction of CTlA4 with other genes and its importance in different pathways. This investigation of damaging nsSNPs can be considered in future while studying CTLA4 related diseases and can be of great importance in precision medicine.
Collapse
Affiliation(s)
- Muhammad Irfan
- grid.412117.00000 0001 2234 2376Healthcare Biotechnology, National University of Science and Technology, Islamabad H-12, 44000 Pakistan
| | - Talha Iqbal
- grid.412117.00000 0001 2234 2376Healthcare Biotechnology, National University of Science and Technology, Islamabad H-12, 44000 Pakistan
| | - Sakina Hashmi
- grid.412117.00000 0001 2234 2376Healthcare Biotechnology, National University of Science and Technology, Islamabad H-12, 44000 Pakistan
| | - Uzma Ghani
- grid.412117.00000 0001 2234 2376Healthcare Biotechnology, National University of Science and Technology, Islamabad H-12, 44000 Pakistan
| | - Attya Bhatti
- grid.412117.00000 0001 2234 2376Healthcare Biotechnology, National University of Science and Technology, Islamabad H-12, 44000 Pakistan
| |
Collapse
|
15
|
Weigle AT, Feng J, Shukla D. Thirty years of molecular dynamics simulations on posttranslational modifications of proteins. Phys Chem Chem Phys 2022; 24:26371-26397. [PMID: 36285789 PMCID: PMC9704509 DOI: 10.1039/d2cp02883b] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/06/2023]
Abstract
Posttranslational modifications (PTMs) are an integral component to how cells respond to perturbation. While experimental advances have enabled improved PTM identification capabilities, the same throughput for characterizing how structural changes caused by PTMs equate to altered physiological function has not been maintained. In this Perspective, we cover the history of computational modeling and molecular dynamics simulations which have characterized the structural implications of PTMs. We distinguish results from different molecular dynamics studies based upon the timescales simulated and analysis approaches used for PTM characterization. Lastly, we offer insights into how opportunities for modern research efforts on in silico PTM characterization may proceed given current state-of-the-art computing capabilities and methodological advancements.
Collapse
Affiliation(s)
- Austin T Weigle
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Jiangyan Feng
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - Diwakar Shukla
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
- Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.
| |
Collapse
|
16
|
Abstract
Artificial intelligence (AI) methods have been and are now being increasingly integrated in prediction software implemented in bioinformatics and its glycoscience branch known as glycoinformatics. AI techniques have evolved in the past decades, and their applications in glycoscience are not yet widespread. This limited use is partly explained by the peculiarities of glyco-data that are notoriously hard to produce and analyze. Nonetheless, as time goes, the accumulation of glycomics, glycoproteomics, and glycan-binding data has reached a point where even the most recent deep learning methods can provide predictors with good performance. We discuss the historical development of the application of various AI methods in the broader field of glycoinformatics. A particular focus is placed on shining a light on challenges in glyco-data handling, contextualized by lessons learnt from related disciplines. Ending on the discussion of state-of-the-art deep learning approaches in glycoinformatics, we also envision the future of glycoinformatics, including development that need to occur in order to truly unleash the capabilities of glycoscience in the systems biology era.
Collapse
Affiliation(s)
- Daniel Bojar
- Department
of Chemistry and Molecular Biology, University
of Gothenburg, Gothenburg 41390, Sweden
- Wallenberg
Centre for Molecular and Translational Medicine, University of Gothenburg, Gothenburg 41390, Sweden
| | - Frederique Lisacek
- Proteome
Informatics Group, Swiss Institute of Bioinformatics, CH-1227 Geneva, Switzerland
- Computer
Science Department & Section of Biology, University of Geneva, route de Drize 7, CH-1227, Geneva, Switzerland
| |
Collapse
|
17
|
Akmal MA, Hassan MA, Muhammad S, Khurshid KS, Mohamed A. An analytical study on the identification of N-linked glycosylation sites using machine learning model. PeerJ Comput Sci 2022; 8:e1069. [PMID: 36262138 PMCID: PMC9575850 DOI: 10.7717/peerj-cs.1069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2022] [Accepted: 07/25/2022] [Indexed: 06/16/2023]
Abstract
N-linked is the most common type of glycosylation which plays a significant role in identifying various diseases such as type I diabetes and cancer and helps in drug development. Most of the proteins cannot perform their biological and psychological functionalities without undergoing such modification. Therefore, it is essential to identify such sites by computational techniques because of experimental limitations. This study aims to analyze and synthesize the progress to discover N-linked places using machine learning methods. It also explores the performance of currently available tools to predict such sites. Almost seventy research articles published in recognized journals of the N-linked glycosylation field have shortlisted after the rigorous filtering process. The findings of the studies have been reported based on multiple aspects: publication channel, feature set construction method, training algorithm, and performance evaluation. Moreover, a literature survey has developed a taxonomy of N-linked sequence identification. Our study focuses on the performance evaluation criteria, and the importance of N-linked glycosylation motivates us to discover resources that use computational methods instead of the experimental method due to its limitations.
Collapse
Affiliation(s)
- Muhammad Aizaz Akmal
- Department of Computer Science, University of Engineering and Technology, KSK, Lahore, Punjab, Pakistan
| | - Muhammad Awais Hassan
- Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
| | - Shoaib Muhammad
- Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
| | - Khaldoon S. Khurshid
- Department of Computer Science, University of Engineering and Technology, Lahore, Punjab, Pakistan
| | | |
Collapse
|
18
|
Biophysical differences in IgG1 Fc-based therapeutics relate to their cellular handling, interaction with FcRn and plasma half-life. Commun Biol 2022; 5:832. [PMID: 35982144 PMCID: PMC9388496 DOI: 10.1038/s42003-022-03787-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2022] [Accepted: 08/01/2022] [Indexed: 01/07/2023] Open
Abstract
Antibody-based therapeutics (ABTs) are used to treat a range of diseases. Most ABTs are either full-length IgG1 antibodies or fusions between for instance antigen (Ag)-binding receptor domains and the IgG1 Fc fragment. Interestingly, their plasma half-life varies considerably, which may relate to how they engage the neonatal Fc receptor (FcRn). As such, there is a need for an in-depth understanding of how different features of ABTs affect FcRn-binding and transport behavior. Here, we report on how FcRn-engagement of the IgG1 Fc fragment compare to clinically relevant IgGs and receptor domain Fc fusions, binding to VEGF or TNF-α. The results reveal FcRn-dependent intracellular accumulation of the Fc, which is in line with shorter plasma half-life than that of full-length IgG1 in human FcRn-expressing mice. Receptor domain fusion to the Fc increases its half-life, but not to the extent of IgG1. This is mirrored by a reduced cellular recycling capacity of the Fc-fusions. In addition, binding of cognate Ag to ABTs show that complexes of similar size undergo cellular transport at different rates, which could be explained by the biophysical properties of each ABT. Thus, the study provides knowledge that should guide tailoring of ABTs regarding optimal cellular sorting and plasma half-life. Analysis of clinically approved antibody-based therapeutics reveals different structural designs, such as full-length IgG1 or Fc-fusions, entail distinct biophysical properties that affect FcRn binding, intracellular transport and plasma half-life.
Collapse
|
19
|
Moreira RS, Filho VB, Calomeno NA, Wagner G, Miletti LC. EpiBuilder: A Tool for Assembling, Searching, and Classifying B-Cell Epitopes. Bioinform Biol Insights 2022; 16:11779322221095221. [PMID: 35571557 PMCID: PMC9102138 DOI: 10.1177/11779322221095221] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 03/29/2022] [Indexed: 11/16/2022] Open
Abstract
Epitopes are portions of a protein that are recognized by antibodies. These small amino acid sequences represent a significant breakthrough in a branch of bioinformatics called immunoinformatics. Various software are available for linear B-cell epitope (BCE) prediction such as ABCPred, SVMTrip, EpiDope, and EpitopeVec; a well-known BCE predictor is BepiPred-2.0. However, despite the prediction, there are several essential steps, such as epitope assembly, evaluation, and searching for epitopes in other proteomes. Here, we present EpiBuilder (https://epibuilder.sourceforge.io), a user friendly software that assists in epitope assembly, classifying and searching using input results of BepiPred-2.0. EpiBuilder generates several output results from these data and supports a proteome-wide processing approach. In addition, this software provides the following features: Chou & Fasman beta-turn prediction, Emini surface accessibility prediction, Karplus and Schulz flexibility prediction, Kolaskar and Tongaonkar antigenicity, Parker hydrophilicity prediction, N-glycosylation domains, and hydropathy. These information generate a unique topology for each epitope, visually demonstrating its characteristics. The software can search the entire epitope sequence in various FASTA files, and it allows to use BLASTP to identify epitopes that eventually have sequence variations. As an EpiBuilder application, we developed a epitope dataset from the protozoan Trypanosoma brucei gambiense, the gram-positive bacterium Clostridioides difficile, and severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2).
Collapse
Affiliation(s)
- Renato Simões Moreira
- Laboratório de Hemoparasitas e Vetores, Departamento de Produção Animal e Alimentos, Centro de Ciências Agroveterinárias (CAV), Universidade do Estado de Santa Catarina (UDESC), Lages, Brazil
- Instituto Federal de Santa Catarina (IFSC), Lages, Brazil
| | - Vilmar Benetti Filho
- Laboratório de Bioinformática, Universidade Federal de Santa Catarina, Florianópolis, Brazil
| | - Nathália Anderson Calomeno
- Laboratório de Hemoparasitas e Vetores, Departamento de Produção Animal e Alimentos, Centro de Ciências Agroveterinárias (CAV), Universidade do Estado de Santa Catarina (UDESC), Lages, Brazil
| | - Glauber Wagner
- Laboratório de Bioinformática, Universidade Federal de Santa Catarina, Florianópolis, Brazil
| | - Luiz Claudio Miletti
- Laboratório de Hemoparasitas e Vetores, Departamento de Produção Animal e Alimentos, Centro de Ciências Agroveterinárias (CAV), Universidade do Estado de Santa Catarina (UDESC), Lages, Brazil
| |
Collapse
|
20
|
Venkatraman V, Colligan TH, Lesica GT, Olson DR, Gaiser J, Copeland CJ, Wheeler TJ, Roy A. Drugsniffer: An Open Source Workflow for Virtually Screening Billions of Molecules for Binding Affinity to Protein Targets. Front Pharmacol 2022; 13:874746. [PMID: 35559261 PMCID: PMC9086895 DOI: 10.3389/fphar.2022.874746] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2022] [Accepted: 04/04/2022] [Indexed: 11/13/2022] Open
Abstract
The SARS-CoV2 pandemic has highlighted the importance of efficient and effective methods for identification of therapeutic drugs, and in particular has laid bare the need for methods that allow exploration of the full diversity of synthesizable small molecules. While classical high-throughput screening methods may consider up to millions of molecules, virtual screening methods hold the promise of enabling appraisal of billions of candidate molecules, thus expanding the search space while concurrently reducing costs and speeding discovery. Here, we describe a new screening pipeline, called drugsniffer, that is capable of rapidly exploring drug candidates from a library of billions of molecules, and is designed to support distributed computation on cluster and cloud resources. As an example of performance, our pipeline required ∼40,000 total compute hours to screen for potential drugs targeting three SARS-CoV2 proteins among a library of ∼3.7 billion candidate molecules.
Collapse
Affiliation(s)
- Vishwesh Venkatraman
- Department of Chemistry, Norwegian University of Science and Technology, Trondheim, Norway
| | - Thomas H. Colligan
- Department of Computer Science, University of Montana, Missoula, MT, United States
| | - George T. Lesica
- Department of Computer Science, University of Montana, Missoula, MT, United States
| | - Daniel R. Olson
- Department of Computer Science, University of Montana, Missoula, MT, United States
| | - Jeremiah Gaiser
- Department of Computer Science, University of Montana, Missoula, MT, United States
| | - Conner J. Copeland
- Department of Computer Science, University of Montana, Missoula, MT, United States
| | - Travis J. Wheeler
- Department of Computer Science, University of Montana, Missoula, MT, United States
| | - Amitava Roy
- Department of Computer Science, University of Montana, Missoula, MT, United States
- Rocky Mountain Laboratories, Bioinformatics and Computational Biosciences Branch, Office of Cyber Infrastructure and Computational Biology, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Hamilton, MT, United States
| |
Collapse
|
21
|
Welzel G, Schuster S. Connexins evolved after early chordates lost innexin diversity. eLife 2022; 11:74422. [PMID: 35042580 PMCID: PMC8769644 DOI: 10.7554/elife.74422] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 01/05/2022] [Indexed: 11/26/2022] Open
Abstract
Gap junction channels are formed by two unrelated protein families. Non-chordates use the primordial innexins, while chordates use connexins that superseded the gap junction function of innexins. Chordates retained innexin-homologs, but N-glycosylation prevents them from forming gap junctions. It is puzzling why chordates seem to exclusively use the new gap junction protein and why no chordates should exist that use non-glycosylated innexins to form gap junctions. Here, we identified glycosylation sites of 2388 innexins from 174 non-chordate and 276 chordate species. Among all chordates, we found not a single innexin without glycosylation sites. Surprisingly, the glycosylation motif is also widespread among non-chordate innexins indicating that glycosylated innexins are not a novelty of chordates. In addition, we discovered a loss of innexin diversity during early chordate evolution. Most importantly, lancelets, which lack connexins, exclusively possess only one highly conserved innexin with one glycosylation site. A bottleneck effect might thus explain why connexins have become the only protein used to form chordate gap junctions.
Collapse
Affiliation(s)
- Georg Welzel
- Department of Animal Physiology, University of Bayreuth
| | | |
Collapse
|
22
|
Taherzadeh G, Campbell M, Zhou Y. Computational Prediction of N- and O-Linked Glycosylation Sites for Human and Mouse Proteins. Methods Mol Biol 2022; 2499:177-186. [PMID: 35696081 DOI: 10.1007/978-1-0716-2317-6_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Protein glycosylation is one of the most complex posttranslational modifications (PTM) that play a fundamental role in protein function. Identification and annotation of these sites using experimental approaches are challenging and time consuming. Hence, there is a demand to build fast and efficient computational methods to address this problem. Here, we present the SPRINT-Gly framework containing the largest dataset and a prediction model of glycosylation sites for a given protein sequence. In this framework, we construct a large dataset containing N- and O-linked glycosylation sites of human and mouse proteins, collected from different sources. We then introduce the SPRINT-Gly method to predict putative N- and O-linked sites. SPRINT-Gly is a machine learning-based approach consisting of a number of trained predictive models for glycosylation sites in both human and mouse proteins, separately. The method is built by incorporating sequence-based, predicted structural, and physicochemical information of the neighboring residues of each N- and O-linked glycosylation site and by training deep learning neural network and support vector machine as classifiers. SPRINT-Gly outperformed other existing methods by achieving 18% and 50% higher Matthew's correlation coefficient for N- and O-linked glycosylation site prediction, respectively. SPRINT-Gly is publicly available as an online and stand-alone predictor at https://sparks-lab.org/server/sprint-gly/ .
Collapse
Affiliation(s)
- Ghazaleh Taherzadeh
- Department of Mathematics and Computer Science, Wilkes University, Wilkes-Barre, PA, USA.
| | - Matthew Campbell
- Institute for Glycomics, Griffith University, Southport, QLD, Australia
| | - Yaoqi Zhou
- Institute for Systems and Physical Biology, Shenzhen Bay Laboratory, Shenzhen, China
| |
Collapse
|
23
|
Sobitan A, Mahase V, Rhoades R, Williams D, Liu D, Xie Y, Li L, Tang Q, Teng S. Computational Saturation Mutagenesis of SARS-CoV-1 Spike Glycoprotein: Stability, Binding Affinity, and Comparison With SARS-CoV-2. Front Mol Biosci 2021; 8:784303. [PMID: 34957216 PMCID: PMC8696472 DOI: 10.3389/fmolb.2021.784303] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 11/18/2021] [Indexed: 12/24/2022] Open
Abstract
Severe Acute respiratory syndrome coronavirus (SARS-CoV-1) attaches to the host cell surface to initiate the interaction between the receptor-binding domain (RBD) of its spike glycoprotein (S) and the human Angiotensin-converting enzyme (hACE2) receptor. SARS-CoV-1 mutates frequently because of its RNA genome, which challenges the antiviral development. Here, we per-formed computational saturation mutagenesis of the S protein of SARS-CoV-1 to identify the residues crucial for its functions. We used the structure-based energy calculations to analyze the effects of the missense mutations on the SARS-CoV-1 S stability and the binding affinity with hACE2. The sequence and structure alignment showed similarities between the S proteins of SARS-CoV-1 and SARS-CoV-2. Interestingly, we found that target mutations of S protein amino acids generate similar effects on their stabilities between SARS-CoV-1 and SARS-CoV-2. For example, G839W of SARS-CoV-1 corresponds to G857W of SARS-CoV-2, which decrease the stability of their S glycoproteins. The viral mutation analysis of the two different SARS-CoV-1 isolates showed that mutations, T487S and L472P, weakened the S-hACE2 binding of the 2003–2004 SARS-CoV-1 isolate. In addition, the mutations of L472P and F360S destabilized the 2003–2004 viral isolate. We further predicted that many mutations on N-linked glycosylation sites would increase the stability of the S glycoprotein. Our results can be of therapeutic importance in the design of antivirals or vaccines against SARS-CoV-1 and SARS-CoV-2.
Collapse
Affiliation(s)
- Adebiyi Sobitan
- Department of Biology, Howard University, Washington, DC, United States
| | - Vidhyanand Mahase
- Department of Biology, Howard University, Washington, DC, United States
| | - Raina Rhoades
- Department of Biology, Howard University, Washington, DC, United States
| | - Dejaun Williams
- Department of Biology, Howard University, Washington, DC, United States
| | - Dongxiao Liu
- Howard University College of Medicine, Washington, DC, United States
| | - Yixin Xie
- Computational Science Program, University of Texas at El Paso, El Paso, TX, United States
| | - Lin Li
- Computational Science Program, University of Texas at El Paso, El Paso, TX, United States.,Physics Department, University of Texas at El Paso, El Paso, TX, United States
| | - Qiyi Tang
- Howard University College of Medicine, Washington, DC, United States
| | - Shaolei Teng
- Department of Biology, Howard University, Washington, DC, United States
| |
Collapse
|
24
|
Pakhrin SC, Aoki-Kinoshita KF, Caragea D, KC DB. DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction. Molecules 2021; 26:molecules26237314. [PMID: 34885895 PMCID: PMC8658957 DOI: 10.3390/molecules26237314] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 11/22/2021] [Accepted: 11/26/2021] [Indexed: 12/21/2022] Open
Abstract
Protein N-linked glycosylation is a post-translational modification that plays an important role in a myriad of biological processes. Computational prediction approaches serve as complementary methods for the characterization of glycosylation sites. Most of the existing predictors for N-linked glycosylation utilize the information that the glycosylation site occurs at the N-X-[S/T] sequon, where X is any amino acid except proline. Not all N-X-[S/T] sequons are glycosylated, thus the N-X-[S/T] sequon is a necessary but not sufficient determinant for protein glycosylation. In that regard, computational prediction of N-linked glycosylation sites confined to N-X-[S/T] sequons is an important problem. Here, we report DeepNGlyPred a deep learning-based approach that encodes the positive and negative sequences in the human proteome dataset (extracted from N-GlycositeAtlas) using sequence-based features (gapped-dipeptide), predicted structural features, and evolutionary information. DeepNGlyPred produces SN, SP, MCC, and ACC of 88.62%, 73.92%, 0.60, and 79.41%, respectively on N-GlyDE independent test set, which is better than the compared approaches. These results demonstrate that DeepNGlyPred is a robust computational technique to predict N-Linked glycosylation sites confined to N-X-[S/T] sequon. DeepNGlyPred will be a useful resource for the glycobiology community.
Collapse
Affiliation(s)
- Subash C. Pakhrin
- School of Computing, Wichita State University, 1845 Fairmount St., Wichita, KS 67260, USA;
| | | | - Doina Caragea
- Department of Computer Science, Kansas State University, Manhattan, KS 66506, USA;
| | - Dukka B. KC
- Department of Computer Science, Michigan Technological University, Houghton, MI 49931, USA
- Correspondence: ; Tel.: +1-906-487-1657
| |
Collapse
|
25
|
Islam MS, Hasan MS, Hasan MN, Prodhan SH, Islam T, Ghosh A. Genome-wide identification, evolution, and transcript profiling of Aldehyde dehydrogenase superfamily in potato during development stages and stress conditions. Sci Rep 2021; 11:18284. [PMID: 34521910 PMCID: PMC8440639 DOI: 10.1038/s41598-021-97691-9] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2020] [Accepted: 08/09/2021] [Indexed: 02/08/2023] Open
Abstract
The Aldehyde dehydrogenase (ALDH) superfamily comprises a group of enzymes involved in the scavenging of toxic aldehyde molecules by converting them into their corresponding non-toxic carboxylic acids. A genome-wide study in potato identified a total of 22 ALDH genes grouped into ten families that are presented unevenly throughout all the 12 chromosomes. Based on the evolutionary analysis of ALDH proteins from different plant species, ALDH2 and ALDH3 were found to be the most abundant families in the plant, while ALDH18 was found to be the most distantly related one. Gene expression analysis revealed that the expression of StALDH genes is highly tissue-specific and divergent in various abiotic, biotic, and hormonal treatments. Structural modelling and functional analysis of selected StALDH members revealed conservancy in their secondary structures and cofactor binding sites. Taken together, our findings provide comprehensive information on the ALDH gene family in potato that will help in developing a framework for further functional studies.
Collapse
Affiliation(s)
- Md Sifatul Islam
- Department of Biochemistry and Molecular Biology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Md Soyib Hasan
- Department of Biochemistry and Molecular Biology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Md Nazmul Hasan
- Department of Genetic Engineering and Biotechnology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Shamsul H Prodhan
- Department of Genetic Engineering and Biotechnology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh
| | - Tahmina Islam
- Department of Botany, University of Dhaka, Dhaka, 3114, Bangladesh
| | - Ajit Ghosh
- Department of Biochemistry and Molecular Biology, Shahjalal University of Science and Technology, Sylhet, 3114, Bangladesh.
| |
Collapse
|
26
|
Ouranidis A, Choli-Papadopoulou T, Papachristou ET, Papi R, Kostomitsopoulos N. Biopharmaceutics 4.0, Advanced Pre-Clinical Development of mRNA-Encoded Monoclonal Antibodies to Immunosuppressed Murine Models. Vaccines (Basel) 2021; 9:890. [PMID: 34452015 PMCID: PMC8402437 DOI: 10.3390/vaccines9080890] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2021] [Revised: 07/30/2021] [Accepted: 07/30/2021] [Indexed: 12/18/2022] Open
Abstract
Administration of mRNA against SARS-CoV-2 has demonstrated sufficient efficacy, tolerability and clinical potential to disrupt the vaccination field. A multiple-arm, cohort randomized, mixed blind, placebo-controlled study was designed to investigate the in vivo expression of mRNA antibodies to immunosuppressed murine models to conduct efficacy, safety and bioavailability evaluation. Enabling 4.0 tools we reduced animal sacrifice, while interventions were designed compliant to HARRP and SPIRIT engagement: (a) Randomization, blinding; (b) pharmaceutical grade formulation, monitoring; (c) biochemical and histological analysis; and (d) theoretic, statistical analysis. Risk assessment molded the study orientations, according to the ARRIVE guidelines. The primary target of this protocol is the validation of the research hypothesis that autologous translation of Trastuzumab by in vitro transcribed mRNA-encoded antibodies to immunosuppressed animal models, is non-inferior to classical treatments. The secondary target is the comparative pharmacokinetic assessment of the novel scheme, between immunodeficient and healthy subjects. Herein, the debut clinical protocol, investigating the pharmacokinetic/pharmacodynamic impact of mRNA vaccination to immunodeficient organisms. Our design, contributes novel methodology to guide the preclinical development of RNA antibody modalities by resolving efficacy, tolerability and dose regime adjustment for special populations that are incapable of humoral defense.
Collapse
Affiliation(s)
- Andreas Ouranidis
- Department of Pharmaceutical Technology, School of Pharmacy, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
- Department of Chemical Engineering, Polytechnic School, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece
| | - Theodora Choli-Papadopoulou
- Department of Chemistry, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece; (T.C.-P.); (E.T.P.); (R.P.)
| | - Eleni T. Papachristou
- Department of Chemistry, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece; (T.C.-P.); (E.T.P.); (R.P.)
| | - Rigini Papi
- Department of Chemistry, Aristotle University of Thessaloniki, 54124 Thessaloniki, Greece; (T.C.-P.); (E.T.P.); (R.P.)
| | - Nikolaos Kostomitsopoulos
- Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, 11527 Athens, Greece;
| |
Collapse
|
27
|
Islam MKB, Rahman J, Hasan MAM, Ahmad S. predForm-Site: Formylation site prediction by incorporating multiple features and resolving data imbalance. Comput Biol Chem 2021; 94:107553. [PMID: 34384997 DOI: 10.1016/j.compbiolchem.2021.107553] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2020] [Revised: 06/22/2021] [Accepted: 07/28/2021] [Indexed: 10/20/2022]
Abstract
Formylation is one of the newly discovered post-translational modifications in lysine residue which is responsible for different kinds of diseases. In this work, a novel predictor, named predForm-Site, has been developed to predict formylation sites with higher accuracy. We have integrated multiple sequence features for developing a more informative representation of formylation sites. Moreover, decision function of the underlying classifier have been optimized on skewed formylation dataset during prediction model training for prediction quality improvement. On the dataset used by LFPred and Formator predictor, predForm-Site achieved 99.5% sensitivity, 99.8% specificity and 99.8% overall accuracy with AUC of 0.999 in the jackknife test. In the independent test, it has also achieved more than 97% sensitivity and 99% specificity. Similarly, in benchmarking with recent method CKSAAP_FormSite, the proposed predictor significantly outperformed in all the measures, particularly sensitivity by around 20%, specificity by nearly 30% and overall accuracy by more than 22%. These experimental results show that the proposed predForm-Site can be used as a complementary tool for the fast exploration of formylation sites. For convenience of the scientific community, predForm-Site has been deployed as an online tool, accessible at http://103.99.176.239:8080/predForm-Site.
Collapse
Affiliation(s)
- Md Khaled Ben Islam
- Institute for Integrated and Intelligent Systems, Griffith University, Brisbane, Australia; Department of Computer Science & Engineering, Pabna University of Science and Technology, Pabna, Bangladesh.
| | - Julia Rahman
- Institute for Integrated and Intelligent Systems, Griffith University, Brisbane, Australia; Department of Computer Science & Engineering, Rajshahi University of Engineering and Technology, Rajshahi, Bangladesh.
| | - Md Al Mehedi Hasan
- Department of Computer Science & Engineering, Rajshahi University of Engineering and Technology, Rajshahi, Bangladesh
| | - Shamim Ahmad
- Department of Computer Science & Engineering, Rajshahi University, Rajshahi, Bangladesh
| |
Collapse
|
28
|
Attwood MM, Schiöth HB. Characterization of Five Transmembrane Proteins: With Focus on the Tweety, Sideroflexin, and YIP1 Domain Families. Front Cell Dev Biol 2021; 9:708754. [PMID: 34350187 PMCID: PMC8327215 DOI: 10.3389/fcell.2021.708754] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2021] [Accepted: 06/28/2021] [Indexed: 11/25/2022] Open
Abstract
Transmembrane proteins are involved in many essential cell processes such as signal transduction, transport, and protein trafficking, and hence many are implicated in different disease pathways. Further, as the structure and function of proteins are correlated, investigating a group of proteins with the same tertiary structure, i.e., the same number of transmembrane regions, may give understanding about their functional roles and potential as therapeutic targets. This analysis investigates the previously unstudied group of proteins with five transmembrane-spanning regions (5TM). More than half of the 58 proteins identified with the 5TM architecture belong to 12 families with two or more members. Interestingly, more than half the proteins in the dataset function in localization activities through movement or tethering of cell components and more than one-third are involved in transport activities, particularly in the mitochondria. Surprisingly, no receptor activity was identified within this dataset in large contrast with other TM groups. The three major 5TM families, which comprise nearly 30% of the dataset, include the tweety family, the sideroflexin family and the Yip1 domain (YIPF) family. We also analyzed the evolutionary origin of these three families. The YIPF family appears to be the most ancient with presence in bacteria and archaea, while the tweety and sideroflexin families are first found in eukaryotes. We found no evidence of common decent for these three families. About 30% of the 5TM proteins have prominent expression in the brain, liver, or testis. Importantly, 60% of these proteins are identified as cancer prognostic markers, where they are associated with clinical outcomes of various tumor types. Nearly 10% of the 5TMs are still not fully characterized and further investigation of their functional activities and expression is warranted. This study provides the first comprehensive analysis of proteins with the 5TM architecture, providing details of their unique characteristics.
Collapse
Affiliation(s)
- Misty M Attwood
- Functional Pharmacology, Department of Neuroscience, Uppsala University, Uppsala, Sweden
| | - Helgi B Schiöth
- Functional Pharmacology, Department of Neuroscience, Uppsala University, Uppsala, Sweden.,Institute for Translational Medicine and Biotechnology, Sechenov First Moscow State Medical University, Moscow, Russia
| |
Collapse
|
29
|
Genome sequence of the cardiopulmonary canid nematode Angiostrongylus vasorum reveals species-specific genes with potential involvement in coagulopathy. Genomics 2021; 113:2695-2701. [PMID: 34118383 DOI: 10.1016/j.ygeno.2021.06.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 05/21/2021] [Accepted: 06/07/2021] [Indexed: 11/22/2022]
Abstract
Angiostrongylus vasorum is an emerging parasitic nematode of canids and causes respiratory distress, bleeding, and other signs in dogs. Despite its clinical importance, the molecular toolbox allowing the study of the parasite is incomplete. To address this gap, we have sequenced its nuclear genome using Oxford nanopore sequencing, polished with Illumina reads. The size of the final genome is 280 Mb comprising 468 contigs, with an N50 value of 1.68 Mb and a BUSCO score of 93.5%. Ninety-three percent of 13,766 predicted genes were assigned to putative functions. Three folate carriers were found exclusively in A. vasorum, with potential involvement in host coagulopathy. A screen for previously identified vaccine candidates, the aminopeptidase H11 and the somatic protein rHc23, revealed homologs in A. vasorum. The genome sequence will provide a foundation for the development of new tools against canine angiostrongylosis, supporting the identification of potential drug and vaccine targets.
Collapse
|
30
|
Krishnan S, Krishnan GP. N-Glycosylation Network Construction and Analysis to Modify Glycans on the Spike (S) Glycoprotein of SARS-CoV-2. FRONTIERS IN BIOINFORMATICS 2021; 1:667012. [PMID: 36303733 PMCID: PMC9581045 DOI: 10.3389/fbinf.2021.667012] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Accepted: 05/11/2021] [Indexed: 12/26/2022] Open
Abstract
Background: The N-glycan structure and composition of the spike (S) protein of SARS-CoV-2 are pertinent to vaccine development and efficacy. Methods: We reconstructed the glycosylation network based on previously published mass spectrometry data using GNAT, a glycosylation network analysis tool. Our compilation of the network tool had 26 glycosyltransferase and glucosidase enzymes and could infer the pathway of glycosylation machinery based on glycans in the virus spike protein. Once the glycan biosynthesis pathway was generated, we simulated the effect of blocking specific enzymes—swainsonine or deoxynojirimycin for blocking mannosidase-II and indolizidine for blocking alpha-1,6-fucosyltransferase—to see how they would affect the biosynthesis network and the glycans that were synthesized. Results: The N-glycan biosynthesis network of SARS-CoV-2 spike protein shows an elaborate enzymatic pathway with several intermediate glycans, along with the ones identified by mass spectrometric studies. Of the 26 enzymes, the following were involved—Man-Ia, MGAT1, MGAT2, MGAT4, MGAT5, B3GalT, B4GalT, Man-II, SiaT, ST3GalI, ST3GalVI, and FucT8. Blocking specific enzymes resulted in a substantially modified glycan profile of SARS-CoV-2. Conclusion: Variations in the final N-glycan profile of the virus, given its site-specific microheterogeneity, are factors in the host response to the infection, vaccines, and antibodies. Heterogeneity in the N-glycan profile of the spike (S) protein and its potential effect on vaccine efficacy or adverse reactions to the vaccines remain unexplored. Here, we provide all the resources we generated—the glycans in the glycoCT xml format and the biosynthesis network for future work.
Collapse
|
31
|
Wang B, Wu G, Li K, Ling J, Zhao Y, Liu F. A Glycoside Hydrolase Family 99-Like Domain-Containing Protein Modifies Outer Membrane Proteins to Maintain Xanthomonas Pathogenicity and Viability in Stressful Environments. PHYTOPATHOLOGY 2021; 111:929-939. [PMID: 33174820 DOI: 10.1094/phyto-08-20-0327-r] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Protein glycosylation is an essential process that plays an important role in proteome stability, protein structure, and protein function modulation in eukaryotes. However, in bacteria, especially plant pathogenic bacteria, similar studies are lacking. Here, we investigated the relationship between protein glycosylation and pathogenicity by using Xanthomonas oryzae pv. oryzae, the causal agent of bacterial leaf blight in rice, as a well-defined example. In our previous work, we identified a virulence-related hypothetical protein, PXO_03177, but how this protein regulates X. oryzae pv. oryzae virulence has remained unclear. BLAST analysis showed that most homologous proteins of PXO_03177 are glycoside hydrolase family 99-like domain-containing proteins. In the current study, we found that the outer membrane integrity of ΔPXO_03177 appeared to be disrupted. Extracting the outer membrane proteins (OMPs) and performing comparative proteomics and sodium dodecyl sulphate-polyacrylamide gel electrophoresis gel staining analyses revealed that PXO_03177 deletion altered the protein levels of 13 OMPs. Western blot analyses showed that the protein level and glycosylation modification of PXO_02523, a related OmpA family-like protein, was changed in the ΔPXO_03177 mutant background strain. Additionally, the interaction between PXO_03177 and PXO_02523 was confirmed by coimmunoprecipitation. Both PXO_03177 and PXO_02523 play important roles in regulating pathogen virulence and viability in stressful environments. This work provides the first evidence that protein glycosylation is necessary for the virulence of plant pathogenic bacteria.
Collapse
Affiliation(s)
- Bo Wang
- Institute of Plant Protection, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, P.R. China
| | - Guichun Wu
- Institute of Plant Protection, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, P.R. China
| | - Kaihuai Li
- Institute of Plant Protection, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, P.R. China
- Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, China
| | - Jun Ling
- Institute of Plant Protection, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, P.R. China
| | - Yancun Zhao
- Institute of Plant Protection, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, P.R. China
| | - Fengquan Liu
- Institute of Plant Protection, Jiangsu Academy of Agricultural Sciences, Nanjing 210014, P.R. China
| |
Collapse
|
32
|
Londono-Avendano MA, Peláez-Moreno M, López Medina E, Moreno Turriago MS, Parra Patiño B. Transmission of Respiratory Syncytial Virus genotypes in Cali, Colombia. Influenza Other Respir Viruses 2021; 15:521-528. [PMID: 33830644 PMCID: PMC8189202 DOI: 10.1111/irv.12833] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 12/10/2020] [Accepted: 12/15/2020] [Indexed: 01/13/2023] Open
Abstract
Background Colombia's climatological variety, added to pathogen diversity, creates local niches for infectious diseases. In Bogotá, respiratory syncytial virus causes 30%‐52% of the cases of respiratory infections. In coastal or inter‐Andean cities with higher temperature and longer dry seasons, frequency of this virus is 7%‐13%. By 2017, increased hospitalizations due to airway infections occurred in regions whose weather is differently influenced by “El Niño Southern Oscillation” than in Bogotá, although microbial diversity might have also been involved. Methods For Cali, an inter‐Andean city with warm tropical weather, records of respiratory syncytial virus from 2014 to 2018, in children two years old or younger, were analyzed, and genotypes transmitted during 2016‐2017 were identified based on partial sequences of glycoprotein G. Results Most cases of respiratory syncytial virus in Cali occur in the first semesters, with peaks expressed around March‐April, without a clear association with pluviosity. Unlike the biannual rotating pattern of Bogotá, co‐circulation of types A and B was detected. As years pass, transmission seasons are becoming longer and frequencies of the virus augment. The viral genotypes identified follow international trends with dominance of Ontario and Buenos Aires clades. Similar to other isolates in these clades, viruses from Cali exhibit glycosylation variability that may account for their fitness. Conclusions The pattern of respiratory syncytial virus transmission in Cali differs from that in Bogotá. Its epidemiology is shifting and will remain so with the advent of novel respiratory diseases. This may impact the introduction of vaccination schemes for these or other respiratory viruses.
Collapse
Affiliation(s)
- Maria Aurora Londono-Avendano
- Virus and Emerging Diseases -VIREM, Department of Microbiology, College of Health, Universidad del Valle, Cali, Colombia
| | - Melissa Peláez-Moreno
- Virus and Emerging Diseases -VIREM, Department of Microbiology, College of Health, Universidad del Valle, Cali, Colombia.,Currently at Public Health Secretariat, Caquetá, Colombia
| | - Eduardo López Medina
- Pediatric Infectology Study Center (CEIP).,Department of Pediatrics, College of Health, Universidad del Valle, Cali, Colombia.,Clínica Imbanaco, Cali, Colombia
| | - Mabel Soraya Moreno Turriago
- Municipal Public Health Secretariat, Santiago de Cali, Cali, Colombia.,Currently at epidemiology, COOMEVA EPS, Cali, Colombia
| | - Beatriz Parra Patiño
- Virus and Emerging Diseases -VIREM, Department of Microbiology, College of Health, Universidad del Valle, Cali, Colombia
| |
Collapse
|
33
|
Kim T, Xie Y, Li Q, Artegoitia VM, Lebrilla CB, Keim NL, Adams SH, Krishnan S. Diet affects glycosylation of serum proteins in women at risk for cardiometabolic disease. Eur J Nutr 2021; 60:3727-3741. [PMID: 33770218 PMCID: PMC8437848 DOI: 10.1007/s00394-021-02539-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Accepted: 03/11/2021] [Indexed: 02/06/2023]
Abstract
Background Glycoproteomics deals with glycoproteins that are formed by post-translational modification when sugars (like fucose and sialic acid) are attached to protein. Glycosylation of proteins influences function, but whether glycosylation is altered by diet is unknown. Objective To evaluate the effect of consuming a diet based on the Dietary Guidelines for Americans on circulating glycoproteins that have previously been associated with cardiometabolic diseases. Design Forty-four women, with one or more metabolic syndrome characteristics, completed an 8-week randomized controlled feeding intervention (n = 22) consuming a diet based on the Dietary Guidelines for Americans (DGA 2010); the remaining consumed a ‘typical American diet’ (TAD, n = 22). Fasting serum samples were obtained at week0 (baseline) and week8 (post-intervention); 17 serum proteins were chosen for targeted analyses. Protein standards and serum samples were analyzed in a UHPLC-MS protocol to determine peptide concentration and their glycan (fucosylation or sialylation) profiles. Data at baseline were used in correlational analyses; change in proteins and glycans following intervention were used in non-parametric analyses. Results At baseline, women with more metabolic syndrome characteristics had more fucosylation (total di-fucosylated proteins: p = 0.045) compared to women with a lesser number of metabolic syndrome characteristics. Dietary refined grain intake was associated with increased total fucosylation (ρ = − 0.530, p < 0.001) and reduced total sialylation (ρ = 0.311, p = 0.042). After the 8-week intervention, there was higher sialylation following the DGA diet (Total di-sialylated protein p = 0.018, poly-sialylated orosomucoid p = 0.012) compared to the TAD diet. Conclusions Based on this study, glycosylation of proteins is likely affected by dietary patterns; higher sialylation was associated with a healthier diet pattern. Altered glycosylation is associated with several diseases, particularly cancer and type 2 diabetes, and this study raises the possibility that diet may influence disease state by altering glycosylation. Clinical trial registration NCT02298725 at clinicaltrials.gov; https://clinicaltrials.gov/ct2/show/NCT02298725. Supplementary Information The online version contains supplementary material available at 10.1007/s00394-021-02539-7.
Collapse
Affiliation(s)
- Tyler Kim
- Global Disease Biology, University of California Davis, Davis, USA
| | - Yixuan Xie
- Department of Chemistry, University of California Davis, Davis, USA
| | - Qiongyu Li
- Department of Chemistry, University of California Davis, Davis, USA
| | - Virginia M Artegoitia
- Obesity and Metabolism Research Unit, USDA-WHNRC, 430 W. Health Sciences Drive, Davis, CA, 95616, USA
| | | | - Nancy L Keim
- Obesity and Metabolism Research Unit, USDA-WHNRC, 430 W. Health Sciences Drive, Davis, CA, 95616, USA.,Department of Nutrition, University of California Davis, Davis, USA
| | - Sean H Adams
- Department of Surgery, University of California Davis School of Medicine, Sacramento, USA.,Center for Alimentary and Metabolic Science, University of California Davis School of Medicine, Sacramento, USA
| | - Sridevi Krishnan
- Obesity and Metabolism Research Unit, USDA-WHNRC, 430 W. Health Sciences Drive, Davis, CA, 95616, USA. .,Department of Nutrition, University of California Davis, Davis, USA.
| |
Collapse
|
34
|
Weber JJ, Kashipathy MM, Battaile KP, Go E, Desaire H, Kanost MR, Lovell S, Gorman MJ. Structural insight into the novel iron-coordination and domain interactions of transferrin-1 from a model insect, Manduca sexta. Protein Sci 2021; 30:408-422. [PMID: 33197096 PMCID: PMC7784759 DOI: 10.1002/pro.3999] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2020] [Revised: 11/11/2020] [Accepted: 11/13/2020] [Indexed: 11/07/2022]
Abstract
Transferrins function in iron sequestration and iron transport by binding iron tightly and reversibly. Vertebrate transferrins coordinate iron through interactions with two tyrosines, an aspartate, a histidine, and a carbonate anion, and conformational changes that occur upon iron binding and release have been described. Much less is known about the structure and functions of insect transferrin-1 (Tsf1), which is present in hemolymph and influences iron homeostasis mostly by unknown mechanisms. Amino acid sequence and biochemical analyses have suggested that iron coordination by Tsf1 differs from that of the vertebrate transferrins. Here we report the first crystal structure (2.05 Å resolution) of an insect transferrin. Manduca sexta (MsTsf1) in the holo form exhibits a bilobal fold similar to that of vertebrate transferrins, but its carboxyl-lobe adopts a novel orientation and contacts with the amino-lobe. The structure revealed coordination of a single Fe3+ ion in the amino-lobe through Tyr90, Tyr204, and two carbonate anions. One carbonate anion is buried near the ferric ion and is coordinated by four residues, whereas the other carbonate anion is solvent exposed and coordinated by Asn121. Notably, these residues are highly conserved in Tsf1 orthologs. Docking analysis suggested that the solvent exposed carbonate position is capable of binding alternative anions. These findings provide a structural basis for understanding Tsf1 function in iron sequestration and transport in insects as well as insight into the similarities and differences in iron homeostasis between insects and humans.
Collapse
Affiliation(s)
- Jacob J. Weber
- Department of Biochemistry and Molecular BiophysicsKansas State UniversityManhattanKansasUSA
| | - Maithri M. Kashipathy
- Protein Structure Laboratory, Del Shankel Structural Biology CenterUniversity of KansasLawrenceKansasUSA
| | | | - Eden Go
- Department of ChemistryUniversity of KansasLawrenceKansasUSA
| | - Heather Desaire
- Department of ChemistryUniversity of KansasLawrenceKansasUSA
| | - Michael R. Kanost
- Department of Biochemistry and Molecular BiophysicsKansas State UniversityManhattanKansasUSA
| | - Scott Lovell
- Protein Structure Laboratory, Del Shankel Structural Biology CenterUniversity of KansasLawrenceKansasUSA
| | - Maureen J. Gorman
- Department of Biochemistry and Molecular BiophysicsKansas State UniversityManhattanKansasUSA
| |
Collapse
|
35
|
Nguyen TTD, Le NQK, Tran TA, Pham DM, Ou YY. Incorporating a transfer learning technique with amino acid embeddings to efficiently predict N-linked glycosylation sites in ion channels. Comput Biol Med 2021; 130:104212. [PMID: 33454535 DOI: 10.1016/j.compbiomed.2021.104212] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2020] [Revised: 12/21/2020] [Accepted: 01/04/2021] [Indexed: 11/27/2022]
Abstract
Glycosylation is a dynamic enzymatic process that attaches glycan to proteins or other organic molecules such as lipoproteins. Research has shown that such a process in ion channel proteins plays a fundamental role in modulating ion channel functions. This study used a computational method to predict N-linked glycosylation sites, the most common type, in ion channel proteins. From segments of ion channel proteins centered around N-linked glycosylation sites, the amino acid embedding vectors of each residue were concatenated to create features for prediction. We experimented with two different models for converting amino acids to their corresponding embeddings: one was fed with ion channel sequences and the other with a large dataset composed of more than one million protein sequences. The latter model stemmed from the idea of transfer learning technique and emerged as a more efficient feature extractor. Our best model was obtained from this transfer learning approach and a hyperparameter tuning process with a random search on 5-fold cross-validation data. It achieved an accuracy, specificity, sensitivity, and Matthews correlation coefficient of 93.4%, 92.8%, 98.6%, and 0.726, respectively. Corresponding scores on an independent test were 92.9%, 92.2%, 99%, and 0.717. These results outperform the position-specific scoring matrix features that are predominantly employed in post-translational modification site predictions. Furthermore, compared to N-GlyDE, GlycoEP, SPRINT-Gly, the most recent N-linked glycosylation site predictors, our model yields higher scores on the above 4 metrics, thus further demonstrating the efficiency of our approach.
Collapse
Affiliation(s)
| | - Nguyen-Quoc-Khanh Le
- Professional Master Program in Artificial Intelligence in Medicine, College of Medicine, Taipei Medical University, Taipei, 106, Taiwan; Research Center for Artificial Intelligence in Medicine, Taipei Medical University, Taipei, 106, Taiwan
| | | | - Dinh-Minh Pham
- Institute of Biotechnology, Vietnam Academy of Science and Technology, Hanoi, Viet Nam
| | - Yu-Yen Ou
- Department of Computer Science and Engineering, Yuan Ze University, Chung-Li, 32003, Taiwan.
| |
Collapse
|
36
|
Ao C, Yu L, Zou Q. Prediction of bio-sequence modifications and the associations with diseases. Brief Funct Genomics 2020; 20:1-18. [PMID: 33313647 DOI: 10.1093/bfgp/elaa023] [Citation(s) in RCA: 52] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 11/09/2020] [Accepted: 11/10/2020] [Indexed: 12/22/2022] Open
Abstract
Modifications of protein, RNA and DNA play an important role in many biological processes and are related to some diseases. Therefore, accurate identification and comprehensive understanding of protein, RNA and DNA modification sites can promote research on disease treatment and prevention. With the development of sequencing technology, the number of known sequences has continued to increase. In the past decade, many computational tools that can be used to predict protein, RNA and DNA modification sites have been developed. In this review, we comprehensively summarized the modification site predictors for three different biological sequences and the association with diseases. The relevant web server is accessible at http://lab.malab.cn/∼acy/PTM_data/ some sample data on protein, RNA and DNA modification can be downloaded from that website.
Collapse
|
37
|
Huang X, Zhang C, Pearce R, Omenn GS, Zhang Y. Identifying the Zoonotic Origin of SARS-CoV-2 by Modeling the Binding Affinity between the Spike Receptor-Binding Domain and Host ACE2. J Proteome Res 2020; 19:4844-4856. [PMID: 33175551 PMCID: PMC7770890 DOI: 10.1021/acs.jproteome.0c00717] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Indexed: 12/14/2022]
Abstract
Despite considerable research progress on SARS-CoV-2, the direct zoonotic origin (intermediate host) of the virus remains ambiguous. The most definitive approach to identify the intermediate host would be the detection of SARS-CoV-2-like coronaviruses in wild animals. However, due to the high number of animal species, it is not feasible to screen all the species in the laboratory. Given that binding to ACE2 proteins is the first step for the coronaviruses to invade host cells, we propose a computational pipeline to identify potential intermediate hosts of SARS-CoV-2 by modeling the binding affinity between the Spike receptor-binding domain (RBD) and host ACE2. Using this pipeline, we systematically examined 285 ACE2 variants from mammals, birds, fish, reptiles, and amphibians, and found that the binding energies calculated for the modeled Spike-RBD/ACE2 complex structures correlated closely with the effectiveness of animal infection as determined by multiple experimental data sets. Built on the optimized binding affinity cutoff, we suggest a set of 96 mammals, including 48 experimentally investigated ones, which are permissive to SARS-CoV-2, with candidates from primates, rodents, and carnivores at the highest risk of infection. Overall, this work not only suggests a limited range of potential intermediate SARS-CoV-2 hosts for further experimental investigation, but also, more importantly, it proposes a new structure-based approach to general zoonotic origin and susceptibility analyses that are critical for human infectious disease control and wildlife protection.
Collapse
Affiliation(s)
- Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Chengxin Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
- Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| |
Collapse
|
38
|
Wang X, Zhuo C, Xiao X, Wang X, Docampo-Palacios M, Chen F, Dixon RA. Substrate Specificity of LACCASE8 Facilitates Polymerization of Caffeyl Alcohol for C-Lignin Biosynthesis in the Seed Coat of Cleome hassleriana. THE PLANT CELL 2020; 32:3825-3845. [PMID: 33037146 PMCID: PMC7721330 DOI: 10.1105/tpc.20.00598] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/28/2020] [Revised: 09/24/2020] [Accepted: 10/06/2020] [Indexed: 05/02/2023]
Abstract
Catechyl lignin (C-lignin) is a linear homopolymer of caffeyl alcohol found in the seed coats of diverse plant species. Its properties make it a natural source of carbon fibers and high-value chemicals, but the mechanism of in planta polymerization of caffeyl alcohol remains unclear. In the ornamental plant Cleome hassleriana, lignin biosynthesis in the seed coat switches from guaiacyl lignin to C-lignin at ∼12 d after pollination. Here we found that the transcript profile of the laccase gene ChLAC8 parallels the accumulation of C-lignin during seed coat development. Recombinant ChLAC8 oxidizes caffeyl and sinapyl alcohols, generating their corresponding dimers or trimers in vitro, but cannot oxidize coniferyl alcohol. We propose a basis for this substrate preference based on molecular modeling/docking experiments. Suppression of ChLAC8 expression led to significantly reduced C-lignin content in the seed coats of transgenic Cleome plants. Feeding of 13C-caffeyl alcohol to the Arabidopsis (Arabidopsis thaliana) caffeic acid o-methyltransferase mutant resulted in no incorporation of 13C into C-lignin, but expressing ChLAC8 in this genetic background led to appearance of C-lignin with >40% label incorporation. These results indicate that ChLAC8 is required for C-lignin polymerization and determines lignin composition when caffeyl alcohol is available.
Collapse
Affiliation(s)
- Xin Wang
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
- Key Laboratory of Biology and Genetic Improvement of Oil Crops, Ministry of Agriculture, Oil Crops Research Institute of the Chinese Academy of Agricultural Sciences, Wuhan 430062, China
| | - Chunliu Zhuo
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
| | - Xirong Xiao
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
| | - Xiaoqiang Wang
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
| | - Maite Docampo-Palacios
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
| | - Fang Chen
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
| | - Richard A Dixon
- BioDiscovery Institute and Department of Biological Sciences, University of North Texas, Denton, Texas 76203
- Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831
| |
Collapse
|
39
|
Cowley JA. The genomes of Mourilyan virus and Wēnzhōu shrimp virus 1 of prawns comprise 4 RNA segments. Virus Res 2020; 292:198225. [PMID: 33181202 DOI: 10.1016/j.virusres.2020.198225] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Revised: 11/02/2020] [Accepted: 11/03/2020] [Indexed: 12/16/2022]
Abstract
Reported here is the complete genome sequence of Mourilyan virus (MoV) that infects giant tiger (Penaeus monodon) and kuruma prawns (P. japonicas) in Australia. Its genome was determined using various PCR strategies based on the sequences of 3 randomly-amplified cDNA clones to its L and M RNA segments discovered in a library generated to determine the genome sequence of gill-associated ronivirus. The sequences of PCR products and clones obtained showed the MoV genome to comprise 4 ssRNA segments (L, M, S1 and S2), as confirmed by Northern blotting using RNA from naïve and MoV-infected prawns, and by Illumina sequence analysis of semi-purified MoV. BLASTn searches identified the MoV L, M and S1 RNA segments to be homologous to Wēnzhōu shrimp virus 1 (WzSV1) segments discovered recently in a P. monodon RNA-Seq library (SRR1745808). Mapping this read library to the MoV S2 RNA segment identified WzSV1 to also possess an equivalent segment. BLASTp searches identified the putative non-structural protein (NSs2; 393-394 aa) encoded in their S2 RNA segments to have no homologs in GenBank. Possibly due to NSs2 being encoded in a discrete RNA segment rather than in ambisense relative to the N protein as in the S RNA segments of other phenuiviruses, each of 6 MoV S1 RNA segment clones sequenced possessed a variable-length (≤ 645 nt) imperfect GA-repeat extending from the N protein stop codon to the more variable ∼90 nt segment terminal sequence. Read mapping of RNA-Seq library SRR1745808 showed the WzSV1 S1 RNA segment to possess a similar GA-repeat. However, paired-read variations hindered definitive assembly of a consensus sequence. All 4 MoV and WzSV1 RNA segments terminated with a 10 nt inverted repeat sequence (5'-ACACAAAGAC.) identical to the RNA segment termini of uukuviruses. Phylogenetic analyses of MoV/WzSV1 RNA-dependant RNA polymerase (L RNA), G1G2 precursor glycoprotein (M RNA) and nucleocapsid (N) protein (S1 RNA) sequences generally clustered them with as yet unassigned crustacean/diptera bunya-like viruses on branches positioned closely to others containing tick-transmitted phenuiviruses. As genome sequences of most phenuiviruses discovered recently have originated from meta-transcriptomics studies, the data presented here showing the MoV and WzSV1 genomes to comprise more than 3 RNA segments, like the plant tenuiviruses, suggests a need to investigate the genomes of these unassigned viruses more closely.
Collapse
Affiliation(s)
- Jeff A Cowley
- Livestock & Aquaculture, CSIRO Agriculture & Food, Queensland Bioscience Precinct, 306 Carmody Road, St. Lucia, QLD, 4067, Australia.
| |
Collapse
|
40
|
Invertases in Phytophthora infestans Localize to Haustoria and Are Programmed for Infection-Specific Expression. mBio 2020; 11:mBio.01251-20. [PMID: 33051363 PMCID: PMC7554665 DOI: 10.1128/mbio.01251-20] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The oomycete Phytophthora infestans, the causal agent of potato and tomato blight, expresses two extracellular invertases. Unlike typical fungal invertases, the P. infestans genes are not sucrose induced or glucose repressed but instead appear to be under developmental control. Transcript levels of both genes were very low in mycelia harvested from artificial medium but high in preinfection stages (sporangia, zoospores, and germinated cysts), high during biotrophic growth in leaves and tubers, and low during necrotrophy. Genome-wide analyses of metabolic enzymes and effectors indicated that this expression profile was fairly unusual, matched only by a few other enzymes, such as carbonic anhydrases and a few RXLR effectors. Genes for other metabolic enzymes were typically downregulated in the preinfection stages. Overall metabolic gene expression during the necrotrophic stage of infection clustered with artificial medium, while the biotrophic phase formed a separate cluster. Confocal microscopy of transformants expressing green fluorescent protein (GFP) fusions indicated that invertase protein resided primarily in haustoria during infection. This localization was not attributable to haustorium-specific promoter activity. Instead, the N-terminal regions of proteins containing signal peptides were sufficient to deliver proteins to haustoria. Invertase expression during leaf infection was linked to a decline in apoplastic sucrose, consistent with a role of the enzymes in plant pathogenesis. This was also suggested by the discovery that invertase genes occur across multiple orders of oomycetes but not in most animal pathogens or a mycoparasite.IMPORTANCE Oomycetes cause hundreds of diseases in economically and environmentally significant plants. How these microbes acquire host nutrients is not well understood. Many oomycetes insert specialized hyphae called haustoria into plant cells, but unlike their fungal counterparts, a role in nutrition has remained unproven. The discovery that Phytophthora invertases localize to haustoria provides the first strong evidence that these structures participate in feeding. Since regions of proteins containing signal peptides targeted proteins to the haustorium-plant interface, haustoria appear to be the primary machinery for secreting proteins during biotrophic pathogenesis. Although oomycete invertases were acquired laterally from fungi, their expression patterns have adapted to the Phytophthora lifestyle by abandoning substrate-level regulation in favor of developmental control, allowing the enzymes to be produced in anticipation of plant colonization. This study highlights how a widely distributed hydrolytic enzyme has evolved new behaviors in oomycetes.
Collapse
|
41
|
Huang X, Zhang C, Pearce R, Omenn GS, Zhang Y. Identifying zoonotic origin of SARS-CoV-2 by modeling the binding affinity between Spike receptor-binding domain and host ACE2. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.09.11.293449. [PMID: 32935105 PMCID: PMC7491519 DOI: 10.1101/2020.09.11.293449] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Despite considerable research progress on SARS-CoV-2, the direct zoonotic origin (intermediate host) of the virus remains ambiguous. The most definitive approach to identify the intermediate host would be the detection of SARS-CoV-2-like coronaviruses in wild animals. However, due to the high number of animal species, it is not feasible to screen all the species in the laboratory. Given that the recognition of the binding ACE2 proteins is the first step for the coronaviruses to invade host cells, we proposed a computational pipeline to identify potential intermediate hosts of SARS-CoV-2 by modeling the binding affinity between the Spike receptor-binding domain (RBD) and host ACE2. Using this pipeline, we systematically examined 285 ACE2 variants from mammals, birds, fish, reptiles, and amphibians, and found that the binding energies calculated on the modeled Spike-RBD/ACE2 complex structures correlate closely with the effectiveness of animal infections as determined by multiple experimental datasets. Built on the optimized binding affinity cutoff, we suggested a set of 96 mammals, including 48 experimentally investigated ones, which are permissive to SARS-CoV-2, with candidates from primates, rodents, and carnivores at the highest risk of infection. Overall, this work not only suggested a limited range of potential intermediate SARS-CoV-2 hosts for further experimental investigation; but more importantly, it proposed a new structure-based approach to general zoonotic origin and susceptibility analyses that are critical for human infectious disease control and wildlife protection.
Collapse
Affiliation(s)
- Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Chengxin Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
- Department of Biological Chemistry, University of Michigan, 100 Washtenaw Avenue, Ann Arbor, MI 48109, USA
| |
Collapse
|
42
|
Khodadad N, Seyedian SS, Moattari A, Biparva Haghighi S, Pirmoradi R, Abbasi S, Makvandi M. In silico functional and structural characterization of hepatitis B virus PreS/S-gene in Iranian patients infected with chronic hepatitis B virus genotype D. Heliyon 2020; 6:e04332. [PMID: 32695898 PMCID: PMC7365991 DOI: 10.1016/j.heliyon.2020.e04332] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Revised: 01/23/2020] [Accepted: 06/24/2020] [Indexed: 12/20/2022] Open
Abstract
Objective Chronic hepatitis B (CHB) virus infection is the most prevalent chronic liver disease and has become a serious threat to human health. In this study, we attempted to specify and predict several properties including physicochemical, mutation sites, B-cell epitopes, phosphorylation sites, N-link, O-link glycosylation sites, and protein structures of S protein isolated from Ahvaz. Materials and methods Initially, hepatitis B virus DNA (HBV DNA) was extracted from five sera samples of untreated chronic hepatitis B patients. The full-length HBV genomes were amplified and then cloned in pTZ57 R/T vector. The full sequences of HBV were registered in the GenBank with accessions numbers (MK355500), (MK355501) and (MK693107-9). PROTSCALE, Expasy's ProtParam, immuneepitope, ABCpred, BcePred, Bepipred, Algpred, VaxiJen, SCRATCH, DiANNA, plus a number of online analytical processing tools were used to analyse and predict the preS/S gene of genotype D sequences. The present study is the first analytical research on samples obtained from Ahvaz. Results We found major hydrophilic region (MHR) mutations at "a" determining region that included K122R, N131T, F134Y, P142L, and T126N mutations. Moreover, Ahvaz sequences revealed four sites (4, 112, 166, and 309) in the preS/S gene for N-glycosylation that could possibly be a potential target for anti-HBV therapy. Conclusion In the present study, mutations were identified at positions T113S and N131T within the MHR region of S protein; these mutations can potentially decrease the effect of hepatitis B vaccination in vaccine recipients.
Collapse
Affiliation(s)
- Nastaran Khodadad
- Infectious and Tropical Disease Research Center, Health Research Institute, and Department of Virology, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Seyed Saeed Seyedian
- Alimentary Tract Research Center, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Afagh Moattari
- Department of Bacteriology and Virology, Shiraz University of Medical Sciences, Shiraz, Iran
| | - Somayeh Biparva Haghighi
- Department of General Courses, School of Medicine, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | - Roya Pirmoradi
- Infectious and Tropical Disease Research Center, Health Research Institute, and Department of Virology, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| | | | - Manoochehr Makvandi
- Infectious and Tropical Disease Research Center, Health Research Institute, and Department of Virology, Ahvaz Jundishapur University of Medical Sciences, Ahvaz, Iran
| |
Collapse
|
43
|
Interleukin 34 Serves as a Novel Molecular Adjuvant against Nocardia Seriolae Infection in Largemouth Bass ( Micropterus Salmoides). Vaccines (Basel) 2020; 8:vaccines8020151. [PMID: 32231137 PMCID: PMC7349345 DOI: 10.3390/vaccines8020151] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2020] [Revised: 03/26/2020] [Accepted: 03/26/2020] [Indexed: 02/06/2023] Open
Abstract
DNA vaccines have been widely employed in controlling viral and bacterial infections in mammals and teleost fish. Co-injection of molecular adjuvants, including chemokines, cytokines, and immune co-stimulatory molecules, is one of the potential strategies used to improve DNA vaccine efficacy. In mammals and teleost fish, interleukin-34 (IL-34) had been described as a multifunctional cytokine and its immunological role had been confirmed; however, the adjuvant capacity of IL-34 remains to be elucidated. In this study, IL-34 was identified in largemouth bass. A recombinant plasmid of IL-34 (pcIL-34) was constructed and co-administered with a DNA vaccine encoding hypoxic response protein 1 (Hrp1; pcHrp1) to evaluate the adjuvant capacity of pcIL-34 against Nocardia seriolae infection. Our results indicated that pcIL-34 co-injected with pcHrp1 not only triggered innate immunity and a specific antibody response, but also enhanced the mRNA expression level of immune-related genes encoding for cytokines, chemokines, and humoral and cell-mediated immunity. Moreover, pcIL-34 enhanced the protection of pcHrp1 against N. seriolae challenge and conferred the relative percent survival of 82.14%. Collectively, IL-34 is a promising adjuvant in a DNA vaccine against nocardiosis in fish.
Collapse
|
44
|
Structural, glycosylation and antigenic variation between 2019 novel coronavirus (2019-nCoV) and SARS coronavirus (SARS-CoV). Virusdisease 2020; 31:13-21. [PMID: 32206694 PMCID: PMC7085496 DOI: 10.1007/s13337-020-00571-5] [Citation(s) in RCA: 146] [Impact Index Per Article: 29.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2020] [Accepted: 02/21/2020] [Indexed: 12/24/2022] Open
Abstract
The emergence of 2019 novel coronavirus (2019-nCoV) is of global concern and might have emerged from RNA recombination among existing coronaviruses. CoV spike (S) protein which is crucial for receptor binding, membrane fusion via conformational changes, internalization of the virus, host tissue tropism and comprises crucial targets for vaccine development, remain largely uncharacterized. Therefore, the present study has been planned to determine the sequence variation, structural and antigenic divergence of S glycoprotein which may be helpful for the management of 2019-nCoV infection. The sequences of spike glycoprotein of 2019-nCoV and SARS coronavirus (SARS-CoV) were used for the comparison. The sequence variations were determined using EMBOSS Needle pairwise sequence alignment tools. The variation in glycosylation sites was predicted by NetNGlyc 1.0 and validated by N-GlyDE server. Antigenicity was predicted by NetCTL 1.2 and validated by IEDB Analysis Resource server. The structural divergence was determined by using SuperPose Version 1.0 based on cryo-EM structure of the SARS coronavirus spike glycoprotein. Our data suggests that 2019-nCoV is newly spilled coronavirus into humans in China is closely related to SARS-CoV, which has only 12.8% of difference with SARS-CoV in S protein and has 83.9% similarity in minimal receptor-binding domain with SARS-CoV. Addition of a novel glycosylation sites were observed in 2019-nCoV. In addition, antigenic analysis proposes that great antigenic differences exist between both the viral strains, but some of the epitopes were found to be similar between both the S proteins. In spite of the variation in S protein amino acid composition, we found no significant difference in their structures. Collectively, for the first time our results exhibit the emergence of human 2019-nCoV is closely related to predecessor SARS-CoV and provide the evidence that 2019-nCoV uses various novel glycosylation sites as SARS-CoV and may have a potential to become pandemic owing its antigenic discrepancy. Further, demonstration of novel Cytotoxic T lymphocyte epitopes may impart opportunities for the development of peptide based vaccine for the prevention of 2019-nCoV.
Collapse
|