1
|
D'Incà R, Mattioli R, Tomasella M, Tavazza R, Macone A, Incocciati A, Martignago D, Polticelli F, Fraudentali I, Cona A, Angelini R, Tavazza M, Nardini A, Tavladoraki P. A Solanum lycopersicum polyamine oxidase contributes to the control of plant growth, xylem differentiation, and drought stress tolerance. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024. [PMID: 38761363 DOI: 10.1111/tpj.16809] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Revised: 04/26/2024] [Accepted: 05/03/2024] [Indexed: 05/20/2024]
Abstract
Polyamines are involved in several plant physiological processes. In Arabidopsis thaliana, five FAD-dependent polyamine oxidases (AtPAO1 to AtPAO5) contribute to polyamine homeostasis. AtPAO5 catalyzes the back-conversion of thermospermine (T-Spm) to spermidine and plays a role in plant development, xylem differentiation, and abiotic stress tolerance. In the present study, to verify whether T-Spm metabolism can be exploited as a new route to improve stress tolerance in crops and to investigate the underlying mechanisms, tomato (Solanum lycopersicum) AtPAO5 homologs were identified (SlPAO2, SlPAO3, and SlPAO4) and CRISPR/Cas9-mediated loss-of-function slpao3 mutants were obtained. Morphological, molecular, and physiological analyses showed that slpao3 mutants display increased T-Spm levels and exhibit changes in growth parameters, number and size of xylem elements, and expression levels of auxin- and gibberellin-related genes compared to wild-type plants. The slpao3 mutants are also characterized by improved tolerance to drought stress, which can be attributed to a diminished xylem hydraulic conductivity that limits water loss, as well as to a reduced vulnerability to embolism. Altogether, this study evidences conservation, though with some significant variations, of the T-Spm-mediated regulatory mechanisms controlling plant growth and differentiation across different plant species and highlights the T-Spm role in improving stress tolerance while not constraining growth.
Collapse
Affiliation(s)
- Riccardo D'Incà
- Department of Science, University Roma Tre, 00146, Rome, Italy
| | | | - Martina Tomasella
- Dipartimento di Scienze della Vita, Università di Trieste, Trieste, Italy
| | - Raffaela Tavazza
- Italian National Agency for New Technologies, Energy and Sustainable Economic Development (ENEA), BIOAG-BIOTEC C.R. Casaccia, Rome, Italy
| | - Alberto Macone
- Department of Biochemical Sciences 'A. Rossi Fanelli', Sapienza University of Rome, Rome, Italy
| | - Alessio Incocciati
- Department of Biochemical Sciences 'A. Rossi Fanelli', Sapienza University of Rome, Rome, Italy
| | | | - Fabio Polticelli
- Department of Science, University Roma Tre, 00146, Rome, Italy
- National Institute of Nuclear Physics, Roma Tre Section, 00146, Rome, Italy
| | | | - Alessandra Cona
- Department of Science, University Roma Tre, 00146, Rome, Italy
- Istituto Nazionale Biostrutture e Biosistemi (INBB), Rome, Italy
| | - Riccardo Angelini
- Department of Science, University Roma Tre, 00146, Rome, Italy
- Istituto Nazionale Biostrutture e Biosistemi (INBB), Rome, Italy
- NBFC, National Biodiversity Future Center, Palermo, Italy
| | - Mario Tavazza
- Italian National Agency for New Technologies, Energy and Sustainable Economic Development (ENEA), BIOAG-BIOTEC C.R. Casaccia, Rome, Italy
| | - Andrea Nardini
- Dipartimento di Scienze della Vita, Università di Trieste, Trieste, Italy
| | - Paraskevi Tavladoraki
- Department of Science, University Roma Tre, 00146, Rome, Italy
- Istituto Nazionale Biostrutture e Biosistemi (INBB), Rome, Italy
| |
Collapse
|
2
|
Li H, Qian X, Mohanram H, Han X, Qi H, Zou G, Yuan F, Miserez A, Liu T, Yang Q, Gao H, Yu J. Self-assembly of peptide nanocapsules by a solvent concentration gradient. NATURE NANOTECHNOLOGY 2024:10.1038/s41565-024-01654-w. [PMID: 38671050 DOI: 10.1038/s41565-024-01654-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Accepted: 03/12/2024] [Indexed: 04/28/2024]
Abstract
Biological systems can create materials with intricate structures and specialized functions. In comparison, precise control of structures in human-made materials has been challenging. Here we report on insect cuticle peptides that spontaneously form nanocapsules through a single-step solvent exchange process, where the concentration gradient resulting from the mixing of water and acetone drives the localization and self-assembly of the peptides into hollow nanocapsules. The underlying driving force is found to be the intrinsic affinity of the peptides for a particular solvent concentration, while the diffusion of water and acetone creates a gradient interface that triggers peptide localization and self-assembly. This gradient-mediated self-assembly offers a transformative pathway towards simple generation of drug delivery systems based on peptide nanocapsules.
Collapse
Affiliation(s)
- Haopeng Li
- School of Materials Science and Engineering, Nanyang Technological University, Singapore, Singapore
- MOE Key Laboratory of Bio-Intelligent Manufacturing, School of Bioengineering, Dalian University of Technology, Dalian, China
| | - Xuliang Qian
- School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore, Singapore
| | - Harini Mohanram
- School of Biological Sciences, Division of Structural and Computational Biology, Nanyang Technological University, Singapore, Singapore
| | - Xiao Han
- School of Materials Science and Engineering, Nanyang Technological University, Singapore, Singapore
| | - Huitang Qi
- MOE Key Laboratory of Bio-Intelligent Manufacturing, School of Bioengineering, Dalian University of Technology, Dalian, China
| | - Guijin Zou
- School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore, Singapore
- Institute of High Performance Computing, A*STAR, Singapore, Singapore
| | - Fenghou Yuan
- MOE Key Laboratory of Bio-Intelligent Manufacturing, School of Bioengineering, Dalian University of Technology, Dalian, China
| | - Ali Miserez
- School of Materials Science and Engineering, Nanyang Technological University, Singapore, Singapore
- Biological and Biomimetic Material Laboratory (BBML), Center for Sustainable Materials (SusMat), School of Materials Science and Engineering, Nanyang Technological University, Singapore, Singapore
| | - Tian Liu
- MOE Key Laboratory of Bio-Intelligent Manufacturing, School of Bioengineering, Dalian University of Technology, Dalian, China.
| | - Qing Yang
- State Key Laboratory for Biology of Plant Diseases and Insect Pests, Institute of Plant Protection, Chinese Academy of Agricultural Sciences, Beijing, China.
- Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen, China.
| | - Huajian Gao
- School of Mechanical and Aerospace Engineering, Nanyang Technological University, Singapore, Singapore.
- Institute of High Performance Computing, A*STAR, Singapore, Singapore.
- Mechano-X Institute, Applied Mechanics Laboratory, Department of Engineering Mechanics, Tsinghua University, Beijing, China.
| | - Jing Yu
- School of Materials Science and Engineering, Nanyang Technological University, Singapore, Singapore.
- Institute for Digital Molecular Analytics and Science, Nanyang Technological University, Singapore, Singapore.
| |
Collapse
|
3
|
Machulin AV, Deryusheva EI, Galzitskaya OV. Variation in base composition, structure-function relationships, and origins of structural repetition in bacterial rpsA gene. Biosystems 2024; 238:105196. [PMID: 38537772 DOI: 10.1016/j.biosystems.2024.105196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 03/22/2024] [Accepted: 03/22/2024] [Indexed: 04/12/2024]
Abstract
Protein domain repeats are known to arise due to tandem duplications of internal genes. However, the understanding of the underlying mechanisms of this process is incomplete. The goal of this work was to investigate the mechanism of occurrence of repeat expansion based on studying the sequences of 1324 rpsA genes of bacterial S1 ribosomal proteins containing different numbers of S1 structural domains. The rpsA gene encodes ribosomal S1 protein, which is essential for cell viability as it interacts with both mRNA and proteins. Gene ontology (GO) analysis of S1 domains in ribosomal S1 proteins revealed that bacterial protein sequences in S1 mainly have 3 types of molecular functions: RNA binding activity, nucleic acid activity, and ribosome structural component. Our results show that the maximum value of rpsA gene identity for full-length proteins was found for S1 proteins containing six structural domains (58%). Analysis of consensus sequences showed that parts of the rpsA gene encoding separate S1 domains have no a strictly repetitive structure between groups containing different numbers of S1 domains. At the same time, gene regions encoding some conserved residues that form the RNA-binding site remain conserved. The detected phylogenetic similarity suggests that the proposed fold of the rpsA translation initiation region of Escherichia coli has functional value and is important for translational control of rpsA gene expression in other bacterial phyla, but not only in gamma Proteobacteria.
Collapse
Affiliation(s)
- Andrey V Machulin
- Skryabin Institute of Biochemistry and Physiology of Microorganisms, Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", 142290, Pushchino, Moscow Region, Russia
| | - Evgeniya I Deryusheva
- Institute for Biological Instrumentation, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", 142290, Pushchino, Moscow Region, Russia
| | - Oxana V Galzitskaya
- Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia; Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
| |
Collapse
|
4
|
Mac Donagh J, Marchesini A, Spiga A, Fallico MJ, Arrías PN, Monzon AM, Vagiona AC, Gonçalves-Kulik M, Mier P, Andrade-Navarro MA. Structured Tandem Repeats in Protein Interactions. Int J Mol Sci 2024; 25:2994. [PMID: 38474241 DOI: 10.3390/ijms25052994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 02/28/2024] [Accepted: 03/01/2024] [Indexed: 03/14/2024] Open
Abstract
Tandem repeats (TRs) in protein sequences are consecutive, highly similar sequence motifs. Some types of TRs fold into structural units that pack together in ensembles, forming either an (open) elongated domain or a (closed) propeller, where the last unit of the ensemble packs against the first one. Here, we examine TR proteins (TRPs) to see how their sequence, structure, and evolutionary properties favor them for a function as mediators of protein interactions. Our observations suggest that TRPs bind other proteins using large, structured surfaces like globular domains; in particular, open-structured TR ensembles are favored by flexible termini and the possibility to tightly coil against their targets. While, intuitively, open ensembles of TRs seem prone to evolve due to their potential to accommodate insertions and deletions of units, these evolutionary events are unexpectedly rare, suggesting that they are advantageous for the emergence of the ancestral sequence but are early fixed. We hypothesize that their flexibility makes it easier for further proteins to adapt to interact with them, which would explain their large number of protein interactions. We provide insight into the properties of open TR ensembles, which make them scaffolds for alternative protein complexes to organize genes, RNA and proteins.
Collapse
Affiliation(s)
- Juan Mac Donagh
- Science and Technology Department, National University of Quilmes, Bernal B1876, Argentina
- National Scientific and Technical Research Council (CONICET), Buenos Aires C1033AAJ, Argentina
| | - Abril Marchesini
- National Scientific and Technical Research Council (CONICET), Buenos Aires C1033AAJ, Argentina
- Biotechnology and Molecular Biology Institute (IBBM, UNLP-CONICET), Faculty of Exact Sciences, University of La Plata, La Plata 1900, Argentina
| | - Agostina Spiga
- Science and Technology Department, National University of Quilmes, Bernal B1876, Argentina
- National Scientific and Technical Research Council (CONICET), Buenos Aires C1033AAJ, Argentina
| | - Maximiliano José Fallico
- Laboratory of Bioactive Compound Research and Development, Faculty of Exact Sciences, University of La Plata, La Plata 1900, Argentina
| | - Paula Nazarena Arrías
- Department of Biomedical Sciences, University of Padova, Via U. Bassi 58/b, 35121 Padova, Italy
| | - Alexander Miguel Monzon
- Department of Information Engineering, University of Padova, Via Giovanni Gradenigo 6/B, 35131 Padova, Italy
| | - Aimilia-Christina Vagiona
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Mariane Gonçalves-Kulik
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Miguel A Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| |
Collapse
|
5
|
Bowhay CR, Hanington PC. Animal granulins: In the GRN scheme of things. DEVELOPMENTAL AND COMPARATIVE IMMUNOLOGY 2024; 152:105115. [PMID: 38101714 DOI: 10.1016/j.dci.2023.105115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Revised: 12/06/2023] [Accepted: 12/10/2023] [Indexed: 12/17/2023]
Abstract
Granulins are conserved in nearly all metazoans, with an intriguing loss in insects. These pleiotropic peptides are involved in numerous physiological and pathological processes yet have been overwhelmingly examined in mammalian systems. While work in other animal models has been informative, a richer understanding of the proteins should be obtained by integrating knowledge from all available contexts. The main bodies of work described here include 1) the structure-function relationships of progranulin and its cleavage products, 2) the role of expanded granulin gene families and different isoforms in fish immunology, 3) the release of granulin peptides to promote host angiogenesis by parasitic worms, 4) a diversity of molluscan uses for granulins, including immune activation in intermediate hosts to trematodes, 5) knowledge gained on lysosomal functions from C. elegans and the stress-related activities of granulins. We provide an overview of functional reports across the Metazoa to inform much-needed future research.
Collapse
Affiliation(s)
- Christina R Bowhay
- Department of Biological Sciences, University of Alberta, Edmonton, AB, T6G 2R3, Canada
| | - Patrick C Hanington
- School of Public Health, University of Alberta, Edmonton, AB, T6G 2R3, Canada.
| |
Collapse
|
6
|
Ham H, Park DS. New Insights and Approach Toward the Genetic Diversity and Strain Typing of Erwinia pyrifoliae Based on rsxC, an Electron Transport Gene. PLANT DISEASE 2024; 108:296-301. [PMID: 37669173 DOI: 10.1094/pdis-03-23-0475-sc] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/07/2023]
Abstract
Erwinia pyrifoliae, a causal agent of black shoot blight in apple and pear trees, is a plant pathogenic bacterium first reported in South Korea. The symptoms of black shoot blight are very similar to those of the fire blight disease in apple and pear trees caused by E. amylovora, as E. pyrifoliae has a genetically very close relationship with E. amylovora. Recently, there have been reports that E. pyrifoliae causes disease in European strawberries, resulting in severe fruit loss that aroused great concern about its spread, distribution, and host range. Therefore, it is essential to establish a trustworthy approach to understanding the distribution patterns of E. pyrifoliae based on the genetic background to strengthen the barrier of potential spreading risks, although advanced methods have been provided to accurately detect E. pyrifoliae and E. amylovora. Consequently, this study discovered a noble and noteworthy gene, rsxC, capable of providing the pathogen genotype by comparing E. pyrifoliae genomic sequences in the international representative genome archive. Different numbers of 40-unit amino acid repeats in this gene among the strains induced intraspecific traits in RsxC. By comparing their repeat pattern, E. pyrifoliae isolates were divided into two main groups, branching into several clades via sequence alignment of 35 E. pyrifoliae isolates from various apple orchards from 2020 to 2021 in South Korea. The newly discovered quadraginta amino acid repeat within this gene would be a valuable genetic touchstone for determining the genotype and distribution pattern of E. pyrifoliae strains, ultimately leading to exploring their evolution. The function of amino acid repeats and the biological significance of strains need to be elucidated further.
Collapse
Affiliation(s)
- Hyeonheui Ham
- Crop Protection Division, National Institute of Agricultural Sciences, Wanju-gun 55365, Republic of Korea
| | - Dong Suk Park
- Crop Protection Division, National Institute of Agricultural Sciences, Wanju-gun 55365, Republic of Korea
| |
Collapse
|
7
|
Ventura C, Banerjee A, Zacharopoulou M, Itzhaki LS, Bahar I. Tandem-repeat proteins conformational mechanics are optimized to facilitate functional interactions and complexations. Curr Opin Struct Biol 2024; 84:102744. [PMID: 38134536 DOI: 10.1016/j.sbi.2023.102744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2023] [Revised: 10/30/2023] [Accepted: 11/24/2023] [Indexed: 12/24/2023]
Abstract
The architectures of tandem-repeat proteins are distinct from those of globular proteins. Individual modules, each comprising small structural motifs of 20-40 residues, are arrayed in a quasi one-dimensional fashion to form striking, elongated, horseshoe-like, and superhelical architectures, stabilized solely by short-range interaction. The spring-like shapes of repeat arrays point to elastic modes of action, and these proteins function as adapter molecules or 'hubs,' propagating signals within multi-subunit assemblies in diverse biological contexts. This flexibility is apparent in the dramatic variability observed in the structures of tandem-repeat proteins in different complexes. Here, using computational analysis, we demonstrate the striking ability of just one or a few global motions to recapitulate these structures. These findings show how the mechanics of repeat arrays are robustly enabled by their unique architecture. Thus, the repeating architecture has been optimized by evolution to favor functional modes of motions. The global motions enabling functional transitions can be fully visualized at http://bahargroup.org/tr_web.
Collapse
Affiliation(s)
- Carlos Ventura
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA; Department of Chemistry, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Anupam Banerjee
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Maria Zacharopoulou
- Department of Pharmacology, University of Cambridge, Cambridge, CB2 1PD, UK. https://twitter.com/maria_zach_
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge, Cambridge, CB2 1PD, UK.
| | - Ivet Bahar
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY, 11794, USA; Department of Biochemistry and Cell Biology, Stony Brook University, Stony Brook, NY, 11794, USA.
| |
Collapse
|
8
|
Bezerra-Brandao M, Tunque Cahui RR, Hirsh L. Daisy: An integrated repeat protein curation service. J Struct Biol 2023; 215:108033. [PMID: 37797915 DOI: 10.1016/j.jsb.2023.108033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 09/20/2023] [Accepted: 10/02/2023] [Indexed: 10/07/2023]
Abstract
Tandem repeats in proteins identification, classification and curation is a complex process that requires manual processing from experts, processing power and time. There are recent and relevant advances applying machine learning for protein structure prediction and repeat classification that are useful for this process. However, no service contemplates required databases and software to supplement researching on repeat proteins. In this publication we present Daisy, an integrated repeat protein curation web service. This service can process Protein Data Bank (PDB) and the AlphaFold Database entries for tandem repeats identification. In addition, it uses an algorithm to search a sequence against a library of Pfam hidden Markov model (HMM). Repeat classifications are associated with the identified families through RepeatsDB. This prediction is considered for enhancing the ReUPred algorithm execution and hastening the repeat units identification process. The service can also operate every associated PDB and AlphaFold structure with a UniProt proteome registry. Availability: The Daisy web service is freely accessible at daisy.bioinformatica.org.
Collapse
Affiliation(s)
| | | | - Layla Hirsh
- Department of Engineering, Pontifical Catholic University of Peru, Lima 32, Peru.
| |
Collapse
|
9
|
Bethel NP, Borst AJ, Parmeggiani F, Bick MJ, Brunette TJ, Nguyen H, Kang A, Bera AK, Carter L, Miranda MC, Kibler RD, Lamb M, Li X, Sankaran B, Baker D. Precisely patterned nanofibres made from extendable protein multiplexes. Nat Chem 2023; 15:1664-1671. [PMID: 37667012 PMCID: PMC10695826 DOI: 10.1038/s41557-023-01314-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 08/04/2023] [Indexed: 09/06/2023]
Abstract
Molecular systems with coincident cyclic and superhelical symmetry axes have considerable advantages for materials design as they can be readily lengthened or shortened by changing the length of the constituent monomers. Among proteins, alpha-helical coiled coils have such symmetric, extendable architectures, but are limited by the relatively fixed geometry and flexibility of the helical protomers. Here we describe a systematic approach to generating modular and rigid repeat protein oligomers with coincident C2 to C8 and superhelical symmetry axes that can be readily extended by repeat propagation. From these building blocks, we demonstrate that a wide range of unbounded fibres can be systematically designed by introducing hydrophilic surface patches that force staggering of the monomers; the geometry of such fibres can be precisely tuned by varying the number of repeat units in the monomer and the placement of the hydrophilic patches.
Collapse
Affiliation(s)
- Neville P Bethel
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Andrew J Borst
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Fabio Parmeggiani
- School of Chemistry, University of Bristol, Bristol, UK
- School of Biochemistry, University of Bristol, Bristol, UK
- Bristol Biodesign Institute, University of Bristol, Bristol, UK
| | - Matthew J Bick
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - T J Brunette
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Hannah Nguyen
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Alex Kang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Asim K Bera
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lauren Carter
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Marcos C Miranda
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Ryan D Kibler
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Mila Lamb
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Xinting Li
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Banumathi Sankaran
- Berkeley Center for Structural Biology, Molecular Biophysics and Integrated Bioimaging, Lawrence Berkeley Laboratory, Berkeley, CA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA.
| |
Collapse
|
10
|
Monzon AM, Arrías PN, Elofsson A, Mier P, Andrade-Navarro MA, Bevilacqua M, Clementel D, Bateman A, Hirsh L, Fornasari MS, Parisi G, Piovesan D, Kajava AV, Tosatto SCE. A STRP-ed definition of Structured Tandem Repeats in Proteins. J Struct Biol 2023; 215:108023. [PMID: 37652396 DOI: 10.1016/j.jsb.2023.108023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 07/31/2023] [Accepted: 08/28/2023] [Indexed: 09/02/2023]
Abstract
Tandem Repeat Proteins (TRPs) are a class of proteins with repetitive amino acid sequences that have been studied extensively for over two decades. Different features at the level of sequence, structure, function and evolution have been attributed to them by various authors. And yet many of its salient features appear only when looking at specific subclasses of protein tandem repeats. Here, we attempt to rationalize the existing knowledge on Tandem Repeat Proteins (TRPs) by pointing out several dichotomies. The emerging picture is more nuanced than generally assumed and allows us to draw some boundaries of what is not a "proper" TRP. We conclude with an operational definition of a specific subset, which we have denominated STRPs (Structural Tandem Repeat Proteins), which separates a subclass of tandem repeats with distinctive features from several other less well-defined types of repeats. We believe that this definition will help researchers in the field to better characterize the biological meaning of this large yet largely understudied group of proteins.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Dept. of Information Engineering, University of Padova, via Giovanni Gradenigo 6/B, 35131 Padova, Italy
| | - Paula Nazarena Arrías
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Arne Elofsson
- Dept. of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Tomtebodavägen 23, 171 21 Solna, Sweden
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, Hanns-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Miguel A Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, Hanns-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Martina Bevilacqua
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Damiano Clementel
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Layla Hirsh
- Dept. of Engineering, Faculty of Science and Engineering, Pontifical Catholic University of Peru, Av. Universitaria 1801 San Miguel, Lima 32, Lima, Peru
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Damiano Piovesan
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier, 1919 Route de Mende, Cedex 5, 34293 Montpellier, France
| | - Silvio C E Tosatto
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy.
| |
Collapse
|
11
|
Ham H, Park DS. Novel approach toward the understanding of genetic diversity based on the two types of amino acid repeats in Erwinia amylovora. Sci Rep 2023; 13:17876. [PMID: 37857695 PMCID: PMC10587187 DOI: 10.1038/s41598-023-44558-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Accepted: 10/10/2023] [Indexed: 10/21/2023] Open
Abstract
Erwinia amylovora is a notorious plant pathogenic bacterium of global concern that has devastated the apple and pear production industry worldwide. Nevertheless, the approaches available currently to understand the genetic diversity of E. amylovora remain unsatisfactory because of the lack of a trustworthy index and data covering the globally occurring E. amylovora strains; thus, their origin and distribution pattern remains ambiguous. Therefore, there is a growing need for robust approaches for obtaining this information via the comparison of the genomic structure of Amygdaloideae-infecting strains to understand their genetic diversity and distribution. Here, the whole-genome sequences of 245 E. amylovora strains available from the NCBI database were compared to identify intraspecific genes for use as an improved index for the simple classification of E. amylovora strains regarding their distribution. Finally, we discovered two kinds of strain-typing protein-encoding genes, i.e., the SAM-dependent methyltransferase and electron transport complex subunit RsxC. Interestingly, both of these proteins carried an amino acid repeat in these strains: SAM-dependent methyltransferase comprised a single-amino-acid repeat (asparagine), whereas RsxC carried a 40-amino-acid repeat, which was differentially distributed among the strains. These noteworthy findings and approaches may enable the exploration of the genetic diversity of E. amylovora from a global perspective.
Collapse
Affiliation(s)
- Hyeonheui Ham
- Crop Protection Division, Department of Agro-Food Safety and Crop Protection, National Institute of Agricultural Sciences, Rural Development Administration, Wanju, 55365, Republic of Korea
| | - Dong Suk Park
- Crop Protection Division, Department of Agro-Food Safety and Crop Protection, National Institute of Agricultural Sciences, Rural Development Administration, Wanju, 55365, Republic of Korea.
| |
Collapse
|
12
|
Sidhanta SPD, Sowdhamini R, Srinivasan N. Comparative analysis of permanent and transient domain-domain interactions in multi-domain proteins. Proteins 2023. [PMID: 37828826 DOI: 10.1002/prot.26581] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 08/09/2023] [Accepted: 08/11/2023] [Indexed: 10/14/2023]
Abstract
Protein domains are structural, functional, and evolutionary units. These domains bring out the diversity of functionality by means of interactions with other co-existing domains and provide stability. Hence, it is important to study intra-protein inter-domain interactions from the perspective of types of interactions. Domains within a chain could interact over short timeframes or permanently, rather like protein-protein interactions (PPIs). However, no systematic study has been carried out between two classes, namely permanent and transient domain-domain interactions. In this work, we studied 263 two-domain proteins, belonging to either of these classes and their interfaces on the basis of several factors, such as interface area and details of interactions (number, strength, and types of interactions). We also characterized them based on residue conservation at the interface, correlation of residue motions across domains, its involvement in repeat formation, and their involvement in particular molecular processes. Finally, we could analyze the interactions arising from domains in two-domain monomeric proteins, and we observed significant differences between these two classes of domain interactions and a few similarities. This study will help to obtain a better understanding of structure-function and folding principles of multi-domain proteins.
Collapse
Affiliation(s)
| | - Ramanathan Sowdhamini
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- Computational Approaches to Protein Science, National Centre for Biological Sciences, Bangalore, India
- Computational Biology, Institute of Bioinformatics and Applied Biotechnology, Bangalore, India
| | | |
Collapse
|
13
|
Deryusheva EI, Machulin AV, Galzitskaya OV. Diversity and features of proteins with structural repeats. Biophys Rev 2023; 15:1159-1169. [PMID: 37974986 PMCID: PMC10643770 DOI: 10.1007/s12551-023-01130-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 08/28/2023] [Indexed: 11/19/2023] Open
Abstract
The review provides information on proteins with structural repeats, including their classification, characteristics, functions, and relevance in disease development. It explores methods for identifying structural repeats and specialized databases. The review also highlights the potential use of repeat proteins as drug design scaffolds and discusses their evolutionary mechanisms.
Collapse
Affiliation(s)
- Evgeniya I. Deryusheva
- Institute for Biological Instrumentation, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, Russia
| | - Andrey V. Machulin
- Skryabin Institute of Biochemistry and Physiology of Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, Russia
| | - Oxana V. Galzitskaya
- Institute of Protein Research of the Russian Academy of Sciences, Pushchino, Russia
- Institute of Theoretical and Experimental Biophysics of the Russian Academy of Sciences, Pushchino, Russia
| |
Collapse
|
14
|
Ormazábal A, Carletti MS, Saldaño TE, Gonzalez Buitron M, Marchetti J, Palopoli N, Bateman A. Expanding the repertoire of human tandem repeat RNA-binding proteins. PLoS One 2023; 18:e0290890. [PMID: 37729217 PMCID: PMC10511089 DOI: 10.1371/journal.pone.0290890] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 08/15/2023] [Indexed: 09/22/2023] Open
Abstract
Protein regions consisting of arrays of tandem repeats are known to bind other molecular partners, including nucleic acid molecules. Although the interactions between repeat proteins and DNA are already widely explored, studies characterising tandem repeat RNA-binding proteins are lacking. We performed a large-scale analysis of human proteins devoted to expanding the knowledge about tandem repeat proteins experimentally reported as RNA-binding molecules. This work is timely because of the release of a full set of accurate structural models for the human proteome amenable to repeat detection using structural methods. The main goal of our analysis was to build a comprehensive set of human RNA-binding proteins that contain repeats at the sequence or structure level. Our results showed that the combination of sequence and structural methods finds significantly more tandem repeat proteins than either method alone. We identified 219 tandem repeat proteins that bind RNA molecules and characterised the overlap between repeat regions and RNA-binding regions as a first step towards assessing their functional relationship. We observed differences in the characteristics of repeat regions predicted by sequence-based or structure-based methods in terms of their sequence composition, their functions and their protein domains.
Collapse
Affiliation(s)
- Agustín Ormazábal
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Godoy Cruz, Buenos Aires, Argentina
| | - Matías Sebastián Carletti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Godoy Cruz, Buenos Aires, Argentina
| | - Tadeo Enrique Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Godoy Cruz, Buenos Aires, Argentina
- Departamento de Ciencias Básicas, Facultad de Agronomía, Universidad Nacional del Centro de la Provincia de Buenos Aires, Azul, Buenos Aires, Argentina
| | - Martín Gonzalez Buitron
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Godoy Cruz, Buenos Aires, Argentina
| | - Julia Marchetti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Godoy Cruz, Buenos Aires, Argentina
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, United Kingdom
| |
Collapse
|
15
|
Szatkownik A, Zea DJ, Richard H, Laine E. Building alternative splicing and evolution-aware sequence-structure maps for protein repeats. J Struct Biol 2023; 215:107997. [PMID: 37453591 DOI: 10.1016/j.jsb.2023.107997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 06/15/2023] [Accepted: 07/05/2023] [Indexed: 07/18/2023]
Abstract
Alternative splicing of repeats in proteins provides a mechanism for rewiring and fine-tuning protein interaction networks. In this work, we developed a robust and versatile method, ASPRING, to identify alternatively spliced protein repeats from gene annotations. ASPRING leverages evolutionary meaningful alternative splicing-aware hierarchical graphs to provide maps between protein repeats sequences and 3D structures. We re-think the definition of repeats by explicitly accounting for transcript diversity across several genes/species. Using a stringent sequence-based similarity criterion, we detected over 5,000 evolutionary conserved repeats by screening virtually all human protein-coding genes and their orthologs across a dozen species. Through a joint analysis of their sequences and structures, we extracted specificity-determining sequence signatures and assessed their implication in experimentally resolved and modelled protein interactions. Our findings demonstrate the widespread alternative usage of protein repeats in modulating protein interactions and open avenues for targeting repeat-mediated interactions.
Collapse
Affiliation(s)
- Antoine Szatkownik
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France; Bioinformatics Unit, Genome Competence Center (MF1), Robert Koch Institute, 13353 Berlin, Germany
| | - Diego Javier Zea
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Hugues Richard
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France; Bioinformatics Unit, Genome Competence Center (MF1), Robert Koch Institute, 13353 Berlin, Germany.
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France.
| |
Collapse
|
16
|
Barth ZK, Dunham DT, Seed KD. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. NAR Genom Bioinform 2023; 5:lqad076. [PMID: 37636022 PMCID: PMC10448857 DOI: 10.1093/nargab/lqad076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 07/13/2023] [Accepted: 08/16/2023] [Indexed: 08/29/2023] Open
Abstract
Homing endonuclease genes (HEGs) are ubiquitous selfish elements that generate targeted double-stranded DNA breaks, facilitating the recombination of the HEG DNA sequence into the break site and contributing to the evolutionary dynamics of HEG-encoding genomes. Bacteriophages (phages) are well-documented to carry HEGs, with the paramount characterization of HEGs being focused on those encoded by coliphage T4. Recently, it has been observed that the highly sampled vibriophage, ICP1, is similarly enriched with HEGs distinct from T4's. Here, we examined the HEGs encoded by ICP1 and diverse phages, proposing HEG-driven mechanisms that contribute to phage evolution. Relative to ICP1 and T4, we found a variable distribution of HEGs across phages, with HEGs frequently encoded proximal to or within essential genes. We identified large regions (> 10kb) of high nucleotide identity flanked by HEGs, deemed HEG islands, which we hypothesize to be mobilized by the activity of flanking HEGs. Finally, we found examples of domain swapping between phage-encoded HEGs and genes encoded by other phages and phage satellites. We anticipate that HEGs have a larger impact on the evolutionary trajectory of phages than previously appreciated and that future work investigating the role of HEGs in phage evolution will continue to highlight these observations.
Collapse
Affiliation(s)
- Zachary K Barth
- Department of Plant and Microbial Biology, University of California, Berkeley. 271 Koshland Hall, Berkeley, CA 94720, USA
| | - Drew T Dunham
- Department of Plant and Microbial Biology, University of California, Berkeley. 271 Koshland Hall, Berkeley, CA 94720, USA
| | - Kimberley D Seed
- Department of Plant and Microbial Biology, University of California, Berkeley. 271 Koshland Hall, Berkeley, CA 94720, USA
| |
Collapse
|
17
|
Ferrelli ML, Pidre ML, García-Domínguez R, Alberca LN, Del Saz-Navarro DM, Santana-Molina C, Devos DP. Prokaryotic membrane coat - like proteins: An update. J Struct Biol 2023; 215:107987. [PMID: 37343709 DOI: 10.1016/j.jsb.2023.107987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 06/09/2023] [Accepted: 06/16/2023] [Indexed: 06/23/2023]
Abstract
Membrane coat proteins are essential players in the eukaryotic endomembrane traffic system. Previous work identified proteins with the membrane-coat architecture in prokaryotes, specifically in the Planctomycetes, Verrucomicrobia and Chlamydiae (PVC) superphylum, bacteria that display the most developed prokaryotic endomembrane system. Hence, the membrane coat-like (MCL) proteins are predicted to play a central role in this system but their actual function is still unknown. In this work we strengthened previous structure predictions for these prokaryotic MCL proteins. We also detected new putative MCL proteins in the Planctomycete Gemmata obscuriglobus. Structural analysis of these revealed the presence of additional domains apart from the β-propeller and α-solenoid combination, characteristic of the membrane-coat architecture. Functions associated with these domains include some related to carbohydrate or membrane/lipid binding. Using homology-based methods, we found MCL proteins in other bacterial phyla, but the most abundant hits are still restricted to Planctomycetes and Verrucomicrobia. Detailed inspection of neighbouring genes of MCL in G. obscuriglobus supports the idea that the function of these proteins is related to membrane manipulation. No significant hits were found in Archaea, including Asgard archaea. More than 10 years after their original detection, PVC bacteria are still uniquely linked to eukaryotes through the structure of the MCL proteins sustaining their endomembrane system.
Collapse
Affiliation(s)
- M Leticia Ferrelli
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas (CSIC), Campus Universidad Pablo de Olavide (UPO), 41013 Seville, Spain
| | - Matías L Pidre
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas (CSIC), Campus Universidad Pablo de Olavide (UPO), 41013 Seville, Spain
| | - Ruben García-Domínguez
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas (CSIC), Campus Universidad Pablo de Olavide (UPO), 41013 Seville, Spain
| | - Lucas N Alberca
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas (CSIC), Campus Universidad Pablo de Olavide (UPO), 41013 Seville, Spain
| | - DMaría Del Saz-Navarro
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas (CSIC), Campus Universidad Pablo de Olavide (UPO), 41013 Seville, Spain
| | - Carlos Santana-Molina
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas (CSIC), Campus Universidad Pablo de Olavide (UPO), 41013 Seville, Spain
| | - Damien P Devos
- Centro Andaluz de Biología del Desarrollo (CABD), Consejo Superior de Investigaciones Científicas (CSIC), Campus Universidad Pablo de Olavide (UPO), 41013 Seville, Spain.
| |
Collapse
|
18
|
Arrías PN, Monzon AM, Clementel D, Mozaffari S, Piovesan D, Kajava AV, Tosatto SCE. The repetitive structure of DNA clamps: An overlooked protein tandem repeat. J Struct Biol 2023; 215:108001. [PMID: 37467824 DOI: 10.1016/j.jsb.2023.108001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2023] [Revised: 07/12/2023] [Accepted: 07/16/2023] [Indexed: 07/21/2023]
Abstract
Structured tandem repeats proteins (STRPs) are a specific kind of tandem repeat proteins characterized by a modular and repetitive three-dimensional structure arrangement. The majority of STRPs adopt solenoid structures, but with the increasing availability of experimental structures and high-quality predicted structural models, more STRP folds can be characterized. Here, we describe "Box repeats", an overlooked STRP fold present in the DNA sliding clamp processivity factors, which has eluded classification although structural data has been available since the late 1990s. Each Box repeat is a β⍺βββ module of about 60 residues, which forms a class V "beads-on-a-string" type STRP. The number of repeats present in processivity factors is organism dependent. Monomers of PCNA proteins in both Archaea and Eukarya have 4 repeats, while the monomers of bacterial beta-sliding clamps have 6 repeats. This new repeat fold has been added to the RepeatsDB database, which now provides structural annotation for 66 Box repeat proteins belonging to different organisms, including viruses.
Collapse
Affiliation(s)
- Paula Nazarena Arrías
- Department of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Alexander Miguel Monzon
- Department of Information Engineering, University of Padova, via Giovanni Gradenigo 6/B, 35131 Padova, Italy
| | - Damiano Clementel
- Department of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Soroush Mozaffari
- Department of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier, 1919 Route de Mende, Cedex 5, 34293 Montpellier, France
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy.
| |
Collapse
|
19
|
Miller J, Urvoas A, Gigant B, Ouldali M, Arteni A, Mesneau A, Valerio-Lepiniec M, Artzner F, Dujardin E, Minard P. Engineering of brick and staple components for ordered assembly of synthetic repeat proteins. J Struct Biol 2023; 215:108012. [PMID: 37567372 DOI: 10.1016/j.jsb.2023.108012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 07/27/2023] [Accepted: 08/07/2023] [Indexed: 08/13/2023]
Abstract
Synthetic ɑRep repeat proteins are engineered as Brick and Staple protein pairs that together self-assemble into helical filaments. In most cases, the filaments spontaneously form supercrystals. Here, we describe an expanded series of ɑRep Bricks designed to stabilize the interaction between consecutive Bricks, to control the length of the assembled multimers, or to alter the spatial distribution of the Staple on the filaments. The effects of these Brick modifications on the assembly, on the final filament structure and on the crystal symmetry are analyzed by biochemical methods, electron microscopy and small angle X-ray scattering. We further extend the concept of Brick/Staple protein origami by designing a new type of "Janus"-like Brick protein that is equally assembled by orthogonal staples binding its inner or outer surfaces and thus ending inside or outside the filaments. The relative roles of longitudinal and lateral associations in the assembly process are discussed. This set of results demonstrates important proofs-of-principle for engineering these remarkably versatile proteins toward nanometer-to-micron scale constructions.
Collapse
Affiliation(s)
- Jessalyn Miller
- Emory University Department of Chemistry, 1515 Dickey Drive, Atlanta, GA 30322, USA(1); Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France
| | - Agathe Urvoas
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France
| | - Benoit Gigant
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France
| | - Malika Ouldali
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France
| | - Ana Arteni
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France
| | - Agnes Mesneau
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France
| | - Marie Valerio-Lepiniec
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France
| | - Franck Artzner
- Institut de Physique de Rennes (IPR), CNRS, UMR 6251, Université de Rennes 1, F-35042 Rennes, France
| | - Erik Dujardin
- Laboratoire Interdisciplinaire Carnot de Bourgogne, CNRS UMR 6303, Université de Bourgogne Franche-Comté, 21000 Dijon, France.
| | - Philippe Minard
- Institute for Integrative Biology of the Cell (I2BC), CEA, CNRS, Université Paris-Saclay, F-91198 Gif-sur-Yvette CEDEX, France.
| |
Collapse
|
20
|
Mesdaghi S, Price RM, Madine J, Rigden DJ. Deep Learning-based structure modelling illuminates structure and function in uncharted regions of β-solenoid fold space. J Struct Biol 2023; 215:108010. [PMID: 37544372 DOI: 10.1016/j.jsb.2023.108010] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 07/19/2023] [Accepted: 08/03/2023] [Indexed: 08/08/2023]
Abstract
Repeat proteins are common in all domains of life and exhibit a wide range of functions. One class of repeat protein contains solenoid folds where the repeating unit consists of β-strands separated by tight turns. β-solenoids have distinguishing structural features such as handedness, twist, oligomerisation state, coil shape and size which give rise to their diversity. Characterised β-solenoid repeat proteins are known to form regions in bacterial and viral virulence factors, antifreeze proteins and functional amyloids. For many of these proteins, the experimental structure has not been solved, as they are difficult to crystallise or model. Here we use various deep learning-based structure-modelling methods to discover novel predicted β-solenoids, perform structural database searches to mine further structural neighbours and relate their predicted structure to possible functions. We find both eukaryotic and prokaryotic adhesins, confirming a known functional linkage between adhesin function and the β-solenoid fold. We further identify exceptionally long, flat β-solenoid folds as possible structures of mucin tandem repeat regions and unprecedentedly small β-solenoid structures. Additionally, we characterise a novel β-solenoid coil shape, the FapC Greek key β-solenoid as well as plausible complexes between it and other proteins involved in Pseudomonas functional amyloid fibres.
Collapse
Affiliation(s)
- Shahram Mesdaghi
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom; Computational Biology Facility, MerseyBio, University of Liverpool, Crown Street, Liverpool L69 7ZB, United Kingdom
| | - Rebecca M Price
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom
| | - Jillian Madine
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom.
| | - Daniel J Rigden
- The University of Liverpool, Institute of Systems, Molecular & Integrative Biology, Biosciences Building, Crown Street, Liverpool L69 7ZB, United Kingdom.
| |
Collapse
|
21
|
Manasra S, Kajava AV. Why does the first protein repeat often become the only one? J Struct Biol 2023; 215:108014. [PMID: 37567371 DOI: 10.1016/j.jsb.2023.108014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 08/06/2023] [Accepted: 08/09/2023] [Indexed: 08/13/2023]
Abstract
Proteins with two similar motifs in tandem are one of the most common cases of tandem repeat proteins. The question arises: why is the first emerged repeat frequently fixed in the process of evolution, despite the ample opportunities to continue its multiplication at the DNA level? To answer this question, we systematically analyzed the structure and function of these proteins. Our analysis showed that, in the vast majority of cases, the structural repetitive units have a two-fold (C2) internal symmetry. These closed structures provide an internal structural limitation for the subsequent growth of the repeat number. Frequently, the units "swap" their secondary structure elements with each other. Moreover, the duplicated domains, in contrast to other tandem repeat proteins, form binding sites for small molecules around the axis of C2 symmetry. Thus, the closure of the C2 structures and the emergence of new functional sites around the axis of C2 symmetry provide plausible explanations for why a repeat, once appeared, becomes fixed in the evolutionary process. We have placed these structures within the general structural classification of tandem repeat proteins, classifying them as either Class IV or V depending on the size of the repetitive unit.
Collapse
Affiliation(s)
- Simona Manasra
- Institute of Bioengineering, ITMO University, Kronverksky Pr. 49, 197101 Saint Petersburg, Russia
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier, 1919 Route de Mende, Cedex 5, 34293 Montpellier, France.
| |
Collapse
|
22
|
Mier P, Andrade-Navarro MA. Evolutionary Study of Protein Short Tandem Repeats in Protein Families. Biomolecules 2023; 13:1116. [PMID: 37509152 PMCID: PMC10377733 DOI: 10.3390/biom13071116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2023] [Revised: 07/06/2023] [Accepted: 07/12/2023] [Indexed: 07/30/2023] Open
Abstract
Tandem repeats in proteins are patterns of residues repeated directly adjacent to each other. The evolution of these repeats can be assessed by using groups of homologous sequences, which can help pointing to events of unit duplication or deletion. High pressure in a protein family for variation of a given type of repeat might point to their function. Here, we propose the analysis of protein families to calculate protein short tandem repeats (pSTRs) in each protein sequence and assess their variability within the family in terms of number of units. To facilitate this analysis, we developed the pSTR tool, a method to analyze the evolution of protein short tandem repeats in a given protein family by pairwise comparisons between evolutionarily related protein sequences. We evaluated pSTR unit number variation in protein families of 12 complete metazoan proteomes. We hypothesize that families with more dynamic ensembles of repeats could reflect particular roles of these repeats in processes that require more adaptability.
Collapse
Affiliation(s)
- Pablo Mier
- Faculty of Biology, Institute of Organismic and Molecular Evolution, Johannes Gutenberg University Mainz, 55128 Mainz, Germany
| | - Miguel A Andrade-Navarro
- Faculty of Biology, Institute of Organismic and Molecular Evolution, Johannes Gutenberg University Mainz, 55128 Mainz, Germany
| |
Collapse
|
23
|
Kaynak BT, Dahmani ZL, Doruker P, Banerjee A, Yang SH, Gordon R, Itzhaki LS, Bahar I. Cooperative mechanics of PR65 scaffold underlies the allosteric regulation of the phosphatase PP2A. Structure 2023; 31:607-618.e3. [PMID: 36948205 PMCID: PMC10164121 DOI: 10.1016/j.str.2023.02.012] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 01/25/2023] [Accepted: 02/23/2023] [Indexed: 03/24/2023]
Abstract
PR65, a horseshoe-shaped scaffold composed of 15 HEAT (observed in Huntingtin, elongation factor 3, protein phosphatase 2A, and the yeast kinase TOR1) repeats, forms, together with catalytic and regulatory subunits, the heterotrimeric protein phosphatase PP2A. We examined the role of PR65 in enabling PP2A enzymatic activity with computations at various levels of complexity, including hybrid approaches that combine full-atomic and elastic network models. Our study points to the high flexibility of this scaffold allowing for end-to-end distance fluctuations of 40-50 Å between compact and extended conformations. Notably, the intrinsic dynamics of PR65 facilitates complexation with the catalytic subunit and is retained in the PP2A complex enabling PR65 to engage the two domains of the catalytic subunit and provide the mechanical framework for enzymatic activity, with support from the regulatory subunit. In particular, the intra-repeat coils at the C-terminal arm play an important role in allosterically mediating the collective dynamics of PP2A, pointing to target sites for modulating PR65 function.
Collapse
Affiliation(s)
- Burak T Kaynak
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Computational Neurobiology Laboratory, Salk Institute for Biological Studies, La Jolla, CA 92037, USA
| | - Zakaria L Dahmani
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Pemra Doruker
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA
| | - Anupam Banerjee
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Pittsburgh, PA 15213, USA; Laufer Center for Physical and Quantitative Biology, and Department of Biochemistry and Cell Biology, School of Medicine, Stony Brook University, Stony Brook, NY 11794, USA
| | - Shang-Hua Yang
- Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan
| | - Reuven Gordon
- Department of Electrical and Computer Engineering, University of Victoria, Victoria, BC V8P 5C2, Canada
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, UK
| | - Ivet Bahar
- Laufer Center for Physical and Quantitative Biology, and Department of Biochemistry and Cell Biology, School of Medicine, Stony Brook University, Stony Brook, NY 11794, USA.
| |
Collapse
|
24
|
Barth ZK, Dunham DT, Seed KD. Nuclease genes occupy boundaries of genetic exchange between bacteriophages. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.23.533998. [PMID: 36993569 PMCID: PMC10055350 DOI: 10.1101/2023.03.23.533998] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
Homing endonuclease genes (HEGs) are ubiquitous selfish elements that generate targeted double-stranded DNA breaks, facilitating the recombination of the HEG DNA sequence into the break site and contributing to the evolutionary dynamics of HEG-encoding genomes. Bacteriophages (phages) are well-documented to carry HEGs, with the paramount characterization of HEGs being focused on those encoded by coliphage T4. Recently, it has been observed that the highly sampled vibriophage, ICP1, is similarly enriched with HEGs distinct from T4’s. Here, we examined the HEGs encoded by ICP1 and diverse phages, proposing HEG-driven mechanisms that contribute to phage evolution. Relative to ICP1 and T4, we found a variable distribution of HEGs across phages, with HEGs frequently encoded proximal to or within essential genes. We identified large regions (> 10kb) of high nucleotide identity flanked by HEGs, deemed HEG islands, which we hypothesize to be mobilized by the activity of flanking HEGs. Finally, we found examples of domain swapping between phage-encoded HEGs and genes encoded by other phages and phage satellites. We anticipate that HEGs have a larger impact on the evolutionary trajectory of phages than previously appreciated and that future work investigating the role of HEGs in phage evolution will continue to highlight these observations.
Collapse
Affiliation(s)
- Zachary K Barth
- Department of Plant and Microbial Biology, University of California, Berkeley. 271 Koshland Hall, Berkeley, CA 94720, USA
| | - Drew T Dunham
- Department of Plant and Microbial Biology, University of California, Berkeley. 271 Koshland Hall, Berkeley, CA 94720, USA
| | - Kimberley D Seed
- Department of Plant and Microbial Biology, University of California, Berkeley. 271 Koshland Hall, Berkeley, CA 94720, USA
| |
Collapse
|
25
|
Wright SE, Todd PK. Native functions of short tandem repeats. eLife 2023; 12:e84043. [PMID: 36940239 PMCID: PMC10027321 DOI: 10.7554/elife.84043] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2022] [Accepted: 03/08/2023] [Indexed: 03/21/2023] Open
Abstract
Over a third of the human genome is comprised of repetitive sequences, including more than a million short tandem repeats (STRs). While studies of the pathologic consequences of repeat expansions that cause syndromic human diseases are extensive, the potential native functions of STRs are often ignored. Here, we summarize a growing body of research into the normal biological functions for repetitive elements across the genome, with a particular focus on the roles of STRs in regulating gene expression. We propose reconceptualizing the pathogenic consequences of repeat expansions as aberrancies in normal gene regulation. From this altered viewpoint, we predict that future work will reveal broader roles for STRs in neuronal function and as risk alleles for more common human neurological diseases.
Collapse
Affiliation(s)
- Shannon E Wright
- Department of Neurology, University of Michigan–Ann ArborAnn ArborUnited States
- Neuroscience Graduate Program, University of Michigan–Ann ArborAnn ArborUnited States
- Department of Neuroscience, Picower InstituteCambridgeUnited States
| | - Peter K Todd
- Department of Neurology, University of Michigan–Ann ArborAnn ArborUnited States
- VA Ann Arbor Healthcare SystemAnn ArborUnited States
| |
Collapse
|
26
|
Woolfson DN. Understanding a protein fold: the physics, chemistry, and biology of α-helical coiled coils. J Biol Chem 2023; 299:104579. [PMID: 36871758 PMCID: PMC10124910 DOI: 10.1016/j.jbc.2023.104579] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2023] [Revised: 02/25/2023] [Accepted: 02/27/2023] [Indexed: 03/07/2023] Open
Abstract
Protein science is being transformed by powerful computational methods for structure prediction and design: AlphaFold2 can predict many natural protein structures from sequence, and other AI methods are enabling the de novo design of new structures. This raises a question: how much do we understand the underlying sequence-to-structure/function relationships being captured by these methods? This perspective presents our current understanding of one class of protein assembly, the α-helical coiled coils. At first sight, these are straightforward: sequence repeats of hydrophobic (h) and polar (p) residues, (hpphppp)n, direct the folding and assembly of amphipathic α helices into bundles. However, many different bundles are possible: they can have two or more helices (different oligomers); the helices can have parallel, antiparallel or mixed arrangements (different topologies); and the helical sequences can be the same (homomers) or different (heteromers). Thus, sequence-to-structure relationships must be present within the hpphppp repeats to distinguish these states. I discuss the current understanding of this problem at three levels: First, physics gives a parametric framework to generate the many possible coiled-coil backbone structures. Second, chemistry provides a means to explore and deliver sequence-to-structure relationships. Third, biology shows how coiled coils are adapted and functionalized in nature, inspiring applications of coiled coils in synthetic biology. I argue that the chemistry is largely understood; the physics is partly solved, though the considerable challenge of predicting even relative stabilities of different coiled-coil states remains; but there is much more to explore in the biology and synthetic biology of coiled coils.
Collapse
Affiliation(s)
- Derek N Woolfson
- School of Chemistry, University of Bristol, Bristol, United Kingdom; School of Biochemistry, University of Bristol, Medical Sciences Building, University Walk, Bristol, United Kingdom; BrisEngBio, School of Chemistry, University of Bristol, Bristol, United Kingdom; Max Planck-Bristol Centre for Minimal Biology, University of Bristol, Bristol, United Kingdom.
| |
Collapse
|
27
|
Yu H, Kalutantirige FC, Yao L, Schroeder CM, Chen Q, Moore JS. Self-Assembly of Repetitive Segment and Random Segment Polymer Architectures. ACS Macro Lett 2022; 11:1366-1372. [PMID: 36413761 DOI: 10.1021/acsmacrolett.2c00495] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Recent advances in chemical synthesis have created new methodologies for synthesizing sequence-controlled synthetic polymers, but rational design of monomer sequence for desired properties remains challenging. In this work, we synthesize periodic polymers with repetitive segments using a sequence-controlled ring-opening metathesis polymerization (ROMP) method, which draws inspiration from proteins containing repetitive sequence motifs. The repetitive segment architecture is shown to dramatically affect the self-assembly behavior of these materials. Our results show that polymers with identical repetitive sequences assemble into uniform spherical nanoparticles after thermal annealing, whereas copolymers with random placement of segments with different sequences exhibit disordered assemblies without a well-defined morphology. Overall, these results bring a new understanding to the role of periodic repetitive sequences in polymer assembly.
Collapse
Affiliation(s)
- Hao Yu
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Falon C Kalutantirige
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Lehan Yao
- Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Charles M Schroeder
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Qian Chen
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Jeffrey S Moore
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
28
|
Osmanli Z, Falgarone T, Samadova T, Aldrian G, Leclercq J, Shahmuradov I, Kajava AV. The Difference in Structural States between Canonical Proteins and Their Isoforms Established by Proteome-Wide Bioinformatics Analysis. Biomolecules 2022; 12:1610. [PMID: 36358962 PMCID: PMC9687161 DOI: 10.3390/biom12111610] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2022] [Revised: 10/14/2022] [Accepted: 10/27/2022] [Indexed: 09/02/2023] Open
Abstract
Alternative splicing is an important means of generating the protein diversity necessary for cellular functions. Hence, there is a growing interest in assessing the structural and functional impact of alternative protein isoforms. Typically, experimental studies are used to determine the structures of the canonical proteins ignoring the other isoforms. Therefore, there is still a large gap between abundant sequence information and meager structural data on these isoforms. During the last decade, significant progress has been achieved in the development of bioinformatics tools for structural and functional annotations of proteins. Moreover, the appearance of the AlphaFold program opened up the possibility to model a large number of high-confidence structures of the isoforms. In this study, using state-of-the-art tools, we performed in silico analysis of 58 eukaryotic proteomes. The evaluated structural states included structured domains, intrinsically disordered regions, aggregation-prone regions, and tandem repeats. Among other things, we found that the isoforms have fewer signal peptides, transmembrane regions, or tandem repeat regions in comparison with their canonical counterparts. This could change protein function and/or cellular localization. The AlphaFold modeling demonstrated that frequently isoforms, having differences with the canonical sequences, still can fold in similar structures though with significant structural rearrangements which can lead to changes of their functions. Based on the modeling, we suggested classification of the structural differences between canonical proteins and isoforms. Altogether, we can conclude that a majority of isoforms, similarly to the canonical proteins are under selective pressure for the functional roles.
Collapse
Affiliation(s)
- Zarifa Osmanli
- CRBM, Université de Montpellier, CNRS, 1919 Route de Mende, CEDEX 5, 34293 Montpellier, France
- Institute of Biophysics, ANAS, Baku AZ1141, Azerbaijan
| | - Theo Falgarone
- CRBM, Université de Montpellier, CNRS, 1919 Route de Mende, CEDEX 5, 34293 Montpellier, France
| | | | - Gudrun Aldrian
- CRBM, Université de Montpellier, CNRS, 1919 Route de Mende, CEDEX 5, 34293 Montpellier, France
| | - Jeremy Leclercq
- CRBM, Université de Montpellier, CNRS, 1919 Route de Mende, CEDEX 5, 34293 Montpellier, France
| | | | - Andrey V. Kajava
- CRBM, Université de Montpellier, CNRS, 1919 Route de Mende, CEDEX 5, 34293 Montpellier, France
| |
Collapse
|
29
|
Kastano K, Mier P, Dosztányi Z, Promponas VJ, Andrade-Navarro MA. Functional Tuning of Intrinsically Disordered Regions in Human Proteins by Composition Bias. Biomolecules 2022; 12:biom12101486. [PMID: 36291695 PMCID: PMC9599065 DOI: 10.3390/biom12101486] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2022] [Revised: 09/30/2022] [Accepted: 10/11/2022] [Indexed: 11/16/2022] Open
Abstract
Intrinsically disordered regions (IDRs) in protein sequences are flexible, have low structural constraints and as a result have faster rates of evolution. This lack of evolutionary conservation greatly limits the use of sequence homology for the classification and functional assessment of IDRs, as opposed to globular domains. The study of IDRs requires other properties for their classification and functional prediction. While composition bias is not a necessary property of IDRs, compositionally biased regions (CBRs) have been noted as frequent part of IDRs. We hypothesized that to characterize IDRs, it could be helpful to study their overlap with particular types of CBRs. Here, we evaluate this overlap in the human proteome. A total of 2/3 of residues in IDRs overlap CBRs. Considering CBRs enriched in one type of amino acid, we can distinguish CBRs that tend to be fully included within long IDRs (R, H, N, D, P, G), from those that partially overlap shorter IDRs (S, E, K, T), and others that tend to overlap IDR terminals (Q, A). CBRs overlap more often IDRs in nuclear proteins and in proteins involved in liquid-liquid phase separation (LLPS). Study of protein interaction networks reveals the enrichment of CBRs in IDRs by tandem repetition of short linear motifs (rich in S or P), and the existence of E-rich polar regions that could support specific protein interactions with non-specific interactions. Our results open ways to pin down the function of IDRs from their partial compositional biases.
Collapse
Affiliation(s)
- Kristina Kastano
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Biozentrum I, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Biozentrum I, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Zsuzsanna Dosztányi
- Department of Biochemistry, ELTE Eötvös Loránd University, Pázmány Péter stny 1/c, H-1117 Budapest, Hungary
| | - Vasilis J. Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, 1678 Nicosia, Cyprus
| | - Miguel A. Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Biozentrum I, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
- Correspondence:
| |
Collapse
|
30
|
Digging into the 3D Structure Predictions of AlphaFold2 with Low Confidence: Disorder and Beyond. Biomolecules 2022; 12:biom12101467. [PMID: 36291675 PMCID: PMC9599455 DOI: 10.3390/biom12101467] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/04/2022] [Accepted: 10/05/2022] [Indexed: 01/12/2023] Open
Abstract
AlphaFold2 (AF2) has created a breakthrough in biology by providing three-dimensional structure models for whole-proteome sequences, with unprecedented levels of accuracy. In addition, the AF2 pLDDT score, related to the model confidence, has been shown to provide a good measure of residue-wise disorder. Here, we combined AF2 predictions with pyHCA, a tool we previously developed to identify foldable segments and estimate their order/disorder ratio, from a single protein sequence. We focused our analysis on the AF2 predictions available for 21 reference proteomes (AFDB v1), in particular on their long foldable segments (>30 amino acids) that exhibit characteristics of soluble domains, as estimated by pyHCA. Among these segments, we provided a global analysis of those with very low pLDDT values along their entire length and compared their characteristics to those of segments with very high pLDDT values. We highlighted cases containing conditional order, as well as cases that could form well-folded structures but escape the AF2 prediction due to a shallow multiple sequence alignment and/or undocumented structure or fold. AF2 and pyHCA can therefore be advantageously combined to unravel cryptic structural features in whole proteomes and to refine predictions for different flavors of disorder.
Collapse
|
31
|
Mier P, Elena-Real CA, Cortés J, Bernadó P, Andrade-Navarro MA. The sequence context in poly-alanine regions: structure, function and conservation. Bioinformatics 2022; 38:4851-4858. [PMID: 36106994 PMCID: PMC9620824 DOI: 10.1093/bioinformatics/btac610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Revised: 07/07/2022] [Accepted: 09/05/2022] [Indexed: 11/24/2022] Open
Abstract
Motivation Poly-alanine (polyA) regions are protein stretches mostly composed of alanines. Despite their abundance in eukaryotic proteomes and their association to nine inherited human diseases, the structural and functional roles exerted by polyA stretches remain poorly understood. In this work we study how the amino acid context in which polyA regions are settled in proteins influences their structure and function. Results We identified glycine and proline as the most abundant amino acids within polyA and in the flanking regions of polyA tracts, in human proteins as well as in 17 additional eukaryotic species. Our analyses indicate that the non-structuring nature of these two amino acids influences the α-helical conformations predicted for polyA, suggesting a relevant role in reducing the inherent aggregation propensity of long polyA. Then, we show how polyA position in protein N-termini relates with their function as transit peptides. PolyA placed just after the initial methionine is often predicted as part of mitochondrial transit peptides, whereas when placed in downstream positions, polyA are part of signal peptides. A few examples from known structures suggest that short polyA can emerge by alanine substitutions in α-helices; but evolution by insertion is observed for longer polyA. Our results showcase the importance of studying the sequence context of homorepeats as a mechanism to shape their structure–function relationships. Availability and implementation The datasets used and/or analyzed during the current study are available from the corresponding author onreasonable request. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Pablo Mier
- Faculty of Biology, Institute of Organismic and Molecular Evolution, Johannes Gutenberg University Mainz , 55128 Mainz, Germany
| | - Carlos A Elena-Real
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS , 34090 Montpellier, France
| | - Juan Cortés
- LAAS-CNRS, Université de Toulouse, CNRS , Toulouse, France
| | - Pau Bernadó
- Centre de Biologie Structurale (CBS), Université de Montpellier, INSERM, CNRS , 34090 Montpellier, France
| | - Miguel A Andrade-Navarro
- Faculty of Biology, Institute of Organismic and Molecular Evolution, Johannes Gutenberg University Mainz , 55128 Mainz, Germany
| |
Collapse
|
32
|
Merski M, Macedo-Ribeiro S, Wieczorek RM, Górna MW. The Repeating, Modular Architecture of the HtrA Proteases. Biomolecules 2022; 12:biom12060793. [PMID: 35740918 PMCID: PMC9221053 DOI: 10.3390/biom12060793] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 06/02/2022] [Accepted: 06/04/2022] [Indexed: 02/04/2023] Open
Abstract
A conserved, 26-residue sequence [AA(X2)[A/G][G/L](X2)GDV[I/L](X2)[V/L]NGE(X1)V(X6)] and corresponding structure repeating module were identified within the HtrA protease family using a non-redundant set (N = 20) of publicly available structures. While the repeats themselves were far from sequence perfect, they had notable conservation to a statistically significant level. Three or more repetitions were identified within each protein despite being statistically expected to randomly occur only once per 1031 residues. This sequence repeat was associated with a six stranded antiparallel β-barrel module, two of which are present in the core of the structures of the PA clan of serine proteases, while a modified version of this module could be identified in the PDZ-like domains. Automated structural alignment methods had difficulties in superimposing these β-barrels, but the use of a target human HtrA2 structure showed that these modules had an average RMSD across the set of structures of less than 2 Å (mean and median). Our findings support Dayhoff’s hypothesis that complex proteins arose through duplication of simpler peptide motifs and domains.
Collapse
Affiliation(s)
- Matthew Merski
- Structural Biology Group, Biological and Chemical Research Centre, Faculty of Chemistry, University of Warsaw, Żwirki i Wigury 101, 02-089 Warsaw, Poland
- Correspondence: (M.M.); (M.W.G.); Tel.: +48-225-526-642 (M.M.)
| | - Sandra Macedo-Ribeiro
- Instituto de Investigação e Inovação em Saúde and Instituto de Biologia Molecular e Celular (IBMC), Universidade do Porto, 4200-135 Porto, Portugal;
| | - Rafal M. Wieczorek
- Faculty of Chemistry, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland;
| | - Maria W. Górna
- Structural Biology Group, Biological and Chemical Research Centre, Faculty of Chemistry, University of Warsaw, Żwirki i Wigury 101, 02-089 Warsaw, Poland
- Correspondence: (M.M.); (M.W.G.); Tel.: +48-225-526-642 (M.M.)
| |
Collapse
|
33
|
Synakewicz M, Eapen RS, Perez-Riba A, Rowling PJE, Bauer D, Weißl A, Fischer G, Hyvönen M, Rief M, Itzhaki LS, Stigler J. Unraveling the Mechanics of a Repeat-Protein Nanospring: From Folding of Individual Repeats to Fluctuations of the Superhelix. ACS NANO 2022. [PMID: 35258937 DOI: 10.1101/2021.03.27.437344] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]
Abstract
Tandem-repeat proteins comprise small secondary structure motifs that stack to form one-dimensional arrays with distinctive mechanical properties that are proposed to direct their cellular functions. Here, we use single-molecule optical tweezers to study the folding of consensus-designed tetratricopeptide repeats (CTPRs), superhelical arrays of short helix-turn-helix motifs. We find that CTPRs display a spring-like mechanical response in which individual repeats undergo rapid equilibrium fluctuations between partially folded and unfolded conformations. We rationalize the force response using Ising models and dissect the folding pathway of CTPRs under mechanical load, revealing how the repeat arrays form from the center toward both termini simultaneously. Most strikingly, we also directly observe the protein's superhelical tertiary structure in the force signal. Using protein engineering, crystallography, and single-molecule experiments, we show that the superhelical geometry can be altered by carefully placed amino acid substitutions, and we examine how these sequence changes affect intrinsic repeat stability and inter-repeat coupling. Our findings provide the means to dissect and modulate repeat-protein stability and dynamics, which will be essential for researchers to understand the function of natural repeat proteins and to exploit artificial repeats proteins in nanotechnology and biomedical applications.
Collapse
Affiliation(s)
- Marie Synakewicz
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Rohan S Eapen
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Albert Perez-Riba
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Pamela J E Rowling
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Daniela Bauer
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Andreas Weißl
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Gerhard Fischer
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Marko Hyvönen
- Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Matthias Rief
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Laura S Itzhaki
- Department of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom†
| | - Johannes Stigler
- Gene Center Munich, Ludwig-Maximilians-Universität München, Feodor-Lynen-Straße 25, 81377 München, Germany
| |
Collapse
|
34
|
Synakewicz M, Eapen RS, Perez-Riba A, Rowling PJE, Bauer D, Weißl A, Fischer G, Hyvönen M, Rief M, Itzhaki LS, Stigler J. Unraveling the Mechanics of a Repeat-Protein Nanospring: From Folding of Individual Repeats to Fluctuations of the Superhelix. ACS NANO 2022; 16:3895-3905. [PMID: 35258937 PMCID: PMC8944806 DOI: 10.1021/acsnano.1c09162] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2021] [Accepted: 01/27/2022] [Indexed: 06/14/2023]
Abstract
Tandem-repeat proteins comprise small secondary structure motifs that stack to form one-dimensional arrays with distinctive mechanical properties that are proposed to direct their cellular functions. Here, we use single-molecule optical tweezers to study the folding of consensus-designed tetratricopeptide repeats (CTPRs), superhelical arrays of short helix-turn-helix motifs. We find that CTPRs display a spring-like mechanical response in which individual repeats undergo rapid equilibrium fluctuations between partially folded and unfolded conformations. We rationalize the force response using Ising models and dissect the folding pathway of CTPRs under mechanical load, revealing how the repeat arrays form from the center toward both termini simultaneously. Most strikingly, we also directly observe the protein's superhelical tertiary structure in the force signal. Using protein engineering, crystallography, and single-molecule experiments, we show that the superhelical geometry can be altered by carefully placed amino acid substitutions, and we examine how these sequence changes affect intrinsic repeat stability and inter-repeat coupling. Our findings provide the means to dissect and modulate repeat-protein stability and dynamics, which will be essential for researchers to understand the function of natural repeat proteins and to exploit artificial repeats proteins in nanotechnology and biomedical applications.
Collapse
Affiliation(s)
- Marie Synakewicz
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Rohan S. Eapen
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Albert Perez-Riba
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Pamela J. E. Rowling
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Daniela Bauer
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Andreas Weißl
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Gerhard Fischer
- Department
of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Marko Hyvönen
- Department
of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge, CB2 1GA, United Kingdom
| | - Matthias Rief
- Physik-Department, Technische Universität München, James-Franck-Straße 1, 85748 Garching, Germany
| | - Laura S. Itzhaki
- Department
of Pharmacology, University of Cambridge, Tennis Court Road, Cambridge CB2 1PD, United Kingdom
| | - Johannes Stigler
- Gene
Center Munich, Ludwig-Maximilians-Universität
München, Feodor-Lynen-Straße 25, 81377 München, Germany
| |
Collapse
|
35
|
Miller JG, Hughes SA, Modlin C, Conticello VP. Structures of synthetic helical filaments and tubes based on peptide and peptido-mimetic polymers. Q Rev Biophys 2022; 55:1-103. [PMID: 35307042 DOI: 10.1017/s0033583522000014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
AbstractSynthetic peptide and peptido-mimetic filaments and tubes represent a diverse class of nanomaterials with a broad range of potential applications, such as drug delivery, vaccine development, synthetic catalyst design, encapsulation, and energy transduction. The structures of these filaments comprise supramolecular polymers based on helical arrangements of subunits that can be derived from self-assembly of monomers based on diverse structural motifs. In recent years, structural analyses of these materials at near-atomic resolution (NAR) have yielded critical insights into the relationship between sequence, local conformation, and higher-order structure and morphology. This structural information offers the opportunity for development of new tools to facilitate the predictable and reproduciblede novodesign of synthetic helical filaments. However, these studies have also revealed several significant impediments to the latter process – most notably, the common occurrence of structural polymorphism due to the lability of helical symmetry in structural space. This article summarizes the current state of knowledge on the structures of designed peptide and peptido-mimetic filamentous assemblies, with a focus on structures that have been solved to NAR for which reliable atomic models are available.
Collapse
Affiliation(s)
- Jessalyn G Miller
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, GA30322
| | - Spencer A Hughes
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, GA30322
| | - Charles Modlin
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, GA30322
| | | |
Collapse
|
36
|
Pražnikar J, Attygalle NT. Quantitative analysis of visual codewords of a protein distance matrix. PLoS One 2022; 17:e0263566. [PMID: 35120181 PMCID: PMC8815937 DOI: 10.1371/journal.pone.0263566] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 01/24/2022] [Indexed: 12/02/2022] Open
Abstract
3D protein structures can be analyzed using a distance matrix calculated as the pairwise distance between all Cα atoms in the protein model. Although researchers have efficiently used distance matrices to classify proteins and find homologous proteins, much less work has been done on quantitative analysis of distance matrix features. Therefore, the distance matrix was analyzed as gray scale image using KAZE feature extractor algorithm with Bag of Visual Words model. In this study, each protein was represented as a histogram of visual codewords. The analysis showed that a very small number of codewords (~1%) have a high relative frequency (> 0.25) and that the majority of codewords have a relative frequency around 0.05. We have also shown that there is a relationship between the frequency of codewords and the position of the features in a distance matrix. The codewords that are more frequent are located closer to the main diagonal. Less frequent codewords, on the other hand, are located in the corners of the distance matrix, far from the main diagonal. Moreover, the analysis showed a correlation between the number of unique codewords and the 3D repeats in the protein structure. The solenoid and tandem repeats proteins have a significantly lower number of unique codewords than the globular proteins. Finally, the codeword histograms and Support Vector Machine (SVM) classifier were used to classify solenoid and globular proteins. The result showed that the SVM classifier fed with codeword histograms correctly classified 352 out of 354 proteins.
Collapse
Affiliation(s)
- Jure Pražnikar
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
- Department of Biochemistry, Molecular and Structural Biology, Institute Jožef Stefan, Ljubljana, Slovenia
- * E-mail:
| | - Nuwan Tharanga Attygalle
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| |
Collapse
|
37
|
Becerra A, Muñoz-Velasco I, Aguilar-Cámara A, Cottom-Salas W, Cruz-González A, Vázquez-Salazar A, Hernández-Morales R, Jácome R, Campillo-Balderas JA, Lazcano A. Two short low complexity regions (LCRs) are hallmark sequences of the Delta SARS-CoV-2 variant spike protein. Sci Rep 2022; 12:936. [PMID: 35042962 PMCID: PMC8766472 DOI: 10.1038/s41598-022-04976-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 01/04/2022] [Indexed: 11/24/2022] Open
Abstract
Low complexity regions (LCRs) are protein sequences formed by a set of compositionally biased residues. LCRs are extremely abundant in cellular proteins and have also been reported in viruses, where they may partake in evasion of the host immune system. Analyses of 28,231 SARS-CoV-2 whole proteomes and of 261,051 spike protein sequences revealed the presence of four extremely conserved LCRs in the spike protein of several SARS-CoV-2 variants. With the exception of Iota, where it is absent, the Spike LCR-1 is present in the signal peptide of 80.57% of the Delta variant sequences, and in other variants of concern and interest. The Spike LCR-2 is highly prevalent (79.87%) in Iota. Two distinctive LCRs are present in the Delta spike protein. The Delta Spike LCR-3 is present in 99.19% of the analyzed sequences, and the Delta Spike LCR-4 in 98.3% of the same set of proteins. These two LCRs are located in the furin cleavage site and HR1 domain, respectively, and may be considered hallmark traits of the Delta variant. The presence of the medically-important point mutations P681R and D950N in these LCRs, combined with the ubiquity of these regions in the highly contagious Delta variant opens the possibility that they may play a role in its rapid spread.
Collapse
Affiliation(s)
- Arturo Becerra
- Facultad de Ciencias, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| | - Israel Muñoz-Velasco
- Facultad de Ciencias, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| | | | - Wolfgang Cottom-Salas
- Facultad de Ciencias, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
- Escuela Nacional Preparatoria, Plantel 8 Miguel E. Schulz, Universidad Nacional Autónoma de México, 01600, Mexico City, Mexico
| | - Adrián Cruz-González
- Facultad de Ciencias, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| | - Alberto Vázquez-Salazar
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, CA, 90095, USA
| | | | - Rodrigo Jácome
- Facultad de Ciencias, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico
| | | | - Antonio Lazcano
- Facultad de Ciencias, Universidad Nacional Autónoma de México, 04510, Mexico City, Mexico.
- El Colegio Nacional, 06470, Mexico City, Mexico.
| |
Collapse
|
38
|
Sequence and Structure-Based Analyses of Human Ankyrin Repeats. MOLECULES (BASEL, SWITZERLAND) 2022; 27:molecules27020423. [PMID: 35056738 PMCID: PMC8781854 DOI: 10.3390/molecules27020423] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Revised: 12/27/2021] [Accepted: 01/04/2022] [Indexed: 11/24/2022]
Abstract
Ankyrin is one of the most abundant protein repeat families found across all forms of life. It is found in a variety of multi-domain and single domain proteins in humans with diverse number of repeating units. They are observed to occur in several functionally diverse proteins, such as transcriptional initiators, cell cycle regulators, cytoskeletal organizers, ion transporters, signal transducers, developmental regulators, and toxins, and, consequently, defects in ankyrin repeat proteins have been associated with a number of human diseases. In this study, we have classified the human ankyrin proteins into clusters based on the sequence similarity in their ankyrin repeat domains. We analyzed the amino acid compositional bias and consensus ankyrin motif sequence of the clusters to understand the diversity of the human ankyrin proteins. We carried out network-based structural analysis of human ankyrin proteins across different clusters and showed the association of conserved residues with topologically important residues identified by network centrality measures. The analysis of conserved and structurally important residues helps in understanding their role in structural stability and function of these proteins. In this paper, we also discuss the significance of these conserved residues in disease association across the human ankyrin protein clusters.
Collapse
|
39
|
Qi Y, Zhou D, Kessler JL, Qiu R, Yu SM, Li G, Qin Z, Li Y. Terminal repeats impact collagen triple-helix stability through hydrogen bonding. Chem Sci 2022; 13:12567-12576. [PMID: 36382282 PMCID: PMC9629113 DOI: 10.1039/d2sc03666e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 10/10/2022] [Indexed: 11/22/2022] Open
Abstract
Nearly 30% of human proteins have tandem repeating sequences. Structural understanding of the terminal repeats is well-established for many repeat proteins with the common α-helix and β-sheet foldings. By contrast, the sequence–structure interplay of the terminal repeats of the collagen triple-helix remains to be fully explored. As the most abundant human repeat protein and the most prevalent structural component of the extracellular matrix, collagen features a hallmark triple-helix formed by three supercoiled polypeptide chains of long repeating sequences of the Gly–X–Y triplets. Here, with CD characterization of 28 collagen-mimetic peptides (CMPs) featuring various terminal motifs, as well as DSC measurements, crystal structure analysis, and computational simulations, we show that CMPs only differing in terminal repeat may have distinct end structures and stabilities. We reveal that the cross-chain hydrogen bonding mediated by the terminal repeat is key to maintaining the triple-helix's end structure, and that disruption of it with a single amide to carboxylate substitution can lead to destabilization as drastic as 19 °C. We further demonstrate that the terminal repeat also impacts how strong the CMP strands form hybrid triple-helices with unfolded natural collagen chains in tissue. Our findings provide a spatial profile of hydrogen bonding within the CMP triple-helix, marking a critical guideline for future crystallographic or NMR studies of collagen, and algorithms for predicting triple-helix stability, as well as peptide-based collagen assemblies and materials. This study will also inspire new understanding of the sequence–structure relationship of many other complex structural proteins with repeating sequences. Collagen mimetic peptides (CMPs) only differing in terminal repeat have distinct stabilities and end structures due to a spatial hydrogen bonding profile that is useful for future crystallography, algorithm prediction, and materials of collagen.![]()
Collapse
Affiliation(s)
- Yingying Qi
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai 519000, China
- Cardiac Surgery and Structural Heart Disease Unit of Cardiovascular Center, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai 519000, China
- Department of Radiology, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai 519000, China
| | - Daoning Zhou
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai 519000, China
| | - Julian L. Kessler
- Department of Biomedical Engineering, University of Utah, Salt Lake City, Utah 84112, USA
| | - Rongmao Qiu
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai 519000, China
| | - S. Michael Yu
- Department of Biomedical Engineering, University of Utah, Salt Lake City, Utah 84112, USA
| | - Gang Li
- Cardiac Surgery and Structural Heart Disease Unit of Cardiovascular Center, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai 519000, China
| | - Zhao Qin
- Department of Civil & Environmental Engineering, College of Engineering & Computer Science, Syracuse University, Syracuse, New York 13244, USA
| | - Yang Li
- Guangdong Provincial Key Laboratory of Biomedical Imaging and Guangdong Provincial Engineering Research Center of Molecular Imaging, The Fifth Affiliated Hospital, Sun Yat-sen University, Zhuhai 519000, China
| |
Collapse
|
40
|
Chakrabarty B, Parekh N. DbStRiPs: Database of structural repeats in proteins. Protein Sci 2022; 31:23-36. [PMID: 33641184 PMCID: PMC8740836 DOI: 10.1002/pro.4052] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Revised: 02/11/2021] [Accepted: 02/15/2021] [Indexed: 01/03/2023]
Abstract
Recent interest in repeat proteins has arisen due to stable structural folds, high evolutionary conservation and repertoire of functions provided by these proteins. However, repeat proteins are poorly characterized because of high sequence variation between repeating units and structure-based identification and classification of repeats is desirable. Using a robust network-based pipeline, manual curation and Kajava's structure-based classification schema, we have developed a database of tandem structural repeats, Database of Structural Repeats in Proteins (DbStRiPs). A unique feature of this database is that available knowledge on sequence repeat families is incorporated by mapping Pfam classification scheme onto structural classification. Integration of sequence and structure-based classifications help in identifying different functional groups within the same structural subclass, leading to refinement in the annotation of repeat proteins. Analysis of complete Protein Data Bank revealed 16,472 repeat annotations in 15,141 protein chains, one previously uncharacterized novel protein repeat family (PRF), named left-handed beta helix, and 33 protein repeat clusters (PRCs). Based on their unique structural motif, ~79% of these repeat proteins are classified in one of the 14 PRFs or 33 PRCs, and the remaining are grouped as unclassified repeat proteins. Each repeat protein is provided with a detailed annotation in DbStRiPs that includes start and end boundaries of repeating units, copy number, secondary and tertiary structure view, repeat class/subclass, disease association, MSA of repeating units and cross-references to various protein pattern databases, human protein atlas and interaction resources. DbStRiPs provides easy search and download options to high-quality annotations of structural repeat proteins (URL: http://bioinf.iiit.ac.in/dbstrips/).
Collapse
Affiliation(s)
- Broto Chakrabarty
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information TechnologyHyderabadIndia
| | - Nita Parekh
- Centre for Computational Natural Sciences and Bioinformatics, International Institute of Information TechnologyHyderabadIndia
| |
Collapse
|
41
|
Abstract
Rhodopsins are light-activated proteins displaying an enormous versatility of function as cation/anion pumps or sensing environmental stimuli and are widely distributed across all domains of life. Even with wide sequence divergence and uncertain evolutionary linkages between microbial (type 1) and animal (type 2) rhodopsins, the membrane orientation of the core structural scaffold of both was presumed universal. This was recently amended through the discovery of heliorhodopsins (HeRs; type 3), that, in contrast to known rhodopsins, display an inverted membrane topology and yet retain similarities in sequence, structure, and the light-activated response. While no ion-pumping activity has been demonstrated for HeRs and multiple crystal structures are available, fundamental questions regarding their cellular and ecological function or even their taxonomic distribution remain unresolved. Here, we investigated HeR function and distribution using genomic/metagenomic data with protein domain fusions, contextual genomic information, and gene coexpression analysis with strand-specific metatranscriptomics. We bring to resolution the debated monoderm/diderm occurrence patterns and show that HeRs are restricted to monoderms. Moreover, we provide compelling evidence that HeRs are a novel type of sensory rhodopsins linked to histidine kinases and other two-component system genes across phyla. In addition, we also describe two novel putative signal-transducing domains fused to some HeRs. We posit that HeRs likely function as generalized light-dependent switches involved in the mitigation of light-induced oxidative stress and metabolic circuitry regulation. Their role as sensory rhodopsins is corroborated by their photocycle dynamics and their presence/function in monoderms is likely connected to the higher sensitivity of these organisms to light-induced damage. IMPORTANCE Heliorhodopsins are enigmatic, novel rhodopsins with a membrane orientation that is opposite to all known rhodopsins. However, their cellular and ecological functions are unknown, and even their taxonomic distribution remains a subject of debate. We provide evidence that HeRs are a novel type of sensory rhodopsins linked to histidine kinases and other two-component system genes across phyla boundaries. In support of this, we also identify two novel putative signal transducing domains in HeRs that are fused with them. We also observe linkages of HeRs to genes involved in mitigation of light-induced oxidative stress and increased carbon and nitrogen metabolism. Finally, we synthesize these findings into a framework that connects HeRs with the cellular response to light in monoderms, activating light-induced oxidative stress defenses along with carbon/nitrogen metabolic circuitries. These findings are consistent with the evolutionary, taxonomic, structural, and genomic data available so far.
Collapse
|
42
|
Hallinan JP, Doyle LA, Shen BW, Gewe MM, Takushi B, Kennedy MA, Friend D, Roberts JM, Bradley P, Stoddard BL. Design of functionalised circular tandem repeat proteins with longer repeat topologies and enhanced subunit contact surfaces. Commun Biol 2021; 4:1240. [PMID: 34716407 PMCID: PMC8556268 DOI: 10.1038/s42003-021-02766-y] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2021] [Accepted: 10/07/2021] [Indexed: 01/16/2023] Open
Abstract
Circular tandem repeat proteins (‘cTRPs’) are de novo designed protein scaffolds (in this and prior studies, based on antiparallel two-helix bundles) that contain repeated protein sequences and structural motifs and form closed circular structures. They can display significant stability and solubility, a wide range of sizes, and are useful as protein display particles for biotechnology applications. However, cTRPs also demonstrate inefficient self-assembly from smaller subunits. In this study, we describe a new generation of cTRPs, with longer repeats and increased interaction surfaces, which enhanced the self-assembly of two significantly different sizes of homotrimeric constructs. Finally, we demonstrated functionalization of these constructs with (1) a hexameric array of peptide-binding SH2 domains, and (2) a trimeric array of anti-SARS CoV-2 VHH domains. The latter proved capable of sub-nanomolar binding affinities towards the viral receptor binding domain and potent viral neutralization function. Jazmine Hallinan et al. report the development of a new generation of circular tandem repeat proteins with enhanced self-assembly. Functionalisation of these constructs with SARS CoV-2 VHH domains resulted in sub-nanomolar binding affinity to the viral receptor binding domain.
Collapse
Affiliation(s)
- Jazmine P Hallinan
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA
| | - Lindsey A Doyle
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA
| | - Betty W Shen
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA
| | - Mesfin M Gewe
- Lumen Bioscience Inc., 1441 North 34th Street, Seattle, WA, 98103, USA
| | - Brittany Takushi
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA
| | - Madison A Kennedy
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA
| | - Della Friend
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA
| | - James M Roberts
- Lumen Bioscience Inc., 1441 North 34th Street, Seattle, WA, 98103, USA
| | - Philip Bradley
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA. .,Division of Public Health Sciences and Program in Computational Biology, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. N., Seattle, WA, 98009, USA.
| | - Barry L Stoddard
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave. North, Seattle, WA, 98109, USA.
| |
Collapse
|
43
|
Deryusheva EI, Machulin AV, Galzitskaya OV. Structural, Functional, and Evolutionary Characteristics of Proteins with Repeats. Mol Biol 2021. [DOI: 10.1134/s0026893321040038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
44
|
Sawada T, Fujita M. Orderly Entangled Nanostructures of Metal–Peptide Strands. BULLETIN OF THE CHEMICAL SOCIETY OF JAPAN 2021. [DOI: 10.1246/bcsj.20210218] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Tomohisa Sawada
- Department of Applied Chemistry, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
- Precursory Research for Embryonic Science and Technology (PRESTO), Japan Science and Technology Agency (JST), 4-1-8 Honcho, Kawaguchi, Saitama 332-0012, Japan
| | - Makoto Fujita
- Department of Applied Chemistry, School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
- Division of Advanced Molecular Science, Institute for Molecular Science, National Institutes of Natural Sciences, 5-1 Higashiyama, Myodaiji-cho, Okazaki, Aichi 444-8787, Japan
| |
Collapse
|
45
|
Gottin C, Dievart A, Summo M, Droc G, Périn C, Ranwez V, Chantret N. A new comprehensive annotation of leucine-rich repeat-containing receptors in rice. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2021; 108:492-508. [PMID: 34382706 PMCID: PMC9292849 DOI: 10.1111/tpj.15456] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2021] [Revised: 07/23/2021] [Accepted: 07/30/2021] [Indexed: 06/13/2023]
Abstract
Oryza sativa (rice) plays an essential food security role for more than half of the world's population. Obtaining crops with high levels of disease resistance is a major challenge for breeders, especially today, given the urgent need for agriculture to be more sustainable. Plant resistance genes are mainly encoded by three large leucine-rich repeat (LRR)-containing receptor (LRR-CR) families: the LRR-receptor-like kinase (LRR-RLK), LRR-receptor-like protein (LRR-RLP) and nucleotide-binding LRR receptor (NLR). Using lrrprofiler, a pipeline that we developed to annotate and classify these proteins, we compared three publicly available annotations of the rice Nipponbare reference genome. The extended discrepancies that we observed for LRR-CR gene models led us to perform an in-depth manual curation of their annotations while paying special attention to nonsense mutations. We then transferred this manually curated annotation to Kitaake, a cultivar that is closely related to Nipponbare, using an optimized strategy. Here, we discuss the breakthrough achieved by manual curation when comparing genomes and, in addition to 'functional' and 'structural' annotations, we propose that the community adopts this approach, which we call 'comprehensive' annotation. The resulting data are crucial for further studies on the natural variability and evolution of LRR-CR genes in order to promote their use in breeding future resilient varieties.
Collapse
Affiliation(s)
- Céline Gottin
- UMR AGAP InstitutUniv MontpellierCIRAD, INRAEInstitut AgroF‐34398MontpellierFrance
- CIRADUMR AGAP InstitutF‐34398MontpellierFrance
| | - Anne Dievart
- UMR AGAP InstitutUniv MontpellierCIRAD, INRAEInstitut AgroF‐34398MontpellierFrance
- CIRADUMR AGAP InstitutF‐34398MontpellierFrance
| | - Marilyne Summo
- UMR AGAP InstitutUniv MontpellierCIRAD, INRAEInstitut AgroF‐34398MontpellierFrance
- CIRADUMR AGAP InstitutF‐34398MontpellierFrance
| | - Gaëtan Droc
- UMR AGAP InstitutUniv MontpellierCIRAD, INRAEInstitut AgroF‐34398MontpellierFrance
- CIRADUMR AGAP InstitutF‐34398MontpellierFrance
| | - Christophe Périn
- UMR AGAP InstitutUniv MontpellierCIRAD, INRAEInstitut AgroF‐34398MontpellierFrance
- CIRADUMR AGAP InstitutF‐34398MontpellierFrance
| | - Vincent Ranwez
- UMR AGAP InstitutUniv MontpellierCIRAD, INRAEInstitut AgroF‐34398MontpellierFrance
| | - Nathalie Chantret
- UMR AGAP InstitutUniv MontpellierCIRAD, INRAEInstitut AgroF‐34398MontpellierFrance
| |
Collapse
|
46
|
Mier P, Paladin L, Tamana S, Petrosian S, Hajdu-Soltész B, Urbanek A, Gruca A, Plewczynski D, Grynberg M, Bernadó P, Gáspári Z, Ouzounis CA, Promponas VJ, Kajava AV, Hancock JM, Tosatto SCE, Dosztanyi Z, Andrade-Navarro MA. Disentangling the complexity of low complexity proteins. Brief Bioinform 2021; 21:458-472. [PMID: 30698641 PMCID: PMC7299295 DOI: 10.1093/bib/bbz007] [Citation(s) in RCA: 51] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2018] [Revised: 12/19/2018] [Accepted: 01/07/2019] [Indexed: 12/31/2022] Open
Abstract
There are multiple definitions for low complexity regions (LCRs) in protein sequences, with all of them broadly considering LCRs as regions with fewer amino acid types compared to an average composition. Following this view, LCRs can also be defined as regions showing composition bias. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, and more generally the overlaps between different properties related to LCRs, using examples. We argue that statistical measures alone cannot capture all structural aspects of LCRs and recommend the combined usage of a variety of predictive tools and measurements. While the methodologies available to study LCRs are already very advanced, we foresee that a more comprehensive annotation of sequences in the databases will enable the improvement of predictions and a better understanding of the evolution and the connection between structure and function of LCRs. This will require the use of standards for the generation and exchange of data describing all aspects of LCRs. Short abstract There are multiple definitions for low complexity regions (LCRs) in protein sequences. In this critical review, we focus on the definition of sequence complexity of LCRs and their connection with structure. We present statistics and methodological approaches that measure low complexity (LC) and related sequence properties. Composition bias is often associated with LC and disorder, but repeats, while compositionally biased, might also induce ordered structures. We illustrate this dichotomy, plus overlaps between different properties related to LCRs, using examples.
Collapse
Affiliation(s)
- Pablo Mier
- Institute of Organismic and Molecular Evolution, Johannes Gutenberg University of Mainz, Mainz, Germany
| | - Lisanna Paladin
- Department of Biomedical Science, University of Padova, Padova, Italy
| | - Stella Tamana
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
| | - Sophia Petrosian
- Biological Computation and Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
| | - Borbála Hajdu-Soltész
- MTA-ELTE Lendület Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary
| | - Annika Urbanek
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, Montpellier, France
| | - Aleksandra Gruca
- Institute of Informatics, Silesian University of Technology, Gliwice, Poland
| | - Dariusz Plewczynski
- Center of New Technologies, University of Warsaw, Warsaw, Poland.,Faculty of Mathematics and Information Science, Warsaw University of Technology, Warsaw, Poland
| | | | - Pau Bernadó
- Centre de Biochimie Structurale, INSERM, CNRS, Université de Montpellier, Montpellier, France
| | - Zoltán Gáspári
- Faculty of Information Technology and Bionics, Pázmány Péter Catholic University, Budapest, Hungary
| | - Christos A Ouzounis
- Biological Computation and Process Laboratory, Chemical Process & Energy Resources Institute, Centre for Research & Technology Hellas, Thessalonica, Greece
| | - Vasilis J Promponas
- Bioinformatics Research Laboratory, Department of Biological Sciences, University of Cyprus, Nicosia, Cyprus
| | - Andrey V Kajava
- Centre de Recherche en Biologie Cellulaire de Montpellier, CNRS-UMR, Institut de Biologie Computationnelle, Universite de Montpellier, Montpellier, France.,Institute of Bioengineering, University ITMO, St. Petersburg, Russia
| | - John M Hancock
- Earlham Institute, Norwich, UK.,ELIXIR Hub, Welcome Genome Campus, Hinxton, UK
| | - Silvio C E Tosatto
- Department of Biomedical Science, University of Padova, Padova, Italy.,CNR Institute of Neuroscience, Padova, Italy
| | - Zsuzsanna Dosztanyi
- MTA-ELTE Lendület Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, Budapest, Hungary
| | - Miguel A Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Johannes Gutenberg University of Mainz, Mainz, Germany
| |
Collapse
|
47
|
Comparative Genomic Analyses of Flavobacterium psychrophilum Isolates Reveals New Putative Genetic Determinants of Virulence Traits. Microorganisms 2021; 9:microorganisms9081658. [PMID: 34442736 PMCID: PMC8400371 DOI: 10.3390/microorganisms9081658] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 07/23/2021] [Accepted: 07/29/2021] [Indexed: 11/29/2022] Open
Abstract
The fish pathogen Flavobacterium psychrophilum is currently one of the main pathogenic bacteria hampering the productivity of salmonid farming worldwide. Although putative virulence determinants have been identified, the genetic basis for variation in virulence of F. psychrophilum is not fully understood. In this study, we analyzed whole-genome sequences of a collection of 25 F. psychrophilum isolates from Baltic Sea countries and compared genomic information with a previous determination of their virulence in juvenile rainbow trout. The results revealed a conserved population of F. psychrophilum that were consistently present across the Baltic Sea countries, with no clear association between genomic repertoire, phylogenomic, or gene distribution and virulence traits. However, analysis of the entire genome of four F. psychrophilum isolates by hybrid assembly provided an unprecedented resolution for discriminating even highly related isolates. The results showed that isolates with different virulence phenotypes harbored genetic variances on a number of consecutive leucine-rich repeat (LRR) proteins, repetitive motifs in gliding motility-associated protein, and the insertion of transposable elements into intergenic and genic regions. Thus, these findings provide novel insights into the genetic variation of these elements and their putative role in the modulation of F. psychrophilum virulence.
Collapse
|
48
|
The Right-Handed Parallel β-Helix Topology of Erwinia chrysanthemi Pectin Methylesterase Is Intimately Associated with Both Sequential Folding and Resistance to High Pressure. Biomolecules 2021; 11:biom11081083. [PMID: 34439750 PMCID: PMC8392785 DOI: 10.3390/biom11081083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Revised: 07/01/2021] [Accepted: 07/03/2021] [Indexed: 11/30/2022] Open
Abstract
The complex topologies of large multi-domain globular proteins make the study of their folding and assembly particularly demanding. It is often characterized by complex kinetics and undesired side reactions, such as aggregation. The structural simplicity of tandem-repeat proteins, which are characterized by the repetition of a basic structural motif and are stabilized exclusively by sequentially localized contacts, has provided opportunities for dissecting their folding landscapes. In this study, we focus on the Erwinia chrysanthemi pectin methylesterase (342 residues), an all-β pectinolytic enzyme with a right-handed parallel β-helix structure. Chemicals and pressure were chosen as denaturants and a variety of optical techniques were used in conjunction with stopped-flow equipment to investigate the folding mechanism of the enzyme at 25 °C. Under equilibrium conditions, both chemical- and pressure-induced unfolding show two-state transitions, with average conformational stability (ΔG° = 35 ± 5 kJ·mol−1) but exceptionally high resistance to pressure (Pm = 800 ± 7 MPa). Stopped-flow kinetic experiments revealed a very rapid (τ < 1 ms) hydrophobic collapse accompanied by the formation of an extended secondary structure but did not reveal stable tertiary contacts. This is followed by three distinct cooperative phases and the significant population of two intermediate species. The kinetics followed by intrinsic fluorescence shows a lag phase, strongly indicating that these intermediates are productive species on a sequential folding pathway, for which we propose a plausible model. These combined data demonstrate that even a large repeat protein can fold in a highly cooperative manner.
Collapse
|
49
|
Woolfson DN. A Brief History of De Novo Protein Design: Minimal, Rational, and Computational. J Mol Biol 2021; 433:167160. [PMID: 34298061 DOI: 10.1016/j.jmb.2021.167160] [Citation(s) in RCA: 64] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Revised: 07/07/2021] [Accepted: 07/12/2021] [Indexed: 12/26/2022]
Abstract
Protein design has come of age, but how will it mature? In the 1980s and the 1990s, the primary motivation for de novo protein design was to test our understanding of the informational aspect of the protein-folding problem; i.e., how does protein sequence determine protein structure and function? This necessitated minimal and rational design approaches whereby the placement of each residue in a design was reasoned using chemical principles and/or biochemical knowledge. At that time, though with some notable exceptions, the use of computers to aid design was not widespread. Over the past two decades, the tables have turned and computational protein design is firmly established. Here, I illustrate this progress through a timeline of de novo protein structures that have been solved to atomic resolution and deposited in the Protein Data Bank. From this, it is clear that the impact of rational and computational design has been considerable: More-complex and more-sophisticated designs are being targeted with many being resolved to atomic resolution. Furthermore, our ability to generate and manipulate synthetic proteins has advanced to a point where they are providing realistic alternatives to natural protein functions for applications both in vitro and in cells. Also, and increasingly, computational protein design is becoming accessible to non-specialists. This all begs the questions: Is there still a place for minimal and rational design approaches? And, what challenges lie ahead for the burgeoning field of de novo protein design as a whole?
Collapse
Affiliation(s)
- Derek N Woolfson
- School of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Biomedical Sciences Building, University Walk, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK.
| |
Collapse
|
50
|
Izert MA, Szybowska PE, Górna MW, Merski M. The Effect of Mutations in the TPR and Ankyrin Families of Alpha Solenoid Repeat Proteins. FRONTIERS IN BIOINFORMATICS 2021; 1:696368. [PMID: 36303725 PMCID: PMC9581033 DOI: 10.3389/fbinf.2021.696368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 06/22/2021] [Indexed: 11/20/2022] Open
Abstract
Protein repeats are short, highly similar peptide motifs that occur several times within a single protein, for example the TPR and Ankyrin repeats. Understanding the role of mutation in these proteins is complicated by the competing facts that 1) the repeats are much more restricted to a set sequence than non-repeat proteins, so mutations should be harmful much more often because there are more residues that are heavily restricted due to the need of the sequence to repeat and 2) the symmetry of the repeats in allows the distribution of functional contributions over a number of residues so that sometimes no specific site is singularly responsible for function (unlike enzymatic active site catalytic residues). To address this issue, we review the effects of mutations in a number of natural repeat proteins from the tetratricopeptide and Ankyrin repeat families. We find that mutations are context dependent. Some mutations are indeed highly disruptive to the function of the protein repeats while mutations in identical positions in other repeats in the same protein have little to no effect on structure or function.
Collapse
Affiliation(s)
| | | | | | - Matthew Merski
- *Correspondence: Maria Wiktoria Górna, ; Matthew Merski,
| |
Collapse
|