1
|
Machulin AV, Deryusheva EI, Galzitskaya OV. Variation in base composition, structure-function relationships, and origins of structural repetition in bacterial rpsA gene. Biosystems 2024; 238:105196. [PMID: 38537772 DOI: 10.1016/j.biosystems.2024.105196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Revised: 03/22/2024] [Accepted: 03/22/2024] [Indexed: 04/12/2024]
Abstract
Protein domain repeats are known to arise due to tandem duplications of internal genes. However, the understanding of the underlying mechanisms of this process is incomplete. The goal of this work was to investigate the mechanism of occurrence of repeat expansion based on studying the sequences of 1324 rpsA genes of bacterial S1 ribosomal proteins containing different numbers of S1 structural domains. The rpsA gene encodes ribosomal S1 protein, which is essential for cell viability as it interacts with both mRNA and proteins. Gene ontology (GO) analysis of S1 domains in ribosomal S1 proteins revealed that bacterial protein sequences in S1 mainly have 3 types of molecular functions: RNA binding activity, nucleic acid activity, and ribosome structural component. Our results show that the maximum value of rpsA gene identity for full-length proteins was found for S1 proteins containing six structural domains (58%). Analysis of consensus sequences showed that parts of the rpsA gene encoding separate S1 domains have no a strictly repetitive structure between groups containing different numbers of S1 domains. At the same time, gene regions encoding some conserved residues that form the RNA-binding site remain conserved. The detected phylogenetic similarity suggests that the proposed fold of the rpsA translation initiation region of Escherichia coli has functional value and is important for translational control of rpsA gene expression in other bacterial phyla, but not only in gamma Proteobacteria.
Collapse
Affiliation(s)
- Andrey V Machulin
- Skryabin Institute of Biochemistry and Physiology of Microorganisms, Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", 142290, Pushchino, Moscow Region, Russia
| | - Evgeniya I Deryusheva
- Institute for Biological Instrumentation, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", 142290, Pushchino, Moscow Region, Russia
| | - Oxana V Galzitskaya
- Institute of Protein Research, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia; Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, 142290, Pushchino, Moscow Region, Russia.
| |
Collapse
|
2
|
Mac Donagh J, Marchesini A, Spiga A, Fallico MJ, Arrías PN, Monzon AM, Vagiona AC, Gonçalves-Kulik M, Mier P, Andrade-Navarro MA. Structured Tandem Repeats in Protein Interactions. Int J Mol Sci 2024; 25:2994. [PMID: 38474241 DOI: 10.3390/ijms25052994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2024] [Revised: 02/28/2024] [Accepted: 03/01/2024] [Indexed: 03/14/2024] Open
Abstract
Tandem repeats (TRs) in protein sequences are consecutive, highly similar sequence motifs. Some types of TRs fold into structural units that pack together in ensembles, forming either an (open) elongated domain or a (closed) propeller, where the last unit of the ensemble packs against the first one. Here, we examine TR proteins (TRPs) to see how their sequence, structure, and evolutionary properties favor them for a function as mediators of protein interactions. Our observations suggest that TRPs bind other proteins using large, structured surfaces like globular domains; in particular, open-structured TR ensembles are favored by flexible termini and the possibility to tightly coil against their targets. While, intuitively, open ensembles of TRs seem prone to evolve due to their potential to accommodate insertions and deletions of units, these evolutionary events are unexpectedly rare, suggesting that they are advantageous for the emergence of the ancestral sequence but are early fixed. We hypothesize that their flexibility makes it easier for further proteins to adapt to interact with them, which would explain their large number of protein interactions. We provide insight into the properties of open TR ensembles, which make them scaffolds for alternative protein complexes to organize genes, RNA and proteins.
Collapse
Affiliation(s)
- Juan Mac Donagh
- Science and Technology Department, National University of Quilmes, Bernal B1876, Argentina
- National Scientific and Technical Research Council (CONICET), Buenos Aires C1033AAJ, Argentina
| | - Abril Marchesini
- National Scientific and Technical Research Council (CONICET), Buenos Aires C1033AAJ, Argentina
- Biotechnology and Molecular Biology Institute (IBBM, UNLP-CONICET), Faculty of Exact Sciences, University of La Plata, La Plata 1900, Argentina
| | - Agostina Spiga
- Science and Technology Department, National University of Quilmes, Bernal B1876, Argentina
- National Scientific and Technical Research Council (CONICET), Buenos Aires C1033AAJ, Argentina
| | - Maximiliano José Fallico
- Laboratory of Bioactive Compound Research and Development, Faculty of Exact Sciences, University of La Plata, La Plata 1900, Argentina
| | - Paula Nazarena Arrías
- Department of Biomedical Sciences, University of Padova, Via U. Bassi 58/b, 35121 Padova, Italy
| | - Alexander Miguel Monzon
- Department of Information Engineering, University of Padova, Via Giovanni Gradenigo 6/B, 35131 Padova, Italy
| | - Aimilia-Christina Vagiona
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Mariane Gonçalves-Kulik
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Miguel A Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University, Hans-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| |
Collapse
|
3
|
Monzon AM, Arrías PN, Elofsson A, Mier P, Andrade-Navarro MA, Bevilacqua M, Clementel D, Bateman A, Hirsh L, Fornasari MS, Parisi G, Piovesan D, Kajava AV, Tosatto SCE. A STRP-ed definition of Structured Tandem Repeats in Proteins. J Struct Biol 2023; 215:108023. [PMID: 37652396 DOI: 10.1016/j.jsb.2023.108023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 07/31/2023] [Accepted: 08/28/2023] [Indexed: 09/02/2023]
Abstract
Tandem Repeat Proteins (TRPs) are a class of proteins with repetitive amino acid sequences that have been studied extensively for over two decades. Different features at the level of sequence, structure, function and evolution have been attributed to them by various authors. And yet many of its salient features appear only when looking at specific subclasses of protein tandem repeats. Here, we attempt to rationalize the existing knowledge on Tandem Repeat Proteins (TRPs) by pointing out several dichotomies. The emerging picture is more nuanced than generally assumed and allows us to draw some boundaries of what is not a "proper" TRP. We conclude with an operational definition of a specific subset, which we have denominated STRPs (Structural Tandem Repeat Proteins), which separates a subclass of tandem repeats with distinctive features from several other less well-defined types of repeats. We believe that this definition will help researchers in the field to better characterize the biological meaning of this large yet largely understudied group of proteins.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Dept. of Information Engineering, University of Padova, via Giovanni Gradenigo 6/B, 35131 Padova, Italy
| | - Paula Nazarena Arrías
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Arne Elofsson
- Dept. of Biochemistry and Biophysics and Science for Life Laboratory, Stockholm University, Tomtebodavägen 23, 171 21 Solna, Sweden
| | - Pablo Mier
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, Hanns-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Miguel A Andrade-Navarro
- Institute of Organismic and Molecular Evolution, Faculty of Biology, Johannes Gutenberg University of Mainz, Hanns-Dieter-Hüsch-Weg 15, 55128 Mainz, Germany
| | - Martina Bevilacqua
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Damiano Clementel
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| | - Layla Hirsh
- Dept. of Engineering, Faculty of Science and Engineering, Pontifical Catholic University of Peru, Av. Universitaria 1801 San Miguel, Lima 32, Lima, Peru
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Damiano Piovesan
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM), UMR 5237 CNRS, Université Montpellier, 1919 Route de Mende, Cedex 5, 34293 Montpellier, France
| | - Silvio C E Tosatto
- Dept. of Biomedical Sciences, University of Padova, via U. Bassi 58/b, 35121 Padova, Italy.
| |
Collapse
|
4
|
Deryusheva EI, Machulin AV, Galzitskaya OV. Diversity and features of proteins with structural repeats. Biophys Rev 2023; 15:1159-1169. [PMID: 37974986 PMCID: PMC10643770 DOI: 10.1007/s12551-023-01130-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Accepted: 08/28/2023] [Indexed: 11/19/2023] Open
Abstract
The review provides information on proteins with structural repeats, including their classification, characteristics, functions, and relevance in disease development. It explores methods for identifying structural repeats and specialized databases. The review also highlights the potential use of repeat proteins as drug design scaffolds and discusses their evolutionary mechanisms.
Collapse
Affiliation(s)
- Evgeniya I. Deryusheva
- Institute for Biological Instrumentation, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, Russia
| | - Andrey V. Machulin
- Skryabin Institute of Biochemistry and Physiology of Microorganisms, Federal Research Center “Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences”, Pushchino, Russia
| | - Oxana V. Galzitskaya
- Institute of Protein Research of the Russian Academy of Sciences, Pushchino, Russia
- Institute of Theoretical and Experimental Biophysics of the Russian Academy of Sciences, Pushchino, Russia
| |
Collapse
|
5
|
Szatkownik A, Zea DJ, Richard H, Laine E. Building alternative splicing and evolution-aware sequence-structure maps for protein repeats. J Struct Biol 2023; 215:107997. [PMID: 37453591 DOI: 10.1016/j.jsb.2023.107997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2023] [Revised: 06/15/2023] [Accepted: 07/05/2023] [Indexed: 07/18/2023]
Abstract
Alternative splicing of repeats in proteins provides a mechanism for rewiring and fine-tuning protein interaction networks. In this work, we developed a robust and versatile method, ASPRING, to identify alternatively spliced protein repeats from gene annotations. ASPRING leverages evolutionary meaningful alternative splicing-aware hierarchical graphs to provide maps between protein repeats sequences and 3D structures. We re-think the definition of repeats by explicitly accounting for transcript diversity across several genes/species. Using a stringent sequence-based similarity criterion, we detected over 5,000 evolutionary conserved repeats by screening virtually all human protein-coding genes and their orthologs across a dozen species. Through a joint analysis of their sequences and structures, we extracted specificity-determining sequence signatures and assessed their implication in experimentally resolved and modelled protein interactions. Our findings demonstrate the widespread alternative usage of protein repeats in modulating protein interactions and open avenues for targeting repeat-mediated interactions.
Collapse
Affiliation(s)
- Antoine Szatkownik
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France; Bioinformatics Unit, Genome Competence Center (MF1), Robert Koch Institute, 13353 Berlin, Germany
| | - Diego Javier Zea
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Hugues Richard
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France; Bioinformatics Unit, Genome Competence Center (MF1), Robert Koch Institute, 13353 Berlin, Germany.
| | - Elodie Laine
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), 75005 Paris, France.
| |
Collapse
|
6
|
Li P, Li W, Zhou X, Situ J, Xie L, Xi P, Yang B, Kong G, Jiang Z. Peronophythora litchii RXLR effector P. litchii avirulence homolog 202 destabilizes a host ethylene biosynthesis enzyme. PLANT PHYSIOLOGY 2023; 193:756-774. [PMID: 37232407 DOI: 10.1093/plphys/kiad311] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Accepted: 02/24/2023] [Indexed: 05/27/2023]
Abstract
Oomycete pathogens can secrete hundreds of effectors into plant cells to interfere with the plant immune system during infection. Here, we identified a Arg-X-Leu-Arg (RXLR) effector protein from the most destructive pathogen of litchi (Litchi chinensis Sonn.), Peronophythora litchii, and named it P. litchii avirulence homolog 202 (PlAvh202). PlAvh202 could suppress cell death triggered by infestin 1 or avirulence protein 3a/resistance protein 3a in Nicotiana benthamiana and was essential for P. litchii virulence. In addition, PlAvh202 suppressed plant immune responses and promoted the susceptibility of N. benthamiana to Phytophthora capsici. Further research revealed that PlAvh202 could suppress ethylene (ET) production by targeting and destabilizing plant S-adenosyl-L-methionine synthetase (SAMS), a key enzyme in the ET biosynthesis pathway, in a 26S proteasome-dependent manner without affecting its expression. Transient expression of LcSAMS3 induced ET production and enhanced plant resistance, whereas inhibition of ET biosynthesis promoted P. litchii infection, supporting that litchi SAMS (LcSAMS) and ET positively regulate litchi immunity toward P. litchii. Overall, these findings highlight that SAMS can be targeted by the oomycete RXLR effector to manipulate ET-mediated plant immunity.
Collapse
Affiliation(s)
- Peng Li
- Guangdong Key Laboratory of Microbial Signals and Disease Control/Department of Plant Pathology, College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| | - Wen Li
- Guangdong Key Laboratory of Microbial Signals and Disease Control/Department of Plant Pathology, College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| | - Xiaofan Zhou
- Integrative Microbiology Research Centre, South China Agricultural University, Guangzhou 510642, China
| | - Junjian Situ
- Guangdong Key Laboratory of Microbial Signals and Disease Control/Department of Plant Pathology, College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| | - Lizhu Xie
- Guangdong Key Laboratory of Microbial Signals and Disease Control/Department of Plant Pathology, College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| | - Pinggen Xi
- Guangdong Key Laboratory of Microbial Signals and Disease Control/Department of Plant Pathology, College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| | - Bo Yang
- College of Grassland Science/Department of Plant Pathology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, China
| | - Guanghui Kong
- Guangdong Key Laboratory of Microbial Signals and Disease Control/Department of Plant Pathology, College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| | - Zide Jiang
- Guangdong Key Laboratory of Microbial Signals and Disease Control/Department of Plant Pathology, College of Plant Protection, South China Agricultural University, Guangzhou 510642, China
| |
Collapse
|
7
|
Mohri M, Moghadam A, Burketova L, Ryšánek P. Genome-wide identification of the opsin protein in Leptosphaeria maculans and comparison with other fungi (pathogens of Brassica napus). Front Microbiol 2023; 14:1193892. [PMID: 37692395 PMCID: PMC10485269 DOI: 10.3389/fmicb.2023.1193892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2023] [Accepted: 06/28/2023] [Indexed: 09/12/2023] Open
Abstract
The largest family of transmembrane receptors are G-protein-coupled receptors (GPCRs). These receptors respond to perceived environmental signals and infect their host plants. Family A of the GPCR includes opsin. However, there is little known about the roles of GPCRs in phytopathogenic fungi. We studied opsin in Leptosphaeria maculans, an important pathogen of oilseed rape (Brassica napus) that causes blackleg disease, and compared it with six other fungal pathogens of oilseed rape. A phylogenetic tree analysis of 31 isoforms of the opsin protein showed six major groups and six subgroups. All three opsin isoforms of L. maculans are grouped in the same clade in the phylogenetic tree. Physicochemical analysis revealed that all studied opsin proteins are stable and hydrophobic. Subcellular localization revealed that most isoforms were localized in the endoplasmic reticulum membrane except for several isoforms in Verticillium species, which were localized in the mitochondrial membrane. Most isoforms comprise two conserved domains. One conserved motif was observed across all isoforms, consisting of the BACTERIAL_OPSIN_1 domain, which has been hypothesized to have an identical sensory function. Most studied isoforms showed seven transmembrane helices, except for one isoform of V. longisporum and four isoforms of Fusarium oxysporum. Tertiary structure prediction displayed a conformational change in four isoforms of F. oxysporum that presumed differences in binding to other proteins and sensing signals, thereby resulting in various pathogenicity strategies. Protein-protein interactions and binding site analyses demonstrated a variety of numbers of ligands and pockets across all isoforms, ranging between 0 and 13 ligands and 4 and 10 pockets. According to the phylogenetic analysis in this study and considerable physiochemically and structurally differences of opsin proteins among all studied fungi hypothesized that this protein acts in the pathogenicity, growth, sporulation, and mating of these fungi differently.
Collapse
Affiliation(s)
- Marzieh Mohri
- Department of Plant Protection, Faculty of Agrobiology, Food, and Natural Resources, Czech University of Life Sciences, Prague, Czechia
| | - Ali Moghadam
- Institute of Biotechnology, Shiraz University, Shiraz, Iran
| | - Lenka Burketova
- Institute of Experimental Botany, Czech Academy of Sciences, Prague, Czechia
| | - Pavel Ryšánek
- Department of Plant Protection, Faculty of Agrobiology, Food, and Natural Resources, Czech University of Life Sciences, Prague, Czechia
| |
Collapse
|
8
|
Choudhary P, Anyango S, Berrisford J, Tolchard J, Varadi M, Velankar S. Unified access to up-to-date residue-level annotations from UniProtKB and other biological databases for PDB data. Sci Data 2023; 10:204. [PMID: 37045837 PMCID: PMC10097656 DOI: 10.1038/s41597-023-02101-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Accepted: 03/23/2023] [Indexed: 04/14/2023] Open
Abstract
More than 61,000 proteins have up-to-date correspondence between their amino acid sequence (UniProtKB) and their 3D structures (PDB), enabled by the Structure Integration with Function, Taxonomy and Sequences (SIFTS) resource. SIFTS incorporates residue-level annotations from many other biological resources. SIFTS data is available in various formats like XML, CSV and TSV format or also accessible via the PDBe REST API but always maintained separately from the structure data (PDBx/mmCIF file) in the PDB archive. Here, we extended the wwPDB PDBx/mmCIF data dictionary with additional categories to accommodate SIFTS data and added the UniProtKB, Pfam, SCOP2, and CATH residue-level annotations directly into the PDBx/mmCIF files from the PDB archive. With the integrated UniProtKB annotations, these files now provide consistent numbering of residues in different PDB entries allowing easy comparison of structure models. The extended dictionary yields a more consistent, standardised metadata description without altering the core PDB information. This development enables up-to-date cross-reference information at the residue level resulting in better data interoperability, supporting improved data analysis and visualisation.
Collapse
Grants
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- BB/V004247/1, PI:Sameer Velankar RCUK | Biotechnology and Biological Sciences Research Council (BBSRC)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley) National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley National Science Foundation (NSF)
- DBI-2019297, PI: S.K. Burley NSF | National Science Board (NSB)
Collapse
Affiliation(s)
- Preeti Choudhary
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | - Stephen Anyango
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - John Berrisford
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- AstraZeneca, Biomedical Campus, 1 Francis Crick Ave, Trumpington, Cambridge, CB2 0AA, UK
| | - James Tolchard
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
- Claude Bernard University, Villeurbanne, Lyon, 69100, France
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Sameer Velankar
- Protein Data Bank in Europe, European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| |
Collapse
|
9
|
Gualandi N, Fracarossi D, Riommi D, Sollitto M, Greco S, Mardirossian M, Pacor S, Hori T, Pallavicini A, Gerdol M. Unveiling the Impact of Gene Presence/Absence Variation in Driving Inter-Individual Sequence Diversity within the CRP-I Gene Family in Mytilus spp. Genes (Basel) 2023; 14:genes14040787. [PMID: 37107545 PMCID: PMC10138031 DOI: 10.3390/genes14040787] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/14/2023] [Accepted: 03/22/2023] [Indexed: 03/29/2023] Open
Abstract
Mussels (Mytilus spp.) tolerate infections much better than other species living in the same marine coastal environment thanks to a highly efficient innate immune system, which exploits a remarkable diversification of effector molecules involved in mucosal and humoral responses. Among these, antimicrobial peptides (AMPs) are subjected to massive gene presence/absence variation (PAV), endowing each individual with a potentially unique repertoire of defense molecules. The unavailability of a chromosome-scale assembly has so far prevented a comprehensive evaluation of the genomic arrangement of AMP-encoding loci, preventing an accurate ascertainment of the orthology/paralogy relationships among sequence variants. Here, we characterized the CRP-I gene cluster in the blue mussel Mytilus edulis, which includes about 50 paralogous genes and pseudogenes, mostly packed in a small genomic region within chromosome 5. We further reported the occurrence of widespread PAV within this family in the Mytilus species complex and provided evidence that CRP-I peptides likely adopt a knottin fold. We functionally characterized the synthetic peptide sCRP-I H1, assessing the presence of biological activities consistent with other knottins, revealing that mussel CRP-I peptides are unlikely to act as antimicrobial agents or protease inhibitors, even though they may be used as defense molecules against infections from eukaryotic parasites.
Collapse
Affiliation(s)
- Nicolò Gualandi
- Area of Neuroscience, International School for Advanced Studies, 34136 Trieste, Italy;
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
| | - Davide Fracarossi
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
| | - Damiano Riommi
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
| | - Marco Sollitto
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, 6000 Koper, Slovenia
| | - Samuele Greco
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
| | - Mario Mardirossian
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
| | - Sabrina Pacor
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
| | - Tiago Hori
- Atlantic Aqua Farms Ltd., Vernon Bridge, PE C0A 2E0, Canada;
| | - Alberto Pallavicini
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
- Anton Dohrn Zoological Station, 80121 Naples, Italy
| | - Marco Gerdol
- Department of Life Sciences, University of Trieste, 34127 Trieste, Italy; (D.F.); (D.R.); (M.S.); (S.G.); (M.M.); (S.P.); (A.P.)
- Correspondence:
| |
Collapse
|
10
|
Abstract
Mechanisms of emergence and divergence of protein folds pose central questions in biological sciences. Incremental mutation and stepwise adaptation explain relationships between topologically similar protein folds. However, the universe of folds is diverse and riotous, suggesting more potent and creative forces are at play. Sequence and structure similarity are observed between distinct folds, indicating that proteins with distinct folds may share common ancestry. We found evidence of common ancestry between three distinct β-barrel folds: Scr kinase family homology (SH3), oligonucleotide/oligosaccharide-binding (OB), and cradle loop barrel (CLB). The data suggest a mechanism of fold evolution that interconverts SH3, OB, and CLB. This mechanism, which we call creative destruction, can be generalized to explain many examples of fold evolution including circular permutation. In creative destruction, an open reading frame duplicates or otherwise merges with another to produce a fused polypeptide. A merger forces two ancestral domains into a new sequence and spatial context. The fused polypeptide can explore folding landscapes that are inaccessible to either of the independent ancestral domains. However, the folding landscapes of the fused polypeptide are not fully independent of those of the ancestral domains. Creative destruction is thus partially conservative; a daughter fold inherits some motifs from ancestral folds. After merger and refolding, adaptive processes such as mutation and loss of extraneous segments optimize the new daughter fold. This model has application in disease states characterized by genetic instability. Fused proteins observed in cancer cells are likely to experience remodeled folding landscapes and realize altered folds, conferring new or altered functions.
Collapse
|
11
|
Rapid molecular diversification and homogenization of clustered major ampullate silk genes in Argiope garden spiders. PLoS Genet 2022; 18:e1010537. [PMID: 36508456 PMCID: PMC9779670 DOI: 10.1371/journal.pgen.1010537] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Revised: 12/22/2022] [Accepted: 11/18/2022] [Indexed: 12/14/2022] Open
Abstract
The evolutionary diversification of orb-web weaving spiders is closely tied to the mechanical performance of dragline silk. This proteinaceous fiber provides the primary structural framework of orb web architecture, and its extraordinary toughness allows these structures to absorb the high energy of aerial prey impact. The dominant model of dragline silk molecular structure involves the combined function of two highly repetitive, spider-specific, silk genes (spidroins)-MaSp1 and MaSp2. Recent genomic studies, however, have suggested this framework is overly simplistic, and our understanding of how MaSp genes evolve is limited. Here we present a comprehensive analysis of MaSp structural and evolutionary diversity across species of Argiope (garden spiders). This genomic analysis reveals the largest catalog of MaSp genes found in any spider, driven largely by an expansion of MaSp2 genes. The rapid diversification of Argiope MaSp genes, located primarily in a single genomic cluster, is associated with profound changes in silk gene structure. MaSp2 genes, in particular, have evolved complex hierarchically organized repeat units (ensemble repeats) delineated by novel introns that exhibit remarkable evolutionary dynamics. These repetitive introns have arisen independently within the genus, are highly homogenized within a gene, but diverge rapidly between genes. In some cases, these iterated introns are organized in an alternating structure in which every other intron is nearly identical in sequence. We hypothesize that this intron structure has evolved to facilitate homogenization of the coding sequence. We also find evidence of intergenic gene conversion and identify a more diverse array of stereotypical amino acid repeats than previously recognized. Overall, the extreme diversification found among MaSp genes requires changes in the structure-function model of dragline silk performance that focuses on the differential use and interaction among various MaSp paralogs as well as the impact of ensemble repeat structure and different amino acid motifs on mechanical behavior.
Collapse
|
12
|
Cui X, Xue Y, McCormack C, Garces A, Rachman TW, Yi Y, Stolzer M, Durand D. Simulating domain architecture evolution. Bioinformatics 2022; 38:i134-i142. [PMID: 35758772 PMCID: PMC9236583 DOI: 10.1093/bioinformatics/btac242] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
Motivation Simulation is an essential technique for generating biomolecular data with a ‘known’ history for use in validating phylogenetic inference and other evolutionary methods. On longer time scales, simulation supports investigations of equilibrium behavior and provides a formal framework for testing competing evolutionary hypotheses. Twenty years of molecular evolution research have produced a rich repertoire of simulation methods. However, current models do not capture the stringent constraints acting on the domain insertions, duplications, and deletions by which multidomain architectures evolve. Although these processes have the potential to generate any combination of domains, only a tiny fraction of possible domain combinations are observed in nature. Modeling these stringent constraints on domain order and co-occurrence is a fundamental challenge in domain architecture simulation that does not arise with sequence and gene family simulation. Results Here, we introduce a stochastic model of domain architecture evolution to simulate evolutionary trajectories that reflect the constraints on domain order and co-occurrence observed in nature. This framework is implemented in a novel domain architecture simulator, DomArchov, using the Metropolis–Hastings algorithm with data-driven transition probabilities. The use of a data-driven event module enables quick and easy redeployment of the simulator for use in different taxonomic and protein function contexts. Using empirical evaluation with metazoan datasets, we demonstrate that domain architectures simulated by DomArchov recapitulate properties of genuine domain architectures that reflect the constraints on domain order and adjacency seen in nature. This work expands the realm of evolutionary processes that are amenable to simulation. Availability and implementation DomArchov is written in Python 3 and is available at http://www.cs.cmu.edu/~durand/DomArchov. The data underlying this article are available via the same link. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaoyue Cui
- Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Yifan Xue
- Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA.,Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Collin McCormack
- Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA.,Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Alejandro Garces
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Thomas W Rachman
- Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Yang Yi
- Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA.,Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Maureen Stolzer
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| | - Dannie Durand
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
13
|
Rivera AM, Swanson WJ. The Importance of Gene Duplication and Domain Repeat Expansion for the Function and Evolution of Fertilization Proteins. Front Cell Dev Biol 2022; 10:827454. [PMID: 35155436 PMCID: PMC8830517 DOI: 10.3389/fcell.2022.827454] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Accepted: 01/12/2022] [Indexed: 11/13/2022] Open
Abstract
The process of gene duplication followed by gene loss or evolution of new functions has been studied extensively, yet the role gene duplication plays in the function and evolution of fertilization proteins is underappreciated. Gene duplication is observed in many fertilization protein families including Izumo, DCST, ZP, and the TFP superfamily. Molecules mediating fertilization are part of larger gene families expressed in a variety of tissues, but gene duplication followed by structural modifications has often facilitated their cooption into a fertilization function. Repeat expansions of functional domains within a gene also provide opportunities for the evolution of novel fertilization protein. ZP proteins with domain repeat expansions are linked to species-specificity in fertilization and TFP proteins that experienced domain duplications were coopted into a novel sperm function. This review outlines the importance of gene duplications and repeat domain expansions in the evolution of fertilization proteins.
Collapse
Affiliation(s)
- Alberto M. Rivera
- Department of Genome Sciences, University of Washington, Seattle, WA, United States
| | | |
Collapse
|
14
|
Levine TP. Sequence Analysis and Structural Predictions of Lipid Transfer Bridges in the Repeating Beta Groove (RBG) Superfamily Reveal Past and Present Domain Variations Affecting Form, Function and Interactions of VPS13, ATG2, SHIP164, Hobbit and Tweek. CONTACT (THOUSAND OAKS (VENTURA COUNTY, CALIF.)) 2022; 5:251525642211343. [PMID: 36571082 PMCID: PMC7613979 DOI: 10.1177/25152564221134328] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
Lipid transfer between organelles requires proteins that shield the hydrophobic portions of lipids as they cross the cytoplasm. In the last decade a new structural form of lipid transfer protein (LTP) has been found: long hydrophobic grooves made of beta-sheet that bridge between organelles at membrane contact sites. Eukaryotes have five families of bridge-like LTPs: VPS13, ATG2, SHIP164, Hobbit and Tweek. These are unified into a single superfamily through their bridges being composed of just one domain, called the repeating beta groove (RBG) domain, which builds into rod shaped multimers with a hydrophobic-lined groove and hydrophilic exterior. Here, sequences and predicted structures of the RBG superfamily were analyzed in depth. Phylogenetics showed that the last eukaryotic common ancestor contained all five RBG proteins, with duplicated VPS13s. The current set of long RBG protein appears to have arisen in even earlier ancestors from shorter forms with 4 RBG domains. The extreme ends of most RBG proteins have amphipathic helices that might be an adaptation for direct or indirect bilayer interaction, although this has yet to be tested. The one exception to this is the C-terminus of SHIP164, which instead has a coiled-coil. Finally, the exterior surfaces of the RBG bridges are shown to have conserved residues along most of their length, indicating sites for partner interactions almost all of which are unknown. These findings can inform future cell biological and biochemical experiments.
Collapse
|
15
|
Deryusheva EI, Machulin AV, Galzitskaya OV. Structural, Functional, and Evolutionary Characteristics of Proteins with Repeats. Mol Biol 2021. [DOI: 10.1134/s0026893321040038] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
16
|
Conserved Structure and Evolution of DPF Domain of PHF10-The Specific Subunit of PBAF Chromatin Remodeling Complex. Int J Mol Sci 2021; 22:ijms222011134. [PMID: 34681795 PMCID: PMC8538644 DOI: 10.3390/ijms222011134] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/08/2021] [Accepted: 10/11/2021] [Indexed: 11/17/2022] Open
Abstract
Transcription activation factors and multisubunit coactivator complexes get recruited at specific chromatin sites via protein domains that recognize histone modifications. Single PHDs (plant homeodomains) interact with differentially modified H3 histone tails. Double PHD finger (DPF) domains possess a unique structure different from PHD and are found in six proteins: histone acetyltransferases MOZ and MORF; chromatin remodeling complex BAF (DPF1–3); and chromatin remodeling complex PBAF (PHF10). Among them, PHF10 stands out due to the DPF sequence, structure, and functions. PHF10 is ubiquitously expressed in developing and adult organisms as four isoforms differing in structure (the presence or absence of DPF) and transcription regulation functions. Despite the importance of the DPF domain of PHF10 for transcription activation, its structure remains undetermined. We performed homology modeling of the human PHF10 DPF domain and determined common and distinct features in structure and histone modifications recognition capabilities, which can affect PBAF complex chromatin recruitment. We also traced the evolution of DPF1–3 and PHF10 genes from unicellular to vertebrate organisms. The data reviewed suggest that the DPF domain of PHF10 plays an important role in SWI/SNF-dependent chromatin remodeling during transcription activation.
Collapse
|
17
|
In Silico Analysis of Fatty Acid Desaturases Structures in Camelina sativa, and Functional Evaluation of Csafad7 and Csafad8 on Seed Oil Formation and Seed Morphology. Int J Mol Sci 2021; 22:ijms221910857. [PMID: 34639198 PMCID: PMC8532002 DOI: 10.3390/ijms221910857] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/01/2021] [Accepted: 10/05/2021] [Indexed: 12/19/2022] Open
Abstract
Fatty acid desaturases add a second bond into a single bond of carbon atoms in fatty acid chains, resulting in an unsaturated bond between the two carbons. They are classified into soluble and membrane-bound desaturases, according to their structure, subcellular location, and function. The orthologous genes in Camelina sativa were identified and analyzed, and a total of 62 desaturase genes were identified. It was revealed that they had the common fatty acid desaturase domain, which has evolved separately, and the proteins of the same family also originated from the same ancestry. A mix of conserved, gained, or lost intron structure was obvious. Besides, conserved histidine motifs were found in each family, and transmembrane domains were exclusively revealed in the membrane-bound desaturases. The expression profile analysis of C. sativa desaturases revealed an increase in young leaves, seeds, and flowers. C. sativa ω3-fatty acid desaturases CsaFAD7 and CsaDAF8 were cloned and the subcellular localization analysis showed their location in the chloroplast. They were transferred into Arabidopsis thaliana to obtain transgenic lines. It was revealed that the ω3-fatty acid desaturase could increase the C18:3 level at the expense of C18:2, but decreases in oil content and seed weight, and wrinkled phenotypes were observed in transgenic CsaFAD7 lines, while no significant change was observed in transgenic CsaFAD8 lines in comparison to the wild-type. These findings gave insights into the characteristics of desaturase genes, which could provide an excellent basis for further investigation for C. sativa improvement, and overexpression of ω3-fatty acid desaturases in seeds could be useful in genetic engineering strategies, which are aimed at modifying the fatty acid composition of seed oil.
Collapse
|
18
|
Aluru C, Singh M. Improved inference of tandem domain duplications. Bioinformatics 2021; 37:i133-i141. [PMID: 34252920 PMCID: PMC8275333 DOI: 10.1093/bioinformatics/btab329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/03/2021] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Protein domain duplications are a major contributor to the functional diversification of protein families. These duplications can occur one at a time through single domain duplications, or as tandem duplications where several consecutive domains are duplicated together as part of a single evolutionary event. Existing methods for inferring domain-level evolutionary events are based on reconciling domain trees with gene trees. While some formulations consider multiple domain duplications, they do not explicitly model tandem duplications; this leads to inaccurate inference of which domains duplicated together over the course of evolution. RESULTS Here, we introduce a reconciliation-based framework that considers the relative positions of domains within extant sequences. We use this information to uncover tandem domain duplications within the evolutionary history of these genes. We devise an integer linear programming approach that solves our problem exactly, and a heuristic approach that works well in practice. We perform extensive simulation studies to demonstrate that our approaches can accurately uncover single and tandem domain duplications, and additionally test our approach on a well-studied orthogroup where lineage-specific domain expansions exhibit varying and complex domain duplication patterns. AVAILABILITY AND IMPLEMENTATION Code is available on github at https://github.com/Singh-Lab/TandemDuplications. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Chaitanya Aluru
- Department of Computer Science and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| | - Mona Singh
- Department of Computer Science and Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08540, USA
| |
Collapse
|
19
|
Calatayud S, Garcia-Risco M, Capdevila M, Cañestro C, Palacios Ò, Albalat R. Modular Evolution and Population Variability of Oikopleura dioica Metallothioneins. Front Cell Dev Biol 2021; 9:702688. [PMID: 34277643 PMCID: PMC8283569 DOI: 10.3389/fcell.2021.702688] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2021] [Accepted: 06/09/2021] [Indexed: 01/29/2023] Open
Abstract
Chordate Oikopleura dioica probably is the fastest evolving metazoan reported so far, and thereby, a suitable system in which to explore the limits of evolutionary processes. For this reason, and in order to gain new insights on the evolution of protein modularity, we have investigated the organization, function and evolution of multi-modular metallothionein (MT) proteins in O. dioica. MTs are a heterogeneous group of modular proteins defined by their cysteine (C)-rich domains, which confer the capacity of coordinating different transition metal ions. O. dioica has two MTs, a bi-modular OdiMT1 consisting of two domains (t-12C and 12C), and a multi-modular OdiMT2 with six t-12C/12C repeats. By means of mass spectrometry and spectroscopy of metal-protein complexes, we have shown that the 12C domain is able to autonomously bind four divalent metal ions, although the t-12C/12C pair –as it is found in OdiMT1– is the optimized unit for divalent metal binding. We have also shown a direct relationship between the number of the t-12C/12C repeats and the metal-binding capacity of the MTs, which means a stepwise mode of functional and structural evolution for OdiMT2. Finally, after analyzing four different O. dioica populations worldwide distributed, we have detected several OdiMT2 variants with changes in their number of t-12C/12C domain repeats. This finding reveals that the number of repeats fluctuates between current O. dioica populations, which provides a new perspective on the evolution of domain repeat proteins.
Collapse
Affiliation(s)
- Sara Calatayud
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Mario Garcia-Risco
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Mercè Capdevila
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Cristian Cañestro
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| | - Òscar Palacios
- Departament de Química, Facultat de Ciències, Universitat Autònoma de Barcelona, Cerdanyola del Vallès, Spain
| | - Ricard Albalat
- Departament de Genètica, Microbiologia i Estadística, Facultat de Biologia, Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Barcelona, Spain
| |
Collapse
|
20
|
Rani J, Chauhan C, Das De T, Kumari S, Sharma P, Tevatiya S, Patel K, Mishra AK, Pandey KC, Singh N, Dixit R. Hemocyte RNA-Seq analysis of Indian malarial vectors Anopheles stephensi and Anopheles culicifacies: From similarities to differences. Gene 2021; 798:145810. [PMID: 34224830 DOI: 10.1016/j.gene.2021.145810] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 06/26/2021] [Accepted: 06/30/2021] [Indexed: 02/05/2023]
Abstract
Anopheles stephensi and Anopheles culicifacies are dominant malarial vectors in urban and rural India, respectively. Both species carry significant biological differences in their behavioral adaptation and immunity, but the genetic basis of these variations are still poorly understood. Here, we uncovered the genetic differences of immune blood cells, that influence several immune-physiological responses. We generated, analyzed and compared the hemocyte RNA-Seq database of both mosquitoes. A total of 5,837,223,769 assembled bases collapsed into 7,595 and 3,791 transcripts, originating from hemocytes of laboratory-reared 3-4 days old naïve (sugar-fed) mosquitoes, Anopheles stephensi and Anopheles culicifacies respectively. Comparative GO annotation analysis revealed that both mosquito hemocytes encode similar proteins. Furthermore, while An. stephensi hemocytes showed a higher percentage of immune transcripts encoding APHAG (Autophagy), IMD (Immune deficiency pathway), PRDX (Peroxiredoxin), SCR (Scavenger receptor), IAP (Inhibitor of apoptosis), GALE (galactoside binding lectins), BGBPs (1,3 beta D glucan binding proteins), CASPs (caspases) and SRRP (Small RNA regulatory pathway), An. culicifacies hemocytes yielded a relatively higher percentage of transcripts encoding CLIP (Clip domain serine protease), FREP (Fibrinogen related proteins), PPO (Prophenol oxidase), SRPN (Serpines), ML (Myeloid differentiation 2-related lipid recognition protein), Toll path and TEP (Thioester protein), family proteins. However, a detailed comparative Interproscan analysis showed An. stephensi mosquito hemocytes encode proteins with increased repeat numbers as compared to An. culicifacies. Notably, we observed an abundance of transcripts showing significant variability of encoded proteins with repeats such as LRR (Leucine rich repeat), WD40 (W-D dipeptide), Ankyrin, Annexin, Tetratricopeptide and Mitochondrial substrate carrier repeat-containing family proteins, which may have a direct influence on species-specific immune-physiological responses. Summarily, our deep sequencing analysis unraveled that An. stephensi evolved with an expansion of repeat sequences in hemocyte proteins as compared to An. culicifacies, possibly providing an advantage for better adaptation to diverse environments.
Collapse
Affiliation(s)
- Jyoti Rani
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India; Department of Bio and Nanotechnology, Guru Jambheshwar University of Science and Technology, Hisar, Haryana, India
| | - Charu Chauhan
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India
| | - Tanwee Das De
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India
| | - Seena Kumari
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India
| | - Punita Sharma
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India
| | - Sanjay Tevatiya
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India
| | - Karan Patel
- DNA Xperts Private Limited, Sector 63, Noida, Uttar Pradesh 20130, India
| | - Ashwani K Mishra
- DNA Xperts Private Limited, Sector 63, Noida, Uttar Pradesh 20130, India
| | - Kailash C Pandey
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India
| | - Namita Singh
- Department of Bio and Nanotechnology, Guru Jambheshwar University of Science and Technology, Hisar, Haryana, India
| | - Rajnikant Dixit
- Laboratory of Host-Parasite Interaction Studies, ICMR-National Institute of Malaria, Research, Dwarka, New Delhi 110077, India.
| |
Collapse
|
21
|
Homopeptide and homocodon levels across fungi are coupled to GC/AT-bias and intrinsic disorder, with unique behaviours for some amino acids. Sci Rep 2021; 11:10025. [PMID: 33976321 PMCID: PMC8113271 DOI: 10.1038/s41598-021-89650-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 04/22/2021] [Indexed: 11/09/2022] Open
Abstract
Homopeptides (runs of one amino-acid type) are evolutionarily important since they are prone to expand/contract during DNA replication, recombination and repair. To gain insight into the genomic/proteomic traits driving their variation, we analyzed how homopeptides and homocodons (which are pure codon repeats) vary across 405 Dikarya, and probed their linkage to genome GC/AT bias and other factors. We find that amino-acid homopeptide frequencies vary diversely between clades, with the AT-rich Saccharomycotina trending distinctly. As organisms evolve, homocodon and homopeptide numbers are majorly coupled to GC/AT-bias, exhibiting a bi-furcated correlation with degree of AT- or GC-bias. Mid-GC/AT genomes tend to have markedly fewer simply because they are mid-GC/AT. Despite these trends, homopeptides tend to be GC-biased relative to other parts of coding sequences, even in AT-rich organisms, indicating they absorb AT bias less or are inherently more GC-rich. The most frequent and most variable homopeptide amino acids favour intrinsic disorder, and there are an opposing correlation and anti-correlation versus homopeptide levels for intrinsic disorder and structured-domain content respectively. Specific homopeptides show unique behaviours that we suggest are linked to inherent slippage probabilities during DNA replication and recombination, such as poly-glutamine, which is an evolutionarily very variable homopeptide with a codon repertoire unbiased for GC/AT, and poly-lysine whose homocodons are overwhelmingly made from the codon AAG.
Collapse
|
22
|
Byerly CD, Patterson LL, McBride JW. Ehrlichia TRP effectors: moonlighting, mimicry and infection. Pathog Dis 2021; 79:6261440. [PMID: 33974702 PMCID: PMC8112483 DOI: 10.1093/femspd/ftab026] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Accepted: 04/29/2021] [Indexed: 12/24/2022] Open
Abstract
Intracellular bacteria have evolved various strategies to evade host defense mechanisms. Remarkably, the obligately intracellular bacterium, Ehrlichia chaffeensis, hijacks host cell processes of the mononuclear phagocyte to evade host defenses through mechanisms executed in part by tandem repeat protein (TRP) effectors secreted by the type 1 secretion system. In the past decade, TRP120 has emerged as a model moonlighting effector, acting as a ligand mimetic, nucleomodulin and ubiquitin ligase. These defined functions illuminate the diverse roles TRP120 plays in exploiting and manipulating host cell processes, including cytoskeletal organization, vesicle trafficking, cell signaling, transcriptional regulation, post-translational modifications, autophagy and apoptosis. This review will focus on TRP effectors and their expanding roles in infection and provide perspective on Ehrlichia chaffeensis as an invaluable model organism for understanding infection strategies of obligately intracellular bacteria.
Collapse
Affiliation(s)
- Caitlan D Byerly
- Departments of Pathology, University of Texas Medical Branch, Galveston, TX 77555, USA
| | - LaNisha L Patterson
- Departments of Pathology, University of Texas Medical Branch, Galveston, TX 77555, USA
| | - Jere W McBride
- Departments of Pathology, University of Texas Medical Branch, Galveston, TX 77555, USA.,Microbiology and Immunology, University of Texas Medical Branch, Galveston, TX 77555, USA.,Center for Biodefense and Emerging Infectious Diseases, University of Texas Medical Branch, Galveston, TX 77555, USA.,Sealy Institute for Vaccine Sciences, University of Texas Medical Branch, Galveston, TX 77555, USA.,Institute for Human Infections and Immunity, University of Texas Medical Branch, Galveston, TX 77555, USA
| |
Collapse
|
23
|
Accurate contact-based modelling of repeat proteins predicts the structure of new repeats protein families. PLoS Comput Biol 2021; 17:e1008798. [PMID: 33857128 PMCID: PMC8078820 DOI: 10.1371/journal.pcbi.1008798] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2020] [Revised: 04/27/2021] [Accepted: 02/15/2021] [Indexed: 12/18/2022] Open
Abstract
Repeat proteins are abundant in eukaryotic proteomes. They are involved in many eukaryotic specific functions, including signalling. For many of these proteins, the structure is not known, as they are difficult to crystallise. Today, using direct coupling analysis and deep learning it is often possible to predict a protein’s structure. However, the unique sequence features present in repeat proteins have been a challenge to use direct coupling analysis for predicting contacts. Here, we show that deep learning-based methods (trRosetta, DeepMetaPsicov (DMP) and PconsC4) overcomes this problem and can predict intra- and inter-unit contacts in repeat proteins. In a benchmark dataset of 815 repeat proteins, about 90% can be correctly modelled. Further, among 48 PFAM families lacking a protein structure, we produce models of forty-one families with estimated high accuracy. Repeat proteins are widespread among organisms and particularly abundant in eukaryotic proteomes. Their primary sequence presents repetition in the amino acid sequences that origin structures with repeated folds/domains. Although the repeated units often can be recognised from the sequence alone, often structural information is missing. Here, we used contact prediction for predicting the structure of repeats protein directly from their primary sequences. We benchmark the methods on a dataset comprehensive of all the known repeated structures. We evaluate the contact predictions and the obtained models for different classes of repeat proteins. Further, we develop and benchmark a quality assessment (QA) method specific for repeat proteins. Finally, we used the prediction pipeline for all PFAM repeat families without resolved structures and found that forty-one of them could be modelled with high accuracy.
Collapse
|
24
|
Deryusheva E, Machulin A, Matyunin M, Galzitskaya O. Sequence and evolutionary analysis of bacterial ribosomal S1 proteins. Proteins 2021; 89:1111-1124. [PMID: 33843105 DOI: 10.1002/prot.26084] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2020] [Revised: 03/17/2021] [Accepted: 04/07/2021] [Indexed: 12/21/2022]
Abstract
The multi-domain bacterial S1 protein is the largest and most functionally important ribosomal protein of the 30S subunit, which interacts with both mRNA and proteins. The family of ribosomal S1 proteins differs in the classical sense from a protein with tandem repeats and has a "bead-on-string" organization, where each repeat is folded into a globular domain. Based on our recent data, the study of evolutionary relationships for the bacterial phyla will provide evidence for one of the proposed theories of the evolutionary development of proteins with structural repeats: from multiple repeats of assembles to single repeats, or vice versa. In this comparative analysis of 1333 S1 sequences that were identified in 24 different phyla, we demonstrate how such phyla can form independently/dependently during evolution. To the best of our knowledge, this work is the first study of the evolutionary history of bacterial ribosomal S1 proteins. The collected and structured data can be useful to computer biologists as a resource for determining percent identity, amino acid composition and logo motifs, as well as dN/dS ratio in bacterial S1 protein. The obtained research data indicate that the evolutionary development of bacterial ribosomal S1 proteins evolved from multiple assemblies to single repeat. The presented data are integrated into the server, which can be accessed at http://oka.protres.ru:4200.
Collapse
Affiliation(s)
- Evgeniya Deryusheva
- Institute for Biological Instrumentation, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Russian Federation
| | - Andrey Machulin
- Skryabin Institute of Biochemistry and Physiology of Microorganisms, Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Russian Federation
| | - Maxim Matyunin
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Russian Federation
| | - Oxana Galzitskaya
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Russian Federation.,Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, Pushchino, Russian Federation
| |
Collapse
|
25
|
Sadat MA, Ullah MW, Bashar KK, Hossen QMM, Tareq MZ, Islam MS. Genome-wide identification of F-box proteins in Macrophomina phaseolina and comparison with other fungus. J Genet Eng Biotechnol 2021; 19:46. [PMID: 33761027 PMCID: PMC7991009 DOI: 10.1186/s43141-021-00143-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 03/11/2021] [Indexed: 01/01/2023]
Abstract
Background In fungi, like other eukaryotes, protein turnover is an important cellular process for the controlling of various cellular functions. The ubiquitin-proteasome pathway degrades some selected intracellular proteins and F-box proteins are one of the important components controlling protein degradation. F-box proteins are well studied in different model plants however, their functions in the fungi are not clear yet. This study aimed to identify the genes involved in protein degradation for disease development in the Macrophomina phaseolina fungus. Results In this research, in silico studies were done to understand the distribution of F-box proteins in pathogenic fungi including Macrophomina phaseolina fungus. Genome-wide analysis indicates that M. phaseolina fungus contained thirty-one F-box proteins throughout its chromosomes. In addition, there are 17, 37, 16, and 21 F-box proteins have been identified from Puccinia graminis, Colletotrichum graminicola, Ustilago maydis, and Phytophthora infestans, respectively. Analyses revealed that selective fungal genomes contain several additional functional domains along with F-box domain. Sequence alignment showed the substitution of amino acid in several F-box proteins; however, gene duplication was not found among these proteins. Phylogenetic analysis revealed that F-box proteins having similar functional domain was highly diverse form each other showing the possibility of various function. Analysis also found that MPH_00568 and MPH_05531 were closely related to rice blast fungus F-box protein MGG_00768 and MGG_13065, respectively, may play an important role for blast disease development. Conclusion This genome-wide analysis of F-box proteins will be useful for characterization of candidate F-box proteins to understand the molecular mechanisms leading to disease development of M. phaseolina in the host plants. Supplementary Information The online version contains supplementary material available at 10.1186/s43141-021-00143-0.
Collapse
Affiliation(s)
- Md Abu Sadat
- Basic and Applied Research on Jute Project, Bangladesh Jute Research Institute, Manik Mia Avenue, Dhaka, 1207, Bangladesh.
| | - Md Wali Ullah
- Basic and Applied Research on Jute Project, Bangladesh Jute Research Institute, Manik Mia Avenue, Dhaka, 1207, Bangladesh
| | - Kazi Khayrul Bashar
- Basic and Applied Research on Jute Project, Bangladesh Jute Research Institute, Manik Mia Avenue, Dhaka, 1207, Bangladesh
| | - Quazi Md Mosaddeque Hossen
- Basic and Applied Research on Jute Project, Bangladesh Jute Research Institute, Manik Mia Avenue, Dhaka, 1207, Bangladesh
| | - Md Zablul Tareq
- Basic and Applied Research on Jute Project, Bangladesh Jute Research Institute, Manik Mia Avenue, Dhaka, 1207, Bangladesh
| | - Md Shahidul Islam
- Basic and Applied Research on Jute Project, Bangladesh Jute Research Institute, Manik Mia Avenue, Dhaka, 1207, Bangladesh
| |
Collapse
|
26
|
Aupič J, Strmšek Ž, Lapenta F, Pahovnik D, Pisanski T, Drobnak I, Ljubetič A, Jerala R. Designed folding pathway of modular coiled-coil-based proteins. Nat Commun 2021; 12:940. [PMID: 33574262 PMCID: PMC7878764 DOI: 10.1038/s41467-021-21185-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Accepted: 01/13/2021] [Indexed: 12/02/2022] Open
Abstract
Natural proteins are characterised by a complex folding pathway defined uniquely for each fold. Designed coiled-coil protein origami (CCPO) cages are distinct from natural compact proteins, since their fold is prescribed by discrete long-range interactions between orthogonal pairwise-interacting coiled-coil (CC) modules within a single polypeptide chain. Here, we demonstrate that CCPO proteins fold in a stepwise sequential pathway. Molecular dynamics simulations and stopped-flow Förster resonance energy transfer (FRET) measurements reveal that CCPO folding is dominated by the effective intra-chain distance between CC modules in the primary sequence and subsequent folding intermediates, allowing identical CC modules to be employed for multiple cage edges and thus relaxing CCPO cage design requirements. The number of orthogonal modules required for constructing a CCPO tetrahedron can be reduced from six to as little as three different CC modules. The stepwise modular nature of the folding pathway offers insights into the folding of tandem repeat proteins and can be exploited for the design of modular protein structures based on a given set of orthogonal modules.
Collapse
Affiliation(s)
- Jana Aupič
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Žiga Strmšek
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
- Interdisciplinary Doctoral Programme in Biomedicine, University of Ljubljana, Ljubljana, Slovenia
| | - Fabio Lapenta
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
- EN-FIST Centre of Excellence, Ljubljana, Slovenia
| | - David Pahovnik
- Department of Polymer Chemistry and Technology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Tomaž Pisanski
- FAMNIT, University of Primorska, Koper, Slovenia
- Institute of Mathematics, Physics and Mechanics, Ljubljana, Slovenia
| | - Igor Drobnak
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Ajasja Ljubetič
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Roman Jerala
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia.
- EN-FIST Centre of Excellence, Ljubljana, Slovenia.
| |
Collapse
|
27
|
Ratu STN, Hirata A, Kalaw CO, Yasuda M, Tabuchi M, Okazaki S. Multiple Domains in the Rhizobial Type III Effector Bel2-5 Determine Symbiotic Efficiency With Soybean. FRONTIERS IN PLANT SCIENCE 2021; 12:689064. [PMID: 34163515 PMCID: PMC8215712 DOI: 10.3389/fpls.2021.689064] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Accepted: 05/10/2021] [Indexed: 05/06/2023]
Abstract
Bradyrhizobium elkanii utilizes the type III effector Bel2-5 for nodulation in host plants in the absence of Nod factors (NFs). In soybean plants carrying the Rj4 allele, however, Bel2-5 causes restriction of nodulation by triggering immune responses. Bel2-5 shows similarity with XopD of the phytopathogen Xanthomonas campestris pv. vesicatoria and possesses two internal repeat sequences, two ethylene (ET)-responsive element-binding factor-associated amphiphilic repression (EAR) motifs, a nuclear localization signal (NLS), and a ubiquitin-like protease (ULP) domain, which are all conserved in XopD except for the repeat domains. By mutational analysis, we revealed that most of the putative domains/motifs in Bel2-5 were essential for both NF-independent nodulation and nodulation restriction in Rj4 soybean. The expression of soybean symbiosis- and defense-related genes was also significantly altered by inoculation with the bel2-5 domain/motif mutants compared with the expression upon inoculation with wild-type B. elkanii, which was mostly consistent with the phenotypic changes of nodulation in host plants. Notably, the functionality of Bel2-5 was mostly correlated with the growth inhibition effect of Bel2-5 expressed in yeast cells. The nodulation phenotypes of the domain-swapped mutants of Bel2-5 and XopD indicated that both the C-terminal ULP domain and upstream region are required for the Bel2-5-dependent nodulation phenotypes. These results suggest that Bel2-5 interacts with and modifies host targets via these multiple domains to execute both NF-independent symbiosis and nodulation restriction in Rj4 soybean.
Collapse
Affiliation(s)
- Safirah Tasa Nerves Ratu
- United Graduate School of Agricultural Science, Tokyo University of Agriculture and Technology, Fuchu, Japan
| | - Atsushi Hirata
- Department of Applied Biological Science, Faculty of Agriculture, Kagawa University, Kagawa, Japan
| | - Christian Oliver Kalaw
- Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Japan
| | - Michiko Yasuda
- Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Japan
| | - Mitsuaki Tabuchi
- Department of Applied Biological Science, Faculty of Agriculture, Kagawa University, Kagawa, Japan
| | - Shin Okazaki
- United Graduate School of Agricultural Science, Tokyo University of Agriculture and Technology, Fuchu, Japan
- Graduate School of Agriculture, Tokyo University of Agriculture and Technology, Fuchu, Japan
- *Correspondence: Shin Okazaki,
| |
Collapse
|
28
|
Vlachakis D, Papageorgiou L, Papadaki A, Georga M, Kossida S, Eliopoulos E. An updated evolutionary study of the Notch family reveals a new ancient origin and novel invariable motifs as potential pharmacological targets. PeerJ 2020; 8:e10334. [PMID: 33194454 PMCID: PMC7649014 DOI: 10.7717/peerj.10334] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Accepted: 10/19/2020] [Indexed: 01/02/2023] Open
Abstract
Notch family proteins play a key role in a variety of developmental processes by controlling cell fate decisions and operating in a great number of biological processes in several organ systems, such as hematopoiesis, somatogenesis, vasculogenesis, neurogenesis and homeostasis. The Notch signaling pathway is crucial for the majority of developmental programs and regulates multiple pathogenic processes. Notch family receptors' activation has been largely related to its multiple effects in sustaining oncogenesis. The Notch signaling pathway constitutes an ancient and conserved mechanism for cell to cell communication. Much of what is known about Notch family proteins function comes from studies done in Caenorhabditis Elegans and Drosophila Melanogaster. Although, human Notch homologs had also been identified, the molecular mechanisms which modulate the Notch signaling pathway remained substantially unknown. In this study, an updated evolutionary analysis of the Notch family members among 603 different organisms of all kingdoms, from bacteria to humans, was performed in order to discover key regions that have been conserved throughout evolution and play a major role in the Notch signaling pathway. The major goal of this study is the presentation of a novel updated phylogenetic tree for the Notch family as a reliable phylogeny "map", in order to correlate information of the closely related members and identify new possible pharmacological targets that can be used in pathogenic cases, including cancer.
Collapse
Affiliation(s)
- Dimitrios Vlachakis
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
- University Research Institute of Maternal and Child Health & Precision Medicine, and UNESCO Chair on Adolescent Health Care, “Aghia Sophia” Children’s Hospital, National and Kapodistrian University of Athens, Athens, Greece
- Division of Endocrinology and Metabolism, Center of Clinical, Experimental Surgery and Translational Research, Biomedical Research Foundation of the Academy of Athens, Athens, Greece
| | - Louis Papageorgiou
- Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, Athens, Greece
| | - Ariadne Papadaki
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Maria Georga
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| | - Sofia Kossida
- IMGT, The International ImMunoGeneTics Information System, Université de Montpellier, Laboratoire d’ImmunoGénétique Moléculaire and Institut de Génétique Humaine, University of Montpellier, Montpellier, France
| | - Elias Eliopoulos
- Laboratory of Genetics, Department of Biotechnology, School of Applied Biology and Biotechnology, Agricultural University of Athens, Athens, Greece
| |
Collapse
|
29
|
Seong K, Seo E, Witek K, Li M, Staskawicz B. Evolution of NLR resistance genes with noncanonical N-terminal domains in wild tomato species. THE NEW PHYTOLOGIST 2020; 227:1530-1543. [PMID: 32344448 DOI: 10.1111/nph.16628] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 04/11/2020] [Indexed: 06/11/2023]
Abstract
Nucleotide-binding and leucine-rich repeat immune receptors (NLRs) provide resistance against diverse pathogens. To create comparative NLR resources, we conducted resistance gene enrichment sequencing (RenSeq) with single-molecule real-time sequencing of PacBio for 18 accessions in Solanaceae, including 15 accessions of five wild tomato species. We investigated the evolution of a class of NLRs, CNLs with extended N-terminal sequences previously named Solanaceae Domain. Through comparative genomic analysis, we revealed that the extended CNLs (exCNLs) anciently emerged in the most recent common ancestor between Asterids and Amaranthaceae, far predating the Solanaceae family. In tomatoes, the exCNLs display exceptional modes of evolution in a clade-specific manner. In the clade G3, exCNLs have substantially elongated their N-termini through tandem duplications of exon segments. In the clade G1, exCNLs have evolved through recent proliferation and sequence diversification. In the clade G6, an ancestral exCNL has lost its N-terminal domains in the course of evolution. Our study provides high-quality NLR gene models for close relatives of domesticated tomatoes that can serve as a useful resource for breeding and molecular engineering for disease resistance. Our findings regarding the exCNLs offer unique backgrounds and insights for future functional studies of the NLRs.
Collapse
Affiliation(s)
- Kyungyong Seong
- Department of Plant and Microbial Biology, University of California, Berkeley, CA, 94720, USA
- Innovative Genomics Institute, University of California, Berkeley, CA, 94704, USA
| | - Eunyoung Seo
- Department of Plant and Microbial Biology, University of California, Berkeley, CA, 94720, USA
- Innovative Genomics Institute, University of California, Berkeley, CA, 94704, USA
| | - Kamil Witek
- The Sainsbury Laboratory, University of East Anglia, Norwich Research Park, Norwich, NR4 7UH, UK
| | - Meng Li
- Department of Plant and Microbial Biology, University of California, Berkeley, CA, 94720, USA
- Innovative Genomics Institute, University of California, Berkeley, CA, 94704, USA
| | - Brian Staskawicz
- Department of Plant and Microbial Biology, University of California, Berkeley, CA, 94720, USA
- Innovative Genomics Institute, University of California, Berkeley, CA, 94704, USA
| |
Collapse
|
30
|
Choi Y, Jeong S, Choi JM, Ndong C, Griswold KE, Bailey-Kellogg C, Kim HS. Computer-guided binding mode identification and affinity improvement of an LRR protein binder without structure determination. PLoS Comput Biol 2020; 16:e1008150. [PMID: 32866140 PMCID: PMC7485979 DOI: 10.1371/journal.pcbi.1008150] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 09/11/2020] [Accepted: 07/14/2020] [Indexed: 12/24/2022] Open
Abstract
Precise binding mode identification and subsequent affinity improvement without structure determination remain a challenge in the development of therapeutic proteins. However, relevant experimental techniques are generally quite costly, and purely computational methods have been unreliable. Here, we show that integrated computational and experimental epitope localization followed by full-atom energy minimization can yield an accurate complex model structure which ultimately enables effective affinity improvement and redesign of binding specificity. As proof-of-concept, we used a leucine-rich repeat (LRR) protein binder, called a repebody (Rb), that specifically recognizes human IgG1 (hIgG1). We performed computationally-guided identification of the Rb:hIgG1 binding mode and leveraged the resulting model to reengineer the Rb so as to significantly increase its binding affinity for hIgG1 as well as redesign its specificity toward multiple IgGs from other species. Experimental structure determination verified that our Rb:hIgG1 model closely matched the co-crystal structure. Using a benchmark of other LRR protein complexes, we further demonstrated that the present approach may be broadly applicable to proteins undergoing relatively small conformational changes upon target binding.
Collapse
Affiliation(s)
- Yoonjoo Choi
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Korea
| | - Sukyo Jeong
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Korea
| | - Jung-Min Choi
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Korea
| | - Christian Ndong
- Thayer School of Engineering, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Karl E. Griswold
- Thayer School of Engineering, Dartmouth College, Hanover, New Hampshire, United States of America
- Norris Cotton Cancer Center at Dartmouth, Lebanon, New Hampshire, United States of America
- Department of Biological Sciences, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Chris Bailey-Kellogg
- Department of Computer Science, Dartmouth College, Hanover, New Hampshire, United States of America
| | - Hak-Sung Kim
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon, Korea
| |
Collapse
|
31
|
Yadav A, Fernández-Baca D, Cannon SB. Family-Specific Gains and Losses of Protein Domains in the Legume and Grass Plant Families. Evol Bioinform Online 2020; 16:1176934320939943. [PMID: 32694909 PMCID: PMC7350399 DOI: 10.1177/1176934320939943] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2020] [Accepted: 06/15/2020] [Indexed: 11/27/2022] Open
Abstract
Protein domains can be regarded as sections of protein sequences capable of folding independently and performing specific functions. In addition to amino-acid level changes, protein sequences can also evolve through domain shuffling events such as domain insertion, deletion, or duplication. The evolution of protein domains can be studied by tracking domain changes in a selected set of species with known phylogenetic relationships. Here, we conduct such an analysis by defining domains as “features” or “descriptors,” and considering the species (target + outgroup) as instances or data-points in a data matrix. We then look for features (domains) that are significantly different between the target species and the outgroup species. We study the domain changes in 2 large, distinct groups of plant species: legumes (Fabaceae) and grasses (Poaceae), with respect to selected outgroup species. We evaluate 4 types of domain feature matrices: domain content, domain duplication, domain abundance, and domain versatility. The 4 types of domain feature matrices attempt to capture different aspects of domain changes through which the protein sequences may evolve—that is, via gain or loss of domains, increase or decrease in the copy number of domains along the sequences, expansion or contraction of domains, or through changes in the number of adjacent domain partners. All the feature matrices were analyzed using feature selection techniques and statistical tests to select protein domains that have significant different feature values in legumes and grasses. We report the biological functions of the top selected domains from the analysis of all the feature matrices. In addition, we also perform domain-centric gene ontology (dcGO) enrichment analysis on all selected domains from all 4 feature matrices to study the gene ontology terms associated with the significantly evolving domains in legumes and grasses. Domain content analysis revealed a striking loss of protein domains from the Fanconi anemia (FA) pathway, the pathway responsible for the repair of interstrand DNA crosslinks. The abundance analysis of domains found in legumes revealed an increase in glutathione synthase enzyme, an antioxidant required from nitrogen fixation, and a decrease in xanthine oxidizing enzymes, a phenomenon confirmed by previous studies. In grasses, the abundance analysis showed increases in domains related to gene silencing which could be due to polyploidy or due to enhanced response to viral infection. We provide a docker container that can be used to perform this analysis workflow on any user-defined sets of species, available at https://cloud.docker.com/u/akshayayadav/repository/docker/akshayayadav/protein-domain-evolution-project.
Collapse
Affiliation(s)
- Akshay Yadav
- Bioinformatics and Computational Biology Graduate Program, Iowa State University, Ames, IA, USA
| | | | - Steven B Cannon
- Corn Insects and Crop Genetics Research Unit, USDA-Agricultural Research Service, Ames, IA, USA
| |
Collapse
|
32
|
Galpern EA, Freiberger MI, Ferreiro DU. Large Ankyrin repeat proteins are formed with similar and energetically favorable units. PLoS One 2020; 15:e0233865. [PMID: 32579546 PMCID: PMC7314423 DOI: 10.1371/journal.pone.0233865] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2020] [Accepted: 05/13/2020] [Indexed: 11/19/2022] Open
Abstract
Ankyrin containing proteins are one of the most abundant repeat protein families present in all extant organisms. They are made with tandem copies of similar amino acid stretches that fold into elongated architectures. Here, we built and curated a dataset of 200 thousand proteins that contain 1.2 million Ankyrin regions and characterize the abundance, structure and energetics of the repetitive regions in natural proteins. We found that there is a continuous roughly exponential variety of array lengths with an exceptional frequency at 24 repeats. We described that individual repeats are seldom interrupted with long insertions and accept few deletions, in line with the known tertiary structures. We found that longer arrays are made up of repeats that are more similar to each other than shorter arrays, and display more favourable folding energy, hinting at their evolutionary origin. The array distributions show that there is a physical upper limit to the size of an array of repeats of about 120 copies, consistent with the limit found in nature. The identity patterns within the arrays suggest that they may have originated by sequential copies of more than one Ankyrin unit.
Collapse
Affiliation(s)
- Ezequiel A. Galpern
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - María I. Freiberger
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Diego U. Ferreiro
- Protein Physiology Lab, Departamento de Química Biológica, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN-CONICE), Universidad de Buenos Aires, Buenos Aires, Argentina
- * E-mail:
| |
Collapse
|
33
|
Malembic-Maher S, Desqué D, Khalil D, Salar P, Bergey B, Danet JL, Duret S, Dubrana-Ourabah MP, Beven L, Ember I, Acs Z, Della Bartola M, Materazzi A, Filippin L, Krnjajic S, Krstić O, Toševski I, Lang F, Jarausch B, Kölber M, Jović J, Angelini E, Arricau-Bouvery N, Maixner M, Foissac X. When a Palearctic bacterium meets a Nearctic insect vector: Genetic and ecological insights into the emergence of the grapevine Flavescence dorée epidemics in Europe. PLoS Pathog 2020; 16:e1007967. [PMID: 32210479 PMCID: PMC7135369 DOI: 10.1371/journal.ppat.1007967] [Citation(s) in RCA: 29] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 04/06/2020] [Accepted: 02/18/2020] [Indexed: 11/28/2022] Open
Abstract
Flavescence dorée (FD) is a European quarantine grapevine disease transmitted by the Deltocephalinae leafhopper Scaphoideus titanus. Whereas this vector had been introduced from North America, the possible European origin of FD phytoplasma needed to be challenged and correlated with ecological and genetic drivers of FD emergence. For that purpose, a survey of genetic diversity of these phytoplasmas in grapevines, S. titanus, black alders, alder leafhoppers and clematis were conducted in five European countries. Out of 132 map genotypes, only 11 were associated to FD outbreaks, three were detected in clematis, whereas 127 were detected in alder trees, alder leafhoppers or in grapevines out of FD outbreaks. Most of the alder trees were found infected, including 8% with FD genotypes M6, M38 and M50, also present in alders neighboring FD-free vineyards and vineyard-free areas. The Macropsinae Oncopsis alni could transmit genotypes unable to achieve transmission by S. titanus, while the Deltocephalinae Allygus spp. and Orientus ishidae transmitted M38 and M50 that proved to be compatible with S. titanus. Variability of vmpA and vmpB adhesin-like genes clearly discriminated 3 genetic clusters. Cluster Vmp-I grouped genotypes only transmitted by O. alni, while clusters Vmp-II and -III grouped genotypes transmitted by Deltocephalinae leafhoppers. Interestingly, adhesin repeated domains evolved independently in cluster Vmp-I, whereas in clusters Vmp-II and-III showed recent duplications. Latex beads coated with various ratio of VmpA of clusters II and I, showed that cluster II VmpA promoted enhanced adhesion to the Deltocephalinae Euscelidius variegatus epithelial cells and were better retained in both E. variegatus and S. titanus midguts. Our data demonstrate that most FD phytoplasmas are endemic to European alders. Their emergence as grapevine epidemic pathogens appeared restricted to some genetic variants pre-existing in alders, whose compatibility to S. titanus correlates with different vmp gene sequences and VmpA binding properties.
Collapse
Affiliation(s)
| | | | - Dima Khalil
- INRAE, Univ. Bordeaux, UMR BFP, Villenave d’Ornon, France
| | - Pascal Salar
- INRAE, Univ. Bordeaux, UMR BFP, Villenave d’Ornon, France
| | - Bernard Bergey
- INRAE, Univ. Bordeaux, UMR BFP, Villenave d’Ornon, France
| | - Jean-Luc Danet
- INRAE, Univ. Bordeaux, UMR BFP, Villenave d’Ornon, France
| | - Sybille Duret
- INRAE, Univ. Bordeaux, UMR BFP, Villenave d’Ornon, France
| | | | - Laure Beven
- INRAE, Univ. Bordeaux, UMR BFP, Villenave d’Ornon, France
| | | | - Zoltan Acs
- Genlogs Biodiagnosztika Ltd, Budapest, Hungary
| | | | - Alberto Materazzi
- Department of Agriculture, Food and Environment, University of Pisa, Pisa, Italy
| | | | - Slobodan Krnjajic
- Department of Plant Pests, Institute of Plant Protection and Environment, Zemun, Serbia
| | - Oliver Krstić
- Department of Plant Pests, Institute of Plant Protection and Environment, Zemun, Serbia
| | - Ivo Toševski
- Department of Plant Pests, Institute of Plant Protection and Environment, Zemun, Serbia
- CABI, Delémont, Switzerland
| | - Friederike Lang
- JKI, Institute for Plant Protection in Fruit Crops and Viticulture, Siebeldingen, Germany
| | - Barbara Jarausch
- JKI, Institute for Plant Protection in Fruit Crops and Viticulture, Siebeldingen, Germany
| | | | - Jelena Jović
- Department of Plant Pests, Institute of Plant Protection and Environment, Zemun, Serbia
| | | | | | - Michael Maixner
- JKI, Institute for Plant Protection in Fruit Crops and Viticulture, Siebeldingen, Germany
| | - Xavier Foissac
- INRAE, Univ. Bordeaux, UMR BFP, Villenave d’Ornon, France
| |
Collapse
|
34
|
Situ J, Jiang L, Fan X, Yang W, Li W, Xi P, Deng Y, Kong G, Jiang Z. An RXLR effector PlAvh142 from Peronophythora litchii triggers plant cell death and contributes to virulence. MOLECULAR PLANT PATHOLOGY 2020; 21:415-428. [PMID: 31912634 PMCID: PMC7036370 DOI: 10.1111/mpp.12905] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 12/05/2019] [Accepted: 12/06/2019] [Indexed: 05/09/2023]
Abstract
Litchi downy blight, caused by the phytopathogenic oomycete Peronophythora litchii, results in tremendous economic loss in litchi production every year. To successfully colonize the host cell, Phytophthora species secret hundreds of RXLR effectors that interfere with plant immunity and facilitate the infection process. Previous work has already predicted 245 candidate RXLR effector-encoding genes in P. litchii, 212 of which have been cloned and tested for plant cell death-inducing activity in this study. We found three such RXLR effectors could trigger plant cell death through transient expression in Nicotiana benthamiana. Further experiments demonstrated that PlAvh142 could induce cell death and immune responses in several plants. We also found that PlAvh142 localized in both the cytoplasm and nucleus of plant cells. The cytoplasmic localization was critical for its cell death-inducing activity. Moreover, deletion either of the two internal repeats in PlAvh142 abolished the cell death-inducing activity. Virus-induced gene silencing assays showed that cell death triggered by PlAvh142 was dependent on the plant transduction components RAR1 (require for Mla12 resistance), SGT1 (suppressor of the G2 allele of skp1) and HSP90 (heat shock protein 90). Finally, knockout of PlAvh142 resulted in significantly attenuated P. litchii virulence on litchi plants, whereas the PlAvh142-overexpressed mutants were more aggressive. These data indicated that PlAvh142 could be recognized in plant cytoplasm and is an important virulence RXLR effector of P. litchii.
Collapse
Affiliation(s)
- Junjian Situ
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| | - Liqun Jiang
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
- Guangdong Province Key Laboratory of New Technology in Rice Breeding/Rice Research InstituteGuangdong Academy of Agricultural SciencesGuangzhouChina
| | - Xiaoning Fan
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| | - Wensheng Yang
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| | - Wen Li
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| | - Pinggen Xi
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| | - Yizhen Deng
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| | - Guanghui Kong
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| | - Zide Jiang
- Department of Plant Pathology/Guangdong Province Key Laboratory of Microbial Signals and Disease ControlSouth China Agricultural UniversityGuangzhouChina
| |
Collapse
|
35
|
de Jonge PA, von Meijenfeldt FAB, van Rooijen LE, Brouns SJJ, Dutilh BE. Evolution of BACON Domain Tandem Repeats in crAssphage and Novel Gut Bacteriophage Lineages. Viruses 2019; 11:v11121085. [PMID: 31766550 PMCID: PMC6949934 DOI: 10.3390/v11121085] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2019] [Revised: 11/17/2019] [Accepted: 11/19/2019] [Indexed: 12/12/2022] Open
Abstract
The human gut contains an expanse of largely unstudied bacteriophages. Among the most common are crAss-like phages, which were predicted to infect Bacteriodetes hosts. CrAssphage, the first crAss-like phage to be discovered, contains a protein encoding a Bacteroides-associated carbohydrate-binding often N-terminal (BACON) domain tandem repeat. Because protein domain tandem repeats are often hotspots of evolution, BACON domains may provide insight into the evolution of crAss-like phages. Here, we studied the biodiversity and evolution of BACON domains in bacteriophages by analysing over 2 million viral contigs. We found a high biodiversity of BACON in seven gut phage lineages, including five known crAss-like phage lineages and two novel gut phage lineages that are distantly related to crAss-like phages. In three BACON-containing phage lineages, we found that BACON domain tandem repeats were associated with phage tail proteins, suggestive of a possible role of these repeats in host binding. In contrast, individual BACON domains that did not occur in tandem were not found in the proximity of tail proteins. In two lineages, tail-associated BACON domain tandem repeats evolved largely through horizontal transfer of separate domains. In the third lineage that includes the prototypical crAssphage, the tandem repeats arose from several sequential domain duplications, resulting in a characteristic tandem array that is distinct from bacterial BACON domains. We conclude that phage tail-associated BACON domain tandem repeats have evolved in at least two independent cases in gut bacteriophages, including in the widespread gut phage crAssphage.
Collapse
Affiliation(s)
- Patrick A. de Jonge
- Theoretical Biology and Bioinformatics, Science4 Life, Utrecht University, 3584 CH Utrecht, The Netherlands; (P.A.d.J.); (F.A.B.v.M.); (L.E.v.R.)
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, 2629 HZ Delft, The Netherlands;
| | - F. A. Bastiaan von Meijenfeldt
- Theoretical Biology and Bioinformatics, Science4 Life, Utrecht University, 3584 CH Utrecht, The Netherlands; (P.A.d.J.); (F.A.B.v.M.); (L.E.v.R.)
| | - Laura E. van Rooijen
- Theoretical Biology and Bioinformatics, Science4 Life, Utrecht University, 3584 CH Utrecht, The Netherlands; (P.A.d.J.); (F.A.B.v.M.); (L.E.v.R.)
| | - Stan J. J. Brouns
- Department of Bionanoscience, Kavli Institute of Nanoscience, Delft University of Technology, 2629 HZ Delft, The Netherlands;
| | - Bas E. Dutilh
- Theoretical Biology and Bioinformatics, Science4 Life, Utrecht University, 3584 CH Utrecht, The Netherlands; (P.A.d.J.); (F.A.B.v.M.); (L.E.v.R.)
- Centre for Molecular and Biomolecular Informatics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Centre, 6525 GA Nijmegen, The Netherlands
- Correspondence:
| |
Collapse
|
36
|
Gao B, Wang J, Huang J, Huang X, Sha W, Qin L. The dynamic region of the peptidoglycan synthase gene, Rv0050, induces the growth rate and morphologic heterogeneity in Mycobacteria. INFECTION GENETICS AND EVOLUTION 2019; 72:86-92. [DOI: 10.1016/j.meegid.2018.12.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2018] [Revised: 11/30/2018] [Accepted: 12/07/2018] [Indexed: 12/16/2022]
|
37
|
Basile W, Salvatore M, Bassot C, Elofsson A. Why do eukaryotic proteins contain more intrinsically disordered regions? PLoS Comput Biol 2019; 15:e1007186. [PMID: 31329574 PMCID: PMC6675126 DOI: 10.1371/journal.pcbi.1007186] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2018] [Revised: 08/01/2019] [Accepted: 06/14/2019] [Indexed: 12/12/2022] Open
Abstract
Intrinsic disorder is more abundant in eukaryotic than prokaryotic proteins. Methods predicting intrinsic disorder are based on the amino acid sequence of a protein. Therefore, there must exist an underlying difference in the sequences between eukaryotic and prokaryotic proteins causing the (predicted) difference in intrinsic disorder. By comparing proteins, from complete eukaryotic and prokaryotic proteomes, we show that the difference in intrinsic disorder emerges from the linker regions connecting Pfam domains. Eukaryotic proteins have more extended linker regions, and in addition, the eukaryotic linkers are significantly more disordered, 38% vs. 12-16% disordered residues. Next, we examined the underlying reason for the increase in disorder in eukaryotic linkers, and we found that the changes in abundance of only three amino acids cause the increase. Eukaryotic proteins contain 8.6% serine; while prokaryotic proteins have 6.5%, eukaryotic proteins also contain 5.4% proline and 5.3% isoleucine compared with 4.0% proline and ≈ 7.5% isoleucine in the prokaryotes. All these three differences contribute to the increased disorder in eukaryotic proteins. It is tempting to speculate that the increase in serine frequencies in eukaryotes is related to regulation by kinases, but direct evidence for this is lacking. The differences are observed in all phyla, protein families, structural regions and type of protein but are most pronounced in disordered and linker regions. The observation that differences in the abundance of three amino acids cause the difference in disorder between eukaryotic and prokaryotic proteins raises the question: Are amino acid frequencies different in eukaryotic linkers because the linkers are more disordered or do the differences cause the increased disorder? Intrinsic disorder is essential for various functions in eukaryotic cells and is a signature of eukaryotic proteins. Here, we try to understand the origin of the difference in disorder between eukaryotic and prokaryotic proteins. We show that eukaryotic proteins contain more extended linker regions and that these linker regions are significantly more disordered. Further, we show, for the first time, that the difference in disorder originates from a systematic difference in amino acid frequencies between eukaryotic and prokaryotic proteins. Three amino acids contribute to the difference in disorder; serine and proline are more abundant in eukaryotic linkers, while isoleucine is less frequent. These shifts in frequencies are observed in all phyla, protein families, structural regions and type of protein but are most pronounced in disordered and linker regions. It is tempting to speculate that the increase in serine frequencies in eukaryotes is related to regulation by kinases, but direct evidence for this is lacking. Anyhow the widespread of the shifts in abundance indicates that the differences are ancient and caused be some yet not fully understood selective difference acting on eukaryotic and prokaryotic proteins.
Collapse
Affiliation(s)
- Walter Basile
- Science for Life Laboratory, Stockholm University, Solna, Sweden
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Marco Salvatore
- Science for Life Laboratory, Stockholm University, Solna, Sweden
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Claudio Bassot
- Science for Life Laboratory, Stockholm University, Solna, Sweden
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Arne Elofsson
- Science for Life Laboratory, Stockholm University, Solna, Sweden
- Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
- Swedish e-Science Research Center (SeRC), Stockholm, Sweden
- * E-mail:
| |
Collapse
|
38
|
Walden M, Tian L, Ross RL, Sykora UM, Byrne DP, Hesketh EL, Masandi SK, Cassel J, George R, Ault JR, El Oualid F, Pawłowski K, Salvino JM, Eyers PA, Ranson NA, Del Galdo F, Greenberg RA, Zeqiraj E. Metabolic control of BRISC-SHMT2 assembly regulates immune signalling. Nature 2019; 570:194-199. [PMID: 31142841 PMCID: PMC6914362 DOI: 10.1038/s41586-019-1232-1] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2018] [Accepted: 04/29/2019] [Indexed: 02/04/2023]
Abstract
Serine hydroxymethyltransferase 2 (SHMT2) regulates one-carbon transfer reactions that are essential for amino acid and nucleotide metabolism, and uses pyridoxal-5'-phosphate (PLP) as a cofactor. Apo SHMT2 exists as a dimer with unknown functions, whereas PLP binding stabilizes the active tetrameric state. SHMT2 also promotes inflammatory cytokine signalling by interacting with the deubiquitylating BRCC36 isopeptidase complex (BRISC), although it is unclear whether this function relates to metabolism. Here we present the cryo-electron microscopy structure of the human BRISC-SHMT2 complex at a resolution of 3.8 Å. BRISC is a U-shaped dimer of four subunits, and SHMT2 sterically blocks the BRCC36 active site and inhibits deubiquitylase activity. Only the inactive SHMT2 dimer-and not the active PLP-bound tetramer-binds and inhibits BRISC. Mutations in BRISC that disrupt SHMT2 binding impair type I interferon signalling in response to inflammatory stimuli. Intracellular levels of PLP regulate the interaction between BRISC and SHMT2, as well as inflammatory cytokine responses. These data reveal a mechanism in which metabolites regulate deubiquitylase activity and inflammatory signalling.
Collapse
Affiliation(s)
- Miriam Walden
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Lei Tian
- Department of Cancer Biology, Basser Center for BRCA, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Rebecca L Ross
- Leeds Institute of Rheumatic and Musculoskeletal Medicine and NIHR Biomedical Research Centre, University of Leeds, Leeds, UK
| | - Upasana M Sykora
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Dominic P Byrne
- Department of Biochemistry, Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Emma L Hesketh
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Safi K Masandi
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Joel Cassel
- The Wistar Cancer Center for Molecular Screening, The Wistar Institute, Philadelphia, PA, USA
| | - Rachel George
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - James R Ault
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | | | - Krzysztof Pawłowski
- Warsaw University of Life Sciences, Warsaw, Poland
- Department of Translational Medicine, Clinical Sciences, Lund University, Lund, Sweden
| | - Joseph M Salvino
- The Wistar Cancer Center for Molecular Screening, The Wistar Institute, Philadelphia, PA, USA
| | - Patrick A Eyers
- Department of Biochemistry, Institute of Integrative Biology, University of Liverpool, Liverpool, UK
| | - Neil A Ranson
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK
| | - Francesco Del Galdo
- Leeds Institute of Rheumatic and Musculoskeletal Medicine and NIHR Biomedical Research Centre, University of Leeds, Leeds, UK
| | - Roger A Greenberg
- Department of Cancer Biology, Basser Center for BRCA, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| | - Elton Zeqiraj
- Astbury Centre for Structural Molecular Biology, School of Molecular and Cellular Biology, Faculty of Biological Sciences, University of Leeds, Leeds, UK.
| |
Collapse
|
39
|
Machulin A, Deryusheva E, Lobanov M, Galzitskaya O. Repeats in S1 Proteins: Flexibility and Tendency for Intrinsic Disorder. Int J Mol Sci 2019; 20:ijms20102377. [PMID: 31091666 PMCID: PMC6566611 DOI: 10.3390/ijms20102377] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Revised: 05/06/2019] [Accepted: 05/10/2019] [Indexed: 11/16/2022] Open
Abstract
An important feature of ribosomal S1 proteins is multiple copies of structural domains in bacteria, the number of which changes in a strictly limited range from one to six. For S1 proteins, little is known about the contribution of flexible regions to protein domain function. We exhaustively studied a tendency for intrinsic disorder and flexibility within and between structural domains for all available UniProt S1 sequences. Using charge–hydrophobicity plot cumulative distribution function (CH-CDF) analysis we classified 53% of S1 proteins as ordered proteins; the remaining proteins were related to molten globule state. S1 proteins are characterized by an equal ratio of regions connecting the secondary structure within and between structural domains, which indicates a similar organization of separate S1 domains and multi-domain S1 proteins. According to the FoldUnfold and IsUnstruct programs, in the multi-domain proteins, relatively short flexible or disordered regions are predominant. The lowest percentage of flexibility is in the central parts of multi-domain proteins. Our results suggest that the ratio of flexibility in the separate domains is related to their roles in the activity and functionality of S1: a more stable and compact central part in the multi-domain proteins is vital for RNA interaction, terminals domains are important for other functions.
Collapse
Affiliation(s)
- Andrey Machulin
- Skryabin Institute of Biochemistry and Physiology of Microorganisms, Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, 142290 Pushchino, Russia.
| | - Evgenia Deryusheva
- Institute for Biological Instrumentation, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences, 142290 Pushchino, Russia.
| | - Mikhail Lobanov
- Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Russia.
| | - Oxana Galzitskaya
- Institute of Protein Research, Russian Academy of Sciences, 142290 Pushchino, Russia.
| |
Collapse
|
40
|
A Graph-Based Approach for Detecting Sequence Homology in Highly Diverged Repeat Protein Families. Methods Mol Biol 2019. [PMID: 30298401 DOI: 10.1007/978-1-4939-8736-8_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
Abstract
Reconstructing evolutionary relationships in repeat proteins is notoriously difficult due to the high degree of sequence divergence that typically occurs between duplicated repeats. This is complicated further by the fact that proteins with a large number of similar repeats are more likely to produce significant local sequence alignments than proteins with fewer copies of the repeat motif. Furthermore, biologically correct sequence alignments are sometimes impossible to achieve in cases where insertion or translocation events disrupt the order of repeats in one of the sequences being aligned. Combined, these attributes make traditional phylogenetic methods for studying protein families unreliable for repeat proteins, due to the dependence of such methods on accurate sequence alignment.We present here a practical solution to this problem, making use of graph clustering combined with the open-source software package HH-suite, which enables highly sensitive detection of sequence relationships. Carrying out multiple rounds of homology searches via alignment of profile hidden Markov models, large sets of related proteins are generated. By representing the relationships between proteins in these sets as graphs, subsequent clustering with the Markov cluster algorithm enables robust detection of repeat protein subfamilies.
Collapse
|
41
|
A Recurrent Motif: Diversity and Evolution of ShKT Domain Containing Proteins in the Vampire Snail Cumia reticulata. Toxins (Basel) 2019; 11:toxins11020106. [PMID: 30759797 PMCID: PMC6409789 DOI: 10.3390/toxins11020106] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2019] [Revised: 02/04/2019] [Accepted: 02/07/2019] [Indexed: 11/17/2022] Open
Abstract
Proteins of the ShK superfamily are characterized by a small conserved domain (ShKT), first discovered in small venom peptides produced by sea anemones, and acting as specific inhibitors of voltage-dependent and calcium-activated K+ channels. The ShK superfamily includes both small toxic peptides and larger multifunctional proteins with various functions. ShK toxins are often important components of animal venoms, where they perform different biological functions including neurotoxic and immunosuppressive effects. Given their high specificity and effectiveness, they are currently regarded as promising pharmacological lead compounds for the treatment of autoimmune diseases. Here, we report on the molecular analysis of ShKT domain containing proteins produced by the Mediterranean vampire snail Cumia reticulata, an ectoparasitic gastropod that feeds on benthic fishes. The high specificity of expression of most ShK transcripts in salivary glands identifies them as relevant components of C. reticulata venom. These ShK proteins display various structural architectures, being produced either as single-domain secretory peptides, or as larger proteins combining the ShKT with M12 or CAP domains. Both ShKT-containing genes and their internal ShKT domains undergo frequent duplication events in C. reticulata, ensuring a high level of variability that is likely to play a role in increasing the range of their potential molecular targets.
Collapse
|
42
|
Abstract
The range of barrel-shaped proteins found in the outer membrane of certain bacteria evolved through multiple pathways.
Collapse
Affiliation(s)
- Vikas Nanda
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, United States
| |
Collapse
|
43
|
Schmielau L, Dvorak M, Niederwanger M, Dobieszewski N, Pedrini-Martha V, Ladurner P, Pedregal JRG, Maréchal JD, Dallinger R. Differential response to Cadmium exposure by expression of a two and a three-domain metallothionein isoform in the land winkle Pomatias elegans: Valuating the marine heritage of a land snail. THE SCIENCE OF THE TOTAL ENVIRONMENT 2019; 648:561-571. [PMID: 30121534 DOI: 10.1016/j.scitotenv.2018.07.426] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/28/2018] [Revised: 07/16/2018] [Accepted: 07/30/2018] [Indexed: 06/08/2023]
Abstract
Through evolution, marine snails have adapted several times independently to terrestrial life. A prime example for such transitions is the adaptation to terrestrial conditions in members of the gastropod clade of Littorinoidea (Caenogastropoda). Some species of this lineage like the periwinkle (Littorina littorea), live in intertidal habitats, where they are intermittently exposed to semi-terrestrial conditions. Pomatias elegans is a close relative of Littorina littorea that has successfully colonized terrestrial habitats. Evolutionary transitions from marine to terrestrial conditions have often been fostered in marine ancestors by acquisition of physiological pre-adaptations to terrestrial life. Such pre-adaptations are based, among others, on the optimization of a wide repertoire of stress resistance mechanisms, such as the expression of metal inactivating metallothioneins (MTs). The objective of our study was to explore the Cd handling strategy in the terrestrial snail Pomatias elegans in comparison to that observed previously in Littorina littorea. After Cd exposure, the metal is accumulated mainly in the midgut gland of Pomatias elegans, in a similar way as in its marine relative. Upon Cd exposure, Pomatias elegans expresses Cd-specific MTs, as also described from Littorina littorea. In contrast to the latter species, however, the detoxification of Cd in Pomatias elegans is mediated by two different MT isoforms, one two-domain and one three-domain MT. Although the MT proteins of both species are homologous and clearly originate from one common ancestor, the three-domain MT isoform of Pomatias elegans has evolved independently from the three-domain MT of its marine counterpart, probably by addition of a third domain to the pre-existing two-domain MT. Obviously, the occurrence of homologous MT structures in both species is a hereditary character, whereas the differentiation into two distinct MT isoforms with different upregulation capacities in Pomatias elegans is an adaptive feature that probably emerged upon transition to life on land.
Collapse
Affiliation(s)
- Lara Schmielau
- Department of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria
| | - Martin Dvorak
- Department of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria
| | - Michael Niederwanger
- Department of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria
| | - Nicole Dobieszewski
- Department of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria
| | - Veronika Pedrini-Martha
- Department of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria
| | - Peter Ladurner
- Department of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria
| | | | - Jean-Didier Maréchal
- Insilichem, Departament de Química, Universitat Autònoma de Barcelona, 08193 Bellaterra, Barcelona, Spain
| | - Reinhard Dallinger
- Department of Zoology and Center of Molecular Biosciences Innsbruck, University of Innsbruck, Technikerstraße 25, 6020 Innsbruck, Austria.
| |
Collapse
|
44
|
Abstract
This chapter reviews current research on how protein domain architectures evolve. We begin by summarizing work on the phylogenetic distribution of proteins, as this will directly impact which domain architectures can be formed in different species. Studies relating domain family size to occurrence have shown that they generally follow power law distributions, both within genomes and larger evolutionary groups. These findings were subsequently extended to multi-domain architectures. Genome evolution models that have been suggested to explain the shape of these distributions are reviewed, as well as evidence for selective pressure to expand certain domain families more than others. Each domain has an intrinsic combinatorial propensity, and the effects of this have been studied using measures of domain versatility or promiscuity. Next, we study the principles of protein domain architecture evolution and how these have been inferred from distributions of extant domain arrangements. Following this, we review inferences of ancestral domain architecture and the conclusions concerning domain architecture evolution mechanisms that can be drawn from these. Finally, we examine whether all known cases of a given domain architecture can be assumed to have a single common origin (monophyly) or have evolved convergently (polyphyly). We end by a discussion of some available tools for computational analysis or exploitation of protein domain architectures and their evolution.
Collapse
|
45
|
Uliano-Silva M, Dondero F, Dan Otto T, Costa I, Lima NCB, Americo JA, Mazzoni CJ, Prosdocimi F, Rebelo MDF. A hybrid-hierarchical genome assembly strategy to sequence the invasive golden mussel, Limnoperna fortunei. Gigascience 2018; 7:4750781. [PMID: 29267857 PMCID: PMC5836269 DOI: 10.1093/gigascience/gix128] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 12/11/2017] [Indexed: 11/13/2022] Open
Abstract
Background For more than 25 years, the golden mussel, Limnoperna fortunei, has aggressively invaded South American freshwaters, having travelled more than 5000 km upstream across 5 countries. Along the way, the golden mussel has outcompeted native species and economically harmed aquaculture, hydroelectric powers, and ship transit. We have sequenced the complete genome of the golden mussel to understand the molecular basis of its invasiveness and search for ways to control it. Findings We assembled the 1.6-Gb genome into 20 548 scaffolds with an N50 length of 312 Kb using a hybrid and hierarchical assembly strategy from short and long DNA reads and transcriptomes. A total of 60 717 coding genes were inferred from a customized transcriptome-trained AUGUSTUS run. We also compared predicted protein sets with those of complete molluscan genomes, revealing an exacerbation of protein-binding domains in L. fortunei. Conclusions We built one of the best bivalve genome assemblies available using a cost-effective approach using Illumina paired-end, mate-paired, and PacBio long reads. We expect that the continuous and careful annotation of L. fortunei's genome will contribute to the investigation of bivalve genetics, evolution, and invasiveness, as well as to the development of biotechnological tools for aquatic pest control.
Collapse
Affiliation(s)
- Marcela Uliano-Silva
- Carlos Chagas Filho Biophysics Institute (IBCCF), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.,Department of Evolutionary Genetics, Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany.,Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
| | - Francesco Dondero
- Department of Science and Technological Innovation (DiSIT), Università del Piemonte Orientale Amedeo Avogadro, Vercelli-Novara-Alessandria, Italy
| | - Thomas Dan Otto
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK.,Centre of Immunobiology, Institute of Infection, Immunity & Inflammation, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, UK
| | - Igor Costa
- Leopoldo de Meis Biomedical Biochemistry Institute (IBqM), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Nicholas Costa Barroso Lima
- Leopoldo de Meis Biomedical Biochemistry Institute (IBqM), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil.,Bioinformatics Laboratory (LabInfo) of the National Laboratory for Scientific Computing, Petrópolis, Rio de Janeiro, Brazil
| | - Juliana Alves Americo
- Carlos Chagas Filho Biophysics Institute (IBCCF), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Camila Junqueira Mazzoni
- Department of Evolutionary Genetics, Leibniz Institute for Zoo and Wildlife Research, Berlin, Germany.,Berlin Center for Genomics in Biodiversity Research, Berlin, Germany
| | - Francisco Prosdocimi
- Leopoldo de Meis Biomedical Biochemistry Institute (IBqM), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Mauro de Freitas Rebelo
- Carlos Chagas Filho Biophysics Institute (IBCCF), Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| |
Collapse
|
46
|
Tillu VA, Lim YW, Kovtun O, Mureev S, Ferguson C, Bastiani M, McMahon KA, Lo HP, Hall TE, Alexandrov K, Collins BM, Parton RG. A variable undecad repeat domain in cavin1 regulates caveola formation and stability. EMBO Rep 2018; 19:e45775. [PMID: 30021837 PMCID: PMC6123655 DOI: 10.15252/embr.201845775] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Revised: 06/05/2018] [Accepted: 06/14/2018] [Indexed: 11/09/2022] Open
Abstract
Caveolae are plasma membrane invaginations involved in transport, signalling and mechanical membrane sensing in metazoans. Their formation depends upon multiple interactions between membrane-embedded caveolins, lipids and cytosolic cavin proteins. Of the four cavin family members, only cavin1 is strictly required for caveola formation. Here, we demonstrate that an eleven residue (undecad) repeat sequence (UC1) exclusive to cavin1 is essential for caveolar localization and promotes membrane remodelling through binding to phosphatidylserine. In the notochord of mechanically stimulated zebrafish embryos, the UC1 domain is required for caveolar stability and resistance to membrane stress. The number of undecad repeats in the cavin1 UC1 domain varies throughout evolution, and we find that an increased number also correlates with increased caveolar stability. Lastly, we show that the cavin1 UC1 domain induces dramatic remodelling of the plasma membrane when grafted into cavin2 suggesting an important role in membrane sculpting. Overall, our work defines a novel conserved cavin1 modular domain that controls caveolar assembly and stability.
Collapse
Affiliation(s)
- Vikas A Tillu
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Ye-Wheen Lim
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Oleksiy Kovtun
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Sergey Mureev
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Charles Ferguson
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Michele Bastiani
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Kerrie-Ann McMahon
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Harriet P Lo
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Thomas E Hall
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Kirill Alexandrov
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Brett M Collins
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
| | - Robert G Parton
- Institute for Molecular Bioscience, The University of Queensland, St. Lucia, Qld, Australia
- Centre for Microscopy and Microanalysis, The University of Queensland, St. Lucia, Qld, Australia
| |
Collapse
|
47
|
Raboanatahiry N, Wang B, Yu L, Li M. Functional and Structural Diversity of Acyl-coA Binding Proteins in Oil Crops. Front Genet 2018; 9:182. [PMID: 29872448 PMCID: PMC5972291 DOI: 10.3389/fgene.2018.00182] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2018] [Accepted: 05/01/2018] [Indexed: 12/16/2022] Open
Abstract
Diversities in structure and function of ACBP were discussed in this review. ACBP are important proteins that could transport newly synthesized fatty acid, activated into -coA, from plastid to endoplasmic reticulum, where oil in the form of triacylglycerol occurs. ACBP were detected in various animal and plants species, which indicated their importance in biological function. In fact, involvement of ACBP in important process such as lipid metabolism, regulation of enzyme and gene expression, and in response to plant stresses has been proven in several studies. In this review, findings on ACBP of 11 well-known oil crops were reviewed to comprehend diversity, comparative analyses on ACBP structure were made, and link between structure and function, tissue expression and subcellular location of ACBP were also observed. Incomplete reports in some species were mentioned, which might be encouraging to start or to perform deeper studies. Similar characteristics were found in paralogs ACBP, and orthologs ACBP had different functions, despite the high identity in amino acid sequence. At the end, it is confirmed that ortholog proteins could not necessarily display the same function, even from closely related species.
Collapse
Affiliation(s)
- Nadia Raboanatahiry
- Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China.,Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang Normal University, Huanggang, China
| | - Baoshan Wang
- College of Life Science, Shandong Normal University, Jinan, China
| | - Longjiang Yu
- Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China
| | - Maoteng Li
- Department of Biotechnology, College of Life Science and Technology, Huazhong University of Science and Technology, Wuhan, China.,Hubei Key Laboratory of Economic Forest Germplasm Improvement and Resources Comprehensive Utilization, Hubei Collaborative Innovation Center for the Characteristic Resources Exploitation of Dabie Mountains, Huanggang Normal University, Huanggang, China
| |
Collapse
|
48
|
Dellafiora L, Dall'Asta C, Galaverna G. Toxicodynamics of Mycotoxins in the Framework of Food Risk Assessment-An In Silico Perspective. Toxins (Basel) 2018; 10:E52. [PMID: 29360783 PMCID: PMC5848153 DOI: 10.3390/toxins10020052] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2017] [Revised: 01/16/2018] [Accepted: 01/20/2018] [Indexed: 12/11/2022] Open
Abstract
Mycotoxins severely threaten the health of humans and animals. For this reason, many countries have enforced regulations and recommendations to reduce the dietary exposure. However, even though regulatory actions must be based on solid scientific knowledge, many aspects of their toxicological activity are still poorly understood. In particular, deepening knowledge on the primal molecular events triggering the toxic stimulus may be relevant to better understand the mechanisms of action of mycotoxins. The present work presents the use of in silico approaches in studying the mycotoxins toxicodynamics, and discusses how they may contribute in widening the background of knowledge. A particular emphasis has been posed on the methods accounting the molecular initiating events of toxic action. In more details, the key concepts and challenges of mycotoxins toxicology have been introduced. Then, topical case studies have been presented and some possible practical implementations of studying mycotoxins toxicodynamics have been discussed.
Collapse
Affiliation(s)
- Luca Dellafiora
- Department of Food and Drug, University of Parma, 43124 Parma, Italy.
| | - Chiara Dall'Asta
- Department of Food and Drug, University of Parma, 43124 Parma, Italy.
| | - Gianni Galaverna
- Department of Food and Drug, University of Parma, 43124 Parma, Italy.
| |
Collapse
|
49
|
Shapiro JA. Living Organisms Author Their Read-Write Genomes in Evolution. BIOLOGY 2017; 6:E42. [PMID: 29211049 PMCID: PMC5745447 DOI: 10.3390/biology6040042] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/23/2017] [Revised: 11/17/2017] [Accepted: 11/28/2017] [Indexed: 12/18/2022]
Abstract
Evolutionary variations generating phenotypic adaptations and novel taxa resulted from complex cellular activities altering genome content and expression: (i) Symbiogenetic cell mergers producing the mitochondrion-bearing ancestor of eukaryotes and chloroplast-bearing ancestors of photosynthetic eukaryotes; (ii) interspecific hybridizations and genome doublings generating new species and adaptive radiations of higher plants and animals; and, (iii) interspecific horizontal DNA transfer encoding virtually all of the cellular functions between organisms and their viruses in all domains of life. Consequently, assuming that evolutionary processes occur in isolated genomes of individual species has become an unrealistic abstraction. Adaptive variations also involved natural genetic engineering of mobile DNA elements to rewire regulatory networks. In the most highly evolved organisms, biological complexity scales with "non-coding" DNA content more closely than with protein-coding capacity. Coincidentally, we have learned how so-called "non-coding" RNAs that are rich in repetitive mobile DNA sequences are key regulators of complex phenotypes. Both biotic and abiotic ecological challenges serve as triggers for episodes of elevated genome change. The intersections of cell activities, biosphere interactions, horizontal DNA transfers, and non-random Read-Write genome modifications by natural genetic engineering provide a rich molecular and biological foundation for understanding how ecological disruptions can stimulate productive, often abrupt, evolutionary transformations.
Collapse
Affiliation(s)
- James A Shapiro
- Department of Biochemistry and Molecular Biology, University of Chicago GCIS W123B, 979 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
50
|
Islam Z, Nagampalli RSK, Fatima MT, Ashraf GM. New paradigm in ankyrin repeats: Beyond protein-protein interaction module. Int J Biol Macromol 2017; 109:1164-1173. [PMID: 29157912 DOI: 10.1016/j.ijbiomac.2017.11.101] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2017] [Revised: 11/13/2017] [Accepted: 11/16/2017] [Indexed: 01/06/2023]
Abstract
Classically, ankyrin repeat (ANK) proteins are built from tandems of two or more repeats and form curved solenoid structures that are associated with protein-protein interactions. These are short, widespread structural motif of around 33 amino acids repeats in tandem, having a canonical helix-loop-helix fold, found individually or in combination with other domains. The multiplicity of structural pattern enables it to form assemblies of diverse sizes, required for their abilities to confer multiple binding and structural roles of proteins. Three-dimensional structures of these repeats determined to date reveal a degree of structural variability that translates into the considerable functional versatility of this protein superfamily. Recent work on the ANK has proposed novel structural information, especially protein-lipid, protein-sugar and protein-protein interaction. Self-assembly of these repeats was also shown to prevent the associated protein in forming filaments. In this review, we summarize the latest findings and how the new structural information has increased our understanding of the structural determinants of ANK proteins. We discussed latest findings on how these proteins participate in various interactions to diversify the ANK roles in numerous biological processes, and explored the emerging and evolving field of designer ankyrins and its framework for protein engineering emphasizing on biotechnological applications.
Collapse
Affiliation(s)
- Zeyaul Islam
- Laboratório Nacional de Biociências, Centro Nacional de Pesquisa em Energia e Materiais, Campinas, SP, 13083-100, Brazil.
| | | | - Munazza Tamkeen Fatima
- Department of Biochemistry and Tissue Biology, Institute of Biology, State University of Campinas (UNICAMP), Campinas, SP, 13083-862, Brazil
| | - Ghulam Md Ashraf
- King Fahd Medical Research Center, King Abdulaziz University, P.O. Box 80216, Jeddah, 21589, Saudi Arabia.
| |
Collapse
|