1
|
Krokengen OC, Raasakka A, Kursula P. The intrinsically disordered protein glue of the myelin major dense line: Linking AlphaFold2 predictions to experimental data. Biochem Biophys Rep 2023; 34:101474. [PMID: 37153862 PMCID: PMC10160357 DOI: 10.1016/j.bbrep.2023.101474] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 03/31/2023] [Accepted: 04/19/2023] [Indexed: 05/10/2023] Open
Abstract
Numerous human proteins are classified as intrinsically disordered proteins (IDPs). Due to their physicochemical properties, high-resolution structural information about IDPs is generally lacking. On the other hand, IDPs are known to adopt local ordered structures upon interactions with e.g. other proteins or lipid membrane surfaces. While recent developments in protein structure prediction have been revolutionary, their impact on IDP research at high resolution remains limited. We took a specific example of two myelin-specific IDPs, the myelin basic protein (MBP) and the cytoplasmic domain of myelin protein zero (P0ct). Both of these IDPs are crucial for normal nervous system development and function, and while they are disordered in solution, upon membrane binding, they partially fold into helices, being embedded into the lipid membrane. We carried out AlphaFold2 predictions of both proteins and analysed the models in light of experimental data related to protein structure and molecular interactions. We observe that the predicted models have helical segments that closely correspond to the membrane-binding sites on both proteins. We furthermore analyse the fits of the models to synchrotron-based X-ray scattering and circular dichroism data from the same IDPs. The models are likely to represent the membrane-bound state of both MBP and P0ct, rather than the conformation in solution. Artificial intelligence-based models of IDPs appear to provide information on the ligand-bound state of these proteins, instead of the conformers dominating free in solution. We further discuss the implications of the predictions for mammalian nervous system myelination and their relevance to understanding disease aspects of these IDPs.
Collapse
Affiliation(s)
| | - Arne Raasakka
- Department of Biomedicine, University of Bergen, Norway
| | - Petri Kursula
- Department of Biomedicine, University of Bergen, Norway
- Faculty of Biochemistry and Molecular Medicine & Biocenter Oulu, Oulu, Finland
- Corresponding author. Department of Biomedicine, University of Bergen, Norway.
| |
Collapse
|
2
|
Bruley A, Bitard-Feildel T, Callebaut I, Duprat E. A sequence-based foldability score combined with AlphaFold2 predictions to disentangle the protein order/disorder continuum. Proteins 2023; 91:466-484. [PMID: 36306150 DOI: 10.1002/prot.26441] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 10/14/2022] [Accepted: 10/18/2022] [Indexed: 11/11/2022]
Abstract
Order and disorder govern protein functions, but there is a great diversity in disorder, from regions that are-and stay-fully disordered to conditional order. This diversity is still difficult to decipher even though it is encoded in the amino acid sequences. Here, we developed an analytic Python package, named pyHCA, to estimate the foldability of a protein segment from the only information of its amino acid sequence and based on a measure of its density in regular secondary structures associated with hydrophobic clusters, as defined by the hydrophobic cluster analysis (HCA) approach. The tool was designed by optimizing the separation between foldable segments from databases of disorder (DisProt) and order (SCOPe [soluble domains] and OPM [transmembrane domains]). It allows to specify the ratio between order, embodied by regular secondary structures (either participating in the hydrophobic core of well-folded 3D structures or conditionally formed in intrinsically disordered regions) and disorder. We illustrated the relevance of pyHCA with several examples and applied it to the sequences of the proteomes of 21 species ranging from prokaryotes and archaea to unicellular and multicellular eukaryotes, for which structure models are provided in the AlphaFold protein structure database. Cases of low-confidence scores related to disorder were distinguished from those of sequences that we identified as foldable but are still excluded from accurate modeling by AlphaFold2 due to a lack of sequence homologs or to compositional biases. Overall, our approach is complementary to AlphaFold2, providing guides to map structural innovations through evolutionary processes, at proteome and gene scales.
Collapse
Affiliation(s)
- Apolline Bruley
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Tristan Bitard-Feildel
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Elodie Duprat
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| |
Collapse
|
3
|
Digging into the 3D Structure Predictions of AlphaFold2 with Low Confidence: Disorder and Beyond. Biomolecules 2022; 12:biom12101467. [PMID: 36291675 PMCID: PMC9599455 DOI: 10.3390/biom12101467] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Revised: 10/04/2022] [Accepted: 10/05/2022] [Indexed: 01/12/2023] Open
Abstract
AlphaFold2 (AF2) has created a breakthrough in biology by providing three-dimensional structure models for whole-proteome sequences, with unprecedented levels of accuracy. In addition, the AF2 pLDDT score, related to the model confidence, has been shown to provide a good measure of residue-wise disorder. Here, we combined AF2 predictions with pyHCA, a tool we previously developed to identify foldable segments and estimate their order/disorder ratio, from a single protein sequence. We focused our analysis on the AF2 predictions available for 21 reference proteomes (AFDB v1), in particular on their long foldable segments (>30 amino acids) that exhibit characteristics of soluble domains, as estimated by pyHCA. Among these segments, we provided a global analysis of those with very low pLDDT values along their entire length and compared their characteristics to those of segments with very high pLDDT values. We highlighted cases containing conditional order, as well as cases that could form well-folded structures but escape the AF2 prediction due to a shallow multiple sequence alignment and/or undocumented structure or fold. AF2 and pyHCA can therefore be advantageously combined to unravel cryptic structural features in whole proteomes and to refine predictions for different flavors of disorder.
Collapse
|
4
|
Tamburrini KC, Pesce G, Nilsson J, Gondelaud F, Kajava AV, Berrin JG, Longhi S. Predicting Protein Conformational Disorder and Disordered Binding Sites. Methods Mol Biol 2022; 2449:95-147. [PMID: 35507260 DOI: 10.1007/978-1-0716-2095-3_4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In the last two decades it has become increasingly evident that a large number of proteins adopt either a fully or a partially disordered conformation. Intrinsically disordered proteins are ubiquitous proteins that fulfill essential biological functions while lacking a stable 3D structure. Their conformational heterogeneity is encoded by the amino acid sequence, thereby allowing intrinsically disordered proteins or regions to be recognized based on their sequence properties. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to crystallization. This chapter focuses on the methods currently employed for predicting protein disorder and identifying intrinsically disordered binding sites.
Collapse
Affiliation(s)
- Ketty C Tamburrini
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
- INRAE, Aix Marseille Univ, Biodiversité et Biotechnologie Fongiques (BBF), UMR 1163, Marseille, France
| | - Giulia Pesce
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
| | - Juliet Nilsson
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
| | - Frank Gondelaud
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France
| | - Andrey V Kajava
- Centre de Recherche en Biologie cellulaire de Montpellier, UMR 5237, CNRS, Université Montpellier, Montpellier, France
| | - Jean-Guy Berrin
- INRAE, Aix Marseille Univ, Biodiversité et Biotechnologie Fongiques (BBF), UMR 1163, Marseille, France
| | - Sonia Longhi
- Aix Marseille Univ, CNRS, Architecture et Fonction des Macromolécules Biologiques, AFMB, UMR 7257, Marseille, France.
| |
Collapse
|
5
|
Li J, Hou C, Ma X, Guo S, Zhang H, Shi L, Liao C, Zheng B, Ye L, Yang L, He X. Entropy-Enthalpy Compensations Fold Proteins in Precise Ways. Int J Mol Sci 2021; 22:9653. [PMID: 34502559 PMCID: PMC8431812 DOI: 10.3390/ijms22179653] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2021] [Revised: 08/27/2021] [Accepted: 08/30/2021] [Indexed: 12/16/2022] Open
Abstract
Exploring the protein-folding problem has been a longstanding challenge in molecular biology and biophysics. Intramolecular hydrogen (H)-bonds play an extremely important role in stabilizing protein structures. To form these intramolecular H-bonds, nascent unfolded polypeptide chains need to escape from hydrogen bonding with surrounding polar water molecules under the solution conditions that require entropy-enthalpy compensations, according to the Gibbs free energy equation and the change in enthalpy. Here, by analyzing the spatial layout of the side-chains of amino acid residues in experimentally determined protein structures, we reveal a protein-folding mechanism based on the entropy-enthalpy compensations that initially driven by laterally hydrophobic collapse among the side-chains of adjacent residues in the sequences of unfolded protein chains. This hydrophobic collapse promotes the formation of the H-bonds within the polypeptide backbone structures through the entropy-enthalpy compensation mechanism, enabling secondary structures and tertiary structures to fold reproducibly following explicit physical folding codes and forces. The temperature dependence of protein folding is thus attributed to the environment dependence of the conformational Gibbs free energy equation. The folding codes and forces in the amino acid sequence that dictate the formation of β-strands and α-helices can be deciphered with great accuracy through evaluation of the hydrophobic interactions among neighboring side-chains of an unfolded polypeptide from a β-strand-like thermodynamic metastable state. The folding of protein quaternary structures is found to be guided by the entropy-enthalpy compensations in between the docking sites of protein subunits according to the Gibbs free energy equation that is verified by bioinformatics analyses of a dozen structures of dimers. Protein folding is therefore guided by multistage entropy-enthalpy compensations of the system of polypeptide chains and water molecules under the solution conditions.
Collapse
Affiliation(s)
- Jiacheng Li
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and Structures, Harbin Institute of Technology, Harbin 150080, China; (J.L.); (X.M.); (S.G.); (H.Z.); (L.S.)
| | - Chengyu Hou
- School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin 150080, China; (C.H.); (C.L.)
| | - Xiaoliang Ma
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and Structures, Harbin Institute of Technology, Harbin 150080, China; (J.L.); (X.M.); (S.G.); (H.Z.); (L.S.)
| | - Shuai Guo
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and Structures, Harbin Institute of Technology, Harbin 150080, China; (J.L.); (X.M.); (S.G.); (H.Z.); (L.S.)
| | - Hongchi Zhang
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and Structures, Harbin Institute of Technology, Harbin 150080, China; (J.L.); (X.M.); (S.G.); (H.Z.); (L.S.)
| | - Liping Shi
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and Structures, Harbin Institute of Technology, Harbin 150080, China; (J.L.); (X.M.); (S.G.); (H.Z.); (L.S.)
| | - Chenchen Liao
- School of Electronics and Information Engineering, Harbin Institute of Technology, Harbin 150080, China; (C.H.); (C.L.)
| | - Bing Zheng
- Key Laboratory of Functional Inorganic Material Chemistry (Ministry of Education), School of Chemistry and Materials Science, Heilongjiang University, Harbin 150001, China;
| | - Lin Ye
- School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Sydney, NSW 2006, Australia;
| | - Lin Yang
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and Structures, Harbin Institute of Technology, Harbin 150080, China; (J.L.); (X.M.); (S.G.); (H.Z.); (L.S.)
- School of Aerospace, Mechanical and Mechatronic Engineering, The University of Sydney, Sydney, NSW 2006, Australia;
| | - Xiaodong He
- National Key Laboratory of Science and Technology on Advanced Composites in Special Environments, Center for Composite Materials and Structures, Harbin Institute of Technology, Harbin 150080, China; (J.L.); (X.M.); (S.G.); (H.Z.); (L.S.)
- Shenzhen STRONG Advanced Materials Research Institute Co., Ltd., Shenzhen 518035, China
| |
Collapse
|
6
|
Characterisation and comparative analysis of hydrophobin isolated from Pleurotus floridanus (PfH). Protein Expr Purif 2021; 182:105834. [PMID: 33516827 DOI: 10.1016/j.pep.2021.105834] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2020] [Revised: 01/19/2021] [Accepted: 01/25/2021] [Indexed: 10/22/2022]
Abstract
Hydrophobins are a class of small cysteine rich surface active proteins produced exclusively by filamentous fungi. It forms a nano layer in the cell-water interface, thereby protecting the emerging fungal hyphae from surrounding water. Even though hydrophobins have similar functions in fungi, they share less sequence similarity. In the current study, we made a comparative study of the hydrophobin produced by the mushroom Pleurotus floridanus (PfH). Mushroom P. floridanus was cultured in PD broth. The hydrophobin was purified by foam fractionation and characterized in terms of molecular weight, solubility and glycosylation. In the RP-HPLC analysis, the hydrophobin eluted at a retention time of 45.56 min. The molecular weight of the PfH was found to be 13.52 kDa by MALDI-TOF MS and the LC-MS/MS showed no similar sequence in MASCOT database. The hydrophobin gene of P. floridanus was amplified using custom-designed primers and the BLAST analysis showed 80% sequence similarity with the Vmh2-1 gene of Pleurotus ostreatus. The sequence was translated into protein using ExPASy, secondary and tertiary structure predictions were carried out using Jpred4 and Phyre2. The tertiary structure showed 91.5% similarity with the HYD1 hydrophobin of Schizophyllum commune. A comparative study of PfH with Vmh2-1 and HYD1 was performed using bioinformatics tools. Hydrophobic cluster analysis revealed that three of these proteins have uniformity in terms of amphiphilic and non-amphiphilic α-helices, whereas PfH has a unique proline clustering. Physicochemical analysis by ProtParam revealed that PfH shares similar properties with HYD1 and Vmh2-1, which can be correlated with its function.
Collapse
|
7
|
Lamiable A, Bitard-Feildel T, Rebehmed J, Quintus F, Schoentgen F, Mornon JP, Callebaut I. A topology-based investigation of protein interaction sites using Hydrophobic Cluster Analysis. Biochimie 2019; 167:68-80. [PMID: 31525399 DOI: 10.1016/j.biochi.2019.09.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Accepted: 09/11/2019] [Indexed: 01/20/2023]
Abstract
Hydrophobic clusters, as defined by Hydrophobic Cluster Analysis (HCA), are conditioned binary patterns, made of hydrophobic and non-hydrophobic positions, whose limits fit well those of regular secondary structures. They were proved to be useful for predicting secondary structures in proteins from the only information of a single amino acid sequence and have permitted to assess, in a comprehensive way, the leading role of binary patterns in secondary structure preference towards a particular state. Here, we considered the available experimental 3D structures of protein globular domains to enlarge our previously reported hydrophobic cluster database (HCDB), almost doubling the number of hydrophobic cluster species (each species being defined by a unique binary pattern) that represent the most frequent structural bricks encountered within protein globular domains. We then used this updated HCDB to show that the hydrophobic amino acids of discordant clusters, i.e. those less abundant clusters for which the observed secondary structure is in disagreement with the binary pattern preference of the species to which they belong, are more exposed to solvent and are more involved in protein interfaces than the hydrophobic amino acids of concordant clusters. As amino acid composition differs between concordant/discordant clusters, considering binary patterns may be used to gain novel insights into key features of protein globular domain cores and surfaces. It can also provide useful information on possible conformational plasticity, including disorder to order transitions.
Collapse
Affiliation(s)
- Alexis Lamiable
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| | - Tristan Bitard-Feildel
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| | - Joseph Rebehmed
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France; Lebanese American University, Department of Computer Science and Mathematics, Beirut, Lebanon
| | - Flavien Quintus
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| | - Françoise Schoentgen
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| | - Jean-Paul Mornon
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France.
| |
Collapse
|
8
|
Faure G, Jézéquel K, Roisné-Hamelin F, Bitard-Feildel T, Lamiable A, Marcand S, Callebaut I. Discovery and Evolution of New Domains in Yeast Heterochromatin Factor Sir4 and Its Partner Esc1. Genome Biol Evol 2019; 11:572-585. [PMID: 30668669 PMCID: PMC6394760 DOI: 10.1093/gbe/evz010] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/20/2019] [Indexed: 12/22/2022] Open
Abstract
Sir4 is a core component of heterochromatin found in yeasts of the Saccharomycetaceae family, whose general hallmark is to harbor a three-loci mating-type system with two silent loci. However, a large part of the Sir4 amino acid sequences has remained unexplored, belonging to the dark proteome. Here, we analyzed the phylogenetic profile of yet undescribed foldable regions present in Sir4 as well as in Esc1, an Sir4-interacting perinuclear anchoring protein. Within Sir4, we identified a new conserved motif (TOC) adjacent to the N-terminal KU-binding motif. We also found that the Esc1-interacting region of Sir4 is a Dbf4-related H-BRCT domain, only present in species possessing the HO endonuclease and in Kluveryomyces lactis. In addition, we found new motifs within Esc1 including a motif (Esc1-F) that is unique to species where Sir4 possesses an H-BRCT domain. Mutagenesis of conserved amino acids of the Sir4 H-BRCT domain, known to play a critical role in the Dbf4 function, shows that the function of this domain is separable from the essential role of Sir4 in transcriptional silencing and the protection from HO-induced cutting in Saccharomyces cerevisiae. In the more distant methylotrophic clade of yeasts, which often harbor a two-loci mating-type system with one silent locus, we also found a yet undescribed H-BRCT domain in a distinct protein, the ISWI2 chromatin-remodeling factor subunit Itc1. This study provides new insights on yeast heterochromatin evolution and emphasizes the interest of using sensitive methods of sequence analysis for identifying hitherto ignored functional regions within the dark proteome.
Collapse
Affiliation(s)
- Guilhem Faure
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France.,National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD
| | - Kévin Jézéquel
- Institut de Biologie François Jacob, IRCM/SIGRR/LTR, INSERM U1274, Université Paris-Saclay, CEA Paris-Saclay, Paris, France.,National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD
| | - Florian Roisné-Hamelin
- Institut de Biologie François Jacob, IRCM/SIGRR/LTR, INSERM U1274, Université Paris-Saclay, CEA Paris-Saclay, Paris, France.,National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD
| | - Tristan Bitard-Feildel
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Alexis Lamiable
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Stéphane Marcand
- Institut de Biologie François Jacob, IRCM/SIGRR/LTR, INSERM U1274, Université Paris-Saclay, CEA Paris-Saclay, Paris, France.,Sorbonne Université, UMR CNRS 7238, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France.,Sorbonne Université, UMR CNRS 7238, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, France
| |
Collapse
|
9
|
Bitard‐Feildel T, Lamiable A, Mornon J, Callebaut I. Order in Disorder as Observed by the "Hydrophobic Cluster Analysis" of Protein Sequences. Proteomics 2018; 18:e1800054. [PMID: 30299594 PMCID: PMC7168002 DOI: 10.1002/pmic.201800054] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2018] [Revised: 08/29/2018] [Indexed: 12/17/2022]
Abstract
Hydrophobic cluster analysis (HCA) is an original approach for protein sequence analysis, which provides access to the foldable repertoire of the protein universe, including yet unannotated protein segments ("dark proteome"). Foldable segments correspond to ordered regions, as well as to intrinsically disordered regions (IDRs) undergoing disorder to order transitions. In this review, how HCA can be used to give insight into this last category of foldable segments is illustrated, with examples matching known 3D structures. After reviewing the HCA principles, examples of short foldable segments are given, which often contain short linear motifs, typically matching hydrophobic clusters. These segments become ordered upon contact with partners, with secondary structure preferences generally corresponding to those observed in the 3D structures within the complexes. Such small foldable segments are sometimes larger than the segments of known 3D structures, including flanking hydrophobic clusters that may be critical for interaction specificity or regulation, as well as intervening sequences allowing fuzziness. Cases of larger conditionally disordered domains are also presented, with lower density in hydrophobic clusters than well-folded globular domains or with exposed hydrophobic patches, which are stabilized by interaction with partners.
Collapse
Affiliation(s)
- Tristan Bitard‐Feildel
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
- Laboratoire de Biologie Computationnelle et Quantitative (LCQB)Institute of Biology Paris‐Seine (IBPS)Centre national de la recherche scientifique (CNRS)Sorbonne Université75005ParisFrance
| | - Alexis Lamiable
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
| | - Jean‐Paul Mornon
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
| | - Isabelle Callebaut
- Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC)Institut de recherche pour le développement (IRD)UMR CNRS 7590Muséum National d'Histoire NaturelleSorbonne Université75005ParisFrance
| |
Collapse
|
10
|
Hoffmann B, Elbahnsi A, Lehn P, Décout JL, Pietrucci F, Mornon JP, Callebaut I. Combining theoretical and experimental data to decipher CFTR 3D structures and functions. Cell Mol Life Sci 2018; 75:3829-3855. [PMID: 29779042 PMCID: PMC11105360 DOI: 10.1007/s00018-018-2835-7] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Revised: 05/04/2018] [Accepted: 05/07/2018] [Indexed: 12/15/2022]
Abstract
Cryo-electron microscopy (cryo-EM) has recently provided invaluable experimental data about the full-length cystic fibrosis transmembrane conductance regulator (CFTR) 3D structure. However, this experimental information deals with inactive states of the channel, either in an apo, quiescent conformation, in which nucleotide-binding domains (NBDs) are widely separated or in an ATP-bound, yet closed conformation. Here, we show that 3D structure models of the open and closed forms of the channel, now further supported by metadynamics simulations and by comparison with the cryo-EM data, could be used to gain some insights into critical features of the conformational transition toward active CFTR forms. These critical elements lie within membrane-spanning domains but also within NBD1 and the N-terminal extension, in which conformational plasticity is predicted to occur to help the interaction with filamin, one of the CFTR cellular partners.
Collapse
Affiliation(s)
- Brice Hoffmann
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
- Iktos, Paris, France
| | - Ahmad Elbahnsi
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| | - Pierre Lehn
- INSERM U1078, SFR ScInBioS, Université de Bretagne Occidentale, Brest, France
| | | | - Fabio Pietrucci
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| | - Jean-Paul Mornon
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France.
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, 75005, Paris, France
| |
Collapse
|
11
|
Klasberg S, Bitard-Feildel T, Callebaut I, Bornberg-Bauer E. Origins and structural properties of novel and de novo protein domains during insect evolution. FEBS J 2018; 285:2605-2625. [PMID: 29802682 DOI: 10.1111/febs.14504] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2017] [Revised: 04/12/2018] [Accepted: 05/11/2018] [Indexed: 12/11/2022]
Abstract
Over long time scales, protein evolution is characterized by modular rearrangements of protein domains. Such rearrangements are mainly caused by gene duplication, fusion and terminal losses. To better understand domain emergence mechanisms we investigated 32 insect genomes covering a speciation gradient ranging from ~ 2 to ~ 390 mya. We use established domain models and foldable domains delineated by hydrophobic cluster analysis (HCA), which does not require homologous sequences, to also identify domains which have likely arisen de novo, that is, from previously noncoding DNA. Our results indicate that most novel domains emerge terminally as they originate from ORF extensions while fewer arise in middle arrangements, resulting from exonization of intronic or intergenic regions. Many novel domains rapidly migrate between terminal or middle positions and single- and multidomain arrangements. Young domains, such as most HCA-defined domains, are under strong selection pressure as they show signals of purifying selection. De novo domains, linked to ancient domains or defined by HCA, have higher degrees of intrinsic disorder and disorder-to-order transition upon binding than ancient domains. However, the corresponding DNA sequences of the novel domains of de novo origins could only rarely be found in sister genomes. We conclude that novel domains are often recruited by other proteins and undergo important structural modifications shortly after their emergence, but evolve too fast to be characterized by cross-species comparisons alone.
Collapse
Affiliation(s)
- Steffen Klasberg
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University Muenster, Germany
| | - Tristan Bitard-Feildel
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, France
| | - Isabelle Callebaut
- Sorbonne Université, Muséum National d'Histoire Naturelle, UMR CNRS 7590, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie, IMPMC, Paris, France
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University Muenster, Germany
| |
Collapse
|
12
|
Carlow CE, Faultless JT, Lee C, Siddiqua M, Edge A, Nassuth A. Nuclear localization and transactivation by Vitis CBF transcription factors are regulated by combinations of conserved amino acid domains. PLANT PHYSIOLOGY AND BIOCHEMISTRY : PPB 2017; 118:306-319. [PMID: 28675818 DOI: 10.1016/j.plaphy.2017.06.027] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 06/20/2017] [Accepted: 06/22/2017] [Indexed: 05/07/2023]
Abstract
The highly conserved CBF pathway is crucial in the regulation of plant responses to low temperatures. Extensive analysis of Arabidopsis CBF proteins revealed that their functions rely on several conserved amino acid domains although the exact function of each domain is disputed. The question was what functions similar domains have in CBFs from other, overwintering woody plants such as Vitis, which likely have a more involved regulation than the model plant Arabidopsis. A total of seven CBF genes were cloned and sequenced from V. riparia and the less frost tolerant V. vinifera. The deduced species-specific amino acid sequences differ in only a few amino acids, mostly in non-conserved regions. Amino acid sequence comparison and phylogenetic analysis showed two distinct groups of Vitis CBFs. One group contains CBF1, CBF2, CBF3 and CBF8 and the other group contains CBF4, CBF5 and CBF6. Transient transactivation assays showed that all Vitis CBFs except CBF5 activate via a CRT or DRE promoter element, whereby Vitis CBF3 and 4 prefer a CRT element. The hydrophobic domains in the C-terminal end of VrCBF6 were shown to be important for how well it activates. The putative nuclear localization domain of Vitis CBF1 was shown to be sufficient for nuclear localization, in contrast to previous reports for AtCBF1, and also important for transactivation. The latter highlights the value of careful analysis of domain functions instead of reliance on computer predictions and published data for other related proteins.
Collapse
Affiliation(s)
- Chevonne E Carlow
- Department of Molecular and Cellular Biology, University of Guelph, 50 Stone Road East, Guelph, Ontario N1G 2W1, Canada
| | - J Trent Faultless
- Department of Molecular and Cellular Biology, University of Guelph, 50 Stone Road East, Guelph, Ontario N1G 2W1, Canada
| | - Christine Lee
- Department of Molecular and Cellular Biology, University of Guelph, 50 Stone Road East, Guelph, Ontario N1G 2W1, Canada
| | - Mahbuba Siddiqua
- Department of Molecular and Cellular Biology, University of Guelph, 50 Stone Road East, Guelph, Ontario N1G 2W1, Canada
| | - Alison Edge
- Department of Molecular and Cellular Biology, University of Guelph, 50 Stone Road East, Guelph, Ontario N1G 2W1, Canada
| | - Annette Nassuth
- Department of Molecular and Cellular Biology, University of Guelph, 50 Stone Road East, Guelph, Ontario N1G 2W1, Canada.
| |
Collapse
|
13
|
Exploring the dark foldable proteome by considering hydrophobic amino acids topology. Sci Rep 2017; 7:41425. [PMID: 28134276 PMCID: PMC5278394 DOI: 10.1038/srep41425] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 12/19/2016] [Indexed: 12/18/2022] Open
Abstract
The protein universe corresponds to the set of all proteins found in all organisms. A way to explore it is by taking into account the domain content of the proteins. However, some part of sequences and many entire sequences remain un-annotated despite a converging number of domain families. The un-annotated part of the protein universe is referred to as the dark proteome and remains poorly characterized. In this study, we quantify the amount of foldable domains within the dark proteome by using the hydrophobic cluster analysis methodology. These un-annotated foldable domains were grouped using a combination of remote homology searches and domain annotations, leading to define different levels of darkness. The dark foldable domains were analyzed to understand what make them different from domains stored in databases and thus difficult to annotate. The un-annotated domains of the dark proteome universe display specific features relative to database domains: shorter length, non-canonical content and particular topology in hydrophobic residues, higher propensity for disorder, and a higher energy. These features make them hard to relate to known families. Based on these observations, we emphasize that domain annotation methodologies can still be improved to fully apprehend and decipher the molecular evolution of the protein universe.
Collapse
|
14
|
Lieutaud P, Ferron F, Uversky AV, Kurgan L, Uversky VN, Longhi S. How disordered is my protein and what is its disorder for? A guide through the "dark side" of the protein universe. INTRINSICALLY DISORDERED PROTEINS 2016; 4:e1259708. [PMID: 28232901 DOI: 10.1080/21690707.2016.1259708] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/22/2016] [Revised: 11/03/2016] [Accepted: 11/04/2016] [Indexed: 12/18/2022]
Abstract
In the last 2 decades it has become increasingly evident that a large number of proteins are either fully or partially disordered. Intrinsically disordered proteins lack a stable 3D structure, are ubiquitous and fulfill essential biological functions. Their conformational heterogeneity is encoded in their amino acid sequences, thereby allowing intrinsically disordered proteins or regions to be recognized based on properties of these sequences. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to structural determination with X-ray crystallization. This article discusses a comprehensive selection of databases and methods currently employed to disseminate experimental and putative annotations of disorder, predict disorder and identify regions involved in induced folding. It also provides a set of detailed instructions that should be followed to perform computational analysis of disorder.
Collapse
Affiliation(s)
- Philippe Lieutaud
- Aix-Marseille Université, AFMB UMR, Marseille, France; CNRS, AFMB UMR, Marseille, France
| | - François Ferron
- Aix-Marseille Université, AFMB UMR, Marseille, France; CNRS, AFMB UMR, Marseille, France
| | - Alexey V Uversky
- Center for Data Analytics and Biomedical Informatics, Department of Computer and Information Sciences, College of Science and Technology, Temple University , Philadelphia, PA, USA
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University , Richmond, VA, USA
| | - Vladimir N Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer Research Institute, Morsani College of Medicine, University of South Florida, Tampa, FL, USA; Laboratory of Structural Dynamics, Stability and Folding of Proteins, Institute of Cytology, Russian Academy of Sciences, St. Petersburg, Russia
| | - Sonia Longhi
- Aix-Marseille Université, AFMB UMR, Marseille, France; CNRS, AFMB UMR, Marseille, France
| |
Collapse
|
15
|
Abstract
Proteins are the workhorses of the cell and, over billions of years, they have evolved an amazing plethora of extremely diverse and versatile structures with equally diverse functions. Evolutionary emergence of new proteins and transitions between existing ones are believed to be rare or even impossible. However, recent advances in comparative genomics have repeatedly called some 10%-30% of all genes without any detectable similarity to existing proteins. Even after careful scrutiny, some of those orphan genes contain protein coding reading frames with detectable transcription and translation. Thus some proteins seem to have emerged from previously non-coding 'dark genomic matter'. These 'de novo' proteins tend to be disordered, fast evolving, weakly expressed but also rapidly assuming novel and physiologically important functions. Here we review mechanisms by which 'de novo' proteins might be created, under which circumstances they may become fixed and why they are elusive. We propose a 'grow slow and moult' model in which first a reading frame is extended, coding for an initially disordered and non-globular appendage which, over time, becomes more structured and may also become associated with other proteins.
Collapse
|
16
|
Forsova OS, Zakharov VV. High-order oligomers of intrinsically disordered brain proteins BASP1 and GAP-43 preserve the structural disorder. FEBS J 2016; 283:1550-69. [PMID: 26918762 DOI: 10.1111/febs.13692] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2015] [Revised: 01/26/2016] [Accepted: 02/23/2016] [Indexed: 11/30/2022]
Abstract
Brain acid-soluble protein-1 (BASP1) and growth-associated protein-43 (GAP-43) are presynaptic membrane proteins participating in axon guidance, neuroregeneration and synaptic plasticity. They are presumed to sequester phosphatidylinositol-4,5-bisphosphate (PIP2 ) in lipid rafts. Previously we have shown that the proteins form heterogeneously sized oligomers in the presence of anionic phospholipids or SDS at submicellar concentration. BASP1 and GAP-43 are intrinsically disordered proteins (IDPs). In light of this, we investigated the structure of their oligomers. Using partial cross-linking of the oligomers with glutaraldehyde, the aggregation numbers of BASP1 and GAP-43 were estimated as 10-14 and 6-7 monomer subunits, respectively. The cross-linking pattern indicated that the subunits are circularly arranged. The circular dichroism (CD) spectra of the monomers were characteristic of coil-like IDPs showing unordered structure with a high population of polyproline-II conformation. The oligomerization was accompanied by a minor CD spectral change attributable to formation of a small amount of α-helix. The number of residues in the α-helical conformation was estimated as 13 in BASP1 and 18 in GAP-43. However, the overall structure of the oligomers remained disordered, indicating a high degree of 'fuzziness'. This was confirmed by measuring the hydrodynamic dimensions of the oligomers using polyacrylamide gradient gel electrophoresis and size-exclusion chromatography, and by assaying their sensitivity to proteolytic digestion. There is evidence that the observed α-helical folding occurs within the basic effector domains, which are presumably tethered together via anionic molecules of SDS or PIP2 . We conclude that BASP1 and GAP-43 oligomers preserve a mostly disordered structure, which may be of great importance for their function in PIP2 signaling pathway.
Collapse
Affiliation(s)
- Oksana S Forsova
- Molecular and Radiation Biophysics Division, B. P. Konstantinov Petersburg Nuclear Physics Institute, National Research Centre 'Kurchatov Institute', Gatchina, Russia.,Laboratory of Natural Polymers, Institute of Macromolecular Compounds, Russian Academy of Sciences, St Petersburg, Russia
| | - Vladislav V Zakharov
- Molecular and Radiation Biophysics Division, B. P. Konstantinov Petersburg Nuclear Physics Institute, National Research Centre 'Kurchatov Institute', Gatchina, Russia.,Laboratory of Natural Polymers, Institute of Macromolecular Compounds, Russian Academy of Sciences, St Petersburg, Russia.,Department of Biophysics, Institute of Physics, Nanotechnology and Telecommunications, Peter the Great St Petersburg Polytechnic University, Russia
| |
Collapse
|
17
|
Rebehmed J, Quintus F, Mornon JP, Callebaut I. The respective roles of polar/nonpolar binary patterns and amino acid composition in protein regular secondary structures explored exhaustively using hydrophobic cluster analysis. Proteins 2016; 84:624-38. [DOI: 10.1002/prot.25012] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2015] [Revised: 02/01/2016] [Accepted: 02/03/2016] [Indexed: 11/06/2022]
Affiliation(s)
- Joseph Rebehmed
- CNRS UMR7590; Sorbonne Universités, Université Pierre Et Marie Curie-Paris6 - MNHN - IRD - IUC; Paris France
| | - Flavien Quintus
- CNRS UMR7590; Sorbonne Universités, Université Pierre Et Marie Curie-Paris6 - MNHN - IRD - IUC; Paris France
| | - Jean-Paul Mornon
- CNRS UMR7590; Sorbonne Universités, Université Pierre Et Marie Curie-Paris6 - MNHN - IRD - IUC; Paris France
| | - Isabelle Callebaut
- CNRS UMR7590; Sorbonne Universités, Université Pierre Et Marie Curie-Paris6 - MNHN - IRD - IUC; Paris France
| |
Collapse
|
18
|
Abstract
In the last two decades, it has become increasingly evident that a large number of proteins are either fully or partially disordered. Intrinsically disordered proteins are ubiquitous proteins that fulfill essential biological functions while lacking a stable 3D structure. Their conformational heterogeneity is encoded at the amino acid sequence level, thereby allowing intrinsically disordered proteins or regions to be recognized based on their sequence properties. The identification of disordered regions facilitates the functional annotation of proteins and is instrumental for delineating boundaries of protein domains amenable to crystallization. This chapter focuses on the methods currently employed for predicting disorder and identifying regions involved in induced folding.
Collapse
Affiliation(s)
- Philippe Lieutaud
- AFMB UMR 7257, Aix-Marseille Université, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
- AFMB UMR 7257, CNRS, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
| | - François Ferron
- AFMB UMR 7257, Aix-Marseille Université, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
- AFMB UMR 7257, CNRS, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France
| | - Sonia Longhi
- AFMB UMR 7257, Aix-Marseille Université, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France.
- AFMB UMR 7257, CNRS, 163, avenue de Luminy, Case 932, 13288, Marseille Cedex 09, France.
| |
Collapse
|
19
|
Salinas Castellanos LC, Chomilier J, Hernández-Torres J. Recombination of chl-fus gene (Plastid Origin) downstream of hop: a locus of chromosomal instability. BMC Genomics 2015; 16:573. [PMID: 26238241 PMCID: PMC4522979 DOI: 10.1186/s12864-015-1780-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2014] [Accepted: 07/14/2015] [Indexed: 11/26/2022] Open
Abstract
Background The co-chaperone Hop [heat shock protein (HSP) organizing protein] has been shown to act as an adaptor for protein folding and maturation, in concert with Hsp70 and Hsp90. The hop gene is of eukaryotic origin. Likewise, the chloroplast elongation factor G (cEF-G) catalyzes the translocation step in chloroplast protein synthesis. The chl-fus gene, which encodes the cEF-G protein, is of plastid origin. Both proteins, Hop and cEF-G, derived from domain duplications. It was demonstrated that the nuclear chl-fus gene locates in opposite orientation to a hop gene in Glycine max. We explored 53 available plant genomes from Chlorophyta to higher plants, to determine whether the chl-fus gene was transferred directly downstream of the primordial hop in the proto-eukaryote host cell. Since both genes came from exon/module duplication events, we wanted to explore the involvement of introns in the early origin and the ensuing evolutionary changes in gene structure. Results We reconstructed the evolutionary history of the two convergent plant genes, on the basis of their gene structure, microsynteny and microcolinearity, from 53 plant nuclear genomes. Despite a high degree (72 %) of microcolinearity among vascular plants, our results demonstrate that their adjacency was a product of chromosomal rearrangements. Based on predicted exon − intron structures, we inferred the molecular events giving rise to the current form of genes. Therefore, we propose a simple model of exon/module shuffling by intronic recombinations in which phase-0 introns were essential for domain duplication, and a phase-1 intron for transit peptide recruiting. Finally, we demonstrate a natural susceptibility of the intergenic region to recombine or delete, seriously threatening the integrity of the chl-fus gene for the future. Conclusions Our results are consistent with the interpretation that the chl-fus gene was transferred from the chloroplast to a chromosome different from that of hop, in the primitive photosynthetic eukaryote, and much later before the appearance of angiosperms, it was recombined downstream of hop. Exon/module shuffling mediated by symmetric intron phases (i.e., phase-0 introns) was essential for gene evolution. The intergenic region is prone to recombine, risking the integrity of both genes. Electronic supplementary material The online version of this article (doi:10.1186/s12864-015-1780-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
| | - Jacques Chomilier
- IMPMC, UPMC, CNRS UMR 7590, MNHN, IRD, Paris, France and RPBS, Paris, France.
| | - Jorge Hernández-Torres
- Laboratorio de Biología Molecular, Escuela de Biología, Universidad Industrial de Santander, Apartado Aéreo 678, Bucaramanga, Colombia.
| |
Collapse
|
20
|
Mutation of a Single Envelope N-Linked Glycosylation Site Enhances the Pathogenicity of Bovine Leukemia Virus. J Virol 2015; 89:8945-56. [PMID: 26085161 DOI: 10.1128/jvi.00261-15] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 06/09/2015] [Indexed: 12/29/2022] Open
Abstract
UNLABELLED Viruses have coevolved with their host to ensure efficient replication and transmission without inducing excessive pathogenicity that would indirectly impair their persistence. This is exemplified by the bovine leukemia virus (BLV) system in which lymphoproliferative disorders develop in ruminants after latency periods of several years. In principle, the equilibrium reached between the virus and its host could be disrupted by emergence of more pathogenic strains. Intriguingly but fortunately, such a hyperpathogenic BLV strain was never observed in the field or designed in vitro. In this study, we sought to understand the role of envelope N-linked glycosylation with the hypothesis that this posttranslational modification could either favor BLV infection by allowing viral entry or allow immune escape by using glycans as a shield. Using reverse genetics of an infectious molecular provirus, we identified a N-linked envelope glycosylation site (N230) that limits viral replication and pathogenicity. Indeed, mutation N230E unexpectedly leads to enhanced fusogenicity and protein stability. IMPORTANCE Infection by retroviruses requires the interaction of the viral envelope protein (SU) with a membrane-associated receptor allowing fusion and release of the viral genomic RNA into the cell. We show that N-linked glycosylation of the bovine leukemia virus (BLV) SU protein is, as expected, essential for cell infection in vitro. Consistently, mutation of all glycosylation sites of a BLV provirus destroys infectivity in vivo. However, single mutations do not significantly modify replication in vivo. Instead, a particular mutation at SU codon 230 increases replication and accelerates pathogenesis. This unexpected observation has important consequences in terms of disease control and managing.
Collapse
|
21
|
Characteristics and Applicability of Phytase of the Yeast Pichia anomala in Synthesizing Haloperoxidase. Appl Biochem Biotechnol 2015; 176:1351-69. [PMID: 25957272 DOI: 10.1007/s12010-015-1650-y] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2015] [Accepted: 04/21/2015] [Indexed: 10/23/2022]
Abstract
The phytase of the yeast Pichia anomala is a histidine acid phosphatase based on signature sequences and catalytic amino acids identified by site-directed mutagenesis. Among modulators, N-bromosuccinimide and butanedione inhibit phytase, while Ca(2+) and Ni(2+) stimulate slightly. Vanadate exhibits competitive inhibition of phytase, making it bifunctional to act as haloperoxidase. Molecular docking supports vanadate to share its binding site with phytate. The T 1/2, activation energy (E a ), temperature quotient (Q 10), activation energy of thermal inactivation (Ed), and enthalpy (ΔH d (0) ) of the enzyme are 4.0 min (80 °C), 27.72 kJ mol(-1), 2.1, 410.62 kJ mol(-1), and ∼407.8 kJ mol(-1) (65-80 °C), respectively. The free energy of the process (ΔG d (o) ) increases from 49.56 to 71.58 kJ mol(-1) with rise in temperature, while entropy of inactivation (ΔS d (0) ) remains constant at ∼1.36 kJ mol(-1) K(-1). The supplementation of whole wheat dough with rPPHY resulted in 72.5 % reduction in phytic acid content of bread. These characteristics confirm that the phytase has adequate thermostability for its applicability as a food and feed additive.
Collapse
|
22
|
Mornon JP, Hoffmann B, Jonic S, Lehn P, Callebaut I. Full-open and closed CFTR channels, with lateral tunnels from the cytoplasm and an alternative position of the F508 region, as revealed by molecular dynamics. Cell Mol Life Sci 2015; 72:1377-403. [PMID: 25287046 PMCID: PMC11113974 DOI: 10.1007/s00018-014-1749-2] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2014] [Revised: 09/28/2014] [Accepted: 09/29/2014] [Indexed: 12/17/2022]
Abstract
In absence of experimental 3D structures, several homology models, based on ABC exporter 3D structures, have provided significant insights into the molecular mechanisms underlying the function of the cystic fibrosis transmembrane conductance regulator (CFTR) protein, a chloride channel whose defects are associated with cystic fibrosis (CF). Until now, these models, however, did not furnished much insights into the continuous way that ions could follow from the cytosol to the extracellular milieu in the open form of the channel. Here, we have built a refined model of CFTR, based on the outward-facing Sav1866 experimental 3D structure and integrating the evolutionary and structural information available today. Molecular dynamics simulations revealed significant conformational changes, resulting in a full-open channel, accessible from the cytosol through lateral tunnels displayed in the long intracellular loops (ICLs). At the same time, the region of nucleotide-binding domain 1 in contact with one of the ICLs and carrying amino acid F508, the deletion of which is the most common CF-causing mutation, was found to adopt an alternative but stable position. Then, in a second step, this first stable full-open conformation evolved toward another stable state, in which only a limited displacement of the upper part of the transmembrane helices leads to a closure of the channel, in a conformation very close to that adopted by the Atm1 ABC exporter, in an inward-facing conformation. These models, supported by experimental data, provide significant new insights into the CFTR structure-function relationships and into the possible impact of CF-causing mutations.
Collapse
Affiliation(s)
- Jean-Paul Mornon
- IMPMC, Sorbonne Universités, UPMC Univ Paris 06, UMR CNRS 7590, Museum National d’Histoire Naturelle, IRD UMR 206, IUC, Case 115, 4 Place Jussieu, 75005 Paris Cedex 05, France
| | - Brice Hoffmann
- IMPMC, Sorbonne Universités, UPMC Univ Paris 06, UMR CNRS 7590, Museum National d’Histoire Naturelle, IRD UMR 206, IUC, Case 115, 4 Place Jussieu, 75005 Paris Cedex 05, France
| | - Slavica Jonic
- IMPMC, Sorbonne Universités, UPMC Univ Paris 06, UMR CNRS 7590, Museum National d’Histoire Naturelle, IRD UMR 206, IUC, Case 115, 4 Place Jussieu, 75005 Paris Cedex 05, France
| | - Pierre Lehn
- INSERM U1078, SFR ScInBioS, Université de Bretagne Occidentale, Brest, France
| | - Isabelle Callebaut
- IMPMC, Sorbonne Universités, UPMC Univ Paris 06, UMR CNRS 7590, Museum National d’Histoire Naturelle, IRD UMR 206, IUC, Case 115, 4 Place Jussieu, 75005 Paris Cedex 05, France
| |
Collapse
|
23
|
Bitard-Feildel T, Heberlein M, Bornberg-Bauer E, Callebaut I. Detection of orphan domains in Drosophila using "hydrophobic cluster analysis". Biochimie 2015; 119:244-53. [PMID: 25736992 DOI: 10.1016/j.biochi.2015.02.019] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2014] [Accepted: 02/20/2015] [Indexed: 11/30/2022]
Abstract
INTRODUCTION Comparative genomics has become an important strategy in life science research. While many genes, and the proteins they code for, can be well characterized by assigning orthologs, a significant amount of proteins or domains remain obscure "orphans". Some orphans are overlooked by current computational methods because they rapidly diverged, others emerged relatively recently (de novo). Recent research has demonstrated the importance of orphans, and of de novo proteins and domains for development of new phenotypic traits and adaptation. New approaches for detecting novel domains are thus of paramount importance. RESULTS The hydrophobic cluster analysis (HCA) method delineates globular-like domains from the information of a protein sequence and thereby allows bypassing some of the established methods limitations based on conserved sequence similarity. In this study, HCA is tested for orphan domain detection on 12 Drosophila genomes. After their detection, the oprhan domains are classified into two categories, depending on their presence/absence in distantly related species. The two categories show significantly different physico-chemical properties when compared to previously characterized domains from the Pfam database. The newly detected domains have a higher degree of intrinsic disorder and a particular hydrophobic cluster composition. The older the domains are, the more similar their hydrophobic cluster content is to the cluster content of Pfam domains. The results suggest that, over time, newly created domains acquire a canonical set of hydrophobic clusters but conserve some features of intrinsically disordered regions. CONCLUSION Our results agree with previous findings on orphan domains and suggest that the physico-chemical properties of domains change over evolutionary long time scale. The presented HCA-based method is able to detect domains with unusual properties without relying on prior knowledge, such as the availability of homologs. Therefore, the method has large potential for complementing existing strategies to annotate genomes, and for better understanding how molecular features emerge.
Collapse
Affiliation(s)
- Tristan Bitard-Feildel
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University Muenster, Huefferstrasse 1, D-48149, Germany
| | - Magdalena Heberlein
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University Muenster, Huefferstrasse 1, D-48149, Germany
| | - Erich Bornberg-Bauer
- Institute for Evolution and Biodiversity, Westfalian Wilhelms University Muenster, Huefferstrasse 1, D-48149, Germany.
| | - Isabelle Callebaut
- IMPMC, Sorbonne Universités - UMR CNRS 7590, UPMC Univ Paris 06, Muséum National d'Histoire Naturelle, IRD UMR 206, IUC 4 Place Jussieu, F-75005 Paris, France.
| |
Collapse
|
24
|
A Novel Protein, Ubiquitous in Marine Phytoplankton, Concentrates Iron at the Cell Surface and Facilitates Uptake. Curr Biol 2015; 25:364-371. [DOI: 10.1016/j.cub.2014.12.004] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2014] [Revised: 10/02/2014] [Accepted: 12/02/2014] [Indexed: 12/11/2022]
|
25
|
Arabidopsis thaliana Tic110, involved in chloroplast protein translocation, contains at least fourteen highly divergent heat-like repeated motifs. Biologia (Bratisl) 2013. [DOI: 10.2478/s11756-013-0310-3] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
26
|
Large-scale production and structural and biophysical characterizations of the human hepatitis B virus polymerase. J Virol 2013; 88:2584-99. [PMID: 24352439 DOI: 10.1128/jvi.02575-13] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
UNLABELLED Hepatitis B virus (HBV) is a major human pathogen that causes serious liver disease and 600,000 deaths annually. Approved therapies for treating chronic HBV infections usually target the multifunctional viral polymerase (hPOL). Unfortunately, these therapies--broad-spectrum antivirals--are not general cures, have side effects, and cause viral resistance. While hPOL remains an attractive therapeutic target, it is notoriously difficult to express and purify in a soluble form at yields appropriate for structural studies. Thus, no empirical structural data exist for hPOL, and this impedes medicinal chemistry and rational lead discovery efforts targeting HBV. Here, we present an efficient strategy to overexpress recombinant hPOL domains in Escherichia coli, purifying them at high yield and solving their known aggregation tendencies. This allowed us to perform the first structural and biophysical characterizations of hPOL domains. Apo-hPOL domains adopt mainly α-helical structures with small amounts of β-sheet structures. Our recombinant material exhibited metal-dependent, reverse transcriptase activity in vitro, with metal binding modulating the hPOL structure. Calcomine orange 2RS, a small molecule that inhibits duck HBV POL activity, also inhibited the in vitro priming activity of recombinant hPOL. Our work paves the way for structural and biophysical characterizations of hPOL and should facilitate high-throughput lead discovery for HBV. IMPORTANCE The viral polymerase from human hepatitis B virus (hPOL) is a well-validated therapeutic target. However, recombinant hPOL has a well-deserved reputation for being extremely difficult to express in a soluble, active form in yields appropriate to the structural studies that usually play an important role in drug discovery programs. This has hindered the development of much-needed new antivirals for HBV. However, we have solved this problem and report here procedures for expressing recombinant hPOL domains in Escherichia coli and also methods for purifying them in soluble forms that have activity in vitro. We also present the first structural and biophysical characterizations of hPOL. Our work paves the way for new insights into hPOL structure and function, which should assist the discovery of novel antivirals for HBV.
Collapse
|
27
|
Faure G, Callebaut I. Comprehensive repertoire of foldable regions within whole genomes. PLoS Comput Biol 2013; 9:e1003280. [PMID: 24204229 PMCID: PMC3812050 DOI: 10.1371/journal.pcbi.1003280] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2013] [Accepted: 08/15/2013] [Indexed: 11/30/2022] Open
Abstract
In order to get a comprehensive repertoire of foldable domains within whole proteomes, including orphan domains, we developed a novel procedure, called SEG-HCA. From only the information of a single amino acid sequence, SEG-HCA automatically delineates segments possessing high densities in hydrophobic clusters, as defined by Hydrophobic Cluster Analysis (HCA). These hydrophobic clusters mainly correspond to regular secondary structures, which together form structured or foldable regions. Genome-wide analyses revealed that SEG-HCA is opposite of disorder predictors, both addressing distinct structural states. Interestingly, there is however an overlap between the two predictions, including small segments of disordered sequences, which undergo coupled folding and binding. SEG-HCA thus gives access to these specific domains, which are generally poorly represented in domain databases. Comparison of the whole set of SEG-HCA predictions with the Conserved Domain Database (CDD) also highlighted a wide proportion of predicted large (length >50 amino acids) segments, which are CDD orphan. These orphan sequences may either correspond to highly divergent members of already known families or belong to new families of domains. Their comprehensive description thus opens new avenues to investigate new functional and/or structural features, which remained so far uncovered. Altogether, the data described here provide new insights into the protein architecture and organization throughout the three kingdoms of life. Spontaneous or induced folding into a specific 3D structure is a key property of proteins to perform their biological functions. Folded 3D structures of proteins perform specific functions, including interactions with other proteins. Intrinsically disordered regions also mediate interaction, gaining structure only when bound to a target protein. In both cases, hydrophobicity generally plays a major role in the protein segment “foldability”. Here, we developed an original procedure to identify foldable segments from only the information of a single amino acid sequence and to explore protein structures at a proteomic scale. Our approach goes beyond the simple consideration of mean hydrophobicity, by including the secondary structure information through the use of a two-dimensional transposition of the sequence. The developed procedure, combined with disorder predictors, may facilitate the specific identification of small segments that undergo coupled folding and binding. Combined with the analysis of specific domain databases, it also highlights orphan foldable segments, which remain yet uncharacterized.
Collapse
Affiliation(s)
- Guilhem Faure
- CNRS, UPMC Univ Paris 6, IMPMC, UMR7590 - IUC, Paris, France
| | | |
Collapse
|
28
|
Faure G, Callebaut I. Identification of hidden relationships from the coupling of hydrophobic cluster analysis and domain architecture information. ACTA ACUST UNITED AC 2013; 29:1726-33. [PMID: 23677940 DOI: 10.1093/bioinformatics/btt271] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Describing domain architecture is a critical step in the functional characterization of proteins. However, some orphan domains do not match any profile stored in dedicated domain databases and are thereby difficult to analyze. RESULTS We present here an original novel approach, called TREMOLO-HCA, for the analysis of orphan domain sequences and inspired from our experience in the use of Hydrophobic Cluster Analysis (HCA). Hidden relationships between protein sequences can be more easily identified from the PSI-BLAST results, using information on domain architecture, HCA plots and the conservation degree of amino acids that may participate in the protein core. This can lead to reveal remote relationships with known families of domains, as illustrated here with the identification of a hidden Tudor tandem in the human BAHCC1 protein and a hidden ET domain in the Saccharomyces cerevisiae Taf14p and human AF9 proteins. The results obtained in such a way are consistent with those provided by HHPRED, based on pairwise comparisons of HHMs. Our approach can, however, be applied even in absence of domain profiles or known 3D structures for the identification of novel families of domains. It can also be used in a reverse way for refining domain profiles, by starting from known protein domain families and identifying highly divergent members, hitherto considered as orphan. AVAILABILITY We provide a possible integration of this approach in an open TREMOLO-HCA package, which is fully implemented in python v2.7 and is available on request. Instructions are available at http://www.impmc.upmc.fr/∼callebau/tremolohca.html. CONTACT isabelle.callebaut@impmc.upmc.fr SUPPLEMENTARY INFORMATION Supplementary Data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Guilhem Faure
- IMPMC, UMR7590, CNRS, Université Pierre et Marie Curie-Paris6, Paris Cedex 05, France
| | | |
Collapse
|
29
|
Voxeur A, André A, Breton C, Lerouge P. Identification of putative rhamnogalacturonan-II specific glycosyltransferases in Arabidopsis using a combination of bioinformatics approaches. PLoS One 2012; 7:e51129. [PMID: 23272088 PMCID: PMC3522684 DOI: 10.1371/journal.pone.0051129] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2012] [Accepted: 10/31/2012] [Indexed: 01/24/2023] Open
Abstract
Rhamnogalacturonan-II (RG-II) is a complex plant cell wall polysaccharide that is composed of an α(1,4)-linked homogalacturonan backbone substituted with four side chains. It exists in the cell wall in the form of a dimer that is cross-linked by a borate di-ester. Despite its highly complex structure, RG-II is evolutionarily conserved in the plant kingdom suggesting that this polymer has fundamental functions in the primary wall organisation. In this study, we have set up a bioinformatics strategy aimed at identifying putative glycosyltransferases (GTs) involved in RG-II biosynthesis. This strategy is based on the selection of candidate genes encoding type II membrane proteins that are tightly coexpressed in both rice and Arabidopsis with previously characterised genes encoding enzymes involved in the synthesis of RG-II and exhibiting an up-regulation upon isoxaben treatment. This study results in the final selection of 26 putative Arabidopsis GTs, including 10 sequences already classified in the CAZy database. Among these CAZy sequences, the screening protocol allowed the selection of α-galacturonosyltransferases involved in the synthesis of α4-GalA oligogalacturonides present in both homogalacturonans and RG-II, and two sialyltransferase-like sequences previously proposed to be involved in the transfer of Kdo and/or Dha on the pectic backbone of RG-II. In addition, 16 non-CAZy GT sequences were retrieved in the present study. Four of them exhibited a GT-A fold. The remaining sequences harbored a GT-B like fold and a fucosyltransferase signature. Based on homologies with glycosyltransferases of known functions, putative roles in the RG-II biosynthesis are proposed for some GT candidates.
Collapse
Affiliation(s)
- Aline Voxeur
- Laboratoire Glyco-MEV, EA 4358, Institut de Recherche et d'Innovation Biotechnologique, University of Rouen, Mont-Saint-Aignan, France
| | - Aurélie André
- Laboratoire Glyco-MEV, EA 4358, Institut de Recherche et d'Innovation Biotechnologique, University of Rouen, Mont-Saint-Aignan, France
| | | | - Patrice Lerouge
- Laboratoire Glyco-MEV, EA 4358, Institut de Recherche et d'Innovation Biotechnologique, University of Rouen, Mont-Saint-Aignan, France
| |
Collapse
|
30
|
Callebaut I, Mornon JP. The PWAPA cassette: Intimate association of a PHD-like finger and a winged-helix domain in proteins included in histone-modifying complexes. Biochimie 2012; 94:2006-12. [DOI: 10.1016/j.biochi.2012.05.025] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2012] [Accepted: 05/14/2012] [Indexed: 12/11/2022]
|
31
|
Op de Beeck K, Van Camp G, Thys S, Cools N, Callebaut I, Vrijens K, Van Nassauw L, Van Tendeloo VFI, Timmermans JP, Van Laer L. The DFNA5 gene, responsible for hearing loss and involved in cancer, encodes a novel apoptosis-inducing protein. Eur J Hum Genet 2011; 19:965-73. [PMID: 21522185 DOI: 10.1038/ejhg.2011.63] [Citation(s) in RCA: 95] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
DFNA5 was first identified as a gene causing autosomal dominant hearing loss (HL). Different mutations have been found, all exerting a highly specific gain-of-function effect, in which skipping of exon 8 causes the HL. Later reports revealed the involvement of the gene in different types of cancer. Epigenetic silencing of DFNA5 in a large percentage of gastric, colorectal and breast tumors and p53-dependent transcriptional activity have been reported, concluding that DFNA5 acts as a tumor suppressor gene in different frequent types of cancer. Despite these data, the molecular function of DFNA5 has not been investigated properly. Previous transfection studies with mutant DFNA5 in yeast and in mammalian cells showed a toxic effect of the mutant protein, which was not seen after transfection of the wild-type protein. Here, we demonstrate that DFNA5 is composed of two domains, separated by a hinge region. The first region induces apoptosis when transfected in HEK293T cells, the second region masks and probably regulates this apoptosis inducing capability. Moreover, the involvement of DFNA5 in apoptosis-related pathways in a physiological setting was demonstrated through gene expression microarray analysis using Dfna5 knockout mice. In view of its important role in carcinogenesis, this finding is expected to lead to new insights on the role of apoptosis in many types of cancer. In addition, it provides a new line of evidence supporting an important role for apoptosis in monogenic and complex forms of HL.
Collapse
Affiliation(s)
- Ken Op de Beeck
- Center of Medical Genetics, Department of Biomedical Sciences, University of Antwerp, Universiteitsplein 1, Wilrijk, Belgium
| | | | | | | | | | | | | | | | | | | |
Collapse
|
32
|
Kapusta A, Matsuda A, Marmignon A, Ku M, Silve A, Meyer E, Forney JD, Malinsky S, Bétermier M. Highly precise and developmentally programmed genome assembly in Paramecium requires ligase IV-dependent end joining. PLoS Genet 2011; 7:e1002049. [PMID: 21533177 PMCID: PMC3077386 DOI: 10.1371/journal.pgen.1002049] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2010] [Accepted: 02/25/2011] [Indexed: 01/09/2023] Open
Abstract
During the sexual cycle of the ciliate Paramecium, assembly of the somatic genome includes the precise excision of tens of thousands of short, non-coding germline sequences (Internal Eliminated Sequences or IESs), each one flanked by two TA dinucleotides. It has been reported previously that these genome rearrangements are initiated by the introduction of developmentally programmed DNA double-strand breaks (DSBs), which depend on the domesticated transposase PiggyMac. These DSBs all exhibit a characteristic geometry, with 4-base 5′ overhangs centered on the conserved TA, and may readily align and undergo ligation with minimal processing. However, the molecular steps and actors involved in the final and precise assembly of somatic genes have remained unknown. We demonstrate here that Ligase IV and Xrcc4p, core components of the non-homologous end-joining pathway (NHEJ), are required both for the repair of IES excision sites and for the circularization of excised IESs. The transcription of LIG4 and XRCC4 is induced early during the sexual cycle and a Lig4p-GFP fusion protein accumulates in the developing somatic nucleus by the time IES excision takes place. RNAi–mediated silencing of either gene results in the persistence of free broken DNA ends, apparently protected against extensive resection. At the nucleotide level, controlled removal of the 5′-terminal nucleotide occurs normally in LIG4-silenced cells, while nucleotide addition to the 3′ ends of the breaks is blocked, together with the final joining step, indicative of a coupling between NHEJ polymerase and ligase activities. Taken together, our data indicate that IES excision is a “cut-and-close” mechanism, which involves the introduction of initiating double-strand cleavages at both ends of each IES, followed by DSB repair via highly precise end joining. This work broadens our current view on how the cellular NHEJ pathway has cooperated with domesticated transposases for the emergence of new mechanisms involved in genome dynamics. Double-strand breaks (DSBs) are among the most deleterious lesions that may occur on DNA. Some physiological processes, however, involve the introduction of DSBs and their subsequent repair. In the ciliate Paramecium, programmed DSBs initiate the extensive genome rearrangements that take place at each sexual cycle, during the development of the somatic nucleus. In particular, short intervening germline sequences (one every 1–2 kb along the genome) are spliced out from coding and non-coding regions. In this study, we present evidence that this process is a two-step mechanism and involves DNA cleavage at both ends of each excised sequence, followed by DSB repair. We demonstrate that cellular end-joining proteins, Ligase IV and its partner, Xrcc4p, are essential for the closure of broken excision sites, which has to be precise at the nucleotide level to allow the assembly of functional genes. This precision stands in sharp contrast to the notion that end joining is an error-prone DSB repair pathway. Therefore, Paramecium provides an excellent model for analysis of an intrinsically precise end joining pathway that has been recruited for genome-wide DSB repair.
Collapse
Affiliation(s)
- Aurélie Kapusta
- CNRS UPR3404, Centre de Génétique Moléculaire, Gif-sur-Yvette, France
- Université Paris 11, Département de Biologie, Orsay, France
- CNRS FRC3115, Centre de Recherches de Gif–sur-Yvette, Gif-sur-Yvette, France
| | - Atsushi Matsuda
- Department of Biochemistry, Purdue University, West Lafayette, Indiana, United States of America
| | - Antoine Marmignon
- CNRS UPR3404, Centre de Génétique Moléculaire, Gif-sur-Yvette, France
- Université Paris 11, Département de Biologie, Orsay, France
- CNRS FRC3115, Centre de Recherches de Gif–sur-Yvette, Gif-sur-Yvette, France
| | - Michael Ku
- Department of Biochemistry, Purdue University, West Lafayette, Indiana, United States of America
| | - Aude Silve
- CNRS UPR3404, Centre de Génétique Moléculaire, Gif-sur-Yvette, France
| | - Eric Meyer
- Institut de Biologie de l'Ecole Normale Supérieure, CNRS UMR8197, INSERM U1024, Paris, France
| | - James D. Forney
- Department of Biochemistry, Purdue University, West Lafayette, Indiana, United States of America
| | - Sophie Malinsky
- Institut de Biologie de l'Ecole Normale Supérieure, CNRS UMR8197, INSERM U1024, Paris, France
- Université Paris Diderot – Paris 7, UFR des Sciences du Vivant, Paris, France
| | - Mireille Bétermier
- CNRS UPR3404, Centre de Génétique Moléculaire, Gif-sur-Yvette, France
- Université Paris 11, Département de Biologie, Orsay, France
- CNRS FRC3115, Centre de Recherches de Gif–sur-Yvette, Gif-sur-Yvette, France
- * E-mail:
| |
Collapse
|
33
|
Hansen SF, Bettler E, Rinnan A, Engelsen SB, Breton C. Exploring genomes for glycosyltransferases. MOLECULAR BIOSYSTEMS 2010; 6:1773-81. [PMID: 20556308 DOI: 10.1039/c000238k] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Glycosyltransferases are one of the largest and most diverse enzyme groups in Nature. They catalyse the synthesis of glycosidic linkages by the transfer of a sugar residue from a donor to an acceptor substrate. These enzymes have been classified into families on the basis of amino acid sequence similarity that are kept updated in the Carbohydrate Active enZyme database (CAZy, ). The repertoire of glycosyltransferases in genomes is believed to determine the diversity of cellular glycan structures, and current estimates suggest that for most genomes about 1% of the coding regions are glycosyltransferases. However, plants tend to have far more glycosyltransferase genes than any other organism sequenced to date, and this can be explained by the highly complex polysaccharide network that form the cell wall and also by the numerous glycosylated secondary metabolites. In recent years, various bioinformatics strategies have been used to search bacterial and plant genomes for new glycosyltransferase genes. These are based on the use of remote homology detection methods that act at the 1D, 2D, and 3D level. The combined use of methods such as profile Hidden Markov Model (HMM) and fold recognition appears to be appropriate for this class of enzyme. Chemometric tools are also particularly well suited for obtaining an overview of multivariate data and revealing hidden latent information when dealing with large and highly complex datasets.
Collapse
|
34
|
Callebaut I, Mornon JP. LOTUS, a new domain associated with small RNA pathways in the germline. Bioinformatics 2010; 26:1140-4. [DOI: 10.1093/bioinformatics/btq122] [Citation(s) in RCA: 57] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
|
35
|
Finding of residues crucial for supersecondary structure formation. Proc Natl Acad Sci U S A 2009; 106:18996-9000. [PMID: 19855006 DOI: 10.1073/pnas.0909714106] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
This work evaluates the hypothesis that proteins with an identical supersecondary structure (SSS) share a unique set of residues--SSS-determining residues--even though they may belong to different protein families and have very low sequence similarities. This hypothesis was tested on two groups of sandwich-like proteins (SPs). Proteins in each group have an identical SSS, but their sequence similarity is below the "twilight zone." To find the SSS-determining residues specific to each group, a unique structure-based algorithm of multiple sequences alignment was developed. The units of alignment are individual strands and loops rather than whole sequences. The algorithm is based on the alignment of residues that form hydrogen bonds between corresponding strands. Structure-based alignment revealed that 30-35% of the positions in the sequences in each group of proteins are "conserved positions" occupied either by hydrophobic-only or hydrophilic-only residues. Moreover, each group of SPs is characterized by a unique set of SSS-determining residues found at the conserved positions. The set of SSS-determining residues has very high sensitivity and specificity for identifying proteins with a corresponding SSS: It is an "amino acid tag" that brands a sequence as having a particular SSS. Thus, the sets of SSS-determining residues can be used to classify proteins and to predict the SSS of a query amino acid sequence.
Collapse
|
36
|
Mornon JP, Lehn P, Callebaut I. Molecular models of the open and closed states of the whole human CFTR protein. Cell Mol Life Sci 2009; 66:3469-86. [PMID: 19707853 PMCID: PMC11115851 DOI: 10.1007/s00018-009-0133-0] [Citation(s) in RCA: 117] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2009] [Revised: 07/17/2009] [Accepted: 08/12/2009] [Indexed: 12/15/2022]
Abstract
Cystic fibrosis transmembrane conductance regulator (CFTR), involved in cystic fibrosis (CF), is a chloride channel belonging to the ATP-binding cassette (ABC) superfamily. Using the experimental structure of Sav1866 as template, we previously modeled the human CFTR structure, including membrane-spanning domains (MSD) and nucleotide-binding domains (NBD), in an outward-facing conformation (open channel state). Here, we constructed a model of the CFTR inward-facing conformation (closed channel) on the basis of the recent corrected structures of MsbA and compared the structural features of those two states of the channel. Interestingly, the MSD:NBD coupling interfaces including F508 (DeltaF508 being the most common CF mutation) are mainly left unchanged. This prediction, completed by the modeling of the regulatory R domain, is supported by experimental data and provides a molecular basis for a better understanding of the functioning of CFTR, especially of the structural features that make CFTR the unique channel among the ABC transporters.
Collapse
Affiliation(s)
- Jean-Paul Mornon
- IMPMC, UMR7590, CNRS, Universités Pierre et Marie Curie-Paris 6 et Denis Diderot-Paris 7, 140 rue de Lourmel, Paris, France
| | - Pierre Lehn
- INSERM U613, IFR148 ScInBioS, Université de Bretagne Occidentale, Brest, France
| | - Isabelle Callebaut
- IMPMC, UMR7590, CNRS, Universités Pierre et Marie Curie-Paris 6 et Denis Diderot-Paris 7, 140 rue de Lourmel, Paris, France
| |
Collapse
|
37
|
Damaj R, Pomel S, Bricheux G, Coffe G, Viguès B, Ravet V, Bouchard P. Cross-study analysis of genomic data defines the ciliate multigenic epiplasmin family: strategies for functional analysis in Paramecium tetraurelia. BMC Evol Biol 2009; 9:125. [PMID: 19493334 PMCID: PMC2709106 DOI: 10.1186/1471-2148-9-125] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2008] [Accepted: 06/03/2009] [Indexed: 05/25/2023] Open
Abstract
Background The sub-membranous skeleton of the ciliate Paramecium, the epiplasm, is composed of hundreds of epiplasmic scales centered on basal bodies, and presents a complex set of proteins, epiplasmins, which belong to a multigenic family. The repeated duplications observed in the P. tetraurelia genome present an interesting model of the organization and evolution of a multigenic family within a single cell. Results To study this multigenic family, we used phylogenetic, structural, and analytical transcriptional approaches. The phylogenetic method defines 5 groups of epiplasmins in the multigenic family. A refined analysis by Hydrophobic Cluster Analysis (HCA) identifies structural characteristics of 51 epiplasmins, defining five separate groups, and three classes. Depending on the sequential arrangement of their structural domains, the epiplasmins are defined as symmetric, asymmetric or atypical. The EST data aid in this classification, in the identification of putative regulating sequences such as TATA or CAAT boxes. When specific RNAi experiments were conducted using sequences from either symmetric or asymmetric classes, phenotypes were drastic. Local effects show either disrupted or ill-shaped epiplasmic scales. In either case, this results in aborted cell division. Using structural features, we show that 4 epiplasmins are also present in another ciliate, Tetrahymena thermophila. Their affiliation with the distinctive structural groups of Paramecium epiplasmins demonstrates an interspecific multigenic family. Conclusion The epiplasmin multigenic family illustrates the history of genomic duplication in Paramecium. This study provides a framework which can guide functional analysis of epiplasmins, the major components of the membrane skeleton in ciliates. We show that this set of proteins handles an important developmental information in Paramecium since maintenance of epiplasm organization is crucial for cell morphogenesis.
Collapse
Affiliation(s)
- Raghida Damaj
- Laboratoire Microorganismes: Génome et Environnement (ex, LBP) UMR CNRS 6023, Université Blaise Pascal, 63177, Aubière cedex, France.
| | | | | | | | | | | | | |
Collapse
|
38
|
Hoffman RMB, Sykes BD. Isoform-specific variation in the intrinsic disorder of troponin I. Proteins 2009; 73:338-50. [PMID: 18433059 DOI: 10.1002/prot.22063] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Abstract
Various intrinsic disorder (ID) prediction algorithms were applied to the three tissue isoforms of troponin I (TnI). The results were interpreted in terms of the known structure and dynamics of troponin. In line with previous results, all isoforms of TnI were predicted to have large stretches of ID. The predictions show that the C-termini of all isoforms are extensively disordered as is the N-terminal extension of the cardiac isoform. Cardiac TnI likely belongs to the group of intrinsically disordered signalling hub proteins. For a given portion of the protein sequence, most ID prediction approaches indicate isoform-dependent variations in the probability of disorder. Comparison of machine learning and physically based approaches suggests the ID variations are only partially attributable to local variations in the ratio of charged to hydrophobic residues. The VSL2B algorithm predicts the largest variations in ID across the isoforms, with the cardiac isoform having the highest probability of structured regions, and the fast-skeletal isoform having no intrinsic structure. The region corresponding to residues 57-95 of the fast-skeletal isoform, known to form a coiled coil substructure with troponin T, was highly variable between isoforms. The isoform-specific ID variations may have mechanistic significance, modulating the extent to which conformational fluctuations in tropomyosin are communicated to the troponin complex. We discuss structural mechanisms for this communication. Overall, the results motivate the development of predictors designed to address relative levels of disorder between highly similar proteins.
Collapse
Affiliation(s)
- Ryan M B Hoffman
- Department of Biochemistry, University of Alberta, Edmonton, Canada
| | | |
Collapse
|
39
|
de Villartay JP, Shimazaki N, Charbonnier JB, Fischer A, Mornon JP, Lieber MR, Callebaut I. A histidine in the β-CASP domain of Artemis is critical for its full in vitro and in vivo functions. DNA Repair (Amst) 2009; 8:202-8. [DOI: 10.1016/j.dnarep.2008.10.010] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2008] [Revised: 09/30/2008] [Accepted: 10/08/2008] [Indexed: 01/03/2023]
|
40
|
Vos JW, Pieuchot L, Evrard JL, Janski N, Bergdoll M, de Ronde D, Perez LH, Sardon T, Vernos I, Schmit AC. The plant TPX2 protein regulates prospindle assembly before nuclear envelope breakdown. THE PLANT CELL 2008; 20:2783-97. [PMID: 18941054 PMCID: PMC2590745 DOI: 10.1105/tpc.107.056796] [Citation(s) in RCA: 85] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/09/2007] [Revised: 10/02/2008] [Accepted: 10/09/2008] [Indexed: 05/18/2023]
Abstract
The Targeting Protein for Xklp2 (TPX2) is a central regulator of spindle assembly in vertebrate cells. The absence or excess of TPX2 inhibits spindle formation. We have defined a TPX2 signature motif that is present once in vertebrate sequences but twice in plants. Plant TPX2 is predominantly nuclear during interphase and is actively exported before nuclear envelope breakdown to initiate prospindle assembly. It localizes to the spindle microtubules but not to the interdigitating polar microtubules during anaphase or to the phragmoplast as it is rapidly degraded during telophase. We characterized the Arabidopsis thaliana TPX2-targeting domains and show that the protein is able to rescue microtubule assembly in TPX2-depleted Xenopus laevis egg extracts. Injection of antibodies to TPX2 into living plant cells inhibits the onset of mitosis. These results demonstrate that plant TPX2 already functions before nuclear envelope breakdown. Thus, plants have adapted nuclear-cytoplasmic shuttling of TPX2 to maintain proper spindle assembly without centrosomes.
Collapse
Affiliation(s)
- Jan W Vos
- Laboratory of Plant Cell Biology, Wageningen University, 6703 BD Wageningen, The Netherlands
| | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Dufourny L, Levasseur A, Migaud M, Callebaut I, Pontarotti P, Malpaux B, Monget P. GPR50 is the mammalian ortholog of Mel1c: evidence of rapid evolution in mammals. BMC Evol Biol 2008; 8:105. [PMID: 18400093 PMCID: PMC2323367 DOI: 10.1186/1471-2148-8-105] [Citation(s) in RCA: 90] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2007] [Accepted: 04/09/2008] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND The melatonin receptor subfamily contains three members Mel1a, Mel1b and Mel1c, found in all vertebrates except for Mel1c which is found only in fish, Xenopus species and the chicken. Another receptor, the melatonin related receptor known as GPR50, found exclusively in mammals and later identified as a member of the melatonin receptor subfamily because of its identity to the three melatonin receptors despite its absence of affinity for melatonin. The aim of this study was to describe the evolutionary relationships between GPR50 and the three other members of the melatonin receptor subfamily. RESULTS Using an in silico approach, we demonstrated that GPR50 is the ortholog of the high affinity Mel1c receptor. It was necessary to also study the synteny of this gene to reach this conclusion because classical mathematical models that estimate orthology and build phylogenetic trees were not sufficient. The receptor has been deeply remodelled through evolution by the mutation of numerous amino acids and by the addition of a long C-terminal tail. These alterations have modified its affinity for melatonin and probably affected its interactions with the other two known melatonin receptors MT1 and MT2 that are encoded by Mel1a and Mel1b genes respectively. Evolutionary studies provided evidence that the GPR50 group evolved under different selective pressure as compared to the orthologous groups Me11 a, b, and c. CONCLUSION This study demonstrated that there are only three members in the melatonin receptor subfamily with one of them (Me11c) undergoing rapid evolution from fishes and birds to mammals. Further studies are necessary to investigate the physiological roles of this receptor.
Collapse
Affiliation(s)
- Laurence Dufourny
- Physiologie de la Reproduction et des Comportements, UMR 6175 INRA-CNRS-Université François Rabelais de Tours-Haras Nationaux, 37380 Nouzilly, France.
| | | | | | | | | | | | | |
Collapse
|
42
|
Sagar GDV, Gereben B, Callebaut I, Mornon JP, Zeöld A, Curcio-Morelli C, Harney JW, Luongo C, Mulcahey MA, Larsen PR, Huang SA, Bianco AC. The thyroid hormone-inactivating deiodinase functions as a homodimer. Mol Endocrinol 2008; 22:1382-93. [PMID: 18356288 DOI: 10.1210/me.2007-0490] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023] Open
Abstract
The type 3 deiodinase (D3) inactivates thyroid hormone action by catalyzing tissue-specific inner ring deiodination, predominantly during embryonic development. D3 has gained much attention as a player in the euthyroid sick syndrome, given its robust reactivation during injury and/or illness. Whereas much of the structure biology of the deiodinases is derived from studies with D2, a dimeric endoplasmic reticulum obligatory activating deiodinase, little is known about the holostructure of the plasma membrane resident D3, the deiodinase capable of thyroid hormone inactivation. Here we used fluorescence resonance energy transfer in live cells to demonstrate that D3 exists as homodimer. While D3 homodimerized in its native state, minor heterodimerization was also observed between D3:D1 and D3:D2 in intact cells, the significance of which remains elusive. Incubation with 0.5-1.2 m urea resulted in loss of D3 homodimerization as assessed by bioluminescence resonance energy transfer and a proportional loss of enzyme activity, to a maximum of approximately 50%. Protein modeling using a D2-based scaffold identified potential dimerization surfaces in the transmembrane and globular domains. Truncation of the transmembrane domain (DeltaD3) abrogated dimerization and deiodinase activity except when coexpressed with full-length catalytically inactive deiodinase, thus assembled as DeltaD3:D3 dimer; thus the D3 globular domain also exhibits dimerization surfaces. In conclusion, the inactivating deiodinase D3 exists as homo- or heterodimer in living intact cells, a feature that is critical for their catalytic activities.
Collapse
Affiliation(s)
- G D Vivek Sagar
- Brigham and Women's Hospital, Harvard Medical School , Boston, Massachusetts 02115, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
43
|
Carrière C, Mornon JP, Venien-Bryan C, Boisset N, Callebaut I. Calcineurin B-like domains in the large regulatory α/β subunits of phosphorylase kinase. Proteins 2008; 71:1597-606. [DOI: 10.1002/prot.22006] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
44
|
Callebaut I, Mornon JP, Monget P. Isolated ZP-N domains constitute the N-terminal extensions of Zona Pellucida proteins. ACTA ACUST UNITED AC 2007; 23:1871-4. [PMID: 17510169 DOI: 10.1093/bioinformatics/btm265] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Zona Pellucida (ZP) domains have been found in a wide variety of extracellular proteins, in which they play essential role for polymerization. They are shared by the ZP proteins, which constitute the extracellular coat of animal eggs. Except from ZP3, constituting the primary sperm receptor, the ZP proteins possess, in addition to their C-terminal ZP domains, N-terminal extensions, which are thought to play an important role in the species-specific gamete recognition. Here, we show that these extensions are made of single or multiple copies of a small globular domain, which can be significantly related to the N-terminal region of ZP domains (ZP-N domains). This finding brings new insights into the molecular evolution of ZP proteins, which may have evolved around a common ZP-N architecture, and more generally into the noticeable sequence diversity of ZP-N domains, which can be found as isolated subunits or tightly associated with ZP-C domains to form complete, canonical ZP domains.
Collapse
Affiliation(s)
- Isabelle Callebaut
- Département de Biologie Structurale, IMPMC UMR 7590, Universités Pierre et Marie Curie-Paris6, Paris, France.
| | | | | |
Collapse
|
45
|
Dulin F, Callebaut I, Colloc'h N, Mornon JP. Sequence-based modeling of Aβ42 soluble oligomers. Biopolymers 2007; 85:422-37. [PMID: 17211889 DOI: 10.1002/bip.20675] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Abeta fibrils, which are central to the pathology of Alzheimer's disease, form a cross-beta-structure that contains likely parallel beta-sheets with a salt bridge between residues Asp23 and Lys28. Recent studies suggest that soluble oligomers of amyloid peptides have neurotoxic effects in cell cultures, raising the interest in studying the structures of these intermediate forms. Here, we present three models of possible soluble Abeta forms based on the sequences similarities, assumed to support local structural similarities, of the Abeta peptide with fragments of three proteins (adhesin, Semliki Forest virus capsid protein, and transthyretin). These three models share a similar structure in the C-terminal region composed of two beta-strands connected by a loop, which contain the Asp23-Lys28 salt bridge. This segment is also structurally well conserved in Abeta fibril forms. Differences between the three monomeric models occur in the N-terminal region and in the C-terminal tail. These three models might sample some of the most stable conformers of the soluble Abeta peptide within oligomeric assemblies, which were modeled here in the form of dimers, trimers, tetramers, and hexamers. The consistency of these models is discussed with respect to available experimental and theoretical data.
Collapse
Affiliation(s)
- Fabienne Dulin
- Département de Biologie Structurale, IMPMC, CNRS UMR7590, Universités Pierre et Marie Curie-Paris 6 et Denis Diderot-Paris 7, F-75005 France
| | | | | | | |
Collapse
|