1
|
Tanoz I, Timsit Y. Protein Fold Usages in Ribosomes: Another Glance to the Past. Int J Mol Sci 2024; 25:8806. [PMID: 39201491 PMCID: PMC11354259 DOI: 10.3390/ijms25168806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 08/07/2024] [Accepted: 08/08/2024] [Indexed: 09/02/2024] Open
Abstract
The analysis of protein fold usage, similar to codon usage, offers profound insights into the evolution of biological systems and the origins of modern proteomes. While previous studies have examined fold distribution in modern genomes, our study focuses on the comparative distribution and usage of protein folds in ribosomes across bacteria, archaea, and eukaryotes. We identify the prevalence of certain 'super-ribosome folds,' such as the OB fold in bacteria and the SH3 domain in archaea and eukaryotes. The observed protein fold distribution in the ribosomes announces the future power-law distribution where only a few folds are highly prevalent, and most are rare. Additionally, we highlight the presence of three copies of proto-Rossmann folds in ribosomes across all kingdoms, showing its ancient and fundamental role in ribosomal structure and function. Our study also explores early mechanisms of molecular convergence, where different protein folds bind equivalent ribosomal RNA structures in ribosomes across different kingdoms. This comparative analysis enhances our understanding of ribosomal evolution, particularly the distinct evolutionary paths of the large and small subunits, and underscores the complex interplay between RNA and protein components in the transition from the RNA world to modern cellular life. Transcending the concept of folds also makes it possible to group a large number of ribosomal proteins into five categories of urfolds or metafolds, which could attest to their ancestral character and common origins. This work also demonstrates that the gradual acquisition of extensions by simple but ordered folds constitutes an inexorable evolutionary mechanism. This observation supports the idea that simple but structured ribosomal proteins preceded the development of their disordered extensions.
Collapse
Affiliation(s)
- Inzhu Tanoz
- Aix-Marseille Université, Université de Toulon, IRD, CNRS, Mediterranean Institute of Oceanography (MIO), UM 110, 13288 Marseille, France;
| | - Youri Timsit
- Aix-Marseille Université, Université de Toulon, IRD, CNRS, Mediterranean Institute of Oceanography (MIO), UM 110, 13288 Marseille, France;
- Research Federation for the Study of Global Ocean Systems Ecology and Evolution, FR2022/Tara GOSEE, 3 Rue Michel-Ange, 75016 Paris, France
| |
Collapse
|
2
|
Meadows J, Röder K. The Effect of Pulling and Twisting Forces on Chameleon Sequence Peptides. Chemphyschem 2023; 24:e202300351. [PMID: 37818741 DOI: 10.1002/cphc.202300351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 10/06/2023] [Accepted: 10/06/2023] [Indexed: 10/13/2023]
Abstract
Chameleon sequences are amino acid sequences found in several distinct configurations in experiment. They challenge our understanding of the link between sequence and structure, and provide insight into structural competition in proteins. Here, we study the energy landscapes for three such sequences, and interrogate how pulling and twisting forces impact the available structural ensembles. Chameleon sequences do not necessarily exhibit multiple structural ensembles on a multifunnel energy landscape when we consider them in isolation. The application of even small forces leads to drastic changes in the energy landscapes. For pulling forces, we observe transitions from helical to extended structures in a very small span of forces. For twisting forces, the picture is much more complex, and highly dependent on the magnitude and handedness of the applied force as well as the reference angle for the twist. Depending on these parameters, more complex and more simplistic energy landscapes are observed alongside more and less diverse structural ensembles. The impact of even small forces is significant, confirming their likely role in folding events. In addition, small forces exerted by the remaining scaffold of a protein may be sufficient to lead to the adoption of a specific structural ensemble by a chameleon sequence.
Collapse
Affiliation(s)
- James Meadows
- Department of Chemistry, Durham University, Stockton Road, Durham, DH1 3LE, UK
- Previous affiliation: Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Konstantin Röder
- Randall Centre for Cell & Molecular Biophysics, King's College London, Guy's Campus, Great Maze Pond, London, SE1 1UL, UK
- Previous affiliation: Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| |
Collapse
|
3
|
Dimer dissociation is a key energetic event in the fold-switch pathway of KaiB. Biophys J 2022; 121:943-955. [PMID: 35151633 PMCID: PMC8943816 DOI: 10.1016/j.bpj.2022.02.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Revised: 12/14/2021] [Accepted: 02/09/2022] [Indexed: 11/21/2022] Open
Abstract
Cyanobacteria possesses the simplest circadian clock, composed of three proteins that act as a phosphorylation oscillator: KaiA, KaiB, and KaiC. The timing of this oscillator is determined by the fold-switch of KaiB, a structural rearrangement of its C-terminal half that is accompanied by a change in the oligomerization state. During the day, KaiB forms a stable tetramer (gsKaiB), whereas it adopts a monomeric thioredoxin-like fold during the night (fsKaiB). Although the structures and functions of both native states are well studied, little is known about the sequence and structure determinants that control their structural interconversion. Here, we used confinement molecular dynamics (CCR-MD) and folding simulations using structure-based models to show that the dissociation of the gsKaiB dimer is a key energetic event for the fold-switch. Hydrogen-deuterium exchange mass spectrometry (HDXMS) recapitulates the local stability of protein regions reported by CCR-MD, with both approaches consistently indicating that the energy and backbone flexibility changes are solely associated with the region that fold-switches between gsKaiB and fsKaiB and that the localized regions that differentially stabilize gsKaiB also involve regions outside the dimer interface. Moreover, two mutants (R23C and R75C) previously reported to be relevant for altering the rhythmicity of the Kai clock were also studied by HDXMS. Particularly, R75C populates dimeric and monomeric states with a deuterium incorporation profile comparable to the one observed for fsKaiB, emphasizing the importance of the oligomerization state of KaiB for the fold-switch. These findings suggest that the information necessary to control the rhythmicity of the cyanobacterial biological clock is, to a great extent, encoded within the KaiB sequence.
Collapse
|
4
|
Sannigrahi A, Chattopadhyay K. Pore formation by pore forming membrane proteins towards infections. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2021; 128:79-111. [PMID: 35034727 DOI: 10.1016/bs.apcsb.2021.09.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Over the last 25 years, the biology of membrane proteins, including the PFPs-membranes interactions is seeking attention for the development of successful drug molecules against a number of infectious diseases. Pore forming toxins (PFTs), the largest family of PFPs are considered as a group of virulence factors produced in a large number of pathogenic systems which include streptococcus, pneumonia, Staphylococcus aureus, E. coli, Mycobacterium tuberculosis, group A and B streptococci, Corynebacterium diphtheria and many more. PFTs are generally utilized by the disease causing pathogens to disrupt the host first line of defense i.e. host cell membranes through pore formation strategy. Although, pore formation is the principal mode of action of the PFTs but they can have additional adverse effects on the hosts including immune evasion. Recently, structural investigation of different PFTs have imparted the molecular mechanistic insights into how PFTs get transformed from its inactive state to active toxic state. On the basis of their structural entity, PFTs have been classified in different types and their mode of actions alters in terms of pore formation and corresponding cellular toxicity. Although pathogen genome analysis can identify the probable PFTs depending upon their structural diversity, there are so many PFTs which utilize the local environmental conditions to generate their pore forming ability using a novel strategy which is known as "conformational switch" of a protein. This conformational switch is considered as characteristics of the phase shifting proteins which were often utilized by many pathogenic systems to protect them from the invaders through allosteric communication between distant regions of the protein. In this chapter, we discuss the structure function relationships of PFTs and how activity of PFTs varies with the change in the environmental conditions has been explored. Finally, we demonstrate these structural insights to develop therapeutic potential to treat the infections caused by multidrug resistant pathogens.
Collapse
Affiliation(s)
- Achinta Sannigrahi
- Department of Chemical Engineering, Indian Institute of Science, Bengaluru, Karnataka, India.
| | - Krishnananda Chattopadhyay
- Structural Biology and Bioinformatics Division, CSIR-Indian Institute of Chemical Biology, Kolkata, West Bengal, India.
| |
Collapse
|
5
|
Kuznetsov IB. Identification of non-random sequence properties in groups of signature peptides obtained in random sequence peptide microarray experiments. Biopolymers 2016; 106:318-29. [PMID: 27037995 DOI: 10.1002/bip.22845] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2015] [Revised: 02/16/2016] [Accepted: 03/28/2016] [Indexed: 11/09/2022]
Abstract
Immunosignaturing is an emerging experimental technique that uses random sequence peptide microarrays to detect antibodies produced by the immune system in response to a particular disease. Two important questions regarding immunosignaturing are "Do microarray peptides that exhibit a strong affinity to a given type of antibodies share common sequence properties?" and "If so, what are those properties?" In this work, three statistical tests designed to detect non-random patterns in the amino acid makeup of a group of microarray peptides are presented. One test detects patterns of significantly biased amino acid usage, whereas the other two detect patterns of significant bias in the biochemical properties. These tests do not require a large number of peptides per group. The tests were applied to analyze 19 groups of peptides identified in immunosignaturing experiments as being specific for antibodies produced in response to various types of cancer and other diseases. The positional distribution of the biochemical properties of the amino acids in these 19 peptide groups was also studied. Remarkably, despite the random nature of the sequence libraries used to design the microarrays, a unique group-specific non-random pattern was identified in the majority of the peptide groups studied. © 2016 Wiley Periodicals, Inc. Biopolymers (Pept Sci) 106: 318-329, 2016.
Collapse
Affiliation(s)
- Igor B Kuznetsov
- Cancer Research Center and Department of Epidemiology and Biostatistics, University at Albany, State University of New York, One Discovery Drive, Rensselaer, NY, 12144
| |
Collapse
|
6
|
Bahramali G, Goliaei B, Minuchehr Z, Salari A. Chameleon sequences in neurodegenerative diseases. Biochem Biophys Res Commun 2016; 472:209-16. [PMID: 26920059 PMCID: PMC7124260 DOI: 10.1016/j.bbrc.2016.01.187] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2016] [Accepted: 01/30/2016] [Indexed: 01/03/2023]
Abstract
Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.
Collapse
Affiliation(s)
- Golnaz Bahramali
- Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Bahram Goliaei
- Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.
| | - Zarrin Minuchehr
- Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran, Iran.
| | - Ali Salari
- Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran, Iran
| |
Collapse
|
7
|
Li W, Kinch LN, Karplus PA, Grishin NV. ChSeq: A database of chameleon sequences. Protein Sci 2015; 24:1075-86. [PMID: 25970262 DOI: 10.1002/pro.2689] [Citation(s) in RCA: 37] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Revised: 04/15/2015] [Accepted: 04/24/2015] [Indexed: 11/11/2022]
Abstract
Chameleon sequences (ChSeqs) refer to sequence strings of identical amino acids that can adopt different conformations in protein structures. Researchers have detected and studied ChSeqs to understand the interplay between local and global interactions in protein structure formation. The different secondary structures adopted by one ChSeq challenge sequence-based secondary structure predictors. With increasing numbers of available Protein Data Bank structures, we here identify a large set of ChSeqs ranging from 6 to 10 residues in length. The homologous ChSeqs discovered highlight the structural plasticity involved in biological function. When compared with previous studies, the set of unrelated ChSeqs found represents an about 20-fold increase in the number of detected sequences, as well as an increase in the longest ChSeq length from 8 to 10 residues. We applied secondary structure predictors on our ChSeqs and found that methods based on a sequence profile outperformed methods based on a single sequence. For the unrelated ChSeqs, the evolutionary information provided by the sequence profile typically allows successful prediction of the prevailing secondary structure adopted in each protein family. Our dataset will facilitate future studies of ChSeqs, as well as interpretations of the interplay between local and nonlocal interactions. A user-friendly web interface for this ChSeq database is available at prodata.swmed.edu/chseq.
Collapse
Affiliation(s)
- Wenlin Li
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.,Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050
| | - Lisa N Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050
| | - P Andrew Karplus
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon, 97331
| | - Nick V Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.,Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.,Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050
| |
Collapse
|
8
|
Yamamoto N. Hot Spot of Structural Ambivalence in Prion Protein Revealed by Secondary Structure Principal Component Analysis. J Phys Chem B 2014; 118:9826-33. [DOI: 10.1021/jp5034245] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Norifumi Yamamoto
- Department of Life and Environmental
Sciences, Faculty of Engineering, Chiba Institute of Technology, 2-17-1 Tsudanuma, Narashino 275-0016, Japan
| |
Collapse
|
9
|
Kaur H, Sasidhar YU. Molecular dynamics study of an insertion/duplication mutant of bacteriophage T4 lysozyme reveals the nature of α→β transition in full protein context. Phys Chem Chem Phys 2013; 15:7819-30. [PMID: 23598905 DOI: 10.1039/c3cp44327b] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
An α→β transition underlies the first step of disease causing amyloidogenesis in many proteins. In view of this, many studies have been carried out using peptide models to characterize these secondary structural transitions. In this paper we show that an insertion/duplication mutant 'L20' of bacteriophage T4 lysozyme (M. Sagermann, W. A. Baase and B. W. Matthews, Proc. Natl. Acad. Sci. U.S.A., 1999, 96, 6078) displays an α→β transition. We performed molecular dynamics (MD) simulation of L20, using the GROMACS package of programs and united atom GROMOS 53a6 force field for a time period of 600 ns at 300 K, in explicit water. Our MD simulation demonstrated that the transition occurs in a duplicated α-helical region inserted tandemly at the N-terminus of the 'parent' helix. We show that a C-terminal β-sheet anchors the parent helix while the loosely held N-terminal loop in the duplicate region is vulnerable to solvent attack and thus undergoes an α→β transition. Main chain-solvent interactions were seen to stabilize the observed β-structure. Thus L20 serves as a good protein model for characterization of α→β transition in a full length protein.
Collapse
Affiliation(s)
- Harpreet Kaur
- Department of Chemistry, Indian Institute of Technology Bombay, Powai, Mumbai 400 076, India
| | | |
Collapse
|
10
|
Bhattacharjee N, Biswas P. Helical ambivalency induced by point mutations. BMC STRUCTURAL BIOLOGY 2013; 13:9. [PMID: 23675772 PMCID: PMC3683331 DOI: 10.1186/1472-6807-13-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2012] [Accepted: 05/02/2013] [Indexed: 01/15/2023]
Abstract
Background Mutation of amino acid sequences in a protein may have diverse effects on its structure and function. Point mutations of even a single amino acid residue in the helices of the non-redundant database may lead to sequentially identical peptides which adopt different secondary structures in different proteins. However, various physico-chemical factors which govern the formation of these ambivalent helices generated by point mutations of a sequence are not clearly known. Results Sequences generated by point mutations of helices are mapped on to their non-helical counterparts in the SCOP database. The results show that short helices are prone to transform into non-helical conformations upon point mutations. Mutation of amino acid residues by helix breakers preferentially yield non-helical conformations, while mutation with residues of intermediate helix propensity display least preferences for non-helical conformations. Differences in the solvent accessibility of the mutating/mutated residues are found to be a major criteria for these sequences to conform to non-helical conformations. Even with minimal differences in the amino acid distributions of the sequences flanking the helical and non-helical conformations, helix-flanking sequences are found be more solvent accessible. Conclusions All types of mutations from helical to non-helical conformations are investigated. The primary factors attributing such changes in conformation can be: i) type/propensity of the mutating and mutant residues ii) solvent accessibility of the residue at the mutation site iii) context/environment dependence of the flanking sequences. The results from the present study may be used to design de novo proteins via point mutations.
Collapse
|
11
|
Seidler NW. Dynamic oligomeric properties. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2013; 985:207-47. [PMID: 22851451 DOI: 10.1007/978-94-007-4716-6_7] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
This chapter provides a foundation for further research into the relationship between dynamic oligomeric properties and functional diversity. The structural basis that underlies the conformational sub-states of the GAPDH oligomer is discussed. The issue of protein stability is given a thorough analysis, since it is well-established that the primary strategy for protein oligomerization is to stabilize conformation. Several factors that affect oligomerization are described, including chemical modification by synthetic reagents. The effects of native substrates and coenzymes are also discussed. The curious feature of chloride ions having a de-stabilizing effect on native GAPDH structure is described. Additionally, the role of adenine dinucleotides in tetramer-dimer equilibrium dynamics is suggested to be a major part of the physiological regulation of GAPDH structure and function. This chapter also contends that a vast amount of useful information can come from comparative analyses of diverse species, particularly regarding protein stability and subunit-subunit interaction. Lastly, the concept of domain exchange is introduced as a means of understanding the stabilization of dynamic oligomers, suggesting that inter-subunit contacts may also be a way of masking docking sites to other proteins.
Collapse
Affiliation(s)
- Norbert W Seidler
- Department of Biochemistry, Kansas City University of Medicine and Biosciences, Kansas City, MO, USA
| |
Collapse
|
12
|
Establishment of a human cell line stably overexpressing mouse Nip45 and characterization of Nip45 subcellular localization. Biochem Biophys Res Commun 2012; 430:72-7. [PMID: 23159618 DOI: 10.1016/j.bbrc.2012.11.020] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2012] [Accepted: 11/06/2012] [Indexed: 11/24/2022]
Abstract
The nuclear factor of activated T cells, cytoplasmic, calcineurin dependent 2 interacting protein, Nfatc2ip (Nip45), has been implicated as a crucial coordinator of the immune response and of cellular differentiation in humans and mice, and contains SUMO-like domains in its C-terminal region. However, the significance of its N-terminal region and its correlation to the SUMO modification pathway remain largely uncharacterized. In this study, a human cultured cell line was established, in which FLAG-tagged mouse Nip45 (FLAG-mNip45) was stably overexpressed. Under standard, non-stressful conditions, we detected FLAG-mNip45 diffusely distributed in the nucleus. Intriguingly, proteasome inhibition by MG132 caused FLAG-mNip45, together with SUMOylated proteins, to localize in nuclear domains associated with promyelocytic leukemia protein. Finally, using an in vitro binding assay, we showed interaction of the N-terminal region of mNip45 with both free SUMO-3 and SUMO-3 chains, indicating that Nip45 may, in part, exert its function via interaction with SUMO/SUMOylated proteins. Taken together, our study provides novel information on a poorly characterized mammalian protein and suggests that our newly established cell line will be useful for elucidating the physiological role of Nip45.
Collapse
|
13
|
Abstract
This work is a first attempt to characterise the conformational preference of structurally ambivalent helices in terms of their backbone conformational entropy. Ambivalent sequences conform to two different secondary structures (helix-sheet or helix-random coil or sheet-random coil, etc.) in two different proteins. For variable ambivalent helices, the helical conformations are found to possess less conformational entropy as compared with their non-helical counterparts when the ϕ-ψ dihedral angle range of the entire peptide segment is used to calculate the backbone conformational entropy. The favourable number of native contacts is a primary stabilising factor for these helical conformations. However, an opposite trend is observed when the ϕ-ψ angles of the individual amino acids are used to calculate the backbone conformational entropy. The results show that these peptide segments are rather reluctant to form helices, but are driven to form helices due to the favourable number of native contacts and optimum range of ϕ-ψ angle of the segments. Both procedures are validated by applying on conserved helices in the non-redundant database and their corresponding counterparts in the Structural Classification of Proteins database. Although context is a major determinant in deciding conformations of ambivalent sequences, no significant difference in the conformational entropy of sequences flanking ambivalent helical sequences in helical and non-helical forms is observed in this study. The results may be useful in understanding the structural context and environmental factors which leads to the formation of ambivalent helices and designing de novo proteins.
Collapse
|
14
|
Saravanan KM, Selvaraj S. Search for identical octapeptides in unrelated proteins: Structural plasticity revisited. Biopolymers 2011; 98:11-26. [PMID: 23325556 DOI: 10.1002/bip.21676] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2011] [Revised: 03/31/2011] [Accepted: 05/10/2011] [Indexed: 12/22/2022]
Abstract
Since proteins are dynamic in nature, they can alter their local structure in response to changes in their environment factors such as temperature, pH, phosphorylation, and binding of other small molecules. These conformational changes are extremely important for the correct folding and functioning of proteins. There are also a number of diseases associated with protein conformational change such as amyloid diseases. To stimulate research into the above factors which specify one conformation over another, different theoretical models have been proposed and tested against sequence similar distant structure protein fragments. In order to simplify the computational complexity of identifying conformational changes in proteins, various local sequence search algorithms were employed and the structural plasticity in unrelated proteins was examined by various research groups. In the present work, we revisit the mechanism of structural plasticity in unrelated proteins with increased number of structures in Protein Data Bank by comparing identical octapeptides in unrelated proteins with dictionary of protein secondary structure extracted from existing experimental data. Our goal is to bring out the influence of hydrophobic residues, hydrophilic residues, flanking residues, difference in secondary structural propensities of surrounding residues, difference in phi-psi angles and local and nonlocal interactions in identical octapeptides adopting different conformations. Also we have used surrounding hydrophobicity, environment dependent interaction energy, atomic mean force potential, structural unit contacts and difference profiles models to explore the factors which cause structural plasticity. The results discussed here may provide insights into protein folding, design and function.
Collapse
Affiliation(s)
- K M Saravanan
- Department of Bioinformatics, School of Life Sciences, Bharathidasan University, Tiruchirappalli 620 024, Tamil Nadu, India
| | | |
Collapse
|
15
|
Ruvinsky AM, Vakser IA. Sequence composition and environment effects on residue fluctuations in protein structures. J Chem Phys 2011; 133:155101. [PMID: 20969427 DOI: 10.1063/1.3498743] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Structure fluctuations in proteins affect a broad range of cell phenomena, including stability of proteins and their fragments, allosteric transitions, and energy transfer. This study presents a statistical-thermodynamic analysis of relationship between the sequence composition and the distribution of residue fluctuations in protein-protein complexes. A one-node-per-residue elastic network model accounting for the nonhomogeneous protein mass distribution and the interatomic interactions through the renormalized inter-residue potential is developed. Two factors, a protein mass distribution and a residue environment, were found to determine the scale of residue fluctuations. Surface residues undergo larger fluctuations than core residues in agreement with experimental observations. Ranking residues over the normalized scale of fluctuations yields a distinct classification of amino acids into three groups: (i) highly fluctuating-Gly, Ala, Ser, Pro, and Asp, (ii) moderately fluctuating-Thr, Asn, Gln, Lys, Glu, Arg, Val, and Cys, and (iii) weakly fluctuating-Ile, Leu, Met, Phe, Tyr, Trp, and His. The structural instability in proteins possibly relates to the high content of the highly fluctuating residues and a deficiency of the weakly fluctuating residues in irregular secondary structure elements (loops), chameleon sequences, and disordered proteins. Strong correlation between residue fluctuations and the sequence composition of protein loops supports this hypothesis. Comparing fluctuations of binding site residues (interface residues) with other surface residues shows that, on average, the interface is more rigid than the rest of the protein surface and Gly, Ala, Ser, Cys, Leu, and Trp have a propensity to form more stable docking patches on the interface. The findings have broad implications for understanding mechanisms of protein association and stability of protein structures.
Collapse
Affiliation(s)
- Anatoly M Ruvinsky
- Center for Bioinformatics, The University of Kansas, Lawrence, Kansas 66047, USA.
| | | |
Collapse
|
16
|
Bhattacharjee N, Biswas P. Statistical analysis and molecular dynamics simulations of ambivalent α-helices. BMC Bioinformatics 2010; 11:519. [PMID: 20955581 PMCID: PMC2973962 DOI: 10.1186/1471-2105-11-519] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2010] [Accepted: 10/18/2010] [Indexed: 11/17/2022] Open
Abstract
Background Analysis of known protein structures reveals that identical sequence fragments in proteins can adopt different secondary structure conformations. The extent of this conformational diversity is influenced by various factors like the intrinsic sequence propensity, sequence context and other environmental factors such as pH, site directed mutations or alteration of the binding ligands. Understanding the mechanism by which the environment affects the structural ambivalence of these peptides has potential implications for protein design and reliable local structure prediction algorithms. Identification of the structurally ambivalent sequence fragments and determining the rules which dictate their conformational preferences play an important role in understanding the conformational changes observed in misfolding diseases. However, a systematic classification of their intrinsic sequence patterns or a statistical analysis of their properties and sequence context in relation to the origin of their structural diversity have largely remained unexplored. Results In this work, the conformational variability of α-helices is studied by mapping sequences from the non-redundant database to identical sequences across all classes of the SCOP (Structural Classification of Proteins) database. Some helices retain their conformations when mapped in the SCOP database while others exhibit a complete/partial switch to non-helical conformations. The results clearly depict the differences in the propensities of amino acids for the variable and conserved helices. Sequences flanking these ambivalent sequence fragments have anisotropic propensities at the N- and C-termini. This structural variability is depicted by molecular dynamics simulations in explicit solvent, which show that the short conserved helices retain their conformations while their longer counterparts fray into two or more shorter helices. Variable helices in the non-redundant database exhibit a trend of retaining helical conformations while their corresponding non-helical conformations in SCOP database show large deviations from their respective initial structures by adopting partial or full helical conformations. Partially ambivalent helices are also found to retain their respective conformations. Conclusions All sequence fragments which show structural diversity in different proteins of the non-redundant database are investigated. The final conformation of these ambivalent sequences are dictated by a fine tuning of their intrinsic sequence propensity and the anisotropic amino acid propensity of the flanking sequences. This analysis may unravel the connection between diverse secondary structures, which conserve the overall structural fold of the protein thus determining its function.
Collapse
|
17
|
Liu YC, Yang MH, Lin WL, Huang CK, Oyang YJ. A sequence-based hybrid predictor for identifying conformationally ambivalent regions in proteins. BMC Genomics 2009; 10 Suppl 3:S22. [PMID: 19958486 PMCID: PMC2788375 DOI: 10.1186/1471-2164-10-s3-s22] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Background Proteins are dynamic macromolecules which may undergo conformational transitions upon changes in environment. As it has been observed in laboratories that protein flexibility is correlated to essential biological functions, scientists have been designing various types of predictors for identifying structurally flexible regions in proteins. In this respect, there are two major categories of predictors. One category of predictors attempts to identify conformationally flexible regions through analysis of protein tertiary structures. Another category of predictors works completely based on analysis of the polypeptide sequences. As the availability of protein tertiary structures is generally limited, the design of predictors that work completely based on sequence information is crucial for advances of molecular biology research. Results In this article, we propose a novel approach to design a sequence-based predictor for identifying conformationally ambivalent regions in proteins. The novelty in the design stems from incorporating two classifiers based on two distinctive supervised learning algorithms that provide complementary prediction powers. Experimental results show that the overall performance delivered by the hybrid predictor proposed in this article is superior to the performance delivered by the existing predictors. Furthermore, the case study presented in this article demonstrates that the proposed hybrid predictor is capable of providing the biologists with valuable clues about the functional sites in a protein chain. The proposed hybrid predictor provides the users with two optional modes, namely, the high-sensitivity mode and the high-specificity mode. The experimental results with an independent testing data set show that the proposed hybrid predictor is capable of delivering sensitivity of 0.710 and specificity of 0.608 under the high-sensitivity mode, while delivering sensitivity of 0.451 and specificity of 0.787 under the high-specificity mode. Conclusion Though experimental results show that the hybrid approach designed to exploit the complementary prediction powers of distinctive supervised learning algorithms works more effectively than conventional approaches, there exists a large room for further improvement with respect to the achieved performance. In this respect, it is of interest to investigate the effects of exploiting additional physiochemical properties that are related to conformational ambivalence. Furthermore, it is of interest to investigate the effects of incorporating lately-developed machine learning approaches, e.g. the random forest design and the multi-stage design. As conformational transition plays a key role in carrying out several essential types of biological functions, the design of more advanced predictors for identifying conformationally ambivalent regions in proteins deserves our continuous attention.
Collapse
Affiliation(s)
- Yu-Cheng Liu
- Institute of Biomedical Engineering, National Taiwan University, Taipei, Taiwan, Republic of China.
| | | | | | | | | |
Collapse
|
18
|
Kuznetsov IB, Rackovsky S. CFP: a web-server for constructing sequence-based protein conformational flexibility profiles. Bioinformation 2009; 4:176-8. [PMID: 20461153 PMCID: PMC2859570 DOI: 10.6026/97320630004176] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2009] [Accepted: 10/14/2009] [Indexed: 11/23/2022] Open
Abstract
UNLABELLED Many proteins contain conformationally flexible segments that undergo significant changes in the backbone conformation or completely lack a well-defined conformation. Previously, we have developed the generalized local propensity (GLP), a quantitative sequence-based measure of the protein backbone flexibility. In this paper, we present the CFP (Conformational Flexibility Profile) web-server that constructs the GLP flexibility profile for a user-submitted sequence and uses this profile to identify segments with high backbone flexibility. The statistical significance of a flexible sequence segment is assessed using the discrete scan statistics based on the density of flexible residues observed in this segment. AVAILABILITY CFP is publicly available at http://cfp.rit.albany.edu.
Collapse
Affiliation(s)
- Igor B Kuznetsov
- Cancer Research Center, Department of Epidemiology and Biostatistics, University at Albany, One Discovery Drive, Rensselaer, NY 12144, USA.
| | | |
Collapse
|
19
|
Hu HY, Xue W, Hu ZQ, Xiang JF, Chen CF, He SG. Probing the Dynamic Environment-Associated Conformational Conversion from Secondary to Supersecondary Structures in Oligo(phenanthroline dicarboxamide)s. J Org Chem 2009; 74:4949-57. [DOI: 10.1021/jo900647d] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Hai-Yu Hu
- Beijing National Laboratory for Molecular Sciences, CAS Key Laboratory of Molecular Recognition and Function, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China, and Graduate School, Chinese Academy of Sciences, Beijing 100049, China
| | - Wei Xue
- Beijing National Laboratory for Molecular Sciences, CAS Key Laboratory of Molecular Recognition and Function, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China, and Graduate School, Chinese Academy of Sciences, Beijing 100049, China
| | - Zhi-Qiang Hu
- Beijing National Laboratory for Molecular Sciences, CAS Key Laboratory of Molecular Recognition and Function, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China, and Graduate School, Chinese Academy of Sciences, Beijing 100049, China
| | - Jun-Feng Xiang
- Beijing National Laboratory for Molecular Sciences, CAS Key Laboratory of Molecular Recognition and Function, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China, and Graduate School, Chinese Academy of Sciences, Beijing 100049, China
| | - Chuan-Feng Chen
- Beijing National Laboratory for Molecular Sciences, CAS Key Laboratory of Molecular Recognition and Function, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China, and Graduate School, Chinese Academy of Sciences, Beijing 100049, China
| | - Sheng-Gui He
- Beijing National Laboratory for Molecular Sciences, CAS Key Laboratory of Molecular Recognition and Function, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, China, and Graduate School, Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
20
|
Liu BQ, Jin L, Zhu L, Li J, Huang S, Yuan M. Phosphorylation of microtubule-associated protein SB401 from Solanum berthaultii regulates its effect on microtubules. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2009; 51:235-42. [PMID: 19261066 DOI: 10.1111/j.1744-7909.2008.00797.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
We reported previously that the protein SB401 from Solanum berthaultii binds to and bundles both microtubules and F-actin. In the current study, we investigated the regulation of SB401 activity by its phosphorylation. Our experimental results showed that the phosphorylation of SB401 by casein kinase II (CKII) downregulates the activities of SB401, namely the bundling of microtubules and enhancement of the polymerization of tubulin. However, phosphorylation of SB401 had no observable effect on its bundling of F-actin. Further investigation using extract of potato pollen indicated that a CKII-like kinase may exist in potato pollen. Antibodies against CKII alpha recognized specifically a major band from the pollen extract and the pollen extract was able to phosphorylate the SB401 protein in vitro. The CKII-like kinase showed a similar ability to downregulate the bundling of microtubules. Our experiments demonstrated that phosphorylation plays an important role in the regulation of SB401 activity. We propose that this phosphorylation may regulate the effects of SB401 on microtubules and the actin cytoskeleton.
Collapse
Affiliation(s)
- Bao-Quan Liu
- State Key Laboratory of Plant Physiology and Biochemistry, Department of Plant Sciences, College of Biological Sciences, China Agricultural University, Beijing 100094, China
| | | | | | | | | | | |
Collapse
|
21
|
Kawazoe T, Park HK, Iwana S, Tsuge H, Fukui K. Human D-amino acid oxidase: an update and review. CHEM REC 2008; 7:305-15. [PMID: 17924443 DOI: 10.1002/tcr.20129] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The flavoprotein D-amino acid oxidase (DAO) degrades the gliotransmitter D-Ser, a potent activator of N-methyl-D-aspartate-type glutamate receptors. A body of evidence suggests that DAO, together with its activator, G72 protein, may play a key role in the pathophysiology of schizophrenia. It has also been suggested that 3,4-dihydroxy-D-phenylalanine (D-DOPA), the stereoisomer of 3,4-dihydroxy-L-phenylalanine (L-DOPA), is oxidized by DAO and converted to dopamine via an alternative biosynthetic pathway. We determined the crystal structures of human DAO in complex with the reaction products of two clinically important substrates, D-Ser and D-DOPA. Kinetic data show that the maximum velocity is much greater for D-DOPA than that for D-Ser, which strongly supports the proposed alternative pathway for dopamine biosynthesis in the treatment of Parkinson's disease. In addition, biochemical characterization of human DAO indicates that it binds FAD more weakly than does porcine D-amino acid oxidase (pDAO) and exists as a stable homodimer, even in the apoprotein form. Determination of the structures of human DAO in various states reveals that, in contrast to pDAO, the hydrophobic-Val-Ala-Ala-Gly-Leu (VAAGL) stretch (residues 47-51, structurally ambivalent peptide) located at the si-face of the flavin ring assumes a uniquely stable conformation, which provides a structural basis for the unique kinetic features of human DAO.
Collapse
Affiliation(s)
- Tomoya Kawazoe
- Institute for Enzyme Research, The University of Tokushima, 3-18-15 Kuramoto, Tokushima 770-8503, Japan
| | | | | | | | | |
Collapse
|
22
|
Kuznetsov IB. Ordered conformational change in the protein backbone: Prediction of conformationally variable positions from sequence and low-resolution structural data. Proteins 2008; 72:74-87. [DOI: 10.1002/prot.21899] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
|
23
|
Vries JK, Liu X, Bahar I. The relationship between n-gram patterns and protein secondary structure. Proteins 2007; 68:830-8. [PMID: 17523186 DOI: 10.1002/prot.21480] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
An n-gram pattern (NP{n,m}) in a protein sequence is a set of n residues and m wildcards in a window of size n+m. Each window of n+m amino acids is associated with a collection of NP{n,m} patterns based on the combinatorics of n+m objects taken m at a time. NP{n,m} patterns that are shared between sequences reflect evolutionary relationships. Recently the authors developed an alignment-independent protein classification algorithm based on shared NP{4,2} patterns that compared favorably to PSI-BLAST. Theoretically, NP{4,2} patterns should also reflect secondary structure propensity since they contain all possible n-grams for 1 < or = n < or = 4 and a window of 6 residues is wide enough to capture periodicities in the 2 < or = n < or = 5 range. This sparked interest in differentiating the information content in NP{4,2} patterns related to evolution from the content related to local propensity. The probability of alpha-, beta-, and coil components was determined for every NP{4,2} pattern over all the chains in the Protein Data Bank (PDB). An algorithm exclusively based on the Z-values of these distributions was developed, which accurately predicted 71-76% of alpha-helical segments and 62-67% of beta-sheets in rigorous jackknife tests. This provided evidence for the strong correlation between NP{4,2} patterns and secondary structure. By grouping PDB chains into subsets with increasing levels of sequence identity, it was also possible to separate the evolutionary and local propensity contributions to the classification process. The results showed that information derived from evolutionary relationships was more important for beta-sheet prediction than alpha-helix prediction.
Collapse
Affiliation(s)
- John K Vries
- Department of Computational Biology, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania 15213, USA.
| | | | | |
Collapse
|
24
|
Guo JT, Jaromczyk JW, Xu Y. Analysis of chameleon sequences and their implications in biological processes. Proteins 2007; 67:548-58. [PMID: 17299764 DOI: 10.1002/prot.21285] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Chameleon sequences have been implicated in amyloid related diseases. Here we report an analysis of two types of chameleon sequences, chameleon-HS (Helix vs. Strand) and chameleon-HE (Helix vs. Sheet), based on known structures in Protein Data Bank. Our survey shows that the longest chameleon-HS is eight residues while the longest chameleon-HE is seven residues. We have done a detailed analysis on the local and global environment that might contribute to the unique conformation of a chameleon sequence. We found that the existence of chameleon sequences does not present a problem for secondary structure prediction programs, including the first generation prediction programs, such as Chou-Fasman algorithm, and the third generation prediction programs that utilize evolution information. We have also investigated the possible implication of chameleon sequences in structural conservation and functional diversity of alternatively spliced protein isoforms.
Collapse
Affiliation(s)
- Jun-Tao Guo
- Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602, USA
| | | | | |
Collapse
|
25
|
Kawazoe T, Tsuge H, Imagawa T, Aki K, Kuramitsu S, Fukui K. Structural basis of D-DOPA oxidation by D-amino acid oxidase: alternative pathway for dopamine biosynthesis. Biochem Biophys Res Commun 2007; 355:385-91. [PMID: 17303072 DOI: 10.1016/j.bbrc.2007.01.181] [Citation(s) in RCA: 58] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2007] [Accepted: 01/30/2007] [Indexed: 11/17/2022]
Abstract
D-amino acid oxidase (DAO) degrades the gliotransmitter D-serine, a potent endogenous ligand of N-methyl-D-aspartate type glutamate receptors. It also has been suggested that D-DOPA, the stereoisomer of L-DOPA, is oxidized by DAO and then converted to dopamine via an alternative biosynthetic pathway. Here, we provide direct crystallographic evidence that D-DOPA is readily fitted into the active site of human DAO, where it is oxidized by the enzyme. Moreover, our kinetic data show that the maximal velocity for oxidation of D-DOPA is much greater than for D-serine, which strongly supports the proposed alternative pathway for dopamine biosynthesis in the treatment of Parkinson's disease. In addition, determination of the structures of human DAO in various states revealed that the conformation of the hydrophobic VAAGL stretch (residues 47-51) to be uniquely stable in the human enzyme, which provides a structural basis for the unique kinetic features of human DAO.
Collapse
Affiliation(s)
- Tomoya Kawazoe
- The Institute for Enzyme Research, The University of Tokushima, 3-18-15 Kuramoto, Tokushima 770-8503, Japan
| | | | | | | | | | | |
Collapse
|
26
|
Patel S, Balaji PV, Sasidhar YU. The sequence TGAAKAVALVL from glyceraldehyde-3-phosphate dehydrogenase displays structural ambivalence and interconverts between α-helical and β-hairpin conformations mediated by collapsed conformational states. J Pept Sci 2007; 13:314-26. [PMID: 17437248 DOI: 10.1002/psc.843] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The peptide TGAAKAVALVL from glyceraldehyde-3-phosphate dehydrogenase adopts a helical conformation in the crystal structure and is a site for two hydrated helical segments, which are thought to be helical folding intermediates. Overlapping sequences of four to five residues from the peptide, sample both helical and strand conformations in known protein structures, which are dissimilar to glyceraldehyde-3-phosphate dehydrogenase suggesting that the peptide may have a structural ambivalence. Molecular dynamics simulations of the peptide sequence performed for a total simulation time of 1.2 micros, starting from the various initial conformations using GROMOS96 force field under NVT conditions, show that the peptide samples a large number of conformational forms with transitions from alpha-helix to beta-hairpin and vice versa. The peptide, therefore, displays a structural ambivalence. The mechanism from alpha-helix to beta-hairpin transition and vice versa reveals that the compact bends and turns conformational forms mediate such conformational transitions. These compact structures including helices and hairpins have similar hydrophobic radius of gyration (Rgh) values suggesting that similar hydrophobic interactions govern these conformational forms. The distribution of conformational energies is Gaussian with helix sampling lowest energy followed by the hairpins and coil. The lowest potential energy of the full helix may enable the peptide to take up helical conformation in the crystal structure of the glyceraldehyde-3-phosphate dehydrogenase, even though the peptide has a preference for hairpin too. The relevance of folding and unfolding events observed in our simulations to hydrophobic collapse model of protein folding are discussed.
Collapse
Affiliation(s)
- Sunita Patel
- Department of Chemistry, Indian Institute of Technology Bombay, Powai, Mumbai 400 076, India
| | | | | |
Collapse
|
27
|
Kawazoe T, Tsuge H, Pilone MS, Fukui K. Crystal structure of human D-amino acid oxidase: context-dependent variability of the backbone conformation of the VAAGL hydrophobic stretch located at the si-face of the flavin ring. Protein Sci 2006; 15:2708-17. [PMID: 17088322 PMCID: PMC2242440 DOI: 10.1110/ps.062421606] [Citation(s) in RCA: 86] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
In the brain, the extensively studied FAD-dependent enzyme D-amino acid oxidase (DAO) degrades the gliotransmitter D-serine, a potent activator of N-methyl-D-aspartate type glutamate receptors, and evidence suggests that DAO, together with its activator G72 protein, may play a key role in the pathophysiology of schizophrenia. Indeed, its potential clinical importance highlights the need for structural and functional analyses of human DAO. We recently succeeded in purifying human DAO, and found that it weakly binds FAD and shows a significant slower rate of flavin reduction compared with porcine DAO. However, the molecular basis for the different kinetic features remains unclear because the active site of human DAO was considered to be virtually identical to that of porcine DAO, as would be expected from the 85% sequence identity. To address this issue, we determined the crystal structure of human DAO in complex with a competitive inhibitor benzoate, at a resolution of 2.5 Angstrom. The overall dimeric structure of human DAO is similar to porcine DAO, and the catalytic residues are fully conserved at the re-face of the flavin ring. However, at the si-face of the flavin ring, despite the strict sequence identity, a hydrophobic stretch (residues 47-51, VAAGL) exists in a significantly different conformation compared with both of the independently determined porcine DAO-benzoate structures. This suggests that a context-dependent conformational variability of the hydrophobic stretch accounts for the low affinity for FAD as well as the slower rate of flavin reduction, thus highlighting the unique features of the human enzyme.
Collapse
Affiliation(s)
- Tomoya Kawazoe
- The Institute for Enzyme Research, The University of Tokushima, 3-18-15 Kuramoto, Tokushima 770-8503, Japan
| | | | | | | |
Collapse
|
28
|
Ambroggio XI, Kuhlman B. Design of protein conformational switches. Curr Opin Struct Biol 2006; 16:525-30. [PMID: 16765587 DOI: 10.1016/j.sbi.2006.05.014] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2006] [Revised: 05/03/2006] [Accepted: 05/26/2006] [Indexed: 11/25/2022]
Abstract
Protein conformational switches are ubiquitous in nature and often regulate key biological processes. To design new proteins that can switch conformation, protein designers have focused on the two key components of protein switches: the amino acid sequence must be compatible with the multiple target states and there must be a mechanism for perturbing the relative stability of these states. Proteins have been designed that can switch between folded and disordered states, between distinct folded states and between different aggregation states. A variety of trigger mechanisms have been used, including pH shifts, post-translational modification and ligand binding. Recently, computational protein design methods have been applied to switch design. These include algorithms for designing novel ligand-binding sites and simultaneously optimizing a sequence for multiple target structures.
Collapse
Affiliation(s)
- Xavier I Ambroggio
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC 27599, USA
| | | |
Collapse
|
29
|
Bodén M, Bailey TL. Identifying sequence regions undergoing conformational change via predicted continuum secondary structure. Bioinformatics 2006; 22:1809-14. [PMID: 16720586 DOI: 10.1093/bioinformatics/btl198] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Conformational flexibility is essential to the function of many proteins, e.g. catalytic activity. To assist efforts in determining and exploring the functional properties of a protein, it is desirable to automatically identify regions that are prone to undergo conformational changes. It was recently shown that a probabilistic predictor of continuum secondary structure is more accurate than categorical predictors for structurally ambivalent sequence regions, suggesting that such models are suited to characterize protein flexibility. RESULTS We develop a computational method for identifying regions that are prone to conformational change directly from the amino acid sequence. The method uses the entropy of the probabilistic output of an 8-class continuum secondary structure predictor. Results for 171 unique amino acid sequences with well-characterized variable structure (identified in the 'Macromolecular movements database') indicate that the method is highly sensitive at identifying flexible protein regions, but false positives remain a problem. The method can be used to explore conformational flexibility of proteins (including hypothetical or synthetic ones) whose structure is yet to be determined experimentally. AVAILABILITY The predictor, sequence data and supplementary studies are available at http://pprowler.itee.uq.edu.au/sspred/ and are free for academic use.
Collapse
Affiliation(s)
- Mikael Bodén
- School of Information Technology and Electrical Engineering, QLD 4072, The University of Queensland Australia.
| | | |
Collapse
|
30
|
Ganapathiraju M, Manoharan V, Klein-Seetharaman J. BLMT: statistical sequence analysis using N-grams. ACTA ACUST UNITED AC 2005; 3:193-200. [PMID: 15693744 DOI: 10.2165/00822942-200403020-00013] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
UNLABELLED Statistical analysis of amino acid and nucleotide sequences, especially sequence alignment, is one of the most commonly performed tasks in modern molecular biology. However, for many tasks in bioinformatics, the requirement for the features in an alignment to be consecutive is restrictive and "n-grams" (aka k-tuples) have been used as features instead. N-grams are usually short nucleotide or amino acid sequences of length n, but the unit for a gram may be chosen arbitrarily. The n-gram concept is borrowed from language technologies where n-grams of words form the fundamental units in statistical language models. Despite the demonstrated utility of n-gram statistics for the biology domain, there is currently no publicly accessible generic tool for the efficient calculation of such statistics. Most sequence analysis tools will disregard matches because of the lack of statistical significance in finding short sequences. This article presents the integrated Biological Language Modeling Toolkit (BLMT) that allows efficient calculation of n-gram statistics for arbitrary sequence datasets. AVAILABILITY BLMT can be downloaded from http://www.cs.cmu.edu/~blmt/source and installed for standalone use on any Unix platform or Unix shell emulation such as Cygwin on the Windows platform. Specific tools and usage details are described in a "readme" file. The n-gram computations carried out by the BLMT are part of a broader set of tools borrowed from language technologies and modified for statistical analysis of biological sequences; these are available at http://flan.blm.cs.cmu.edu/.
Collapse
Affiliation(s)
- Madhavi Ganapathiraju
- Language Technologies Institute, Carnegie Mellon University, 4400 Fifth Avenue, Pittsburgh, PA 15213, USA
| | | | | |
Collapse
|
31
|
Betschinger J, Eisenhaber F, Knoblich JA. Phosphorylation-induced autoinhibition regulates the cytoskeletal protein Lethal (2) giant larvae. Curr Biol 2005; 15:276-82. [PMID: 15694314 DOI: 10.1016/j.cub.2005.01.012] [Citation(s) in RCA: 127] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2004] [Revised: 12/03/2004] [Accepted: 12/06/2004] [Indexed: 11/28/2022]
Abstract
During asymmetric cell division, cell fate determinants localize asymmetrically and segregate into one of the two daughter cells. In Drosophila neuroblasts, the asymmetric localization of cell fate determinants to the basal cell cortex requires aPKC. aPKC localizes to the apical cell cortex and phosphorylates the cytoskeletal protein Lethal (2) giant larvae (Lgl). Upon phosphorylation, Lgl dissociates from the cytoskeleton and becomes inactive. Here, we show that phosphorylation regulates Lgl by allowing an autoinhibitory interaction of the N terminus with the C terminus of the protein. We demonstrate that interaction with the cytoskeleton is mediated by a C-terminal domain while the N terminus is not required. Instead, the N terminus can bind to the C terminus and can compete for binding to the cytoskeleton. Interaction between the N- and C-terminal domains requires phosphorylation of Lgl by aPKC. Our results suggest that unphosphorylated, active Lgl exists in an open conformation that interacts with the cytoskeleton while phosphorylation changes the protein to an autoinhibited state.
Collapse
Affiliation(s)
- Joerg Betschinger
- Institute of Molecular Biotechnology of the Austrian Academy of Sciences, Dr. Bohr Gasse 3-5, 1030 Vienna, Austria
| | | | | |
Collapse
|
32
|
Kuznetsov IB, Rackovsky S. Comparative computational analysis of prion proteins reveals two fragments with unusual structural properties and a pattern of increase in hydrophobicity associated with disease-promoting mutations. Protein Sci 2005; 13:3230-44. [PMID: 15557265 PMCID: PMC2287303 DOI: 10.1110/ps.04833404] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Prion diseases are a group of neurodegenerative disorders associated with conversion of a normal prion protein, PrPC, into a pathogenic conformation, PrPSc. The PrPSc is thought to promote the conversion of PrPC. The structure and stability of PrPC are well characterized, whereas little is known about the structure of PrPSc, what parts of PrPC undergo conformational transition, or how mutations facilitate this transition. We use a computational knowledge-based approach to analyze the intrinsic structural propensities of the C-terminal domain of PrP and gain insights into possible mechanisms of structural conversion. We compare the properties of PrP sequences to those of a PrP paralog, Doppel, and to the distributions of structural propensities observed in known protein structures from the Protein Data Bank. We show that the prion protein contains at least two sequence fragments with highly unusual intrinsic propensities, PrP(114-125) and helix B. No segments with unusual properties were found in Doppel protein, which is topologically identical to PrP but does not undergo structural rearrangements. Known disease-promoting PrP mutations form a statistically significant cluster in the region comprising helices B and C. Due to their unusual properties, PrP(114-125) and the C terminus of helix B may be considered as primary candidates for sites involved in conformational transition from PrPC to PrPSc. The results of our study also show that most PrP mutations associated with neurodegenerative disorders increase local hydrophobicity. We suggest that the observed increase in hydrophobicity may facilitate PrP-to-PrP or/and PrP-to-cofactor interactions, and thus promote structural conversion.
Collapse
Affiliation(s)
- Igor B Kuznetsov
- Department of Biomathematical Sciences, Mount Sinai School of Medicine, New York, NY 10029, USA.
| | | |
Collapse
|
33
|
Novatchkova M, Bachmair A, Eisenhaber B, Eisenhaber F. Proteins with two SUMO-like domains in chromatin-associated complexes: the RENi (Rad60-Esc2-NIP45) family. BMC Bioinformatics 2005; 6:22. [PMID: 15698469 PMCID: PMC549199 DOI: 10.1186/1471-2105-6-22] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2004] [Accepted: 02/07/2005] [Indexed: 01/03/2023] Open
Abstract
BACKGROUND Post-translational modification by Small Ubiquitin-like Modifiers (SUMO) has been implicated in protein targeting, in the maintenance of genomic integrity and in transcriptional control. But the specific molecular effects of SUMO modification on many target proteins remain to be elucidated. Recent findings point at the importance of SUMO-mediated histone NAD-dependent deacetylase (HDAC) recruitment in transcriptional regulation. RESULTS We describe the RENi family of SUMO-like domain proteins (SDP) with the unique feature of typically containing two carboxy-terminal SUMO-like domains. Using sequence analytic evidence, we collect family members from animals, fungi and plants, most prominent being yeast Rad60, Esc2 and mouse NIP45 http://mendel.imp.univie.ac.at/SEQUENCES/reni/. Different proteins of the novel family are known to interact directly with histone NAD-dependent deacetylases (HDACs), structural maintenance of chromosomes (SMC) proteins, and transcription factors. In particular, the highly non-trivial designation of the first of the two successive SUMO-domains in non-plant RENi provides a rationale for previously published functionally impaired mutant variants. CONCLUSIONS Till now, SUMO-like proteins have been studied exclusively in the context of their covalent conjugation to target proteins. Here, we present the exciting possibility that SUMO domain proteins, similarly to ubiquitin modifiers, have also evolved in a second line - namely as multi-domain proteins that are non-covalently attached to their target proteins. We suggest that the SUMO stable fusion proteins of the RENi family, which we introduce in this work, might mimic SUMO and share its interaction motifs (in analogy to the way that ubiquitin-like domains mimic ubiquitin). This presumption is supported by parallels in the spectrum of modified or bound proteins e.g. transcription factors and chromatin-associated proteins and in the recruitment of HDAC-activity.
Collapse
Affiliation(s)
- Maria Novatchkova
- Gregor Mendel-Institut GMI, Austrian Academy of Sciences, Vienna Biocenter, A-1030 Vienna, Austria
| | - Andreas Bachmair
- Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, D-50829 Cologne, Germany
| | - Birgit Eisenhaber
- Research Institute of Molecular Pathology, Dr. Bohr-Gasse 7, A-1030 Vienna, Austria
| | - Frank Eisenhaber
- Research Institute of Molecular Pathology, Dr. Bohr-Gasse 7, A-1030 Vienna, Austria
| |
Collapse
|
34
|
Ganapathiraju M, Balakrishnan N, Reddy R, Klein-Seetharaman J. Computational Biology and Language. AMBIENT INTELLIGENCE FOR SCIENTIFIC DISCOVERY 2005. [DOI: 10.1007/978-3-540-32263-4_2] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
35
|
Gunasekaran K, Tsai CJ, Nussinov R. Analysis of ordered and disordered protein complexes reveals structural features discriminating between stable and unstable monomers. J Mol Biol 2004; 341:1327-41. [PMID: 15321724 DOI: 10.1016/j.jmb.2004.07.002] [Citation(s) in RCA: 131] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2004] [Revised: 07/01/2004] [Accepted: 07/02/2004] [Indexed: 12/26/2022]
Abstract
Most proteins exist in the cell as multi-component assemblies. However, which proteins need to be present simultaneously in order to perform a given function is frequently unknown. The first step toward this goal would be to predict proteins that can function only when in a complexed form. Here, we propose a scheme to distinguish whether the protein components are ordered (stable) or disordered when separated from their complexed partners. We analyze structural characteristics of several types of complexes, such as natively unstructured proteins, ribosomal proteins, two-state and three-state complexes, and crystal-packing dimers. Our analysis makes use of the fact that natively unstructured proteins, which undergo a disorder-to-order transition upon binding their partner, and stable monomeric proteins, which exist as dimers only in their crystal form, provide examples of two vastly different scenarios. We find that ordered monomers can be distinguished from disordered monomers on the basis of the per-residue surface and interface areas, which are significantly smaller for ordered proteins. With this scale, two-state dimers (where the monomers unfold upon dimer separation) and ribosomal proteins are shown to resemble disordered proteins. On the other hand, crystal-packing dimers, whose monomers are stable in solution, fall into the ordered protein category. While there should be a continuum in the distributions, nevertheless, the per-residue scale measures the confidence in the determination of whether a protein can exist as a stable monomer. Further analysis, focusing on the chemical and contact preferences at the interface, interior and exposed surface areas, reveals that disordered proteins lack a strong hydrophobic core and are composed of highly polar surface area. We discuss the implication of our results for de novo design of stable monomeric proteins and peptides.
Collapse
Affiliation(s)
- Kannan Gunasekaran
- Laboratory of Experimental and Computational Biology, Basic Research Program, SAIC-Frederick, Inc., NCI-Frederick, Frederick, MD 21702, USA.
| | | | | |
Collapse
|
36
|
Fuxreiter M, Simon I, Friedrich P, Tompa P. Preformed structural elements feature in partner recognition by intrinsically unstructured proteins. J Mol Biol 2004; 338:1015-26. [PMID: 15111064 DOI: 10.1016/j.jmb.2004.03.017] [Citation(s) in RCA: 423] [Impact Index Per Article: 21.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2003] [Revised: 03/02/2004] [Accepted: 03/02/2004] [Indexed: 12/25/2022]
Abstract
Intrinsically unstructured proteins (IUPs) are devoid of extensive structural order but often display signs of local and limited residual structure. To explain their effective functioning, we reasoned that such residual structure can be crucial in their interactions with their structured partner(s) in a way that preformed structural elements presage their final conformational state. To check this assumption, a database of 24 IUPs with known 3D structures in the bound state has been assembled and the distribution of secondary structure elements and backbone torsion angles have been analysed. The high proportion of residues in coil conformation and with phi, psi angles in the disallowed regions of the Ramachandran map compared to the reference set of globular proteins shows that IUPs are not fully ordered even in their bound form. To probe the effect of partner proteins on IUP folding, inherent conformational preferences of IUP sequences have been assessed by secondary structure predictions using the GOR, ALB and PROF algorithms. The accuracy of predicting secondary structure elements of IUPs is similar to that of their partner proteins and is significantly higher than the corresponding values for random sequences. We propose that strong conformational preferences mark regions in IUPs (mostly helices), which correspond to their final structural state, while regions with weak conformational preferences represent flexible linkers between them. In our interpretation, preformed elements could serve as initial contact points, the binding of which facilitates the reeling of the flexible regions onto the template. This finding implies that IUPs draw a functional advantage from preformed structural elements, as they enable their facile, kinetically and energetically less demanding, interaction with their physiological partner.
Collapse
Affiliation(s)
- Monika Fuxreiter
- Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences, Budapest, Hungary
| | | | | | | |
Collapse
|