1
|
Pourahmadi M, Shirdel A, Jamshidi N, Jafarian V, Khalifeh K. Comparing similar versions of a connecting helix on the structure of Chondroitinase ABC I. Enzyme Microb Technol 2022; 160:110073. [DOI: 10.1016/j.enzmictec.2022.110073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2022] [Revised: 05/20/2022] [Accepted: 06/02/2022] [Indexed: 11/16/2022]
|
2
|
Bondarenko V, Wells MM, Chen Q, Tillman TS, Singewald K, Lawless MJ, Caporoso J, Brandon N, Coleman JA, Saxena S, Lindahl E, Xu Y, Tang P. Structures of highly flexible intracellular domain of human α7 nicotinic acetylcholine receptor. Nat Commun 2022; 13:793. [PMID: 35145092 PMCID: PMC8831596 DOI: 10.1038/s41467-022-28400-x] [Citation(s) in RCA: 22] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2021] [Accepted: 01/10/2022] [Indexed: 11/24/2022] Open
Abstract
The intracellular domain (ICD) of Cys-loop receptors mediates diverse functions. To date, no structure of a full-length ICD is available due to challenges stemming from its dynamic nature. Here, combining nuclear magnetic resonance (NMR) and electron spin resonance experiments with Rosetta computations, we determine full-length ICD structures of the human α7 nicotinic acetylcholine receptor in a resting state. We show that ~57% of the ICD residues are in highly flexible regions, primarily in a large loop (loop L) with the most mobile segment spanning ~50 Å from the central channel axis. Loop L is anchored onto the MA helix and virtually forms two smaller loops, thereby increasing its stability. Previously known motifs for cytoplasmic binding, regulation, and signaling are found in both the helices and disordered flexible regions, supporting the essential role of the ICD conformational plasticity in orchestrating a broad range of biological processes. The intracellular domain (ICD) of Cys-loop receptors mediates many of their functions, but no complete structure of a Cys-loop receptor ICD is available to date. Here, the authors combine NMR and ESR spectroscopy to determine the full-length ICD structures of the human α7 nicotinic acetylcholine receptor (α7nAChR).
Collapse
Affiliation(s)
- Vasyl Bondarenko
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Marta M Wells
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Qiang Chen
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Tommy S Tillman
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Kevin Singewald
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Matthew J Lawless
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Joel Caporoso
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Nicole Brandon
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Jonathan A Coleman
- Department of Structural Biology, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Sunil Saxena
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Erik Lindahl
- Department of Biochemistry and Biophysics, Science for Life Laboratory, Stockholm University, Solna, Sweden.,Department of Applied Physics, Swedish e-Science Research Center, KTH Royal Institute of Technology, Solna, Sweden
| | - Yan Xu
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Department of Structural Biology, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Department of Pharmacology and Chemical Biology, University of Pittsburgh, Pittsburgh, PA, 15260, USA.,Department of Physics and Astronomy, University of Pittsburgh, Pittsburgh, PA, 15260, USA
| | - Pei Tang
- Depatment of Anesthesiology and Perioperative Medicine, University of Pittsburgh, Pittsburgh, PA, 15260, USA. .,Department of Pharmacology and Chemical Biology, University of Pittsburgh, Pittsburgh, PA, 15260, USA. .,Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, 15260, USA.
| |
Collapse
|
3
|
Fenner K, Redgate A, Brancaleon L. A 200 nanoseconds all-atom simulation of the pH-dependent EF loop transition in bovine β-lactoglobulin. The role of the orientation of the E89 side chain. J Biomol Struct Dyn 2022; 40:549-564. [PMID: 32909899 PMCID: PMC8853732 DOI: 10.1080/07391102.2020.1817785] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
In silico molecular dynamics (MD) using crystallographic and NMR data was used to simulate the effects of the protonation state of E89 on the pH-dependent conformational rearrangement of the EF loop, also known as the Tanford transition, in a series of apo-β-lactoglobulin (BLG) structures. Compared to existing studies these simulations were carried out over a much longer time scale (200 ns where the stability of the transition can be evaluated) and used an explicit water model. We considered eight different entries from the Brookhaven Protein Data Bank (PDB) separated into two groups. We observed that fixing the protonation state of E89 prompts the transition of the EF loop only when its side chain is oriented under the loop and into the entrance of the interior cavity. The motion of the EF loop occurs mostly as a step-function and its timing varies greatly from ∼ 20 ns to ∼170 ns from the beginning of the simulation. Once the transition is completed, the protein appears to reach a stable conformation as in a true two-state transition. We also observed novel findings. When the transition occurs, the hydrogen bond between E89 and S116 is replaced with a salt bridge with Lys residues in the βC-CD loop-βD motif. This electrostatic interaction causes the distortion of this motif as well as the protrusion of the GH loop into the aperture of the cavity with the result of limiting the increase of its contour area despite the rotation of the EF loop.Communicated by Ramaswamy H. Sarma.
Collapse
|
4
|
Fischer S, Stegmann F, Gnanapragassam VS, Lepenies B. From structure to function – Ligand recognition by myeloid C-type lectin receptors. Comput Struct Biotechnol J 2022; 20:5790-5812. [DOI: 10.1016/j.csbj.2022.10.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2022] [Revised: 10/14/2022] [Accepted: 10/14/2022] [Indexed: 11/29/2022] Open
|
5
|
Richaud AD, Zhao G, Hobloss S, Roche SP. Folding in Place: Design of β-Strap Motifs to Stabilize the Folding of Hairpins with Long Loops. J Org Chem 2021; 86:13535-13547. [PMID: 34499510 PMCID: PMC8576641 DOI: 10.1021/acs.joc.1c01442] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Despite their pivotal role in defining antibody affinity and protein function, β-hairpins harboring long noncanonical loops remain synthetically challenging because of the large entropic penalty associated with their conformational folding. Little is known about the contribution and impact of stabilizing motifs on the folding of β-hairpins with loops of variable length and plasticity. Here, we report a design of minimalist β-straps (strap = strand + cap) that offset the entropic cost of long-loop folding. The judicious positioning of noncovalent interactions (hydrophobic cluster and salt-bridge) within the novel 8-mer β-strap design RW(V/H)W···WVWE stabilizes hairpins with up to 10-residue loops of varying degrees of plasticity (Tm up to 52 °C; 88 ± 1% folded at 18 °C). This "hyper" thermostable β-strap outperforms the previous gold-standard technology of β-strand-β-cap (16-mer) and provides a foundation for producing new classes of long hairpins as a viable and practical alternative to macrocyclic peptides.
Collapse
Affiliation(s)
- Alexis D Richaud
- Department of Chemistry and Biochemistry, Florida Atlantic University, Boca Raton, Florida 33431, United States
| | - Guangkuan Zhao
- Department of Chemistry and Biochemistry, Florida Atlantic University, Boca Raton, Florida 33431, United States
| | - Samir Hobloss
- Department of Chemistry and Biochemistry, Florida Atlantic University, Boca Raton, Florida 33431, United States
| | - Stéphane P Roche
- Department of Chemistry and Biochemistry, Florida Atlantic University, Boca Raton, Florida 33431, United States
| |
Collapse
|
6
|
Wang Y, Shin I, Li J, Liu A. Crystal structure of human cysteamine dioxygenase provides a structural rationale for its function as an oxygen sensor. J Biol Chem 2021; 297:101176. [PMID: 34508780 PMCID: PMC8503633 DOI: 10.1016/j.jbc.2021.101176] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Revised: 09/03/2021] [Accepted: 09/05/2021] [Indexed: 01/03/2023] Open
Abstract
Cysteamine dioxygenase (ADO) plays a vital role in regulating thiol metabolism and preserving oxygen homeostasis in humans by oxidizing the sulfur of cysteamine and N-terminal cysteine-containing proteins to their corresponding sulfinic acids using O2 as a cosubstrate. However, as the only thiol dioxygenase that processes both small-molecule and protein substrates, how ADO handles diverse substrates of disparate sizes to achieve various reactions is not understood. The knowledge gap is mainly due to the three-dimensional structure not being solved, as ADO cannot be directly compared with other known thiol dioxygenases. Herein, we report the first crystal structure of human ADO at a resolution of 1.78 Å with a nickel-bound metal center. Crystallization was achieved through both metal substitution and C18S/C239S double mutations. The metal center resides in a tunnel close to an entry site flanked by loops. While ADO appears to use extensive flexibility to handle substrates of different sizes, it also employs proline and proline pairs to maintain the core protein structure and to retain the residues critical for catalysis in place. This feature distinguishes ADO from thiol dioxygenases that only oxidize small-molecule substrates, possibly explaining its divergent substrate specificity. Our findings also elucidate the structural basis for ADO functioning as an oxygen sensor by modifying N-degron substrates to transduce responses to hypoxia. Thus, this work fills a gap in structure–function relationships of the thiol dioxygenase family and provides a platform for further mechanistic investigation and therapeutic intervention targeting impaired oxygen sensing.
Collapse
Affiliation(s)
- Yifan Wang
- Department of Chemistry, The University of Texas at San Antonio, Texas, USA
| | - Inchul Shin
- Department of Chemistry, The University of Texas at San Antonio, Texas, USA
| | - Jiasong Li
- Department of Chemistry, The University of Texas at San Antonio, Texas, USA
| | - Aimin Liu
- Department of Chemistry, The University of Texas at San Antonio, Texas, USA.
| |
Collapse
|
7
|
Saravanan KM, Peng Y, Wei Y. Systematic analysis of NO Regular Secondary structural regions (NORS) in membrane and non-membrane proteins. J Biomol Struct Dyn 2019; 38:268-274. [PMID: 30616457 DOI: 10.1080/07391102.2019.1566092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Affiliation(s)
- Konda Mani Saravanan
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, PR China
| | - Yin Peng
- Department of Pathology, The Shenzhen University School of Medicine, Shenzhen, PR China
| | - Yanjie Wei
- Center for High Performance Computing, Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, PR China
| |
Collapse
|
8
|
Berezovsky IN, Guarnera E, Zheng Z. Basic units of protein structure, folding, and function. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2016; 128:85-99. [PMID: 27697476 DOI: 10.1016/j.pbiomolbio.2016.09.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/29/2016] [Revised: 09/05/2016] [Accepted: 09/26/2016] [Indexed: 10/20/2022]
Abstract
Study of the hierarchy of domain structure with alternative sets of domains and analysis of discontinuous domains, consisting of remote segments of the polypeptide chain, raised a question about the minimal structural unit of the protein domain. The hypothesis on the decisive role of the polypeptide backbone in determining the elementary units of globular proteins have led to the discovery of closed loops. It is reviewed here how closed loops form the loop-n-lock structure of proteins, providing the foundation for stability and designability of protein folds/domain and underlying their co-translational folding. Simplified protein sequences are considered here with the aim to explore the basic principles that presumably dominated the folding and stability of proteins in the early stages of structural evolution. Elementary functional loops (EFLs), closed loops with one or few catalytic residues, are, in turn, units of the protein function. They are apparent descendants of the prebiotic ring-like peptides, which gave rise to the first functional folds/domains being fused in the beginning of the evolution of protein structure. It is also shown how evolutionary relations between protein functional superfamilies and folds delineated with the help of EFLs can contribute to establishing the rules for design of desired enzymatic functions. Generalized descriptors of the elementary functions are proposed to be used as basic units in the future computational design.
Collapse
Affiliation(s)
- Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore.
| | - Enrico Guarnera
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | - Zejun Zheng
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| |
Collapse
|
9
|
Das R. Atomic-accuracy prediction of protein loop structures through an RNA-inspired Ansatz. PLoS One 2013; 8:e74830. [PMID: 24204571 PMCID: PMC3804535 DOI: 10.1371/journal.pone.0074830] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2013] [Accepted: 08/07/2013] [Indexed: 11/18/2022] Open
Abstract
Consistently predicting biopolymer structure at atomic resolution from sequence alone remains a difficult problem, even for small sub-segments of large proteins. Such loop prediction challenges, which arise frequently in comparative modeling and protein design, can become intractable as loop lengths exceed 10 residues and if surrounding side-chain conformations are erased. Current approaches, such as the protein local optimization protocol or kinematic inversion closure (KIC) Monte Carlo, involve stages that coarse-grain proteins, simplifying modeling but precluding a systematic search of all-atom configurations. This article introduces an alternative modeling strategy based on a ‘stepwise ansatz’, recently developed for RNA modeling, which posits that any realistic all-atom molecular conformation can be built up by residue-by-residue stepwise enumeration. When harnessed to a dynamic-programming-like recursion in the Rosetta framework, the resulting stepwise assembly (SWA) protocol enables enumerative sampling of a 12 residue loop at a significant but achievable cost of thousands of CPU-hours. In a previously established benchmark, SWA recovers crystallographic conformations with sub-Angstrom accuracy for 19 of 20 loops, compared to 14 of 20 by KIC modeling with a comparable expenditure of computational power. Furthermore, SWA gives high accuracy results on an additional set of 15 loops highlighted in the biological literature for their irregularity or unusual length. Successes include cis-Pro touch turns, loops that pass through tunnels of other side-chains, and loops of lengths up to 24 residues. Remaining problem cases are traced to inaccuracies in the Rosetta all-atom energy function. In five additional blind tests, SWA achieves sub-Angstrom accuracy models, including the first such success in a protein/RNA binding interface, the YbxF/kink-turn interaction in the fourth ‘RNA-puzzle’ competition. These results establish all-atom enumeration as an unusually systematic approach to ab initio protein structure modeling that can leverage high performance computing and physically realistic energy functions to more consistently achieve atomic accuracy.
Collapse
Affiliation(s)
- Rhiju Das
- Departments of Biochemistry and Physics, Stanford University, Stanford, California, United States of America
- * E-mail:
| |
Collapse
|
10
|
Thukral L, Shenoy SR, Bhushan K, Jayaram B. ProRegIn: A regularity index for the selection of native-like tertiary structures of proteins. J Biosci 2007; 32:71-81. [PMID: 17426381 DOI: 10.1007/s12038-007-0007-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Automated protein tertiary structure prediction from sequence information alone remains an elusive goal to computational prescriptions. Dividing the problem into three stages viz. secondary structure prediction, generation of plausible main chain loop dihedrals and side chain dihedral optimization, considerable progress has been achieved in our laboratory (http://www.scfbio-iitd.res.in/bhageerath/index.jsp) and elsewhere for proteins with less than 100 amino acids. As a part of our on-going efforts in this direction and to facilitate tertiary structure selection/rejection in containing the combinatorial explosion of trial structures for a specified amino acid sequence, we describe here a web-enabled tool ProRegIn (Protein Regularity Index) developed based on the regularity in the Phi, Psi dihedral angles of the amino acids that constitute loop regions. We have analysed the dihedrals in loop regions in a non-redundant dataset of 7351 proteins drawn from the Protein Data Bank and categorized them as helix-like or sheet-like (regular) or irregular. We noticed that the regularity thus defined exceeds 86% for Phi barring glycine and 70% for Psi for all the amino acid side chains including glycine, compelling us to reexamine the conventional view that loops are irregular regions structurally. The regularity index is presented here as a simple tool that finds its application in protein structure analysis as a discriminatory scoring function for rapid screening before the more compute intensive atomic level energy calculations could be undertaken. The tool is made freely accessible over the internet at www.scfbio-iitd.res.in/software/proregin.jsp.
Collapse
Affiliation(s)
- Lipi Thukral
- Department of Chemistry and Supercomputing Facility for Bioinformatics and Computational Biology, Indian Institute of Technology Delhi, Hauz Khas, New Delhi 110 016, India
| | | | | | | |
Collapse
|
11
|
Szarecka A, Meirovitch H. Optimization of the GB/SA solvation model for predicting the structure of surface loops in proteins. J Phys Chem B 2006; 110:2869-80. [PMID: 16471897 PMCID: PMC1945207 DOI: 10.1021/jp055771+] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Implicit solvation models are commonly optimized with respect to experimental data or Poisson-Boltzmann (PB) results obtained for small molecules, where the force field is sometimes not considered. In previous studies, we have developed an optimization procedure for cyclic peptides and surface loops in proteins based on the entire system studied and the specific force field used. Thus, the loop has been modeled by the simplified solvation function E(tot) = E(FF) (epsilon = 2r) + Sigma(i) sigma(i)A(i), where E(FF) (epsilon = nr) is the AMBER force field energy with a distance-dependent dielectric function, epsilon = nr, A(i) is the solvent accessible surface area of atom i, and sigma(i) is its atomic solvation parameter. During the optimization process, the loop is free to move while the protein template is held fixed in its X-ray structure. To improve on the results of this model, in the present work we apply our optimization procedure to the physically more rigorous solvation model, the generalized Born with surface area (GB/SA) (together with the all-atom AMBER force field) as suggested by Still and co-workers (J. Phys. Chem. A 1997, 101, 3005). The six parameters of the GB/SA model, namely, P(1)-P(5) and the surface area parameter, sigma (programmed in the TINKER package) are reoptimized for a "training" group of nine loops, and a best-fit set is defined from the individual sets of optimized parameters. The best-fit set and Still's original set of parameters (where Lys, Arg, His, Glu, and Asp are charged or neutralized) were applied to the training group as well as to a "test" group of seven loops, and the energy gaps and the corresponding RMSD values were calculated. These GB/SA results based on the three sets of parameters have been found to be comparable; surprisingly, however, they are somewhat inferior (e.g, of larger energy gaps) to those obtained previously from the simplified model described above. We discuss recent results for loops obtained by other solvation models and potential directions for future studies.
Collapse
Affiliation(s)
- Agnieszka Szarecka
- Department of Computational Biology, University of Pittsburgh School of Medicine, Suite 3064, BST 3, 3501 Fifth Avenue, Pittsburgh, PA 15213
| | - Hagai Meirovitch
- Department of Computational Biology, University of Pittsburgh School of Medicine, Suite 3064, BST 3, 3501 Fifth Avenue, Pittsburgh, PA 15213
| |
Collapse
|
12
|
White RP, Meirovitch H. Minimalist explicit solvation models for surface loops in proteins. J Chem Theory Comput 2006; 2:1135-1151. [PMID: 17429495 PMCID: PMC1851699 DOI: 10.1021/ct0503217] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We have performed molecular dynamics simulations of protein surface loops solvated by explicit water, where a prime focus of the study is the small numbers (e.g., ~100) of explicit water molecules employed. The models include only part of the protein (typically 500 - 1000 atoms), and the water molecules are restricted to a region surrounding the loop. In this study, the number of water molecules (N(w)) is systematically varied, and convergence with large N(w) is monitored to reveal N(w)(min), the minimum number required for the loop to exhibit realistic (fully hydrated) behavior. We have also studied protein surface coverage, as well as diffusion and residence times for water molecules as a function of N(w). A number of other modeling parameters are also tested. These include the number of environmental protein atoms explicitly considered in the model, as well as two ways to constrain the water molecules to the vicinity of the loop (where we find one of these methods to perform better when N(w) is small). The results (for RMSD and its fluctuations for four loops) are further compared to much larger, fully solvated systems (using ~10,000 water molecules under periodic boundary conditions and Ewald electrostatics), and to results for the GBSA implicit solvation model. We find that the loop backbone can stabilize with a surprisingly small number of water molecules (as low as 5 molecules per amino acid residue). The side chains of the loop require somewhat larger N(w), where the atomic fluctuations become too small if N(w) is further reduced. Thus, in general, we find adequate hydration to occur at roughly 12 water molecules per residue. This is an important result, because at this hydration level, computational times are comparable to those required for GBSA. Therefore these "minimalist explicit models" can provide a viable and potentially more accurate alternative. The importance of protein loop modeling is discussed in the context of these, and other, loop models, along with other challenges including the relevance of appropriate free energy simulation methodology for assessment of conformational stability.
Collapse
Affiliation(s)
- Ronald P. White
- Department of Computational Biology, University of Pittsburgh School of Medicine, Biomedical Science Tower3, 3064 Pittsburgh, PA 15260
| | - Hagai Meirovitch
- Department of Computational Biology, University of Pittsburgh School of Medicine, Biomedical Science Tower3, 3064 Pittsburgh, PA 15260
| |
Collapse
|
13
|
De S, Sur K, Dasgupta S. Characterization of the nonregular regions of proteins by a contortion index. Biopolymers 2005; 79:63-73. [PMID: 15962279 DOI: 10.1002/bip.20333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Nonstructured regions in proteins that provide the link between two regular structured regions play a significant role in maintaining the scaffold of the protein. Not only do they act as connectors between two regular secondary structural elements of proteins but they also provide the necessary turn or reversal in the polypeptide chain. This incorporates flexibility in the structure. Thus an understanding of the structural aspects of the nonregular regions is necessary to have a better insight into these features. We can assume the nonregular region to be a contorted polypeptide segment tethered by regular secondary structured regions at both ends. To describe the undulating nature of the nonregular regions, we introduce a parameter called the "contortion index." This index describes how tortuously the region is organized. Our analysis shows that the contortion index is related to other physicochemical parameters and can be used to characterize the nonregular regions of proteins.
Collapse
Affiliation(s)
- Subhajyoti De
- Department of Chemistry, Indian Institute of Technology, Kharagpur 721 302, India
| | | | | |
Collapse
|
14
|
Tendulkar AV, Joshi AA, Sohoni MA, Wangikar PP. Clustering of Protein Structural Fragments Reveals Modular Building Block Approach of Nature. J Mol Biol 2004; 338:611-29. [PMID: 15081817 DOI: 10.1016/j.jmb.2004.02.047] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2003] [Revised: 02/11/2004] [Accepted: 02/17/2004] [Indexed: 11/29/2022]
Abstract
Structures of peptide fragments drawn from a protein can potentially occupy a vast conformational continuum. We co-ordinatize this conformational space with the help of geometric invariants and demonstrate that the peptide conformations of the currently available protein structures are heavily biased in favor of a finite number of conformational types or structural building blocks. This is achieved by representing a peptides' backbone structure with geometric invariants and then clustering peptides based on closeness of the geometric invariants. This results in 12,903 clusters, of which 2207 are made up of peptides drawn from functionally and/or structurally related proteins. These are termed "functional" clusters and provide clues about potential functional sites. The rest of the clusters, including the largest few, are made up of peptides drawn from unrelated proteins and are termed "structural" clusters. The largest clusters are of regular secondary structures such as helices and beta strands as well as of beta hairpins. Several categories of helices and strands are discovered based on geometric differences. In addition to the known classes of loops, we discover several new classes, which will be useful in protein structure modeling. Our algorithm does not require assignment of secondary structure and, therefore, overcomes the limitations in loop classification due to ambiguity in secondary structure assignment at loop boundaries.
Collapse
Affiliation(s)
- Ashish V Tendulkar
- Kanwal Rekhi School of Information Technology, Indian Institute of Technology, Bombay, Powai, Mumbai 400 076, India
| | | | | | | |
Collapse
|
15
|
Zemlin M, Klinger M, Link J, Zemlin C, Bauer K, Engler JA, Schroeder HW, Kirkham PM. Expressed murine and human CDR-H3 intervals of equal length exhibit distinct repertoires that differ in their amino acid composition and predicted range of structures. J Mol Biol 2004; 334:733-49. [PMID: 14636599 DOI: 10.1016/j.jmb.2003.10.007] [Citation(s) in RCA: 261] [Impact Index Per Article: 13.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Immunoglobulin junctional diversity is concentrated in the third complementarity-determining region of the heavy chain (CDR-H3), which often plays a dominant role in antigen binding. The range of CDR-H3 lengths in mouse is shorter than in human, and thus the murine repertoire could be presumed to be a subset of the human one. To test this presumption, we analyzed 4751 human and 2170 murine unique, functional, published CDR-H3 intervals. Although tyrosine, glycine, and serine were found to predominate in both species, the human sequences contained fewer tyrosine residues, more proline residues, and more hydrophobic residues (p<0.001, respectively). While changes in amino acid utilization as a function of CDR-H3 length followed similar trends in both species, murine and human CDR-H3 intervals of identical length were found to differ from each other. These differences reflect both divergence of germline diversity and joining gene sequence and somatic selection. Together, these factors promote the production of a rather uniform repertoire in mice of tyrosine-enriched CDR-H3 loops with stabilized hydrogen bond-ladders versus a much more diverse repertoire in human that contains CDR-H3 loops sculpted by the presence of intra-chain disulfide bonds due to germline-encoded cysteine residues as well as the enhanced presence of somatically generated proline residues that preclude hydrogen bond ladder formation. Thus, despite the presumed need to recognize a similar range of antigen epitopes, the murine CDR-H3 repertoire is clearly distinct from its human counterpart in its amino acid composition and its predicted range of structures. These findings represent a benchmark to which CDR-H3 repertoires can be compared to better characterize and understand the shaping of the CDR-H3 repertoire over evolution and during immune responses. This information may also be useful for the design of species-specific CDR-H3 sequences in synthetic antibody libraries.
Collapse
Affiliation(s)
- Michael Zemlin
- Department of Medicine, University of Alabama at Birmingham, 1530 3rd Avenue South, Birmingham, AL 35294-3300, USA
| | | | | | | | | | | | | | | |
Collapse
|
16
|
Guruprasad K, Rao MJ, Adindla S, Guruprasad L. Combinations of turns in proteins. THE JOURNAL OF PEPTIDE RESEARCH : OFFICIAL JOURNAL OF THE AMERICAN PEPTIDE SOCIETY 2003; 62:167-74. [PMID: 12969196 DOI: 10.1034/j.1399-3011.2003.00086.x] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
We observed that beta- and gamma-turns in protein structure may be associated as peptides representing combinations of turns that span between nine and 26 amino acid residues along the polypeptide backbone chain and often correspond to loops in the protein structure. Around 475 peptides resulted from the analysis of a non-redundant data set corresponding to 248 protein crystal structures selected from the Protein Data Bank. Nearly 40% protein chains are associated with two or more peptides and the peptides with nine and 10 amino acid residues are more frequent. A maximum of four distinct peptides varying in number of amino acid residues were observed in at least 10 proteins along the same protein chain. Nearly 80% peptides comprise type IV beta-turns that are associated with irregular dihedral angle values suggesting this may be important for the conformational diversity associated with the loops in proteins. In general, predominant interactions that possibly stabilize these peptides involve main-chain and side-chain interactions with solvent, in addition to hydrogen bond, salt-bridge and non-bonded interactions. Majority of the peptides were observed in hydrolase, oxidoreductase, transferase, serine proteinase/inhibitor complex, electron transport/electron transfer and lyase proteins.
Collapse
Affiliation(s)
- K Guruprasad
- Centre for Cellular and Molecular Biology, Uppal Road, Hyderabad 500 007, India.
| | | | | | | |
Collapse
|
17
|
Abstract
Most globular proteins are divisible by domains, distinct substructures of the globule. The notion of hierarchy of the domains was introduced earlier via van der Waals energy profiles that allow one to subdivide the proteins into domains (subdomains). The question remains open as to what is the possible structural connection of the energy profiles. The recent discovery of the loop-n-lock elements in the globular proteins suggests such a structural connection. A direct comparison of the segmentation by van der Waals energy criteria with the maps of the locked loops of nearly standard size reveals a striking correlation: domains in general appear to consist of one to several such loops. In addition, it was demonstrated that a variety of subdivisions of the same protein into domains is just a regrouping of the loop-n-lock elements.
Collapse
Affiliation(s)
- Igor N Berezovsky
- Department of Structural Biology, The Weizmann Institute of Science, P.O.B. 26, Rehovot 76100, Israel.
| |
Collapse
|
18
|
Collis AVJ, Brouwer AP, Martin ACR. Analysis of the antigen combining site: correlations between length and sequence composition of the hypervariable loops and the nature of the antigen. J Mol Biol 2003; 325:337-54. [PMID: 12488099 DOI: 10.1016/s0022-2836(02)01222-6] [Citation(s) in RCA: 194] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
It has long been suggested that the overall shape of the antigen combining site (ACS) of antibodies is correlated with the nature of the antigen. For example, deep pockets are characteristic of antibodies that bind haptens, grooves indicate peptide binders, while antibodies that bind to proteins have relatively flat combining sites. In 1996, MacCallum, Martin and Thornton used a fractal shape descriptor and showed a strong correlation of the shape of the binding region with the general nature of the antigen.However, the shape of the ACS is determined primarily by the lengths of the six complementarity-determining regions (CDRs). Here, we make a direct correlation between the lengths of the CDRs and the nature of the antigen. In addition, we show significant differences in the residue composition of the CDRs of antibodies that bind to different antigen classes. As well as helping us to understand the process of antigen recognition, autoimmune disease and cross-reactivity, these results are of direct application in the design of antibody phage libraries and modification of affinity.
Collapse
Affiliation(s)
- Abigail V J Collis
- School of Animal and Microbial Sciences, University of Reading, Whiteknights, P.O. Box 228, UK
| | | | | |
Collapse
|
19
|
Berezovsky IN, Kirzhner VM, Kirzhner A, Trifonov EN. Protein folding: looping from hydrophobic nuclei. Proteins 2001; 45:346-50. [PMID: 11746682 DOI: 10.1002/prot.1155] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Protein structure can be viewed as a compact linear array of nearly standard size closed loops of 25-30 amino acid residues (Berezovsky et al., FEBS Letters 2000; 466: 283-286) irrespective of details of secondary structure. The end-to-end contacts in the loops are likely to be hydrophobic, which is a testable hypothesis. This notion could be verified by direct comparison of the loop maps with Kyte and Doolittle hydropathicity plots. This analysis reveals that most of the ends of the loops are hydrophobic, indeed. The same conclusion is reached on the basis of positional autocorrelation analysis of protein sequences of 23 fully sequenced bacterial genomes. Hydrophobic residues valine, alanine, glycine, leucine, and isoleucine appear preferentially at the 25-30 residues distance one from another. These observations open a new perspective in the understanding of protein structure and folding: a consecutive looping of the polypeptide chain with the loops ending primarily at hydrophobic nuclei.
Collapse
Affiliation(s)
- I N Berezovsky
- Department of Structural Biology, The Weizmann Institute of Science, Rehovot, Israel.
| | | | | | | |
Collapse
|
20
|
Abstract
Protein chains make numerous returns in globules, thus forming loops, closed by tight residue-to-residue contacts-closed loops. Previous statistical analysis of the sizes and locations of the closed loops in all major protein folds revealed that the loops have an almost standard contour length of 25-30 amino acid residues and follow one after another along the chain. In this work the closed loops of the major folds are presented in three dimensions. A special image filtering procedure is introduced that allows one to visualize the standard size closed loops for the first time. The loop positions along the sequences are verified by detection of loop-end clusters.
Collapse
Affiliation(s)
- I N Berezovsky
- Department of Structural Biology, The Weizmann Institute of Science, P.O.B. 26, Rehovot 76100, Israel.
| | | |
Collapse
|
21
|
Cleverley RM, Zheng N, Gierasch LM. The cost of exposing a hydrophobic loop and implications for the functional role of 4.5 S RNA in the Escherichia coli signal recognition particle. J Biol Chem 2001; 276:19327-31. [PMID: 11278844 DOI: 10.1074/jbc.m011130200] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The signal recognition particle (SRP) is an RNA-protein complex that directs ribosomes to the rough endoplasmic reticulum membrane by binding to targeting signals found on the nascent chain of proteins destined for export to the endoplasmic reticulum. We found evidence from studies with fragments of the protein component of the Escherichia coli SRP that a long hydrophobic loop (the so-called "finger loop") is detrimental to the stability of its signal peptide-binding domain, the M domain. This hydrophobic loop is highly conserved and thus may have a critical role in the function of the SRP. Given our previously reported evidence that 4.5 S RNA stabilizes the tertiary fold of the M domain (Zheng, N., and Gierasch, L. M. (1997) Mol. Cell 1, 79-87), we now propose that the functional requirement for 4.5 S RNA resides in its ability to counteract the destabilizing influence of the finger loop.
Collapse
Affiliation(s)
- R M Cleverley
- Department of Biochemistry and Molecular Biology, University of Massachusetts, Amherst, Massachussets 01003, USA
| | | | | |
Collapse
|
22
|
Das B, Meirovitch H. Optimization of solvation models for predicting the structure of surface loops in proteins. Proteins 2001; 43:303-14. [PMID: 11288180 DOI: 10.1002/prot.1041] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
A novel procedure for optimizing the atomic solvation parameters (ASPs) sigma(i) developed recently for cyclic peptides is extended to surface loops in proteins. The loop is free to move, whereas the protein template is held fixed in its X-ray structure. The energy is E(tot) = E(FF)(epsilon = nr) + summation operator sigma(i)A(i), where E(FF)(epsilon = nr) is the force-field energy of the loop-loop and loop-template interactions, epsilon = nr is a distance-dependent dielectric constant, and n is an additional parameter to be optimized. A(i) is the solvent-accessible surface area of atom i. The optimal sigma(i) and n are those for which the loop structure with the global minimum of E(tot)(n, sigma(i)) becomes the experimental X-ray structure. Thus, the ASPs depend on the force field and are optimized in the protein environment, unlike commonly used ASPs such as those of Wesson and Eisenberg (Protein Sci 1992;1:227-235). The latter are based on the free energy of transfer of small molecules from the gas phase to water and have been traditionally combined with various force fields without further calibration. We found that for loops the all-atom AMBER force field performed better than OPLS and CHARMM22. Two sets of ASPs [based on AMBER (n = 2)], optimized independently for loops 64-71 and 89-97 of ribonuclease A, were similar and thus enabled the definition of a best-fit set. All these ASPs were negative (hydrophilic), including those for carbon. Very good (i.e., small) root-mean-square-deviation values from the X-ray loop structure were obtained with the three sets of ASPs, suggesting that the best-fit set would be transferable to loops in other proteins as well. The structure of loop 13-24 is relatively stretched and was insensitive to the effect of the ASPs.
Collapse
Affiliation(s)
- B Das
- School of Computational Science and Information Technology, Florida State University, Tallahassee, FL 32306-4052, USA
| | | |
Collapse
|
23
|
Abstract
In a globular protein the polypeptide chain returns to itself many times, making numerous chain-to-chain contacts. The stability of these contacts is maintained primarily by van der Waals interactions. In this work we isolated and analysed van der Waals contacts that stabilise spatial structures of nine major folds. We suggest a specific way to identify the tightest contacts of prime importance for the stability of a given crystallized protein and introduce the notion of the van der Waals lock. The loops closed by the van der Waals interactions provide a basically novel view of protein globule organization: the loop-n-lock structure. This opens a new perspective in understanding protein folding as well: the consecutive looping of the polypeptide chain and the locking of the loop ends by tight van der Waals interactions.
Collapse
Affiliation(s)
- I N Berezovsky
- Department of Structural Biology, The Weizmann Institute of Science, Rehovot, 76100, Israel.
| | | |
Collapse
|
24
|
|
25
|
Hahin R, Chen Z, Reddy G. The isolation and characterization of a peptide that alters sodium channels from Buthus martensii Karsch. Toxicon 2000; 38:645-60. [PMID: 10673157 DOI: 10.1016/s0041-0101(99)00180-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
The peptides were purified using gel filtration, ion exchange, FPLC, and HPLC chromatography and found to greatly prolong action potentials at nanomolar concentrations when applied to frog and mouse nerves. The N-terminal primary amino acid sequence of one of the peptides, BMK 16(5), was determined. The first 23 amino acids of BMK 16(5) were found to be: VKDGYIADDRNCPYFCGRNAYYD. The two cysteine residues in the sequence appeared as Edman sequence cycle blanks; however, they were assigned to be cysteines due to sequence similarity to other peptide toxins that bind to sodium channels and identification of the presence of cysteines obtained from single time point amino acid analysis. The MW of BMK 16(5) was determined by a Perkin Elmer API 300 LC/MS/MS to be 3,695. The amino acid residues of BMK 16(5) show strong similarity with the first 23 amino acid residues of a number of scorpion alpha neurotoxins. Unlike these neurotoxins, BMK 16(5) possesses a proline residue at position 13 which will likely make it fold in a unique way so as to bind to and alter sodium channels.
Collapse
Affiliation(s)
- R Hahin
- Biological Sciences, Northern Illinois University, DeKalb 60115, USA
| | | | | |
Collapse
|
26
|
Berezovsky IN, Grosberg AY, Trifonov EN. Closed loops of nearly standard size: common basic element of protein structure. FEBS Lett 2000; 466:283-6. [PMID: 10682844 DOI: 10.1016/s0014-5793(00)01091-7] [Citation(s) in RCA: 112] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
By screening the crystal protein structure database for close Calpha-Calpha contacts, a size distribution of the closed loops is generated. The distribution reveals a maximum at 27+/-5 residues, the same for eukaryotic and prokaryotic proteins. This is apparently a consequence of polymer statistic properties of protein chain trajectory. That is, closure into the loops depends on the flexibility (persistence length) of the chain. The observed preferential loop size is consistent with the theoretical optimal loop closure size. The mapping of the detected unit-size loops on the sequences of major typical folds reveals an almost regular compact consecutive arrangement of the loops. Thus, a novel basic element of protein architecture is discovered; structurally diverse closed loops of the particular size.
Collapse
Affiliation(s)
- I N Berezovsky
- Department of Structural Biology, The Weizmann Institute of Science, Rehovot, Israel.
| | | | | |
Collapse
|
27
|
Li W, Liang S, Wang R, Lai L, Han Y. Exploring the conformational diversity of loops on conserved frameworks. PROTEIN ENGINEERING 1999; 12:1075-86. [PMID: 10611401 DOI: 10.1093/protein/12.12.1075] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Loops are structurally variable regions, but the secondary structural elements bracing loops are often conserved. Motifs with similar secondary structures exist in the same and different protein families. In this study, we made an all-PDB-based analysis and produced 495 motif families accessible from the Internet. Every motif family contains some variable loops spanning a common framework (a pair of secondary structures). The diversity of loops and the convergence of frameworks were examined. In addition, we also identified 119 loops with conformational changes in different PDB files. These materials can give some directions for functional loop design and flexible docking.
Collapse
Affiliation(s)
- W Li
- Institute of Physical Chemistry, Peking University, Beijing 100871, China
| | | | | | | | | |
Collapse
|
28
|
Pal L, Basu G. Novel protein structural motifs containing two-turn and longer 3(10)-helices. PROTEIN ENGINEERING 1999; 12:811-4. [PMID: 10556239 DOI: 10.1093/protein/12.10.811] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The 3(10)-helix constitutes a small but significant fraction of secondary structural elements in proteins. Protein data base surveys have shown these helices to be present as alpha-helical extensions, in loops and as connectors between beta-strands. The present work focuses on two-turn and longer 3(10)-helices where we establish that two-turn and longer 3(10) helices, unlike the more abundant single-turn 3(10)-helices, frequently occur independent of any other contiguous secondary structural elements. More importantly, a large fraction of these independent two-turn and longer 3(10)-helices, along with alpha-helices and beta-strands, are found to form novel super-secondary structural motifs in several proteins with possible implications for protein folding, local conformational relaxation and biological functions.
Collapse
Affiliation(s)
- L Pal
- Department of Biophysics, Bose Institute, P-1/12 CIT Scheme VIIM, Calcutta 700 054, India
| | | |
Collapse
|
29
|
Wojcik J, Mornon JP, Chomilier J. New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification. J Mol Biol 1999; 289:1469-90. [PMID: 10373380 DOI: 10.1006/jmbi.1999.2826] [Citation(s) in RCA: 71] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A bank of 13,563 loops from three to eight amino acid residues long, representing motifs between two consecutive regular secondary structures, has been derived from protein structures presenting less than 95 % sequence identity. Statistical analyses of occurrences of conformations and residues revealed length-dependent over-representations of particular amino acids (glycine, proline, asparagine, serine, and aspartate) and conformations (alphaL, epsilon, betaPregions of the Ramachandran plot). A position-dependent distribution of these occurrences was observed for N and C-terminal residues, which are correlated to the nature of the flanking regions. Loops of the same length were clustered into statistically meaningful families on the basis of their backbone structures when placed in a common reference frame, independent of the flanks. These clusters present significantly different distributions of sequence, conformations, and endpoint residue Calphadistances. On the basis of the sequence-structure correlation of this clustering, an automatic loop modeling algorithm was developed. Based on the knowledge of its sequence and of its flank backbone structures each query loop is assigned to a family and target loop supports are selected in this family. The support backbones of these target loops are then adjusted on flanking structures by partial exploration of the conformational space. Loop closure is performed by energy minimization for each support and the final model is chosen among connected supports based upon energy criteria. The quality of the prediction is evaluated by the root-mean-square deviation (rmsd) between the final model and the native loops when the whole bank is re-attributed on itself with a Jackknife test. This average rmsd ranges from 1.1 A for three-residue loops to 3.8 A for eight-residue loops. A few poorly predicted loops are inescapable, considering the high level of diversity in loops and the lack of environment data. To overcome such modeling problems, a statistical reliability score was assigned for each prediction. This score is correlated to the quality of the prediction, in terms of rmsd, and thus improves the selection accuracy of the model. The algorithm efficiency was compared to CASP3 target loop predictions. Moreover, when tested on a test loop bank, this algorithm was shown to be robust when the loops are not precisely delimited, therefore proving to be a useful tool in practice for protein modeling.
Collapse
Affiliation(s)
- J Wojcik
- Systèmes Moléculaires et Biologie Structurale Laboratoire de Minéralogie-Cristallographie (LMCP), Universités Paris VI et Paris VII, Cedex 05, Paris, CNRS UMR7590, France
| | | | | |
Collapse
|
30
|
Abstract
A general problem in comparative modeling and protein design is the conformational evaluation of loops with a certain sequence in specific environmental protein frameworks. Loops of different sequences and structures on similar scaffolds are common in the Protein Data Bank (PDB). In order to explore both structural and sequential diversity of them, a data base of loops connecting similar secondary structure fragments is constructed by searching the data base of families of structurally similar proteins and PDB. A total of 84 loop families having 2-13 residues are found among the well-determined structures of resolution better than 2.5 A. Eight alpha-alpha, 20 alpha-beta, 19 beta-alpha, and 37 beta-beta families are identified. Every family contains more than 5 loop motifs. In each family, no loops share same sequence and all the frameworks are well superimposed. Forty-three new loop classes are distinguished in the data base. The structural variability of loops in homologous proteins are examined and shown in 44 families. Motif families are characterized with geometric parameters and sequence patterns. The conformations of loops in each family are clustered into subfamilies using average linkage cluster analysis method. Information such as geometric properties, sequence profile, sequential and structural variability in loop, structural alignment parameters, sequence similarities, and clustering results are provided. Correlations between the conformation of loops and loop sequence, motif sequence, and global sequence of PDB chain are examined in order to find how loop structures depend on their sequences and how they are affected by the local and global environment. Strong correlations (R > 0.75) are only found in 24 families. The best R value is 0.98. The data base is available through the Internet.
Collapse
Affiliation(s)
- W Li
- Institute of Physical Chemistry, Peking University, Beijing, P. R. China
| | | | | |
Collapse
|
31
|
Brancaccio A, Schulthess T, Gesemann M, Engel J. The N-terminal region of alpha-dystroglycan is an autonomous globular domain. EUROPEAN JOURNAL OF BIOCHEMISTRY 1997; 246:166-72. [PMID: 9210479 DOI: 10.1111/j.1432-1033.1997.00166.x] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
The structure of the N-terminal region of mouse alpha-dystroglycan (DGN) was investigated by expression of two protein fragments (residues 30-180 and 30-438) in Escherichia coli cells. Trypsin susceptibility experiments show the presence of a stable alpha-dystroglycan N-terminal region (approximately from residue 30 to 315). In addition, guanidinium hydrochloride (Gdn/HCl) denaturation of DGN-(30-438)-peptide, monitored by means of tryptophan fluorescence, produces a cooperative transition typical of folded protein structures. These results strongly suggest that the alpha-dystroglycan N-terminal is an autonomous folding unit preluding a flexible mucin-like region and that its folding is not influenced by the absence of glycosylation. In order to obtain more information on the structural features of the N-terminal domain we have also used circular dichroism, analytical sedimentation and electron microscopy analysis. Circular dichroic spectra show the absence of typical secondary structure (e.g. alpha-helix or beta-sheet) and closely resemble those recorded for loop-containing proteins. This is consistent with a sequence similarity of the alpha-dystroglycan domain with the loop-containing protein elastase. Analytical ultracentrifugation and electron microscopy analysis reveal that the N-terminal domain has a globular structure. DGN-(30-438)-peptide does not bind in the nanomolar range to an iodinated agrin fragment which binds with high affinity to tissue purified alpha-dystroglycan. No binding was detected also to laminin. This result suggests that the alpha-dystroglycan N-terminal domain does not contain the binding site to its extracellular matrix binding partners. It is less likely than the lack of glycosylation reduces its binding affinity, because the N-terminal globular domain only contains two glycosylation sites.
Collapse
Affiliation(s)
- A Brancaccio
- Department of Biophysical Chemistry, Biozentrum, University of Basel, Switzerland
| | | | | | | |
Collapse
|
32
|
Oliva B, Bates PA, Querol E, Avilés FX, Sternberg MJ. An automated classification of the structure of protein loops. J Mol Biol 1997; 266:814-30. [PMID: 9102471 DOI: 10.1006/jmbi.1996.0819] [Citation(s) in RCA: 161] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Conformational clusters and consensus sequences for protein loops have been derived by computational analysis of their structures in a non-redundant set of 233 proteins with less than 25% sequence homology (X-ray resolution better than 2.5 A). Loops have been classified into five types (alpha-alpha, beta-beta links, beta-beta hairpins, alpha-beta and beta-alpha) according to the secondary structures they embrace. Four variables have been used to describe the loop geometry, three angles and one distance between the secondary structure elements embracing the loop. Ramachandran angles (phi, psi) are used to define the loop conformations within each brace geometry. All loops from the non-redundant set have been clustered by means of these geometric features. A total of 56 classes (9 alpha-alpha, 11 beta-beta links, 14 beta-beta hairpins, 13 alpha-beta and 9 beta-alpha) were identified with consensus Ramachandran angles in the loops. These classes were divided into subclasses based on the brace geometry. This clustering procedure captures most of the clusters analysed by predominantly visual inspection methods and finds other clusters that have hitherto not been described. Consensus sequence patterns were identified for the subclasses. An extensive characterisation of loop conformations has therefore been achieved and the computational approach is readily open to the incorporation of information from newly determined structures. These clusters should also enhance model building by comparison studies.
Collapse
Affiliation(s)
- B Oliva
- Institut de Biologia Fonamental, Universitat Autònoma de Barcelona, Spain
| | | | | | | | | |
Collapse
|
33
|
Nagi AD, Regan L. An inverse correlation between loop length and stability in a four-helix-bundle protein. FOLDING & DESIGN 1997; 2:67-75. [PMID: 9080200 DOI: 10.1016/s1359-0278(97)00007-2] [Citation(s) in RCA: 158] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
BACKGROUND The loops in proteins are less well characterized than the secondary structural elements that they connect. We have used the four-helix-bundle protein Rop as a model system in which to explore the role of loop length in protein folding and stability. RESULTS A natural two-residue loop was replaced with a series of glycine linkers up to 10 residues in length. All 10 mutants are highly helical dimers that retain wild-type RNA-binding activity. As loop length is increased, the stability of Rop toward thermal and chemical denaturation is progressively decreased. CONCLUSIONS All the mutants assume a wild-type-like structure, which suggests that the natural loop does not actively dictate the final protein fold. The strong inverse correlation observed between loop length and stability is well described by a simple polymer model in which the entropy of loop closure is the dominant energetic term. Our results emphasize the importance of optimization of loop length to successful protein design.
Collapse
Affiliation(s)
- A D Nagi
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT 06520, USA
| | | |
Collapse
|
34
|
Donate LE, Rufino SD, Canard LH, Blundell TL. Conformational analysis and clustering of short and medium size loops connecting regular secondary structures: a database for modeling and prediction. Protein Sci 1996; 5:2600-16. [PMID: 8976569 PMCID: PMC2143328 DOI: 10.1002/pro.5560051223] [Citation(s) in RCA: 82] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]
Abstract
Loops are regions of nonrepetitive conformation connecting regular secondary structures. We identified 2,024 loops of one to eight residues in length, with acceptable main-chain bond lengths and peptide bond angles, from a database of 223 protein and protein-domain structures. Each loop is characterized by its sequence, main-chain conformation, and relative disposition of its bounding secondary structures as described by the separation between the tips of their axes and the angle between them. Loops, grouped according to their length and type of their bounding secondary structures, were superposed and clustered into 161 conformational classes, corresponding to 63% of all loops. Of these, 109 (51% of the loops) were populated by at least four nonhomologous loops or four loops sharing a low sequence identity. Another 52 classes, including 12% of the loops, were populated by at least three loops of low sequence similarity from three or fewer nonhomologous groups. Loop class suprafamilies resulting from variations in the termini of secondary structures are discussed in this article. Most previously described loop conformations were found among the classes. New classes included a 2:4 type IV hairpin, a helix-capping loop, and a loop that mediates dinucleotide-binding. The relative disposition of bounding secondary structures varies among loop classes, with some classes such as beta-hairpins being very restrictive. For each class, sequence preferences as key residues were identified; those most frequently at these conserved positions than in proteins were Gly, Asp, Pro, Phe, and Cys. Most of these residues are involved in stabilizing loop conformation, often through a positive phi conformation or secondary structure capping. Identification of helix-capping residues and beta-breakers among the highly conserved positions supported our decision to group loops according to their bounding secondary structures. Several of the identified loop classes were associated with specific functions, and all of the member loops had the same function; key residues were conserved for this purpose, as is the case for the parvalbumin-like calcium-binding loops. A significant number, but not all, of the member loops of other loop classes had the same function, as is the case for the helix-turn-helix DNA-binding loops. This article provides a systematic and coherent conformational classification of loops, covering a broad range of lengths and all four combinations of bounding secondary structure types, and supplies a useful basis for modelling of loop conformations where the bounding secondary structures are known or reliably predicted.
Collapse
Affiliation(s)
- L E Donate
- Imperial Cancer Research Fund, Department of Crystallography, Birkbeck College, University of London, United Kingdom
| | | | | | | |
Collapse
|