Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Zheng Q, Rosenfeld R, DeLisi C, Kyle DJ. Multiple copy sampling in protein loop modeling: computational efficiency and sensitivity to dihedral angle perturbations. Protein Sci 1994;3:493-506. [PMID: 8019420 PMCID: PMC2142699 DOI: 10.1002/pro.5560030315] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

For:	Zheng Q, Rosenfeld R, DeLisi C, Kyle DJ. Multiple copy sampling in protein loop modeling: computational efficiency and sensitivity to dihedral angle perturbations. Protein Sci 1994;3:493-506. [PMID: 8019420 PMCID: PMC2142699 DOI: 10.1002/pro.5560030315] [Citation(s) in RCA: 44] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Number

Cited by Other Article(s)

Kolodny R, Guibas L, Levitt M, Koehl P. Inverse Kinematics in Biology: The Protein Loop Closure Problem. Int J Rob Res 2016. [DOI: 10.1177/0278364905050352] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]

Fiser A. Protein structure modeling in the proteomics era. Expert Rev Proteomics 2014;1:97-110. [PMID: 15966803 DOI: 10.1586/14789450.1.1.97] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Modeling Structures and Motions of Loops in Protein Molecules. ENTROPY 2012. [DOI: 10.3390/e14020252] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]

Application of biasing-potential replica-exchange simulations for loop modeling and refinement of proteins in explicit solvent. Proteins 2010;78:2809-19. [DOI: 10.1002/prot.22796] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Hixson CA, Wheeler RA. Pressure Annealing as a Complement to Temperature Annealing To Find Low-Energy Structures of Oligomeric Molecules. J Chem Theory Comput 2009;5:1883-94. [DOI: 10.1021/ct800451c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Velez-Vega C, Fenwick MK, Escobedo FA. Simulated mutagenesis of the hypervariable loops of a llama VHH domain for the recovery of canonical conformations. J Phys Chem B 2009;113:1785-95. [PMID: 19132876 DOI: 10.1021/jp805866j] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Zhong S, Moix JM, Quirk S, Hernandez R. Dihedral-angle information entropy as a gauge of secondary structure propensity. Biophys J 2006;91:4014-23. [PMID: 16980371 PMCID: PMC1635691 DOI: 10.1529/biophysj.106.089243] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2006] [Accepted: 08/29/2006] [Indexed: 11/18/2022] Open

Shehu A, Clementi C, Kavraki LE. Modeling protein conformational ensembles: From missing loops to equilibrium fluctuations. Proteins 2006;65:164-79. [PMID: 16917941 DOI: 10.1002/prot.21060] [Citation(s) in RCA: 63] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Bastard K, Prévost C, Zacharias M. Accounting for loop flexibility during protein-protein docking. Proteins 2005;62:956-69. [PMID: 16372349 DOI: 10.1002/prot.20770] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Cheng X, Cui G, Hornak V, Simmerling C. Modified replica exchange simulation methods for local structure refinement. J Phys Chem B 2005;109:8220-30. [PMID: 16851961 PMCID: PMC4805125 DOI: 10.1021/jp045437y] [Citation(s) in RCA: 96] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Abstract

Parallel tempering, also known as replica exchange molecular dynamics (REMD), has recently been successfully used to study the structure and thermodynamic properties of biomolecules such as peptides and small proteins. For large systems, however, applying REMD can be costly since the number of replicas needed increases as the square root of the number of degrees of freedom in the system. Often, enhanced sampling is only needed for a subset of atoms, such as a loop region of a large protein or a small ligand binding to a receptor. In such applications, it is often reasonable to assume a weak dependence of the structure of the larger region on the instantaneous conformation of the smaller region of interest. For these cases, we derived two variant replica exchange methods, partial replica exchange molecular dynamics (PREMD) and local replica exchange molecular dynamics (LREMD). The Hamiltonian for the system is separated, with replica exchange carried out only for terms involving the subsystem of interest while the remainder of the system is maintained at a single temperature. The number of replicas required for efficient exchange thus depends on the number of degrees of freedom in the fragment needing refinement rather than on the size of the full system. The method can be applied to much larger systems than was previously practical. This also provides a means to preserve the integrity of the structure outside the refinement region without introduction of restraints. LREMD takes this weak coupling approximation a step further, employing only a single representation of the large fragment that simultaneously interacts with all of the replicas of the subsystem of interest. This is obtained by combining replica exchange with the locally enhanced sampling approximation (LES), reducing the computational expense of replica exchange simulations to near that of a single standard molecular dynamics (MD) simulation. Use of LREMD also permits the use of LES without requiring the specification of a single temperature, a known difficulty for standard LES simulations. We tested these two methods on the loop region of an RNA hairpin model system and find significant advantages over standard MD and REMD simulations.

Collapse

Bastard K, Thureau A, Lavery R, Prévost C. Docking macromolecules with flexible segments. J Comput Chem 2003;24:1910-20. [PMID: 14515373 DOI: 10.1002/jcc.10329] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

Fenwick MK, Escobedo FA. Hybrid Monte Carlo with multidimensional replica exchanges: conformational equilibria of the hypervariable regions of a llama VHH antibody domain. Biopolymers 2003;68:160-77. [PMID: 12548621 DOI: 10.1002/bip.10291] [Citation(s) in RCA: 19] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]

Abstract

Since the structural repertoire of the hypervariable regions of human antibodies is known to be more restricted than what is implied by sequence variability, a common approach to structural prediction is to use a knowledge-based (KB) method, such as the canonical structure model (C. Chothia and A. M. Lesk, Journal of Molecular Biology, 1987, Vol. 196, pp. 901-917). However, this model is less successful when applied to camelid heavy chain antibodies. In this study, molecular simulations were used to examine the conformational equilibria of the hypervariable regions (H1, H2, and H3) of a llama heavy chain variable domain, for which KB predictions are poor. Simulations were carried out using both conventional molecular dynamics (MD) and hybrid Monte Carlo with multidimensional replica exchanges (HYMREX). The advantage of the latter method is its ability to selectively target parts of the Hamiltonian that can most readily improve sampling. A novel variant of HYMREX was implemented in which, besides the temperature, torsional interactions and the range of nonbonded interactions were varied. To compare the sampling abilities of MD and this HYMREX scheme, simulations were started from a misfolded conformational state. Overall, MD yielded final conformations more similar to the initial state, implying quasi-ergodic sampling. In contrast, HYMREX achieved more ergodic sampling, and the majority of conformations that it sampled agreed well with the known crystal structure. The HYMREX simulation results were used to help identify the chief interactions governing the conformational equilibria and to reexamine the key assumptions underlying the KB predictions. The data show that the H1 region exhibited significant conformational freedom, in support of the hypothesis that main-chain structural variability in this region could play a greater role in antigen binding in camelid antibodies than it does in normal antibodies. Key H1 residues and associated inter-loop interactions are conjectured to account for the poor KB predictions.

Collapse

Mehler EL, Periole X, Hassan SA, Weinstein H. Key issues in the computational simulation of GPCR function: representation of loop domains. J Comput Aided Mol Des 2002;16:841-53. [PMID: 12825797 DOI: 10.1023/a:1023845015343] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]

Abstract

Some key concerns raised by molecular modeling and computational simulation of functional mechanisms for membrane proteins are discussed and illustrated for members of the family of G protein coupled receptors (GPCRs). Of particular importance are issues related to the modeling and computational treatment of loop regions. These are demonstrated here with results from different levels of computational simulations applied to the structures of rhodopsin and a model of the 5-HT2A serotonin receptor, 5-HT2AR. First, comparative Molecular Dynamics (MD) simulations are reported for rhodopsin in vacuum and embedded in an explicit representation of the membrane and water environment. It is shown that in spite of a partial accounting of solvent screening effects by neutralization of charged side chains, vacuum MD simulations can lead to severe distortions of the loop structures. The primary source of the distortion appears to be formation of artifactual H-bonds, as has been repeatedly observed in vacuum simulations. To address such shortcomings, a recently proposed approach that has been developed for calculating the structure of segments that connect elements of secondary structure with known coordinates, is applied to 5-HT2AR to obtain an initial representation of the loops connecting the transmembrane (TM) helices. The approach consists of a simulated annealing combined with biased scaled collective variables Monte Carlo technique, and is applied to loops connecting the TM segments on both the extra-cellular and the cytoplasmic sides of the receptor. Although this initial calculation treats the loops as independent structural entities, the final structure exhibits a number of interloop interactions that may have functional significance. Finally, it is shown here that in the case where a given loop from two different GPCRs (here rhodopsin and 5-HT2AR) has approximately the same length and some degree of sequence identity, the fold adopted by the loops can be similar. Thus, in such special cases homology modeling might be used to obtain initial structures of these loops. Notably, however, all other loops in these two receptors appear to be very different in sequence and structure, so that their conformations can be found reliably only by ab initio, energy based methods and not by homology modeling.

Collapse

Forster MJ. Molecular modelling in structural biology. Micron 2002;33:365-84. [PMID: 11814876 DOI: 10.1016/s0968-4328(01)00035-x] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]

Abstract

Molecular modelling is a powerful methodology for analysing the three dimensional structure of biological macromolecules. There are many ways in which molecular modelling methods have been used to address problems in structural biology. It is not widely appreciated that modelling methods are often an integral component of structure determination by NMR spectroscopy and X-ray crystallography. In this review we consider some of the numerous ways in which modelling can be used to interpret and rationalise experimental data and in constructing hypotheses that can be tested by experiment. Genome sequencing projects are producing a vast wealth of data describing the protein coding regions of the genome under study. However, only a minority of the protein sequences thus identified will have a clear sequence homology to a known protein. In such cases valuable three-dimensional models of the protein coding sequence can be constructed by homology modelling methods. Threading methods, which used specialised schemes to relate protein sequences to a library of known structures, have been shown to be able to identify the likely protein fold even in cases where there is no clear sequence homology. The number of protein sequences that cannot be assigned to a structural class by homology or threading methods, simply because they belong to a previously unidentified protein folding class, will decrease in the future as collaborative efforts in systematic structure determination begin to develop. For this reason, modelling methods are likely to become increasingly useful in the near future. The role of the blind prediction contests, such as the Critical Assessment of techniques for protein Structure Prediction (CASP), will be briefly discussed. Methods for modelling protein-ligand and protein-protein complexes are also described and examples of their applications given.

Collapse

Tosatto SCE, Bindewald E, Hesser J, Männer R. A divide and conquer approach to fast loop modeling. Protein Eng Des Sel 2002;15:279-86. [PMID: 11983928 DOI: 10.1093/protein/15.4.279] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open

Lafontaine I, Lavery R. ADAPT: a molecular mechanics approach for studying the structural properties of long DNA sequences. Biopolymers 2002;56:292-310. [PMID: 11754342 DOI: 10.1002/1097-0282(2000)56:4<292::aid-bip10028>3.0.co;2-9] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]

Hassan SA, Mehler EL, Weinstein H. Structure Calculation of Protein Segments Connecting Domains with Defined Secondary Structure: A Simulated Annealing Monte Carlo Combined with Biased Scaled Collective Variables Technique. ACTA ACUST UNITED AC 2002. [DOI: 10.1007/978-3-642-56080-4_9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/08/2023]

Galaktionov S, Nikiforovich GV, Marshall GR. Ab initio modeling of small, medium, and large loops in proteins. Biopolymers 2001;60:153-68. [PMID: 11455548 DOI: 10.1002/1097-0282(2001)60:2<153::aid-bip1010>3.0.co;2-6] [Citation(s) in RCA: 38] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Hixson CA, Wheeler RA. Rigorous classical-mechanical derivation of a multiple-copy algorithm for sampling statistical mechanical ensembles. PHYSICAL REVIEW E 2001;64:026701. [PMID: 11497738 DOI: 10.1103/physreve.64.026701] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2001] [Indexed: 11/07/2022]

Fiser A, Do RK, Sali A. Modeling of loops in protein structures. Protein Sci 2000;9:1753-73. [PMID: 11045621 PMCID: PMC2144714 DOI: 10.1110/ps.9.9.1753] [Citation(s) in RCA: 1590] [Impact Index Per Article: 63.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]

Abstract

Comparative protein structure prediction is limited mostly by the errors in alignment and loop modeling. We describe here a new automated modeling technique that significantly improves the accuracy of loop predictions in protein structures. The positions of all nonhydrogen atoms of the loop are optimized in a fixed environment with respect to a pseudo energy function. The energy is a sum of many spatial restraints that include the bond length, bond angle, and improper dihedral angle terms from the CHARMM-22 force field, statistical preferences for the main-chain and side-chain dihedral angles, and statistical preferences for nonbonded atomic contacts that depend on the two atom types, their distance through space, and separation in sequence. The energy function is optimized with the method of conjugate gradients combined with molecular dynamics and simulated annealing. Typically, the predicted loop conformation corresponds to the lowest energy conformation among 500 independent optimizations. Predictions were made for 40 loops of known structure at each length from 1 to 14 residues. The accuracy of loop predictions is evaluated as a function of thoroughness of conformational sampling, loop length, and structural properties of native loops. When accuracy is measured by local superposition of the model on the native loop, 100, 90, and 30% of 4-, 8-, and 12-residue loop predictions, respectively, had <2 A RMSD error for the mainchain N, C(alpha), C, and O atoms; the average accuracies were 0.59 +/- 0.05, 1.16 +/- 0.10, and 2.61 +/- 0.16 A, respectively. To simulate real comparative modeling problems, the method was also evaluated by predicting loops of known structure in only approximately correct environments with errors typical of comparative modeling without misalignment. When the RMSD distortion of the main-chain stem atoms is 2.5 A, the average loop prediction error increased by 180, 25, and 3% for 4-, 8-, and 12-residue loops, respectively. The accuracy of the lowest energy prediction for a given loop can be estimated from the structural variability among a number of low energy predictions. The relative value of the present method is gauged by (1) comparing it with one of the most successful previously described methods, and (2) describing its accuracy in recent blind predictions of protein structure. Finally, it is shown that the average accuracy of prediction is limited primarily by the accuracy of the energy function rather than by the extent of conformational sampling.

Collapse

Van Belle D, De Maria L, Iurcu G, Wodak SJ. Pathways of ligand clearance in acetylcholinesterase by multiple copy sampling. J Mol Biol 2000;298:705-26. [PMID: 10788331 DOI: 10.1006/jmbi.2000.3698] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract

The clearance of seven different ligands from the deeply buried active-site of Torpedo californica acetylcholinesterase is investigated by combining multiple copy sampling molecular dynamics simulations, with the analysis of protein-ligand interactions, protein motion and the electrostatic potential sampled by the ligand copies along their journey outwards. The considered ligands are the cations ammonium, methylammonium, and tetramethylammonium, the hydrophobic methane and neopentane, and the anionic product acetate and its neutral form, acetic acid. We find that the pathways explored by the different ligands vary with ligand size and chemical properties. Very small ligands, such as ammonium and methane, exit through several routes. One involves the main exit through the mouth of the enzyme gorge, another is through the so-called back door near Trp84, and a third uses a side door at a direction of approximately 45 degrees to the main exit. The larger polar ligands, methylammonium and acetic acid, leave through the main exit, but the bulkiest, tetramethylammonium and neopentane, as well as the smaller acetate ion, remain trapped in the enzyme gorge during the time of the simulations. The pattern of protein-ligand contacts during the diffusion process is highly non-random and differs for different ligands. A majority is made with aromatic side-chains, but classical H-bonds are also formed. In the case of acetate, but not acetic acid, the anionic and neutral form, respectively, of one of the reaction products, specific electrostatic interactions with protein groups, seem to slow ligand motion and interfere with protein flexibility; protonation of the acetate ion is therefore suggested to facilitate clearance. The Poisson-Boltzmann formalism is used to compute the electrostatic potential of the thermally fluctuating acetylcholinesterase protein at positions actually visited by the diffusing ligand copies. Ligands of different charge and size are shown to sample somewhat different electrostatic potentials during their migration, because they explore different microscopic routes. The potential along the clearance route of a cation such as methylammonium displays two clear minima at the active and peripheral anionic site. We find moreover that the electrostatic energy barrier that the cation needs to overcome when moving between these two sites is small in both directions, being of the order of the ligand kinetic energy. The peripheral site thus appears to play a role in trapping inbound cationic ligands as well as in cation clearance, and hence in product release.

Collapse

Kim ST, Shirai H, Nakajima N, Higo J, Nakamura H. Enhanced conformational diversity search of CDR-H3 in antibodies: Role of the first CDR-H3 residue. Proteins 1999. [DOI: 10.1002/(sici)1097-0134(19991201)37:4<683::aid-prot17>3.0.co;2-d] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Wojcik J, Mornon JP, Chomilier J. New efficient statistical sequence-dependent structure prediction of short to medium-sized protein loops based on an exhaustive loop classification. J Mol Biol 1999;289:1469-90. [PMID: 10373380 DOI: 10.1006/jmbi.1999.2826] [Citation(s) in RCA: 71] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Abstract

A bank of 13,563 loops from three to eight amino acid residues long, representing motifs between two consecutive regular secondary structures, has been derived from protein structures presenting less than 95 % sequence identity. Statistical analyses of occurrences of conformations and residues revealed length-dependent over-representations of particular amino acids (glycine, proline, asparagine, serine, and aspartate) and conformations (alphaL, epsilon, betaPregions of the Ramachandran plot). A position-dependent distribution of these occurrences was observed for N and C-terminal residues, which are correlated to the nature of the flanking regions. Loops of the same length were clustered into statistically meaningful families on the basis of their backbone structures when placed in a common reference frame, independent of the flanks. These clusters present significantly different distributions of sequence, conformations, and endpoint residue Calphadistances. On the basis of the sequence-structure correlation of this clustering, an automatic loop modeling algorithm was developed. Based on the knowledge of its sequence and of its flank backbone structures each query loop is assigned to a family and target loop supports are selected in this family. The support backbones of these target loops are then adjusted on flanking structures by partial exploration of the conformational space. Loop closure is performed by energy minimization for each support and the final model is chosen among connected supports based upon energy criteria. The quality of the prediction is evaluated by the root-mean-square deviation (rmsd) between the final model and the native loops when the whole bank is re-attributed on itself with a Jackknife test. This average rmsd ranges from 1.1 A for three-residue loops to 3.8 A for eight-residue loops. A few poorly predicted loops are inescapable, considering the high level of diversity in loops and the lack of environment data. To overcome such modeling problems, a statistical reliability score was assigned for each prediction. This score is correlated to the quality of the prediction, in terms of rmsd, and thus improves the selection accuracy of the model. The algorithm efficiency was compared to CASP3 target loop predictions. Moreover, when tested on a test loop bank, this algorithm was shown to be robust when the loops are not precisely delimited, therefore proving to be a useful tool in practice for protein modeling.

Collapse

Li W, Liu Z, Lai L. Protein loops on structurally similar scaffolds: database and conformational analysis. Biopolymers 1999;49:481-95. [PMID: 10193195 DOI: 10.1002/(sici)1097-0282(199905)49:6<481::aid-bip6>3.0.co;2-v] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Abstract

A general problem in comparative modeling and protein design is the conformational evaluation of loops with a certain sequence in specific environmental protein frameworks. Loops of different sequences and structures on similar scaffolds are common in the Protein Data Bank (PDB). In order to explore both structural and sequential diversity of them, a data base of loops connecting similar secondary structure fragments is constructed by searching the data base of families of structurally similar proteins and PDB. A total of 84 loop families having 2-13 residues are found among the well-determined structures of resolution better than 2.5 A. Eight alpha-alpha, 20 alpha-beta, 19 beta-alpha, and 37 beta-beta families are identified. Every family contains more than 5 loop motifs. In each family, no loops share same sequence and all the frameworks are well superimposed. Forty-three new loop classes are distinguished in the data base. The structural variability of loops in homologous proteins are examined and shown in 44 families. Motif families are characterized with geometric parameters and sequence patterns. The conformations of loops in each family are clustered into subfamilies using average linkage cluster analysis method. Information such as geometric properties, sequence profile, sequential and structural variability in loop, structural alignment parameters, sequence similarities, and clustering results are provided. Correlations between the conformation of loops and loop sequence, motif sequence, and global sequence of PDB chain are examined in order to find how loop structures depend on their sequences and how they are affected by the local and global environment. Strong correlations (R > 0.75) are only found in 24 families. The best R value is 0.98. The data base is available through the Internet.

Collapse

Maroun RC. Molecular modeling of an active loop structure in lysozyme. Sequence effects or crystal packing? J Biomol Struct Dyn 1999;16:873-89. [PMID: 10217456 DOI: 10.1080/07391102.1999.10508299] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]

Shirai H, Nakajima N, Higo J, Kidera A, Nakamura H. Conformational sampling of CDR-H3 in antibodies by multicanonical molecular dynamics simulation. J Mol Biol 1998;278:481-96. [PMID: 9571065 DOI: 10.1006/jmbi.1998.1698] [Citation(s) in RCA: 55] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Computational screening of combinatorial libraries via multicopy sampling. Drug Discov Today 1997. [DOI: 10.1016/s1359-6446(97)01046-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

van Vlijmen HW, Karplus M. PDB-based protein loop prediction: parameters for selection and methods for optimization. J Mol Biol 1997;267:975-1001. [PMID: 9135125 DOI: 10.1006/jmbi.1996.0857] [Citation(s) in RCA: 113] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Abstract

An approach to loop prediction that starts with a database search is presented and analyzed. To obtain meaningful statistics, 130 loops from 21 proteins were studied. The correlation between the internal conformation of the loop and the conformation of the neighboring stem residues was examined. Distances between C(alpha) and C(beta) of the immediate neighbor residues at each end select template loops as well as more complex (e.g. three residues on either side) matching criteria. To have a high probability that the best possible loop candidate in the database is included in the set, relatively large cutoffs for matching the interatomic distances of the stem residues have to be used in the template loop selection procedure; for loops of length 5, this results in an average of 1000 loops and for loops of length 9, the number is about 1500. The required number increases only slowly with loop length, in contrast to the exponential time increase involved in direct searches of the conformational space. The best loops among the large number of candidates can be determined by ranking them with the standard CHARMM non-bonded energy function (without electrostatics) applied to the backbone and C(beta) atoms. The same representation (backbone plus C(beta)) can be used to optimize the loop orientations relative to the rest of the protein by constrained energy minimization. Target loops that have many non-bonded contacts with the protein yield better results so that analysis of the non-bonded contacts of the selected template loops is useful in determining the expected accuracy of a prediction. The method for loop selection and optimization predicted eight (out of 18) loops of up to nine residues to an RMSD better than 1.07 A relative to the crystal structure; for 17 of the 18 loops, one of the three lowest energy template loops had an RMSD of less than 1.79 A. The prediction of antibody loops from a database search is more effective than that for non-antibody loops. Provided that they belong to one of the canonical classes, very similar antibody loops are certain to exist in the database. Superposition of the stem residues for antibody loops also results in a better orientation than with arbitrary target loops because the neighboring residues tend to have a more similar beta-strand structure. Two H3 loops (for which no canonical structures have been proposed) were predicted with reasonable accuracy (RMSD of 0.49 A and 1.07 A) even though no corresponding antibody loops were in the database.

Collapse

Zheng WM, Zheng Q. An analytical derivation of the locally enhanced sampling approximation. J Chem Phys 1997. [DOI: 10.1063/1.473216] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open

Shirai H, Kidera A, Nakamura H. Structural classification of CDR-H3 in antibodies. FEBS Lett 1996;399:1-8. [PMID: 8980108 DOI: 10.1016/s0014-5793(96)01252-5] [Citation(s) in RCA: 188] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023]

Mazur J, Jernigan RL, Sarai A. Constructing optimal backbone segments for joining fixed DNA base pairs. Biophys J 1996;71:1493-506. [PMID: 8874023 PMCID: PMC1233616 DOI: 10.1016/s0006-3495(96)79352-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open

Zheng Q, Kyle DJ. Computational screening of combinatorial libraries. Bioorg Med Chem 1996;4:631-8. [PMID: 8804526 DOI: 10.1016/0968-0896(96)00056-9] [Citation(s) in RCA: 18] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Koehl P, Delarue M. Mean-field minimization methods for biological macromolecules. Curr Opin Struct Biol 1996;6:222-6. [PMID: 8728655 DOI: 10.1016/s0959-440x(96)80078-9] [Citation(s) in RCA: 66] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Vásquez M. Modeling side-chain conformation. Curr Opin Struct Biol 1996;6:217-21. [PMID: 8728654 DOI: 10.1016/s0959-440x(96)80077-7] [Citation(s) in RCA: 55] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]

Zheng Q, Kyle DJ. Accuracy and reliability of the scaling-relaxation method for loop closure: an evaluation based on extensive and multiple copy conformational samplings. Proteins 1996;24:209-17. [PMID: 8820487 DOI: 10.1002/(sici)1097-0134(199602)24:2<209::aid-prot7>3.0.co;2-d] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]

Sali A. Modeling mutations and homologous proteins. Curr Opin Biotechnol 1995;6:437-51. [PMID: 7579655 DOI: 10.1016/0958-1669(95)80074-3] [Citation(s) in RCA: 123] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]

Koehl P, Delarue M. A self consistent mean field approach to simultaneous gap closure and side-chain positioning in homology modelling. NATURE STRUCTURAL BIOLOGY 1995;2:163-70. [PMID: 7538429 DOI: 10.1038/nsb0295-163] [Citation(s) in RCA: 84] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]

Zheng Q, Kyle DJ. Multiple copy sampling: rigid versus flexible protein. Proteins 1994;19:324-9. [PMID: 7527150 DOI: 10.1002/prot.340190407] [Citation(s) in RCA: 27] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]