1
|
Behkamal B, Naghibzadeh M, Pagnani A, Saberi MR, Al Nasr K. LPTD: a novel linear programming-based topology determination method for cryo-EM maps. Bioinformatics 2022; 38:2734-2741. [PMID: 35561171 PMCID: PMC9306757 DOI: 10.1093/bioinformatics/btac170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 03/01/2022] [Accepted: 03/18/2022] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Topology determination is one of the most important intermediate steps toward building the atomic structure of proteins from their medium-resolution cryo-electron microscopy (cryo-EM) map. The main goal in the topology determination is to identify correct matches (i.e. assignment and direction) between secondary structure elements (SSEs) (α-helices and β-sheets) detected in a protein sequence and cryo-EM density map. Despite many recent advances in molecular biology technologies, the problem remains a challenging issue. To overcome the problem, this article proposes a linear programming-based topology determination (LPTD) method to solve the secondary structure topology problem in three-dimensional geometrical space. Through modeling of the protein's sequence with the aid of extracting highly reliable features and a distance-based scoring function, the secondary structure matching problem is transformed into a complete weighted bipartite graph matching problem. Subsequently, an algorithm based on linear programming is developed as a decision-making strategy to extract the true topology (native topology) between all possible topologies. The proposed automatic framework is verified using 12 experimental and 15 simulated α-β proteins. Results demonstrate that LPTD is highly efficient and extremely fast in such a way that for 77% of cases in the dataset, the native topology has been detected in the first rank topology in <2 s. Besides, this method is able to successfully handle large complex proteins with as many as 65 SSEs. Such a large number of SSEs have never been solved with current tools/methods. AVAILABILITY AND IMPLEMENTATION The LPTD package (source code and data) is publicly available at https://github.com/B-Behkamal/LPTD. Moreover, two test samples as well as the instruction of utilizing the graphical user interface have been provided in the shared readme file. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Bahareh Behkamal
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Mahmoud Naghibzadeh
- Department of Computer Engineering, Faculty of Engineering, Ferdowsi University of Mashhad, Mashhad 9177948974, Iran
| | - Andrea Pagnani
- Department of Applied Science and Technology (DISAT), Politecnico di Torino, Torino I-10129, Italy
- Italian Institute for Genomic Medicine (IIGM), IRCC-Candiolo, Candiolo (TO) I-10060, Italy
- INFN Sezione di Torino, Torino I-10125, Italy
| | - Mohammad Reza Saberi
- Medicinal Chemistry Department, School of Pharmacy, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
- Bioinformatics Research Group, Mashhad University of Medical Sciences, Mashhad 9177899191, Iran
| | - Kamal Al Nasr
- Department of Computer Science, Tennessee State University, Nashville, TN 37209, USA
| |
Collapse
|
2
|
Wen C, Zhang J, Duan Y, Zhang H, Ma H. A Mini‐Review on Brewer's Spent Grain Protein: Isolation, Physicochemical Properties, Application of Protein, and Functional Properties of Hydrolysates. J Food Sci 2019; 84:3330-3340. [DOI: 10.1111/1750-3841.14906] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2019] [Revised: 08/30/2019] [Accepted: 10/01/2019] [Indexed: 01/17/2023]
Affiliation(s)
- Chaoting Wen
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
| | - Jixian Zhang
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
| | - Yuqing Duan
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
- Inst. of Food Physical ProcessingJiangsu Univ. Zhenjiang 212013 China
| | - Haihui Zhang
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
| | - Haile Ma
- School of Food and Biological EngineeringJiangsu Univ. Zhenjiang 212013 China
- Inst. of Food Physical ProcessingJiangsu Univ. Zhenjiang 212013 China
| |
Collapse
|
3
|
Kumar KK, Devi BU, Neeraja P. Molecular activities and ligand-binding specificities of StAR-related lipid transfer domains: exploring integrated in silico methods and ensemble-docking approaches. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2018; 29:483-501. [PMID: 29688061 DOI: 10.1080/1062936x.2018.1462847] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/03/2018] [Accepted: 04/05/2018] [Indexed: 06/08/2023]
Abstract
In this study, cholesterol biotransformation gene-set of human steroidogenic acute regulatory protein-related lipid transfer (START) domains were evaluated from high-throughput gene screening approaches. It was shown that STARD1, STARD3 and STARD4 proteins are better effective transporters of cholesterol than STARD5 and STARD6 domains. Docking studies show a strong agreement with gene ontology enrichment data. According to both complementary strategies, it was found that only STARD1, STARD3 and STARD4 are potentially involved in cholesterol biotransformation in mitochondria through Ω1-loop of C-terminal α4-helical domain. Ensemble docking assessment for a set of selected chemicals of protein-chemical networks has shown possible binding probabilities with START domains. Among those, reproductive toxicity evoked drugs (mifepristone), insecticides (rotenone), tobacco pulmonary carcinogens (benzo(a)pyrene) and endocrine disruptor chemicals (EDCs) including perfluorooctanesulfonic acid (PFOS) and aflatoxin B1 (AFB1) potentially bound with novel hotspot residues of the α4-helical domain. Compound representation space and clustering approaches reveal that the START proteins show more sensitivity with these lead scaffolds, so they could provide probable barrier assets in cholesterol and steroidogenic acute regulatory (StAR) binding and leads adverse consequences in steroidogenesis. These findings indicate potential START domains and their binding levels with toxic chemicals; sorted viewpoints could be useful as a promising way to identify chemicals with related steroidogenisis impacts on human health.
Collapse
Affiliation(s)
- K Kranthi Kumar
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| | - B Uma Devi
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| | - P Neeraja
- a Department of Zoology , Sri Venkateswara University , Tirupati , 517502 - A.P . India
| |
Collapse
|
4
|
Computational modeling of protein assemblies. Curr Opin Struct Biol 2017; 44:179-189. [DOI: 10.1016/j.sbi.2017.04.006] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2016] [Revised: 04/07/2017] [Accepted: 04/11/2017] [Indexed: 01/18/2023]
|
5
|
Joseph AP, Lagerstedt I, Patwardhan A, Topf M, Winn M. Improved metrics for comparing structures of macromolecular assemblies determined by 3D electron-microscopy. J Struct Biol 2017; 199:12-26. [PMID: 28552721 PMCID: PMC5479444 DOI: 10.1016/j.jsb.2017.05.007] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2016] [Revised: 05/19/2017] [Accepted: 05/23/2017] [Indexed: 11/28/2022]
Abstract
Recent developments in 3-dimensional electron microcopy (3D-EM) techniques and a concomitant drive to look at complex molecular structures, have led to a rapid increase in the amount of volume data available for biomolecules. This creates a demand for better methods to analyse the data, including improved scores for comparison, classification and integration of data at different resolutions. To this end, we developed and evaluated a set of scoring functions that compare 3D-EM volumes. To test our scores we used a benchmark set of volume alignments derived from the Electron Microscopy Data Bank. We find that the performance of different scores vary with the map-type, resolution and the extent of overlap between volumes. Importantly, adding the overlap information to the local scoring functions can significantly improve their precision and accuracy in a range of resolutions. A combined score involving the local mutual information and overlap (LMI_OV) performs best overall, irrespective of the map category, resolution or the extent of overlap, and we recommend this score for general use. The local mutual information score itself is found to be more discriminatory than cross-correlation coefficient for intermediate-to-low resolution maps or when the map size and density distribution differ significantly. For comparing map surfaces, we implemented two filters to detect the surface points, including one based on the 'extent of surface exposure'. We show that scores that compare surfaces are useful at low resolutions and for maps with evident surface features. All the scores discussed are implemented in TEMPy (http://tempy.ismb.lon.ac.uk/).
Collapse
Affiliation(s)
- Agnel Praveen Joseph
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom; Scientific Computing Department, Science and Technology Facilities Council, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom
| | - Ingvar Lagerstedt
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom; Computational Chemistry and Cheminformatics, Lilly UK, Windlesham GU20 6PH, United Kingdom
| | - Ardan Patwardhan
- European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom
| | - Maya Topf
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom.
| | - Martyn Winn
- Scientific Computing Department, Science and Technology Facilities Council, Research Complex at Harwell, Didcot OX11 0FA, United Kingdom.
| |
Collapse
|
6
|
Samsó M. A guide to the 3D structure of the ryanodine receptor type 1 by cryoEM. Protein Sci 2016; 26:52-68. [PMID: 27671094 DOI: 10.1002/pro.3052] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2016] [Revised: 09/21/2016] [Accepted: 09/22/2016] [Indexed: 01/04/2023]
Abstract
Signal transduction by the ryanodine receptor (RyR) is essential in many excitable cells including all striated contractile cells and some types of neurons. While its transmembrane domain is a classic tetrameric, six-transmembrane cation channel, the cytoplasmic domain is uniquely large and complex, hosting a multiplicity of specialized domains. The overall outline and substructure readily recognizable by electron microscopy make RyR a geometrically well-behaved specimen. Hence, for the last two decades, the 3D structural study of the RyR has tracked closely the technological advances in electron microscopy, cryo-electron microscopy (cryoEM), and computerized 3D reconstruction. This review summarizes the progress in the structural determination of RyR by cryoEM and, bearing in mind the leap in resolution provided by the recent implementation of direct electron detection, analyzes the first near-atomic structures of RyR. These reveal a complex orchestration of domains controlling the channel's function, and help to understand how this could break down as a consequence of disease-causing mutations.
Collapse
Affiliation(s)
- Montserrat Samsó
- Department of Physiology and Biophysics, Virginia Commonwealth University, Richmond, Virginia
| |
Collapse
|
7
|
Matthew Allen Bullock J, Schwab J, Thalassinos K, Topf M. The Importance of Non-accessible Crosslinks and Solvent Accessible Surface Distance in Modeling Proteins with Restraints From Crosslinking Mass Spectrometry. Mol Cell Proteomics 2016; 15:2491-500. [PMID: 27150526 PMCID: PMC4937519 DOI: 10.1074/mcp.m116.058560] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2016] [Indexed: 11/06/2022] Open
Abstract
Crosslinking mass spectrometry (XL-MS) is becoming an increasingly popular technique for modeling protein monomers and complexes. The distance restraints garnered from these experiments can be used alone or as part of an integrative modeling approach, incorporating data from many sources. However, modeling practices are varied and the difference in their usefulness is not clear. Here, we develop a new scoring procedure for models based on crosslink data—Matched and Nonaccessible Crosslink score (MNXL). We compare its performance with that of other commonly-used scoring functions (Number of Violations and Sum of Violation Distances) on a benchmark of 14 protein domains, each with 300 corresponding models (at various levels of quality) and associated, previously published, experimental crosslinks (XLdb). The distances between crosslinked lysines are calculated either as Euclidean distances or Solvent Accessible Surface Distances (SASD) using a newly-developed method (Jwalk). MNXL takes into account whether a crosslink is nonaccessible, i.e. an experimentally observed crosslink has no corresponding SASD in a model due to buried lysines. This metric alone is shown to have a significant impact on modeling performance and is a concept that is not considered at present if only Euclidean distances are used. Additionally, a comparison between modeling with SASD or Euclidean distance shows that SASD is superior, even when factoring out the effect of the nonaccessible crosslinks. Our benchmarking also shows that MNXL outperforms the other tested scoring functions in terms of precision and correlation to Cα-RMSD from the crystal structure. We finally test the MNXL at different levels of crosslink recovery (i.e. the percentage of crosslinks experimentally observed out of all theoretical ones) and set a target recovery of ∼20% after which the performance plateaus.
Collapse
Affiliation(s)
- Joshua Matthew Allen Bullock
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK
| | - Jannik Schwab
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK; §Gene Center Munich, Ludwig-Maximilians-Universität (LMU) Munich, Feodor-Lynen-Strasse 25, 81377 Munich, Germany
| | - Konstantinos Thalassinos
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK; ¶Institute of Structural and Molecular Biology, Division of Biosciences, University College London, London WC1E 6BT, UK
| | - Maya Topf
- From the ‡Institute of Structural and Molecular Biology, Birkbeck College, University of London, Malet street, London, WC1E 7HX, UK;
| |
Collapse
|
8
|
Rakesh R, Srinivasan N. Improving the Accuracy of Fitted Atomic Models in Cryo-EM Density Maps of Protein Assemblies Using Evolutionary Information from Aligned Homologous Proteins. Methods Mol Biol 2016; 1415:193-209. [PMID: 27115634 DOI: 10.1007/978-1-4939-3572-7_10] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Cryo-Electron Microscopy (cryo-EM) has become an important technique to obtain structural insights into large macromolecular assemblies. However the resolution of the density maps do not allow for its interpretation at atomic level. Hence they are combined with high resolution structures along with information from other experimental or bioinformatics techniques to obtain pseudo-atomic models. Here, we describe the use of evolutionary conservation of residues as obtained from protein structures and alignments of homologous proteins to detect errors in the fitting of atomic structures as well as improve accuracy of the protein-protein interfacial regions in the cryo-EM density maps.
Collapse
Affiliation(s)
- Ramachandran Rakesh
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
| | | |
Collapse
|
9
|
Mass spectrometry coupled experiments and protein structure modeling methods. Int J Mol Sci 2013; 14:20635-57. [PMID: 24132151 PMCID: PMC3821635 DOI: 10.3390/ijms141020635] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2013] [Revised: 09/17/2013] [Accepted: 09/19/2013] [Indexed: 01/02/2023] Open
Abstract
With the accumulation of next generation sequencing data, there is increasing interest in the study of intra-species difference in molecular biology, especially in relation to disease analysis. Furthermore, the dynamics of the protein is being identified as a critical factor in its function. Although accuracy of protein structure prediction methods is high, provided there are structural templates, most methods are still insensitive to amino-acid differences at critical points that may change the overall structure. Also, predicted structures are inherently static and do not provide information about structural change over time. It is challenging to address the sensitivity and the dynamics by computational structure predictions alone. However, with the fast development of diverse mass spectrometry coupled experiments, low-resolution but fast and sensitive structural information can be obtained. This information can then be integrated into the structure prediction process to further improve the sensitivity and address the dynamics of the protein structures. For this purpose, this article focuses on reviewing two aspects: the types of mass spectrometry coupled experiments and structural data that are obtainable through those experiments; and the structure prediction methods that can utilize these data as constraints. Also, short review of current efforts in integrating experimental data in the structural modeling is provided.
Collapse
|
10
|
iMODFIT: efficient and robust flexible fitting based on vibrational analysis in internal coordinates. J Struct Biol 2013; 184:261-70. [PMID: 23999189 DOI: 10.1016/j.jsb.2013.08.010] [Citation(s) in RCA: 127] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2013] [Revised: 08/20/2013] [Accepted: 08/22/2013] [Indexed: 12/31/2022]
Abstract
Here, we employed the collective motions extracted from Normal Mode Analysis (NMA) in internal coordinates (torsional space) for the flexible fitting of atomic-resolution structures into electron microscopy (EM) density maps. The proposed methodology was validated using a benchmark of simulated cases, highlighting its robustness over the full range of EM resolutions and even over coarse-grained representations. A systematic comparison with other methods further showcased the advantages of this proposed methodology, especially at medium to lower resolutions. Using this method, computational costs and potential overfitting problems are naturally reduced by constraining the search in low-frequency NMA space, where covalent geometry is implicitly maintained. This method also effectively captures the macromolecular changes of a representative set of experimental test cases. We believe that this novel approach will extend the currently available EM hybrid methods to the atomic-level interpretation of large conformational changes and their functional implications.
Collapse
|
11
|
Esquivel-Rodríguez J, Kihara D. Computational methods for constructing protein structure models from 3D electron microscopy maps. J Struct Biol 2013; 184:93-102. [PMID: 23796504 DOI: 10.1016/j.jsb.2013.06.008] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2012] [Revised: 06/11/2013] [Accepted: 06/13/2013] [Indexed: 12/31/2022]
Abstract
Protein structure determination by cryo-electron microscopy (EM) has made significant progress in the past decades. Resolutions of EM maps have been improving as evidenced by recently reported structures that are solved at high resolutions close to 3Å. Computational methods play a key role in interpreting EM data. Among many computational procedures applied to an EM map to obtain protein structure information, in this article we focus on reviewing computational methods that model protein three-dimensional (3D) structures from a 3D EM density map that is constructed from two-dimensional (2D) maps. The computational methods we discuss range from de novo methods, which identify structural elements in an EM map, to structure fitting methods, where known high resolution structures are fit into a low-resolution EM map. A list of available computational tools is also provided.
Collapse
Affiliation(s)
- Juan Esquivel-Rodríguez
- Department of Computer Science, College of Science, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|
12
|
Consensus among multiple approaches as a reliability measure for flexible fitting into cryo-EM data. J Struct Biol 2013; 182:67-77. [PMID: 23416197 DOI: 10.1016/j.jsb.2013.02.002] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2012] [Revised: 01/29/2013] [Accepted: 02/01/2013] [Indexed: 12/14/2022]
Abstract
Cryo-electron microscopy (cryo-EM) can provide low-resolution density maps of large macromolecular assemblies. As the number of structures deposited in the Protein Data Bank by fitting a high-resolution structure into a low-resolution cryo-EM map is increasing, there is a need to revise the protocols and improve the measures for fitting. A recent study suggested using a combination of multiple automated flexible fitting approaches to improve the interpretation of cryo-EM data. The current work further explores the use of multiple approaches by validating this "consensus" fitting approach and deriving a local reliability measure. Here four different flexible fitting approaches are applied for fitting an initial structure into a simulated density map of known target structure from a dataset of proteins. It is found that the models produced from different approaches often have a consensus in conformation and are also near to the target structure, whereas cases not showing consensus are away from the target. A high correlation is also observed between the RMSF profiles calculated with respect to the average and the target structures, which indicates that the relation between consensus and accuracy can also be extended to a per-residue level. Therefore, the RMSF among the fitted models is proposed as a local reliability measure, which can be used to assess the reliability of the fit at specific regions. Hence, we encourage the community to use consensus flexible fitting with different methods to report on local reliability of the resulting models and improve the interpretation of cryo-EM data.
Collapse
|
13
|
Badia-Martinez D, Oksanen HM, Stuart DI, Abrescia NGA. Combined approaches to study virus structures. Subcell Biochem 2013; 68:203-246. [PMID: 23737053 DOI: 10.1007/978-94-007-6552-8_7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]
Abstract
A virus particle must work as a safe box for protecting its genome, but at the same time it has to undergo dramatic conformational changes in order to preserve itself by propagating in a cell infection. Thus, viruses are miniaturized wonders whose structural complexity requires them to be investigated by a combination of different techniques that can tackle both static and dynamic processes. In this chapter we will illustrate how major structural techniques such as X-ray crystallography and electron microscopy have been and can be combined with other techniques to determine the structure of complex viruses. The power of these hybrid method approaches are revealed through the various examples provided.
Collapse
Affiliation(s)
- Daniel Badia-Martinez
- Structural Biology Unit, CICbioGUNE, CIBERehd, Bizkaia Technology Park, 48160, Derio, Spain
| | | | | | | |
Collapse
|
14
|
Conventional electron microscopy, cryo-electron microscopy and cryo-electron tomography of viruses. Subcell Biochem 2013; 68:79-115. [PMID: 23737049 DOI: 10.1007/978-94-007-6552-8_3] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Electron microscopy (EM) techniques have been crucial for understanding the structure of biological specimens such as cells, tissues and macromolecular assemblies. Viruses and related viral assemblies are ideal targets for structural studies that help to define essential biological functions. Whereas conventional EM methods use chemical fixation, dehydration, and staining of the specimens, cryo-electron microscopy (cryo-EM) preserves the native hydrated state. Combined with image processing and three-dimensional reconstruction techniques, cryo-EM provides 3D maps of these macromolecular complexes from projection images, at subnanometer to near-atomic resolutions. Cryo-EM is also a major technique in structural biology for dynamic studies of functional complexes, which are often unstable, flexible, scarce or transient in their native environments. As a tool, cryo-EM complements high-resolution techniques such as X-ray diffraction and NMR spectroscopy; these synergistic hybrid approaches provide important new information. Three-dimensional cryo-electron tomography goes further, and allows the study of viruses not only in their physiological state, but also in their natural environment in the cell, thereby bridging structural studies at the molecular and cellular levels.
Collapse
|
15
|
Ceres N, Lavery R. Coarse-grain Protein Models. INNOVATIONS IN BIOMOLECULAR MODELING AND SIMULATIONS 2012. [DOI: 10.1039/9781849735049-00219] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
Abstract
Coarse-graining is a powerful approach for modeling biomolecules that, over the last few decades, has been extensively applied to proteins. Coarse-grain models offer access to large systems and to slow processes without becoming computationally unmanageable. In addition, they are very versatile, enabling both the protein representation and the energy function to be adapted to the biological problem in hand. This review concentrates on modeling soluble proteins and their assemblies. It presents an overview of the coarse-grain representations, of the associated interaction potentials, and of the optimization procedures used to define them. It then shows how coarse-grain models have been used to understand processes involving proteins, from their initial folding to their functional properties, their binary interactions, and the assembly of large complexes.
Collapse
Affiliation(s)
- N. Ceres
- Bases Moléculaires et Structurales des Systèmes Infectieux Université Lyon1/CNRS UMR 5086, IBCP, 7 Passage du Vercors, 69367, Lyon France
| | - R. Lavery
- Bases Moléculaires et Structurales des Systèmes Infectieux Université Lyon1/CNRS UMR 5086, IBCP, 7 Passage du Vercors, 69367, Lyon France
| |
Collapse
|
16
|
Finding rigid bodies in protein structures: Application to flexible fitting into cryoEM maps. J Struct Biol 2012; 177:520-31. [DOI: 10.1016/j.jsb.2011.10.011] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2011] [Revised: 10/22/2011] [Accepted: 10/27/2011] [Indexed: 11/18/2022]
|
17
|
Reconstructing virus structures from nanometer to near-atomic resolutions with cryo-electron microscopy and tomography. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 726:49-90. [PMID: 22297510 DOI: 10.1007/978-1-4614-0980-9_4] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
The past few decades have seen tremendous advances in single-particle electron -cryo-microscopy (cryo-EM). The field has matured to the point that near-atomic resolution density maps can be generated for icosahedral viruses without the need for crystallization. In parallel, substantial progress has been made in determining the structures of nonicosahedrally arranged proteins in viruses by employing either single-particle cryo-EM or cryo-electron tomography (cryo-ET). Implicit in this course have been the availability of a new generation of electron cryo-microscopes and the development of the computational tools that are essential for generating these maps and models. This methodology has enabled structural biologists to analyze structures in increasing detail for virus particles that are in different morphogenetic states. Furthermore, electron imaging of frozen, hydrated cells, in the process of being infected by viruses, has also opened up a new avenue for studying virus structures "in situ". Here we present the common techniques used to acquire and process cryo-EM and cryo-ET data and discuss their implications for structural virology both now and in the future.
Collapse
|
18
|
LU YONGGANG, HE JING, STRAUSS CHARLIEEM. DERIVING TOPOLOGY AND SEQUENCE ALIGNMENT FOR THE HELIX SKELETON IN LOW-RESOLUTION PROTEIN DENSITY MAPS. J Bioinform Comput Biol 2011; 6:183-201. [DOI: 10.1142/s0219720008003357] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2007] [Revised: 10/07/2007] [Accepted: 10/13/2007] [Indexed: 11/18/2022]
Abstract
Cryoelectron microscopy (cryoEM) is an experimental technique to determine the three-dimensional (3D) structure of large protein complexes. Currently, this technique is able to generate protein density maps at 6–9 Å resolution, at which the skeleton of the structure (which is composed of α-helices and β-sheets) can be visualized. As a step towards predicting the entire backbone of the protein from the protein density map, we developed a method to predict the topology and sequence alignment for the skeleton helices. Our method combines the geometrical information of the skeleton helices with the Rosetta ab initio structure prediction method to derive a consensus topology and sequence alignment for the skeleton helices. We tested the method with 60 proteins. For 45 proteins, the majority of the skeleton helices were assigned a correct topology from one of our top ten predictions. The offsets of the alignment for most of the assigned helices were within ±2 amino acids in the sequence. We also analyzed the use of the skeleton helices as a clustering tool for the decoy structures generated by Rosetta. Our comparison suggests that the topology clustering is a better method than a general overlap clustering method to enrich the ranking of decoys, particularly when the decoy pool is small.
Collapse
Affiliation(s)
- YONGGANG LU
- Department of Computer Science, New Mexico State University, Las Cruces, NM 88003, USA
| | - JING HE
- Department of Computer Science, New Mexico State University, Las Cruces, NM 88003, USA
| | - CHARLIE E. M. STRAUSS
- Bioscience Division, M888, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|
19
|
Ahmed A, Whitford PC, Sanbonmatsu KY, Tama F. Consensus among flexible fitting approaches improves the interpretation of cryo-EM data. J Struct Biol 2011; 177:561-70. [PMID: 22019767 DOI: 10.1016/j.jsb.2011.10.002] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2011] [Revised: 10/05/2011] [Accepted: 10/06/2011] [Indexed: 12/31/2022]
Abstract
Cryo-elecron microscopy (cryo-EM) can provide important structural information of large macromolecular assemblies in different conformational states. Recent years have seen an increase in structures deposited in the Protein Data Bank (PDB) by fitting a high-resolution structure into its low-resolution cryo-EM map. A commonly used protocol for accommodating the conformational changes between the X-ray structure and the cryo-EM map is rigid body fitting of individual domains. With the emergence of different flexible fitting approaches, there is a need to compare and revise these different protocols for the fitting. We have applied three diverse automated flexible fitting approaches on a protein dataset for which rigid domain fitting (RDF) models have been deposited in the PDB. In general, a consensus is observed in the conformations, which indicates a convergence from these theoretically different approaches to the most probable solution corresponding to the cryo-EM map. However, the result shows that the convergence might not be observed for proteins with complex conformational changes or with missing densities in cryo-EM map. In contrast, RDF structures deposited in the PDB can represent conformations that not only differ from the consensus obtained by flexible fitting but also from X-ray crystallography. Thus, this study emphasizes that a "consensus" achieved by the use of several automated flexible fitting approaches can provide a higher level of confidence in the modeled configurations. Following this protocol not only increases the confidence level of fitting, but also highlights protein regions with uncertain fitting. Hence, this protocol can lead to better interpretation of cryo-EM data.
Collapse
Affiliation(s)
- Aqeel Ahmed
- Department of Chemistry and Biochemistry, The University of Arizona, 1041 E. Lowell Street, Tucson, AZ 85721, USA.
| | | | | | | |
Collapse
|
20
|
Sun W, He J. From isotropic to anisotropic side chain representations: comparison of three models for residue contact estimation. PLoS One 2011; 6:e19238. [PMID: 21552527 PMCID: PMC3084275 DOI: 10.1371/journal.pone.0019238] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2010] [Accepted: 03/29/2011] [Indexed: 11/19/2022] Open
Abstract
The criterion to determine residue contact is a fundamental problem in deriving knowledge-based mean-force potential energy calculations for protein structures. A frequently used criterion is to require the side chain center-to-center distance or the -to- atom distance to be within a pre-determined cutoff distance. However, the spatially anisotropic nature of the side chain determines that it is challenging to identify the contact pairs. This study compares three side chain contact models: the Atom Distance criteria (ADC) model, the Isotropic Sphere Side chain (ISS) model and the Anisotropic Ellipsoid Side chain (AES) model using 424 high resolution protein structures in the Protein Data Bank. The results indicate that the ADC model is the most accurate and ISS is the worst. The AES model eliminates about 95% of the incorrectly counted contact-pairs in the ISS model. Algorithm analysis shows that AES model is the most computational intensive while ADC model has moderate computational cost. We derived a dataset of the mis-estimated contact pairs by AES model. The most misjudged pairs are Arg-Glu, Arg-Asp and Arg-Tyr. Such a dataset can be useful for developing the improved AES model by incorporating the pair-specific information for the cutoff distance.
Collapse
Affiliation(s)
- Weitao Sun
- Zhou Pei-Yuan Center for Applied Mathematics, Tsinghua University, Beijing, China.
| | | |
Collapse
|
21
|
Beck M, Topf M, Frazier Z, Tjong H, Xu M, Zhang S, Alber F. Exploring the spatial and temporal organization of a cell's proteome. J Struct Biol 2011; 173:483-96. [PMID: 21094684 PMCID: PMC3784337 DOI: 10.1016/j.jsb.2010.11.011] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2010] [Revised: 11/05/2010] [Accepted: 11/08/2010] [Indexed: 10/18/2022]
Abstract
To increase our current understanding of cellular processes, such as cell signaling and division, knowledge is needed about the spatial and temporal organization of the proteome at different organizational levels. These levels cover a wide range of length and time scales: from the atomic structures of macromolecules for inferring their molecular function, to the quantitative description of their abundance, and spatial distribution in the cell. Emerging new experimental technologies are greatly increasing the availability of such spatial information on the molecular organization in living cells. This review addresses three fields that have significantly contributed to our understanding of the proteome's spatial and temporal organization: first, methods for the structure determination of individual macromolecular assemblies, specifically the fitting of atomic structures into density maps generated from electron microscopy techniques; second, research that visualizes the spatial distributions of these complexes within the cellular context using cryo electron tomography techniques combined with computational image processing; and third, methods for the spatial modeling of the dynamic organization of the proteome, specifically those methods for simulating reaction and diffusion of proteins and complexes in crowded intracellular fluids. The long-term goal is to integrate the varied data about a proteome's organization into a spatially explicit, predictive model of cellular processes.
Collapse
Affiliation(s)
- Martin Beck
- European Molecular Biology Laboratory, Meyerhofstr. 1, 69117 Heidelberg, Germany
| | - Maya Topf
- Molecular Biology, Crystallography, Department of Biological Sciences, Birkbeck College, University of London, London, UK
| | - Zachary Frazier
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Harianto Tjong
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Min Xu
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Shihua Zhang
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| | - Frank Alber
- Program in Molecular and Computational Biology, University of Southern California, 1050 Childs Way, RRI 413E, Los Angeles, CA 90068, USA
| |
Collapse
|
22
|
Lasker K, Sali A, Wolfson HJ. Determining macromolecular assembly structures by molecular docking and fitting into an electron density map. Proteins 2011; 78:3205-11. [PMID: 20827723 DOI: 10.1002/prot.22845] [Citation(s) in RCA: 60] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Structural models of macromolecular assemblies are instrumental for gaining a mechanistic understanding of cellular processes. Determining these structures is a major challenge for experimental techniques, such as X-ray crystallography, NMR spectroscopy and electron microscopy (EM). Thus, computational modeling techniques, including molecular docking, are required. The development of most molecular docking methods has so far been focused on modeling of binary complexes. We have recently introduced the MultiFit method for modeling the structure of a multisubunit complex by simultaneously optimizing the fit of the model into an EM density map of the entire complex and the shape complementarity between interacting subunits. Here, we report algorithmic advances of the MultiFit method that result in an efficient and accurate assembly of the input subunits into their density map. The successful predictions and the increasing number of complexes being characterized by EM suggests that the CAPRI challenge could be extended to include docking-based modeling of macromolecular assemblies guided by EM.
Collapse
Affiliation(s)
- Keren Lasker
- Raymond and Beverly Sackler Faculty of Exact Sciences, Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv 69978, Israel
| | | | | |
Collapse
|
23
|
Yin S, Dokholyan NV. Fingerprint-based structure retrieval using electron density. Proteins 2011; 79:1002-9. [PMID: 21287628 DOI: 10.1002/prot.22941] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2010] [Revised: 10/08/2010] [Accepted: 11/05/2010] [Indexed: 12/14/2022]
Abstract
We present a computational approach that can quickly search a large protein structural database to identify structures that fit a given electron density, such as determined by cryo-electron microscopy. We use geometric invariants (fingerprints) constructed using 3D Zernike moments to describe the electron density, and reduce the problem of fitting of the structure to the electron density to simple fingerprint comparison. Using this approach, we are able to screen the entire Protein Data Bank and identify structures that fit two experimental electron densities determined by cryo-electron microscopy.
Collapse
Affiliation(s)
- Shuangye Yin
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-7260, USA
| | | |
Collapse
|
24
|
Automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal. Methods Mol Biol 2011; 857:107-36. [PMID: 22323219 DOI: 10.1007/978-1-61779-588-6_5] [Citation(s) in RCA: 103] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Comparative protein structure modeling is a computational approach to build three-dimensional structural models for proteins using experimental structures of related protein family members as templates. Regular blind assessments of modeling accuracy have demonstrated that comparative protein structure modeling is currently the most reliable technique to model protein structures. Homology models are often sufficiently accurate to substitute for experimental structures in a wide variety of applications. Since the usefulness of a model for specific application is determined by its accuracy, model quality estimation is an essential component of protein structure prediction. Comparative protein modeling has become a routine approach in many areas of life science research since fully automated modeling systems allow also nonexperts to build reliable models. In this chapter, we describe practical approaches for automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal.
Collapse
|
25
|
Gorba C, Tama F. Normal Mode Flexible Fitting of High-Resolution Structures of Biological Molecules Toward SAXS Data. Bioinform Biol Insights 2010; 4:43-54. [PMID: 20634984 PMCID: PMC2901630 DOI: 10.4137/bbi.s4551] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
We present a method to reconstruct a three-dimensional protein structure from an atomic pair distribution function derived from the scattering intensity profile from SAXS data by flexibly fitting known x-ray structures. This method uses a linear combination of low-frequency normal modes from an elastic network description of the molecule in an iterative manner to deform the structure to conform optimally to the target pair distribution function derived from SAXS data. For computational efficiency, the protein and water molecules included in the protein first hydration shell are coarse-grained. In this paper, we demonstrate the validity of our coarse-graining approach to study SAXS data. Illustrative results of our flexible fitting studies on simulated SAXS data from five different proteins are presented.
Collapse
Affiliation(s)
- Christian Gorba
- Department of Chemistry and Biochemistry, The University of Arizona, 1041 E. Lowell Street, Tucson, AZ, 85721
| | | |
Collapse
|
26
|
Rawi R, Whitmore L, Topf M. CHOYCE: a web server for constrained homology modelling with cryoEM maps. ACTA ACUST UNITED AC 2010; 26:1673-4. [PMID: 20444836 PMCID: PMC2887048 DOI: 10.1093/bioinformatics/btq237] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
SUMMARY CHOYCE is a web server for homology modelling of protein components and the fitting of those components into cryo electron microscopy (cryoEM) maps of their assemblies. It provides an interactive approach to improving the selection of models based on the quality of their fit into the EM map. AVAILABILITY http://choyce.ismb.lon.ac.uk/ CONTACT m.topf@cryst.bbk.ac.uk; reda.rawi@uni-due.de SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Reda Rawi
- Institute of Structural and Molecular Biology, Crystallography, Department of Biological Sciences, Birkbeck College, University of London, London, UK.
| | | | | |
Collapse
|
27
|
Zhu J, Cheng L, Fang Q, Zhou ZH, Honig B. Building and refining protein models within cryo-electron microscopy density maps based on homology modeling and multiscale structure refinement. J Mol Biol 2010; 397:835-51. [PMID: 20109465 DOI: 10.1016/j.jmb.2010.01.041] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2009] [Revised: 01/04/2010] [Accepted: 01/20/2010] [Indexed: 11/16/2022]
Abstract
Automatic modeling methods using cryoelectron microscopy (cryoEM) density maps as constraints are promising approaches to building atomic models of individual proteins or protein domains. However, their application to large macromolecular assemblies has not been possible largely due to computational limitations inherent to such unsupervised methods. Here we describe a new method, EM-IMO (electron microscopy-iterative modular optimization), for building, modifying and refining local structures of protein models using cryoEM maps as a constraint. As a supervised refinement method, EM-IMO allows users to specify parameters derived from inspections so as to guide, and as a consequence, significantly speed up the refinement. An EM-IMO-based refinement protocol is first benchmarked on a data set of 50 homology models using simulated density maps. A multiscale refinement strategy that combines EM-IMO-based and molecular dynamics-based refinement is then applied to build backbone models for the seven conformers of the five capsid proteins in our near-atomic-resolution cryoEM map of the grass carp reovirus virion, a member of the Aquareovirus genus of the Reoviridae family. The refined models allow us to reconstruct a backbone model of the entire grass carp reovirus capsid and provide valuable functional insights that are described in the accompanying publication [Cheng, L., Zhu, J., Hui, W. H., Zhang, X., Honig, B., Fang, Q. & Zhou, Z. H. (2010). Backbone model of an aquareovirus virion by cryo-electron microscopy and bioinformatics. J. Mol. Biol. (this issue). doi:10.1016/j.jmb.2009.12.027.]. Our study demonstrates that the integrated use of homology modeling and a multiscale refinement protocol that combines supervised and automated structure refinement offers a practical strategy for building atomic models based on medium- to high-resolution cryoEM density maps.
Collapse
Affiliation(s)
- Jiang Zhu
- Howard Hughes Medical Institute, Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY 10032, USA
| | | | | | | | | |
Collapse
|
28
|
Al Nasr K, Sun W, He J. Structure prediction for the helical skeletons detected from the low resolution protein density map. BMC Bioinformatics 2010; 11 Suppl 1:S44. [PMID: 20122218 PMCID: PMC3009517 DOI: 10.1186/1471-2105-11-s1-s44] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Background The current advances in electron cryo-microscopy technique have made it possible to obtain protein density maps at about 6-10 Å resolution. Although it is hard to derive the protein chain directly from such a low resolution map, the location of the secondary structures such as helices and strands can be computationally detected. It has been demonstrated that such low-resolution map can be used during the protein structure prediction process to enhance the structure prediction. Results We have developed an approach to predict the 3-dimensional structure for the helical skeletons that can be detected from the low resolution protein density map. This approach does not require the construction of the entire chain and distinguishes the structures based on the conformation of the helices. A test with 35 low resolution density maps shows that the highest ranked structure with the correct topology can be found within the top 1% of the list ranked by the effective energy formed by the helices. Conclusion The results in this paper suggest that it is possible to eliminate the great majority of the bad conformations of the helices even without the construction of the entire chain of the protein. For many proteins, the effective contact energy formed by the secondary structures alone can distinguish a small set of likely structures from the pool.
Collapse
Affiliation(s)
- Kamal Al Nasr
- Department of Computer Science, Old Dominion University, Norfolk, VA 23529, USA.
| | | | | |
Collapse
|
29
|
Lindert S, Staritzbichler R, Wötzel N, Karakaş M, Stewart PL, Meiler J. EM-fold: De novo folding of alpha-helical proteins guided by intermediate-resolution electron microscopy density maps. Structure 2009; 17:990-1003. [PMID: 19604479 DOI: 10.1016/j.str.2009.06.001] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2009] [Revised: 05/31/2009] [Accepted: 06/02/2009] [Indexed: 01/22/2023]
Abstract
In medium-resolution (7-10 A) cryo-electron microscopy (cryo-EM) density maps, alpha helices can be identified as density rods whereas beta-strand or loop regions are not as easily discerned. We are proposing a computational protein structure prediction algorithm "EM-Fold" that resolves the density rod connectivity ambiguity by placing predicted alpha helices into the density rods and adding missing backbone coordinates in loop regions. In a benchmark of 11 mainly alpha-helical proteins of known structure a native-like model is identified in eight cases (rmsd 3.9-7.9 A). The three failures can be attributed to inaccuracies in the secondary structure prediction step that precedes EM-Fold. EM-Fold has been applied to the approximately 6 A resolution cryo-EM density map of protein IIIa from human adenovirus. We report the first topological model for the alpha-helical 400 residue N-terminal region of protein IIIa. EM-Fold also has the potential to interpret medium-resolution density maps in X-ray crystallography.
Collapse
Affiliation(s)
- Steffen Lindert
- Department of Chemistry, Vanderbilt University, Nashville, TN 37212, USA
| | | | | | | | | | | |
Collapse
|
30
|
|
31
|
Etzkorn M, Kneuper H, Dünnwald P, Vijayan V, Krämer J, Griesinger C, Becker S, Unden G, Baldus M. Plasticity of the PAS domain and a potential role for signal transduction in the histidine kinase DcuS. Nat Struct Mol Biol 2008; 15:1031-9. [DOI: 10.1038/nsmb.1493] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2008] [Accepted: 08/29/2008] [Indexed: 11/09/2022]
|
32
|
Rusu M, Birmanns S, Wriggers W. Biomolecular pleiomorphism probed by spatial interpolation of coarse models. ACTA ACUST UNITED AC 2008; 24:2460-6. [PMID: 18757874 PMCID: PMC2732278 DOI: 10.1093/bioinformatics/btn461] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
In low resolution structures of biological assemblies one can often observe conformational deviations that require a flexible rearrangement of structural domains fitted at the atomic level. We are evaluating interpolation methods for the flexible alignment of atomic models based on coarse models. Spatial interpolation is well established in image-processing and visualization to describe the overall deformation or warping of an object or an image. Combined with a coarse representation of the biological system by feature vectors, such methods can provide a flexible approximation of the molecular structure. We have compared three well-known interpolation techniques and evaluated the results by comparing them with constrained molecular dynamics. One method, inverse distance weighting interpolation, consistently produced models that were nearly indistinguishable on the alpha carbon level from the molecular dynamics results. The method is simple to apply and enables flexing of structures by non-expert modelers. This is useful for the basic interpretation of volumetric data in biological applications such as electron microscopy. The method can be used as a general interpretation tool for sparsely sampled motions derived from coarse models.
Collapse
Affiliation(s)
- Mirabela Rusu
- School of Health Information Sciences, University of Texas Health Science Center at Houston, 7000 Fannin St, Suite 600, Houston, TX 77030, USA
| | | | | |
Collapse
|
33
|
Abstract
In fitting atomic structures into EM maps, it often happens that the map corresponds to a different conformation of the structure. We have developed a new methodology to handle these situations that preserves the covalent geometry of the structure and allows the modeling of large deformations. The first goal is achieved by working in generalized coordinates (positional and internal coordinates), and the second by avoiding harmonic potentials. Instead, we use dampers (shock absorbers) between every pair of atoms, combined with a force field that attracts the atomic structure toward incompletely occupied regions of the EM map. The trajectory obtained by integrating the resulting equations of motion converges to a conformation that, in our validation cases, was very close to the target atomic structure. Compared to current methods, our approach is more efficient and robust against wrong solutions and to overfitting, and does not require user intervention or subjective decisions. Applications to the computation of transition pathways between known conformers, homology and loop modeling, as well as protein docking, are also discussed.
Collapse
|
34
|
Abstract
![]()
The objective of molecular electron microscopy (EM) is to use electron
microscopes to visualize the structure of biological molecules. This
Review provides a brief overview of the methods used in molecular
EM, their respective strengths and successes, and current developments
that promise an even more exciting future for molecular EM in the
structural investigation of proteins and macromolecular complexes,
studied in isolation or in the context of cells and tissues.
Collapse
Affiliation(s)
- Henning Stahlberg
- Molecular and Cellular Biology,
College of Biological Sciences, University of California at Davis,
Briggs Hall, 1 Shields Avenue, Davis, California 95616
| | - Thomas Walz
- Department of Cell Biology, Harvard Medical School, 240 Longwood Avenue, Boston, Massachusetts 02115
| |
Collapse
|
35
|
Han R, Leo-Macias A, Zerbino D, Bastolla U, Contreras-Moreira B, Ortiz AR. An efficient conformational sampling method for homology modeling. Proteins 2008; 71:175-88. [PMID: 17985353 DOI: 10.1002/prot.21672] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
The structural refinement of protein models is a challenging problem in protein structure prediction (Moult et al., Proteins 2003;53(Suppl 6):334-339). Most attempts to refine comparative models lead to degradation rather than improvement in model quality, so most current comparative modeling procedures omit the refinement step. However, it has been shown that even in the absence of alignment errors and using optimal templates, methods based on a single template have intrinsic limitations, and that refinement is needed to improve model accuracy. It is thought that failure of current methods originates on one hand from the inaccuracy of the effective free energy functions adopted, which do not represent properly the energetic balance in the native state, and on the other hand from the difficulty to sample the high dimensional and rugged free energy landscape of protein folding, in the search for the global minimum. Here, we address this second issue. We define the evolutionary and vibrational armonics subspace (EVA), a reduced sampling subspace that consists of a combination of evolutionarily favored directions, defined by the principal components of the structural variation within a homologous family, plus topologically favored directions, derived from the low frequency normal modes of the vibrational dynamics, up to 50 dimensions. This subspace is accurate enough so that the cores of most proteins can be represented within 1 A accuracy, and reduced enough so that Replica Exchange Monte Carlo (Hukushima and Nemoto, J Phys Soc Jpn 1996;65:1604-1608; Hukushima et al., Int J Mod Phys C: Phys Comput 1996;7:337-344; Mitsutake et al., J Chem Phys 2003;118:6664-6675; Mitsutake et al., J Chem Phys 2003;118:6676-6688) (REMC) can be applied. REMC is one of the best sampling methods currently available, but its applicability is restricted to spaces of small dimensionality. We show that the combination of the EVA subspace and REMC can essentially solve the optimization problem for backbone atoms in the reduced sampling subspace, even for rather rugged free energy landscapes. Applications and limitations of this methodology are finally discussed.
Collapse
Affiliation(s)
- Rongsheng Han
- Bioinformatics Unit, Centro de Biología Molecular "Severo Ochoa" (CSIC-UAM), Universidad Autónoma de Madrid, Cantoblanco, Madrid, Spain
| | | | | | | | | | | |
Collapse
|
36
|
Topf M, Lasker K, Webb B, Wolfson H, Chiu W, Sali A. Protein structure fitting and refinement guided by cryo-EM density. Structure 2008; 16:295-307. [PMID: 18275820 PMCID: PMC2409374 DOI: 10.1016/j.str.2007.11.016] [Citation(s) in RCA: 265] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2007] [Revised: 11/20/2007] [Accepted: 11/26/2007] [Indexed: 11/23/2022]
Abstract
For many macromolecular assemblies, both a cryo-electron microscopy map and atomic structures of its component proteins are available. Here we describe a method for fitting and refining a component structure within its map at intermediate resolution (<15 A). The atomic positions are optimized with respect to a scoring function that includes the crosscorrelation coefficient between the structure and the map as well as stereochemical and nonbonded interaction terms. A heuristic optimization that relies on a Monte Carlo search, a conjugate-gradients minimization, and simulated annealing molecular dynamics is applied to a series of subdivisions of the structure into progressively smaller rigid bodies. The method was tested on 15 proteins of known structure with 13 simulated maps and 3 experimentally determined maps. At approximately 10 A resolution, Calpha rmsd between the initial and final structures was reduced on average by approximately 53%. The method is automated and can refine both experimental and predicted atomic structures.
Collapse
Affiliation(s)
- Maya Topf
- School of Crystallography, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom.
| | | | | | | | | | | |
Collapse
|
37
|
Normal-mode flexible fitting of high-resolution structure of biological molecules toward one-dimensional low-resolution data. Biophys J 2007; 94:1589-99. [PMID: 17993489 DOI: 10.1529/biophysj.107.122218] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present a method for reconstructing a 3D structure from a pair distribution function by flexibly fitting known x-ray structures toward a conformation that agrees with the low-resolution data. This method uses a linear combination of low-frequency normal modes from elastic-network description of the molecule in an iterative manner to deform the structure optimally to conform to the target pair distribution function. A simple function, pair distance distribution function between atoms, is chosen as a test model to establish computational algorithms-optimization algorithm and scoring function-that can utilize low-resolution 1D data. To select a correct structural model based on less information, we developed a scoring function that takes into account a characteristic of pair distribution functions. In addition, we employ a new optimization algorithm, the trusted region method, that relies on both first and second derivatives of the scoring function. Illustrative results of our studies on simulated 1D data from five different proteins, for which large conformational changes are known to occur, are presented.
Collapse
|
38
|
Brock K, Talley K, Coley K, Kundrotas P, Alexov E. Optimization of electrostatic interactions in protein-protein complexes. Biophys J 2007; 93:3340-52. [PMID: 17693468 PMCID: PMC2072065 DOI: 10.1529/biophysj.107.112367] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
In this article, we present a statistical analysis of the electrostatic properties of 298 protein-protein complexes and 356 domain-domain structures extracted from the previously developed database of protein complexes (ProtCom, http://www.ces.clemson.edu/compbio/protcom). For each structure in the dataset we calculated the total electrostatic energy of the binding and its two components, Coulombic and reaction field energy. It was found that in a vast majority of the cases (>90%), the total electrostatic component of the binding energy was unfavorable. At the same time, the Coulombic component of the binding energy was found to favor the complex formation while the reaction field component of the binding energy opposed the binding. It was also demonstrated that the components in a wild-type (WT) structure are optimized/anti-optimized with respect to the corresponding distributions, arising from random shuffling of the charged side chains. The degree of this optimization was assessed through the Z-score of WT energy in respect to the random distribution. It was found that the Z-scores of Coulombic interactions peak at a considerably negative value for all 654 cases considered while the Z-score of the reaction field energy varied among different types of complexes. All these findings indicate that the Coulombic interactions within WT protein-protein complexes are optimized to favor the complex formation while the total electrostatic energy predominantly opposes the binding. This observation was used to discriminate WT structures among sets of structural decoys and showed that the electrostatic component of the binding energy is not a good discriminator of the WT; while, Coulombic or reaction field energies perform better depending upon the decoy set used.
Collapse
Affiliation(s)
- Kelly Brock
- South Carolina Governor School for Science and Mathematics, Hartsville, South Carolina, USA
| | | | | | | | | |
Collapse
|
39
|
Stein M, Gabdoulline RR, Wade RC. Bridging from molecular simulation to biochemical networks. Curr Opin Struct Biol 2007; 17:166-72. [PMID: 17395455 DOI: 10.1016/j.sbi.2007.03.014] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2006] [Revised: 01/11/2007] [Accepted: 03/16/2007] [Indexed: 11/15/2022]
Abstract
How can we make the connection between the three-dimensional structures of individual proteins and understanding how complex biological systems involving many proteins work? The modelling and simulation of protein structures can help to answer this question for systems ranging from multimacromolecular complexes to organelles and cells. On one hand, multiscale modelling and simulation techniques are advancing to permit the spatial and temporal properties of large systems to be simulated using atomic-detail structures. On the other hand, the estimation of kinetic parameters for the mathematical modelling of biochemical pathways using protein structure information provides a basis for iterative manipulation of biochemical pathways guided by protein structure. Recent advances include the structural modelling of protein complexes on the genomic level, novel coarse-graining strategies to increase the size of the system and the time span that can be simulated, and comparative molecular field analyses to estimate enzyme kinetic parameters.
Collapse
Affiliation(s)
- Matthias Stein
- Molecular and Cellular Modeling Group, EML Research, Schloss-Wolfsbrunnenweg 33, Heidelberg, Germany
| | | | | |
Collapse
|
40
|
Shen MY, Sali A. Statistical potential for assessment and prediction of protein structures. Protein Sci 2007; 15:2507-24. [PMID: 17075131 PMCID: PMC2242414 DOI: 10.1110/ps.062416606] [Citation(s) in RCA: 1765] [Impact Index Per Article: 103.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
Abstract
Protein structures in the Protein Data Bank provide a wealth of data about the interactions that determine the native states of proteins. Using the probability theory, we derive an atomic distance-dependent statistical potential from a sample of native structures that does not depend on any adjustable parameters (Discrete Optimized Protein Energy, or DOPE). DOPE is based on an improved reference state that corresponds to noninteracting atoms in a homogeneous sphere with the radius dependent on a sample native structure; it thus accounts for the finite and spherical shape of the native structures. The DOPE potential was extracted from a nonredundant set of 1472 crystallographic structures. We tested DOPE and five other scoring functions by the detection of the native state among six multiple target decoy sets, the correlation between the score and model error, and the identification of the most accurate non-native structure in the decoy set. For all decoy sets, DOPE is the best performing function in terms of all criteria, except for a tie in one criterion for one decoy set. To facilitate its use in various applications, such as model assessment, loop modeling, and fitting into cryo-electron microscopy mass density maps combined with comparative protein structure modeling, DOPE was incorporated into the modeling package MODELLER-8.
Collapse
Affiliation(s)
- Min-Yi Shen
- Department of Biopharmaceutical Sciences, Department of Pharmaceutical Chemistry, University of California at San Francisco, San Francisco, California 94158, USA.
| | | |
Collapse
|
41
|
Shacham E, Sheehan B, Volkmann N. Density-based score for selecting near-native atomic models of unknown structures. J Struct Biol 2006; 158:188-95. [PMID: 17296314 PMCID: PMC2175034 DOI: 10.1016/j.jsb.2006.12.005] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2006] [Revised: 10/20/2006] [Accepted: 12/07/2006] [Indexed: 11/15/2022]
Abstract
We present a low-resolution density-based scoring scheme for selecting high-quality models from a large pool of lesser quality models. We use pre-configured decoy data sets that contain large numbers of models with different degrees of correctness to evaluate the performance of the strategy. We find that the scoring scheme consistently identifies one of the highest quality models for a wide variety of target structures, resolution ranges, and noise models. Tests with experimental data yield similar results.
Collapse
Affiliation(s)
| | | | - Niels Volkmann
- *Correspondence should be addressed to Niels Volkmann, The Burnham Institute for Medical Research, Bioinformatics and Systems Biology Program, 10901 North Torrey Pines Road, La Jolla, CA 92037, Phone: 858 646 3187, Fax: 858 646 3195, e-mail:
| |
Collapse
|
42
|
Garzón JI, Kovacs J, Abagyan R, Chacón P. ADP_EM: fast exhaustive multi-resolution docking for high-throughput coverage. Bioinformatics 2006; 23:427-33. [PMID: 17150992 DOI: 10.1093/bioinformatics/btl625] [Citation(s) in RCA: 83] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Efficient fitting tools are needed to take advantage of a fast growth of atomic models of protein domains from crystallography or comparative modeling, and low-resolution density maps of larger molecular assemblies. Here, we report a novel fitting algorithm for the exhaustive and fast overlay of partial high-resolution models into a low-resolution density map. The method incorporates a fast rotational search based on spherical harmonics (SH) combined with a simple translational scanning. RESULTS This novel combination makes it possible to accurately dock atomic structures into low-resolution electron-density maps in times ranging from seconds to a few minutes. The high-efficiency achieved with simulated and experimental test cases preserves the exhaustiveness needed in these heterogeneous-resolution merging tools. The results demonstrate its efficiency, robustness and high-throughput coverage. AVAILABILITY http://sbg.cib.csic.es/Software/ADP_EM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- José Ignacio Garzón
- Centro de Investigaciones Biológicas, CSIC Ramiro de Maeztu 9, 28040 Madrid, Spain
| | | | | | | |
Collapse
|
43
|
Kundrotas PJ, Alexov E. PROTCOM: searchable database of protein complexes enhanced with domain-domain structures. Nucleic Acids Res 2006; 35:D575-9. [PMID: 17071962 PMCID: PMC1635331 DOI: 10.1093/nar/gkl768] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The database of protein complexes (PROTCOM) is a compilation of known 3D structures of protein–protein complexes enriched with artificially created domain–domain structures using the available entries in the Protein Data Bank. The domain–domain structures are generated by parsing single chain structures into loosely connected domains and are important features of the database. The database () could be used for benchmarking purposes of the docking and other algorithms for predicting 3D structures of protein–protein complexes. The database can be utilized as a template database in the homology or threading methods for modeling the 3D structures of unknown protein–protein complexes. PROTCOM provides the scientific community with an integrated set of tools for browsing, searching, visualizing and downloading a pool of protein complexes. The user is given the option to select a subset of entries using a combination of up to 10 different criteria. As on July 2006 the database contains 1770 entries, each of which consists of the known 3D structures and additional relevant information that can be displayed either in text-only or in visual mode.
Collapse
Affiliation(s)
| | - Emil Alexov
- To whom correspondence should be addressed. Tel: +1 864 656 5307; Fax: +1 864 656 0805;
| |
Collapse
|
44
|
Velazquez-Muriel JA, Carazo JMA. Flexible fitting in 3D-EM with incomplete data on superfamily variability. J Struct Biol 2006; 158:165-81. [PMID: 17257856 DOI: 10.1016/j.jsb.2006.10.014] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2006] [Revised: 09/20/2006] [Accepted: 10/13/2006] [Indexed: 11/26/2022]
Abstract
We present a substantial improvement of S-flexfit, our recently proposed method for flexible fitting in three dimensional electron microscopy (3D-EM) at a resolution range of 8-12A, together with a comparison of the method capabilities with Normal Mode Analysis (NMA), application examples and a user's guide. S-flexfit uses the evolutionary information contained in protein domain databases like CATH, by means of the structural alignment of the elements of a protein superfamily. The added development is based on a recent extension of the Singular Value Decomposition (SVD) algorithm specifically designed for situations with missing data: Incremental Singular Value Decomposition (ISVD). ISVD can manage gaps and allows considering more aminoacids in the structural alignment of a superfamily, extending the range of application and producing better models for the fitting step of our methodology. Our previous SVD-based flexible fitting approach can only take into account positions with no gaps in the alignment, being appropriate when the superfamily members are relatively similar and there are few gaps. However, with new data coming from structural proteomics works, the later situation is becoming less likely, making ISVD the technique of choice for further works. We present the results of using ISVD in the process of flexible fitting with both simulated and experimental 3D-EM maps (GroEL and Poliovirus 135S cell entry intermediate).
Collapse
Affiliation(s)
- Javier A Velazquez-Muriel
- Biocomputing Unit, National Center for Biotechnology, Campus Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | | |
Collapse
|
45
|
Berman HM, Burley SK, Chiu W, Sali A, Adzhubei A, Bourne PE, Bryant SH, Dunbrack RL, Fidelis K, Frank J, Godzik A, Henrick K, Joachimiak A, Heymann B, Jones D, Markley JL, Moult J, Montelione GT, Orengo C, Rossmann MG, Rost B, Saibil H, Schwede T, Standley DM, Westbrook JD. Outcome of a workshop on archiving structural models of biological macromolecules. Structure 2006; 14:1211-7. [PMID: 16955948 DOI: 10.1016/j.str.2006.06.005] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Affiliation(s)
- Helen M Berman
- The Research Collaboratory for Structural Bioinformatics Protein Data Bank, Rutgers, The State University of New Jersey, Piscataway, New Jersey 08854, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
46
|
Llorca O, Trujillo A, Blanco FJ, Bernabeu C. Structural model of human endoglin, a transmembrane receptor responsible for hereditary hemorrhagic telangiectasia. J Mol Biol 2006; 365:694-705. [PMID: 17081563 DOI: 10.1016/j.jmb.2006.10.015] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2006] [Revised: 09/08/2006] [Accepted: 10/05/2006] [Indexed: 12/15/2022]
Abstract
Endoglin is a type I membrane protein expressed as a disulphide-linked homodimer on human vascular endothelial cells whose haploinsufficiency is responsible for the dominant vascular dysplasia known as hereditary hemorrhagic telangiectasia (HHT). Structurally, endoglin belongs to the zona pellucida (ZP) family of proteins that share a ZP domain of approximately 260 amino acid residues at their extracellular region. Endoglin is a component of the TGF-beta receptor complex, interacts with the TGF-beta signalling receptors types I and II, and modulates cellular responses to TGF-beta. Here, we have determined for the first time the three-dimensional structure of the approximately 140 kDa extracellular domain of endoglin at 25 A resolution, using single-particle electron microscopy (EM). This reconstruction provides the general architecture of endoglin, which arranges as a dome made of antiparallel oriented monomers enclosing a cavity at one end. A high-resolution structure of endoglin has also been modelled de novo and found to be consistent with the experimental reconstruction. Each subunit comprises three well-defined domains, two of them corresponding to ZP regions, organised into an open U-shaped monomer. This domain arrangement was found to closely resemble the overall structure derived experimentally and the three modelled de novo domains were tentatively assigned to the domains observed in the EM reconstruction. This molecular model was further tested by tagging endoglin's C terminus with an IgG Fc fragment visible after 3D reconstruction of the labelled protein. Combined, these data provide the structural framework to interpret endoglin's functional domains and mutations found in HHT patients.
Collapse
MESH Headings
- Amino Acid Sequence
- Animals
- Antigens, CD/chemistry
- Antigens, CD/ultrastructure
- CHO Cells
- Cricetinae
- Cricetulus
- Endoglin
- Humans
- Membrane Proteins/chemistry
- Membrane Proteins/ultrastructure
- Microscopy, Electron
- Models, Molecular
- Molecular Sequence Data
- Mutation, Missense/genetics
- Protein Structure, Secondary
- Protein Structure, Tertiary
- Receptors, Cell Surface/chemistry
- Receptors, Cell Surface/ultrastructure
- Receptors, Fc/chemistry
- Receptors, Fc/ultrastructure
- Recombinant Proteins/chemistry
- Recombinant Proteins/ultrastructure
- Sequence Homology
- Solubility
- Telangiectasia, Hereditary Hemorrhagic/genetics
- Telangiectasia, Hereditary Hemorrhagic/pathology
Collapse
Affiliation(s)
- Oscar Llorca
- Centro de Investigaciones Biologicas, Consejo Superior de Investigaciones Cientificas (CSIC), Ramiro de Maetzu 9, 28040 Madrid, Spain
| | | | | | | |
Collapse
|
47
|
Velazquez-Muriel JA, Valle M, Santamaría-Pang A, Kakadiaris IA, Carazo JM. Flexible Fitting in 3D-EM Guided by the Structural Variability of Protein Superfamilies. Structure 2006; 14:1115-26. [PMID: 16843893 DOI: 10.1016/j.str.2006.05.013] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2006] [Revised: 04/27/2006] [Accepted: 05/01/2006] [Indexed: 10/24/2022]
Abstract
A method for flexible fitting of molecular models into three-dimensional electron microscopy (3D-EM) reconstructions at a resolution range of 8-12 A is proposed. The approach uses the evolutionarily related structural variability existing among the protein domains of a given superfamily, according to structural databases such as CATH. A structural alignment of domains belonging to the superfamily, followed by a principal components analysis, is performed, and the first three principal components of the decomposition are explored. Using rigid body transformations for the secondary structure elements (SSEs) plus the cyclic coordinate descent algorithm to close the loops, stereochemically correct models are built for the structure to fit. All of the models are fitted into the 3D-EM map, and the best one is selected based on crosscorrelation measures. This work applies the method to both simulated and experimental data and shows that the flexible fitting was able to produce better results than rigid body fitting.
Collapse
Affiliation(s)
- Javier-Angel Velazquez-Muriel
- Biocomputing Unit, National Center for Biotechnology-CSIC, Campus Universidad Autónoma de Madrid, 28049 Madrid, Spain
| | | | | | | | | |
Collapse
|
48
|
Abstract
Statistical electrostatic analysis of 37 protein-protein complexes extracted from the previously developed database of protein complexes (ProtCom, http://www.ces.clemson.edu/compbio/protcom) is presented. It is shown that small interfaces have a higher content of charged and polar groups compared to large interfaces. In a vast majority of the cases the average pKa shifts for acidic residues induced by the complex formation are negative, indicating that complex formation stabilizes their ionizable states, whereas the histidines are predicted to destabilize the complex. The individual pKa shifts show the same tendency since 80% of the interfacial acidic groups were found to lower their pKas, whereas only 25% of histidines raise their pKa upon the complex formation. The interfacial groups have been divided into three sets according to the mechanism of their pKa shift, and statistical analysis of each set was performed. It was shown that the optimum pH values (pH of maximal stability) of the complex tend to be the same as the optimum pH values of the complex components. This finding can be used in the homology-based prediction of the 3D structures of protein complexes, especially when one needs to evaluate and rank putative models. It is more likely for a model to be correct if both components of the model complex and the entire complex have the same or at least similar values of the optimum pH.
Collapse
Affiliation(s)
- Petras J Kundrotas
- Computational Biophysics and Bioinformatics, Department of Physics and Astronomy, Clemson University, Clemson, South Carolina 29634, USA
| | | |
Collapse
|
49
|
Davis FP, Braberg H, Shen MY, Pieper U, Sali A, Madhusudhan M. Protein complex compositions predicted by structural similarity. Nucleic Acids Res 2006; 34:2943-52. [PMID: 16738133 PMCID: PMC1474056 DOI: 10.1093/nar/gkl353] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Proteins function through interactions with other molecules. Thus, the network of physical interactions among proteins is of great interest to both experimental and computational biologists. Here we present structure-based predictions of 3387 binary and 1234 higher order protein complexes in Saccharomyces cerevisiae involving 924 and 195 proteins, respectively. To generate candidate complexes, comparative models of individual proteins were built and combined together using complexes of known structure as templates. These candidate complexes were then assessed using a statistical potential, derived from binary domain interfaces in PIBASE (). The statistical potential discriminated a benchmark set of 100 interface structures from a set of sequence-randomized negative examples with a false positive rate of 3% and a true positive rate of 97%. Moreover, the predicted complexes were also filtered using functional annotation and sub-cellular localization data. The ability of the method to select the correct binding mode among alternates is demonstrated for three camelid VHH domain—porcine α–amylase interactions. We also highlight the prediction of co-complexed domain superfamilies that are not present in template complexes. Through integration with MODBASE, the application of the method to proteomes that are less well characterized than that of S.cerevisiae will contribute to expansion of the structural and functional coverage of protein interaction space. The predicted complexes are deposited in MODBASE ().
Collapse
Affiliation(s)
- Fred P. Davis
- Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- Department of Pharmaceutical Chemistry, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
| | - Hannes Braberg
- Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- Department of Pharmaceutical Chemistry, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
| | - Min-Yi Shen
- Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- Department of Pharmaceutical Chemistry, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
| | - Ursula Pieper
- Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- Department of Pharmaceutical Chemistry, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
| | - Andrej Sali
- Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- Department of Pharmaceutical Chemistry, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- Correspondence may also be addressed to A. Sali. Tel: +1 415 514 4227; Fax: +1 415 514 4231;
| | - M.S. Madhusudhan
- Department of Biopharmaceutical Sciences, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- Department of Pharmaceutical Chemistry, California Institute for Quantitative Biomedical Research, University of CaliforniaSan Francisco, 1700 4th Street, Byers Hall, San Francisco, CA 94143-2552, USA
- To whom correspondence should be addressed. Tel: + 1 415 514 4232; Fax: +1 415 514 4231;
| |
Collapse
|
50
|
de Bakker PIW, Furnham N, Blundell TL, DePristo MA. Conformer generation under restraints. Curr Opin Struct Biol 2006; 16:160-5. [PMID: 16483766 DOI: 10.1016/j.sbi.2006.02.001] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2005] [Revised: 01/17/2006] [Accepted: 02/06/2006] [Indexed: 10/25/2022]
Abstract
Conformational sampling by direct optimization of an all-atom energy function is ineffective and inefficient because of the ruggedness of the energy landscape. Discrete sampling schemes represent an attractive alternative for generating ensembles of conformers consistent with spatial restraints derived from empirical data. Conformational sampling is becoming increasingly important for structure prediction as the bottleneck in accurate prediction shifts from energy functions to the methods used to find low-energy conformers. Experimental structure determination remains a perennial challenge as investigators tackle larger macromolecular systems, and begin to incorporate more complete descriptions of uncertainty, heterogeneity and dynamics into their models. Computational approaches that combine dense, discrete sampling with all-atom energy evaluation and refinement may help to overcome the remaining barriers to solving these problems.
Collapse
Affiliation(s)
- Paul I W de Bakker
- Department of Molecular Biology and Center for Human Genetic Research, Massachusetts General Hospital, and Department of Genetics, Harvard Medical School, Boston, MA 02114-2790, USA
| | | | | | | |
Collapse
|