1
|
Cretin G, Périn C, Zimmermann N, Galochkina T, Gelly JC. ICARUS: flexible protein structural alignment based on Protein Units. Bioinformatics 2023; 39:btad459. [PMID: 37498544 PMCID: PMC10400377 DOI: 10.1093/bioinformatics/btad459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 07/04/2023] [Accepted: 07/26/2023] [Indexed: 07/28/2023] Open
Abstract
MOTIVATION Alignment of protein structures is a major problem in structural biology. The first approach commonly used is to consider proteins as rigid bodies. However, alignment of protein structures can be very complex due to conformational variability, or complex evolutionary relationships between proteins such as insertions, circular permutations or repetitions. In such cases, introducing flexibility becomes useful for two reasons: (i) it can help compare two protein chains which adopted two different conformational states, such as due to proteins/ligands interaction or post-translational modifications, and (ii) it aids in the identification of conserved regions in proteins that may have distant evolutionary relationships. RESULTS We propose ICARUS, a new approach for flexible structural alignment based on identification of Protein Units, evolutionarily preserved structural descriptors of intermediate size, between secondary structures and domains. ICARUS significantly outperforms reference methods on a dataset of very difficult structural alignments. AVAILABILITY AND IMPLEMENTATION Code is freely available online at https://github.com/DSIMB/ICARUS.
Collapse
Affiliation(s)
- Gabriel Cretin
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France
- Laboratoire d’Excellence GR-Ex, 75015 Paris, France
| | - Charlotte Périn
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France
- Laboratoire d’Excellence GR-Ex, 75015 Paris, France
- TBI, Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France
| | - Nicolas Zimmermann
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France
- Laboratoire d’Excellence GR-Ex, 75015 Paris, France
| | - Tatiana Galochkina
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France
- Laboratoire d’Excellence GR-Ex, 75015 Paris, France
| | - Jean-Christophe Gelly
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France
- Laboratoire d’Excellence GR-Ex, 75015 Paris, France
| |
Collapse
|
2
|
Zhao H, Xu C, Wang T, Liu J. Biomimetic Construction of Artificial Selenoenzymes. Biomimetics (Basel) 2023; 8:biomimetics8010054. [PMID: 36810385 PMCID: PMC9944854 DOI: 10.3390/biomimetics8010054] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 01/31/2023] Open
Abstract
Selenium exists in the form of selenocysteines in selenoproteins and plays a pivotal role in the catalytic process of the antioxidative enzymes. In order to study the structural and functional properties of selenium in selenoproteins, explore the significance of the role of selenium in the fields of biology and chemistry, scientists conducted a series of artificial simulations on selenoproteins. In this review, we sum up the progress and developed strategies in the construction of artificial selenoenzyme. Using different mechanisms from different catalytic angles, selenium-containing catalytic antibodies, semi-synthetic selenonezyme, and the selenium-containing molecularly imprinted enzymes have been constructed. A variety of synthetic selenoenzyme models have been designed and constructed by selecting host molecules such as cyclodextrins, dendrimers, and hyperbranched polymers as the main scaffolds. Then, a variety of selenoprotein assemblies as well as cascade antioxidant nanoenzymes were built by using electrostatic interaction, metal coordination, and host-guest interaction. The unique redox properties of selenoenzyme glutathione peroxidase (GPx) can be reproduced.
Collapse
|
3
|
Taheri-Ledari M, Zandieh A, Shariatpanahi SP, Eslahchi C. Assignment of structural domains in proteins using diffusion kernels on graphs. BMC Bioinformatics 2022; 23:369. [PMID: 36076174 PMCID: PMC9461149 DOI: 10.1186/s12859-022-04902-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Accepted: 08/23/2022] [Indexed: 11/10/2022] Open
Abstract
Though proposing algorithmic approaches for protein domain decomposition has been of high interest, the inherent ambiguity to the problem makes it still an active area of research. Besides, accurate automated methods are in high demand as the number of solved structures for complex proteins is on the rise. While majority of the previous efforts for decomposition of 3D structures are centered on the developing clustering algorithms, employing enhanced measures of proximity between the amino acids has remained rather uncharted. If there exists a kernel function that in its reproducing kernel Hilbert space, structural domains of proteins become well separated, then protein structures can be parsed into domains without the need to use a complex clustering algorithm. Inspired by this idea, we developed a protein domain decomposition method based on diffusion kernels on protein graphs. We examined all combinations of four graph node kernels and two clustering algorithms to investigate their capability to decompose protein structures. The proposed method is tested on five of the most commonly used benchmark datasets for protein domain assignment plus a comprehensive non-redundant dataset. The results show a competitive performance of the method utilizing one of the diffusion kernels compared to four of the best automatic methods. Our method is also able to offer alternative partitionings for the same structure which is in line with the subjective definition of protein domain. With a competitive accuracy and balanced performance for the simple and complex structures despite relying on a relatively naive criterion to choose optimal decomposition, the proposed method revealed that diffusion kernels on graphs in particular, and kernel functions in general are promising measures to facilitate parsing proteins into domains and performing different structural analysis on proteins. The size and interconnectedness of the protein graphs make them promising targets for diffusion kernels as measures of affinity between amino acids. The versatility of our method allows the implementation of future kernels with higher performance. The source code of the proposed method is accessible at https://github.com/taherimo/kludo . Also, the proposed method is available as a web application from https://cbph.ir/tools/kludo .
Collapse
Affiliation(s)
- Mohammad Taheri-Ledari
- Department of Bioinformatics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Amirali Zandieh
- Department of Biophysics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Seyed Peyman Shariatpanahi
- Department of Biophysics, Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
| | - Changiz Eslahchi
- Department of Computer and Data Sciences, Faculty of Mathematical Sciences, Shahid Beheshti University, Tehran, Iran. .,School of Biological Sciences, Institute for Research in Fundamental Sciences (IPM), Tehran, Iran.
| |
Collapse
|
4
|
Cretin G, Galochkina T, Vander Meersche Y, de Brevern AG, Postic G, Gelly JC. SWORD2: hierarchical analysis of protein 3D structures. Nucleic Acids Res 2022; 50:W732-W738. [PMID: 35580056 PMCID: PMC9252838 DOI: 10.1093/nar/gkac370] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Revised: 04/19/2022] [Accepted: 04/29/2022] [Indexed: 11/27/2022] Open
Abstract
Understanding the functions and origins of proteins requires splitting these macromolecules into fragments that could be independent in terms of folding, activity, or evolution. For that purpose, structural domains are the typical level of analysis, but shorter segments, such as subdomains and supersecondary structures, are insightful as well. Here, we propose SWORD2, a web server for exploring how an input protein structure may be decomposed into ‘Protein Units’ that can be hierarchically assembled to delimit structural domains. For each partitioning solution, the relevance of the identified substructures is estimated through different measures. This multilevel analysis is achieved by integrating our previous work on domain delineation, ‘protein peeling’ and model quality assessment. We hope that SWORD2 will be useful to biologists searching for key regions in their proteins of interest and to bioinformaticians building datasets of protein structures. The web server is freely available online: https://www.dsimb.inserm.fr/SWORD2.
Collapse
Affiliation(s)
- Gabriel Cretin
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France.,Laboratoire d'Excellence GR-Ex, 75015 Paris, France
| | - Tatiana Galochkina
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France.,Laboratoire d'Excellence GR-Ex, 75015 Paris, France
| | - Yann Vander Meersche
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France.,Laboratoire d'Excellence GR-Ex, 75015 Paris, France
| | - Alexandre G de Brevern
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France.,Laboratoire d'Excellence GR-Ex, 75015 Paris, France
| | - Guillaume Postic
- Université Paris-Saclay, Univ Evry, IBISC, 91020 Evry-Courcouronnes, France
| | - Jean-Christophe Gelly
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75015 Paris, France.,Laboratoire d'Excellence GR-Ex, 75015 Paris, France
| |
Collapse
|
5
|
Analysis of Integrin α IIb Subunit Dynamics Reveals Long-Range Effects of Missense Mutations on Calf Domains. Int J Mol Sci 2022; 23:ijms23020858. [PMID: 35055046 PMCID: PMC8776176 DOI: 10.3390/ijms23020858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 12/23/2021] [Accepted: 12/30/2021] [Indexed: 11/17/2022] Open
Abstract
Integrin αIIbβ3, a glycoprotein complex expressed at the platelet surface, is involved in platelet aggregation and contributes to primary haemostasis. Several integrin αIIbβ3 polymorphisms prevent the aggregation that causes haemorrhagic syndromes, such as Glanzmann thrombasthenia (GT). Access to 3D structure allows understanding the structural effects of polymorphisms related to GT. In a previous analysis using Molecular Dynamics (MD) simulations of αIIbCalf-1 domain structure, it was observed that GT associated with single amino acid variation affects distant loops, but not the mutated position. In this study, experiments are extended to Calf-1, Thigh, and Calf-2 domains. Two loops in Calf-2 are unstructured and therefore are modelled expertly using biophysical restraints. Surprisingly, MD revealed the presence of rigid zones in these loops. Detailed analysis with structural alphabet, the Proteins Blocks (PBs), allowed observing local changes in highly flexible regions. The variant P741R located at C-terminal of Calf-1 revealed that the Calf-2 presence did not affect the results obtained with isolated Calf-1 domain. Simulations for Calf-1 + Calf-2, and Thigh + Calf-1 variant systems are designed to comprehend the impact of five single amino acid variations in these domains. Distant conformational changes are observed, thus highlighting the potential role of allostery in the structural basis of GT.
Collapse
|
6
|
Pal A, Mulumudy R, Mitra P. Modularity-based parallel protein design algorithm with an implementation using shared memory programming. Proteins 2021; 90:658-669. [PMID: 34651333 DOI: 10.1002/prot.26263] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2021] [Revised: 09/23/2021] [Accepted: 10/01/2021] [Indexed: 01/08/2023]
Abstract
Given a target protein structure, the prime objective of protein design is to find amino acid sequences that will fold/acquire to the given three-dimensional structure. The protein design problem belongs to the non-deterministic polynomial-time-hard class as sequence search space increases exponentially with protein length. To ensure better search space exploration and faster convergence, we propose a protein modularity-based parallel protein design algorithm. The modular architecture of the protein structure is exploited by considering an intermediate structural organization between secondary structure and domain defined as protein unit (PU). Here, we have incorporated a divide-and-conquer approach where a protein is split into PUs and each PU region is explored in a parallel fashion. It has been further analyzed that our shared memory implementation of modularity-based parallel sequence search leads to better search space exploration compared to the case of traditional full protein design. Sequence-based analysis on design sequences depicts an average of 39.7% sequence similarity on the benchmark data set. Structure-based comparison of the modeled structures of the design protein with the target structure exhibited an average root-mean-square deviation of 1.17 Å and an average template modeling score of 0.89. The selected modeled structures of the design protein sequences are validated using 100 ns molecular dynamics simulations where 80% of the proteins have shown better or similar stability to the respective target proteins. Our study informs that our modularity-based protein design algorithm can be extended to protein interaction design as well.
Collapse
Affiliation(s)
- Abantika Pal
- Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India
| | - Rohith Mulumudy
- Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India
| | - Pralay Mitra
- Department of Computer Science and Engineering, Indian Institute of Technology Kharagpur, Kharagpur, West Bengal, India
| |
Collapse
|
7
|
Abstract
The Covid-19 a pandemic infectious disease and affected life across the world resulting in over 188.65 million confirmed cases across 223 countries, territories and areas with 4.06 million deaths. It is caused by a severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and spike (S) protein of SARS-CoV-2, which plays a key role in the receptor recognition and cell membrane fusion process, is composed of two subunits, S1 and S2. The S1 subunit contains a receptor-binding domain (RBD) that recognizes and binds to the host receptor angiotensin-converting enzyme 2 (ACE2), while the S2 subunit mediates viral cell membrane fusion. Hence, it is a key target for developing neutralizing antibodies. Here, we have performed phylogenetic analysis and structural modeling of the SARS-CoV-2 spike glycoprotein, which is found highly conserved. The overall percent protein sequence identity from the SARS-CoV-2 spike protein sequences from the NCBI database was 99.68%. The functional domains of the S protein reveal that the S1 subunit was highly conserved (99.70%) than the S2 subunit (99.66%). Further, the 319–541 residues (RBD) of amino acids within the S1 domain were 100% similar among the spike protein. The 3D modeling of SARS-CoV-2 spike glycoprotein indicated that S protein has four domains with five protein units and the S1 subunit from 1 to 289 amino acid of domain 1 is highly conserved without any change in the ligand interaction site. This analysis clearly suggests that the S1 subunit (RBD 319–541) can be used as a target region for stable and safe vaccine development.
Collapse
|
8
|
Laine E, Grudinin S. HOPMA: Boosting Protein Functional Dynamics with Colored Contact Maps. J Phys Chem B 2021; 125:2577-2588. [PMID: 33687221 DOI: 10.1021/acs.jpcb.0c11633] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
In light of the recent very rapid progress in protein structure prediction, accessing the multitude of functional protein states is becoming more central than ever before. Indeed, proteins are flexible macromolecules, and they often perform their function by switching between different conformations. However, high-resolution experimental techniques such as X-ray crystallography and cryogenic electron microscopy can catch relatively few protein functional states. Many others are only accessible under physiological conditions in solution. Therefore, there is a pressing need to fill this gap with computational approaches. We present HOPMA, a novel method to predict protein functional states and transitions by using a modified elastic network model. The method exploits patterns in a protein contact map, taking its 3D structure as input, and excludes some disconnected patches from the elastic network. Combined with nonlinear normal mode analysis, this strategy boosts the protein conformational space exploration, especially when the input structure is highly constrained, as we demonstrate on a set of more than 400 transitions. Our results let us envision the discovery of new functional conformations, which were unreachable previously, starting from the experimentally known protein structures. The method is computationally efficient and available at https://github.com/elolaine/HOPMA and https://team.inria.fr/nano-d/software/nolb-normal-modes.
Collapse
Affiliation(s)
- Elodie Laine
- CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Sorbonne Université, 75005 Paris, France
| | - Sergei Grudinin
- CNRS, Inria, Grenoble INP, LJK, Univ. Grenoble Alpes, 38000 Grenoble, France
| |
Collapse
|
9
|
Craveur P, Narwani TJ, Rebehmed J, de Brevern AG. Investigation of the impact of PTMs on the protein backbone conformation. Amino Acids 2019; 51:1065-1079. [DOI: 10.1007/s00726-019-02747-w] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2018] [Accepted: 05/18/2019] [Indexed: 12/17/2022]
|
10
|
Lai JI, Verma D, Bailey-Kellogg C, Ackerman ME. Towards conformational fidelity of a quaternary HIV-1 epitope: computational design and directed evolution of a minimal V1V2 antigen. Protein Eng Des Sel 2018; 31:121-133. [PMID: 29897567 PMCID: PMC6030936 DOI: 10.1093/protein/gzy010] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Revised: 04/16/2018] [Accepted: 04/24/2018] [Indexed: 12/11/2022] Open
Abstract
Structure-based approaches to antigen design utilize insights from antibody (Ab):antigen interactions and a refined understanding of protective Ab responses to engineer novel antigens presenting epitopes with conformations relevant to eliciting or discovering protective humoral responses. For human immunodeficiency virus-1 (HIV-1), one model of protection is provided by broadly neutralizing Abs (bnAbs) against epitopes present in the closed prefusion trimeric conformation of HIV-1 envelope glycoprotein, such as the variable loops 1-2 (V1V2) apex. Here, computational design and directed evolution yielded a novel V1V2 sequence variant with potential utility for inclusion in an immunogen for eliciting bnAbs, or as an epitope probe for their detection. The computational design goal was to engineer a minimal single-chain antigen with three copies of the V1V2 loops to support maintenance of closed prefusion V1V2 trimeric conformation and presentation of bnAb epitopes. Via directed evolution of this computationally designed single-chain antigen, we isolated a V1V2 sequence variant that in monomeric form exhibited preferential recognition by quaternary-preferring and conformation-dependent mAbs. Structural context and transferability of this phenotype to V1V2 sequences from all strains of HIV-1 tested suggest a conformation-stabilizing effect. This example demonstrates the potential utility of computational design and directed evolution-based protein engineering strategies to develop minimal, conformation-stabilized epitope-specific antigens.
Collapse
Affiliation(s)
- Jennifer I Lai
- Thayer School of Engineering, Dartmouth College, 14 Engineering Dr, Hanover NH, USA
| | - Deeptak Verma
- Department of Computer Science, Dartmouth College, 9 Maynard St, Hanover NH, USA
| | - Chris Bailey-Kellogg
- Department of Computer Science, Dartmouth College, 9 Maynard St, Hanover NH, USA
| | - Margaret E Ackerman
- Thayer School of Engineering, Dartmouth College, 14 Engineering Dr, Hanover NH, USA
- Department of Microbiology and Immunology, Geisel School of Medicine, Dartmouth College, 1 Medical Center Dr, Lebanon NH, USA
| |
Collapse
|
11
|
In silico analysis of Glanzmann variants of Calf-1 domain of α IIbβ 3 integrin revealed dynamic allosteric effect. Sci Rep 2017; 7:8001. [PMID: 28808266 PMCID: PMC5556033 DOI: 10.1038/s41598-017-08408-w] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Accepted: 07/12/2017] [Indexed: 11/08/2022] Open
Abstract
Integrin αIIbβ3 mediates platelet aggregation and thrombus formation. In a rare hereditary bleeding disorder, Glanzmann thrombasthenia (GT), αIIbβ3 expression / function are impaired. The impact of deleterious missense mutations on the complex structure remains unclear. Long independent molecular dynamics (MD) simulations were performed for 7 GT variants and reference structure of the Calf-1 domain of αIIb. Simulations were analysed using a structural alphabet to describe local protein conformations. Common and flexible regions as well as deformable zones were observed in all the structures. The most flexible region of Calf-1 (with highest B-factor) is rather a rigid region encompassed into two deformable zones. Each mutated structure barely showed any modifications at the mutation sites while distant conformational changes were observed. These unexpected results question the relationship between molecular dynamics and allostery; and the role of these long-range effects in the impaired αIIbβ3 expression. This method is aimed at studying all αIIbβ3 sub-domains and impact of missense mutations at local and global structural level.
Collapse
|
12
|
Postic G, Ghouzam Y, Chebrek R, Gelly JC. An ambiguity principle for assigning protein structural domains. SCIENCE ADVANCES 2017; 3:e1600552. [PMID: 28097215 PMCID: PMC5235333 DOI: 10.1126/sciadv.1600552] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 11/28/2016] [Indexed: 05/20/2023]
Abstract
Ambiguity is the quality of being open to several interpretations. For an image, it arises when the contained elements can be delimited in two or more distinct ways, which may cause confusion. We postulate that it also applies to the analysis of protein three-dimensional structure, which consists in dividing the molecule into subunits called domains. Because different definitions of what constitutes a domain can be used to partition a given structure, the same protein may have different but equally valid domain annotations. However, knowledge and experience generally displace our ability to accept more than one way to decompose the structure of an object-in this case, a protein. This human bias in structure analysis is particularly harmful because it leads to ignoring potential avenues of research. We present an automated method capable of producing multiple alternative decompositions of protein structure (web server and source code available at www.dsimb.inserm.fr/sword/). Our innovative algorithm assigns structural domains through the hierarchical merging of protein units, which are evolutionarily preserved substructures that describe protein architecture at an intermediate level, between domain and secondary structure. To validate the use of these protein units for decomposing protein structures into domains, we set up an extensive benchmark made of expert annotations of structural domains and including state-of-the-art domain parsing algorithms. The relevance of our "multipartitioning" approach is shown through numerous examples of applications covering protein function, evolution, folding, and structure prediction. Finally, we introduce a measure for the structural ambiguity of protein molecules.
Collapse
Affiliation(s)
- Guillaume Postic
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Corresponding author. (G.P.); (J.-C.G.)
| | - Yassine Ghouzam
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
| | - Romain Chebrek
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
| | - Jean-Christophe Gelly
- INSERM U1134, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, UMR_S 1134, Paris, France
- Institut National de la Transfusion Sanguine, Paris, France
- Laboratory of Excellence GR-Ex, Paris, France
- Corresponding author. (G.P.); (J.-C.G.)
| |
Collapse
|
13
|
Esque J, Léonard S, de Brevern AG, Oguey C. VLDP web server: a powerful geometric tool for analysing protein structures in their environment. Nucleic Acids Res 2013; 41:W373-8. [PMID: 23761450 PMCID: PMC3692094 DOI: 10.1093/nar/gkt509] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
Abstract
Protein structures are an ensemble of atoms determined experimentally mostly by X-ray crystallography or Nuclear Magnetic Resonance. Studying 3D protein structures is a key point for better understanding protein function at a molecular level. We propose a set of accurate tools, for analysing protein structures, based on the reliable method of Voronoi–Laguerre tessellations. The Voronoi Laguerre Delaunay Protein web server (VLDPws) computes the Laguerre tessellation on a whole given system first embedded in solvent. Through this fine description, VLDPws gives the following data: (i) Amino acid volumes evaluated with high precision, as confirmed by good correlations with experimental data. (ii) A novel definition of inter-residue contacts within the given protein. (iii) A measure of the residue exposure to solvent that significantly improves the standard notion of accessibility in some cases. At present, no equivalent web server is available. VLDPws provides output in two complementary forms: direct visualization of the Laguerre tessellation, mostly its polygonal molecular surfaces; files of volumes; and areas, contacts and similar data for each residue and each atom. These files are available for download for further analysis. VLDPws can be accessed at http://www.dsimb.inserm.fr/dsimb_tools/vldp.
Collapse
Affiliation(s)
- Jérémy Esque
- LPTM, CNRS UMR 8089, Université Cergy-Pontoise, F-95302 Cergy-Pontoise, France
| | | | | | | |
Collapse
|
14
|
Léonard S, Joseph AP, Srinivasan N, Gelly JC, de Brevern AG. mulPBA: an efficient multiple protein structure alignment method based on a structural alphabet. J Biomol Struct Dyn 2013; 32:661-8. [PMID: 23659291 DOI: 10.1080/07391102.2013.787026] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
The increasing number of available protein structures requires efficient tools for multiple structure comparison. Indeed, multiple structural alignments are essential for the analysis of function, evolution and architecture of protein structures. For this purpose, we proposed a new web server called multiple Protein Block Alignment (mulPBA). This server implements a method based on a structural alphabet to describe the backbone conformation of a protein chain in terms of dihedral angles. This 'sequence-like' representation enables the use of powerful sequence alignment methods for primary structure comparison, followed by an iterative refinement of the structural superposition. This approach yields alignments superior to most of the rigid-body alignment methods and highly comparable with the flexible structure comparison approaches. We implement this method in a web server designed to do multiple structure superimpositions from a set of structures given by the user. Outputs are given as both sequence alignment and superposed 3D structures visualized directly by static images generated by PyMol or through a Jmol applet allowing dynamic interaction. Multiple global quality measures are given. Relatedness between structures is indicated by a distance dendogram. Superimposed structures in PDB format can be also downloaded, and the results are quickly obtained. mulPBA server can be accessed at www.dsimb.inserm.fr/dsimb_tools/mulpba/ .
Collapse
Affiliation(s)
- Sylvain Léonard
- a INSERM UMR-S 665, DSIMB , 6, rue Alexandre Cabanel, F-75739 , Paris , France
| | | | | | | | | |
Collapse
|
15
|
Rebehmed J, Alphand V, de Berardinis V, de Brevern AG. Evolution study of the Baeyer-Villiger monooxygenases enzyme family: functional importance of the highly conserved residues. Biochimie 2013; 95:1394-402. [PMID: 23523772 DOI: 10.1016/j.biochi.2013.03.005] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2013] [Accepted: 03/08/2013] [Indexed: 11/19/2022]
Abstract
Baeyer-Villiger monooxygenases (BVMOs) catalyze the transformation of linear and cyclic ketones into their corresponding esters and lactones by introducing an oxygen atom into a C-C bond. This bioreaction has numerous advantages compared to its chemical version; it does not induce the use of potentially harmful reagents (i.e., green chemistry) and displays significant better enantio- and regio-selectivity. New potential BVMOs were searched using sequence homology for type I BVMO proteins. 116 new sequences were identified as new putative BVMOs respecting the defined selection criteria. Multiple sequence alignments were carried out on the selected sequences to study the conservation of structurally and/or functionally important amino acids during evolution. Type I BVMO signature motif was found to be conserved in 94.8% of the sequences. We noticed also the highly conserved - but previously unnoticed - Threonine 167 (93.1%), located in the signature motif; this position could be added in the pattern used to characterize specific Type I enzymes. Amino acids at the vicinity of the FAD and NADPH cofactors were found also to be highly conserved and the details of the interactions were emphasized. Interestingly, residues at the enzyme binding site were found less conserved in terms of sequence evolution, leading sometimes to some important amino acid changes. These behaviors could explain the enzyme selectivity and specificity for different ligands.
Collapse
|
16
|
Gelly JC, Lin HY, de Brevern AG, Chuang TJ, Chen FC. Selective constraint on human pre-mRNA splicing by protein structural properties. Genome Biol Evol 2012; 4:966-75. [PMID: 22936073 PMCID: PMC3468958 DOI: 10.1093/gbe/evs071] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Alternative splicing (AS) is a major mechanism of increasing proteome diversity in complex organisms. Different AS transcript isoforms may be translated into peptide sequences of significantly different lengths and amino acid compositions. One important question, then, is how AS is constrained by protein structural requirements while peptide sequences may be significantly changed in AS events. Here, we address this issue by examining whether the intactness of three-dimensional protein structural units (compact units in protein structures, namely protein units [PUs]) tends to be preserved in AS events in human. We show that PUs tend to occur in constitutively spliced exons and to overlap constitutive exon boundaries. Furthermore, when PUs are located at the boundaries between two alternatively spliced exons (ASEs), these neighboring ASEs tend to co-occur in different transcript isoforms. In addition, such PU-spanned ASE pairs tend to have a higher frequency of being included in transcript isoforms. ASE regions that overlap with PUs also have lower nonsynonymous-to-synonymous substitution rate ratios than those that do not overlap with PUs, indicating stronger negative selection pressure in PU-overlapped ASE regions. Of note, we show that PUs have protein domain- and structural orderness-independent effects on messenger RNA (mRNA) splicing. Overall, our results suggest that fine-scale protein structural requirements have significant influences on the splicing patterns of human mRNAs.
Collapse
Affiliation(s)
- Jean-Christophe Gelly
- INSERM, UMR-S 665, Dynamique des Structures et Interactions des Macromolécules Biologiques, Paris, France
| | | | | | | | | |
Collapse
|