1
|
Schweke H, Pacesa M, Levin T, Goverde CA, Kumar P, Duhoo Y, Dornfeld LJ, Dubreuil B, Georgeon S, Ovchinnikov S, Woolfson DN, Correia BE, Dey S, Levy ED. An atlas of protein homo-oligomerization across domains of life. Cell 2024; 187:999-1010.e15. [PMID: 38325366 DOI: 10.1016/j.cell.2024.01.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Revised: 11/03/2023] [Accepted: 01/15/2024] [Indexed: 02/09/2024]
Abstract
Protein structures are essential to understanding cellular processes in molecular detail. While advances in artificial intelligence revealed the tertiary structure of proteins at scale, their quaternary structure remains mostly unknown. We devise a scalable strategy based on AlphaFold2 to predict homo-oligomeric assemblies across four proteomes spanning the tree of life. Our results suggest that approximately 45% of an archaeal proteome and a bacterial proteome and 20% of two eukaryotic proteomes form homomers. Our predictions accurately capture protein homo-oligomerization, recapitulate megadalton complexes, and unveil hundreds of homo-oligomer types, including three confirmed experimentally by structure determination. Integrating these datasets with omics information suggests that a majority of known protein complexes are symmetric. Finally, these datasets provide a structural context for interpreting disease mutations and reveal coiled-coil regions as major enablers of quaternary structure evolution in human. Our strategy is applicable to any organism and provides a comprehensive view of homo-oligomerization in proteomes.
Collapse
Affiliation(s)
- Hugo Schweke
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Martin Pacesa
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Tal Levin
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Casper A Goverde
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Prasun Kumar
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Bristol BS8 1TQ, UK; Max Planck-Bristol Centre for Minimal Biology, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK
| | - Yoan Duhoo
- Protein Production and Structure Characterization Core Facility (PTPSP), School of Life Sciences, École polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Lars J Dornfeld
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Benjamin Dubreuil
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Sandrine Georgeon
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA
| | - Derek N Woolfson
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Bristol BS8 1TQ, UK; Max Planck-Bristol Centre for Minimal Biology, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK.
| | - Bruno E Correia
- Laboratory of Protein Design and Immunoengineering, École Polytechnique Fédérale de Lausanne and Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| | - Sucharita Dey
- Department of Bioscience and Bioengineering, Indian Institute of Technology Jodhpur, Rajasthan, India.
| | - Emmanuel D Levy
- Department of Chemical and Structural Biology, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
2
|
Coqueiro A, Fernandes DC, Danuello A, Regasini LO, Cardoso-Lopes EM, Young MCM, Brandão Torres LM, Campos VP, Silva DHS, da Silva Bolzani V, de Oliveira DF. Nematostatic activity of isoprenylated guanidine alkaloids from Pterogyne nitens and their interaction with acetylcholinesterase. Exp Parasitol 2023; 250:108542. [PMID: 37178971 DOI: 10.1016/j.exppara.2023.108542] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Revised: 04/03/2023] [Accepted: 05/03/2023] [Indexed: 05/15/2023]
Abstract
Although new nematicides have appeared, the demand for new products less toxic and more efficient for the control of plant-parasitic nematodes are still high. Consequently, studies on natural secondary metabolites from plants, to develop new nematicides, have increased. In this work, nineteen extracts from eleven Brazilian plant species were screened for activity against Meloidogyne incognita. Among them, the extracts of Piterogyne nitens showed a potent nematostatic activity. The alkaloid fraction obtained from the ethanol extract of leaves of P. nitens was more active than the coming extract. Due to the promising activity from the alkaloid fraction, three isoprenylated guanidine alkaloids isolated from this fraction, galegine (1), pterogynidine (2), and pterogynine (3) were tested, showing similar activity to the alkaloid fraction, which was comparable to that of the positive control Temik at 250 μg/mL. At lower concentrations (125-50 μg/mL), compound 2 showed to be the most active one. As several nematicides act through inhibition of acetylcholinesterase (AChE), the guanidine alkaloids were also employed in two in vitro AChE assays. In both cases, compound 2 was more active than compounds 1 and 3. Its activity was considered moderated compared to the control (physostigmine). Compound 2 was selected for an in silico study with the electric eel (Electrophorus electricus) AChE, showing to bind mostly to the same site of physostigmine in the AChEs, pointing out that this could be the mechanism of action for this compound. These results suggested that the guanidine alkaloids 1,2 and 3 from P. nitens are promising for the development of new products to control M. incognita, especially guanidine 2, and encourage new investigations to confirm the mechanism of action, as well as to determine the structure-activity relationship of the guanidine alkaloids.
Collapse
Affiliation(s)
- Aline Coqueiro
- Nuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University (UNESP), Araraquara, SP, 14801-970, Brazil; Department of Chemistry, Federal University of Technology - Paraná (UTFPR), Ponta Grossa, PR, 84017-220, Brazil.
| | - Daniara Cristina Fernandes
- Nuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University (UNESP), Araraquara, SP, 14801-970, Brazil; Federal Institute of Education, Science and Technology of São Paulo (IFSP), Matão, SP, 15991-502, Brazil
| | - Amanda Danuello
- Nuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University (UNESP), Araraquara, SP, 14801-970, Brazil; Institute of Chemistry, Federal University of Uberlândia (UFU), Uberlândia, MG, 38408-100, Brazil
| | - Luis Octávio Regasini
- Nuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University (UNESP), Araraquara, SP, 14801-970, Brazil
| | | | | | | | - Vicente Paulo Campos
- Department of Phytopathology, Federal University of Lavras (UFLA), Lavras, MG, 37200-000, Brazil
| | - Dulce Helena Siqueira Silva
- Nuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University (UNESP), Araraquara, SP, 14801-970, Brazil
| | - Vanderlan da Silva Bolzani
- Nuclei of Bioassays, Biosynthesis and Ecophysiology of Natural Products (NuBBE), Department of Organic Chemistry, Institute of Chemistry, São Paulo State University (UNESP), Araraquara, SP, 14801-970, Brazil
| | | |
Collapse
|
3
|
López-Blanco JR, Dehouck Y, Bastolla U, Chacón P. Local Normal Mode Analysis for Fast Loop Conformational Sampling. J Chem Inf Model 2022; 62:4561-4568. [PMID: 36099639 PMCID: PMC9516680 DOI: 10.1021/acs.jcim.2c00870] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
![]()
We propose and validate
a novel method to efficiently
explore local
protein loop conformations based on a new formalism for constrained
normal mode analysis (NMA) in internal coordinates. The manifold of
possible loop configurations imposed by the position and orientation
of the fixed loop ends is reduced to an orthogonal set of motions
(or modes) encoding concerted rotations of all the backbone dihedral
angles. We validate the sampling power on a set of protein loops with
highly variable experimental structures and demonstrate that our approach
can efficiently explore the conformational space of closed loops.
We also show an acceptable resemblance of the ensembles around equilibrium
conformations generated by long molecular simulations and constrained
NMA on a set of exposed and diverse loops. In comparison with other
methods, the main advantage is the lack of restrictions on the number
of dihedrals that can be altered simultaneously. Furthermore, the
method is computationally efficient since it only requires the diagonalization
of a tiny matrix, and the modes of motions are energetically contextualized
by the elastic network model, which includes both the loop and the
neighboring residues.
Collapse
Affiliation(s)
- José Ramón López-Blanco
- Department of Biological Physical Chemistry, Rocasolano Institute of Physical Chemistry, CSIC, Serrano 119, 28006 Madrid, Spain
| | - Yves Dehouck
- Centro de Biología Molecular "Severo Ochoa," CSIC-UAM, Cantoblanco, 28049 Madrid, Spain
| | - Ugo Bastolla
- Centro de Biología Molecular "Severo Ochoa," CSIC-UAM, Cantoblanco, 28049 Madrid, Spain
| | - Pablo Chacón
- Department of Biological Physical Chemistry, Rocasolano Institute of Physical Chemistry, CSIC, Serrano 119, 28006 Madrid, Spain
| |
Collapse
|
4
|
Montiel-Garcia D, Rojas-Labra O, Santoyo-Rivera N, Reddy VS. Epitope-Analyzer: A structure-based webtool to analyze broadly neutralizing epitopes. J Struct Biol 2022; 214:107839. [PMID: 35134530 PMCID: PMC8829422 DOI: 10.1016/j.jsb.2022.107839] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2021] [Revised: 01/27/2022] [Accepted: 02/01/2022] [Indexed: 11/23/2022]
Abstract
The antigenic epitope regions of pathogens (e.g., viruses) are recognized by antibodies (Abs) and subsequently cleared by the host immune system, thereby protecting us from disease. Some of these epitopes are conserved among different variants or subgroups of pathogens (e.g., Influenza (FLU) viruses, Coronaviruses), hence can be targeted for potential broad-neutralization. Here we report a web-based tool, Epitope Analyzer (EA), that rapidly identifies conformational epitope and paratope residues in an antigen-antibody complex structure. Furthermore, the tool provides the ways and means to analyze broadly neutralizing epitopes by comparing the equivalent epitope residues in similar antigen structures. The similarity in the epitope residues between (multiple) pairs of similar antigen molecules suggest the presence of conserved epitopes that can be targeted by broadly neutralizing antibodies. These details can be used as a guide in developing effective treatments, such as the design of novel vaccines and formulation of cocktail of broadly neutralizing antibodies, against multiple variants or subgroups of viruses. The web application can be freely accessed from the URL, http://viperdb.scripps.edu/ea.php.
Collapse
Affiliation(s)
- Daniel Montiel-Garcia
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.
| | - Oscar Rojas-Labra
- Departments of Computer Systems and Information Technologies, Tecnologico Nacional de Mexico & Instituto Tecnológico Superior de Irapuato, Irapuato, Guanajuato, Mexico
| | - Nelly Santoyo-Rivera
- Departments of Computer Systems and Information Technologies, Tecnologico Nacional de Mexico & Instituto Tecnológico Superior de Irapuato, Irapuato, Guanajuato, Mexico
| | - Vijay S Reddy
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.
| |
Collapse
|
5
|
Kadukova M, Machado KDS, Chacón P, Grudinin S. KORP-PL: a coarse-grained knowledge-based scoring function for protein-ligand interactions. Bioinformatics 2021; 37:943-950. [PMID: 32840574 DOI: 10.1093/bioinformatics/btaa748] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2020] [Revised: 07/27/2020] [Accepted: 08/18/2020] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION Despite the progress made in studying protein-ligand interactions and the widespread application of docking and affinity prediction tools, improving their precision and efficiency still remains a challenge. Computational approaches based on the scoring of docking conformations with statistical potentials constitute a popular alternative to more accurate but costly physics-based thermodynamic sampling methods. In this context, a minimalist and fast sidechain-free knowledge-based potential with a high docking and screening power can be very useful when screening a big number of putative docking conformations. RESULTS Here, we present a novel coarse-grained potential defined by a 3D joint probability distribution function that only depends on the pairwise orientation and position between protein backbone and ligand atoms. Despite its extreme simplicity, our approach yields very competitive results with the state-of-the-art scoring functions, especially in docking and screening tasks. For example, we observed a twofold improvement in the median 5% enrichment factor on the DUD-E benchmark compared to Autodock Vina results. Moreover, our results prove that a coarse sidechain-free potential is sufficient for a very successful docking pose prediction. AVAILABILITYAND IMPLEMENTATION The standalone version of KORP-PL with the corresponding tests and benchmarks are available at https://team.inria.fr/nano-d/korp-pl/ and https://chaconlab.org/modeling/korp-pl. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Maria Kadukova
- Univ. Grenoble Alpes, CNRS, Inria, Grenoble INP, LJK, 38000 Grenoble, France.,Research Center for Molecular Mechanisms of Aging and Age-Related Diseases, Moscow Institute of Physics and Technology, 141701 Dolgoprudniy, Russia
| | - Karina Dos Santos Machado
- Univ. Grenoble Alpes, CNRS, Inria, Grenoble INP, LJK, 38000 Grenoble, France.,Computational Biology Laboratory, Centro de Ciências Computacionais, Universidade Federal do Rio Grande - FURG, Rio Grande, RS 96201-090, Brazil
| | - Pablo Chacón
- Department of Biological Physical Chemistry, Rocasolano Institute of Physical Chemistry C.S.I.C, Madrid 28006, Spain
| | - Sergei Grudinin
- Univ. Grenoble Alpes, CNRS, Inria, Grenoble INP, LJK, 38000 Grenoble, France
| |
Collapse
|
6
|
Zhao Z, Bourne PE. Structural Insights into the Binding Modes of Viral RNA-Dependent RNA Polymerases Using a Function-Site Interaction Fingerprint Method for RNA Virus Drug Discovery. J Proteome Res 2020; 19:4698-4705. [PMID: 32946692 PMCID: PMC7640976 DOI: 10.1021/acs.jproteome.0c00623] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Indexed: 01/18/2023]
Abstract
The coronavirus disease of 2019 (COVID-19) pandemic speaks to the need for drugs that not only are effective but also remain effective given the mutation rate of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). To this end, we describe structural binding-site insights for facilitating COVID-19 drug design when targeting RNA-dependent RNA polymerase (RDRP), a common conserved component of RNA viruses. We combined an RDRP structure data set, including 384 RDRP PDB structures and all corresponding RDRP-ligand interaction fingerprints, thereby revealing the structural characteristics of the active sites for application to RDRP-targeted drug discovery. Specifically, we revealed the intrinsic ligand-binding modes and associated RDRP structural characteristics. Four types of binding modes with corresponding binding pockets were determined, suggesting two major subpockets available for drug discovery. We screened a drug data set of 7894 compounds against these binding pockets and presented the top-10 small molecules as a starting point in further exploring the prevention of virus replication. In summary, the binding characteristics determined here help rationalize RDRP-targeted drug discovery and provide insights into the specific binding mechanisms important for containing the SARS-CoV-2 virus.
Collapse
Affiliation(s)
- Zheng Zhao
- School
of Data Science, University of Virginia, Charlottesville, Virginia 22904, United States of America
- Department
of Biomedical Engineering, University of
Virginia, Charlottesville, Virginia 22904, United States of America
| | - Philip E. Bourne
- School
of Data Science, University of Virginia, Charlottesville, Virginia 22904, United States of America
- Department
of Biomedical Engineering, University of
Virginia, Charlottesville, Virginia 22904, United States of America
| |
Collapse
|
7
|
Simsir M, Broutin I, Mus-Veteau I, Cazals F. Studying dynamics without explicit dynamics: A structure-based study of the export mechanism by AcrB. Proteins 2020; 89:259-275. [PMID: 32960482 DOI: 10.1002/prot.26012] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Revised: 07/25/2020] [Accepted: 09/13/2020] [Indexed: 12/14/2022]
Abstract
Resistance-nodulation-cell division family proteins are transmembrane proteins identified as large spectrum drug transporters involved in multidrug resistance. A prototypical case in this superfamily, responsible for antibiotic resistance in selected gram-negative bacteria, is AcrB. AcrB forms a trimer using the proton motive force to efflux drugs, implementing a functional rotation mechanism. Unfortunately, the size of the system (1049 amino acid per monomer and membrane) has prevented a systematic dynamical exploration, so that the mild understanding of this coupled transport jeopardizes our ability to counter it. The large number of crystal structures of AcrB prompts studies to further our understanding of the mechanism. To this end, we present a novel strategy based on two key ingredients, which are to study dynamics by exploiting information embodied in the numerous crystal structures obtained to date, and to systematically consider subdomains, their dynamics, and their interactions. Along the way, we identify the subdomains responsible for dynamic events, refine the states (A, B, E) of the functional rotation mechanism, and analyze the evolution of intramonomer and intermonomer interfaces along the functional cycle. Our analysis shows the relevance of AcrB's efflux mechanism as a template within the HAE1 family but not beyond. It also paves the way to targeted simulations exploiting the most relevant degrees of freedom at certain steps, and to a targeting of specific interfaces to block the drug efflux. Our work shows that complex dynamics can be unveiled from static snapshots, a strategy that may be used on a variety of molecular machines of large size.
Collapse
|
8
|
Wen Z, He J, Huang SY. Topology-independent and global protein structure alignment through an FFT-based algorithm. Bioinformatics 2020; 36:478-486. [PMID: 31384919 DOI: 10.1093/bioinformatics/btz609] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2019] [Revised: 07/22/2019] [Accepted: 08/02/2019] [Indexed: 12/12/2022] Open
Abstract
MOTIVATION Protein structure alignment is one of the fundamental problems in computational structure biology. A variety of algorithms have been developed to address this important issue in the past decade. However, due to their heuristic nature, current structure alignment methods may suffer from suboptimal alignment and/or over-fragmentation and thus lead to a biologically wrong alignment in some cases. To overcome these limitations, we have developed an accurate topology-independent and global structure alignment method through an FFT-based exhaustive search algorithm, which is referred to as FTAlign. RESULTS Our FTAlign algorithm was extensively tested on six commonly used datasets and compared with seven state-of-the-art structure alignment approaches, TMalign, DeepAlign, Kpax, 3DCOMB, MICAN, SPalignNS and CLICK. It was shown that FTAlign outperformed the other methods in reproducing manually curated alignments and obtained a high success rate of 96.7 and 90.0% on two gold-standard benchmarks, MALIDUP and MALISAM, respectively. Moreover, FTAlign also achieved the overall best performance in terms of biologically meaningful structure overlap (SO) and TMscore on both the sequential alignment test sets including MALIDUP, MALISAM and 64 difficult cases from HOMSTRAD, and the non-sequential sets including MALIDUP-NS, MALISAM-NS, 199 topology-different cases, where FTAlign especially showed more advantage for non-sequential alignment. Despite its global search feature, FTAlign is also computationally efficient and can normally complete a pairwise alignment within one second. AVAILABILITY AND IMPLEMENTATION http://huanglab.phys.hust.edu.cn/ftalign/.
Collapse
Affiliation(s)
- Zeyu Wen
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, People's Republic of China
| | - Jiahua He
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, People's Republic of China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, People's Republic of China
| |
Collapse
|
9
|
Carpentier M, Chomilier J. Protein multiple alignments: sequence-based versus structure-based programs. Bioinformatics 2020; 35:3970-3980. [PMID: 30942864 DOI: 10.1093/bioinformatics/btz236] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 03/05/2019] [Accepted: 04/02/2019] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Multiple sequence alignment programs have proved to be very useful and have already been evaluated in the literature yet not alignment programs based on structure or both sequence and structure. In the present article we wish to evaluate the added value provided through considering structures. RESULTS We compared the multiple alignments resulting from 25 programs either based on sequence, structure or both, to reference alignments deposited in five databases (BALIBASE 2 and 3, HOMSTRAD, OXBENCH and SISYPHUS). On the whole, the structure-based methods compute more reliable alignments than the sequence-based ones, and even than the sequence+structure-based programs whatever the databases. Two programs lead, MAMMOTH and MATRAS, nevertheless the performances of MUSTANG, MATT, 3DCOMB, TCOFFEE+TM_ALIGN and TCOFFEE+SAP are better for some alignments. The advantage of structure-based methods increases at low levels of sequence identity, or for residues in regular secondary structures or buried ones. Concerning gap management, sequence-based programs set less gaps than structure-based programs. Concerning the databases, the alignments of the manually built databases are more challenging for the programs. AVAILABILITY AND IMPLEMENTATION All data and results presented in this study are available at: http://wwwabi.snv.jussieu.fr/people/mathilde/download/AliMulComp/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mathilde Carpentier
- Institut Systématique Evolution Biodiversité (ISYEB), Sorbonne Université, MNHN, CNRS, EPHE, Paris, France
| | - Jacques Chomilier
- Sorbonne Université, MNHN, CNRS, IRD, Institut de Minéralogie, de Physique des Matériaux et de Cosmochimie (IMPMC), BiBiP, Paris, France
| |
Collapse
|
10
|
Andreani J, Quignot C, Guerois R. Structural prediction of protein interactions and docking using conservation and coevolution. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2020. [DOI: 10.1002/wcms.1470] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Jessica Andreani
- Université Paris‐Saclay CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC) Gif‐sur‐Yvette France
| | - Chloé Quignot
- Université Paris‐Saclay CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC) Gif‐sur‐Yvette France
| | - Raphael Guerois
- Université Paris‐Saclay CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC) Gif‐sur‐Yvette France
| |
Collapse
|
11
|
John A, Vetrivel U, Sivashanmugam M, Natarajan SK. Microsecond Simulation of the Proteoglycan-like Region of Carbonic Anhydrase IX and Design of Chemical Inhibitors Targeting pH Homeostasis in Cancer Cells. ACS OMEGA 2020; 5:4270-4281. [PMID: 32149257 PMCID: PMC7057697 DOI: 10.1021/acsomega.9b04203] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2019] [Accepted: 01/08/2020] [Indexed: 05/09/2023]
Abstract
Carbonic anhydrase IX (CAIX) is a membrane-bound enzyme associated with tumor hypoxia and found to be over expressed in various tumor conditions. Targeting CAIX catalytic activity is proven to be efficient modality in modulating pH homeostasis in cancer cells. Proteoglycan-like (PG) region is unique to CAIX and is proposed to serve as an antenna enhancing the export of protons in conjunction with facilitated efflux of lactate ions via monocarboxylate transporters. Moreover, the PG region is also reported to contribute to the assembly and maturation of focal adhesion links during cellular attachment and dispersion on solid supports. Thus, drug targeting of this region shall efficiently modulate pH homeostasis and cell adhesion in cancer cells. As the PG region is intrinsically disordered, the complete crystal structure is not elucidated. Hence, in this study, we intend to sample the conformational landscape of the PG region at microsecond scale simulation in order to sample the most probable conformations that shall be utilized for structure-based drug design. In addition, the sampled conformations were subjected to high-throughput virtual screening against NCI and Maybridge datasets to identify potential hits based on consensus scoring and validation by molecular dynamics simulation. Further, the identified hits were experimentally validated for efficacy by in vitro and direct enzymatic assays. The results reveal 5-(2-aminoethyl)-1,2,3-benzenetriol to be the most promising hit as it showed significant CAIX inhibition at all levels of in silico and experimental validation.
Collapse
Affiliation(s)
- Arun John
- Centre
for Bioinformatics, Kamalnayan Bajaj Institute for Research in Vision
and Ophthalmology, Vision Research Foundation,
Sankara Nethralaya, Chennai 600 006, Tamil Nadu, India
- School
of Chemical and Biotechnology, SASTRA Deemed
University, Thanjavur, Tamil Nadu, India
| | - Umashankar Vetrivel
- Centre
for Bioinformatics, Kamalnayan Bajaj Institute for Research in Vision
and Ophthalmology, Vision Research Foundation,
Sankara Nethralaya, Chennai 600 006, Tamil Nadu, India
- E-mail: . Phone: +91-44-28271616. Fax: +91-44-28254180
| | - Muthukumaran Sivashanmugam
- Centre
for Bioinformatics, Kamalnayan Bajaj Institute for Research in Vision
and Ophthalmology, Vision Research Foundation,
Sankara Nethralaya, Chennai 600 006, Tamil Nadu, India
| | - Sulochana Konerirajapuram Natarajan
- R.S.
Mehta Jain Department of Biochemistry and Cell Biology, Kamalnayan
Bajaj Institute for Research in Vision and Ophthalmology, Vision Research Foundation, Sankara Nethralaya, Chennai 600 006, Tamil Nadu, India
| |
Collapse
|
12
|
Geometric description of self-interaction potential in symmetric protein complexes. Sci Data 2019; 6:64. [PMID: 31101822 PMCID: PMC6525250 DOI: 10.1038/s41597-019-0058-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2018] [Accepted: 03/29/2019] [Indexed: 01/23/2023] Open
Abstract
Proteins can self-associate with copies of themselves to form symmetric complexes called homomers. Homomers are widespread in all kingdoms of life and allow for unique geometric and functional properties, as reflected in viral capsids or allostery. Once a protein forms a homomer, however, its internal symmetry can compound the effect of point mutations and trigger uncontrolled self-assembly into high-order structures. We identified mutation hot spots for supramolecular assembly, which are predictable by geometry. Here, we present a dataset of descriptors that characterize these hot spot positions both geometrically and chemically, as well as computer scripts allowing the calculation and visualization of these properties for homomers of choice. Since the biological relevance of homomers is not readily available from their X-ray crystallographic structure, we also provide reliability estimates obtained by methods we recently developed. These data have implications in the study of disease-causing mutations, protein evolution and can be exploited in the design of biomaterials. Design Type(s) | protein interaction analysis objective • protein structure prediction objective • modeling and simulation objective | Measurement Type(s) | protein complex | Technology Type(s) | computational modeling technique | Factor Type(s) | Filtering • source | Sample Characteristic(s) | laboratory environment |
Machine-accessible metadata file describing the reported data (ISA-Tab format)
Collapse
|
13
|
Identification of Novel Interaction Partners of Ets-1: Focus on DNA Repair. Genes (Basel) 2019; 10:genes10030206. [PMID: 30857266 PMCID: PMC6470857 DOI: 10.3390/genes10030206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Revised: 03/04/2019] [Accepted: 03/05/2019] [Indexed: 12/03/2022] Open
Abstract
The transcription factor Ets-1 (ETS proto-oncogene 1) shows low expression levels except in specific biological processes like haematopoiesis or angiogenesis. Elevated levels of expression are observed in tumor progression, resulting in Ets-1 being named an oncoprotein. It has recently been shown that Ets-1 interacts with two DNA repair enzymes, PARP-1 (poly(ADP-ribose) polymerase 1) and DNA-PK (DNA-dependent protein kinase), through two different domains and that these interactions play a role in cancer. Considering that Ets-1 can bind to distinctly different domains of two DNA repair enzymes, we hypothesized that the interaction can be transposed onto homologs of the respective domains. We have searched for sequence and structure homologs of the interacting ETS(Ets-1), BRCT(PARP-1) and SAP(DNA-PK) domains, and have identified several candidate binding pairs that are currently not annotated as such. Many of the Ets-1 partners are associated to DNA repair mechanisms. We have applied protein-protein docking to establish putative interaction poses and investigated these using centrality analyses at the protein residue level. Most of the identified poses are virtually similar to our recently established interaction model for Ets-1/PARP-1 and Ets-1/DNA-PK. Our work illustrates the potentially high number of interactors of Ets-1, in particular involved in DNA repair mechanisms, which shows the oncoprotein as a potential important regulator of the mechanism.
Collapse
|
14
|
Pagès G, Charmettant B, Grudinin S. Protein model quality assessment using 3D oriented convolutional neural networks. Bioinformatics 2019; 35:3313-3319. [DOI: 10.1093/bioinformatics/btz122] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2018] [Revised: 01/17/2019] [Accepted: 02/13/2019] [Indexed: 11/13/2022] Open
Abstract
Abstract
Motivation
Protein model quality assessment (QA) is a crucial and yet open problem in structural bioinformatics. The current best methods for single-model QA typically combine results from different approaches, each based on different input features constructed by experts in the field. Then, the prediction model is trained using a machine-learning algorithm. Recently, with the development of convolutional neural networks (CNN), the training paradigm has changed. In computer vision, the expert-developed features have been significantly overpassed by automatically trained convolutional filters. This motivated us to apply a three-dimensional (3D) CNN to the problem of protein model QA.
Results
We developed Ornate (Oriented Routed Neural network with Automatic Typing)—a novel method for single-model QA. Ornate is a residue-wise scoring function that takes as input 3D density maps. It predicts the local (residue-wise) and the global model quality through a deep 3D CNN. Specifically, Ornate aligns the input density map, corresponding to each residue and its neighborhood, with the backbone topology of this residue. This circumvents the problem of ambiguous orientations of the initial models. Also, Ornate includes automatic identification of atom types and dynamic routing of the data in the network. Established benchmarks (CASP 11 and CASP 12) demonstrate the state-of-the-art performance of our approach among single-model QA methods.
Availability and implementation
The method is available at https://team.inria.fr/nano-d/software/Ornate/. It consists of a C++ executable that transforms molecular structures into volumetric density maps, and a Python code based on the TensorFlow framework for applying the Ornate model to these maps.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Guillaume Pagès
- Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France
| | - Benoit Charmettant
- Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France
| | - Sergei Grudinin
- Université Grenoble Alpes, Inria, CNRS, Grenoble INP, LJK, Grenoble, France
| |
Collapse
|
15
|
Cazals F, Tetley R. Characterizing molecular flexibility by combining least root mean square deviation measures. Proteins 2019; 87:380-389. [DOI: 10.1002/prot.25658] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2018] [Revised: 12/20/2018] [Accepted: 01/09/2019] [Indexed: 12/29/2022]
Affiliation(s)
- Frédéric Cazals
- Inria (Algorithms‐Biology‐Structure), Université Côte d'Azur Sophia Antipolis France
| | - Romain Tetley
- Inria (Algorithms‐Biology‐Structure), Université Côte d'Azur Sophia Antipolis France
| |
Collapse
|
16
|
Krieger C, Roselli S, Kellner-Thielmann S, Galati G, Schneider B, Grosjean J, Olry A, Ritchie D, Matern U, Bourgaud F, Hehn A. The CYP71AZ P450 Subfamily: A Driving Factor for the Diversification of Coumarin Biosynthesis in Apiaceous Plants. FRONTIERS IN PLANT SCIENCE 2018; 9:820. [PMID: 29971079 PMCID: PMC6018538 DOI: 10.3389/fpls.2018.00820] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/19/2018] [Accepted: 05/28/2018] [Indexed: 05/19/2023]
Abstract
The production of coumarins and furanocoumarins (FCs) in higher plants is widely considered a model illustration of the adaptation of plants to their environment. In this report, we show that the multiplication of cytochrome P450 variants within the CYP71AZ subfamily has contributed to the diversification of these molecules. Multiple copies of genes encoding this enzyme family are found in Apiaceae, and their phylogenetic analysis suggests that they have different functions within these plants. CYP71AZ1 from Ammi majus and CYP71AZ3, 4, and 6 from Pastinaca sativa were functionally characterized. While CYP71AZ3 merely hydroxylated esculetin, the other enzymes accepted both simple coumarins and FCs. Superimposing in silico models of these enzymes led to the identification of different conformations of three regions in the enzyme active site. These sequences were subsequently utilized to mutate CYP71AZ4 to resemble CYP71AZ3. The swapping of these regions lead to significantly modified substrate specificity. Simultaneous mutations of all three regions shifted the specificity of CYP71AZ4 to that of CYP71AZ3, exclusively accepting esculetin. This approach may explain the evolution of this cytochrome P450 family regarding the appearance of FCs in parsnip and possibly in the Apiaceae.
Collapse
Affiliation(s)
- Célia Krieger
- Laboratoire Agronomie et Environnement, Institut National de la Recherche Agronomique, Université de Lorraine, Nancy, France
| | - Sandro Roselli
- Laboratoire Agronomie et Environnement, Institut National de la Recherche Agronomique, Université de Lorraine, Nancy, France
| | - Sandra Kellner-Thielmann
- Institut für Pharmazeutische Biologie und Biotechnologie, Philipps-Universität Marburg, Marburg, Germany
| | - Gianni Galati
- Laboratoire Agronomie et Environnement, Institut National de la Recherche Agronomique, Université de Lorraine, Nancy, France
| | | | - Jérémy Grosjean
- Laboratoire Agronomie et Environnement, Institut National de la Recherche Agronomique, Université de Lorraine, Nancy, France
| | - Alexandre Olry
- Laboratoire Agronomie et Environnement, Institut National de la Recherche Agronomique, Université de Lorraine, Nancy, France
| | - David Ritchie
- INRIA Nancy, Grand-Est Research Centre, Laboratoire Lorrain De Recherche En Informatique Et Ses Applications, Nancy, France
| | - Ulrich Matern
- Institut für Pharmazeutische Biologie und Biotechnologie, Philipps-Universität Marburg, Marburg, Germany
| | | | - Alain Hehn
- Laboratoire Agronomie et Environnement, Institut National de la Recherche Agronomique, Université de Lorraine, Nancy, France
- *Correspondence: Alain Hehn,
| |
Collapse
|
17
|
Dey S, Levy ED. Inferring and Using Protein Quaternary Structure Information from Crystallographic Data. Methods Mol Biol 2018; 1764:357-375. [PMID: 29605927 DOI: 10.1007/978-1-4939-7759-8_23] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
A precise knowledge of the quaternary structure of proteins is essential to illuminate both their function and their evolution. The major part of our knowledge on quaternary structure is inferred from X-ray crystallography data, but this inference process is hard and error-prone. The difficulty lies in discriminating fortuitous protein contacts, which make up the lattice of protein crystals, from biological protein contacts that exist in the native cellular environment. Here, we review methods devised to discriminate between both types of contacts and describe resources for downloading protein quaternary structure information and identifying high-confidence quaternary structures. The use of high-confidence datasets of quaternary structures will be critical for the analysis of structural, functional, and evolutionary properties of proteins.
Collapse
Affiliation(s)
- Sucharita Dey
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel
| | - Emmanuel D Levy
- Department of Structural Biology, Weizmann Institute of Science, Rehovot, Israel.
| |
Collapse
|
18
|
Abstract
Comparing and classifying protein domain interactions according to their three-dimensional (3D) structures can help to understand protein structure-function and evolutionary relationships. Additionally, structural knowledge of existing domain-domain interactions can provide a useful way to find structural templates with which to model the 3D structures of unsolved protein complexes. Here we present a straightforward guide to using the "Kbdock" protein domain structure database and its associated web site for exploring and comparing protein domain-domain interactions (DDIs) and domain-peptide interactions (DPIs) at the Pfam domain family level. We also briefly explain how the Kbdock web site works, and we provide some notes and suggestions which should help to avoid some common pitfalls when working with 3D protein domain structures.
Collapse
|
19
|
PDB-wide identification of biological assemblies from conserved quaternary structure geometry. Nat Methods 2017; 15:67-72. [DOI: 10.1038/nmeth.4510] [Citation(s) in RCA: 50] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2017] [Accepted: 10/17/2017] [Indexed: 02/07/2023]
|
20
|
Fotoohifiroozabadi S, Mohamad MS, Deris S. Samira-VP: A simple protein alignment method with rechecking the alphabet vector positions. J Bioinform Comput Biol 2017; 15:1750004. [PMID: 28274174 DOI: 10.1142/s0219720017500044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
Protein structure alignment and comparisons that are based on an alphabetical demonstration of protein structure are more simple to run with faster evaluation processes; thus, their accuracy is not as reliable as three-dimension (3D)-based tools. As a 1D method candidate, TS-AMIR used the alphabetic demonstration of secondary-structure elements (SSE) of proteins and compared the assigned letters to each SSE using the [Formula: see text]-gram method. Although the results were comparable to those obtained via geometrical methods, the SSE length and accuracy of adjacency between SSEs were not considered in the comparison process. Therefore, to obtain further information on accuracy of adjacency between SSE vectors, the new approach of assigning text to vectors was adopted according to the spherical coordinate system in the present study. Moreover, dynamic programming was applied in order to account for the length of SSE vectors. Five common datasets were selected for method evaluation. The first three datasets were small, but difficult to align, and the remaining two datasets were used to compare the capability of the proposed method with that of other methods on a large protein dataset. The results showed that the proposed method, as a text-based alignment approach, obtained results comparable to both 1D and 3D methods. It outperformed 1D methods in terms of accuracy and 3D methods in terms of runtime.
Collapse
Affiliation(s)
- Samira Fotoohifiroozabadi
- 1 Artificial Intelligence and Bioinformatics Research Group, Faculty of Computing, Universiti Teknologi Malaysia, Skudai 81310 Johor, Malaysia
| | - Mohd Saberi Mohamad
- 1 Artificial Intelligence and Bioinformatics Research Group, Faculty of Computing, Universiti Teknologi Malaysia, Skudai 81310 Johor, Malaysia
| | - Safaai Deris
- 2 Faculty of Creative Technology & Heritage, Universiti Malaysia Kelantan, Locked Bag 01, 16300 Bachok, Kota Bharu, Kelantan, Malaysia
| |
Collapse
|
21
|
Roselli S, Olry A, Vautrin S, Coriton O, Ritchie D, Galati G, Navrot N, Krieger C, Vialart G, Bergès H, Bourgaud F, Hehn A. A bacterial artificial chromosome (BAC) genomic approach reveals partial clustering of the furanocoumarin pathway genes in parsnip. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2017; 89:1119-1132. [PMID: 27943460 DOI: 10.1111/tpj.13450] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 12/02/2016] [Accepted: 12/06/2016] [Indexed: 05/06/2023]
Abstract
Furanocoumarins are specialized metabolites that are involved in the defense of plants against phytophagous insects. The molecular and functional characterization of the genes involved in their biosynthetic pathway is only partially complete. Many recent reports have described gene clusters responsible for the biosynthesis of specialized metabolites in plants. To investigate possible co-localization of the genes involved in the furanocoumarin pathway, we sequenced parsnip BAC clones spanning two different gene loci. We found that two genes previously identified in this pathway, CYP71AJ3 and CYP71AJ4, were located on the same BAC, whereas a third gene, PsPT1, belonged to a different BAC clone. Chromosome mapping using fluorescence in situ hybridization (FISH) indicated that PsPT1 and the CYP71AJ3-CYP71AJ4 clusters are located on two different chromosomes. Sequencing the BAC clone harboring PsPT1 led to the identification of a gene encoding an Fe(II) α-ketoglutarate-dependent dioxygenase (PsDIOX) situated in the neighborhood of PsPT1 and confirmed the occurrence of a second gene cluster involved in the furanocoumarin pathway. This enzyme metabolizes p-coumaroyl CoA, leading exclusively to the synthesis of umbelliferone, an important intermediate compound in furanocoumarin synthesis. This work provides an insight into the genomic organization of genes from the furanocoumarin biosynthesis pathway organized in more than one gene cluster. It also confirms that the screening of a genomic library and the sequencing of BAC clones represent a valuable tool to identify genes involved in biosynthetic pathways dedicated to specialized metabolite synthesis.
Collapse
Affiliation(s)
- Sandro Roselli
- Laboratoire Agronomie et Environnement, INRA UMR1121, 2 avenue de la forêt de Haye TSA 40602, 54518, Vandœuvre-lès-Nancy, France
- Laboratoire Agronomie et Environnement, Université de Lorraine UMR1121, 2 avenue de la forêt de Haye - TSA 40602, 54518, Vandœuvre-lès-Nancy, France
| | - Alexandre Olry
- Laboratoire Agronomie et Environnement, INRA UMR1121, 2 avenue de la forêt de Haye TSA 40602, 54518, Vandœuvre-lès-Nancy, France
- Laboratoire Agronomie et Environnement, Université de Lorraine UMR1121, 2 avenue de la forêt de Haye - TSA 40602, 54518, Vandœuvre-lès-Nancy, France
| | - Sonia Vautrin
- Centre National de Ressources Génomiques Végétales - INRA - 24 Chemin de Borde Rouge - Auzeville CS 52627, 31326, Castanet Tolosan Cedex, France
| | - Olivier Coriton
- Plate-Forme de Cytogénétique Moléculaire - UMR1349 IGEPP INRA - Agrocampus Ouest-Université de Rennes 1, 35653, Le Rheu, France
| | - Dave Ritchie
- INRIA Nancy Grand Est, 615 rue du Jardin Botanique, 54600, Villers-lès-Nancy, France
| | - Gianni Galati
- Laboratoire Agronomie et Environnement, INRA UMR1121, 2 avenue de la forêt de Haye TSA 40602, 54518, Vandœuvre-lès-Nancy, France
- Laboratoire Agronomie et Environnement, Université de Lorraine UMR1121, 2 avenue de la forêt de Haye - TSA 40602, 54518, Vandœuvre-lès-Nancy, France
| | - Nicolas Navrot
- Institut de biologie moléculaire des plantes - UPR2357 CNRS, Université de Strasbourg, 12 rue du Général Zimmer - 67084, Strasbourg Cedex, France
| | - Célia Krieger
- Laboratoire Agronomie et Environnement, INRA UMR1121, 2 avenue de la forêt de Haye TSA 40602, 54518, Vandœuvre-lès-Nancy, France
- Laboratoire Agronomie et Environnement, Université de Lorraine UMR1121, 2 avenue de la forêt de Haye - TSA 40602, 54518, Vandœuvre-lès-Nancy, France
| | - Guilhem Vialart
- Laboratoire Agronomie et Environnement, INRA UMR1121, 2 avenue de la forêt de Haye TSA 40602, 54518, Vandœuvre-lès-Nancy, France
- Laboratoire Agronomie et Environnement, Université de Lorraine UMR1121, 2 avenue de la forêt de Haye - TSA 40602, 54518, Vandœuvre-lès-Nancy, France
| | - Hélène Bergès
- Centre National de Ressources Génomiques Végétales - INRA - 24 Chemin de Borde Rouge - Auzeville CS 52627, 31326, Castanet Tolosan Cedex, France
| | - Frédéric Bourgaud
- Laboratoire Agronomie et Environnement, INRA UMR1121, 2 avenue de la forêt de Haye TSA 40602, 54518, Vandœuvre-lès-Nancy, France
- Laboratoire Agronomie et Environnement, Université de Lorraine UMR1121, 2 avenue de la forêt de Haye - TSA 40602, 54518, Vandœuvre-lès-Nancy, France
| | - Alain Hehn
- Laboratoire Agronomie et Environnement, INRA UMR1121, 2 avenue de la forêt de Haye TSA 40602, 54518, Vandœuvre-lès-Nancy, France
- Laboratoire Agronomie et Environnement, Université de Lorraine UMR1121, 2 avenue de la forêt de Haye - TSA 40602, 54518, Vandœuvre-lès-Nancy, France
| |
Collapse
|
22
|
El Houasli M, Maigret B, Devignes MD, Ghoorah AW, Grudinin S, Ritchie DW. Modeling and minimizing CAPRI round 30 symmetrical protein complexes from CASP-11 structural models. Proteins 2016; 85:463-469. [PMID: 27701764 DOI: 10.1002/prot.25182] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2016] [Revised: 09/13/2016] [Accepted: 09/24/2016] [Indexed: 11/06/2022]
Abstract
Many of the modeling targets in the blind CASP-11/CAPRI-30 experiment were protein homo-dimers and homo-tetramers. Here, we perform a retrospective docking-based analysis of the perfectly symmetrical CAPRI Round 30 targets whose crystal structures have been published. Starting from the CASP "stage-2" fold prediction models, we show that using our recently developed "SAM" polar Fourier symmetry docking algorithm combined with NAMD energy minimization often gives acceptable or better 3D models of the target complexes. We also use SAM to analyze the overall quality of all CASP structural models for the selected targets from a docking-based perspective. We demonstrate that docking only CASP "center" structures for the selected targets provides a fruitful and economical docking strategy. Furthermore, our results show that many of the CASP models are dockable in the sense that they can lead to acceptable or better models of symmetrical complexes. Even though SAM is very fast, using docking and NAMD energy minimization to pull out acceptable docking models from a large ensemble of docked CASP models is computationally expensive. Nonetheless, thanks to our SAM docking algorithm, we expect that applying our docking protocol on a modern computer cluster will give us the ability to routinely model 3D structures of symmetrical protein complexes from CASP-quality models. Proteins 2017; 85:463-469. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Marwa El Houasli
- INRIA, Equipe Capsid, Campus Scientique, BP 239, 54506, Vandoeuvre-lès-Nancy, France
| | | | | | - Anisah W Ghoorah
- Department of Computer Science and Engineering, University of Mauritius
| | | | - David W Ritchie
- INRIA, Equipe Capsid, Campus Scientique, BP 239, 54506, Vandoeuvre-lès-Nancy, France
| |
Collapse
|
23
|
Ritchie DW. Calculating and scoring high quality multiple flexible protein structure alignments. Bioinformatics 2016; 32:2650-8. [PMID: 27187202 DOI: 10.1093/bioinformatics/btw300] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2015] [Accepted: 05/07/2016] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Calculating multiple protein structure alignments (MSAs) is important for understanding functional and evolutionary relationships between protein families, and for modeling protein structures by homology. While incorporating backbone flexibility promises to circumvent many of the limitations of rigid MSA algorithms, very few flexible MSA algorithms exist today. This article describes several novel improvements to the Kpax algorithm which allow high quality flexible MSAs to be calculated. This article also introduces a new Gaussian-based MSA quality measure called 'M-score', which circumvents the pitfalls of RMSD-based quality measures. RESULTS As well as calculating flexible MSAs, the new version of Kpax can also score MSAs from other aligners and from previously aligned reference datasets. Results are presented for a large-scale evaluation of the Homstrad, SABmark and SISY benchmark sets using Kpax and Matt as examples of state-of-the-art flexible aligners and 3DCOMB as an example of a state-of-the-art rigid aligner. These results demonstrate the utility of the M-score as a measure of MSA quality and show that high quality MSAs may be achieved when structural flexibility is properly taken into account. AVAILABILITY AND IMPLEMENTATION Kpax 5.0 may be downloaded for academic use at http://kpax.loria.fr/ CONTACT dave.ritchie@inria.fr SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
24
|
Maghawry HA, Mostafa MGM, Gharib TF. A new protein structure representation for efficient protein function prediction. J Comput Biol 2015; 21:936-46. [PMID: 25343279 DOI: 10.1089/cmb.2014.0137] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
One of the challenging problems in bioinformatics is the prediction of protein function. Protein function is the main key that can be used to classify different proteins. Protein function can be inferred experimentally with very small throughput or computationally with very high throughput. Computational methods are sequence based or structure based. Structure-based methods produce more accurate protein function prediction. In this article, we propose a new protein structure representation for efficient protein function prediction. The representation is based on three-dimensional patterns of protein residues. In the analysis, we used protein function based on enzyme activity through six mechanistically diverse enzyme superfamilies: amidohydrolase, crotonase, haloacid dehalogenase, isoprenoid synthase type I, and vicinal oxygen chelate. We applied three different classification methods, naïve Bayes, k-nearest neighbors, and random forest, to predict the enzyme superfamily of a given protein. The prediction accuracy using the proposed representation outperforms a recently introduced representation method that is based only on the distance patterns. The results show that the proposed representation achieved prediction accuracy up to 98%, with improvement of about 10% on average.
Collapse
Affiliation(s)
- Huda A Maghawry
- 1 Department of Information Systems, Faculty of Computer and Information Sciences, Ain Shams University , Cairo, Egypt
| | | | | |
Collapse
|
25
|
Aronchik I, Kundu A, Quirit JG, Firestone GL. The antiproliferative response of indole-3-carbinol in human melanoma cells is triggered by an interaction with NEDD4-1 and disruption of wild-type PTEN degradation. Mol Cancer Res 2014; 12:1621-1634. [PMID: 25009292 DOI: 10.1158/1541-7786.mcr-14-0018] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
UNLABELLED Human melanoma cells displaying distinct PTEN genotypes were used to assess the cellular role of this important tumor-suppressor protein in the antiproliferative response induced by the chemopreventative agent indole-3-carbinol (I3C), a natural indolecarbinol compound derived from the breakdown of glucobrassicin produced in cruciferous vegetables such as broccoli and Brussels sprouts. I3C induced a G1-phase cell-cycle arrest and apoptosis by stabilization of PTEN in human melanoma cells that express wild-type PTEN, but not in cells with mutant or null PTEN genotypes. Importantly, normal human epidermal melanocytes were unaffected by I3C treatment. In wild-type PTEN-expressing melanoma xenografts, formed in athymic mice, I3C inhibited the in vivo tumor growth rate and increased PTEN protein levels in the residual tumors. Mechanistically, I3C disrupted the ubiquitination of PTEN by NEDD4-1 (NEDD4), which prevented the proteasome-mediated degradation of PTEN without altering its transcript levels. RNAi-mediated knockdown of PTEN prevented the I3C-induced apoptotic response, whereas knockdown of NEDD4-1 mimicked the I3C apoptotic response, stabilized PTEN protein levels, and downregulated phosphorylated AKT-1 levels. Co-knockdown of PTEN and NEDD4-1 revealed that I3C-regulated apoptotic signaling through NEDD4-1 requires the presence of the wild-type PTEN protein. Finally, in silico structural modeling, in combination with isothermal titration calorimetry analysis, demonstrated that I3C directly interacts with purified NEDD4-1 protein. IMPLICATIONS This study identifies NEDD4-1 as a new I3C target protein, and that the I3C disruption of NEDD4-1 ubiquitination activity triggers the stabilization of the wild-type PTEN tumor suppressor to induce an antiproliferative response in melanoma. Mol Cancer Res; 12(11); 1621-34. ©2014 AACR.
Collapse
Affiliation(s)
- Ida Aronchik
- Department of Molecular and Cell Biology and The Cancer Research Laboratory, Univ. of California at Berkeley, Berkeley, CA 94720-3200
| | - Aishwarya Kundu
- Department of Molecular and Cell Biology and The Cancer Research Laboratory, Univ. of California at Berkeley, Berkeley, CA 94720-3200
| | - Jeanne G Quirit
- Department of Molecular and Cell Biology and The Cancer Research Laboratory, Univ. of California at Berkeley, Berkeley, CA 94720-3200
| | - Gary L Firestone
- Department of Molecular and Cell Biology and The Cancer Research Laboratory, Univ. of California at Berkeley, Berkeley, CA 94720-3200
| |
Collapse
|
26
|
Dhifli W, Saidi R, Nguifo EM. Smoothing 3D Protein Structure Motifs Through Graph Mining and Amino Acid Similarities. J Comput Biol 2014; 21:162-72. [DOI: 10.1089/cmb.2013.0092] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Affiliation(s)
- Wajdi Dhifli
- LIMOS, Blaise Pascal University, Clermont University, Clermont-Ferrand, France
- LIMOS, CNRS UMR 6158, Aubière, France
| | - Rabie Saidi
- European Bioinformatics Institute, Hinxton, Cambridge, United Kingdom
| | - Engelbert Mephu Nguifo
- LIMOS, Blaise Pascal University, Clermont University, Clermont-Ferrand, France
- LIMOS, CNRS UMR 6158, Aubière, France
| |
Collapse
|
27
|
Ghoorah AW, Devignes MD, Smaïl-Tabbone M, Ritchie DW. KBDOCK 2013: a spatial classification of 3D protein domain family interactions. Nucleic Acids Res 2013; 42:D389-95. [PMID: 24271397 PMCID: PMC3964971 DOI: 10.1093/nar/gkt1199] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Comparing, classifying and modelling protein structural interactions can enrich our understanding of many biomolecular processes. This contribution describes Kbdock (http://kbdock.loria.fr/), a database system that combines the Pfam domain classification with coordinate data from the PDB to analyse and model 3D domain–domain interactions (DDIs). Kbdock can be queried using Pfam domain identifiers, protein sequences or 3D protein structures. For a given query domain or pair of domains, Kbdock retrieves and displays a non-redundant list of homologous DDIs or domain–peptide interactions in a common coordinate frame. Kbdock may also be used to search for and visualize interactions involving different, but structurally similar, Pfam families. Thus, structural DDI templates may be proposed even when there is little or no sequence similarity to the query domains.
Collapse
Affiliation(s)
- Anisah W Ghoorah
- Université de Lorraine, LORIA, Campus Scientifique, BP 239, 54506 Villers-lès-Nancy, France, CNRS, LORIA, Campus Scientifique, BP 239, 54506 Villers-lès-Nancy, France and INRIA Nancy Grand Est, LORIA, Campus Scientifique, BP 239, 54506 Villers-lès-Nancy, France
| | | | | | | |
Collapse
|
28
|
Holder A, Simon J, Strauser J, Taylor J, Shibberu Y. Dynamic Programming Used to Align Protein Structures with a Spectrum Is Robust. BIOLOGY 2013; 2:1296-310. [PMID: 24833226 PMCID: PMC4009789 DOI: 10.3390/biology2041296] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/03/2013] [Revised: 10/23/2013] [Accepted: 11/08/2013] [Indexed: 11/16/2022]
Abstract
Several efficient algorithms to conduct pairwise comparisons among large databases of protein structures have emerged in the recent literature. The central theme is the design of a measure between the Cα atoms of two protein chains, from which dynamic programming is used to compute an alignment. The efficiency and efficacy of these algorithms allows large-scale computational studies that would have been previously impractical. The computational study herein shows that the structural alignment algorithm eigen-decomposition alignment with the spectrum (EIGAs) is robust against both parametric and structural variation.
Collapse
Affiliation(s)
- Allen Holder
- Department of Mathematics, Rose-Hulman Institute of Technology, Terre Haute, IN 47803, USA.
| | - Jacqueline Simon
- Department of Mathematics, Rose-Hulman Institute of Technology, Terre Haute, IN 47803, USA.
| | | | - Jonathan Taylor
- Department of Mathematics, Rose-Hulman Institute of Technology, Terre Haute, IN 47803, USA.
| | - Yosi Shibberu
- Department of Mathematics, Rose-Hulman Institute of Technology, Terre Haute, IN 47803, USA.
| |
Collapse
|