1
|
Hwang Y, Cornman AL, Kellogg EH, Ovchinnikov S, Girguis PR. Genomic language model predicts protein co-regulation and function. Nat Commun 2024; 15:2880. [PMID: 38570504 PMCID: PMC10991518 DOI: 10.1038/s41467-024-46947-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 03/13/2024] [Indexed: 04/05/2024] Open
Abstract
Deciphering the relationship between a gene and its genomic context is fundamental to understanding and engineering biological systems. Machine learning has shown promise in learning latent relationships underlying the sequence-structure-function paradigm from massive protein sequence datasets. However, to date, limited attempts have been made in extending this continuum to include higher order genomic context information. Evolutionary processes dictate the specificity of genomic contexts in which a gene is found across phylogenetic distances, and these emergent genomic patterns can be leveraged to uncover functional relationships between gene products. Here, we train a genomic language model (gLM) on millions of metagenomic scaffolds to learn the latent functional and regulatory relationships between genes. gLM learns contextualized protein embeddings that capture the genomic context as well as the protein sequence itself, and encode biologically meaningful and functionally relevant information (e.g. enzymatic function, taxonomy). Our analysis of the attention patterns demonstrates that gLM is learning co-regulated functional modules (i.e. operons). Our findings illustrate that gLM's unsupervised deep learning of the metagenomic corpus is an effective and promising approach to encode functional semantics and regulatory syntax of genes in their genomic contexts and uncover complex relationships between genes in a genomic region.
Collapse
Affiliation(s)
- Yunha Hwang
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.
| | | | - Elizabeth H Kellogg
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
- Department of Structural Biology, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Sergey Ovchinnikov
- John Harvard Distinguished Science Fellowship Program, Harvard University, Cambridge, MA, USA.
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Peter R Girguis
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA, USA.
| |
Collapse
|
2
|
Robinson SL. Structure-guided metagenome mining to tap microbial functional diversity. Curr Opin Microbiol 2023; 76:102382. [PMID: 37741262 DOI: 10.1016/j.mib.2023.102382] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2023] [Revised: 05/21/2023] [Accepted: 08/22/2023] [Indexed: 09/25/2023]
Abstract
Scientists now have access to millions of accurate three-dimensional (3D) models of protein structures. How do we leverage 3D structural models to learn about microbial functions encoded in metagenomes? Here, we review recent developments using protein structural features to mine metagenomes from diverse environments ranging from the human gut to soil and ocean viromes. We compare 3D protein structural methods to characterize antibiotic resistance phenotypes, nutrient cycling, and host-drug-microbe interactions. Broadly, we encourage the scientific community to look beyond global sequence and structure alignments by considering fine-grained descriptors such as distance to ligand, active site, and tertiary interactions between amino acid residues scaling to microbiomes. Finally, we highlight structure-inspired approaches to chart new areas of microbial protein-coding sequence space.
Collapse
Affiliation(s)
- Serina L Robinson
- Department of Environmental Microbiology, Eawag, Swiss Federal Institute of Aquatic Science and Technology, Ueberlandstrasse 133, 8600 Dübendorf, Switzerland.
| |
Collapse
|
3
|
Sankar S, Preeti P, Ravikumar K, Kumar A, Prasad Y, Pal S, Rao DN, Savithri HS, Chandra N. Structural similarities between SAM and ATP recognition motifs and detection of ATP binding in a SAM binding DNA methyltransferase. Curr Res Struct Biol 2023; 6:100108. [PMID: 38106461 PMCID: PMC10724544 DOI: 10.1016/j.crstbi.2023.100108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Revised: 10/05/2023] [Accepted: 10/24/2023] [Indexed: 12/19/2023] Open
Abstract
S-adenosylmethionine (SAM) is a ubiquitous co-factor that serves as a donor for methylation reactions and additionally serves as a donor of other functional groups such as amino and ribosyl moieties in a variety of other biochemical reactions. Such versatility in function is enabled by the ability of SAM to be recognized by a wide variety of protein molecules that vary in their sequences and structural folds. To understand what gives rise to specific SAM binding in diverse proteins, we set out to study if there are any structural patterns at their binding sites. A comprehensive analysis of structures of the binding sites of SAM by all-pair comparison and clustering, indicated the presence of 4 different site-types, only one among them being well studied. For each site-type we decipher the common minimum principle involved in SAM recognition by diverse proteins and derive structural motifs that are characteristic of SAM binding. The presence of the structural motifs with precise three-dimensional arrangement of amino acids in SAM sites that appear to have evolved independently, indicates that these are winning arrangements of residues to bring about SAM recognition. Further, we find high similarity between one of the SAM site types and a well known ATP binding site type. We demonstrate using in vitro experiments that a known SAM binding protein, HpyAII.M1, a type 2 methyltransferase can bind and hydrolyse ATP. We find common structural motifs that explain this, further supported through site-directed mutagenesis. Observation of similar motifs for binding two of the most ubiquitous ligands in multiple protein families with diverse sequences and structural folds presents compelling evidence at the molecular level in favour of convergent evolution.
Collapse
Affiliation(s)
- Santhosh Sankar
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Preeti Preeti
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Kavya Ravikumar
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Amrendra Kumar
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Yedu Prasad
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Sukriti Pal
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Desirazu N. Rao
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Handanahal S. Savithri
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, 560012, Karnataka, India
- Department of BioEngineering, Indian Institute of Science, Bangalore, 560012, Karnataka, India
| |
Collapse
|
4
|
Burroughs A, Aravind L. New biochemistry in the Rhodanese-phosphatase superfamily: emerging roles in diverse metabolic processes, nucleic acid modifications, and biological conflicts. NAR Genom Bioinform 2023; 5:lqad029. [PMID: 36968430 PMCID: PMC10034599 DOI: 10.1093/nargab/lqad029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 02/10/2023] [Accepted: 03/09/2023] [Indexed: 03/25/2023] Open
Abstract
The protein-tyrosine/dual-specificity phosphatases and rhodanese domains constitute a sprawling superfamily of Rossmannoid domains that use a conserved active site with a cysteine to catalyze a range of phosphate-transfer, thiotransfer, selenotransfer and redox activities. While these enzymes have been extensively studied in the context of protein/lipid head group dephosphorylation and various thiotransfer reactions, their overall diversity and catalytic potential remain poorly understood. Using comparative genomics and sequence/structure analysis, we comprehensively investigate and develop a natural classification for this superfamily. As a result, we identified several novel clades, both those which retain the catalytic cysteine and those where a distinct active site has emerged in the same location (e.g. diphthine synthase-like methylases and RNA 2' OH ribosyl phosphate transferases). We also present evidence that the superfamily has a wider range of catalytic capabilities than previously known, including a set of parallel activities operating on various sugar/sugar alcohol groups in the context of NAD+-derivatives and RNA termini, and potential phosphate transfer activities involving sugars and nucleotides. We show that such activities are particularly expanded in the RapZ-C-DUF488-DUF4326 clade, defined here for the first time. Some enzymes from this clade are predicted to catalyze novel DNA-end processing activities as part of nucleic-acid-modifying systems that are likely to function in biological conflicts between viruses and their hosts.
Collapse
Affiliation(s)
- A Maxwell Burroughs
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| | - L Aravind
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894, USA
| |
Collapse
|
5
|
Ribeiro AJM, Riziotis IG, Tyzack JD, Borkakoti N, Thornton JM. Using mechanism similarity to understand enzyme evolution. Biophys Rev 2022; 14:1273-1280. [PMID: 36659981 PMCID: PMC9842563 DOI: 10.1007/s12551-022-01022-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 11/21/2022] [Indexed: 12/04/2022] Open
Abstract
Enzyme reactions take place in the active site through a series of catalytic steps, which are collectively termed the enzyme mechanism. The catalytic step is thereby the individual unit to consider for the purposes of building new enzyme mechanisms - i.e. through the mix and match of individual catalytic steps, new enzyme mechanisms and reactions can be conceived. In the case of natural evolution, it has been shown that new enzyme functions have emerged through the tweaking of existing mechanisms by the addition, removal, or modification of some catalytic steps, while maintaining other steps of the mechanism intact. Recently, we have extracted and codified the information on the catalytic steps of hundreds of enzymes in a machine-readable way, with the aim of automating this kind of evolutionary analysis. In this paper, we illustrate how these data, which we called the "rules of enzyme catalysis", can be used to identify similar catalytic steps across enzymes that differ in their overall function and/or structural folds. A discussion on a set of three enzymes that share part of their mechanism is used as an exemplar to illustrate how this approach can reveal divergent and convergent evolution of enzymes at the mechanistic level. Supplementary Information The online version contains supplementary material available at 10.1007/s12551-022-01022-9.
Collapse
Affiliation(s)
- António J. M. Ribeiro
- European Bioinformatics Institute - European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Ioannis G. Riziotis
- European Bioinformatics Institute - European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Jonathan D. Tyzack
- European Bioinformatics Institute - European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Neera Borkakoti
- European Bioinformatics Institute - European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| | - Janet M. Thornton
- European Bioinformatics Institute - European Molecular Biology Laboratory, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD UK
| |
Collapse
|
6
|
Sanchez Granel ML, Siburu NG, Fricska A, Maldonado LL, Gargiulo LB, Nudel CB, Uttaro AD, Nusblat AD. A novel Tetrahymena thermophila sterol C-22 desaturase belongs to the Fatty Acid Hydroxylase/Desaturase superfamily. J Biol Chem 2022; 298:102397. [PMID: 35988640 PMCID: PMC9485055 DOI: 10.1016/j.jbc.2022.102397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 08/09/2022] [Accepted: 08/15/2022] [Indexed: 11/27/2022] Open
Abstract
Sterols in eukaryotic cells play important roles in modulating membrane fluidity and in cell signaling and trafficking. During evolution, a combination of gene losses and acquisitions gave rise to an extraordinary diversity of sterols in different organisms. The sterol C-22 desaturase identified in plants and fungi as a cytochrome P-450 monooxygenase evolved from the first eukaryotic cytochrome P450 and was lost in many lineages. Although the ciliate Tetrahymena thermophila desaturates sterols at the C-22 position, no cytochrome P-450 orthologs are present in the genome. Here, we aim to identify the genes responsible for the desaturation as well as their probable origin. We used gene knockout and yeast heterologous expression approaches to identify two putative genes, retrieved from a previous transcriptomic analysis, as sterol C-22 desaturases. Furthermore, we demonstrate using bioinformatics and evolutionary analyses that both genes encode a novel type of sterol C-22 desaturase that belongs to the large fatty acid hydroxylase/desaturase superfamily and the genes originated by genetic duplication prior to functional diversification. These results stress the widespread existence of nonhomologous isofunctional enzymes among different lineages of the tree of life as well as the suitability for the use of T. thermophila as a valuable model to investigate the evolutionary process of large enzyme families.
Collapse
Affiliation(s)
- María L Sanchez Granel
- Instituto de Nanobiotecnología (NANOBIOTEC), CONICET, Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Junín 956, C1113AAD, Buenos Aires, Argentina
| | - Nicolás G Siburu
- Instituto de Biología Molecular y Celular de Rosario, CONICET, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Ocampo y Esmeralda s/n, S2000FHQ, Rosario, Argentina
| | - Annamária Fricska
- Instituto de Nanobiotecnología (NANOBIOTEC), CONICET, Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Junín 956, C1113AAD, Buenos Aires, Argentina
| | - Lucas L Maldonado
- Instituto de Investigaciones en Microbiología y Parasitología Médica (IMPaM), CONICET, Facultad de Medicina, Universidad de Buenos Aires, Junín 956, C1113AAD, Buenos Aires, Argentina
| | - Laura B Gargiulo
- Instituto de Nanobiotecnología (NANOBIOTEC), CONICET, Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Junín 956, C1113AAD, Buenos Aires, Argentina
| | - Clara B Nudel
- Instituto de Nanobiotecnología (NANOBIOTEC), CONICET, Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Junín 956, C1113AAD, Buenos Aires, Argentina
| | - Antonio D Uttaro
- Instituto de Biología Molecular y Celular de Rosario, CONICET, Facultad de Ciencias Bioquímicas y Farmacéuticas, Universidad Nacional de Rosario, Ocampo y Esmeralda s/n, S2000FHQ, Rosario, Argentina.
| | - Alejandro D Nusblat
- Instituto de Nanobiotecnología (NANOBIOTEC), CONICET, Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Junín 956, C1113AAD, Buenos Aires, Argentina.
| |
Collapse
|
7
|
Dobó J, Kocsis A, Dani R, Gál P. Proprotein Convertases and the Complement System. Front Immunol 2022; 13:958121. [PMID: 35874789 PMCID: PMC9296861 DOI: 10.3389/fimmu.2022.958121] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Accepted: 06/13/2022] [Indexed: 11/27/2022] Open
Abstract
Proteins destined for secretion - after removal of the signal sequence - often undergo further proteolytic processing by proprotein convertases (PCs). Prohormones are typically processed in the regulated secretory pathway, while most plasma proteins travel though the constitutive pathway. The complement system is a major proteolytic cascade in the blood, serving as a first line of defense against microbes and also contributing to the immune homeostasis. Several complement components, namely C3, C4, C5 and factor I (FI), are multi-chain proteins that are apparently processed by PCs intracellularly. Cleavage occurs at consecutive basic residues and probably also involves the action of carboxypeptidases. The most likely candidate for the intracellular processing of complement proteins is furin, however, because of the overlapping specificities of basic amino acid residue-specific proprotein convertases, other PCs might be involved. To our surprise, we have recently discovered that processing of another complement protein, mannan-binding lectin-associated serine protease-3 (MASP-3) occurs in the blood by PCSK6 (PACE4). A similar mechanism had been described for the membrane protease corin, which is also activated extracellularly by PCSK6. In this review we intend to point out that the proper functioning of the complement system intimately depends on the action of proprotein convertases. In addition to the non-enzymatic components (C3, C4, C5), two constitutively active complement proteases are directly activated by PCs either intracellularly (FI), or extracellularly (MASP-3), moreover indirectly, through the constitutive activation of pro-factor D by MASP-3, the activity of the alternative pathway also depends on a PC present in the blood.
Collapse
Affiliation(s)
| | | | | | - Péter Gál
- *Correspondence: József Dobó, ; Péter Gál,
| |
Collapse
|
8
|
Riziotis IG, Thornton JM. Capturing the geometry, function, and evolution of enzymes with 3D templates. Protein Sci 2022; 31:e4363. [PMID: 35762726 PMCID: PMC9207746 DOI: 10.1002/pro.4363] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 05/06/2022] [Accepted: 05/14/2022] [Indexed: 11/05/2022]
Abstract
Structural templates are 3D signatures representing protein functional sites, such as ligand binding cavities, metal coordination motifs, or catalytic sites. Here we explore methods to generate template libraries and algorithms to query structures for conserved 3D motifs. Applications of templates are discussed, as well as some exemplar cases for examining evolutionary links in enzymes. We also introduce the concept of using more than one template per structure to represent flexible sites, as an approach to better understand catalysis through snapshots captured in enzyme structures. Functional annotation from structure is an important topic that has recently resurfaced due to the new more accurate methods of protein structure prediction. Therefore, we anticipate that template‐based functional site detection will be a powerful tool in the task of characterizing a vast number of new protein models.
Collapse
|
9
|
Riziotis IG, Ribeiro AJ, Borkakoti N, Thornton JM. Conformational variation in enzyme catalysis: A structural study on catalytic residues. J Mol Biol 2022; 434:167517. [PMID: 35240125 PMCID: PMC9005782 DOI: 10.1016/j.jmb.2022.167517] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2021] [Revised: 02/21/2022] [Accepted: 02/23/2022] [Indexed: 11/26/2022]
Abstract
We introduce a pipeline to compare and contrast active sites from homologous enzymes in 3D. Comprehensive structural study covering enzymes from a large functional space. High heterogeneity in magnitude of active site flexibililty between enzyme families. Diffferent catalytic residue types and functions relate to different degrees of flexibility. Four paradigms classify enzymes according to the structural behaviour during catalysis.
Conformational variation in catalytic residues can be captured as alternative snapshots in enzyme crystal structures. Addressing the question of whether active site flexibility is an intrinsic and essential property of enzymes for catalysis, we present a comprehensive study on the 3D variation of active sites of 925 enzyme families, using explicit catalytic residue annotations from the Mechanism and Catalytic Site Atlas and structural data from the Protein Data Bank. Through weighted pairwise superposition of the functional atoms of active sites, we captured structural variability at single-residue level and examined the geometrical changes as ligands bind or as mutations occur. We demonstrate that catalytic centres of enzymes can be inherently rigid or flexible to various degrees according to the function they perform, and structural variability most often involves a subset of the catalytic residues, usually those not directly involved in the formation or cleavage of bonds. Moreover, data suggest that 2/3 of active sites are flexible, and in half of those, flexibility is only observed in the side chain. The goal of this work is to characterise our current knowledge of the extent of flexibility at the heart of catalysis and ultimately place our findings in the context of the evolution of catalysis as enzymes evolve new functions and bind different substrates.
Collapse
|
10
|
Lewis AJO, Hegde RS. A unified evolutionary origin for the ubiquitous protein transporters SecY and YidC. BMC Biol 2021; 19:266. [PMID: 34911545 PMCID: PMC8675477 DOI: 10.1186/s12915-021-01171-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Accepted: 10/21/2021] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Protein transporters translocate hydrophilic segments of polypeptide across hydrophobic cell membranes. Two protein transporters are ubiquitous and date back to the last universal common ancestor: SecY and YidC. SecY consists of two pseudosymmetric halves, which together form a membrane-spanning protein-conducting channel. YidC is an asymmetric molecule with a protein-conducting hydrophilic groove that partially spans the membrane. Although both transporters mediate insertion of membrane proteins with short translocated domains, only SecY transports secretory proteins and membrane proteins with long translocated domains. The evolutionary origins of these ancient and essential transporters are not known. RESULTS The features conserved by the two halves of SecY indicate that their common ancestor was an antiparallel homodimeric channel. Structural searches with SecY's halves detect exceptional similarity with YidC homologs. The SecY halves and YidC share a fold comprising a three-helix bundle interrupted by a helical hairpin. In YidC, this hairpin is cytoplasmic and facilitates substrate delivery, whereas in SecY, it is transmembrane and forms the substrate-binding lateral gate helices. In both transporters, the three-helix bundle forms a protein-conducting hydrophilic groove delimited by a conserved hydrophobic residue. Based on these similarities, we propose that SecY originated as a YidC homolog which formed a channel by juxtaposing two hydrophilic grooves in an antiparallel homodimer. We find that archaeal YidC and its eukaryotic descendants use this same dimerisation interface to heterodimerise with a conserved partner. YidC's sufficiency for the function of simple cells is suggested by the results of reductive evolution in mitochondria and plastids, which tend to retain SecY only if they require translocation of large hydrophilic domains. CONCLUSIONS SecY and YidC share previously unrecognised similarities in sequence, structure, mechanism, and function. Our delineation of a detailed correspondence between these two essential and ancient transporters enables a deeper mechanistic understanding of how each functions. Furthermore, key differences between them help explain how SecY performs its distinctive function in the recognition and translocation of secretory proteins. The unified theory presented here explains the evolution of these features, and thus reconstructs a key step in the origin of cells.
Collapse
Affiliation(s)
- Aaron J O Lewis
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge, CB2 0QH, UK.
| | - Ramanujan S Hegde
- MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge, CB2 0QH, UK.
| |
Collapse
|
11
|
MohammadiPeyhani H, Chiappino-Pepe A, Haddadi K, Hafner J, Hadadi N, Hatzimanikatis V. NICEdrug.ch, a workflow for rational drug design and systems-level analysis of drug metabolism. eLife 2021; 10:e65543. [PMID: 34340747 PMCID: PMC8331181 DOI: 10.7554/elife.65543] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2020] [Accepted: 07/07/2021] [Indexed: 12/30/2022] Open
Abstract
The discovery of a drug requires over a decade of intensive research and financial investments - and still has a high risk of failure. To reduce this burden, we developed the NICEdrug.ch resource, which incorporates 250,000 bioactive molecules, and studied their enzymatic metabolic targets, fate, and toxicity. NICEdrug.ch includes a unique fingerprint that identifies reactive similarities between drug-drug and drug-metabolite pairs. We validated the application, scope, and performance of NICEdrug.ch over similar methods in the field on golden standard datasets describing drugs and metabolites sharing reactivity, drug toxicities, and drug targets. We use NICEdrug.ch to evaluate inhibition and toxicity by the anticancer drug 5-fluorouracil, and suggest avenues to alleviate its side effects. We propose shikimate 3-phosphate for targeting liver-stage malaria with minimal impact on the human host cell. Finally, NICEdrug.ch suggests over 1300 candidate drugs and food molecules to target COVID-19 and explains their inhibitory mechanism for further experimental screening. The NICEdrug.ch database is accessible online to systematically identify the reactivity of small molecules and druggable enzymes with practical applications in lead discovery and drug repurposing.
Collapse
Affiliation(s)
- Homa MohammadiPeyhani
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Anush Chiappino-Pepe
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Kiandokht Haddadi
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Jasmin Hafner
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Noushin Hadadi
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| | - Vassily Hatzimanikatis
- Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFLLausanneSwitzerland
| |
Collapse
|
12
|
Guterres H, Park SJ, Zhang H, Im W. CHARMM-GUI LBS Finder & Refiner for Ligand Binding Site Prediction and Refinement. J Chem Inf Model 2021; 61:3744-3751. [PMID: 34296608 DOI: 10.1021/acs.jcim.1c00561] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
A protein performs its task by binding a variety of ligands in its local region that is also known as the ligand-binding-site (LBS). Therefore, accurate prediction, characterization, and refinement of LBS can facilitate protein functional annotations and structure-based drug design. In this work, we present CHARMM-GUI LBS Finder & Refiner (https://www.charmm-gui.org/input/lbsfinder) that predicts potential LBS, offers interactive features for local LBS structure analysis, and prepares various molecular dynamics (MD) systems and inputs by setting up distance restraint potentials for LBS structure refinement. LBS Finder & Refiner supports 5 different commonly used simulation programs, such as NAMD, AMBER, GROMACS, GENESIS, and OpenMM, for LBS structure refinement together with hydrogen mass repartitioning. The capability of LBS Finder & Refiner is illustrated through LBS structure predictions and refinements of 48 modeled and 20 apo benchmark target proteins. Overall, successful LBS structure predictions and refinements are seen in our benchmark tests. We hope that LBS Finder & Refiner is useful to predict, characterize, and refine potential LBS on any given protein of interest.
Collapse
Affiliation(s)
- Hugo Guterres
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Sang-Jun Park
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Han Zhang
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| | - Wonpil Im
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Science and Engineering, Lehigh University, Bethlehem, Pennsylvania 18015, United States
| |
Collapse
|
13
|
Galmés MÀ, Nödling AR, Luk L, Świderek K, Moliner V. Combined Theoretical and Experimental Study to Unravel the Differences in Promiscuous Amidase Activity of Two Nonhomologous Enzymes. ACS Catal 2021; 11:8635-8644. [PMID: 35875595 PMCID: PMC9299431 DOI: 10.1021/acscatal.1c02150] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
![]()
Convergent evolution has resulted in nonhomologous enzymes that
contain similar active sites that catalyze the same primary and secondary
reactions. Comparing how these enzymes achieve their reaction promiscuity
can yield valuable insights to develop functions from the optimization
of latent activities. In this work, we have focused on the promiscuous
amidase activity in the esterase from Bacillus subtilis (Bs2) and compared with the same activity in the promiscuous lipase
B from Candida antarctica (CALB). The
study, combining multiscale quantum mechanics/molecular mechanics
(QM/MM) simulations, deep machine learning approaches, and experimental
characterization of Bs2 kinetics, confirms the amidase activity of
Bs2 and CALB. The computational results indicate that both enzymes
offer a slightly different reaction environment reflected by electrostatic
effects within the active site, thus resulting in a different reaction
mechanism during the acylation step. A convolutional neural network
(CNN) has been used to understand the conserved amino acids among
the evolved protein family and suggest that Bs2 provides a more robust
protein scaffold to perform future mutagenesis studies. Results derived
from this work will help reveal the origin of enzyme promiscuity,
which will find applications in enzyme (re)design, particularly in
creating a highly active amidase.
Collapse
Affiliation(s)
- Miquel À. Galmés
- Institute of Advanced Materials (INAM), Universitat Jaume I, 12071 Castellón, Spain
| | - Alexander R. Nödling
- Cardiff Catalysis Institute, School of Chemistry, Cardiff University, Main Building, Park Place, Cardiff CF10 3AT, United Kingdom
| | - Louis Luk
- Cardiff Catalysis Institute, School of Chemistry, Cardiff University, Main Building, Park Place, Cardiff CF10 3AT, United Kingdom
| | - Katarzyna Świderek
- Institute of Advanced Materials (INAM), Universitat Jaume I, 12071 Castellón, Spain
| | - Vicent Moliner
- Institute of Advanced Materials (INAM), Universitat Jaume I, 12071 Castellón, Spain
| |
Collapse
|
14
|
Beyer HM, Virtanen SI, Aranko AS, Mikula KM, Lountos GT, Wlodawer A, Ollila OHS, Iwaï H. The Convergence of the Hedgehog/Intein Fold in Different Protein Splicing Mechanisms. Int J Mol Sci 2020; 21:ijms21218367. [PMID: 33171880 PMCID: PMC7664689 DOI: 10.3390/ijms21218367] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2020] [Revised: 11/01/2020] [Accepted: 11/05/2020] [Indexed: 11/16/2022] Open
Abstract
Protein splicing catalyzed by inteins utilizes many different combinations of amino-acid types at active sites. Inteins have been classified into three classes based on their characteristic sequences. We investigated the structural basis of the protein splicing mechanism of class 3 inteins by determining crystal structures of variants of a class 3 intein from Mycobacterium chimaera and molecular dynamics simulations, which suggested that the class 3 intein utilizes a different splicing mechanism from that of class 1 and 2 inteins. The class 3 intein uses a bond cleavage strategy reminiscent of proteases but share the same Hedgehog/INTein (HINT) fold of other intein classes. Engineering of class 3 inteins from a class 1 intein indicated that a class 3 intein would unlikely evolve directly from a class 1 or 2 intein. The HINT fold appears as structural and functional solution for trans-peptidyl and trans-esterification reactions commonly exploited by diverse mechanisms using different combinations of amino-acid types for the active-site residues.
Collapse
Affiliation(s)
- Hannes M. Beyer
- Institute of Biotechnology, University of Helsinki, P.O. Box 65, FIN-00014 Helsinki, Finland; (H.M.B.); (S.I.V.); (A.S.A.); (K.M.M.); (O.H.S.O.)
| | - Salla I. Virtanen
- Institute of Biotechnology, University of Helsinki, P.O. Box 65, FIN-00014 Helsinki, Finland; (H.M.B.); (S.I.V.); (A.S.A.); (K.M.M.); (O.H.S.O.)
| | - A. Sesilja Aranko
- Institute of Biotechnology, University of Helsinki, P.O. Box 65, FIN-00014 Helsinki, Finland; (H.M.B.); (S.I.V.); (A.S.A.); (K.M.M.); (O.H.S.O.)
| | - Kornelia M. Mikula
- Institute of Biotechnology, University of Helsinki, P.O. Box 65, FIN-00014 Helsinki, Finland; (H.M.B.); (S.I.V.); (A.S.A.); (K.M.M.); (O.H.S.O.)
| | - George T. Lountos
- Basic Science Program, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA;
| | - Alexander Wlodawer
- Macromolecular Crystallography Laboratory, National Cancer Institute, Frederick, MD 21702, USA;
| | - O. H. Samuli Ollila
- Institute of Biotechnology, University of Helsinki, P.O. Box 65, FIN-00014 Helsinki, Finland; (H.M.B.); (S.I.V.); (A.S.A.); (K.M.M.); (O.H.S.O.)
| | - Hideo Iwaï
- Institute of Biotechnology, University of Helsinki, P.O. Box 65, FIN-00014 Helsinki, Finland; (H.M.B.); (S.I.V.); (A.S.A.); (K.M.M.); (O.H.S.O.)
- Correspondence: ; Tel.: +358-2941-59752
| |
Collapse
|
15
|
Keshri V, Chabrière E, Pinault L, Colson P, Diene SM, Rolain JM, Raoult D, Pontarotti P. Promiscuous Enzyme Activity as a Driver of Allo and Iso Convergent Evolution, Lessons from the β-Lactamases. Int J Mol Sci 2020; 21:E6260. [PMID: 32872436 PMCID: PMC7504333 DOI: 10.3390/ijms21176260] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 08/17/2020] [Accepted: 08/26/2020] [Indexed: 02/06/2023] Open
Abstract
The probability of the evolution of a character depends on two factors: the probability of moving from one character state to another character state and the probability of the new character state fixation. The more the evolution of a character is probable, the more the convergent evolution will be witnessed, and consequently, convergent evolution could mean that the convergent character evolution results as a combination of these two factors. We investigated this phenomenon by studying the convergent evolution of biochemical functions. For the investigation we used the case of β-lactamases. β-lactamases hydrolyze β-lactams, which are antimicrobials able to block the DD-peptidases involved in bacterial cell wall synthesis. β-lactamase activity is present in two different superfamilies: the metallo-β-lactamase and the serine β-lactamase. The mechanism used to hydrolyze the β-lactam is different for the two superfamilies. We named this kind of evolution an allo-convergent evolution. We further showed that the β-lactamase activity evolved several times within each superfamily, a convergent evolution type that we named iso-convergent evolution. Both types of convergent evolution can be explained by the two evolutionary mechanisms discussed above. The probability of moving from one state to another is explained by the promiscuous β-lactamase activity present in the ancestral sequences of each superfamily, while the probability of fixation is explained in part by positive selection, as the organisms having β-lactamase activity allows them to resist organisms that secrete β-lactams. Indeed, an organism that has a mutation that increases the β-lactamase activity will be selected, as the organisms having this activity will have an advantage over the others.
Collapse
Affiliation(s)
- Vivek Keshri
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
| | - Eric Chabrière
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
| | - Lucile Pinault
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
| | - Philippe Colson
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
| | - Seydina M Diene
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
| | - Jean-Marc Rolain
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
| | - Didier Raoult
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
| | - Pierre Pontarotti
- Aix-Marseille Univ IRD, APHM, MEPHI, IHU Méditerranée Infection, 19-21 Boulevard Jean Moulin, 13005 Marseille, France; (V.K.); (E.C.); (L.P.); (P.C.); (S.M.D.); (J.-M.R.); (D.R.)
- SNC5039 CNRS, 19-21 Boulevard Jean Moulin, 13005 Marseille, France
| |
Collapse
|
16
|
Morris JS, Yu L, Facchini PJ. A single residue determines substrate preference in benzylisoquinoline alkaloid N-methyltransferases. PHYTOCHEMISTRY 2020; 170:112193. [PMID: 31765874 DOI: 10.1016/j.phytochem.2019.112193] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/27/2019] [Revised: 10/29/2019] [Accepted: 10/30/2019] [Indexed: 06/10/2023]
Abstract
N-methylation is a recurring feature in the biosynthesis of many plant specialized metabolites, including alkaloids. A crucial step in the conserved central pathway that provides intermediates for the biosynthesis of benzylisoquinoline alkaloids (BIAs) involves conversion of the secondary amine (S)-coclaurine into the tertiary amine (S)-N-methylcoclaurine by coclaurine N-methyltransferase (CNMT). Subsequent enzymatic steps yield the core intermediate (S)-reticuline, from which various branch pathways for the biosynthesis of major BIAs such as morphine, noscapine and sanguinarine diverge. An additional N-methylation yielding quaternary BIAs is catalyzed by reticuline N-methyltransferase (RNMT), such as in the branch pathway leading to the taxonomically widespread and ecologically significant alkaloid magnoflorine. Despite their functional differences, analysis of primary sequence information has been unable to accurately distinguish between CNMT-like and RNMT-like enzymes, necessitating laborious in vitro screening. Furthermore, despite a recent emphasis on structural characterization of BIA NMTs, the features and mechanisms underlying the CNMT-RNMT functional dichotomy were unknown. We report the identification of structural variants tightly correlated with function in known BIA NMTs and show through reciprocal mutagenesis that a single residue acts as a switch between CNMT- and RNMT-like functions. We use yeast in vivo screening to show that this discovery allows for accurate prediction of activity strictly from primary sequence information and, on this basis, improve the annotation of previously reported putative BIA NMTs. Our results highlight the unusually short mutational distance separating ancestral CNMT-like enzymes from more evolutionarily advanced RNMT-like enzymes, and thus help explain the widespread yet sporadic occurrence of quaternary BIAs in plants. While this is the first report of structural variants controlling mono-versus di-methylation activity among plant NMT enzymes, comparison with bacterial MT enzymes also suggests possible convergent evolution.
Collapse
Affiliation(s)
- Jeremy S Morris
- University of Calgary, Department of Biological Sciences, Calgary, Alberta, T2N 1N4, Canada
| | - Lisa Yu
- University of Calgary, Department of Biological Sciences, Calgary, Alberta, T2N 1N4, Canada
| | - Peter J Facchini
- University of Calgary, Department of Biological Sciences, Calgary, Alberta, T2N 1N4, Canada.
| |
Collapse
|
17
|
Computational methods and tools for binding site recognition between proteins and small molecules: from classical geometrical approaches to modern machine learning strategies. J Comput Aided Mol Des 2019; 33:887-903. [PMID: 31628659 DOI: 10.1007/s10822-019-00235-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2019] [Accepted: 10/11/2019] [Indexed: 10/25/2022]
Abstract
In the current "genomic era" the number of identified genes is growing exponentially. However, the biological function of a large number of the corresponding proteins is still unknown. Recognition of small molecule ligands (e.g., substrates, inhibitors, allosteric regulators, etc.) is pivotal for protein functions in the vast majority of the cases and knowledge of the region where these processes take place is essential for protein function prediction and drug design. In this regard, computational methods represent essential tools to tackle this problem. A significant number of software tools have been developed in the last few years which exploit either protein sequence information, structure information or both. This review describes the most recent developments in protein function recognition and binding site prediction, in terms of both freely-available and commercial solutions and tools, detailing the main characteristics of the considered tools and providing a comparative analysis of their performance.
Collapse
|
18
|
Keshri V, Arbuckle K, Chabrol O, Rolain J, Raoult D, Pontarotti P. The functional convergence of antibiotic resistance in β-lactamases is not conferred by a simple convergent substitution of amino acid. Evol Appl 2019; 12:1812-1822. [PMID: 31548859 PMCID: PMC6752183 DOI: 10.1111/eva.12835] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2018] [Revised: 06/06/2019] [Accepted: 06/12/2019] [Indexed: 01/25/2023] Open
Abstract
Bacterial resistance to antibiotics is a serious medical and public health concern worldwide. Such resistance is conferred by a variety of mechanisms, but the extensive variability in levels of resistance across bacteria is a common finding. Understanding the underlying evolutionary processes governing this functional variation in antibiotic resistance is important as it may allow the development of appropriate strategies to improve treatment options for bacterial infections. The main objective of this study was to examine the functional evolution of β-lactamases, a common mechanism of enzymatic resistance that inactivates a widely used class of antibiotics. We first obtained β-lactamase protein sequences and minimal inhibitory concentration (MIC), a measure of antibiotic function, from previously published literature. We then used a molecular phylogenetic framework to examine the evolution of β-lactamase functional activity. We found that the functional activity of antibiotic resistance mediated by β-lactamase has evolved in a convergent manner within molecular classes, but is not associated with any single amino acid substitution. This suggests that the dynamics of convergent evolution in this system can vary between the functional and molecular (sequence) levels. Such disassociation may hamper bioinformatic approaches to antibiotic resistance determination and underscore the need for (less efficient but more effective) activity assays as an essential step in evaluating resistance in a given case.
Collapse
Affiliation(s)
- Vivek Keshri
- Aix‐Marseille Université, IRD, APHM, Microbe, Evolution, PHylogenie, Infection, IHU ‐Méditerranée InfectionMarseilleFrance
| | - Kevin Arbuckle
- Department of Evolution, Ecology and BehaviourUniversity of LiverpoolLiverpoolUK
- Department of Biosciences, College of ScienceSwansea UniversitySwanseaUK
| | - Olivier Chabrol
- Aix‐Marseille Université, I2M, UMR‐CNRS 7373, Evolution Biologique et ModélisationMarseilleFrance
| | - Jean‐Marc Rolain
- Aix‐Marseille Université, IRD, APHM, Microbe, Evolution, PHylogenie, Infection, IHU ‐Méditerranée InfectionMarseilleFrance
| | - Didier Raoult
- Aix‐Marseille Université, IRD, APHM, Microbe, Evolution, PHylogenie, Infection, IHU ‐Méditerranée InfectionMarseilleFrance
| | - Pierre Pontarotti
- Aix‐Marseille Université, IRD, APHM, Microbe, Evolution, PHylogenie, Infection, IHU ‐Méditerranée InfectionMarseilleFrance
- CNRSMarseilleFrance
| |
Collapse
|
19
|
Chouhan BPS, Maimaiti S, Gade M, Laurino P. Rossmann-Fold Methyltransferases: Taking a "β-Turn" around Their Cofactor, S-Adenosylmethionine. Biochemistry 2018; 58:166-170. [PMID: 30406995 DOI: 10.1021/acs.biochem.8b00994] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Abstract
Methyltransferases (MTases) are superfamilies of enzymes that catalyze the transfer of a methyl group from S-adenosylmethionine (SAM), a nucleoside-based cofactor, to a wide variety of substrates such as DNA, RNA, proteins, small molecules, and lipids. Depending upon their structural features, the MTases can be further classified into different classes; we consider exclusively the largest class of MTases, the Rossmann-fold MTases. It has been shown that the nucleoside cofactor-binding Rossmann enzymes, particularly the nicotinamide adenine dinucleotide (NAD)-, flavin adenine dinucleotide (FAD)-, and SAM-binding MTases enzymes, share common binding motifs that include a Gly-rich loop region that interacts with the cofactor and a highly conserved acidic residue (Asp/Glu) that interacts with the ribose moiety of the cofactor. Here, we observe that the Gly-rich loop region of the Rossmann MTases adapts a specific type II' β-turn in the proximity of the cofactor (<4 Å), and it appears to be a key feature of these superfamilies. Additionally, we demonstrate that the conservation of this β-turn could play a critical role in the enzyme-cofactor interaction, thereby shedding new light on the structural conformation of the Gly-rich loop region from Rossmann MTases.
Collapse
Affiliation(s)
- Bhanu Pratap Singh Chouhan
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Shayida Maimaiti
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Madhuri Gade
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| | - Paola Laurino
- Okinawa Institute of Science and Technology Graduate University , 1919-1 Tancha, Onna-son , Okinawa 904-0412 , Japan
| |
Collapse
|
20
|
Silveira MC, Azevedo da Silva R, Faria da Mota F, Catanho M, Jardim R, R Guimarães AC, de Miranda AB. Systematic Identification and Classification of β-Lactamases Based on Sequence Similarity Criteria: β-Lactamase Annotation. Evol Bioinform Online 2018; 14:1176934318797351. [PMID: 30210232 PMCID: PMC6131288 DOI: 10.1177/1176934318797351] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2018] [Accepted: 08/08/2018] [Indexed: 12/11/2022] Open
Abstract
β-lactamases, the enzymes responsible for resistance to β-lactam antibiotics, are
widespread among prokaryotic genera. However, current β-lactamase classification
schemes do not represent their present diversity. Here, we propose a workflow to
identify and classify β-lactamases. Initially, a set of curated sequences was
used as a model for the construction of profiles Hidden Markov Models (HMM),
specific for each β-lactamase class. An extensive, nonredundant set of
β-lactamase sequences was constructed from 7 different resistance proteins
databases to test the methodology. The profiles HMM were improved for their
specificity and sensitivity and then applied to fully assembled genomes. Five
hierarchical classification levels are described, and a new class of
β-lactamases with fused domains is proposed. Our profiles HMM provide a better
annotation of β-lactamases, with classes and subclasses defined by objective
criteria such as sequence similarity. This classification offers a solid base to
the elaboration of studies on the diversity, dispersion, prevalence, and
evolution of the different classes and subclasses of this critical enzymatic
activity.
Collapse
Affiliation(s)
- Melise Chaves Silveira
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Rangeline Azevedo da Silva
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Fábio Faria da Mota
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Marcos Catanho
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Rodrigo Jardim
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Ana Carolina R Guimarães
- Laboratório de Genômica Funcional e Bioinformática, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| | - Antonio B de Miranda
- Laboratório de Biologia Computacional e Sistemas, Instituto Oswaldo Cruz, Fiocruz, Rio de Janeiro, Brazil
| |
Collapse
|
21
|
Davidi D, Longo LM, Jabłońska J, Milo R, Tawfik DS. A Bird’s-Eye View of Enzyme Evolution: Chemical, Physicochemical, and Physiological Considerations. Chem Rev 2018; 118:8786-8797. [DOI: 10.1021/acs.chemrev.8b00039] [Citation(s) in RCA: 68] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
22
|
da Silva RA, Pereira LDM, Silveira MC, Jardim R, de Miranda AB. Mining of potential drug targets through the identification of essential and analogous enzymes in the genomes of pathogens of Glycine max, Zea mays and Solanum lycopersicum. PLoS One 2018; 13:e0197511. [PMID: 29799863 PMCID: PMC5969768 DOI: 10.1371/journal.pone.0197511] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2017] [Accepted: 05/03/2018] [Indexed: 01/19/2023] Open
Abstract
Pesticides are one of the most widely used pest and disease control measures in plant crops and their indiscriminate use poses a direct risk to the health of populations and environment around the world. As a result, there is a great need for the development of new, less toxic molecules to be employed against plant pathogens. In this work, we employed an in silico approach to study the genes coding for enzymes of the genomes of three commercially important plants, soybean (Glycine max), tomato (Solanum lycopersicum) and corn (Zea mays), as well as 15 plant pathogens (4 bacteria and 11 fungi), focusing on revealing a set of essential and non-homologous isofunctional enzymes (NISEs) that could be prioritized as drug targets. By combining sequence and structural data, we obtained an initial set of 568 cases of analogy, of which 97 were validated and further refined, revealing a subset of 29 essential enzymatic activities with a total of 119 different structural forms, most belonging to central metabolic routes, including the carbohydrate metabolism, the metabolism of amino acids, among others. Further, another subset of 26 enzymatic activities possess a tertiary structure specific for the pathogen, not present in plants, men and Apis mellifera, which may be of importance for the development of specific enzymatic inhibitors against plant diseases that are less harmful to humans and the environment.
Collapse
Affiliation(s)
| | | | | | - Rodrigo Jardim
- Instituto Oswaldo Cruz, Fundação Oswaldo Cruz, Rio de Janeiro, Rio de Janeiro, Brazil
| | | |
Collapse
|
23
|
Underground metabolism: network-level perspective and biotechnological potential. Curr Opin Biotechnol 2018; 49:108-114. [DOI: 10.1016/j.copbio.2017.07.015] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2017] [Revised: 07/20/2017] [Accepted: 07/21/2017] [Indexed: 12/18/2022]
|
24
|
Lee J, Konc J, Janežič D, Brooks BR. Global organization of a binding site network gives insight into evolution and structure-function relationships of proteins. Sci Rep 2017; 7:11652. [PMID: 28912495 PMCID: PMC5599562 DOI: 10.1038/s41598-017-10412-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2017] [Accepted: 08/07/2017] [Indexed: 01/06/2023] Open
Abstract
The global organization of protein binding sites is analyzed by constructing a weighted network of binding sites based on their structural similarities and detecting communities of structurally similar binding sites based on the minimum description length principle. The analysis reveals that there are two central binding site communities that play the roles of the network hubs of smaller peripheral communities. The sizes of communities follow a power-law distribution, which indicates that the binding sites included in larger communities may be older and have been evolutionary structural scaffolds of more recent ones. Structurally similar binding sites in the same community bind to diverse ligands promiscuously and they are also embedded in diverse domain structures. Understanding the general principles of binding site interplay will pave the way for improved drug design and protein design.
Collapse
Affiliation(s)
- Juyong Lee
- Department of Chemistry, Kangwon National University, 1 Kangwondaehak-gil, Chuncheon, 24341, Republic of Korea. .,Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, 20892, United States.
| | - Janez Konc
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, SI-6000, Koper, Slovenia.,National Institute of Chemistry, Hajdrihova 19, SI-1000, Ljubljana, Slovenia
| | - Dušanka Janežič
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, SI-6000, Koper, Slovenia
| | - Bernard R Brooks
- Laboratory of Computational Biology, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, Maryland, 20892, United States
| |
Collapse
|
25
|
Yadav PK, Antonyraj CB, Basheer Ahamed SI, Srinivas S. Understanding Russell's viper venom factor V activator's substrate specificity by surface plasmon resonance and in-silico studies. PLoS One 2017; 12:e0181216. [PMID: 28732041 PMCID: PMC5521794 DOI: 10.1371/journal.pone.0181216] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2017] [Accepted: 06/28/2017] [Indexed: 11/18/2022] Open
Abstract
Blood coagulation factor V (FV) is activated either by Factor X or thrombin, cleaving at three different sites viz., Site I (Arg709-Ser710), site II (Arg1018-Thr1019), and site III (Arg1545-Ser1546). Russell's viper venom factor V activator (RVV-V) is a thrombin-like serine proteinase that activates FV with selective, single cleavage at site III. A long lasting effort is being pending in understanding the 'selective' binding specificity of the RVV-V towards site III. Here, we present the binding kinetic study of RVV-V with two designed peptides corresponding to the regions from site I (Gln699-Asn713) and site II (1008Lys-Pro1022), respectively, that include 15 amino acids. Our investigation for justifying the binding efficacy and kinetics of peptides includes SPR method, protein-peptide docking, molecular dynamics simulation, and principal component analysis (PCA). Surprisingly, the SPR experiment disclosed that the Peptide II showed a lower binding affinity with KD of 2.775 mM while the Peptide I showed none. Docking and simulation of both the peptides with RVV-V engaged either rooted or shallow binding for Peptide II and Peptide I respectively. The peptide binding resulted in global conformational changes in the native fold of RVV-V, whereas the similar studies for thrombin failed to make major changes in the native fold. In support, the PCA analysis for RVV-V showed the dislocation of catalytic triad upon binding both the peptides. Hence, RVV-V, a serine protease, is incompetent in cleaving these two sites. This study suggests a transition in RVV-V from the native rigid to the distorted flexible structure and paves a way to design a new peptide substrate/inhibitor.
Collapse
Affiliation(s)
- Pradeep K. Yadav
- Centre for Bioinformatics, Pondicherry University, Pondicherry, India
| | | | | | - Sistla Srinivas
- GE Healthcare Life Sciences, John F Welch Technology Centre, EPIP, Bengaluru, India
| |
Collapse
|
26
|
Piergiorge RM, de Miranda AB, Guimarães AC, Catanho M. Functional Analogy in Human Metabolism: Enzymes with Different Biological Roles or Functional Redundancy? Genome Biol Evol 2017; 9:1624-1636. [PMID: 28854631 PMCID: PMC5737724 DOI: 10.1093/gbe/evx119] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/04/2017] [Indexed: 12/12/2022] Open
Abstract
Since enzymes catalyze almost all chemical reactions that occur in living organisms, it is crucial that genes encoding such activities are correctly identified and functionally characterized. Several studies suggest that the fraction of enzymatic activities in which multiple events of independent origin have taken place during evolution is substantial. However, this topic is still poorly explored, and a comprehensive investigation of the occurrence, distribution, and implications of these events has not been done so far. Fundamental questions, such as how analogous enzymes originate, why so many events of independent origin have apparently occurred during evolution, and what are the reasons for the coexistence in the same organism of distinct enzymatic forms catalyzing the same reaction, remain unanswered. Also, several isofunctional enzymes are still not recognized as nonhomologous, even with substantial evidence indicating different evolutionary histories. In this work, we begin to investigate the biological significance of the cooccurrence of nonhomologous isofunctional enzymes in human metabolism, characterizing functional analogous enzymes identified in metabolic pathways annotated in the human genome. Our hypothesis is that the coexistence of multiple enzymatic forms might not be interpreted as functional redundancy. Instead, these enzymatic forms may be implicated in distinct (and probably relevant) biological roles.
Collapse
Affiliation(s)
- Rafael Mina Piergiorge
- Laboratório de Genômica Funcional e Bioinformática, Fiocruz, Instituto Oswaldo Cruz, Manguinhos, Rio de Janeiro, Brazil
| | - Antonio Basílio de Miranda
- Laboratório de Biologia Computacional e Sistemas, Fiocruz, Instituto Oswaldo Cruz, Manguinhos, Rio de Janeiro, Brazil
| | - Ana Carolina Guimarães
- Laboratório de Genômica Funcional e Bioinformática, Fiocruz, Instituto Oswaldo Cruz, Manguinhos, Rio de Janeiro, Brazil
| | - Marcos Catanho
- Laboratório de Genômica Funcional e Bioinformática, Fiocruz, Instituto Oswaldo Cruz, Manguinhos, Rio de Janeiro, Brazil
| |
Collapse
|
27
|
Mudgal R, Srinivasan N, Chandra N. Resolving protein structure-function-binding site relationships from a binding site similarity network perspective. Proteins 2017; 85:1319-1335. [PMID: 28342236 DOI: 10.1002/prot.25293] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2016] [Revised: 03/18/2017] [Accepted: 03/20/2017] [Indexed: 11/05/2022]
Abstract
Functional annotation is seldom straightforward with complexities arising due to functional divergence in protein families or functional convergence between non-homologous protein families, leading to mis-annotations. An enzyme may contain multiple domains and not all domains may be involved in a given function, adding to the complexity in function annotation. To address this, we use binding site information from bound cognate ligands and catalytic residues, since it can help in resolving fold-function relationships at a finer level and with higher confidence. A comprehensive database of 2,020 fold-function-binding site relationships has been systematically generated. A network-based approach is employed to capture the complexity in these relationships, from which different types of associations are deciphered, that identify versatile protein folds performing diverse functions, same function associated with multiple folds and one-to-one relationships. Binding site similarity networks integrated with fold, function, and ligand similarity information are generated to understand the depth of these relationships. Apart from the observed continuity in the functional site space, network properties of these revealed versatile families with topologically different or dissimilar binding sites and structural families that perform very similar functions. As a case study, subtle changes in the active site of a set of evolutionarily related superfamilies are studied using these networks. Tracing of such similarities in evolutionarily related proteins provide clues into the transition and evolution of protein functions. Insights from this study will be helpful in accurate and reliable functional annotations of uncharacterized proteins, poly-pharmacology, and designing enzymes with new functional capabilities. Proteins 2017; 85:1319-1335. © 2017 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Richa Mudgal
- IISc Mathematics Initiative, Indian Institute of Science, Bangalore, Karnataka, 560 012, India
| | | | - Nagasuma Chandra
- Department of Biochemistry, Indian Institute of Science, Bangalore, Karnataka, 560 012, India
| |
Collapse
|
28
|
Road Map to Study Convergent Evolution: A Proposition for Evolutionary Systems Biology Approaches. Evol Biol 2016. [DOI: 10.1007/978-3-319-41324-2_1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/25/2023]
|
29
|
Chakraborty S, Nascimento R, Zaini PA, Gouran H, Rao BJ, Goulart LR, Dandekar AM. Sequence/structural analysis of xylem proteome emphasizes pathogenesis-related proteins, chitinases and β-1, 3-glucanases as key players in grapevine defense against Xylella fastidiosa. PeerJ 2016; 4:e2007. [PMID: 27257535 PMCID: PMC4888286 DOI: 10.7717/peerj.2007] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2015] [Accepted: 04/13/2016] [Indexed: 11/20/2022] Open
Abstract
Background. Xylella fastidiosa, the causative agent of various plant diseases including Pierce’s disease in the US, and Citrus Variegated Chlorosis in Brazil, remains a continual source of concern and economic losses, especially since almost all commercial varieties are sensitive to this Gammaproteobacteria. Differential expression of proteins in infected tissue is an established methodology to identify key elements involved in plant defense pathways. Methods. In the current work, we developed a methodology named CHURNER that emphasizes relevant protein functions from proteomic data, based on identification of proteins with similar structures that do not necessarily have sequence homology. Such clustering emphasizes protein functions which have multiple copies that are up/down-regulated, and highlights similar proteins which are differentially regulated. As a working example we present proteomic data enumerating differentially expressed proteins in xylem sap from grapevines that were infected with X. fastidiosa. Results. Analysis of this data by CHURNER highlighted pathogenesis related PR-1 proteins, reinforcing this as the foremost protein function in xylem sap involved in the grapevine defense response to X. fastidiosa. β-1, 3-glucanase, which has both anti-microbial and anti-fungal activities, is also up-regulated. Simultaneously, chitinases are found to be both up and down-regulated by CHURNER, and thus the net gain of this protein function loses its significance in the defense response. Discussion. We demonstrate how structural data can be incorporated in the pipeline of proteomic data analysis prior to making inferences on the importance of individual proteins to plant defense mechanisms. We expect CHURNER to be applicable to any proteomic data set.
Collapse
Affiliation(s)
- Sandeep Chakraborty
- Department of Plant Sciences, University of California, Davis (UC Davis) , CA , United States of America
| | - Rafael Nascimento
- Department of Plant Sciences, University of California, Davis (UC Davis), CA, United States of America; Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama, Uberlândia Minas Gerais, Brazil
| | - Paulo A Zaini
- Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama , Uberlândia Minas Gerais , Brazil
| | - Hossein Gouran
- Department of Plant Sciences, University of California, Davis (UC Davis) , CA , United States of America
| | - Basuthkar J Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research , Mumbai, Maharashtra , India
| | - Luiz R Goulart
- Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama, Uberlândia Minas Gerais, Brazil; Department of Medical Microbiology and Immunology, University of California, Davis (UC Davis), CA, United States of America
| | - Abhaya M Dandekar
- Department of Plant Sciences, University of California, Davis (UC Davis) , CA , United States of America
| |
Collapse
|
30
|
Laurino P, Tóth-Petróczy Á, Meana-Pañeda R, Lin W, Truhlar DG, Tawfik DS. An Ancient Fingerprint Indicates the Common Ancestry of Rossmann-Fold Enzymes Utilizing Different Ribose-Based Cofactors. PLoS Biol 2016; 14:e1002396. [PMID: 26938925 PMCID: PMC4777477 DOI: 10.1371/journal.pbio.1002396] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2015] [Accepted: 01/29/2016] [Indexed: 01/30/2023] Open
Abstract
Nucleoside-based cofactors are presumed to have preceded proteins. The Rossmann fold is one of the most ancient and functionally diverse protein folds, and most Rossmann enzymes utilize nucleoside-based cofactors. We analyzed an omnipresent Rossmann ribose-binding interaction: a carboxylate side chain at the tip of the second β-strand (β2-Asp/Glu). We identified a canonical motif, defined by the β2-topology and unique geometry. The latter relates to the interaction being bidentate (both ribose hydroxyls interacting with the carboxylate oxygens), to the angle between the carboxylate and the ribose, and to the ribose's ring configuration. We found that this canonical motif exhibits hallmarks of divergence rather than convergence. It is uniquely found in Rossmann enzymes that use different cofactors, primarily SAM (S-adenosyl methionine), NAD (nicotinamide adenine dinucleotide), and FAD (flavin adenine dinucleotide). Ribose-carboxylate bidentate interactions in other folds are not only rare but also have a different topology and geometry. We further show that the canonical geometry is not dictated by a physical constraint--geometries found in noncanonical interactions have similar calculated bond energies. Overall, these data indicate the divergence of several major Rossmann-fold enzyme classes, with different cofactors and catalytic chemistries, from a common pre-LUCA (last universal common ancestor) ancestor that possessed the β2-Asp/Glu motif.
Collapse
Affiliation(s)
- Paola Laurino
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Ágnes Tóth-Petróczy
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
| | - Rubén Meana-Pañeda
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Wei Lin
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Donald G. Truhlar
- Department of Chemistry, Chemical Theory Center, and Supercomputing Institute, University of Minnesota, Minneapolis, Minnesota, United States of America
| | - Dan S. Tawfik
- Department of Biological Chemistry, Weizmann Institute of Science, Rehovot, Israel
- * E-mail:
| |
Collapse
|
31
|
Chakraborty S, Nascimento R, Zaini PA, Gouran H, Rao BJ, Goulart LR, Dandekar AM. Sequence/structural analysis of xylem proteome emphasizes pathogenesis-related proteins, chitinases and β-1, 3-glucanases as key players in grapevine defense against Xylella fastidiosa. PeerJ 2016. [PMID: 27257535 DOI: 10.7717/peerj2007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/12/2023] Open
Abstract
Background. Xylella fastidiosa, the causative agent of various plant diseases including Pierce's disease in the US, and Citrus Variegated Chlorosis in Brazil, remains a continual source of concern and economic losses, especially since almost all commercial varieties are sensitive to this Gammaproteobacteria. Differential expression of proteins in infected tissue is an established methodology to identify key elements involved in plant defense pathways. Methods. In the current work, we developed a methodology named CHURNER that emphasizes relevant protein functions from proteomic data, based on identification of proteins with similar structures that do not necessarily have sequence homology. Such clustering emphasizes protein functions which have multiple copies that are up/down-regulated, and highlights similar proteins which are differentially regulated. As a working example we present proteomic data enumerating differentially expressed proteins in xylem sap from grapevines that were infected with X. fastidiosa. Results. Analysis of this data by CHURNER highlighted pathogenesis related PR-1 proteins, reinforcing this as the foremost protein function in xylem sap involved in the grapevine defense response to X. fastidiosa. β-1, 3-glucanase, which has both anti-microbial and anti-fungal activities, is also up-regulated. Simultaneously, chitinases are found to be both up and down-regulated by CHURNER, and thus the net gain of this protein function loses its significance in the defense response. Discussion. We demonstrate how structural data can be incorporated in the pipeline of proteomic data analysis prior to making inferences on the importance of individual proteins to plant defense mechanisms. We expect CHURNER to be applicable to any proteomic data set.
Collapse
Affiliation(s)
- Sandeep Chakraborty
- Department of Plant Sciences, University of California, Davis (UC Davis) , CA , United States of America
| | - Rafael Nascimento
- Department of Plant Sciences, University of California, Davis (UC Davis), CA, United States of America; Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama, Uberlândia Minas Gerais, Brazil
| | - Paulo A Zaini
- Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama , Uberlândia Minas Gerais , Brazil
| | - Hossein Gouran
- Department of Plant Sciences, University of California, Davis (UC Davis) , CA , United States of America
| | - Basuthkar J Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research , Mumbai, Maharashtra , India
| | - Luiz R Goulart
- Institute of Genetics and Biochemistry, Federal University of Uberlândia, Campus Umuarama, Uberlândia Minas Gerais, Brazil; Department of Medical Microbiology and Immunology, University of California, Davis (UC Davis), CA, United States of America
| | - Abhaya M Dandekar
- Department of Plant Sciences, University of California, Davis (UC Davis) , CA , United States of America
| |
Collapse
|
32
|
Žváček C, Friedrichs G, Heizinger L, Merkl R. An assessment of catalytic residue 3D ensembles for the prediction of enzyme function. BMC Bioinformatics 2015; 16:359. [PMID: 26538500 PMCID: PMC4634577 DOI: 10.1186/s12859-015-0807-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2015] [Accepted: 10/29/2015] [Indexed: 12/03/2022] Open
Abstract
Background The central element of each enzyme is the catalytic site, which commonly catalyzes a single biochemical reaction with high specificity. It was unclear to us how often sites that catalyze the same or highly similar reactions evolved on different, i. e. non-homologous protein folds and how similar their 3D poses are. Both similarities are key criteria for assessing the usability of pose comparison for function prediction. Results We have analyzed the SCOP database on the superfamily level in order to estimate the number of non-homologous enzymes possessing the same function according to their EC number. 89 % of the 873 substrate-specific functions (four digit EC number) assigned to mono-functional, single-domain enzymes were only found in one superfamily. For a reaction-specific grouping (three digit EC number), this value dropped to 35 %, indicating that in approximately 65 % of all enzymes the same function evolved in two or more non-homologous proteins. For these isofunctional enzymes, structural similarity of the catalytic sites may help to predict function, because neither high sequence similarity nor identical folds are required for a comparison. To assess the specificity of catalytic 3D poses, we compiled the redundancy-free set ENZ_SITES, which comprises 695 sites, whose composition and function are well-defined. We compared their poses with the help of the program Superpose3D and determined classification performance. If the sites were from different superfamilies, the number of true and false positive predictions was similarly high, both for a coarse and a detailed grouping of enzyme function. Moreover, classification performance did not improve drastically, if we additionally used homologous sites to predict function. Conclusions For a large number of enzymatic functions, dissimilar sites evolved that catalyze the same reaction and it is the individual substrate that determines the arrangement of the catalytic site and its local environment. These substrate-specific requirements turn the comparison of catalytic residues into a weak classifier for the prediction of enzyme function. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0807-6) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Clemens Žváček
- Faculty of Mathematics and Computer Science, University of Hagen, D-58084, Hagen, Germany.
| | - Gerald Friedrichs
- Faculty of Mathematics and Computer Science, University of Hagen, D-58084, Hagen, Germany.
| | - Leonhard Heizinger
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, D-93040, Regensburg, Germany.
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, D-93040, Regensburg, Germany.
| |
Collapse
|
33
|
Bartolowits M, Davisson VJ. Considerations of Protein Subpockets in Fragment-Based Drug Design. Chem Biol Drug Des 2015; 87:5-20. [PMID: 26307335 DOI: 10.1111/cbdd.12631] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
While the fragment-based drug design approach continues to gain importance, gaps in the tools and methods available in the identification and accurate utilization of protein subpockets have limited the scope. The importance of these features of small molecule-protein recognition is highlighted with several examples. A generalized solution for the identification of subpockets and corresponding chemical fragments remains elusive, but there are numerous advancements in methods that can be used in combination to address subpockets. Finally, additional examples of approaches that consider the relative importance of small-molecule co-dependence of protein conformations are highlighted to emphasize an increased significance of subpockets, especially at protein interfaces.
Collapse
Affiliation(s)
- Matthew Bartolowits
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Dr., West Lafayette, IN, 47907, USA
| | - V Jo Davisson
- Department of Medicinal Chemistry and Molecular Pharmacology, College of Pharmacy, Purdue University, 575 Stadium Mall Dr., West Lafayette, IN, 47907, USA
| |
Collapse
|
34
|
Sudan AK, Vakhlu J. Isolation and in silico characterization of novel esterase gene with β-lactamase fold isolated from metagenome of north western Himalayas. 3 Biotech 2015; 5:553-559. [PMID: 28324560 PMCID: PMC4522730 DOI: 10.1007/s13205-014-0254-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2014] [Accepted: 09/17/2014] [Indexed: 11/30/2022] Open
Abstract
An esterase-producing clone Aph2 was isolated from the Apharwat soil metagenomic library, a mountain peak in NW Himalayas. ORF 2 (Est Ac) of clone Aph2 corresponds to 271 aa protein and showed 26 % sequence similarity to carboxylesterase gene of Synechococcus sp. JA-2-3B. Est Ac contains nucleophilic Ser in S68-X-X-K71 motif of β-lactamases with Tyr Y103. The conserved sequences are common with family VIII carboxylesterase and class C β-lactamase sequences. Phylogenetic analysis revealed that Est Ac sequence is closely related to esterase than to β-lactamases. In silico 3D protein structure of Est Ac was generated using MODELLER software (9.10 version). Model was generated on the basis of carboxylesterase template (PDB:1CI8) of Est B (Burkholderia gladioli) and the stereochemical parameters of the model generated were satisfactory. Docking with diisopropyl-fluorophosphate confirmed catalytic activity of Ser68 present in S-X-X-K motif.
Collapse
Affiliation(s)
| | - Jyoti Vakhlu
- School of Biotechnology, University of Jammu, Jammu, 180006 India
| |
Collapse
|
35
|
Martínez Cuesta S, Rahman SA, Furnham N, Thornton JM. The Classification and Evolution of Enzyme Function. Biophys J 2015; 109:1082-6. [PMID: 25986631 DOI: 10.1016/j.bpj.2015.04.020] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 04/16/2015] [Accepted: 04/17/2015] [Indexed: 11/30/2022] Open
Abstract
Enzymes are the proteins responsible for the catalysis of life. Enzymes sharing a common ancestor as defined by sequence and structure similarity are grouped into families and superfamilies. The molecular function of enzymes is defined as their ability to catalyze biochemical reactions; it is manually classified by the Enzyme Commission and robust approaches to quantitatively compare catalytic reactions are just beginning to appear. Here, we present an overview of studies at the interface of the evolution and function of enzymes.
Collapse
Affiliation(s)
- Sergio Martínez Cuesta
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Syed Asad Rahman
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom
| | - Nicholas Furnham
- Department of Pathogen Molecular Biology, London School of Hygiene & Tropical Medicine, London, United Kingdom
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, United Kingdom.
| |
Collapse
|
36
|
Currin A, Swainston N, Day PJ, Kell DB. Synthetic biology for the directed evolution of protein biocatalysts: navigating sequence space intelligently. Chem Soc Rev 2015; 44:1172-239. [PMID: 25503938 PMCID: PMC4349129 DOI: 10.1039/c4cs00351a] [Citation(s) in RCA: 251] [Impact Index Per Article: 27.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2014] [Indexed: 12/21/2022]
Abstract
The amino acid sequence of a protein affects both its structure and its function. Thus, the ability to modify the sequence, and hence the structure and activity, of individual proteins in a systematic way, opens up many opportunities, both scientifically and (as we focus on here) for exploitation in biocatalysis. Modern methods of synthetic biology, whereby increasingly large sequences of DNA can be synthesised de novo, allow an unprecedented ability to engineer proteins with novel functions. However, the number of possible proteins is far too large to test individually, so we need means for navigating the 'search space' of possible protein sequences efficiently and reliably in order to find desirable activities and other properties. Enzymologists distinguish binding (Kd) and catalytic (kcat) steps. In a similar way, judicious strategies have blended design (for binding, specificity and active site modelling) with the more empirical methods of classical directed evolution (DE) for improving kcat (where natural evolution rarely seeks the highest values), especially with regard to residues distant from the active site and where the functional linkages underpinning enzyme dynamics are both unknown and hard to predict. Epistasis (where the 'best' amino acid at one site depends on that or those at others) is a notable feature of directed evolution. The aim of this review is to highlight some of the approaches that are being developed to allow us to use directed evolution to improve enzyme properties, often dramatically. We note that directed evolution differs in a number of ways from natural evolution, including in particular the available mechanisms and the likely selection pressures. Thus, we stress the opportunities afforded by techniques that enable one to map sequence to (structure and) activity in silico, as an effective means of modelling and exploring protein landscapes. Because known landscapes may be assessed and reasoned about as a whole, simultaneously, this offers opportunities for protein improvement not readily available to natural evolution on rapid timescales. Intelligent landscape navigation, informed by sequence-activity relationships and coupled to the emerging methods of synthetic biology, offers scope for the development of novel biocatalysts that are both highly active and robust.
Collapse
Affiliation(s)
- Andrew Currin
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| | - Neil Swainston
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- School of Computer Science , The University of Manchester , Manchester M13 9PL , UK
| | - Philip J. Day
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
- Faculty of Medical and Human Sciences , The University of Manchester , Manchester M13 9PT , UK
| | - Douglas B. Kell
- Manchester Institute of Biotechnology , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK . ; http://dbkgroup.org/; @dbkell ; Tel: +44 (0)161 306 4492
- School of Chemistry , The University of Manchester , Manchester M13 9PL , UK
- Centre for Synthetic Biology of Fine and Speciality Chemicals (SYNBIOCHEM) , The University of Manchester , 131, Princess St , Manchester M1 7DN , UK
| |
Collapse
|
37
|
Roche DB, Brüls T. The enzymatic nature of an anonymous protein sequence cannot reliably be inferred from superfamily level structural information alone. Protein Sci 2015; 24:643-50. [PMID: 25559918 DOI: 10.1002/pro.2635] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2014] [Revised: 12/13/2014] [Accepted: 12/29/2014] [Indexed: 11/12/2022]
Abstract
As the largest fraction of any proteome does not carry out enzymatic functions, and in order to leverage 3D structural data for the annotation of increasingly higher volumes of sequence data, we wanted to assess the strength of the link between coarse grained structural data (i.e., homologous superfamily level) and the enzymatic versus non-enzymatic nature of protein sequences. To probe this relationship, we took advantage of 41 phylogenetically diverse (encompassing 11 distinct phyla) genomes recently sequenced within the GEBA initiative, for which we integrated structural information, as defined by CATH, with enzyme level information, as defined by Enzyme Commission (EC) numbers. This analysis revealed that only a very small fraction (about 1%) of domain sequences occurring in the analyzed genomes was found to be associated with homologous superfamilies strongly indicative of enzymatic function. Resorting to less stringent criteria to define enzyme versus non-enzyme biased structural classes or excluding highly prevalent folds from the analysis had only modest effect on this proportion. Thus, the low genomic coverage by structurally anchored protein domains strongly associated to catalytic activities indicates that, on its own, the power of coarse grained structural information to infer the general property of being an enzyme is rather limited.
Collapse
Affiliation(s)
- Daniel Barry Roche
- Laboratoire de génomique et biochimie du métabolisme, Genoscope, Institut de Génomique, Commissariat à l'Energie Atomique et aux Energies Alternatives, Evry, Essonne, 91057, France; UMR 8030 - Génomique Métabolique, Centre National de la Recherche Scientifique, Evry, Essonne, 91057, France; Départment de Biologie, Université d'Evry-Val-d'Essonne, Evry, Essonne, 91000, France; PRES UniverSud Paris, Saint-Aubin, Essonne, 91190, France
| | | |
Collapse
|
38
|
Abstract
Ligand binding is required for many proteins to function properly. A large number of bioinformatics tools have been developed to predict ligand binding sites as a first step in understanding a protein's function or to facilitate docking computations in virtual screening based drug design. The prediction usually requires only the three-dimensional structure (experimentally determined or computationally modeled) of the target protein to be searched for ligand binding site(s), and Web servers have been built, allowing the free and simple use of prediction tools. In this chapter, we review the underlying concepts of the methods used by various tools, and discuss their different features and the related issues of ligand binding site prediction. Some cautionary notes about the use of these tools are also provided.
Collapse
Affiliation(s)
- Zhong-Ru Xie
- Institute of Biomedical Sciences, Academia Sinica, 128 Academia Road, Section 2, Nankang, Taipei, 115, Taiwan
| | | |
Collapse
|
39
|
Ortegon P, Poot-Hernández AC, Perez-Rueda E, Rodriguez-Vazquez K. Comparison of Metabolic Pathways in Escherichia coli by Using Genetic Algorithms. Comput Struct Biotechnol J 2015; 13:277-85. [PMID: 25973143 PMCID: PMC4423528 DOI: 10.1016/j.csbj.2015.04.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2015] [Revised: 03/31/2015] [Accepted: 04/01/2015] [Indexed: 11/21/2022] Open
Abstract
In order to understand how cellular metabolism has taken its modern form, the conservation and variations between metabolic pathways were evaluated by using a genetic algorithm (GA). The GA approach considered information on the complete metabolism of the bacterium Escherichia coli K-12, as deposited in the KEGG database, and the enzymes belonging to a particular pathway were transformed into enzymatic step sequences by using the breadth-first search algorithm. These sequences represent contiguous enzymes linked to each other, based on their catalytic activities as they are encoded in the Enzyme Commission numbers. In a posterior step, these sequences were compared using a GA in an all-against-all (pairwise comparisons) approach. Individual reactions were chosen based on their measure of fitness to act as parents of offspring, which constitute the new generation. The sequences compared were used to construct a similarity matrix (of fitness values) that was then considered to be clustered by using a k-medoids algorithm. A total of 34 clusters of conserved reactions were obtained, and their sequences were finally aligned with a multiple-sequence alignment GA optimized to align all the reaction sequences included in each group or cluster. From these comparisons, maps associated with the metabolism of similar compounds also contained similar enzymatic step sequences, reinforcing the Patchwork Model for the evolution of metabolism in E. coli K-12, an observation that can be expanded to other organisms, for which there is metabolism information. Finally, our mapping of these reactions is discussed, with illustrations from a particular case.
Collapse
Affiliation(s)
- Patricia Ortegon
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico
| | - Augusto C. Poot-Hernández
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Ernesto Perez-Rueda
- Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
- Unidad Multidisciplinaria de Docencia e Investigación, Sisal Facultad de Ciencias, Sisal, Yucatán, UNAM, Mexico
- Correspondence to: E. Perez-Rueda, Departamento de Ingeniería Celular y Biocatálisis, Instituto de Biotecnología, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico.
| | - Katya Rodriguez-Vazquez
- Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico
- Correspondence to: K. Rodriguez-Vazquez, Departamento de Ingeniería de Sistemas Computacionales y Automatización, IIMAS, Universidad Nacional Autónoma de México, Mexico.
| |
Collapse
|
40
|
Alderson RG, Barker D, Mitchell JBO. One origin for metallo-β-lactamase activity, or two? An investigation assessing a diverse set of reconstructed ancestral sequences based on a sample of phylogenetic trees. J Mol Evol 2014; 79:117-29. [PMID: 25185655 PMCID: PMC4185109 DOI: 10.1007/s00239-014-9639-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2014] [Accepted: 08/11/2014] [Indexed: 01/04/2023]
Abstract
Bacteria use metallo-β-lactamase enzymes to hydrolyse lactam rings found in many antibiotics, rendering them ineffective. Metallo-β-lactamase activity is thought to be polyphyletic, having arisen on more than one occasion within a single functionally diverse homologous superfamily. Since discovery of multiple origins of enzymatic activity conferring antibiotic resistance has broad implications for the continued clinical use of antibiotics, we test the hypothesis of polyphyly further; if lactamase function has arisen twice independently, the most recent common ancestor (MRCA) is not expected to possess lactam-hydrolysing activity. Two major problems present themselves. Firstly, even with a perfectly known phylogeny, ancestral sequence reconstruction is error prone. Secondly, the phylogeny is not known, and in fact reconstructing a single, unambiguous phylogeny for the superfamily has proven impossible. To obtain a more statistical view of the strength of evidence for or against MRCA lactamase function, we reconstructed a sample of 98 MRCAs of the metallo-β-lactamases, each based on a different tree in a bootstrap sample of reconstructed phylogenies. InterPro sequence signatures and homology modelling were then used to assess our sample of MRCAs for lactamase functionality. Only 5 % of these models conform to our criteria for metallo-β-lactamase functionality, suggesting that the ancestor was unlikely to have been a metallo-β-lactamase. On the other hand, given that ancestral proteins may have had metallo-β-lactamase functionality with variation in sequence and structural properties compared with extant enzymes, our criteria are conservative, estimating a lower bound of evidence for metallo-β-lactamase functionality but not an upper bound.
Collapse
Affiliation(s)
- Rosanna G. Alderson
- Biomedical Sciences Research Complex and EaStCHEM School of Chemistry, Purdie Building, University of St Andrews, North Haugh, St Andrews, KY16 9ST Scotland, UK
| | - Daniel Barker
- Sir Harold Mitchell Building, School of Biology, University of St Andrews, St Andrews, KY16 9TH Scotland, UK
| | - John B. O. Mitchell
- Biomedical Sciences Research Complex and EaStCHEM School of Chemistry, Purdie Building, University of St Andrews, North Haugh, St Andrews, KY16 9ST Scotland, UK
| |
Collapse
|
41
|
Martinez Cuesta S, Furnham N, Rahman SA, Sillitoe I, Thornton JM. The evolution of enzyme function in the isomerases. Curr Opin Struct Biol 2014; 26:121-30. [PMID: 25000289 PMCID: PMC4139412 DOI: 10.1016/j.sbi.2014.06.002] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2014] [Revised: 06/02/2014] [Accepted: 06/10/2014] [Indexed: 01/14/2023]
Abstract
The advent of computational approaches to measure functional similarity between enzymes adds a new dimension to existing evolutionary studies based on sequence and structure. This paper reviews research efforts aiming to understand the evolution of enzyme function in superfamilies, presenting a novel strategy to provide an overview of the evolution of enzymes belonging to an individual EC class, using the isomerases as an exemplar.
Collapse
Affiliation(s)
- Sergio Martinez Cuesta
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom.
| | - Nicholas Furnham
- Department of Pathogen Molecular Biology, London School of Hygiene & Tropical Medicine, Keppel Street, London, WC1E 7HT, United Kingdom
| | - Syed Asad Rahman
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Ian Sillitoe
- Institute of Structural and Molecular Biology, Division of Biosciences, University College London, Gower Street, London, WC1E 6BT, United Kingdom
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom.
| |
Collapse
|
42
|
Alderson RG, De Ferrari L, Mavridis L, McDonagh JL, Mitchell JBO, Nath N. Enzyme informatics. Curr Top Med Chem 2014; 12:1911-23. [PMID: 23116471 DOI: 10.2174/156802612804547353] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2012] [Revised: 09/12/2012] [Accepted: 09/15/2012] [Indexed: 12/18/2022]
Abstract
Over the last 50 years, sequencing, structural biology and bioinformatics have completely revolutionised biomolecular science, with millions of sequences and tens of thousands of three dimensional structures becoming available. The bioinformatics of enzymes is well served by, mostly free, online databases. BRENDA describes the chemistry, substrate specificity, kinetics, preparation and biological sources of enzymes, while KEGG is valuable for understanding enzymes and metabolic pathways. EzCatDB, SFLD and MACiE are key repositories for data on the chemical mechanisms by which enzymes operate. At the current rate of genome sequencing and manual annotation, human curation will never finish the functional annotation of the ever-expanding list of known enzymes. Hence there is an increasing need for automated annotation, though it is not yet widespread for enzyme data. In contrast, functional ontologies such as the Gene Ontology already profit from automation. Despite our growing understanding of enzyme structure and dynamics, we are only beginning to be able to design novel enzymes. One can now begin to trace the functional evolution of enzymes using phylogenetics. The ability of enzymes to perform secondary functions, albeit relatively inefficiently, gives clues as to how enzyme function evolves. Substrate promiscuity in enzymes is one example of imperfect specificity in protein-ligand interactions. Similarly, most drugs bind to more than one protein target. This may sometimes result in helpful polypharmacology as a drug modulates plural targets, but also often leads to adverse side-effects. Many chemoinformatics approaches can be used to model the interactions between druglike molecules and proteins in silico. We can even use quantum chemical techniques like DFT and QM/MM to compute the structural and energetic course of enzyme catalysed chemical reaction mechanisms, including a full description of bond making and breaking.
Collapse
Affiliation(s)
- Rosanna G Alderson
- Biomedical Sciences Research Complex and EaStCHEM School of Chemistry, Purdie Building, University of St Andrews, North Haugh, St Andrews, Scotland, UK
| | | | | | | | | | | |
Collapse
|
43
|
Sorci L, Brunetti L, Cialabrini L, Mazzola F, Kazanov MD, D'Auria S, Ruggieri S, Raffaelli N. Characterization of bacterial NMN deamidase as a Ser/Lys hydrolase expands diversity of serine amidohydrolases. FEBS Lett 2014; 588:1016-23. [PMID: 24530526 DOI: 10.1016/j.febslet.2014.01.063] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Revised: 01/23/2014] [Accepted: 01/26/2014] [Indexed: 10/25/2022]
Abstract
NMN deamidase (PncC) is a bacterial enzyme involved in NAD biosynthesis. We have previously demonstrated that PncC is structurally distinct from other known amidohydrolases. Here, we extended PncC characterization by mutating all potential catalytic residues and assessing their individual roles in catalysis through kinetic analyses. Inspection of these residues' spatial arrangement in the active site, allowed us to conclude that PncC is a serine-amidohydrolase, employing a Ser/Lys dyad for catalysis. Analysis of the PncC structure in complex with a modeled NMN substrate supported our conclusion, and enabled us to propose the catalytic mechanism.
Collapse
Affiliation(s)
- Leonardo Sorci
- Department of Clinical Sciences, Polytechnic University of Marche, Ancona, Italy
| | - Lucia Brunetti
- Department of Agricultural, Food and Environmental Sciences, Polytechnic University of Marche, Ancona, Italy
| | - Lucia Cialabrini
- Department of Agricultural, Food and Environmental Sciences, Polytechnic University of Marche, Ancona, Italy
| | - Francesca Mazzola
- Department of Clinical Sciences, Polytechnic University of Marche, Ancona, Italy
| | - Marat D Kazanov
- A.A. Kharkevich Institute for Information Transmission Problems, Russian Academy of Sciences, Moscow, Russia
| | - Sabato D'Auria
- Laboratory for Molecular Sensing, IBP-CNR, Napoli, Italy
| | - Silverio Ruggieri
- Department of Agricultural, Food and Environmental Sciences, Polytechnic University of Marche, Ancona, Italy
| | - Nadia Raffaelli
- Department of Agricultural, Food and Environmental Sciences, Polytechnic University of Marche, Ancona, Italy.
| |
Collapse
|
44
|
Chakraborty S, Rendón-Ramírez A, Ásgeirsson B, Dutta M, Ghosh AS, Oda M, Venkatramani R, Rao BJ, Dandekar AM, Goñi FM. The dipeptidyl peptidase IV inhibitors vildagliptin and K-579 inhibit a phospholipase C: a case of promiscuous scaffolds in proteins. F1000Res 2013. [PMID: 25671081 DOI: 10.12688/f1000research.2-286.v1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The long term side effects of any newly introduced drug is a subject of intense research, and often raging controversies. One such example is the dipeptidyl peptidase-IV (DPP4) inhibitor used for treating type 2 diabetes, which is inconclusively implicated in increased susceptibility to acute pancreatitis. Previously, based on a computational analysis of the spatial and electrostatic properties of active site residues, we have demonstrated that phosphoinositide-specific phospholipase C (PI-PLC) from Bacillus cereus is a prolyl peptidase using in vivo experiments. In the current work, we first report the inhibition of the native activity of PI-PLC by two DPP4 inhibitors - vildagliptin (LAF-237) and K-579. While vildagliptin inhibited PI-PLC at micromolar concentrations, K-579 was a potent inhibitor even at nanomolar concentrations. Subsequently, we queried a comprehensive, non-redundant set of 5000 human proteins (50% similarity cutoff) with known structures using serine protease (SPASE) motifs derived from trypsin and DPP4. A pancreatic lipase and a gastric lipase are among the proteins that are identified as proteins having promiscuous SPASE scaffolds that could interact with DPP4 inhibitors. The presence of such scaffolds in human lipases is expected since they share the same catalytic mechanism with PI-PLC. However our methodology also detects other proteins, often with a completely different enzymatic mechanism, that have significantly congruent domains with the SPASE motifs. The reported elevated levels of serum lipase, although contested, could be rationalized by inhibition of lipases reported here. In an effort to further our understanding of the spatial and electrostatic basis of DPP4 inhibitors, we have also done a comprehensive analysis of all 76 known DPP4 structures liganded to inhibitors till date. Also, the methodology presented here can be easily adopted for other drugs, and provide the first line of filtering in the identification of pathways that might be inadvertently affected due to promiscuous scaffolds in proteins.
Collapse
Affiliation(s)
- Sandeep Chakraborty
- Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, 400 005, India ; Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | | | - Bjarni Ásgeirsson
- Science Institute, Department of Biochemistry, University of Iceland, IS-107 Reykjavik, Iceland
| | - Mouparna Dutta
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India
| | - Anindya S Ghosh
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India
| | - Masataka Oda
- Division of Microbiology and Infectious Diseases, Niigata University Graduate School of Medical and Dental Sciences, Niigata, 951-8514, Japan
| | - Ravindra Venkatramani
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Mumbai, 400 005, India
| | - Basuthkar J Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, 400 005, India
| | - Abhaya M Dandekar
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | - Félix M Goñi
- Unidad de Bio, Universidad del Pais Vasco, Bilbao, Spain
| |
Collapse
|
45
|
Chakraborty S, Rendón-Ramírez A, Ásgeirsson B, Dutta M, Ghosh AS, Oda M, Venkatramani R, Rao BJ, Dandekar AM, Goñi FM. The dipeptidyl peptidase IV inhibitors vildagliptin and K-579 inhibit a phospholipase C: a case of promiscuous scaffolds in proteins. F1000Res 2013; 2:286. [PMID: 25671081 PMCID: PMC4309170 DOI: 10.12688/f1000research.2-286.v3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 01/29/2015] [Indexed: 12/25/2022] Open
Abstract
The long term side effects of any newly introduced drug is a subject of intense research, and often raging controversies. One such example is the dipeptidyl peptidase-IV (DPP4) inhibitor used for treating type 2 diabetes, which is inconclusively implicated in increased susceptibility to acute pancreatitis. Previously, based on a computational analysis of the spatial and electrostatic properties of active site residues, we have demonstrated that phosphoinositide-specific phospholipase C (PI-PLC) from
Bacillus cereus is a prolyl peptidase using
in vivo experiments. In the current work, we first report the inhibition of the native activity of PI-PLC by two DPP4 inhibitors - vildagliptin (LAF-237) and K-579. While vildagliptin inhibited PI-PLC at micromolar concentrations, K-579 was a potent inhibitor even at nanomolar concentrations. Subsequently, we queried a comprehensive, non-redundant set of 5000 human proteins (50% similarity cutoff) with known structures using serine protease (SPASE) motifs derived from trypsin and DPP4. A pancreatic lipase and a gastric lipase are among the proteins that are identified as proteins having promiscuous SPASE scaffolds that could interact with DPP4 inhibitors. The presence of such scaffolds in human lipases is expected since they share the same catalytic mechanism with PI-PLC. However our methodology also detects other proteins, often with a completely different enzymatic mechanism, that have significantly congruent domains with the SPASE motifs. The reported elevated levels of serum lipase, although contested, could be rationalized by inhibition of lipases reported here. In an effort to further our understanding of the spatial and electrostatic basis of DPP4 inhibitors, we have also done a comprehensive analysis of all 76 known DPP4 structures liganded to inhibitors till date. Also, the methodology presented here can be easily adopted for other drugs, and provide the first line of filtering in the identification of pathways that might be inadvertently affected due to promiscuous scaffolds in proteins.
Collapse
Affiliation(s)
- Sandeep Chakraborty
- Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, 400 005, India ; Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | | | - Bjarni Ásgeirsson
- Science Institute, Department of Biochemistry, University of Iceland, IS-107 Reykjavik, Iceland
| | - Mouparna Dutta
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India
| | - Anindya S Ghosh
- Department of Biotechnology, Indian Institute of Technology Kharagpur, Kharagpur, 721302, India
| | - Masataka Oda
- Division of Microbiology and Infectious Diseases, Niigata University Graduate School of Medical and Dental Sciences, Niigata, 951-8514, Japan
| | - Ravindra Venkatramani
- Department of Chemical Sciences, Tata Institute of Fundamental Research, Mumbai, 400 005, India
| | - Basuthkar J Rao
- Department of Biological Sciences, Tata Institute of Fundamental Research, Mumbai, 400 005, India
| | - Abhaya M Dandekar
- Plant Sciences Department, University of California, Davis, CA, 95616, USA
| | - Félix M Goñi
- Unidad de Bio, Universidad del Pais Vasco, Bilbao, Spain
| |
Collapse
|
46
|
Jalencas X, Mestres J. Identification of Similar Binding Sites to Detect Distant Polypharmacology. Mol Inform 2013; 32:976-90. [PMID: 27481143 DOI: 10.1002/minf.201300082] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2013] [Accepted: 07/29/2013] [Indexed: 01/19/2023]
Abstract
The ability of small molecules to interact with multiple proteins is referred to as polypharmacology. This property is often linked to the therapeutic action of drugs but it is known also to be responsible for many of their side effects. Because of its importance, the development of computational methods that can predict drug polypharmacology has become an important line of research that led recently to the identification of many novel targets for known drugs. Nowadays, the majority of these methods are based on measuring the similarity of a query molecule against the hundreds of thousands of molecules for which pharmacological data on thousands of proteins are available in public sources. However, similarity-based methods are inherently biased by the chemical coverage offered by the active molecules present in those public repositories, which limits significantly their capacity to predict interactions with proteins structurally and functionally unrelated to any of the already known targets for drugs. It is in this respect that structure-based methods aiming at identifying similar binding sites may offer an alternative complementary means to ligand-based methods for detecting distant polypharmacology. The different existing approaches to binding site detection, representation, comparison, and fragmentation are reviewed and recent successful applications presented.
Collapse
Affiliation(s)
- Xavier Jalencas
- Systems Pharmacology, Research Program on Biomedical Informatics (GRIB), IMIM Hospital del Mar Research Institute & University Pompeu Fabra, Parc de Recerca Biomèdica, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain fax: +34 93 3160550
| | - Jordi Mestres
- Systems Pharmacology, Research Program on Biomedical Informatics (GRIB), IMIM Hospital del Mar Research Institute & University Pompeu Fabra, Parc de Recerca Biomèdica, Doctor Aiguader 88, 08003 Barcelona, Catalonia, Spain fax: +34 93 3160550.
| |
Collapse
|
47
|
Piontek K, Strittmatter E, Ullrich R, Gröbe G, Pecyna MJ, Kluge M, Scheibner K, Hofrichter M, Plattner DA. Structural basis of substrate conversion in a new aromatic peroxygenase: cytochrome P450 functionality with benefits. J Biol Chem 2013; 288:34767-76. [PMID: 24126915 DOI: 10.1074/jbc.m113.514521] [Citation(s) in RCA: 92] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Aromatic peroxygenases (APOs) represent a unique oxidoreductase sub-subclass of heme proteins with peroxygenase and peroxidase activity and were thus recently assigned a distinct EC classification (EC 1.11.2.1). They catalyze, inter alia, oxyfunctionalization reactions of aromatic and aliphatic hydrocarbons with remarkable regio- and stereoselectivities. When compared with cytochrome P450, APOs appear to be the choice enzymes for oxyfunctionalizations in organic synthesis due to their independence from a cellular environment and their greater chemical versatility. Here, the first two crystal structures of a heavily glycosylated fungal aromatic peroxygenase (AaeAPO) are described. They reveal different pH-dependent ligand binding modes. We model the fitting of various substrates in AaeAPO, illustrating the way the enzyme oxygenates polycyclic aromatic hydrocarbons. Spatial restrictions by a phenylalanine pentad in the active-site environment govern substrate specificity in AaeAPO.
Collapse
Affiliation(s)
- Klaus Piontek
- From the Institute of Organic Chemistry, University of Freiburg, Albertstrasse 21, 79104 Freiburg
| | | | | | | | | | | | | | | | | |
Collapse
|
48
|
Bartlett ME, Whipple CJ. Protein change in plant evolution: tracing one thread connecting molecular and phenotypic diversity. FRONTIERS IN PLANT SCIENCE 2013; 4:382. [PMID: 24124420 PMCID: PMC3794426 DOI: 10.3389/fpls.2013.00382] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2013] [Accepted: 09/06/2013] [Indexed: 05/29/2023]
Abstract
Proteins change over the course of evolutionary time. New protein-coding genes and gene families emerge and diversify, ultimately affecting an organism's phenotype and interactions with its environment. Here we survey the range of structural protein change observed in plants and review the role these changes have had in the evolution of plant form and function. Verified examples tying evolutionary change in protein structure to phenotypic change remain scarce. We will review the existing examples, as well as draw from investigations into domestication, and quantitative trait locus (QTL) cloning studies searching for the molecular underpinnings of natural variation. The evolutionary significance of many cloned QTL has not been assessed, but all the examples identified so far have begun to reveal the extent of protein structural diversity tolerated in natural systems. This molecular (and phenotypic) diversity could come to represent part of natural selection's source material in the adaptive evolution of novel traits. Protein structure and function can change in many distinct ways, but the changes we identified in studies of natural diversity and protein evolution were predicted to fall primarily into one of six categories: altered active and binding sites; altered protein-protein interactions; altered domain content; altered activity as an activator or repressor; altered protein stability; and hypomorphic and hypermorphic alleles. There was also variability in the evolutionary scale at which particular changes were observed. Some changes were detected at both micro- and macroevolutionary timescales, while others were observed primarily at deep or shallow phylogenetic levels. This variation might be used to determine the trajectory of future investigations in structural molecular evolution.
Collapse
Affiliation(s)
| | - Clinton J. Whipple
- *Correspondence: Clinton J. Whipple, Biology Department, Brigham Young University, 401 WIDB, Provo, UT 84602, USA e-mail:
| |
Collapse
|
49
|
Abstract
Rhodopsins are photochemically reactive membrane proteins that covalently bind retinal chromophores. Type I rhodopsins are found in both prokaryotes and eukaryotic microbes, whereas type II rhodopsins function as photoactivated G-protein coupled receptors (GPCRs) in animal vision. Both rhodopsin families share the seven transmembrane α-helix GPCR fold and a Schiff base linkage from a conserved lysine to retinal in helix G. Nevertheless, rhodopsins are widely cited as a striking example of evolutionary convergence, largely because the two families lack detectable sequence similarity and differ in many structural and mechanistic details. Convergence entails that the shared rhodopsin fold is so especially suited to photosensitive function that proteins from separate origins were selected for this architecture twice. Here we show, however, that the rhodopsin fold is not required for photosensitive activity. We engineered functional bacteriorhodopsin variants with novel folds, including radical noncircular permutations of the α-helices, circular permutations of an eight-helix construct, and retinal linkages relocated to other helices. These results contradict a key prediction of convergence and thereby provide an experimental attack on one of the most intractable problems in molecular evolution: how to establish structural homology for proteins devoid of discernible sequence similarity.
Collapse
|
50
|
Convergent evolution of chromatin modification by structurally distinct enzymes: comparative enzymology of histone H3 Lys²⁷ methylation by human polycomb repressive complex 2 and vSET. Biochem J 2013; 453:241-7. [PMID: 23679895 DOI: 10.1042/bj20130439] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
H3K27 (histone H3 Lys27) methylation is an important epigenetic modification that regulates gene transcription. In humans, EZH (enhancer of zeste homologue) 1 and EZH2 are the only enzymes capable of catalysing methylation of H3K27. There is great interest in understanding structure-function relationships for EZH2, as genetic alterations in this enzyme are thought to play a causal role in a number of human cancers. EZH2 is challenging to study because it is only active in the context of the multi-subunit PRC2 (polycomb repressive complex 2). vSET is a viral lysine methyltransferase that represents the smallest protein unit capable of catalysing H3K27 methylation. The crystal structure of this minimal catalytic protein has been solved and researchers have suggested that vSET might prove useful as an EZH2 surrogate for the development of active site-directed inhibitors. To test this proposition, we conducted comparative enzymatic analysis of human EZH2 and vSET and report that, although both enzymes share similar preferences for methylation of H3K27, they diverge in terms of their permissiveness for catalysing methylation of alternative histone lysine sites, their relative preferences for utilization of multimeric macromolecular substrates, their active site primary sequences and, most importantly, their sensitivity to inhibition by drug-like small molecules. The cumulative data led us to suggest that EZH2 and vSET have very distinct active site structures, despite the commonality of the reaction catalysed by the two enzymes. Hence, the EZH2 and vSET pair of enzymes represent an example of convergent evolution in which distinct structural solutions have developed to solve a common catalytic need.
Collapse
|