Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Rahman SA, Cuesta SM, Furnham N, Holliday GL, Thornton JM. EC-BLAST: a tool to automatically search and compare enzyme reactions. Nat Methods 2014;11:171-4. [PMID: 24412978 PMCID: PMC4122987 DOI: 10.1038/nmeth.2803] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 12/10/2013] [Indexed: 11/08/2022]

For:	Rahman SA, Cuesta SM, Furnham N, Holliday GL, Thornton JM. EC-BLAST: a tool to automatically search and compare enzyme reactions. Nat Methods 2014;11:171-4. [PMID: 24412978 PMCID: PMC4122987 DOI: 10.1038/nmeth.2803] [Citation(s) in RCA: 88] [Impact Index Per Article: 8.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2013] [Accepted: 12/10/2013] [Indexed: 11/08/2022]

Number

Cited by Other Article(s)

Gricourt G, Meyer P, Duigou T, Faulon JL. Artificial Intelligence Methods and Models for Retro-Biosynthesis: A Scoping Review. ACS Synth Biol 2024. [PMID: 39047143 DOI: 10.1021/acssynbio.4c00091] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/27/2024]

Shi Z, Wang D, Li Y, Deng R, Lin J, Liu C, Li H, Wang R, Zhao M, Mao Z, Yuan Q, Liao X, Ma H. REME: an integrated platform for reaction enzyme mining and evaluation. Nucleic Acids Res 2024;52:W299-W305. [PMID: 38769057 PMCID: PMC11223788 DOI: 10.1093/nar/gkae405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2024] [Revised: 04/16/2024] [Accepted: 05/01/2024] [Indexed: 05/22/2024] Open

Affiliation(s)

Zhenkun Shi Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China
Dehang Wang Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, PR China
Yang Li Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China University of Chinese Academy of Sciences, Beijing 101408, PR China
Rui Deng Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, PR China
Jiawei Lin Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China College of Biotechnology, Tianjin University of Science and Technology, Tianjin 300457, PR China
Cui Liu Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China
Haoran Li Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China
Ruoyu Wang Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China
Muqiang Zhao Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China
Zhitao Mao Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China
Qianqian Yuan Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China
Xiaoping Liao Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China Haihe Laboratory of Synthetic Biology, Tianjin 300308, PR China
Hongwu Ma Biodesign Center, Key Laboratory of Engineering Biology for Low-carbon Manufacturing, Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin 300308, PR China

Collapse

Ferreira S, Balola A, Sveshnikova A, Hatzimanikatis V, Vilaça P, Maia P, Carreira R, Stoney R, Carbonell P, Souza CS, Correia J, Lousa D, Soares CM, Rocha I. Computer-aided design and implementation of efficient biosynthetic pathways to produce high added-value products derived from tyrosine in Escherichia coli. Front Bioeng Biotechnol 2024;12:1360740. [PMID: 38978715 PMCID: PMC11228882 DOI: 10.3389/fbioe.2024.1360740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Accepted: 06/03/2024] [Indexed: 07/10/2024] Open

Abstract

Developing efficient bioprocesses requires selecting the best biosynthetic pathways, which can be challenging and time-consuming due to the vast amount of data available in databases and literature. The extension of the shikimate pathway for the biosynthesis of commercially attractive molecules often involves promiscuous enzymes or lacks well-established routes. To address these challenges, we developed a computational workflow integrating enumeration/retrosynthesis algorithms, a toolbox for pathway analysis, enzyme selection tools, and a gene discovery pipeline, supported by manual curation and literature review. Our focus has been on implementing biosynthetic pathways for tyrosine-derived compounds, specifically L-3,4-dihydroxyphenylalanine (L-DOPA) and dopamine, with significant applications in health and nutrition. We selected one pathway to produce L-DOPA and two different pathways for dopamine-one already described in the literature and a novel pathway. Our goal was either to identify the most suitable gene candidates for expression in Escherichia coli for the known pathways or to discover innovative pathways. Although not all implemented pathways resulted in the accumulation of target compounds, in our shake-flask experiments we achieved a maximum L-DOPA titer of 0.71 g/L and dopamine titers of 0.29 and 0.21 g/L for known and novel pathways, respectively. In the case of L-DOPA, we utilized, for the first time, a mutant version of tyrosinase from Ralstonia solanacearum. Production of dopamine via the known biosynthesis route was accomplished by coupling the L-DOPA pathway with the expression of DOPA decarboxylase from Pseudomonas putida, resulting in a unique biosynthetic pathway never reported in literature before. In the context of the novel pathway, dopamine was produced using tyramine as the intermediate compound. To achieve this, tyrosine was initially converted into tyramine by expressing TDC from Levilactobacillus brevis, which, in turn, was converted into dopamine through the action of the enzyme encoded by ppoMP from Mucuna pruriens. This marks the first time that an alternative biosynthetic pathway for dopamine has been validated in microbes. These findings underscore the effectiveness of our computational workflow in facilitating pathway enumeration and selection, offering the potential to uncover novel biosynthetic routes, thus paving the way for other target compounds of biotechnological interest.

Collapse

Affiliation(s)

Sofia Ferreira Systems and Synthetic Biology Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
Alexandra Balola Systems and Synthetic Biology Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
Anastasia Sveshnikova Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne, Switzerland
Vassily Hatzimanikatis Laboratory of Computational Systems Biotechnology, École Polytechnique Fédérale de Lausanne, EPFL, Lausanne, Switzerland
Paulo Vilaça SilicoLife-Computational Biology Solutions for the Life Sciences, Braga, Portugal
Paulo Maia SilicoLife-Computational Biology Solutions for the Life Sciences, Braga, Portugal
Rafael Carreira SilicoLife-Computational Biology Solutions for the Life Sciences, Braga, Portugal
Ruth Stoney Manchester Institute of Biotechnology, School of Chemistry, Faculty of Science and Engineering, University of Manchester, Manchester, United Kingdom
Pablo Carbonell Institute of Industrial Control Systems and Computing (AI2), Universitat Politècnica de València (UPV), Valencia, Spain Institute for Integrative Systems Biology I2SysBio, Universitat de València-CSIC: Consejo Superior de Investigaciones Científicas, Paterna, Spain
Caio Silva Souza Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
João Correia Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
Diana Lousa Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
Cláudio M Soares Protein Modelling Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal
Isabel Rocha Systems and Synthetic Biology Laboratory, ITQB Nova-Instituto de Tecnologia Química e Biológica António Xavier, Oeiras, Portugal

Collapse

Ribeiro AJM, Riziotis IG, Borkakoti N, Thornton JM. Enzyme function and evolution through the lens of bioinformatics. Biochem J 2023;480:1845-1863. [PMID: 37991346 PMCID: PMC10754289 DOI: 10.1042/bcj20220405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2023] [Revised: 11/09/2023] [Accepted: 11/14/2023] [Indexed: 11/23/2023]

Probst D. An explainability framework for deep learning on chemical reactions exemplified by enzyme-catalysed reaction classification. J Cheminform 2023;15:113. [PMID: 37996942 PMCID: PMC10668483 DOI: 10.1186/s13321-023-00784-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 11/13/2023] [Indexed: 11/25/2023] Open

Ryu G, Kim GB, Yu T, Lee SY. Deep learning for metabolic pathway design. Metab Eng 2023;80:130-141. [PMID: 37734652 DOI: 10.1016/j.ymben.2023.09.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2023] [Revised: 09/17/2023] [Accepted: 09/19/2023] [Indexed: 09/23/2023]

Affiliation(s)

Gahyeon Ryu Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Four), KAIST Institute for BioCentury, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, 34141, Republic of Korea
Gi Bae Kim Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Four), KAIST Institute for BioCentury, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, 34141, Republic of Korea
Taeho Yu Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Four), KAIST Institute for BioCentury, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, 34141, Republic of Korea
Sang Yup Lee Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Four), KAIST Institute for BioCentury, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea; Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, KAIST, Daejeon, 34141, Republic of Korea; BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, Daejeon, 34141, Republic of Korea; Graduate School of Engineering Biology, KAIST, Daejeon, 34141, Republic of Korea.

Collapse

Riziotis IG, Ribeiro AJM, Borkakoti N, Thornton JM. The 3D Modules of Enzyme Catalysis: Deconstructing Active Sites into Distinct Functional Entities. J Mol Biol 2023;435:168254. [PMID: 37652131 DOI: 10.1016/j.jmb.2023.168254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 08/20/2023] [Accepted: 08/22/2023] [Indexed: 09/02/2023]

Sarker B, Khare N, Devignes MD, Aridhi S. Improving automatic GO annotation with semantic similarity. BMC Bioinformatics 2022;23:433. [PMID: 36510133 PMCID: PMC9743508 DOI: 10.1186/s12859-022-04958-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 09/19/2022] [Indexed: 12/14/2022] Open

Abstract

BACKGROUND

Automatic functional annotation of proteins is an open research problem in bioinformatics. The growing number of protein entries in public databases, for example in UniProtKB, poses challenges in manual functional annotation. Manual annotation requires expert human curators to search and read related research articles, interpret the results, and assign the annotations to the proteins. Thus, it is a time-consuming and expensive process. Therefore, designing computational tools to perform automatic annotation leveraging the high quality manual annotations that already exist in UniProtKB/SwissProt is an important research problem RESULTS: In this paper, we extend and adapt the GrAPFI (graph-based automatic protein function inference) (Sarker et al. in BMC Bioinform 21, 2020; Sarker et al., in: Proceedings of 7th international conference on complex networks and their applications, Cambridge, 2018) method for automatic annotation of proteins with gene ontology (GO) terms renaming it as GrAPFI-GO. The original GrAPFI method uses label propagation in a similarity graph where proteins are linked through the domains, families, and superfamilies that they share. Here, we also explore various types of similarity measures based on common neighbors in the graph. Moreover, GO terms are arranged in a hierarchical manner according to semantic parent-child relations. Therefore, we propose an efficient pruning and post-processing technique that integrates both semantic similarity and hierarchical relations between the GO terms. We produce experimental results comparing the GrAPFI-GO method with and without considering common neighbors similarity. We also test the performance of GrAPFI-GO and other annotation tools for GO annotation on a benchmark of proteins with and without the proposed pruning and post-processing procedure.

CONCLUSION

Our results show that the proposed semantic hierarchical post-processing potentially improves the performance of GrAPFI-GO and of other annotation tools as well. Thus, GrAPFI-GO exposes an original efficient and reusable procedure, to exploit the semantic relations among the GO terms in order to improve the automatic annotation of protein functions.

Collapse

The automated Galaxy-SynBioCAD pipeline for synthetic biology design and engineering. Nat Commun 2022;13:5082. [PMID: 36038542 PMCID: PMC9424320 DOI: 10.1038/s41467-022-32661-x] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Accepted: 08/11/2022] [Indexed: 11/27/2022] Open

Cho JS, Kim GB, Eun H, Moon CW, Lee SY. Designing Microbial Cell Factories for the Production of Chemicals. JACS AU 2022;2:1781-1799. [PMID: 36032533 PMCID: PMC9400054 DOI: 10.1021/jacsau.2c00344] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Revised: 07/26/2022] [Accepted: 07/26/2022] [Indexed: 05/24/2023]

Affiliation(s)

Jae Sung Cho Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea BioProcess Engineering Research Center and BioInformatics Research Center, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
Gi Bae Kim Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
Hyunmin Eun Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
Cheon Woo Moon Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea
Sang Yup Lee Metabolic and Biomolecular Engineering National Research Laboratory and Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea BioProcess Engineering Research Center and BioInformatics Research Center, Korea Advanced Institute of Science and Technology (KAIST), Daejeon 34141, Republic of Korea

Collapse

Warrier T, El Farran C, Zeng Y, Ho B, Bao Q, Zheng Z, Bi X, Ng HH, Ong D, Chu J, Sanyal A, Fullwood MJ, Collins J, Li H, Xu J, Loh YH. SETDB1 acts as a topological accessory to Cohesin via an H3K9me3-independent, genomic shunt for regulating cell fates. Nucleic Acids Res 2022;50:7326-7349. [PMID: 35776115 PMCID: PMC9303280 DOI: 10.1093/nar/gkac531] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2021] [Revised: 05/30/2022] [Accepted: 06/30/2022] [Indexed: 11/13/2022] Open

Affiliation(s)

Tushar Warrier Cell Fate Engineering and Therapeutics Lab, Cell Biology and Therapies Division, A*STAR Institute of Molecular and Cell Biology, Singapore 138673, Singapore Department of Biological Sciences, National University of Singapore, Singapore 117543, Singapore
Chadi El Farran Cell Fate Engineering and Therapeutics Lab, Cell Biology and Therapies Division, A*STAR Institute of Molecular and Cell Biology, Singapore 138673, Singapore Department of Biological Sciences, National University of Singapore, Singapore 117543, Singapore
Yingying Zeng Cell Fate Engineering and Therapeutics Lab, Cell Biology and Therapies Division, A*STAR Institute of Molecular and Cell Biology, Singapore 138673, Singapore School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive 637551, Singapore
Benedict Shao Quan Ho Cell Fate Engineering and Therapeutics Lab, Cell Biology and Therapies Division, A*STAR Institute of Molecular and Cell Biology, Singapore 138673, Singapore
Qiuye Bao Cell Fate Engineering and Therapeutics Lab, Cell Biology and Therapies Division, A*STAR Institute of Molecular and Cell Biology, Singapore 138673, Singapore
Zi Hao Zheng Cell Fate Engineering and Therapeutics Lab, Cell Biology and Therapies Division, A*STAR Institute of Molecular and Cell Biology, Singapore 138673, Singapore Department of Physiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117593, Singapore
Xuezhi Bi Proteomics Group, Bioprocessing Technology Institute, A*STAR, Singapore 138668, Singapore
Huck Hui Ng Gene Regulation Laboratory, Genome Institute of Singapore, Singapore 138672, Singapore
Derrick Sek Tong Ong Department of Physiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117593, Singapore
Justin Jang Hann Chu Department of Microbiology and Immunology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117593, Singapore Infectious Disease Translational Research Programme, National University of Singapore, Singapore 117597, Singapore
Amartya Sanyal School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive 637551, Singapore
Melissa Jane Fullwood School of Biological Sciences, Nanyang Technological University, 60 Nanyang Drive 637551, Singapore Cancer Science Institute of Singapore, National University of Singapore, 14 Medical Drive, Singapore 117599, Singapore
James J Collins Howard Hughes Medical Institute, Boston, MA 02114, USA Institute for Medical Engineering and Science Department of Biological Engineering, and Synthetic Biology Center, Massachusetts Institute of Technology, Cambridge, MA 02114, USA Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA, USA
Hu Li Center for Individualized Medicine, Department of Molecular Pharmacology & Experimental Therapeutics, Mayo Clinic, Rochester, MN 55905, USA
Jian Xu Department of Biological Sciences, National University of Singapore, Singapore 117543, Singapore Department of Plant Systems Physiology, Radboud Institute for Biological and Environmental Sciences, Radboud University, Heyendaalseweg 135, 6525 AJ, Nijmegen, The Netherlands
Yuin-Han Loh Cell Fate Engineering and Therapeutics Lab, Cell Biology and Therapies Division, A*STAR Institute of Molecular and Cell Biology, Singapore 138673, Singapore Department of Biological Sciences, National University of Singapore, Singapore 117543, Singapore Department of Physiology, Yong Loo Lin School of Medicine, National University of Singapore, Singapore 117593, Singapore NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore, 28 MedicalDrive, Singapore 117456, Singapore

Collapse

Furse S, Watkins AJ, Williams HEL, Snowden SG, Chiarugi D, Koulman A. Paternal nutritional programming of lipid metabolism is propagated through sperm and seminal plasma. Metabolomics 2022;18:13. [PMID: 35141784 PMCID: PMC8828597 DOI: 10.1007/s11306-022-01869-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Accepted: 01/04/2022] [Indexed: 12/12/2022]

Tao YM, Bu CY, Zou LH, Hu YL, Zheng ZJ, Ouyang J. A comprehensive review on microbial production of 1,2-propanediol: micro-organisms, metabolic pathways, and metabolic engineering. BIOTECHNOLOGY FOR BIOFUELS 2021;14:216. [PMID: 34794503 PMCID: PMC8600716 DOI: 10.1186/s13068-021-02067-w] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Accepted: 11/07/2021] [Indexed: 06/13/2023]

Lin A, Dyubankova N, Madzhidov TI, Nugmanov RI, Verhoeven J, Gimadiev TR, Afonina VA, Ibragimova Z, Rakhimbekova A, Sidorov P, Gedich A, Suleymanov R, Mukhametgaleev R, Wegner J, Ceulemans H, Varnek A. Atom-to-atom Mapping: A Benchmarking Study of Popular Mapping Algorithms and Consensus Strategies. Mol Inform 2021;41:e2100138. [PMID: 34726834 DOI: 10.1002/minf.202100138] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2021] [Accepted: 10/15/2021] [Indexed: 01/23/2023]

Affiliation(s)

Arkadii Lin Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg4, Blaise Pascal str., 67081, Strasbourg, France
Natalia Dyubankova Janssen Pharmaceutica, 30, Turnhoutseweg str., 2340, Beerse, Belgium
Timur I Madzhidov Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008, Kazan, Russia
Ramil I Nugmanov Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008, Kazan, Russia
Jonas Verhoeven Janssen Pharmaceutica, 30, Turnhoutseweg str., 2340, Beerse, Belgium
Timur R Gimadiev Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Sapporo, Kita-ku, 001-0021, Sapporo, Japan
Valentina A Afonina Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008, Kazan, Russia
Zarina Ibragimova Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008, Kazan, Russia
Assima Rakhimbekova Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008, Kazan, Russia
Pavel Sidorov Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Sapporo, Kita-ku, 001-0021, Sapporo, Japan
Andrei Gedich Arcadia Inc., 28 k2, Bolshoy Sampsonievskiy pr., St. Petersburg, 194044, Russia
Rail Suleymanov Arcadia Inc., 28 k2, Bolshoy Sampsonievskiy pr., St. Petersburg, 194044, Russia
Ravil Mukhametgaleev Laboratory of Chemoinformatics and Molecular Modeling, Butlerov Institute of Chemistry, Kazan Federal University, 18, Kremlyovskaya str., 420008, Kazan, Russia
Joerg Wegner Janssen Pharmaceutica, 30, Turnhoutseweg str., 2340, Beerse, Belgium
Hugo Ceulemans Janssen Pharmaceutica, 30, Turnhoutseweg str., 2340, Beerse, Belgium
Alexandre Varnek Laboratory of Chemoinformatics, UMR 7140 CNRS, University of Strasbourg4, Blaise Pascal str., 67081, Strasbourg, France.,Institute for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Kita 21 Nishi 10, Sapporo, Kita-ku, 001-0021, Sapporo, Japan

Collapse

Dong J, Zhao M, Liu Y, Su Y, Zeng X. Deep learning in retrosynthesis planning: datasets, models and tools. Brief Bioinform 2021;23:6375056. [PMID: 34571535 DOI: 10.1093/bib/bbab391] [Citation(s) in RCA: 32] [Impact Index Per Article: 10.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Revised: 08/16/2021] [Accepted: 08/30/2021] [Indexed: 12/29/2022] Open

Visani GM, Hughes MC, Hassoun S. Enzyme Promiscuity Prediction Using Hierarchy-Informed Multi-Label Classification. Bioinformatics 2021;37:2017–2024. [PMID: 33515234 PMCID: PMC8337005 DOI: 10.1093/bioinformatics/btab054] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 12/30/2020] [Accepted: 01/22/2021] [Indexed: 11/25/2022] Open

Hafner J, Mohammadi‐Peyhani H, Hatzimanikatis V. Pathway Design. Metab Eng 2021. [DOI: 10.1002/9783527823468.ch8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Fenner K, Elsner M, Lueders T, McLachlan MS, Wackett LP, Zimmermann M, Drewes JE. Methodological Advances to Study Contaminant Biotransformation: New Prospects for Understanding and Reducing Environmental Persistence? ACS ES&T WATER 2021;1:1541-1554. [PMID: 34278380 PMCID: PMC8276273 DOI: 10.1021/acsestwater.1c00025] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/23/2021] [Revised: 06/11/2021] [Accepted: 06/11/2021] [Indexed: 05/14/2023]

Habib MAH, Ismail MN. Extraction and identification of biologically important proteins from the medicinal plant God's crown (Phaleria macrocarpa). J Food Biochem 2021;45:e13817. [PMID: 34137461 DOI: 10.1111/jfbc.13817] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2021] [Revised: 05/24/2021] [Accepted: 05/28/2021] [Indexed: 11/30/2022]

Jiang J, Liu LP, Hassoun S. Learning graph representations of biochemical networks and its application to enzymatic link prediction. Bioinformatics 2021;37:793-799. [PMID: 33051674 PMCID: PMC8097755 DOI: 10.1093/bioinformatics/btaa881] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 08/01/2020] [Accepted: 09/29/2020] [Indexed: 11/20/2022] Open

Erhardt P, Bachmann K, Birkett D, Boberg M, Bodor N, Gibson G, Hawkins D, Hawksworth G, Hinson J, Koehler D, Kress B, Luniwal A, Masumoto H, Novak R, Portoghese P, Sarver J, Serafini MT, Trabbic C, Vermeulen N, Wrighton S. Glossary and tutorial of xenobiotic metabolism terms used during small molecule drug discovery and development (IUPAC Technical Report). PURE APPL CHEM 2021. [DOI: 10.1515/pac-2018-0208] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Affiliation(s)

Paul Erhardt Center for Drug Design and Development , University of Toledo , Toledo , Ohio , USA
Kenneth Bachmann Ceuticare, Inc. , Sylvania , Ohio , USA, (TGM and ST)
Donald Birkett Department of Clinical Pharmacology , Flinders University , Adelaide , Australia (now Emeritus), (TGM)
Michael Boberg Metabolism and Isotope Chemistry , Bayer , AG , Germany (now undetermined), (TGM)
Nicholas Bodor Center for Drug Discovery , University of Florida , Belle Glade , FL , USA (now Emeritus Grad Res Prof/CEO Bodor Labs), (TGM)
Gordon Gibson School of Biomedical and Life Sciences, University of Surrey , Surrey , UK (now deceased), (TGM)
David Hawkins Huntingdon Life Sciences , Huntingdon , UK (now retired), (TGM)
Gabrielle Hawksworth Department of Medicine and Therapeutics , University Aberdeen , Aberdeen , UK (now deceased), (TGM)
Jack Hinson Division of Toxicology , University Arkansas for Medical Sciences , Little Rock , Arkansas , USA (now Emeritus Dist Prof), (TGM)
Daniel Koehler Department of Pharmacology , University of Toledo , Toledo , Ohio , USA, (ST)
Brian Kress Department of Medicinal and Biological Chemistry , University of Toledo , Toledo , Ohio , USA, (ST)
Amarjit Luniwal NAmSA, Inc. , Northwood , Ohio , USA, (ST)
Hiroshi Masumoto Drug Metabolism , Daiichi Pharm. Corp., Ltd. , Chuo , Tokyo , Japan (now retired), (TGM)
Raymond Novak Institute of Environmental Health Science, Wayne State University , Detroit , Michigan , USA (now undetermined), (TGM)
Phillip Portoghese Department of Medicinal Chemistry , University of Minnesota , Minneapolis , Minnesota , USA (now same), (TGM)
Jeffrey Sarver Department of Pharmacology , University of Toledo , Toledo , Ohio , USA, (ST)
M. Teresa Serafini Department of Pharmacokinetics and Drug Metabolism , Laboratories Dr. Esteve, S.A. , Barcelona , Spain (now Head Early ADME), (TGM)
Christopher Trabbic MPI Research, Inc. , Mattawan , Michigan , USA, (ST)
Nico Vermeulen Department of Pharmacochemistry , Vrije University , Amsterdam , Netherlands (now Emeritus Section Molecular Toxicology), (TGM)
Steven Wrighton Eli Lilly, Inc. , Indianapolis , Indiana , USA (now retired), (TGM)

Collapse

Hafner J, Payne J, MohammadiPeyhani H, Hatzimanikatis V, Smolke C. A computational workflow for the expansion of heterologous biosynthetic pathways to natural product derivatives. Nat Commun 2021;12:1760. [PMID: 33741955 PMCID: PMC7979880 DOI: 10.1038/s41467-021-22022-5] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 02/24/2021] [Indexed: 01/31/2023] Open

Lipid Traffic Analysis reveals the impact of high paternal carbohydrate intake on offsprings' lipid metabolism. Commun Biol 2021;4:163. [PMID: 33547386 PMCID: PMC7864968 DOI: 10.1038/s42003-021-01686-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2020] [Accepted: 01/08/2021] [Indexed: 12/12/2022] Open

Otero-Muras I, Carbonell P. Automated engineering of synthetic metabolic pathways for efficient biomanufacturing. Metab Eng 2020;63:61-80. [PMID: 33316374 DOI: 10.1016/j.ymben.2020.11.012] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Revised: 11/15/2020] [Accepted: 11/20/2020] [Indexed: 12/19/2022]

Carbonell P, Le Feuvre R, Takano E, Scrutton NS. In silico design and automated learning to boost next-generation smart biomanufacturing. Synth Biol (Oxf) 2020;5:ysaa020. [PMID: 33344778 PMCID: PMC7737007 DOI: 10.1093/synbio/ysaa020] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Revised: 09/08/2020] [Accepted: 09/28/2020] [Indexed: 02/07/2023] Open

Sun D, Cheng X, Tian Y, Ding S, Zhang D, Cai P, Hu QN. EnzyMine: a comprehensive database for enzyme function annotation with enzymatic reaction chemical feature. Database (Oxford) 2020;2023:baaa065. [PMID: 33002112 PMCID: PMC10755256 DOI: 10.1093/database/baaa065] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 07/19/2020] [Accepted: 07/24/2020] [Indexed: 11/14/2022]

Chen F, Yuan L, Ding S, Tian Y, Hu QN. Data-driven rational biosynthesis design: from molecules to cell factories. Brief Bioinform 2020;21:1238-1248. [PMID: 31243440 DOI: 10.1093/bib/bbz065] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 04/28/2019] [Accepted: 05/08/2019] [Indexed: 11/12/2022] Open

Sarker B, Ritchie DW, Aridhi S. GrAPFI: predicting enzymatic function of proteins from domain similarity graphs. BMC Bioinformatics 2020;21:168. [PMID: 32349654 PMCID: PMC7191693 DOI: 10.1186/s12859-020-3460-7] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2019] [Accepted: 03/19/2020] [Indexed: 01/20/2023] Open

Holliday GL, Brown SD, Mischel D, Polacco BJ, Babbitt PC. A strategy for large-scale comparison of evolutionary- and reaction-based classifications of enzyme function. Database (Oxford) 2020;2020:baaa034. [PMID: 32449511 PMCID: PMC7246345 DOI: 10.1093/database/baaa034] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2019] [Revised: 03/18/2020] [Accepted: 04/27/2020] [Indexed: 12/12/2022]

Abstract

Determining the molecular function of enzymes discovered by genome sequencing represents a primary foundation for understanding many aspects of biology. Historically, classification of enzyme reactions has used the enzyme nomenclature system developed to describe the overall reactions performed by biochemically characterized enzymes, irrespective of their associated sequences. In contrast, functional classification and assignment for the millions of protein sequences of unknown function now available is largely done in two computational steps, first by similarity-based assignment of newly obtained sequences to homologous groups, followed by transferring to them the known functions of similar biochemically characterized homologs. Due to the fundamental differences in their etiologies and practice, `how' these chemistry- and evolution-centric functional classification systems relate to each other has been difficult to explore on a large scale. To investigate this issue in a new way, we integrated two published ontologies that had previously described each of these classification systems independently. The resulting infrastructure was then used to compare the functional assignments obtained from each classification system for the well-studied and functionally diverse enolase superfamily. Mapping these function assignments to protein structure and reaction similarity networks shows a profound and complex disconnect between the homology- and chemistry-based classification systems. This conclusion mirrors previous observations suggesting that except for closely related sequences, facile annotation transfer from small numbers of characterized enzymes to the huge number uncharacterized homologs to which they are related is problematic. Our extension of these comparisons to large enzyme superfamilies in a computationally intelligent manner provides a foundation for new directions in protein function prediction for the huge proportion of sequences of unknown function represented in major databases. Interactive sequence, reaction, substrate and product similarity networks computed for this work for the enolase and two other superfamilies are freely available for download from the Structure Function Linkage Database Archive (http://sfld.rbvi.ucsf.edu).

Collapse

Chung NC, Miasojedow B, Startek M, Gambin A. Jaccard/Tanimoto similarity test and estimation methods for biological presence-absence data. BMC Bioinformatics 2019;20:644. [PMID: 31874610 PMCID: PMC6929325 DOI: 10.1186/s12859-019-3118-5] [Citation(s) in RCA: 73] [Impact Index Per Article: 14.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 09/27/2019] [Indexed: 11/12/2022] Open

Abstract

Background

A survey of presences and absences of specific species across multiple biogeographic units (or bioregions) are used in a broad area of biological studies from ecology to microbiology. Using binary presence-absence data, we evaluate species co-occurrences that help elucidate relationships among organisms and environments. To summarize similarity between occurrences of species, we routinely use the Jaccard/Tanimoto coefficient, which is the ratio of their intersection to their union. It is natural, then, to identify statistically significant Jaccard/Tanimoto coefficients, which suggest non-random co-occurrences of species. However, statistical hypothesis testing using this similarity coefficient has been seldom used or studied.

Results

We introduce a hypothesis test for similarity for biological presence-absence data, using the Jaccard/Tanimoto coefficient. Several key improvements are presented including unbiased estimation of expectation and centered Jaccard/Tanimoto coefficients, that account for occurrence probabilities. The exact and asymptotic solutions are derived. To overcome a computational burden due to high-dimensionality, we propose the bootstrap and measurement concentration algorithms to efficiently estimate statistical significance of binary similarity. Comprehensive simulation studies demonstrate that our proposed methods produce accurate p-values and false discovery rates. The proposed estimation methods are orders of magnitude faster than the exact solution, particularly with an increasing dimensionality. We showcase their applications in evaluating co-occurrences of bird species in 28 islands of Vanuatu and fish species in 3347 freshwater habitats in France. The proposed methods are implemented in an open source R package called jaccard (https://cran.r-project.org/package=jaccard).

Conclusion

We introduce a suite of statistical methods for the Jaccard/Tanimoto similarity coefficient for binary data, that enable straightforward incorporation of probabilistic measures in analysis for species co-occurrences. Due to their generality, the proposed methods and implementations are applicable to a wide range of binary data arising from genomics, biochemistry, and other areas of science.

Collapse

Ribeiro AJM, Tyzack JD, Borkakoti N, Holliday GL, Thornton JM. A global analysis of function and conservation of catalytic residues in enzymes. J Biol Chem 2019;295:314-324. [PMID: 31796628 DOI: 10.1074/jbc.rev119.006289] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Carboxylic Ester Hydrolases in Bacteria: Active Site, Structure, Function and Application. CRYSTALS 2019. [DOI: 10.3390/cryst9110597] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]

Tyzack JD, Ribeiro AJM, Borkakoti N, Thornton JM. Transform-MinER: transforming molecules in enzyme reactions. Bioinformatics 2019;34:3597-3599. [PMID: 29762650 PMCID: PMC6184704 DOI: 10.1093/bioinformatics/bty394] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2018] [Accepted: 05/09/2018] [Indexed: 11/12/2022] Open

Ribeiro AJM, Holliday GL, Furnham N, Tyzack JD, Ferris K, Thornton JM. Mechanism and Catalytic Site Atlas (M-CSA): a database of enzyme reaction mechanisms and active sites. Nucleic Acids Res 2019;46:D618-D623. [PMID: 29106569 PMCID: PMC5753290 DOI: 10.1093/nar/gkx1012] [Citation(s) in RCA: 111] [Impact Index Per Article: 22.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2017] [Accepted: 10/13/2017] [Indexed: 12/28/2022] Open

Automatic mapping of atoms across both simple and complex chemical reactions. Nat Commun 2019;10:1434. [PMID: 30926819 PMCID: PMC6441094 DOI: 10.1038/s41467-019-09440-2] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2018] [Accepted: 03/01/2019] [Indexed: 11/08/2022] Open

Enzyme annotation for orphan and novel reactions using knowledge of substrate reactive sites. Proc Natl Acad Sci U S A 2019;116:7298-7307. [PMID: 30910961 PMCID: PMC6462048 DOI: 10.1073/pnas.1818877116] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open

Abstract

Recent advances in synthetic biochemistry have resulted in a wealth of novel hypothetical enzymatic reactions that are not matched to protein-encoding genes, deeming them “orphan.” A large number of known metabolic enzymes are also orphan, leaving important gaps in metabolic network maps. Proposing genes for the catalysis of orphan reactions is critical for applications ranging from biotechnology to medicine. In this work, the computational method BridgIT identified potential enzymes of orphan reactions and nearly all theoretically possible biochemical transformations, providing candidate genes to catalyze these reactions to the research community. The BridgIT online tool will allow researchers to fill the knowledge gaps in metabolic networks and will act as a starting point for designing novel enzymes to catalyze nonnatural transformations.

Thousands of biochemical reactions with characterized activities are “orphan,” meaning they cannot be assigned to a specific enzyme, leaving gaps in metabolic pathways. Novel reactions predicted by pathway-generation tools also lack associated sequences, limiting protein engineering applications. Associating orphan and novel reactions with known biochemistry and suggesting enzymes to catalyze them is a daunting problem. We propose the method BridgIT to identify candidate genes and catalyzing proteins for these reactions. This method introduces information about the enzyme binding pocket into reaction-similarity comparisons. BridgIT assesses the similarity of two reactions, one orphan and one well-characterized nonorphan reaction, using their substrate reactive sites, their surrounding structures, and the structures of the generated products to suggest enzymes that catalyze the most-similar nonorphan reactions as candidates for also catalyzing the orphan ones. We performed two large-scale validation studies to test BridgIT predictions against experimental biochemical evidence. For the 234 orphan reactions from the Kyoto Encyclopedia of Genes and Genomes (KEGG) 2011 (a comprehensive enzymatic-reaction database) that became nonorphan in KEGG 2018, BridgIT predicted the exact or a highly related enzyme for 211 of them. Moreover, for 334 of 379 novel reactions in 2014 that were later cataloged in KEGG 2018, BridgIT predicted the exact or highly similar enzymes. BridgIT requires knowledge about only four connecting bonds around the atoms of the reactive sites to correctly annotate proteins for 93% of analyzed enzymatic reactions. Increasing to seven connecting bonds allowed for the accurate identification of a sequence for nearly all known enzymatic reactions.

Collapse

In defence of taxonomic governance. ORG DIVERS EVOL 2019. [DOI: 10.1007/s13127-019-00391-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]

Tyzack JD, Furnham N, Sillitoe I, Orengo CM, Thornton JM. Exploring Enzyme Evolution from Changes in Sequence, Structure, and Function. Methods Mol Biol 2019;1851:263-275. [PMID: 30298402 DOI: 10.1007/978-1-4939-8736-8_14] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Zhang J, Kwong S, Wong KC. ToBio: Global Pathway Similarity Search Based on Topological and Biological Features. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019;16:336-349. [PMID: 29990160 DOI: 10.1109/tcbb.2017.2769642] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

TAMMiCol: Tool for analysis of the morphology of microbial colonies. PLoS Comput Biol 2018;14:e1006629. [PMID: 30507938 PMCID: PMC6292648 DOI: 10.1371/journal.pcbi.1006629] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2018] [Revised: 12/13/2018] [Accepted: 11/08/2018] [Indexed: 01/21/2023] Open

Abstract

Many microbes are studied by examining colony morphology via two-dimensional top-down images. The quantification of such images typically requires each pixel to be labelled as belonging to either the colony or background, producing a binary image. While this may be achieved manually for a single colony, this process is infeasible for large datasets containing thousands of images. The software Tool for Analysis of the Morphology of Microbial Colonies (TAMMiCol) has been developed to efficiently and automatically convert colony images to binary. TAMMiCol exploits the structure of the images to choose a thresholding tolerance and produce a binary image of the colony. The images produced are shown to compare favourably with images processed manually, while TAMMiCol is shown to outperform standard segmentation methods. Multiple images may be imported together for batch processing, while the binary data may be exported as a CSV or MATLAB MAT file for quantification, or analysed using statistics built into the software. Using the in-built statistics, it is found that images produced by TAMMiCol yield values close to those computed from binary images processed manually. Analysis of a new large dataset using TAMMiCol shows that colonies of Saccharomyces cerevisiae reach a maximum level of filamentous growth once the concentration of ammonium sulfate is reduced to 200 μM. TAMMiCol is accessed through a graphical user interface, making it easy to use for those without specialist knowledge of image processing, statistical methods or coding.

Many microbes are studied by examining the colony morphology via a two-dimensional top-down image. In order to quantify such images, we typically need to label each pixel as belonging either to the colony or the background, creating a binary image. This task is laborious when performed manually and proves infeasible for large datasets. To overcome this, we have developed the software Tool for Analysis of the Morphology of Microbial Colonies (TAMMiCol), which automatically and efficiently converts colony images to binary. Multiple images may be imported and processed simultaneously, and TAMMiCol exploits the structure of the images to identify an appropriate threshold for the binary conversion of each image. The images produced by TAMMiCol, which take around 20 seconds each to process, compare favourably with images processed manually, which take anywhere up to 15 minutes, while TAMMiCol outperforms several standard image segmentation methods. After processing, the images may be exported as a CSV or MATLAB MAT file for further analysis, or may be quantified by TAMMiCol using the in-built statistics. Using TAMMiCol, we have found that colonies of S. cerevisiae reach a maximum level of filamentous growth once the concentration of ammonium sulfate is reduced to 200 μM.

Collapse

Li Y, Wang S, Umarov R, Xie B, Fan M, Li L, Gao X. DEEPre: sequence-based enzyme EC number prediction by deep learning. Bioinformatics 2018;34:760-769. [PMID: 29069344 PMCID: PMC6030869 DOI: 10.1093/bioinformatics/btx680] [Citation(s) in RCA: 124] [Impact Index Per Article: 20.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 10/20/2017] [Indexed: 11/15/2022] Open

Holliday GL, Akiva E, Meng EC, Brown SD, Calhoun S, Pieper U, Sali A, Booker SJ, Babbitt PC. Atlas of the Radical SAM Superfamily: Divergent Evolution of Function Using a "Plug and Play" Domain. Methods Enzymol 2018;606:1-71. [PMID: 30097089 DOI: 10.1016/bs.mie.2018.06.004] [Citation(s) in RCA: 86] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]

Abstract

The radical SAM superfamily contains over 100,000 homologous enzymes that catalyze a remarkably broad range of reactions required for life, including metabolism, nucleic acid modification, and biogenesis of cofactors. While the highly conserved SAM-binding motif responsible for formation of the key 5'-deoxyadenosyl radical intermediate is a key structural feature that simplifies identification of superfamily members, our understanding of their structure-function relationships is complicated by the modular nature of their structures, which exhibit varied and complex domain architectures. To gain new insight about these relationships, we classified the entire set of sequences into similarity-based subgroups that could be visualized using sequence similarity networks. This superfamily-wide analysis reveals important features that had not previously been appreciated from studies focused on one or a few members. Functional information mapped to the networks indicates which members have been experimentally or structurally characterized, their known reaction types, and their phylogenetic distribution. Despite the biological importance of radical SAM chemistry, the vast majority of superfamily members have never been experimentally characterized in any way, suggesting that many new reactions remain to be discovered. In addition to 20 subgroups with at least one known function, we identified additional subgroups made up entirely of sequences of unknown function. Importantly, our results indicate that even general reaction types fail to track well with our sequence similarity-based subgroupings, raising major challenges for function prediction for currently identified and new members that continue to be discovered. Interactive similarity networks and other data from this analysis are available from the Structure-Function Linkage Database.

Collapse

Affiliation(s)

Gemma L Holliday Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States.
Eyal Akiva Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
Elaine C Meng Resource for Biocomputing, Visualization, and Informatics, Department of Pharmaceutical Chemistry, School of Pharmacy, University of California, San Francisco, CA, United States
Shoshana D Brown Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
Sara Calhoun Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Graduate Program in Biophysics, University of California, San Francisco, CA, United States
Ursula Pieper Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
Andrej Sali Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States
Squire J Booker Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA, United States; Department of Chemistry, The Pennsylvania State University, University Park, PA, United States; The Howard Hughes Medical Institute, The Pennsylvania State University, University Park, PA, United States
Patricia C Babbitt Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States.

Collapse

Sivakumar TV, Bhaduri A, Duvvuru Muni RR, Park JH, Kim TY. SimCAL: a flexible tool to compute biochemical reaction similarity. BMC Bioinformatics 2018;19:254. [PMID: 29969981 PMCID: PMC6029250 DOI: 10.1186/s12859-018-2248-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Accepted: 06/14/2018] [Indexed: 11/29/2022] Open

Abstract

Background

Computation of reaction similarity is a pre-requisite for several bioinformatics applications including enzyme identification for specific biochemical reactions, enzyme classification and mining for specific inhibitors. Reaction similarity is often assessed at either two levels: (i) comparison across all the constituent substrates and products of a reaction, reaction level similarity, (ii) comparison at the transformation center with various degrees of neighborhood, transformation level similarity. Existing reaction similarity computation tools are designed for specific applications and use different features and similarity measures. A single system integrating these diverse features enables comparison of the impact of different molecular properties on similarity score computation.

Results

To address these requirements, we present SimCAL, an integrated system to calculate reaction similarity with novel features and capability to perform comparative assessment. SimCAL provides reaction similarity computation at both whole reaction level and transformation level. Novel physicochemical features such as stereochemistry, mass, volume and charge are included in computing reaction fingerprint. Users can choose from four different fingerprint types and nine molecular similarity measures. Further, a comparative assessment of these features is also enabled. The performance of SimCAL is assessed on 3,688,122 reaction pairs with Enzyme Commission (EC) number from MetaCyc and achieved an area under the curve (AUC) of > 0.9. In addition, SimCAL results showed strong correlation with state-of-the-art EC-BLAST and molecular signature based reaction similarity methods.

Conclusions

SimCAL is developed in java and is available as a standalone tool, with intuitive, user-friendly graphical interface and also as a console application. With its customizable feature selection and similarity calculations, it is expected to cater a wide audience interested in studying and analyzing biochemical reactions and metabolic networks.

Electronic supplementary material

The online version of this article (10.1186/s12859-018-2248-5) contains supplementary material, which is available to authorized users.

Collapse

Vazquez-Hernandez C, Loza A, Peguero-Sanchez E, Segovia L, Gutierrez-Rios RM. Identification of reaction organization patterns that naturally cluster enzymatic transformations. BMC SYSTEMS BIOLOGY 2018;12:63. [PMID: 29848336 PMCID: PMC5977463 DOI: 10.1186/s12918-018-0583-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 05/09/2018] [Indexed: 11/10/2022]

Abstract

BACKGROUND

Metabolic reactions are chemical transformations commonly catalyzed by enzymes. In recent years, the explosion of genomic data and individual experimental characterizations have contributed to the construction of databases and methodologies for the analysis of metabolic networks. Some methodologies based on graph theory organize compound networks into metabolic functional categories without preserving biochemical pathways. Other methods based on chemical group exchange and atom flow trace the conversion of substrates into products in detail, which is useful for inferring metabolic pathways.

METHODS

Here, we present a novel rule-based approach incorporating both methods that decomposes each reaction into architectures of compound pairs and loner compounds that can be organized into tree structures. We compared the tree structure-compound pairs to those reported in the KEGG-RPAIR dataset and obtained a match precision of 81%. The generated tree structures naturally clustered all reactions into general reaction patterns of compounds with similar chemical transformations. The match precision of each cluster was calculated and used to suggest reactant-pairs for which manual curation can be avoided because this is the main goal of the method. We evaluated catalytic processes in the clusters based on Enzyme Commission categories that revealed preferential use of enzyme classes.

CONCLUSIONS

We demonstrate that the application of simple rules can enable the identification of reaction patterns reflecting metabolic reactions that transform substrates into products and the types of catalysis involved in these transformations. Our rule-based approach can be incorporated as the input in pathfinders or as a tool for the construction of reaction classifiers, indicating its usefulness for predicting enzyme catalysis.

Collapse

Fast and Flexible Synthesis of Combinatorial Libraries for Directed Evolution. Methods Enzymol 2018;608:59-79. [PMID: 30173773 DOI: 10.1016/bs.mie.2018.04.006] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Plehiers PP, Marin GB, Stevens CV, Van Geem KM. Automated reaction database and reaction network analysis: extraction of reaction templates using cheminformatics. J Cheminform 2018. [PMID: 29524042 PMCID: PMC5845084 DOI: 10.1186/s13321-018-0269-8] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Carbonell P, Wong J, Swainston N, Takano E, Turner NJ, Scrutton NS, Kell DB, Breitling R, Faulon JL. Selenzyme: enzyme selection tool for pathway design. Bioinformatics 2018;34:2153-2154. [PMID: 29425325 PMCID: PMC9881682 DOI: 10.1093/bioinformatics/bty065] [Citation(s) in RCA: 49] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Accepted: 02/06/2018] [Indexed: 02/02/2023] Open

Mallory EK, Acharya A, Rensi SE, Turnbaugh PJ, Bright RA, Altman RB. Chemical reaction vector embeddings: towards predicting drug metabolism in the human gut microbiome. PACIFIC SYMPOSIUM ON BIOCOMPUTING. PACIFIC SYMPOSIUM ON BIOCOMPUTING 2018;23:56-67. [PMID: 29218869 PMCID: PMC5771676] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Carbonell P, Koch M, Duigou T, Faulon JL. Enzyme Discovery: Enzyme Selection and Pathway Design. Methods Enzymol 2018;608:3-27. [PMID: 30173766 DOI: 10.1016/bs.mie.2018.04.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]

Delépine B, Duigou T, Carbonell P, Faulon JL. RetroPath2.0: A retrosynthesis workflow for metabolic engineers. Metab Eng 2018;45:158-170. [DOI: 10.1016/j.ymben.2017.12.002] [Citation(s) in RCA: 128] [Impact Index Per Article: 21.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2017] [Revised: 11/03/2017] [Accepted: 12/05/2017] [Indexed: 12/01/2022]