Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Hanson AD, Pribat A, Waller JC, de Crécy-Lagard V. 'Unknown' proteins and 'orphan' enzymes: the missing half of the engineering parts list--and how to find it. Biochem J 2009;425:1-11. [PMID: 20001958 DOI: 10.1042/BJ20091328] [Citation(s) in RCA: 135] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

For:	Hanson AD, Pribat A, Waller JC, de Crécy-Lagard V. 'Unknown' proteins and 'orphan' enzymes: the missing half of the engineering parts list--and how to find it. Biochem J 2009;425:1-11. [PMID: 20001958 DOI: 10.1042/BJ20091328] [Citation(s) in RCA: 135] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Number

Cited by Other Article(s)

Hameleers L, Pijning T, Gray BB, Fauré R, Jurak E. Novel β-galactosidase activity and first crystal structure of Glycoside Hydrolase family 154. N Biotechnol 2024;80:1-11. [PMID: 38163476 DOI: 10.1016/j.nbt.2023.12.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 12/27/2023] [Accepted: 12/27/2023] [Indexed: 01/03/2024]

Tannock GW. Understanding the gut microbiota by considering human evolution: a story of fire, cereals, cooking, molecular ingenuity, and functional cooperation. Microbiol Mol Biol Rev 2024;88:e0012722. [PMID: 38126754 PMCID: PMC10966955 DOI: 10.1128/mmbr.00127-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023] Open

de Crécy-Lagard V, Hutinet G, Cediel-Becerra JDD, Yuan Y, Zallot R, Chevrette MG, Ratnayake RMMN, Jaroch M, Quaiyum S, Bruner S. Biosynthesis and function of 7-deazaguanine derivatives in bacteria and phages. Microbiol Mol Biol Rev 2024;88:e0019923. [PMID: 38421302 PMCID: PMC10966956 DOI: 10.1128/mmbr.00199-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/02/2024] Open

Michael-Pitschaze T, Cohen N, Ofer D, Hoshen Y, Linial M. Detecting anomalous proteins using deep representations. NAR Genom Bioinform 2024;6:lqae021. [PMID: 38486884 PMCID: PMC10939404 DOI: 10.1093/nargab/lqae021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2023] [Revised: 11/17/2023] [Accepted: 02/23/2024] [Indexed: 03/17/2024] Open

Reed CJ, Denise R, Hourihan J, Babor J, Jaroch M, Martinelli M, Hutinet G, de Crécy-Lagard V. Beyond Blast: Enabling Microbiologists to Better Extract Literature, Taxonomic Distributions and Gene Neighborhood Information for Protein Families. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.03.539116. [PMID: 37205517 PMCID: PMC10187207 DOI: 10.1101/2023.05.03.539116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]

Knoshaug EP, Sun P, Nag A, Nguyen H, Mattoon EM, Zhang N, Liu J, Chen C, Cheng J, Zhang R, St. John P, Umen J. Identification and preliminary characterization of conserved uncharacterized proteins from Chlamydomonas reinhardtii, Arabidopsis thaliana, and Setaria viridis. PLANT DIRECT 2023;7:e527. [PMID: 38044962 PMCID: PMC10690477 DOI: 10.1002/pld3.527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 08/03/2023] [Accepted: 08/11/2023] [Indexed: 12/05/2023]

Bou-Nader C, Pecqueur L, de Crécy-Lagard V, Hamdane D. Integrative Approach to Probe Alternative Redox Mechanisms in RNA Modifications. Acc Chem Res 2023;56:3142-3152. [PMID: 37916403 PMCID: PMC10999249 DOI: 10.1021/acs.accounts.3c00418] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2023]

Abstract

RNA modifications found in most RNAs, particularly in tRNAs and rRNAs, reveal an abundance of chemical alterations of nucleotides. Over 150 distinct RNA modifications are known, emphasizing a remarkable diversity of chemical moieties in RNA molecules. These modifications play pivotal roles in RNA maturation, structural integrity, and the fidelity and efficiency of translation processes. The catalysts responsible for these modifications are RNA-modifying enzymes that use a striking array of chemistries to directly influence the chemical landscape of RNA. This diversity is further underscored by instances where the same modification is introduced by distinct enzymes that use unique catalytic mechanisms and cofactors across different domains of life. This phenomenon of convergent evolution highlights the biological importance of RNA modification and the vast potential within the chemical repertoire for nucleotide alteration. While shared RNA modifications can hint at conserved enzymatic pathways, a major bottleneck is to identify alternative routes within species that possess a modified RNA but are devoid of known RNA-modifying enzymes. To address this challenge, a combination of bioinformatic and experimental strategies proves invaluable in pinpointing new genes responsible for RNA modifications. This integrative approach not only unveils new chemical insights but also serves as a wellspring of inspiration for biocatalytic applications and drug design. In this Account, we present how comparative genomics and genome mining, combined with biomimetic synthetic chemistry, biochemistry, and anaerobic crystallography, can be judiciously implemented to address unprecedented and alternative chemical mechanisms in the world of RNA modification. We illustrate these integrative methodologies through the study of tRNA and rRNA modifications, dihydrouridine, 5-methyluridine, queuosine, 8-methyladenosine, 5-carboxymethylamino-methyluridine, or 5-taurinomethyluridine, each dependent on a diverse array of redox chemistries, often involving organic compounds, organometallic complexes, and metal coenzymes. We explore how vast genome and tRNA databases empower comparative genomic analyses and enable the identification of novel genes that govern RNA modification. Subsequently, we describe how the isolation of a stable reaction intermediate can guide the synthesis of a biomimetic to unveil new enzymatic pathways. We then discuss the usefulness of a biochemical "shunt" strategy to study catalytic mechanisms and to directly visualize reactive intermediates bound within active sites. While we primarily focus on various RNA-modifying enzymes studied in our laboratory, with a particular emphasis on the discovery of a SAM-independent methylation mechanism, the strategies and rationale presented herein are broadly applicable for the identification of new enzymes and the elucidation of their intricate chemistries. This Account offers a comprehensive glimpse into the evolving landscape of RNA modification research and highlights the pivotal role of integrated approaches to identify novel enzymatic pathways.

Collapse

Pathira Kankanamge LS, Ruffner LA, Touch MM, Pina M, Beuning PJ, Ondrechen MJ. Functional annotation of haloacid dehalogenase superfamily structural genomics proteins. Biochem J 2023;480:1553-1569. [PMID: 37747786 DOI: 10.1042/bcj20230057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2023] [Revised: 09/20/2023] [Accepted: 09/25/2023] [Indexed: 09/26/2023]

Sajid S, Mashkoor M, Jørgensen MG, Christensen LP, Hansen PR, Franzyk H, Mirza O, Prabhala BK. The Y-ome Conundrum: Insights into Uncharacterized Genes and Approaches for Functional Annotation. Mol Cell Biochem 2023:10.1007/s11010-023-04827-8. [PMID: 37610616 DOI: 10.1007/s11010-023-04827-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 08/09/2023] [Indexed: 08/24/2023]

Gu X, Cao Z, Zhao L, Seswita-Zilda D, Zhang Q, Fu L, Li J. Metagenomic Insights Reveal the Microbial Diversity and Associated Algal-Polysaccharide-Degrading Enzymes on the Surface of Red Algae among Remote Regions. Int J Mol Sci 2023;24:11019. [PMID: 37446198 DOI: 10.3390/ijms241311019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Revised: 06/27/2023] [Accepted: 06/28/2023] [Indexed: 07/15/2023] Open

Abstract

Macroalgae and macroalgae-associated bacteria together constitute the most efficient metabolic cycling system in the ocean. Their interactions, especially the responses of macroalgae-associated bacteria communities to algae in different geographical locations, are mostly unknown. In this study, metagenomics was used to analyze the microbial diversity and associated algal-polysaccharide-degrading enzymes on the surface of red algae among three remote regions. There were significant differences in the macroalgae-associated bacteria community composition and diversity among the different regions. At the phylum level, Proteobacteria, Bacteroidetes, and Actinobacteria had a significantly high relative abundance among the regions. From the perspective of species diversity, samples from China had the highest macroalgae-associated bacteria diversity, followed by those from Antarctica and Indonesia. In addition, in the functional prediction of the bacterial community, genes associated with amino acid metabolism, carbohydrate metabolism, energy metabolism, metabolism of cofactors and vitamins, and membrane transport had a high relative abundance. Canonical correspondence analysis and redundancy analysis of environmental factors showed that, without considering algae species and composition, pH and temperature were the main environmental factors affecting bacterial community structure. Furthermore, there were significant differences in algal-polysaccharide-degrading enzymes among the regions. Samples from China and Antarctica had high abundances of algal-polysaccharide-degrading enzymes, while those from Indonesia had extremely low abundances. The environmental differences between these three regions may impose a strong geographic differentiation regarding the biodiversity of algal microbiomes and their expressed enzyme genes. This work expands our knowledge of algal microbial ecology, and contributes to an in-depth study of their metabolic characteristics, ecological functions, and applications.

Collapse

Zeng X, Kahng A, Xue L, Mahamid J, Chang YW, Xu M. High-throughput cryo-ET structural pattern mining by unsupervised deep iterative subtomogram clustering. Proc Natl Acad Sci U S A 2023;120:e2213149120. [PMID: 37027429 PMCID: PMC10104553 DOI: 10.1073/pnas.2213149120] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 02/24/2023] [Indexed: 04/08/2023] Open

Makarova KS, Wolf YI, Koonin EV. In Silico Approaches for Prediction of Anti-CRISPR Proteins. J Mol Biol 2023;435:168036. [PMID: 36868398 PMCID: PMC10073340 DOI: 10.1016/j.jmb.2023.168036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2022] [Revised: 02/18/2023] [Accepted: 02/23/2023] [Indexed: 03/05/2023]

Denise R, Babor J, Gerlt JA, de Crécy-Lagard V. Pyridoxal 5'-phosphate synthesis and salvage in Bacteria and Archaea: predicting pathway variant distributions and holes. Microb Genom 2023;9:mgen000926. [PMID: 36729913 PMCID: PMC9997740 DOI: 10.1099/mgen.0.000926] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

Thirumalai A, Ganapathy Raman P, Jayavelu T, Subramanian R. Bridging the gap between maleate hydratase, citraconase and isopropylmalate isomerase: Insights into the single broad-specific enzyme. Enzyme Microb Technol 2023;162:110140. [DOI: 10.1016/j.enzmictec.2022.110140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2022] [Revised: 09/23/2022] [Accepted: 10/08/2022] [Indexed: 11/13/2022]

Brown DC, Aggarwal N, Turner RJ. Exploration of the presence and abundance of multidrug resistance efflux genes in oil and gas environments. MICROBIOLOGY (READING, ENGLAND) 2022;168. [PMID: 36190831 DOI: 10.1099/mic.0.001248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Exploring Bacterial Attributes That Underpin Symbiont Life in the Monogastric Gut. Appl Environ Microbiol 2022;88:e0112822. [PMID: 36036591 PMCID: PMC9499014 DOI: 10.1128/aem.01128-22] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Rhee KY, Jansen RS, Grundner C. Activity-based annotation: the emergence of systems biochemistry. Trends Biochem Sci 2022;47:785-794. [PMID: 35430135 PMCID: PMC9378515 DOI: 10.1016/j.tibs.2022.03.017] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2022] [Revised: 03/10/2022] [Accepted: 03/22/2022] [Indexed: 01/21/2023]

Yu M. Computational analysis on two putative mitochondrial protein-coding genes from the Emydura subglobosa genome: A functional annotation approach. PLoS One 2022;17:e0268031. [PMID: 35981005 PMCID: PMC9387794 DOI: 10.1371/journal.pone.0268031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 04/21/2022] [Indexed: 11/19/2022] Open

de Crécy-lagard V, Amorin de Hegedus R, Arighi C, Babor J, Bateman A, Blaby I, Blaby-Haas C, Bridge AJ, Burley SK, Cleveland S, Colwell LJ, Conesa A, Dallago C, Danchin A, de Waard A, Deutschbauer A, Dias R, Ding Y, Fang G, Friedberg I, Gerlt J, Goldford J, Gorelik M, Gyori BM, Henry C, Hutinet G, Jaroch M, Karp PD, Kondratova L, Lu Z, Marchler-Bauer A, Martin MJ, McWhite C, Moghe GD, Monaghan P, Morgat A, Mungall CJ, Natale DA, Nelson WC, O’Donoghue S, Orengo C, O’Toole KH, Radivojac P, Reed C, Roberts RJ, Rodionov D, Rodionova IA, Rudolf JD, Saleh L, Sheynkman G, Thibaud-Nissen F, Thomas PD, Uetz P, Vallenet D, Carter EW, Weigele PR, Wood V, Wood-Charlson EM, Xu J. A roadmap for the functional annotation of protein families: a community perspective. Database (Oxford) 2022;2022:6663924. [PMID: 35961013 PMCID: PMC9374478 DOI: 10.1093/database/baac062] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Revised: 06/28/2022] [Accepted: 08/03/2022] [Indexed: 12/23/2022]

Affiliation(s)

Valérie de Crécy-lagard Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Rocio Amorin de Hegedus Genetics Institute, University of Florida , Gainesville, FL 32611, USA
Cecilia Arighi Department of Computer and Information Sciences, University of Delaware , Newark, DE 19713, USA
Jill Babor Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Alex Bateman European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus , Hinxton CB10 1SD, UK
Ian Blaby US Department of Energy Joint Genome Institute, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA
Crysten Blaby-Haas Biology Department, Brookhaven National Laboratory , Upton, NY 11973, USA
Alan J Bridge Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire , Geneva 4 CH-1211, Switzerland
Stephen K Burley RCSB Protein Data Bank, Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey , Piscataway, NJ 08854, USA
Stacey Cleveland Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Lucy J Colwell Departmenf of Chemistry, University of Cambridge , Lensfield Road, Cambridge CB2 1EW, UK
Ana Conesa Spanish National Research Council, Institute for Integrative Systems Biology , Paterna, Valencia 46980, Spain
Christian Dallago TUM (Technical University of Munich) Department of Informatics, Bioinformatics & Computational Biology , i12, Boltzmannstr. 3, Garching/Munich 85748, Germany
Antoine Danchin School of Biomedical Sciences, Li KaShing Faculty of Medicine, The University of Hong Kong , 21 Sassoon Road, Pokfulam, SAR Hong Kong 999077, China
Anita de Waard Research Collaboration Unit, Elsevier , Jericho, VT 05465, USA
Adam Deutschbauer Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA
Raquel Dias Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Yousong Ding Department of Medicinal Chemistry, Center for Natural Products, Drug Discovery and Development, University of Florida , Gainesville, FL 32610, USA
Gang Fang NYU-Shanghai , Shanghai 200120, China
Iddo Friedberg Department of Veterinary Microbiology and Preventive Medicine, Iowa State University , Ames, IA 50011, USA
John Gerlt Institute for Genomic Biology and Departments of Biochemistry and Chemistry, University of Illinois at Urbana-Champaign , Urbana, IL 61801, USA
Joshua Goldford Physics of Living Systems, Massachusetts Institute of Technology , Cambridge, MA 02139, USA
Mark Gorelik Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Benjamin M Gyori Laboratory of Systems Pharmacology, Harvard Medical School , Boston, MA 02115, USA
Christopher Henry Mathematics and Computer Science Division, Argonne National Laboratory , Argonne, IL 60439, USA
Geoffrey Hutinet Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Marshall Jaroch Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Peter D Karp Bioinformatics Research Group, SRI International , Menlo Park, CA 94025, USA
Liudmyla Kondratova Genetics Institute, University of Florida , Gainesville, FL 32611, USA
Zhiyong Lu National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH) , 8600 Rockville Pike, Bethesda, MD 20817, USA
Aron Marchler-Bauer National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH) , 8600 Rockville Pike, Bethesda, MD 20817, USA
Maria-Jesus Martin European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus , Hinxton CB10 1SD, UK
Claire McWhite Lewis-Sigler Institute for Integrative Genomics, Princeton University , Princeton, NJ 08540, USA
Gaurav D Moghe Plant Biology Section, School of Integrative Plant Science, Cornell University , Ithaca, NY 14853, USA
Paul Monaghan Department of Agricultural Education and Communication, University of Florida , Gainesville, FL 32611, USA
Anne Morgat Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire , Geneva 4 CH-1211, Switzerland
Christopher J Mungall Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA
Darren A Natale Georgetown University Medical Center , Washington, DC 20007, USA
William C Nelson Biological Sciences Division, Pacific Northwest National Laboratories , Richland, WA 99354, USA
Seán O’Donoghue School of Biotechnology and Biomolecular Sciences, University of NSW , Sydney, NSW 2052, Australia
Christine Orengo Department of Structural and Molecular Biology, University College London , London WC1E 6BT, UK
Katherine H O’Toole New England Biolabs , Ipswich, MA 01938, USA
Predrag Radivojac Khoury College of Computer Sciences, Northeastern University , Boston, MA 02115, USA
Colbie Reed Department of Microbiology and Cell Sciences, University of Florida , Gainesville, FL 32611, USA
Richard J Roberts New England Biolabs , Ipswich, MA 01938, USA
Dmitri Rodionov Sanford Burnham Prebys Medical Discovery Institute , La Jolla, CA 92037, USA
Irina A Rodionova Department of Bioengineering, Division of Engineering, University of California at San Diego , La Jolla, CA 92093-0412, USA
Jeffrey D Rudolf Department of Chemistry, University of Florida , Gainesville, FL 32611, USA
Lana Saleh New England Biolabs , Ipswich, MA 01938, USA
Gloria Sheynkman Department of Molecular Physiology and Biological Physics, University of Virginia , Charlottesville, VA, USA
Francoise Thibaud-Nissen National Center for Biotechnology Information (NCBI), National Library of Medicine (NLM), National Institutes of Health (NIH) , 8600 Rockville Pike, Bethesda, MD 20817, USA
Paul D Thomas Department of Population and Public Health Sciences, University of Southern California , Los Angeles, CA 90033, USA
Peter Uetz Center for Biological Data Science, Virginia Commonwealth University , Richmond, VA 23284, USA
David Vallenet LABGeM, Génomique Métabolique, CEA, Genoscope, Institut François Jacob, Université d’Évry, Université Paris-Saclay, CNRS , Evry 91057, France
Erica Watson Carter Department of Plant Pathology, University of Florida Citrus Research and Education Center , 700 Experiment Station Rd., Lake Alfred, FL 33850, USA
Peter R Weigele New England Biolabs , Ipswich, MA 01938, USA
Valerie Wood Department of Biochemistry, University of Cambridge , Cambridge CB2 1GA, UK
Elisha M Wood-Charlson Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory , Berkeley, CA 94720, USA
Jin Xu Department of Plant Pathology, University of Florida Citrus Research and Education Center , 700 Experiment Station Rd., Lake Alfred, FL 33850, USA

Collapse

Cho KT, Sen TZ, Andorf CM. Predicting Tissue-Specific mRNA and Protein Abundance in Maize: A Machine Learning Approach. Front Artif Intell 2022;5:830170. [PMID: 35719692 PMCID: PMC9204276 DOI: 10.3389/frai.2022.830170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 04/26/2022] [Indexed: 11/13/2022] Open

Vanni C, Schechter MS, Acinas SG, Barberán A, Buttigieg PL, Casamayor EO, Delmont TO, Duarte CM, Eren AM, Finn RD, Kottmann R, Mitchell A, Sánchez P, Siren K, Steinegger M, Gloeckner FO, Fernàndez-Guerra A. Unifying the known and unknown microbial coding sequence space. eLife 2022;11:67667. [PMID: 35356891 PMCID: PMC9132574 DOI: 10.7554/elife.67667] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2021] [Accepted: 03/30/2022] [Indexed: 12/02/2022] Open

Abstract

Genes of unknown function are among the biggest challenges in molecular biology, especially in microbial systems, where 40–60% of the predicted genes are unknown. Despite previous attempts, systematic approaches to include the unknown fraction into analytical workflows are still lacking. Here, we present a conceptual framework, its translation into the computational workflow AGNOSTOS and a demonstration on how we can bridge the known-unknown gap in genomes and metagenomes. By analyzing 415,971,742 genes predicted from 1749 metagenomes and 28,941 bacterial and archaeal genomes, we quantify the extent of the unknown fraction, its diversity, and its relevance across multiple organisms and environments. The unknown sequence space is exceptionally diverse, phylogenetically more conserved than the known fraction and predominantly taxonomically restricted at the species level. From the 71 M genes identified to be of unknown function, we compiled a collection of 283,874 lineage-specific genes of unknown function for Cand. Patescibacteria (also known as Candidate Phyla Radiation, CPR), which provides a significant resource to expand our understanding of their unusual biology. Finally, by identifying a target gene of unknown function for antibiotic resistance, we demonstrate how we can enable the generation of hypotheses that can be used to augment experimental data.

It is estimated that scientists do not know what half of microbial genes actually do. When these genes are discovered in microorganisms grown in the lab or found in environmental samples, it is not possible to identify what their roles are. Many of these genes are excluded from further analyses for these reasons, meaning that the study of microbial genes tends to be limited to genes that have already been described.

These limitations hinder research into microbiology, because information from newly discovered genes cannot be integrated to better understand how these organisms work. Experiments to understand what role these genes have in the microorganisms are labor-intensive, so new analytical strategies are needed.

To do this, Vanni et al. developed a new framework to categorize genes with unknown roles, and a computational workflow to integrate them into traditional analyses. When this approach was applied to over 400 million microbial genes (both with known and unknown roles), it showed that the share of genes with unknown functions is only about 30 per cent, smaller than previously thought. The analysis also showed that these genes are very diverse, revealing a huge space for future research and potential applications. Combining their approach with experimental data, Vanni et al. were able to identify a gene with a previously unknown purpose that could be involved in antibiotic resistance.

This system could be useful for other scientists studying microorganisms to get a more complete view of microbial systems. In future, it may also be used to analyze the genetics of other organisms, such as plants and animals.

Collapse

Affiliation(s)

Chiara Vanni Microbial Genomics and Bioinformatics Research G, Max Planck Institute for Marine Microbiology, Bremen, Germany
Matthew S Schechter Department of Medicine, University of Chicago, Chicago, United States
Silvia G Acinas Department of Marine Biology and Oceanography, Institut de Ciències del Mar-CMIMA (CSIC), Barcelona, Spain
Albert Barberán Department of Environmental Science, University of Arizona, Tucson, United States
Pier Luigi Buttigieg Helmholtz Centre for Polar and Marine Research, Alfred Wegener Institute, Bremerhaven, Germany
Emilio O Casamayor Center for Advanced Studies of Blanes CEAB-CSIC, Spanish Council for Research, Blanes, Spain
Tom O Delmont Génomique Métabolique, Genoscope, Institut François Jacob, CEA, CNRS, Paris, France
Carlos M Duarte Computational Bioscience Research Center, King Abdullah University of Science and Technology, Thuwal, Saudi Arabia
A Murat Eren Department of Medicine, University of Chicago, Chicago, United States
Robert D Finn European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, United Kingdom
Renzo Kottmann Microbial Genomics and Bioinformatics Research G, Max Planck Institute for Marine Microbiology, Bremen, Germany
Alex Mitchell European Bioinformatics Institute (EMBL-EBI), European Molecular Biology Laboratory, Hinxton, United Kingdom
Pablo Sánchez Department of Marine Biology and Oceanography, Institut de Ciències del Mar-CMIMA (CSIC), Barcelona, Spain
Kimmo Siren Section for Evolutionary Genomics, The GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
Martin Steinegger School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
Frank Oliver Gloeckner MARUM, Helmholtz Center for Polar and Marine Research, University of Bremen, Bremen, Germany
Antonio Fernàndez-Guerra Lundbeck Foundation GeoGenetics Centre, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark

Collapse

A deep learning model to detect novel pore-forming proteins. Sci Rep 2022;12:2013. [PMID: 35132124 PMCID: PMC8821639 DOI: 10.1038/s41598-022-05970-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Accepted: 01/12/2022] [Indexed: 11/09/2022] Open

Takihara H, Miura N, Aoki-Kinoshita KF, Okuda S. Functional glyco-metagenomics elucidates the role of glycan-related genes in environments. BMC Bioinformatics 2021;22:505. [PMID: 34663219 PMCID: PMC8522060 DOI: 10.1186/s12859-021-04425-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 10/04/2021] [Indexed: 11/20/2022] Open

Abstract

BACKGROUND

Glycan-related genes play a fundamental role in various processes for energy acquisition and homeostasis maintenance while adapting to the environment in which the organism exists; however, their role in the microbiome in the environment is unclear.

METHODS

Sequence alignment was performed between known glycan-related genes and complete genomes of microorganisms, and optimal parameters for identifying glycan-related genes were determined based on the alignments. Using the constructed scheme (> 90% of identity and > 25 aa of alignment length), glycan-related genes in various environments were identified from 198 different metagenome data.

RESULTS

As a result, we identified 86.73 million glycan-related genes from the metagenome data. Among the 12 environments classified in this study, the percentage of glycan-related genes was high in the human-associated environment, suggesting that these environments utilize glycan metabolism better than other environments. On the other hand, the relative abundances of both glycoside hydrolases and glycosyltransferases surprisingly had a coverage of over 80% in all the environments. These glycoside hydrolases and glycosyltransferases were classified into two groups of (1) general enzyme families identified in various environments and (2) specific enzymes found only in certain environments. The general enzyme families were mostly from genes involved in monosaccharide metabolism, and most of the specific enzymes were polysaccharide degrading enzymes.

CONCLUSION

These findings suggest that environmental microorganisms could change the composition of their glycan-related genes to adapt the processes involved in acquiring energy from glycans in their environments. Our functional glyco-metagenomics approach has made it possible to clarify the relationship between the environment and genes from the perspective of carbohydrates, and the existence of glycan-related genes that exist specifically in the environment.

Collapse

Zeng Y, Howe G, Yi K, Zeng X, Zhang J, Chang YW, Xu M. UNSUPERVISED DOMAIN ALIGNMENT BASED OPEN SET STRUCTURAL RECOGNITION OF MACROMOLECULES CAPTURED BY CRYO-ELECTRON TOMOGRAPHY. PROCEEDINGS. INTERNATIONAL CONFERENCE ON IMAGE PROCESSING 2021;2021:106-110. [PMID: 35350462 PMCID: PMC8959888 DOI: 10.1109/icip42928.2021.9506205] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]

Discovery and mining of enzymes from the human gut microbiome. Trends Biotechnol 2021;40:240-254. [PMID: 34304905 DOI: 10.1016/j.tibtech.2021.06.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Revised: 06/24/2021] [Accepted: 06/25/2021] [Indexed: 12/19/2022]

de Rond T, Asay JE, Moore BS. Co-occurrence of enzyme domains guides the discovery of an oxazolone synthetase. Nat Chem Biol 2021;17:794-799. [PMID: 34099916 PMCID: PMC8238888 DOI: 10.1038/s41589-021-00808-4] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Accepted: 04/29/2021] [Indexed: 02/04/2023]

Key amino acid residues in homoserine-acetyltransferase from M. tuberculosis give insight into the evolution of MetX family of enzymes - HAT, SAT and HST. Biochimie 2021;189:13-25. [PMID: 34090964 DOI: 10.1016/j.biochi.2021.05.016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2021] [Revised: 05/23/2021] [Accepted: 05/30/2021] [Indexed: 11/22/2022]

Poudel S, Cope AL, O'Dell KB, Guss AM, Seo H, Trinh CT, Hettich RL. Identification and characterization of proteins of unknown function (PUFs) in Clostridium thermocellum DSM 1313 strains as potential genetic engineering targets. BIOTECHNOLOGY FOR BIOFUELS 2021;14:116. [PMID: 33971924 PMCID: PMC8112048 DOI: 10.1186/s13068-021-01964-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Accepted: 04/26/2021] [Indexed: 05/13/2023]

Black KA, Duan L, Mandyoli L, Selbach BP, Xu W, Ehrt S, Sacchettini JC, Rhee KY. Metabolic bifunctionality of Rv0812 couples folate and peptidoglycan biosynthesis in Mycobacterium tuberculosis. J Exp Med 2021;218:212052. [PMID: 33950161 PMCID: PMC8105722 DOI: 10.1084/jem.20191957] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 02/16/2021] [Accepted: 03/30/2021] [Indexed: 11/04/2022] Open

Bergès C, Cahoreau E, Millard P, Enjalbert B, Dinclaux M, Heuillet M, Kulyk H, Gales L, Butin N, Chazalviel M, Palama T, Guionnet M, Sokol S, Peyriga L, Bellvert F, Heux S, Portais JC. Exploring the Glucose Fluxotype of the E. coli y-ome Using High-Resolution Fluxomics. Metabolites 2021;11:metabo11050271. [PMID: 33926117 PMCID: PMC8145925 DOI: 10.3390/metabo11050271] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2021] [Revised: 04/16/2021] [Accepted: 04/23/2021] [Indexed: 01/26/2023] Open

Affiliation(s)

Cécilia Bergès Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Edern Cahoreau Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Pierre Millard Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.)
Brice Enjalbert Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.)
Mickael Dinclaux Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.)
Maud Heuillet Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Hanna Kulyk Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Lara Gales Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Noémie Butin Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France RESTORE, Université de Toulouse, Inserm U1031, CNRS 5070, UPS, EFS, 31100 Toulouse, France
Maxime Chazalviel Toxalim (Research Centre in Food Toxicology), UMR1331, Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, 31300 Toulouse, France;
Tony Palama Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Matthieu Guionnet Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Sergueï Sokol Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.)
Lindsay Peyriga Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Floriant Bellvert Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France
Stéphanie Heux Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.)
Jean-Charles Portais Toulouse Biotechnology Institute (TBI), Université de Toulouse, CNRS, INRAE, INSA, 31077 Toulouse, France; (C.B.); (E.C.); (P.M.); (B.E.); (M.D.); (M.H.); (H.K.); (L.G.); (N.B.); (T.P.); (M.G.); (S.S.); (L.P.); (F.B.); (S.H.) MetaToul-MetaboHUB, National Infrastructure of Metabolomics & Fluxomics (ANR-11-INBS-0010), 31077 Toulouse, France RESTORE, Université de Toulouse, Inserm U1031, CNRS 5070, UPS, EFS, 31100 Toulouse, France Correspondence:

Collapse

Current knowledge and recent advances in understanding metabolism of the model cyanobacterium Synechocystis sp. PCC 6803. Biosci Rep 2021;40:222317. [PMID: 32149336 PMCID: PMC7133116 DOI: 10.1042/bsr20193325] [Citation(s) in RCA: 31] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2020] [Revised: 03/05/2020] [Accepted: 03/06/2020] [Indexed: 02/06/2023] Open

Abstract

Cyanobacteria are key organisms in the global ecosystem, useful models for studying metabolic and physiological processes conserved in photosynthetic organisms, and potential renewable platforms for production of chemicals. Characterizing cyanobacterial metabolism and physiology is key to understanding their role in the environment and unlocking their potential for biotechnology applications. Many aspects of cyanobacterial biology differ from heterotrophic bacteria. For example, most cyanobacteria incorporate a series of internal thylakoid membranes where both oxygenic photosynthesis and respiration occur, while CO2 fixation takes place in specialized compartments termed carboxysomes. In this review, we provide a comprehensive summary of our knowledge on cyanobacterial physiology and the pathways in Synechocystis sp. PCC 6803 (Synechocystis) involved in biosynthesis of sugar-based metabolites, amino acids, nucleotides, lipids, cofactors, vitamins, isoprenoids, pigments and cell wall components, in addition to the proteins involved in metabolite transport. While some pathways are conserved between model cyanobacteria, such as Synechocystis, and model heterotrophic bacteria like Escherichia coli, many enzymes and/or pathways involved in the biosynthesis of key metabolites in cyanobacteria have not been completely characterized. These include pathways required for biosynthesis of chorismate and membrane lipids, nucleotides, several amino acids, vitamins and cofactors, and isoprenoids such as plastoquinone, carotenoids, and tocopherols. Moreover, our understanding of photorespiration, lipopolysaccharide assembly and transport, and degradation of lipids, sucrose, most vitamins and amino acids, and haem, is incomplete. We discuss tools that may aid our understanding of cyanobacterial metabolism, notably CyanoSource, a barcoded library of targeted Synechocystis mutants, which will significantly accelerate characterization of individual proteins.

Collapse

Bioinformatic and experimental evidence for suicidal and catalytic plant THI4s. Biochem J 2020;477:2055-2069. [PMID: 32441748 DOI: 10.1042/bcj20200297] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2020] [Revised: 05/20/2020] [Accepted: 05/21/2020] [Indexed: 12/14/2022]

Kornfuehrer T, Romanowski S, de Crécy-Lagard V, Hanson AD, Eustáquio AS. An Enzyme Containing the Conserved Domain of Unknown Function DUF62 Acts as a Stereoselective (R_s ,S_c )-S-Adenosylmethionine Hydrolase. Chembiochem 2020;21:3495-3499. [PMID: 32776704 DOI: 10.1002/cbic.202000349] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2020] [Revised: 08/07/2020] [Indexed: 11/09/2022]

Thioproline formation as a driver of formaldehyde toxicity in Escherichia coli. Biochem J 2020;477:1745-1757. [PMID: 32301498 DOI: 10.1042/bcj20200198] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 04/14/2020] [Accepted: 04/17/2020] [Indexed: 12/14/2022]

Abstract

Formaldehyde (HCHO) is a reactive carbonyl compound that formylates and cross-links proteins, DNA, and small molecules. It is of specific concern as a toxic intermediate in the design of engineered pathways involving methanol oxidation or formate reduction. The interest in engineering these pathways is not, however, matched by engineering-relevant information on precisely why HCHO is toxic or on what damage-control mechanisms cells deploy to manage HCHO toxicity. The only well-defined mechanism for managing HCHO toxicity is formaldehyde dehydrogenase-mediated oxidation to formate, which is counterproductive if HCHO is a desired pathway intermediate. We therefore sought alternative HCHO damage-control mechanisms via comparative genomic analysis. This analysis associated homologs of the Escherichia coli pepP gene with HCHO-related one-carbon metabolism. Furthermore, deleting pepP increased the sensitivity of E. coli to supplied HCHO but not other carbonyl compounds. PepP is a proline aminopeptidase that cleaves peptides of the general formula X-Pro-Y, yielding X + Pro-Y. HCHO is known to react spontaneously with cysteine to form the close proline analog thioproline (thiazolidine-4-carboxylate), which is incorporated into proteins and hence into proteolytic peptides. We therefore hypothesized that certain thioproline-containing peptides are toxic and that PepP cleaves these aberrant peptides. Supporting this hypothesis, PepP cleaved the model peptide Ala-thioproline-Ala as efficiently as Ala-Pro-Ala in vitro and in vivo, and deleting pepP increased sensitivity to supplied thioproline. Our data thus (i) provide biochemical genetic evidence that thioproline formation contributes substantially to HCHO toxicity and (ii) make PepP a candidate damage-control enzyme for engineered pathways having HCHO as an intermediate.

Collapse

Liu Z, Feng J, Yu B, Ma Q, Liu B. The functional determinants in the organization of bacterial genomes. Brief Bioinform 2020;22:5892344. [PMID: 32793986 DOI: 10.1093/bib/bbaa172] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2020] [Revised: 06/30/2020] [Accepted: 07/07/2020] [Indexed: 12/13/2022] Open

Prifti E, Chevaleyre Y, Hanczar B, Belda E, Danchin A, Clément K, Zucker JD. Interpretable and accurate prediction models for metagenomics data. Gigascience 2020;9:giaa010. [PMID: 32150601 PMCID: PMC7062144 DOI: 10.1093/gigascience/giaa010] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2019] [Revised: 09/12/2019] [Accepted: 01/27/2020] [Indexed: 01/28/2023] Open

Abstract

BACKGROUND

Microbiome biomarker discovery for patient diagnosis, prognosis, and risk evaluation is attracting broad interest. Selected groups of microbial features provide signatures that characterize host disease states such as cancer or cardio-metabolic diseases. Yet, the current predictive models stemming from machine learning still behave as black boxes and seldom generalize well. Their interpretation is challenging for physicians and biologists, which makes them difficult to trust and use routinely in the physician-patient decision-making process. Novel methods that provide interpretability and biological insight are needed. Here, we introduce "predomics", an original machine learning approach inspired by microbial ecosystem interactions that is tailored for metagenomics data. It discovers accurate predictive signatures and provides unprecedented interpretability. The decision provided by the predictive model is based on a simple, yet powerful score computed by adding, subtracting, or dividing cumulative abundance of microbiome measurements.

RESULTS

Tested on >100 datasets, we demonstrate that predomics models are simple and highly interpretable. Even with such simplicity, they are at least as accurate as state-of-the-art methods. The family of best models, discovered during the learning process, offers the ability to distil biological information and to decipher the predictability signatures of the studied condition. In a proof-of-concept experiment, we successfully predicted body corpulence and metabolic improvement after bariatric surgery using pre-surgery microbiome data.

CONCLUSIONS

Predomics is a new algorithm that helps in providing reliable and trustworthy diagnostic decisions in the microbiome field. Predomics is in accord with societal and legal requirements that plead for an explainable artificial intelligence approach in the medical field.

Collapse

Wang PH, Fujishima K, Berhanu S, Kuruma Y, Jia TZ, Khusnutdinova AN, Yakunin AF, McGlynn SE. A Bifunctional Polyphosphate Kinase Driving the Regeneration of Nucleoside Triphosphate and Reconstituted Cell-Free Protein Synthesis. ACS Synth Biol 2020;9:36-42. [PMID: 31829622 DOI: 10.1021/acssynbio.9b00456] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Taxonomic Distribution of Cytochrome P450 Monooxygenases (CYPs) among the Budding Yeasts (Sub-Phylum Saccharomycotina). Microorganisms 2019;7:microorganisms7080247. [PMID: 31398949 PMCID: PMC6723986 DOI: 10.3390/microorganisms7080247] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2019] [Revised: 08/06/2019] [Accepted: 08/07/2019] [Indexed: 12/14/2022] Open

Discovery of novel carbohydrate-active enzymes through the rational exploration of the protein sequences space. Proc Natl Acad Sci U S A 2019;116:6063-6068. [PMID: 30850540 PMCID: PMC6442616 DOI: 10.1073/pnas.1815791116] [Citation(s) in RCA: 128] [Impact Index Per Article: 25.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023] Open

Sun J, Sigler CL, Beaudoin GAW, Joshi J, Patterson JA, Cho KH, Ralat MA, Gregory JF, Clark DG, Deng Z, Colquhoun TA, Hanson AD. Parts-Prospecting for a High-Efficiency Thiamin Thiazole Biosynthesis Pathway. PLANT PHYSIOLOGY 2019;179:958-968. [PMID: 30337452 PMCID: PMC6393793 DOI: 10.1104/pp.18.01085] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/31/2018] [Accepted: 10/10/2018] [Indexed: 05/04/2023]

Towards functional characterization of archaeal genomic dark matter. Biochem Soc Trans 2019;47:389-398. [PMID: 30710061 PMCID: PMC6393860 DOI: 10.1042/bst20180560] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2018] [Revised: 01/08/2019] [Accepted: 01/09/2019] [Indexed: 01/07/2023]

Griesemer M, Kimbrel JA, Zhou CE, Navid A, D'haeseleer P. Combining multiple functional annotation tools increases coverage of metabolic annotation. BMC Genomics 2018;19:948. [PMID: 30567498 PMCID: PMC6299973 DOI: 10.1186/s12864-018-5221-9] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 11/05/2018] [Indexed: 12/15/2022] Open

Abstract

Background

Genome-scale metabolic modeling is a cornerstone of systems biology analysis of microbial organisms and communities, yet these genome-scale modeling efforts are invariably based on incomplete functional annotations. Annotated genomes typically contain 30–50% of genes without functional annotation, severely limiting our knowledge of the “parts lists” that the organisms have at their disposal. These incomplete annotations may be sufficient to derive a model of a core set of well-studied metabolic pathways that support growth in pure culture. However, pathways important for growth on unusual metabolites exchanged in complex microbial communities are often less understood, resulting in missing functional annotations in newly sequenced genomes.

Results

Here, we present results on a comprehensive reannotation of 27 bacterial reference genomes, focusing on enzymes with EC numbers annotated by KEGG, RAST, EFICAz, and the BRENDA enzyme database, and on membrane transport annotations by TransportDB, KEGG and RAST. Our analysis shows that annotation using multiple tools can result in a drastically larger metabolic network reconstruction, adding on average 40% more EC numbers, 3–8 times more substrate-specific transporters, and 37% more metabolic genes. These results are even more pronounced for bacterial species that are phylogenetically distant from well-studied model organisms such as E. coli.

Conclusions

Metabolic annotations are often incomplete and inconsistent. Combining multiple functional annotation tools can greatly improve genome coverage and metabolic network size, especially for non-model organisms and non-core pathways.

Electronic supplementary material

The online version of this article (10.1186/s12864-018-5221-9) contains supplementary material, which is available to authorized users.

Collapse

Linder T. Phenotypical characterisation of a putative ω-amino acid transaminase in the yeast Scheffersomyces stipitis. Arch Microbiol 2018;201:185-192. [PMID: 30519708 PMCID: PMC6514085 DOI: 10.1007/s00203-018-1608-x] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2018] [Revised: 11/30/2018] [Accepted: 12/03/2018] [Indexed: 01/05/2023]

Molecular Factors of Hypochlorite Tolerance in the Hypersaline Archaeon Haloferax volcanii. Genes (Basel) 2018;9:genes9110562. [PMID: 30463375 PMCID: PMC6267482 DOI: 10.3390/genes9110562] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2018] [Revised: 11/07/2018] [Accepted: 11/13/2018] [Indexed: 12/17/2022] Open

Rapid, Parallel Identification of Catabolism Pathways of Lignin-Derived Aromatic Compounds in Novosphingobium aromaticivorans. Appl Environ Microbiol 2018;84:AEM.01185-18. [PMID: 30217841 DOI: 10.1128/aem.01185-18] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2018] [Accepted: 09/05/2018] [Indexed: 11/20/2022] Open

A novel chlorination-induced ribonuclease YabJ from Staphylococcus aureus. Biosci Rep 2018;38:BSR20180768. [PMID: 30201692 PMCID: PMC6435465 DOI: 10.1042/bsr20180768] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Revised: 08/15/2018] [Accepted: 08/23/2018] [Indexed: 01/09/2023] Open

de Crécy-Lagard V, Haas D, Hanson AD. Newly-discovered enzymes that function in metabolite damage-control. Curr Opin Chem Biol 2018;47:101-108. [PMID: 30268903 DOI: 10.1016/j.cbpa.2018.09.014] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 08/19/2018] [Accepted: 09/11/2018] [Indexed: 01/26/2023]

Zallot R, Oberg NO, Gerlt JA. 'Democratized' genomic enzymology web tools for functional assignment. Curr Opin Chem Biol 2018;47:77-85. [PMID: 30268904 DOI: 10.1016/j.cbpa.2018.09.009] [Citation(s) in RCA: 94] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2018] [Revised: 09/10/2018] [Accepted: 09/11/2018] [Indexed: 12/24/2022]

Gibson CL, Codreanu SG, Schrimpe-Rutledge AC, Retzlaff CL, Wright J, Mortlock DP, Sherrod SD, McLean JA, Blakely RD. Global untargeted serum metabolomic analyses nominate metabolic pathways responsive to loss of expression of the orphan metallo β-lactamase, MBLAC1. Mol Omics 2018;14:142-155. [PMID: 29868674 PMCID: PMC6015503 DOI: 10.1039/c7mo00022g] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Komárek J, Ivanov Kavková E, Houser J, Horáčková A, Ždánská J, Demo G, Wimmerová M. Structure and properties of AB21, a novelAgaricus bisporusprotein with structural relation to bacterial pore-forming toxins. Proteins 2018;86:897-911. [DOI: 10.1002/prot.25522] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2018] [Revised: 04/23/2018] [Accepted: 04/26/2018] [Indexed: 12/13/2022]