Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Freilich S, Spriggs RV, George RA, Al-Lazikani B, Swindells M, Thornton JM. The complement of enzymatic sets in different species. J Mol Biol 2005;349:745-63. [PMID: 15896806 DOI: 10.1016/j.jmb.2005.04.027] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2005] [Revised: 04/10/2005] [Accepted: 04/12/2005] [Indexed: 11/17/2022]

For:	Freilich S, Spriggs RV, George RA, Al-Lazikani B, Swindells M, Thornton JM. The complement of enzymatic sets in different species. J Mol Biol 2005;349:745-63. [PMID: 15896806 DOI: 10.1016/j.jmb.2005.04.027] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2005] [Revised: 04/10/2005] [Accepted: 04/12/2005] [Indexed: 11/17/2022]

Number

Cited by Other Article(s)

Solano YJ, Kiser PD. Double-duty isomerases: a case study of isomerization-coupled enzymatic catalysis. Trends Biochem Sci 2024:S0968-0004(24)00107-5. [PMID: 38760195 DOI: 10.1016/j.tibs.2024.04.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/08/2024] [Accepted: 04/23/2024] [Indexed: 05/19/2024]

Bartuv R, Berihu M, Medina S, Salim S, Feygenberg O, Faigenboim-Doron A, Zhimo VY, Abdelfattah A, Piombo E, Wisniewski M, Freilich S, Droby S. Functional analysis of the apple fruit microbiome based on shotgun metagenomic sequencing of conventional and organic orchard samples. Environ Microbiol 2023;25:1728-1746. [PMID: 36807446 DOI: 10.1111/1462-2920.16353] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2022] [Accepted: 02/16/2023] [Indexed: 02/23/2023]

Berihu M, Somera TS, Malik A, Medina S, Piombo E, Tal O, Cohen M, Ginatt A, Ofek-Lalzar M, Doron-Faigenboim A, Mazzola M, Freilich S. A framework for the targeted recruitment of crop-beneficial soil taxa based on network analysis of metagenomics data. MICROBIOME 2023;11:8. [PMID: 36635724 PMCID: PMC9835355 DOI: 10.1186/s40168-022-01438-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/14/2022] [Accepted: 11/28/2022] [Indexed: 06/17/2023]

Abstract

BACKGROUND

The design of ecologically sustainable and plant-beneficial soil systems is a key goal in actively manipulating root-associated microbiomes. Community engineering efforts commonly seek to harness the potential of the indigenous microbiome through substrate-mediated recruitment of beneficial members. In most sustainable practices, microbial recruitment mechanisms rely on the application of complex organic mixtures where the resources/metabolites that act as direct stimulants of beneficial groups are not characterized. Outcomes of such indirect amendments are unpredictable regarding engineering the microbiome and achieving a plant-beneficial environment.

RESULTS

This study applied network analysis of metagenomics data to explore amendment-derived transformations in the soil microbiome, which lead to the suppression of pathogens affecting apple root systems. Shotgun metagenomic analysis was conducted with data from 'sick' vs 'healthy/recovered' rhizosphere soil microbiomes. The data was then converted into community-level metabolic networks. Simulations examined the functional contribution of treatment-associated taxonomic groups and linked them with specific amendment-induced metabolites. This analysis enabled the selection of specific metabolites that were predicted to amplify or diminish the abundance of targeted microbes functional in the healthy soil system. Many of these predictions were corroborated by experimental evidence from the literature. The potential of two of these metabolites (dopamine and vitamin B₁₂) to either stimulate or suppress targeted microbial groups was evaluated in a follow-up set of soil microcosm experiments. The results corroborated the stimulant's potential (but not the suppressor) to act as a modulator of plant beneficial bacteria, paving the way for future development of knowledge-based (rather than trial and error) metabolic-defined amendments. Our pipeline for generating predictions for the selective targeting of microbial groups based on processing assembled and annotated metagenomics data is available at https://github.com/ot483/NetCom2 .

CONCLUSIONS

This research demonstrates how genomic-based algorithms can be used to formulate testable hypotheses for strategically engineering the rhizosphere microbiome by identifying specific compounds, which may act as selective modulators of microbial communities. Applying this framework to reduce unpredictable elements in amendment-based solutions promotes the development of ecologically-sound methods for re-establishing a functional microbiome in agro and other ecosystems. Video Abstract.

Collapse

de Oliveira Almeida R, Valente GT. Predicting metabolic pathways of plant enzymes without using sequence similarity: Models from machine learning. THE PLANT GENOME 2020;13:e20043. [PMID: 33217216 DOI: 10.1002/tpg2.20043] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/08/2019] [Revised: 06/03/2020] [Accepted: 06/10/2020] [Indexed: 06/11/2023]

Hidden resources in the Escherichia coli genome restore PLP synthesis and robust growth after deletion of the essential gene pdxB. Proc Natl Acad Sci U S A 2019;116:24164-24173. [PMID: 31712440 PMCID: PMC6883840 DOI: 10.1073/pnas.1915569116] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022] Open

Abstract

The evolution of new metabolic pathways has been a driver of diversification from the last universal common ancestor 3.8 billion y ago to the present. Bioinformatic evidence suggests that many pathways were assembled by recruiting promiscuous enzymes to serve new functions. However, the processes by which new pathways have emerged are lost in time. We have little information about the environmental conditions that fostered emergence of new pathways, the genome context in which new pathways emerged, and the types of mutations that elevated flux through inefficient new pathways. Experimental laboratory evolution has allowed us to evolve a new pathway and identify mechanisms by which mutations increase fitness when an inefficient new pathway becomes important for survival.

PdxB (erythronate 4-phosphate dehydrogenase) is expected to be required for synthesis of the essential cofactor pyridoxal 5′-phosphate (PLP) in Escherichia coli. Surprisingly, incubation of the ∆pdxB strain in medium containing glucose as a sole carbon source for 10 d resulted in visible turbidity, suggesting that PLP is being produced by some alternative pathway. Continued evolution of parallel lineages for 110 to 150 generations produced several strains that grow robustly in glucose. We identified a 4-step bypass pathway patched together from promiscuous enzymes that restores PLP synthesis in strain JK1. None of the mutations in JK1 occurs in a gene encoding an enzyme in the new pathway. Two mutations indirectly enhance the ability of SerA (3-phosphoglycerate dehydrogenase) to perform a new function in the bypass pathway. Another disrupts a gene encoding a PLP phosphatase, thus preserving PLP levels. These results demonstrate that a functional pathway can be patched together from promiscuous enzymes in the proteome, even without mutations in the genes encoding those enzymes.

Collapse

Katsir L, Zhepu R, Santos Garcia D, Piasezky A, Jiang J, Sela N, Freilich S, Bahar O. Genome Analysis of Haplotype D of Candidatus Liberibacter Solanacearum. Front Microbiol 2018;9:2933. [PMID: 30619106 PMCID: PMC6295461 DOI: 10.3389/fmicb.2018.02933] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2018] [Accepted: 11/14/2018] [Indexed: 11/20/2022] Open

Towards a Stochastic Paradigm: From Fuzzy Ensembles to Cellular Functions. Molecules 2018;23:molecules23113008. [PMID: 30453632 PMCID: PMC6278454 DOI: 10.3390/molecules23113008] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2018] [Revised: 11/11/2018] [Accepted: 11/16/2018] [Indexed: 01/03/2023] Open

Copley SD. Shining a light on enzyme promiscuity. Curr Opin Struct Biol 2017;47:167-175. [DOI: 10.1016/j.sbi.2017.11.001] [Citation(s) in RCA: 104] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2017] [Revised: 08/14/2017] [Accepted: 11/02/2017] [Indexed: 11/16/2022]

Martínez-Núñez MA, Rodríguez-Escamilla Z, Rodríguez-Vázquez K, Pérez-Rueda E. Tracing the Repertoire of Promiscuous Enzymes along the Metabolic Pathways in Archaeal Organisms. Life (Basel) 2017;7:life7030030. [PMID: 28703743 PMCID: PMC5617955 DOI: 10.3390/life7030030] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2017] [Revised: 07/09/2017] [Accepted: 07/10/2017] [Indexed: 01/10/2023] Open

Multiple nucleophilic elbows leading to multiple active sites in a single module esterase from Sorangium cellulosum. J Struct Biol 2015;190:314-27. [DOI: 10.1016/j.jsb.2015.04.009] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2014] [Revised: 03/25/2015] [Accepted: 04/10/2015] [Indexed: 11/17/2022]

Martínez Cuesta S, Rahman SA, Furnham N, Thornton JM. The Classification and Evolution of Enzyme Function. Biophys J 2015;109:1082-6. [PMID: 25986631 DOI: 10.1016/j.bpj.2015.04.020] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Revised: 04/16/2015] [Accepted: 04/17/2015] [Indexed: 11/30/2022] Open

Zhou F, Toivonen H, King RD. The use of weighted graphs for large-scale genome analysis. PLoS One 2014;9:e89618. [PMID: 24619061 PMCID: PMC3949676 DOI: 10.1371/journal.pone.0089618] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2013] [Accepted: 01/23/2014] [Indexed: 11/18/2022] Open

Comparative genomics approaches to understanding and manipulating plant metabolism. Curr Opin Biotechnol 2013;24:278-84. [DOI: 10.1016/j.copbio.2012.07.005] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2012] [Revised: 07/29/2012] [Accepted: 07/30/2012] [Indexed: 12/11/2022]

Chen TW, Gan RCR, Wu TH, Huang PJ, Lee CY, Chen YYM, Chen CC, Tang P. FastAnnotator--an efficient transcript annotation web tool. BMC Genomics 2012;13 Suppl 7:S9. [PMID: 23281853 PMCID: PMC3521244 DOI: 10.1186/1471-2164-13-s7-s9] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

Recent developments in high-throughput sequencing (HTS) technologies have made it feasible to sequence the complete transcriptomes of non-model organisms or metatranscriptomes from environmental samples. The challenge after generating hundreds of millions of sequences is to annotate these transcripts and classify the transcripts based on their putative functions. Because many biological scientists lack the knowledge to install Linux-based software packages or maintain databases used for transcript annotation, we developed an automatic annotation tool with an easy-to-use interface.

METHODS

To elucidate the potential functions of gene transcripts, we integrated well-established annotation tools: Blast2GO, PRIAM and RPS BLAST in a web-based service, FastAnnotator, which can assign Gene Ontology (GO) terms, Enzyme Commission numbers (EC numbers) and functional domains to query sequences.

RESULTS

Using six transcriptome sequence datasets as examples, we demonstrated the ability of FastAnnotator to assign functional annotations. FastAnnotator annotated 88.1% and 81.3% of the transcripts from the well-studied organisms Caenorhabditis elegans and Streptococcus parasanguinis, respectively. Furthermore, FastAnnotator annotated 62.9%, 20.4%, 53.1% and 42.0% of the sequences from the transcriptomes of sweet potato, clam, amoeba, and Trichomonas vaginalis, respectively, which lack reference genomes. We demonstrated that FastAnnotator can complete the annotation process in a reasonable amount of time and is suitable for the annotation of transcriptomes from model organisms or organisms for which annotated reference genomes are not avaiable.

CONCLUSIONS

The sequencing process no longer represents the bottleneck in the study of genomics, and automatic annotation tools have become invaluable as the annotation procedure has become the limiting step. We present FastAnnotator, which was an automated annotation web tool designed to efficiently annotate sequences with their gene functions, enzyme functions or domains. FastAnnotator is useful in transcriptome studies and especially for those focusing on non-model organisms or metatranscriptomes. FastAnnotator does not require local installation and is freely available at http://fastannotator.cgu.edu.tw.

Collapse

Klein CC, Cottret L, Kielbassa J, Charles H, Gautier C, Ribeiro de Vasconcelos AT, Lacroix V, Sagot MF. Exploration of the core metabolism of symbiotic bacteria. BMC Genomics 2012;13:438. [PMID: 22938206 PMCID: PMC3543179 DOI: 10.1186/1471-2164-13-438] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2012] [Accepted: 08/18/2012] [Indexed: 12/01/2022] Open

Abstract

Background

A large number of genome-scale metabolic networks is now available for many organisms, mostly bacteria. Previous works on minimal gene sets, when analysing host-dependent bacteria, found small common sets of metabolic genes. When such analyses are restricted to bacteria with similar lifestyles, larger portions of metabolism are expected to be shared and their composition is worth investigating. Here we report a comparative analysis of the small molecule metabolism of symbiotic bacteria, exploring common and variable portions as well as the contribution of different lifestyle groups to the reduction of a common set of metabolic capabilities.

Results

We found no reaction shared by all the bacteria analysed. Disregarding those with the smallest genomes, we still do not find a reaction core, however we did find a core of biochemical capabilities. While obligate intracellular symbionts have no core of reactions within their group, extracellular and cell-associated symbionts do have a small core composed of disconnected fragments. In agreement with previous findings in Escherichia coli, their cores are enriched in biosynthetic processes whereas the variable metabolisms have similar ratios of biosynthetic and degradation reactions. Conversely, the variable metabolism of obligate intracellular symbionts is enriched in anabolism.

Conclusion

Even when removing the symbionts with the most reduced genomes, there is no core of reactions common to the analysed symbiotic bacteria. The main reason is the very high specialisation of obligate intracellular symbionts, however, host-dependence alone is not an explanation for such absence. The composition of the metabolism of cell-associated and extracellular bacteria shows that while they have similar needs in terms of the building blocks of their cells, they have to adapt to very distinct environments. On the other hand, in obligate intracellular bacteria, catabolism has largely disappeared, whereas synthetic routes appear to have been selected for depending on the nature of the symbiosis. As more genomes are added, we expect, based on our simulations, that the core of cell-associated and extracellular bacteria continues to diminish, converging to approximately 60 reactions.

Collapse

Suen S, Lu HHS, Yeang CH. Evolution of domain architectures and catalytic functions of enzymes in metabolic systems. Genome Biol Evol 2012;4:976-93. [PMID: 22936075 PMCID: PMC3468959 DOI: 10.1093/gbe/evs072] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open

Abstract

Domain architectures and catalytic functions of enzymes constitute the centerpieces of a metabolic network. These types of information are formulated as a two-layered network consisting of domains, proteins, and reactions-a domain-protein-reaction (DPR) network. We propose an algorithm to reconstruct the evolutionary history of DPR networks across multiple species and categorize the mechanisms of metabolic systems evolution in terms of network changes. The reconstructed history reveals distinct patterns of evolutionary mechanisms between prokaryotic and eukaryotic networks. Although the evolutionary mechanisms in early ancestors of prokaryotes and eukaryotes are quite similar, more novel and duplicated domain compositions with identical catalytic functions arise along the eukaryotic lineage. In contrast, prokaryotic enzymes become more versatile by catalyzing multiple reactions with similar chemical operations. Moreover, different metabolic pathways are enriched with distinct network evolution mechanisms. For instance, although the pathways of steroid biosynthesis, protein kinases, and glycosaminoglycan biosynthesis all constitute prominent features of animal-specific physiology, their evolution of domain architectures and catalytic functions follows distinct patterns. Steroid biosynthesis is enriched with reaction creations but retains a relatively conserved repertoire of domain compositions and proteins. Protein kinases retain conserved reactions but possess many novel domains and proteins. In contrast, glycosaminoglycan biosynthesis has high rates of reaction/protein creations and domain recruitments. Finally, we elicit and validate two general principles underlying the evolution of DPR networks: 1) duplicated enzyme proteins possess similar catalytic functions and 2) the majority of novel domains arise to catalyze novel reactions. These results shed new lights on the evolution of metabolic systems.

Collapse

Seaver SMD, Henry CS, Hanson AD. Frontiers in metabolic reconstruction and modeling of plant genomes. JOURNAL OF EXPERIMENTAL BOTANY 2012;63:2247-58. [PMID: 22238452 DOI: 10.1093/jxb/err371] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]

Copley SD. Toward a systems biology perspective on enzyme evolution. J Biol Chem 2012;287:3-10. [PMID: 22069330 PMCID: PMC3249082 DOI: 10.1074/jbc.r111.254714] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open

Three serendipitous pathways in E. coli can bypass a block in pyridoxal-5'-phosphate synthesis. Mol Syst Biol 2011;6:436. [PMID: 21119630 PMCID: PMC3010111 DOI: 10.1038/msb.2010.88] [Citation(s) in RCA: 97] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2010] [Accepted: 09/30/2010] [Indexed: 11/28/2022] Open

Abstract

Overexpression of seven different genes restores growth of a ΔpdxB strain of E. coli, which cannot make pyridoxal phosphate (PLP), on M9/glucose.

None of the enzymes encoded by these genes has a promiscuous 4-phosphoerythronate dehydrogenase activity that can replace the activity of PdxB.

Overexpression of these genes restores PLP synthesis by three different serendipitous pathways that feed into the normal PLP synthesis pathway downstream of the blocked step.

Reactions in one of these pathways are catalyzed by low-level activities of enzymes of unknown function and a promiscuous activity of an enzyme that normally has a role in another pathway; one reaction appears to be non-enzymatic.

Most metabolic enzymes are prodigious catalysts that have evolved to accelerate chemical reactions with high efficiency and specificity. However, many enzymes have inefficient promiscuous activities, as well, as a result of the assemblage of highly reactive catalytic residues and cofactors in active sites. Although promiscuous activities are generally orders of magnitude less efficient than well-evolved activities (O'Brien and Herschlag, 1998, 2001; Wang et al, 2003; Taylor Ringia et al, 2004), they often enhance reaction rates by orders of magnitude relative to those of uncatalyzed reactions (O'Brien and Herschlag, 1998, 2001). Thus, promiscuous activities provide a reservoir of novel catalytic activities that can be recruited to serve new functions.

The evolutionary potential of promiscuous enzymes extends beyond the recruitment of single enzymes to serve new functions. Microbes contain hundreds of enzymes—E. coli contains about 1700 (Freilich et al, 2005)—raising the possibility that promiscuous enzymes can be patched together to generate ‘serendipitous' pathways that are not part of normal metabolism. We distinguish serendipitous pathways from latent or cryptic pathways, which are bona fide pathways involving dedicated enzymes that are produced only under particular environmental circumstances. In contrast, serendipitous pathways are patched together from enzymes that normally serve other functions and are not regulated in a coordinated manner in response to the need to synthesize or degrade a metabolite.

In this study, we describe the discovery of three serendipitous pathways that allow synthesis of pyridoxal phosphate (PLP) in a strain of E. coli that lacks 4-phosphoerythronate dehydrogenase (PdxB) when one of the seven different genes is overexpressed. These genes were identified in a multicopy suppression experiment in which a library of E. coli genes (from the ASKA collection) was introduced into a ΔpdxB strain of E. coli that is unable to synthesize PLP. Surprisingly, none of the enzymes encoded by these genes has a promiscuous 4-phosphoerythronate (4PE) dehydrogenase activity that can substitute for the missing PdxB. Rather, overproduction of these enzymes appears to facilitate at least three serendipitous pathways that draw material from other metabolic pathways and feed into the normal PLP synthesis pathway downstream of the blocked step (Figure 1).

We have characterized one of these pathways in detail (Figure 3). The first step, dephosphorylation of 3-phosphohydroxypyruvate, is catalyzed by YeaB, a predicted NUDIX hydrolase of unknown function. Although catalysis is inefficient (k_cat=5.7×10⁻⁵ s⁻¹ and k_cat/K_M>0.028 M⁻¹ s⁻¹), the enzymatic rate is 4×10⁷-fold faster than the rate of the uncatalyzed reaction, and is sufficient to support PLP synthesis when YeaB is overproduced. The second step in the pathway is decarboxylation of 3-hydroxypyruvate (3HP). Although we found two enzymes (1-deoxyxylulose-5-phosphate synthase and the catalytic domain of α-ketoglutarate dehydrogenase) that catalyze this reaction with low but respectable activity in vitro, their involvement in pathway 1 was ruled out by genetic methods. Surprisingly, the non-enzymatic rate of decarboxylation of 3HP appears to be sufficient to support PLP synthesis. The third step in the pathway, condensation of glycolaldehyde and glycine to form 4-hydroxy-L-threonine, is catalyzed by LtaE, a low-specificity threonine aldolase whose physiological role is not known. The final step, phosphorylation of 4-hydroxy-L-threonine, is catalyzed by homoserine kinase (ThrB), which is required for synthesis of threonine. The promiscuous phosphorylation of 4-hydroxy-L-threonine is 80-fold slower than the physiological phosphorylation of homoserine. The involvement of LtaE and ThrB in pathway 1 was confirmed by genetic experiments showing that overexpression of yeaB no longer restores growth of ΔpdxB strains lacking either ltaE or thrB.

Although pathway 1 is inefficient, it provides the ΔpdxB strain with the ability to grow under conditions in which survival is otherwise impossible. In general, serendipitous assembly of an inefficient pathway from promiscuous activities of available enzymes will be important whenever the pathway provides increased fitness. This might occur when a critical metabolite is no longer available from the environment, and survival depends on assembly of a new biosynthetic pathway. A second circumstance in which an inefficient serendipitous pathway might improve fitness is the appearance of a novel compound in the environment that can be exploited as a source of carbon, nitrogen or phosphorous. Finally, chemotherapeutic agents that block metabolic pathways in bacteria or cancer cells could provide selective pressure for assembly of serendipitous pathways that allow synthesis of the end product of the blocked pathway and thus a previously unappreciated source of drug resistance. In all of these cases, even an inefficient pathway can provide a selective advantage over other cells in a particular environmental niche, allowing survival and subsequent mutations that elevate the efficiency of the pathway.

Our work is consistent with the hypothesis that the recognized metabolic network of E. coli is underlain by a denser network of reactions due to promiscuous enzymes that use and generate recognized metabolites, but also unusual metabolites that normally have no physiological role. The findings reported here highlight the abundance of cryptic capabilities in the E. coli proteome that can be drawn on to generate novel pathways. Such pathways could provide a starting place for assembly of more efficient pathways, both in nature and in the hands of metabolic engineers.

Bacterial genomes encode hundreds to thousands of enzymes, most of which are specialized for particular functions. However, most enzymes have inefficient promiscuous activities, as well, that generally serve no purpose. Promiscuous reactions can be patched together to form multistep metabolic pathways. Mutations that increase expression or activity of enzymes in such serendipitous pathways can elevate flux through the pathway to a physiologically significant level. In this study, we describe the discovery of three serendipitous pathways that allow synthesis of pyridoxal-5′-phosphate (PLP) in a strain of E. coli that lacks 4-phosphoerythronate (4PE) dehydrogenase (PdxB) when one of seven different genes is overexpressed. We have characterized one of these pathways in detail. This pathway diverts material from serine biosynthesis and generates an intermediate in the normal PLP synthesis pathway downstream of the block caused by lack of PdxB. Steps in the pathway are catalyzed by a protein of unknown function, a broad-specificity enzyme whose physiological role is unknown, and a promiscuous activity of an enzyme that normally serves another function. One step in the pathway may be non-enzymatic.

Collapse

Loss of genetic redundancy in reductive genome evolution. PLoS Comput Biol 2011;7:e1001082. [PMID: 21379323 PMCID: PMC3040653 DOI: 10.1371/journal.pcbi.1001082] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Accepted: 01/12/2011] [Indexed: 01/14/2023] Open

Sonavane S, Chakrabarti P. Prediction of active site cleft using support vector machines. J Chem Inf Model 2010;50:2266-73. [PMID: 21080689 DOI: 10.1021/ci1002922] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]

Glasner ME, Gerlt JA, Babbitt PC. Mechanisms of protein evolution and their application to protein engineering. ADVANCES IN ENZYMOLOGY AND RELATED AREAS OF MOLECULAR BIOLOGY 2010;75:193-239, xii-xiii. [PMID: 17124868 DOI: 10.1002/9780471224464.ch3] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Freilich S, Goldovsky L, Gottlieb A, Blanc E, Tsoka S, Ouzounis CA. Stratification of co-evolving genomic groups using ranked phylogenetic profiles. BMC Bioinformatics 2009;10:355. [PMID: 19860884 PMCID: PMC2775751 DOI: 10.1186/1471-2105-10-355] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2009] [Accepted: 10/27/2009] [Indexed: 01/12/2023] Open

Abstract

BACKGROUND

Previous methods of detecting the taxonomic origins of arbitrary sequence collections, with a significant impact to genome analysis and in particular metagenomics, have primarily focused on compositional features of genomes. The evolutionary patterns of phylogenetic distribution of genes or proteins, represented by phylogenetic profiles, provide an alternative approach for the detection of taxonomic origins, but typically suffer from low accuracy. Herein, we present rank-BLAST, a novel approach for the assignment of protein sequences into genomic groups of the same taxonomic origin, based on the ranking order of phylogenetic profiles of target genes or proteins across the reference database.

RESULTS

The rank-BLAST approach is validated by computing the phylogenetic profiles of all sequences for five distinct microbial species of varying degrees of phylogenetic proximity, against a reference database of 243 fully sequenced genomes. The approach - a combination of sequence searches, statistical estimation and clustering - analyses the degree of sequence divergence between sets of protein sequences and allows the classification of protein sequences according to the species of origin with high accuracy, allowing taxonomic classification of 64% of the proteins studied. In most cases, a main cluster is detected, representing the corresponding species. Secondary, functionally distinct and species-specific clusters exhibit different patterns of phylogenetic distribution, thus flagging gene groups of interest. Detailed analyses of such cases are provided as examples.

CONCLUSION

Our results indicate that the rank-BLAST approach can capture the taxonomic origins of sequence collections in an accurate and efficient manner. The approach can be useful both for the analysis of genome evolution and the detection of species groups in metagenomics samples.

Collapse

Wagner A. Evolutionary constraints permeate large metabolic networks. BMC Evol Biol 2009;9:231. [PMID: 19747381 PMCID: PMC2753571 DOI: 10.1186/1471-2148-9-231] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2009] [Accepted: 09/11/2009] [Indexed: 11/22/2022] Open

Evolution of efficient pathways for degradation of anthropogenic chemicals. Nat Chem Biol 2009;5:559-66. [PMID: 19620997 DOI: 10.1038/nchembio.197] [Citation(s) in RCA: 123] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Arakaki AK, Huang Y, Skolnick J. EFICAz2: enzyme function inference by a combined approach enhanced by machine learning. BMC Bioinformatics 2009;10:107. [PMID: 19361344 PMCID: PMC2670841 DOI: 10.1186/1471-2105-10-107] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2008] [Accepted: 04/13/2009] [Indexed: 12/22/2022] Open

Abstract

BACKGROUND

We previously developed EFICAz, an enzyme function inference approach that combines predictions from non-completely overlapping component methods. Two of the four components in the original EFICAz are based on the detection of functionally discriminating residues (FDRs). FDRs distinguish between member of an enzyme family that are homofunctional (classified under the EC number of interest) or heterofunctional (annotated with another EC number or lacking enzymatic activity). Each of the two FDR-based components is associated to one of two specific kinds of enzyme families. EFICAz exhibits high precision performance, except when the maximal test to training sequence identity (MTTSI) is lower than 30%. To improve EFICAz's performance in this regime, we: i) increased the number of predictive components and ii) took advantage of consensual information from the different components to make the final EC number assignment.

RESULTS

We have developed two new EFICAz components, analogs to the two FDR-based components, where the discrimination between homo and heterofunctional members is based on the evaluation, via Support Vector Machine models, of all the aligned positions between the query sequence and the multiple sequence alignments associated to the enzyme families. Benchmark results indicate that: i) the new SVM-based components outperform their FDR-based counterparts, and ii) both SVM-based and FDR-based components generate unique predictions. We developed classification tree models to optimally combine the results from the six EFICAz components into a final EC number prediction. The new implementation of our approach, EFICAz2, exhibits a highly improved prediction precision at MTTSI < 30% compared to the original EFICAz, with only a slight decrease in prediction recall. A comparative analysis of enzyme function annotation of the human proteome by EFICAz2 and KEGG shows that: i) when both sources make EC number assignments for the same protein sequence, the assignments tend to be consistent and ii) EFICAz2 generates considerably more unique assignments than KEGG.

CONCLUSION

Performance benchmarks and the comparison with KEGG demonstrate that EFICAz2 is a powerful and precise tool for enzyme function annotation, with multiple applications in genome analysis and metabolic pathway reconstruction. The EFICAz2 web service is available at: http://cssb.biology.gatech.edu/skolnick/webservice/EFICAz2/index.html.

Collapse

Yu C, Zavaljevski N, Desai V, Reifman J. Genome-wide enzyme annotation with precision control: catalytic families (CatFam) databases. Proteins 2009;74:449-60. [PMID: 18636476 DOI: 10.1002/prot.22167] [Citation(s) in RCA: 52] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Abstract

In this article, we present a new method termed CatFam (Catalytic Families) to automatically infer the functions of catalytic proteins, which account for 20-40% of all proteins in living organisms and play a critical role in a variety of biological processes. CatFam is a sequence-based method that generates sequence profiles to represent and infer protein catalytic functions. CatFam generates profiles through a stepwise procedure that carefully controls profile quality and employs nonenzymes as negative samples to establish profile-specific thresholds associated with a predefined nominal false-positive rate (FPR) of predictions. The adjustable FPR allows for fine precision control of each profile and enables the generation of profile databases that meet different needs: function annotation with high precision and hypothesis generation with moderate precision but better recall. Multiple tests of CatFam databases (generated with distinct nominal FPRs) against enzyme and nonenzyme datasets show that the method's predictions have consistently high precision and recall. For example, a 1% FPR database predicts protein catalytic functions for a dataset of enzymes and nonenzymes with 98.6% precision and 95.0% recall. Comparisons of CatFam databases against other established profile-based methods for the functional annotation of 13 bacterial genomes indicate that CatFam consistently achieves higher precision and (in most cases) higher recall, and that (on average) CatFam provides 21.9% additional catalytic functions not inferred by the other similarly reliable methods. These results strongly suggest that the proposed method provides a valuable contribution to the automated prediction of protein catalytic functions. The CatFam databases and the database search program are freely available at http://www.bhsai.org/downloads/catfam.tar.gz.

Collapse

Freilich S, Goldovsky L, Ouzounis CA, Thornton JM. Metabolic innovations towards the human lineage. BMC Evol Biol 2008;8:247. [PMID: 18782449 PMCID: PMC2553087 DOI: 10.1186/1471-2148-8-247] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2008] [Accepted: 09/09/2008] [Indexed: 01/09/2023] Open

Caetano-Anollés G, Yafremava LS, Gee H, Caetano-Anollés D, Kim HS, Mittenthal JE. The origin and evolution of modern metabolism. Int J Biochem Cell Biol 2008;41:285-97. [PMID: 18790074 DOI: 10.1016/j.biocel.2008.08.022] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2008] [Revised: 08/09/2008] [Accepted: 08/11/2008] [Indexed: 10/21/2022]

Sanjuán R, Nebot MR. A network model for the correlation between epistasis and genomic complexity. PLoS One 2008;3:e2663. [PMID: 18648534 PMCID: PMC2481279 DOI: 10.1371/journal.pone.0002663] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Accepted: 06/12/2008] [Indexed: 01/28/2023] Open

Sun J, Lu X, Rinas U, Zeng AP. Metabolic peculiarities of Aspergillus niger disclosed by comparative metabolic genomics. Genome Biol 2008;8:R182. [PMID: 17784953 PMCID: PMC2375020 DOI: 10.1186/gb-2007-8-9-r182] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2007] [Revised: 07/13/2007] [Accepted: 09/04/2007] [Indexed: 11/10/2022] Open

Abstract

A genome-scale metabolic network and an in-depth genomic comparison of Aspergillus niger with seven other fungi is presented, revealing more than 1,100 enzyme-coding genes that are unique to A. niger.

Background

Aspergillus niger is an important industrial microorganism for the production of both metabolites, such as citric acid, and proteins, such as fungal enzymes or heterologous proteins. Despite its extensive industrial applications, the genetic inventory of this fungus is only partially understood. The recently released genome sequence opens a new horizon for both scientific studies and biotechnological applications.

Results

Here, we present the first genome-scale metabolic network for A. niger and an in-depth genomic comparison of this species to seven other fungi to disclose its metabolic peculiarities. The raw genomic sequences of A. niger ATCC 9029 were first annotated. The reconstructed metabolic network is based on the annotation of two A. niger genomes, CBS 513.88 and ATCC 9029, including enzymes with 988 unique EC numbers, 2,443 reactions and 2,349 metabolites. More than 1,100 enzyme-coding genes are unique to A. niger in comparison to the other seven fungi. For example, we identified additional copies of genes such as those encoding alternative mitochondrial oxidoreductase and citrate synthase in A. niger, which might contribute to the high citric acid production efficiency of this species. Moreover, nine genes were identified as encoding enzymes with EC numbers exclusively found in A. niger, mostly involved in the biosynthesis of complex secondary metabolites and degradation of aromatic compounds.

Conclusion

The genome-level reconstruction of the metabolic network and genome-based metabolic comparison disclose peculiarities of A. niger highly relevant to its biotechnological applications and should contribute to future rational metabolic design and systems biology studies of this black mold and related species.

Collapse

Andreini C, Banci L, Bertini I, Rosato A. Zinc through the three domains of life. J Proteome Res 2007;5:3173-8. [PMID: 17081069 DOI: 10.1021/pr0603699] [Citation(s) in RCA: 433] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

High precision multi-genome scale reannotation of enzyme function by EFICAz. BMC Genomics 2006;7:315. [PMID: 17166279 PMCID: PMC1764738 DOI: 10.1186/1471-2164-7-315] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2006] [Accepted: 12/13/2006] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The functional annotation of most genes in newly sequenced genomes is inferred from similarity to previously characterized sequences, an annotation strategy that often leads to erroneous assignments. We have performed a reannotation of 245 genomes using an updated version of EFICAz, a highly precise method for enzyme function prediction.

RESULTS

Based on our three-field EC number predictions, we have obtained lower-bound estimates for the average enzyme content in Archaea (29%), Bacteria (30%) and Eukarya (18%). Most annotations added in KEGG from 2005 to 2006 agree with EFICAz predictions made in 2005. The coverage of EFICAz predictions is significantly higher than that of KEGG, especially for eukaryotes. Thousands of our novel predictions correspond to hypothetical proteins. We have identified a subset of 64 hypothetical proteins with low sequence identity to EFICAz training enzymes, whose biochemical functions have been recently characterized and find that in 96% (84%) of the cases we correctly identified their three-field (four-field) EC numbers. For two of the 64 hypothetical proteins: PA1167 from Pseudomonas aeruginosa, an alginate lyase (EC 4.2.2.3) and Rv1700 of Mycobacterium tuberculosis H37Rv, an ADP-ribose diphosphatase (EC 3.6.1.13), we have detected annotation lag of more than two years in databases. Two examples are presented where EFICAz predictions act as hypothesis generators for understanding the functional roles of hypothetical proteins: FLJ11151, a human protein overexpressed in cancer that EFICAz identifies as an endopolyphosphatase (EC 3.6.1.10), and MW0119, a protein of Staphylococcus aureus strain MW2 that we propose as candidate virulence factor based on its EFICAz predicted activity, sphingomyelin phosphodiesterase (EC 3.1.4.12).

CONCLUSION

Our results suggest that we have generated enzyme function annotations of high precision and recall. These predictions can be mined and correlated with other information sources to generate biologically significant hypotheses and can be useful for comparative genome analysis and automated metabolic pathway reconstruction.

Collapse