1
|
Jones DR, Thomas D, Alger N, Ghavidel A, Inglis GD, Abbott DW. SACCHARIS: an automated pipeline to streamline discovery of carbohydrate active enzyme activities within polyspecific families and de novo sequence datasets. BIOTECHNOLOGY FOR BIOFUELS 2018; 11:27. [PMID: 29441125 PMCID: PMC5798181 DOI: 10.1186/s13068-018-1027-x] [Citation(s) in RCA: 41] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 01/18/2018] [Indexed: 05/19/2023]
Abstract
BACKGROUND Deposition of new genetic sequences in online databases is expanding at an unprecedented rate. As a result, sequence identification continues to outpace functional characterization of carbohydrate active enzymes (CAZymes). In this paradigm, the discovery of enzymes with novel functions is often hindered by high volumes of uncharacterized sequences particularly when the enzyme sequence belongs to a family that exhibits diverse functional specificities (i.e., polyspecificity). Therefore, to direct sequence-based discovery and characterization of new enzyme activities we have developed an automated in silico pipeline entitled: Sequence Analysis and Clustering of CarboHydrate Active enzymes for Rapid Informed prediction of Specificity (SACCHARIS). This pipeline streamlines the selection of uncharacterized sequences for discovery of new CAZyme or CBM specificity from families currently maintained on the CAZy website or within user-defined datasets. RESULTS SACCHARIS was used to generate a phylogenetic tree of a GH43, a CAZyme family with defined subfamily designations. This analysis confirmed that large datasets can be organized into sequence clusters of manageable sizes that possess related functions. Seeding this tree with a GH43 sequence from Bacteroides dorei DSM 17855 (BdGH43b, revealed it partitioned as a single sequence within the tree. This pattern was consistent with it possessing a unique enzyme activity for GH43 as BdGH43b is the first described α-glucanase described for this family. The capacity of SACCHARIS to extract and cluster characterized carbohydrate binding module sequences was demonstrated using family 6 CBMs (i.e., CBM6s). This CBM family displays a polyspecific ligand binding profile and contains many structurally determined members. Using SACCHARIS to identify a cluster of divergent sequences, a CBM6 sequence from a unique clade was demonstrated to bind yeast mannan, which represents the first description of an α-mannan binding CBM. Additionally, we have performed a CAZome analysis of an in-house sequenced bacterial genome and a comparative analysis of B. thetaiotaomicron VPI-5482 and B. thetaiotaomicron 7330, to demonstrate that SACCHARIS can generate "CAZome fingerprints", which differentiate between the saccharolytic potential of two related strains in silico. CONCLUSIONS Establishing sequence-function and sequence-structure relationships in polyspecific CAZyme families are promising approaches for streamlining enzyme discovery. SACCHARIS facilitates this process by embedding CAZyme and CBM family trees generated from biochemically to structurally characterized sequences, with protein sequences that have unknown functions. In addition, these trees can be integrated with user-defined datasets (e.g., genomics, metagenomics, and transcriptomics) to inform experimental characterization of new CAZymes or CBMs not currently curated, and for researchers to compare differential sequence patterns between entire CAZomes. In this light, SACCHARIS provides an in silico tool that can be tailored for enzyme bioprospecting in datasets of increasing complexity and for diverse applications in glycobiotechnology.
Collapse
Affiliation(s)
- Darryl R. Jones
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1 Canada
| | - Dallas Thomas
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1 Canada
| | - Nicholas Alger
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1 Canada
| | - Ata Ghavidel
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1 Canada
| | - G. Douglas Inglis
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1 Canada
| | - D. Wade Abbott
- Lethbridge Research and Development Centre, Agriculture and Agri-Food Canada, 5403-1st Avenue South, Lethbridge, AB T1J 4B1 Canada
| |
Collapse
|
2
|
Exploring the past and the future of protein evolution with ancestral sequence reconstruction: the 'retro' approach to protein engineering. Biochem J 2017; 474:1-19. [PMID: 28008088 DOI: 10.1042/bcj20160507] [Citation(s) in RCA: 84] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2016] [Revised: 11/07/2016] [Accepted: 11/10/2016] [Indexed: 12/22/2022]
Abstract
A central goal in molecular evolution is to understand the ways in which genes and proteins evolve in response to changing environments. In the absence of intact DNA from fossils, ancestral sequence reconstruction (ASR) can be used to infer the evolutionary precursors of extant proteins. To date, ancestral proteins belonging to eubacteria, archaea, yeast and vertebrates have been inferred that have been hypothesized to date from between several million to over 3 billion years ago. ASR has yielded insights into the early history of life on Earth and the evolution of proteins and macromolecular complexes. Recently, however, ASR has developed from a tool for testing hypotheses about protein evolution to a useful means for designing novel proteins. The strength of this approach lies in the ability to infer ancestral sequences encoding proteins that have desirable properties compared with contemporary forms, particularly thermostability and broad substrate range, making them good starting points for laboratory evolution. Developments in technologies for DNA sequencing and synthesis and computational phylogenetic analysis have led to an escalation in the number of ancient proteins resurrected in the last decade and greatly facilitated the use of ASR in the burgeoning field of synthetic biology. However, the primary challenge of ASR remains in accurately inferring ancestral states, despite the uncertainty arising from evolutionary models, incomplete sequences and limited phylogenetic trees. This review will focus, firstly, on the use of ASR to uncover links between sequence and phenotype and, secondly, on the practical application of ASR in protein engineering.
Collapse
|
3
|
Ogawa T, Shirai T. Tracing ancestral specificity of lectins: ancestral sequence reconstruction method as a new approach in protein engineering. Methods Mol Biol 2014; 1200:539-551. [PMID: 25117263 DOI: 10.1007/978-1-4939-1292-6_44] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
Protein evolution is a process of molecular design leading to the diversity of functional proteins found in nature. Recent advances in bioinformatics and structural biology, in addition to recombinant protein expression techniques, enable us to analyze more directly the molecular evolution of proteins by a new method using ancestral sequence reconstruction (ASR), the so-called experimental molecular archaeology. ASR has been used to reveal molecular properties and structures correlating with changing geology, ecology, and physiology, and to identify the structure elements important to changing physiological functions to fill substantial gaps in the processes of protein evolution. In this chapter, we describe ASR as a new method of protein engineering studies, and their application to analyzing lectins, of which evolutionary processes and structural features contributing to molecular stability, specificity, and unique functions have been elucidated. Experimental molecular archeology using ASR and crystal structures of full-length ancestral proteins is useful in understanding the evolutionary process of the functional and structural diversified lectins by tracing ancestral specificities.
Collapse
Affiliation(s)
- Tomohisa Ogawa
- Department of Biomolecular Science, Graduate School of Life Sciences, Tohoku University, Sendai, 980-8577, Japan,
| | | |
Collapse
|
4
|
Watanabe M, Nakamura O, Muramoto K, Ogawa T. Allosteric regulation of the carbohydrate-binding ability of a novel conger eel galectin by D-mannoside. J Biol Chem 2012; 287:31061-72. [PMID: 22810239 DOI: 10.1074/jbc.m112.346213] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Conger eel has two galectins, termed congerins I and II (Con I and II), that function in mucus as biodefense molecules. Con I and II have acquired a novel protein fold via domain swapping and a new ligand-binding site by accelerated evolution, which enables recognition of some marine bacteria. In this study, we identified a new congerin isotype, congerin P (Con-P), from the peritoneal cells of conger eel. Although Con-P displayed obvious homology with galectins, we observed substitution of 7 out of 8 amino acid residues in the carbohydrate recognition domain that are conserved in all other known galectins. To understand the structure-function relationships of this unique galectin, recombinant Con-P was successfully expressed in Escherichia coli by using a Con II-tagged fusion protein system and subsequently characterized. In the presence of D-mannose, Con-P displayed 30-fold greater hemagglutinating activity than Con I; however, no activity was observed without mannose, indicating that D-mannoside can act as a modulator of Con-P. Frontal affinity chromatography analysis showed that activated Con-P, allosterically induced by mannose, displayed affinity for oligomannose-type sugars as well as N-acetyllactosamine-type β-galactosides. Thus, Con-P represents a new member of the galectin family with unique properties.
Collapse
Affiliation(s)
- Mizuki Watanabe
- Department of Biomolecular Sciences, Graduate School of Life Sciences, Tohoku University, Sendai 980-8577, Japan
| | | | | | | |
Collapse
|
5
|
Ogawa T, Watanabe M, Naganuma T, Muramoto K. Diversified carbohydrate-binding lectins from marine resources. JOURNAL OF AMINO ACIDS 2011; 2011:838914. [PMID: 22312473 PMCID: PMC3269628 DOI: 10.4061/2011/838914] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/12/2011] [Accepted: 08/13/2011] [Indexed: 12/20/2022]
Abstract
Marine bioresources produce a great variety of specific and potent bioactive molecules including natural organic compounds such as fatty acids, polysaccharides, polyether, peptides, proteins, and enzymes. Lectins are also one of the promising candidates for useful therapeutic agents because they can recognize the specific carbohydrate structures such as proteoglycans, glycoproteins, and glycolipids, resulting in the regulation of various cells via glycoconjugates and their physiological and pathological phenomenon through the host-pathogen interactions and cell-cell communications. Here, we review the multiple lectins from marine resources including fishes and sea invertebrate in terms of their structure-activity relationships and molecular evolution. Especially, we focus on the unique structural properties and molecular evolution of C-type lectins, galectin, F-type lectin, and rhamnose-binding lectin families.
Collapse
Affiliation(s)
- Tomohisa Ogawa
- Department of Biomolecular Sciences, Graduate School of Life Sciences, Tohoku University, Sendai 980-8577, Japan
| | | | | | | |
Collapse
|
6
|
Hobbs JK, Shepherd C, Saul DJ, Demetras NJ, Haaning S, Monk CR, Daniel RM, Arcus VL. On the Origin and Evolution of Thermophily: Reconstruction of Functional Precambrian Enzymes from Ancestors of Bacillus. Mol Biol Evol 2011; 29:825-35. [DOI: 10.1093/molbev/msr253] [Citation(s) in RCA: 68] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
|
7
|
Konno A, Kitagawa A, Watanabe M, Ogawa T, Shirai T. Tracing protein evolution through ancestral structures of fish galectin. Structure 2011; 19:711-21. [PMID: 21565705 DOI: 10.1016/j.str.2011.02.014] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2010] [Revised: 02/08/2011] [Accepted: 02/16/2011] [Indexed: 11/16/2022]
Abstract
Ancestral structures of fish galectins (congerins) were determined. The extant isoforms I and II of congerin are the components of a fish biological defense system and have rapidly differentiated under natural selection pressure, by which congerin I has experienced a protein-fold evolution. The dimer structure of the ancestral congerin demonstrated intermediate features of the extant isoforms. The protein-fold evolution was not observed in the ancestral structure, indicating it specifically occurred in congerin I lineage. Details of hydrogen bonding pattern at the dimer interface and the carbohydrate-binding site of the ancestor were different from the current proteins. The differences implied these proteins were under selection pressure for stabilizing dimer structure and differentiation in carbohydrate specificity. The ancestor had rather low cytotoxic activity than offspring, indicating selection was made to enhance this activity of congerins. Combined with functional analyses, the structure revealed atomic details of the differentiation process of the proteins.
Collapse
Affiliation(s)
- Ayumu Konno
- Department of Biomolecular Science, Graduate School of Life Sciences, Tohoku University, Sendai 980-8577, Japan
| | | | | | | | | |
Collapse
|
8
|
Abstract
Like other RNA viruses, coxsackievirus B5 (CVB5) exists as circulating heterogeneous populations of genetic variants. In this study, we present the reconstruction and characterization of a probable ancestral virion of CVB5. Phylogenetic analyses based on capsid protein-encoding regions (the VP1 gene of 41 clinical isolates and the entire P1 region of eight clinical isolates) of CVB5 revealed two major cocirculating lineages. Ancestral capsid sequences were inferred from sequences of these contemporary CVB5 isolates by using maximum likelihood methods. By using Bayesian phylodynamic analysis, the inferred VP1 ancestral sequence dated back to 1854 (1807 to 1898). In order to study the properties of the putative ancestral capsid, the entire ancestral P1 sequence was synthesized de novo and inserted into the replicative backbone of an infectious CVB5 cDNA clone. Characterization of the recombinant virus in cell culture showed that fully functional infectious virus particles were assembled and that these viruses displayed properties similar to those of modern isolates in terms of receptor preferences, plaque phenotypes, growth characteristics, and cell tropism. This is the first report describing the resurrection and characterization of a picornavirus with a putative ancestral capsid. Our approach, including a phylogenetics-based reconstruction of viral predecessors, could serve as a starting point for experimental studies of viral evolution and might also provide an alternative strategy for the development of vaccines.
Collapse
|
9
|
Konno A, Yonemaru S, Kitagawa A, Muramoto K, Shirai T, Ogawa T. Protein engineering of conger eel galectins by tracing of molecular evolution using probable ancestral mutants. BMC Evol Biol 2010; 10:43. [PMID: 20152053 PMCID: PMC2843614 DOI: 10.1186/1471-2148-10-43] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2009] [Accepted: 02/14/2010] [Indexed: 01/10/2023] Open
Abstract
Background Conger eel galectins, congerin I (ConI) and congerin II (ConII), show the different molecular characteristics resulting from accelerating evolution. We recently reconstructed a probable ancestral form of congerins, Con-anc. It showed properties similar to those of ConII in terms of thermostability and carbohydrate recognition specificity, although it shares a higher sequence similarity with ConI than ConII. Results In this study, we have focused on the different amino acid residues between Con-anc and ConI, and have performed the protein engineering of Con-anc through site-directed mutagenesis, followed by the molecular evolution analysis of the mutants. This approach revealed the functional importance of loop structures of congerins: (1) N- and C-terminal and loop 5 regions that are involved in conferring a high thermostability to ConI; (2) loops 3, 5, and 6 that are responsible for stronger binding of ConI to most sugars; and (3) loops 5 and 6, and Thr38 residue in loop 3 contribute the specificity of ConI toward lacto-N-fucopentaose-containing sugars. Conclusions Thus, this methodology, with tracing of the molecular evolution using ancestral mutants, is a powerful tool for the analysis of not only the molecular evolutionary process, but also the structural elements of a protein responsible for its various functions.
Collapse
Affiliation(s)
- Ayumu Konno
- Department of Biomolecular Science, Graduate School of Life Sciences, Tohoku University, Sendai 981-8555, Japan
| | | | | | | | | | | |
Collapse
|
10
|
Rich RL, Myszka DG. Survey of the year 2007 commercial optical biosensor literature. J Mol Recognit 2008; 21:355-400. [DOI: 10.1002/jmr.928] [Citation(s) in RCA: 144] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|