1
|
Gado JE, Harrison BE, Sandgren M, Ståhlberg J, Beckham GT, Payne CM. Machine learning reveals sequence-function relationships in family 7 glycoside hydrolases. J Biol Chem 2021; 297:100931. [PMID: 34216620 PMCID: PMC8329511 DOI: 10.1016/j.jbc.2021.100931] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Revised: 06/18/2021] [Accepted: 06/29/2021] [Indexed: 11/28/2022] Open
Abstract
Family 7 glycoside hydrolases (GH7) are among the principal enzymes for cellulose degradation in nature and industrially. These enzymes are often bimodular, including a catalytic domain and carbohydrate-binding module (CBM) attached via a flexible linker, and exhibit an active site that binds cello-oligomers of up to ten glucosyl moieties. GH7 cellulases consist of two major subtypes: cellobiohydrolases (CBH) and endoglucanases (EG). Despite the critical importance of GH7 enzymes, there remain gaps in our understanding of how GH7 sequence and structure relate to function. Here, we employed machine learning to gain data-driven insights into relationships between sequence, structure, and function across the GH7 family. Machine-learning models, trained only on the number of residues in the active-site loops as features, were able to discriminate GH7 CBHs and EGs with up to 99% accuracy, demonstrating that the lengths of loops A4, B2, B3, and B4 strongly correlate with functional subtype across the GH7 family. Classification rules were derived such that specific residues at 42 different sequence positions each predicted the functional subtype with accuracies surpassing 87%. A random forest model trained on residues at 19 positions in the catalytic domain predicted the presence of a CBM with 89.5% accuracy. Our machine learning results recapitulate, as top-performing features, a substantial number of the sequence positions determined by previous experimental studies to play vital roles in GH7 activity. We surmise that the yet-to-be-explored sequence positions among the top-performing features also contribute to GH7 functional variation and may be exploited to understand and manipulate function.
Collapse
Affiliation(s)
- Japheth E Gado
- Department of Chemical and Materials Engineering, University of Kentucky, Lexington, Kentucky, USA; Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
| | - Brent E Harrison
- Department of Computer Science, University of Kentucky, Lexington, Kentucky, USA
| | - Mats Sandgren
- Department of Molecular Sciences, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Jerry Ståhlberg
- Department of Molecular Sciences, Swedish University of Agricultural Sciences, Uppsala, Sweden
| | - Gregg T Beckham
- Renewable Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, Colorado, USA
| | - Christina M Payne
- Department of Chemical and Materials Engineering, University of Kentucky, Lexington, Kentucky, USA.
| |
Collapse
|
2
|
Molecular recognition in the product site of cellobiohydrolase Cel7A regulates processive step length. Biochem J 2020; 477:99-110. [PMID: 31816027 DOI: 10.1042/bcj20190770] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2019] [Revised: 12/06/2019] [Accepted: 12/09/2019] [Indexed: 11/17/2022]
Abstract
Cellobiohydrolase Cel7A is an industrial important enzyme that breaks down cellulose by a complex processive mechanism. The enzyme threads the reducing end of a cellulose strand into its tunnel-shaped catalytic domain and progresses along the strand while sequentially releasing the disaccharide cellobiose. While some molecular details of this intricate process have emerged, general structure-function relationships for Cel7A remain poorly elucidated. One interesting aspect is the occurrence of particularly strong ligand interactions in the product binding site. In this work, we analyze these interactions in Cel7A from Trichoderma reesei with special emphasis on the Arg251 and Arg394 residues. We made extensive biochemical characterization of enzymes that were mutated in these two positions and showed that the arginine residues contributed strongly to product binding. Specifically, ∼50% of the total standard free energy of product binding could be ascribed to four hydrogen bonds to Arg251 and Arg394, which had previously been identified in crystal structures. Mutation of either Arg251 or Arg394 lowered production inhibition of Cel7A, but at the same time altered the enzyme product profile and resulted in ∼50% reduction in both processivity and hydrolytic activity. The position of the two arginine residues closely matches the two-fold screw axis symmetry of the substrate, and this energetically favors the productive enzyme-substrate complex. Our results indicate that the strong and specific ligand interactions of Arg251 and Arg394 provide a simple proofreading system that controls the step length during consecutive hydrolysis and minimizes dead time associated with transient, non-productive complexes.
Collapse
|
3
|
Schiano‐di‐Cola C, Kołaczkowski B, Sørensen TH, Christensen SJ, Cavaleiro AM, Windahl MS, Borch K, Morth JP, Westh P. Structural and biochemical characterization of a family 7 highly thermostable endoglucanase from the fungusRasamsonia emersonii. FEBS J 2019; 287:2577-2596. [DOI: 10.1111/febs.15151] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2019] [Revised: 11/01/2019] [Accepted: 11/20/2019] [Indexed: 01/21/2023]
Affiliation(s)
| | | | - Trine Holst Sørensen
- Department of Science and Environment Roskilde University Denmark
- Novozymes A/S Lyngby Denmark
| | | | | | - Michael Skovbo Windahl
- Department of Science and Environment Roskilde University Denmark
- Novozymes A/S Lyngby Denmark
| | | | - Jens Preben Morth
- Department of Biotechnology and Biomedicine Technical University of Denmark Lyngby Denmark
| | - Peter Westh
- Department of Science and Environment Roskilde University Denmark
- Department of Biotechnology and Biomedicine Technical University of Denmark Lyngby Denmark
| |
Collapse
|
4
|
Gesteira TF, Coulson-Thomas VJ. Structural basis of oligosaccharide processing by glycosaminoglycan sulfotransferases. Glycobiology 2019; 28:885-897. [PMID: 29878110 DOI: 10.1093/glycob/cwy055] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Accepted: 06/06/2018] [Indexed: 02/04/2023] Open
Abstract
Heparan sulfate (HS) is a sulfated polysaccharide that plays a key role in morphogenesis, physiology and pathogenesis. The biosynthesis of HS takes place in the Golgi apparatus by a group of enzymes that polymerize, epimerize and sulfate the sugar chain. This biosynthetic process introduces varying degrees of sulfate substitution, which are tightly regulated and directly dictate binding specificity to different cytokines, morphogens and growth factors. Here, we report the use of molecular dynamics simulations to investigate the dynamics of substrate recognition of two glycosaminoglycan (GAG) sulfotransferases, N-deacetylase-N-sulfotransferase and 2-O-sulfotransferase to the HS chain during the biosynthetic process. We performed multiple simulations of the binding of the sulfotransferase domains to both the HS oligosaccharide substrate and sulfate donor, 3'-phosphoadenosine-5'-phosphosulfate. Analysis of extended simulations provide detailed and useful insights into the atomic interactions that are at work during oligosaccharide processing. The fast information matching method was used to detect the enzyme global dynamics and to predict the pairwise contact of residues responsible for GAG-enzyme binding and unbinding. The correlation between HS displacement and the location of the modified GAG chain were calculated, indicating a possible route for HS and heparin during sulfotransferase processing. Our data also show sulfotransferases contain a conserved interspaced positively charged amino acid residues that form a patch which controls the protein-GAG binding equilibrium. Together, our findings provide further understanding on the fine-tuned complex mechanism of GAG biosynthesis. Our findings can also be extrapolated to other systems for calculating rates of protein-GAG binding.
Collapse
Affiliation(s)
- Tarsis F Gesteira
- College of Optometry, University of Houston, 4901 Calhoun Rd, Houston, TX, USA.,Department of Biochemistry, Universidade Federal de São Paulo, Rua Três de Maio,100 - 6o andar, 04044-020 São Paulo, SP, Brazil
| | | |
Collapse
|
5
|
Rabinovich ML, Melnik MS, Herner ML, Voznyi YV, Vasilchenko LG. Predominant Nonproductive Substrate Binding by Fungal Cellobiohydrolase I and Implications for Activity Improvement. Biotechnol J 2018; 14:e1700712. [PMID: 29781240 DOI: 10.1002/biot.201700712] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Revised: 05/08/2018] [Indexed: 12/20/2022]
Abstract
Enzymatic conversion of the most abundant renewable source of organic compounds, cellulose to fermentable sugars is attractive for production of green fuels and chemicals. The major component of industrial enzyme systems, cellobiohydrolase I from Hypocrea jecorina (Trichoderma reesei) (HjCel7A) processively splits disaccharide units from the reducing ends of tightly packed cellulose chains. HjCel7A consists of a catalytic domain (CD) and a carbohydrate-binding module (CBM) separated by a linker peptide. A tunnel-shaped substrate-binding site in the CD includes nine subsites for β-d-glucose units, seven of which (-7 to -1) precede the catalytic center. Low catalytic activity of Cel7A is the bottleneck and the primary target for improvement. Here it is shown for the first time that, in spite of much lower apparent kcat of HjCel7A at the hydrolysis of β-1,4-glucosidic linkages in the fluorogenic cellotetra- and -pentaose compared to the structurally related endoglucanase I (HjCel7B), the specificity constants (catalytic efficiency) kcat /Km for both enzymes are almost equal in these reactions. The observed activity difference appears from strong nonproductive substrate binding by HjCel7A, particularly significant for MU-β-cellotetraose (MUG4 ). Interaction of substrates with the subsites -6 and -5 proximal to the nonconserved Gln101 residue in HjCel7A decreases Km,ap by >1500 times. HjCel7A can be nonproductively bound onto cellulose surface with Kd ≈2-9 nM via CBM and CD that captures six terminal glucose units of cellulose chain. Decomposition of this nonproductive complex can determine the rate of cellulose conversion. MUG4 is a promising substrate to select active cellobiohydrolase I variants with reduced nonproductive substrate binding.
Collapse
Affiliation(s)
- Mikhail L Rabinovich
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 33, bld. 2 Leninsky Ave., Moscow 119071, Russia
| | - Maria S Melnik
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 33, bld. 2 Leninsky Ave., Moscow 119071, Russia
| | - Mikhail L Herner
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 33, bld. 2 Leninsky Ave., Moscow 119071, Russia
| | - Yakov V Voznyi
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow 119991, Russia
| | - Lilia G Vasilchenko
- Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 33, bld. 2 Leninsky Ave., Moscow 119071, Russia
| |
Collapse
|
6
|
Kari J, Kont R, Borch K, Buskov S, Olsen JP, Cruyz-Bagger N, Väljamäe P, Westh P. Anomeric Selectivity and Product Profile of a Processive Cellulase. Biochemistry 2016; 56:167-178. [PMID: 28026938 DOI: 10.1021/acs.biochem.6b00636] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Cellobiohydrolases (CBHs) make up an important group of enzymes for both natural carbon cycling and industrial deconstruction of lignocellulosic biomass. The consecutive hydrolysis of one cellulose strand relies on an intricate pattern of enzyme-substrate interactions in the long, tunnel-shaped binding site of the CBH. In this work, we have investigated the initial complexation mode with cellulose of the most thoroughly studied CBH, Cel7A from Hypocrea jecorina (HjCel7A). We found that HjCel7A predominantly produces glucose when it initiates a processive run on insoluble microcrystalline cellulose, confirming the validity of an even and odd product ratio as an estimate of processivity. Moreover, the glucose released from cellulose was predominantly α-glucose. A link between the initial binding mode of the enzyme and the reducing end configuration was investigated by inhibition studies with the two anomers of cellobiose. A clear preference for β-cellobiose in product binding site +2 was observed for HjCel7A, but not the homologous endoglucanase, HjCe7B. Possible relationships between this anomeric preference in the product site and the prevalence of odd-numbered initial-cut products are discussed, and a correlation between processivity and anomer selectivity is proposed.
Collapse
Affiliation(s)
- Jeppe Kari
- Research Unit for Functional Biomaterials, Roskilde University , Roskilde, Denmark
| | - Riin Kont
- Institute of Molecular and Cell Biology, University of Tartu , Tartu, Estonia
| | - Kim Borch
- Novozymes A/S , Krogshøjvej 36, DK-2880 Bagsværd, Denmark
| | - Steen Buskov
- Novozymes A/S , Krogshøjvej 36, DK-2880 Bagsværd, Denmark
| | - Johan Pelck Olsen
- Research Unit for Functional Biomaterials, Roskilde University , Roskilde, Denmark
| | | | - Priit Väljamäe
- Institute of Molecular and Cell Biology, University of Tartu , Tartu, Estonia
| | - Peter Westh
- Research Unit for Functional Biomaterials, Roskilde University , Roskilde, Denmark
| |
Collapse
|