1
|
Redelings BD, Holmes I, Lunter G, Pupko T, Anisimova M. Insertions and Deletions: Computational Methods, Evolutionary Dynamics, and Biological Applications. Mol Biol Evol 2024; 41:msae177. [PMID: 39172750 PMCID: PMC11385596 DOI: 10.1093/molbev/msae177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 07/02/2024] [Accepted: 07/09/2024] [Indexed: 08/24/2024] Open
Abstract
Insertions and deletions constitute the second most important source of natural genomic variation. Insertions and deletions make up to 25% of genomic variants in humans and are involved in complex evolutionary processes including genomic rearrangements, adaptation, and speciation. Recent advances in long-read sequencing technologies allow detailed inference of insertions and deletion variation in species and populations. Yet, despite their importance, evolutionary studies have traditionally ignored or mishandled insertions and deletions due to a lack of comprehensive methodologies and statistical models of insertions and deletion dynamics. Here, we discuss methods for describing insertions and deletion variation and modeling insertions and deletions over evolutionary time. We provide practical advice for tackling insertions and deletions in genomic sequences and illustrate our discussion with examples of insertions and deletion-induced effects in human and other natural populations and their contribution to evolutionary processes. We outline promising directions for future developments in statistical methodologies that would allow researchers to analyze insertions and deletion variation and their effects in large genomic data sets and to incorporate insertions and deletions in evolutionary inference.
Collapse
Affiliation(s)
| | - Ian Holmes
- Department of Bioengineering, University of California, Berkeley, CA 94720, USA
- Calico Life Sciences LLC, South San Francisco, CA 94080, USA
| | - Gerton Lunter
- Department of Epidemiology, University Medical Center Groningen, University of Groningen, Groningen 9713 GZ, The Netherlands
| | - Tal Pupko
- The Shmunis School of Biomedicine and Cancer Research, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Maria Anisimova
- Institute of Computational Life Sciences, Zurich University of Applied Sciences, Wädenswil, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
2
|
Iglhaut C, Pečerska J, Gil M, Anisimova M. Please Mind the Gap: Indel-Aware Parsimony for Fast and Accurate Ancestral Sequence Reconstruction and Multiple Sequence Alignment Including Long Indels. Mol Biol Evol 2024; 41:msae109. [PMID: 38842253 PMCID: PMC11221656 DOI: 10.1093/molbev/msae109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Revised: 05/30/2024] [Accepted: 06/03/2024] [Indexed: 06/07/2024] Open
Abstract
Despite having important biological implications, insertion, and deletion (indel) events are often disregarded or mishandled during phylogenetic inference. In multiple sequence alignment, indels are represented as gaps and are estimated without considering the distinct evolutionary history of insertions and deletions. Consequently, indels are usually excluded from subsequent inference steps, such as ancestral sequence reconstruction and phylogenetic tree search. Here, we introduce indel-aware parsimony (indelMaP), a novel way to treat gaps under the parsimony criterion by considering insertions and deletions as separate evolutionary events and accounting for long indels. By identifying the precise location of an evolutionary event on the tree, we can separate overlapping indel events and use affine gap penalties for long indel modeling. Our indel-aware approach harnesses the phylogenetic signal from indels, including them into all inference stages. Validation and comparison to state-of-the-art inference tools on simulated data show that indelMaP is most suitable for densely sampled datasets with closely to moderately related sequences, where it can reach alignment quality comparable to probabilistic methods and accurately infer ancestral sequences, including indel patterns. Due to its remarkable speed, our method is well suited for epidemiological datasets, eliminating the need for downsampling and enabling the exploitation of the additional information provided by dense taxonomic sampling. Moreover, indelMaP offers new insights into the indel patterns of biologically significant sequences and advances our understanding of genetic variability by considering gaps as crucial evolutionary signals rather than mere artefacts.
Collapse
Affiliation(s)
- Clara Iglhaut
- Institute of Computational Life Science, Zurich University of Applied Science, Wädenswil, Switzerland
- Faculty of Mathematics and Science, University of Zurich, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Jūlija Pečerska
- Institute of Computational Life Science, Zurich University of Applied Science, Wädenswil, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Manuel Gil
- Institute of Computational Life Science, Zurich University of Applied Science, Wädenswil, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Maria Anisimova
- Institute of Computational Life Science, Zurich University of Applied Science, Wädenswil, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| |
Collapse
|
3
|
Wilson LA, Melville JN, Pedroso MM, Krco S, Hoelzle R, Zaugg J, Southam G, Virdis B, Evans P, Supper J, Harmer JR, Tyson G, Clark A, Schenk G, Bernhardt PV. Kinetic, electrochemical and spectral characterization of bacterial and archaeal rusticyanins; unexpected stability issues and consequences for applications in biotechnology. J Inorg Biochem 2024; 256:112539. [PMID: 38593609 DOI: 10.1016/j.jinorgbio.2024.112539] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/21/2024] [Accepted: 03/21/2024] [Indexed: 04/11/2024]
Abstract
Motivated by the ambition to establish an enzyme-driven bioleaching pathway for copper extraction, properties of the Type-1 copper protein rusticyanin from Acidithiobacillus ferrooxidans (AfR) were compared with those from an ancestral form of this enzyme (N0) and an archaeal enzyme identified in Ferroplasma acidiphilum (FaR). While both N0 and FaR show redox potentials similar to that of AfR their electron transport rates were significantly slower. The lack of a correlation between the redox potentials and electron transfer rates indicates that AfR and its associated electron transfer chain evolved to specifically facilitate the efficient conversion of the energy of iron oxidation to ATP formation. In F. acidiphilum this pathway is not as efficient unless it is up-regulated by an as of yet unknown mechanism. In addition, while the electrochemical properties of AfR were consistent with previous data, previously unreported behavior was found leading to a form that is associated with a partially unfolded form of the protein. The cyclic voltammetry (CV) response of AfR immobilized onto an electrode showed limited stability, which may be connected to the presence of the partially unfolded state of this protein. Insights gained in this study may thus inform the engineering of optimized rusticyanin variants for bioleaching processes as well as enzyme-catalyzed solubilization of copper-containing ores such as chalcopyrite.
Collapse
Affiliation(s)
- Liam A Wilson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Jamie N Melville
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Marcelo M Pedroso
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Stefan Krco
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Robert Hoelzle
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia; Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Julian Zaugg
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia; Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Gordon Southam
- School of the Environment, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Bernardino Virdis
- Australian Centre for Water and Environmental Biotechnology, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Paul Evans
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia; Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Jenna Supper
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia; Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Jeffrey R Harmer
- Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Gene Tyson
- Centre for Microbiome Research, Queensland University of Technology, Woolloongabba, QLD 4102, Australia
| | - Alice Clark
- Sustainable Minerals Institute, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Gerhard Schenk
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia; Australian Centre for Ecogenomics, The University of Queensland, Brisbane, QLD 4072, Australia; Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Brisbane, QLD 4072, Australia; Sustainable Minerals Institute, The University of Queensland, Brisbane, QLD 4072, Australia.
| | - Paul V Bernhardt
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia.
| |
Collapse
|
4
|
Tule S, Foley G, Zhao C, Forbes M, Bodén M. Optimal phylogenetic reconstruction of insertion and deletion events. Bioinformatics 2024; 40:i277-i286. [PMID: 38940131 PMCID: PMC11211827 DOI: 10.1093/bioinformatics/btae254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open
Abstract
MOTIVATION Insertions and deletions (indels) influence the genetic code in fundamentally distinct ways from substitutions, significantly impacting gene product structure and function. Despite their influence, the evolutionary history of indels is often neglected in phylogenetic tree inference and ancestral sequence reconstruction, hindering efforts to comprehend biological diversity determinants and engineer variants for medical and industrial applications. RESULTS We frame determining the optimal history of indel events as a single Mixed-Integer Programming (MIP) problem, across all branch points in a phylogenetic tree adhering to topological constraints, and all sites implied by a given set of aligned, extant sequences. By disentangling the impact on ancestral sequences at each branch point, this approach identifies the minimal indel events that jointly explain the diversity in sequences mapped to the tips of that tree. MIP can recover alternate optimal indel histories, if available. We evaluated MIP for indel inference on a dataset comprising 15 real phylogenetic trees associated with protein families ranging from 165 to 2000 extant sequences, and on 60 synthetic trees at comparable scales of data and reflecting realistic rates of mutation. Across relevant metrics, MIP outperformed alternative parsimony-based approaches and reported the fewest indel events, on par or below their occurrence in synthetic datasets. MIP offers a rational justification for indel patterns in extant sequences; importantly, it uniquely identifies global optima on complex protein data sets without making unrealistic assumptions of independence or evolutionary underpinnings, promising a deeper understanding of molecular evolution and aiding novel protein design. AVAILABILITY AND IMPLEMENTATION The implementation is available via GitHub at https://github.com/santule/indelmip.
Collapse
Affiliation(s)
- Sanjana Tule
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Gabriel Foley
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Chongting Zhao
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Michael Forbes
- School of Mathematics and Physics, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Mikael Bodén
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
5
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A suite of designed protein cages using machine learning and protein fragment-based protocols. Structure 2024; 32:751-765.e11. [PMID: 38513658 PMCID: PMC11162342 DOI: 10.1016/j.str.2024.02.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/22/2024] [Accepted: 02/23/2024] [Indexed: 03/23/2024]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, but their creation remains challenging. Here, we apply computational approaches to design a suite of tetrahedrally symmetric, self-assembling protein cages. For the generation of docked conformations, we emphasize a protein fragment-based approach, while for sequence design of the de novo interface, a comparison of knowledge-based and machine learning protocols highlights the power and increased experimental success achieved using ProteinMPNN. An analysis of design outcomes provides insights for improving interface design protocols, including prioritizing fragment-based motifs, balancing interface hydrophobicity and polarity, and identifying preferred polar contact patterns. In all, we report five structures for seven protein cages, along with two structures of intermediate assemblies, with the highest resolution reaching 2.0 Å using cryo-EM. This set of designed cages adds substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Michael R Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Mark A Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA
| | - Todd O Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA 90095, USA; UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA.
| |
Collapse
|
6
|
Zhang S, Meng F, Pan X, Qiu X, Li C, Lu S. Chromosome-level genome assembly of Prunella vulgaris L. provides insights into pentacyclic triterpenoid biosynthesis. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2024; 118:731-752. [PMID: 38226777 DOI: 10.1111/tpj.16629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Revised: 12/08/2023] [Accepted: 12/29/2023] [Indexed: 01/17/2024]
Abstract
Prunella vulgaris is one of the bestselling and widely used medicinal herbs. It is recorded as an ace medicine for cleansing and protecting the liver in Chinese Pharmacopoeia and has been used as the main constitutions of many herbal tea formulas in China for centuries. It is also a traditional folk medicine in Europe and other countries of Asia. Pentacyclic triterpenoids are a major class of bioactive compounds produced in P. vulgaris. However, their biosynthetic mechanism remains to be elucidated. Here, we report a chromosome-level reference genome of P. vulgaris using an approach combining Illumina, ONT, and Hi-C technologies. It is 671.95 Mb in size with a scaffold N50 of 49.10 Mb and a complete BUSCO of 98.45%. About 98.31% of the sequence was anchored into 14 pseudochromosomes. Comparative genome analysis revealed a recent WGD in P. vulgaris. Genome-wide analysis identified 35 932 protein-coding genes (PCGs), of which 59 encode enzymes involved in 2,3-oxidosqualene biosynthesis. In addition, 10 PvOSC, 358 PvCYP, and 177 PvUGT genes were identified, of which five PvOSCs, 25 PvCYPs, and 9 PvUGTs were predicted to be involved in the biosynthesis of pentacyclic triterpenoids. Biochemical activity assay of PvOSC2, PvOSC4, and PvOSC6 recombinant proteins showed that they were mixed amyrin synthase (MAS), lupeol synthase (LUS), and β-amyrin synthase (BAS), respectively. The results provide a solid foundation for further elucidating the biosynthetic mechanism of pentacyclic triterpenoids in P. vulgaris.
Collapse
Affiliation(s)
- Sixuan Zhang
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
- Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| | - Fanqi Meng
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
- Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| | - Xian Pan
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
- Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| | - Xiaoxiao Qiu
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
- Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| | - Caili Li
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
- Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| | - Shanfa Lu
- Key Lab of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing, 100193, China
- Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing, 100193, China
| |
Collapse
|
7
|
Jones BS, Ross CM, Foley G, Pozhydaieva N, Sharratt JW, Kress N, Seibt LS, Thomson RES, Gumulya Y, Hayes MA, Gillam EMJ, Flitsch SL. Engineering Biocatalysts for the C-H Activation of Fatty Acids by Ancestral Sequence Reconstruction. Angew Chem Int Ed Engl 2024; 63:e202314869. [PMID: 38163289 DOI: 10.1002/anie.202314869] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Revised: 12/28/2023] [Accepted: 12/29/2023] [Indexed: 01/03/2024]
Abstract
Selective, one-step C-H activation of fatty acids from biomass is an attractive concept in sustainable chemistry. Biocatalysis has shown promise for generating high-value hydroxy acids, but to date enzyme discovery has relied on laborious screening and produced limited hits, which predominantly oxidise the subterminal positions of fatty acids. Herein we show that ancestral sequence reconstruction (ASR) is an effective tool to explore the sequence-activity landscape of a family of multidomain, self-sufficient P450 monooxygenases. We resurrected 11 catalytically active CYP116B ancestors, each with a unique regioselectivity fingerprint that varied from subterminal in the older ancestors to mid-chain in the lineage leading to the extant, P450-TT. In lineages leading to extant enzymes in thermophiles, thermostability increased from ancestral to extant forms, as expected if thermophily had arisen de novo. Our studies show that ASR can be applied to multidomain enzymes to develop active, self-sufficient monooxygenases as regioselective biocatalysts for fatty acid hydroxylation.
Collapse
Affiliation(s)
- Bethan S Jones
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Connie M Ross
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Gabriel Foley
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Nadiia Pozhydaieva
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Joseph W Sharratt
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Nico Kress
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Lisa S Seibt
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| | - Raine E S Thomson
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Yosephine Gumulya
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Martin A Hayes
- Compound Synthesis and Management, Discovery Sciences, R&D, AstraZeneca, Gothenburg, SE
| | - Elizabeth M J Gillam
- School of Chemistry and Molecular Biosciences, The University of Queensland, St. Lucia, Brisbane, 4072, Australia
| | - Sabine L Flitsch
- School of Chemistry, The University of Manchester, Manchester Institute of Biotechnology (MIB), 131 Princess Street, Manchester, M1 7DN, UK
| |
Collapse
|
8
|
Johnson SR, Fu X, Viknander S, Goldin C, Monaco S, Zelezniak A, Yang KK. Computational scoring and experimental evaluation of enzymes generated by neural networks. Nat Biotechnol 2024:10.1038/s41587-024-02214-2. [PMID: 38653796 DOI: 10.1038/s41587-024-02214-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 03/20/2024] [Indexed: 04/25/2024]
Abstract
In recent years, generative protein sequence models have been developed to sample novel sequences. However, predicting whether generated proteins will fold and function remains challenging. We evaluate a set of 20 diverse computational metrics to assess the quality of enzyme sequences produced by three contrasting generative models: ancestral sequence reconstruction, a generative adversarial network and a protein language model. Focusing on two enzyme families, we expressed and purified over 500 natural and generated sequences with 70-90% identity to the most similar natural sequences to benchmark computational metrics for predicting in vitro enzyme activity. Over three rounds of experiments, we developed a computational filter that improved the rate of experimental success by 50-150%. The proposed metrics and models will drive protein engineering research by serving as a benchmark for generative protein sequence models and helping to select active variants for experimental testing.
Collapse
Affiliation(s)
| | - Xiaozhi Fu
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Sandra Viknander
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | - Clara Goldin
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden
| | | | - Aleksej Zelezniak
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, Sweden.
- Institute of Biotechnology, Life Sciences Centre, Vilnius University, Vilnius, Lithuania.
- Randall Centre for Cell & Molecular Biophysics, King's College London, Guy's Campus, London, UK.
| | | |
Collapse
|
9
|
Myrtollari K, Calderini E, Kracher D, Schöngaßner T, Galušić S, Slavica A, Taden A, Mokos D, Schrüfer A, Wirnsberger G, Gruber K, Daniel B, Kourist R. Stability Increase of Phenolic Acid Decarboxylase by a Combination of Protein and Solvent Engineering Unlocks Applications at Elevated Temperatures. ACS SUSTAINABLE CHEMISTRY & ENGINEERING 2024; 12:3575-3584. [PMID: 38456190 PMCID: PMC10915792 DOI: 10.1021/acssuschemeng.3c06513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 12/16/2023] [Accepted: 01/25/2024] [Indexed: 03/09/2024]
Abstract
Enzymatic decarboxylation of biobased hydroxycinnamic acids gives access to phenolic styrenes for adhesive production. Phenolic acid decarboxylases are proficient enzymes that have been applied in aqueous systems, organic solvents, biphasic systems, and deep eutectic solvents, which makes stability a key feature. Stabilization of the enzyme would increase the total turnover number and thus reduce the energy consumption and waste accumulation associated with biocatalyst production. In this study, we used ancestral sequence reconstruction to generate thermostable decarboxylases. Investigation of a set of 16 ancestors resulted in the identification of a variant with an unfolding temperature of 78.1 °C and a half-life time of 45 h at 60 °C. Crystal structures were determined for three selected ancestors. Structural attributes were calculated to fit different regression models for predicting the thermal stability of variants that have not yet been experimentally explored. The models rely on hydrophobic clusters, salt bridges, hydrogen bonds, and surface properties and can identify more stable proteins out of a pool of candidates. Further stabilization was achieved by the application of mixtures of natural deep eutectic solvents and buffers. Our approach is a straightforward option for enhancing the industrial application of the decarboxylation process.
Collapse
Affiliation(s)
- Kamela Myrtollari
- Institute
of Molecular Biotechnology, Graz University
of Technology, Petersgasse
14, 8010 Graz, Austria
- Austrian
Centre of Industrial Biotechnology, ACIB GmbH, Petersgasse 14/1, 8010 Graz, Austria
- Adhesive
Technologies, Henkel AG & Co. KGaA, Henkelstr. 67, 40191 Düsseldorf, Germany
| | - Elia Calderini
- Institute
of Molecular Biotechnology, Graz University
of Technology, Petersgasse
14, 8010 Graz, Austria
| | - Daniel Kracher
- Institute
of Molecular Biotechnology, Graz University
of Technology, Petersgasse
14, 8010 Graz, Austria
- BioTechMed-Graz, Mozartgasse
12/II, 8010 Graz, Austria
| | - Tobias Schöngaßner
- Institute
of Molecular Biotechnology, Graz University
of Technology, Petersgasse
14, 8010 Graz, Austria
| | - Stela Galušić
- Institute
of Molecular Biotechnology, Graz University
of Technology, Petersgasse
14, 8010 Graz, Austria
| | - Anita Slavica
- Faculty
of Food Technology and Biotechnology, Department of Biochemical Engineering, University of Zagreb, Pierottijeva 6, HR-10000 Zagreb, Croatia
| | - Andreas Taden
- Adhesive
Technologies, Henkel AG & Co. KGaA, Henkelstr. 67, 40191 Düsseldorf, Germany
| | - Daniel Mokos
- Institute
of Molecular Biosciences, University of
Graz, NAWI Graz, Humboldtstraße
50/3, 8010 Graz, Austria
| | - Anna Schrüfer
- Institute
of Molecular Biosciences, University of
Graz, NAWI Graz, Humboldtstraße
50/3, 8010 Graz, Austria
| | - Gregor Wirnsberger
- Institute
of Molecular Biosciences, University of
Graz, NAWI Graz, Humboldtstraße
50/3, 8010 Graz, Austria
| | - Karl Gruber
- BioTechMed-Graz, Mozartgasse
12/II, 8010 Graz, Austria
- Institute
of Molecular Biosciences, University of
Graz, NAWI Graz, Humboldtstraße
50/3, 8010 Graz, Austria
| | - Bastian Daniel
- BioTechMed-Graz, Mozartgasse
12/II, 8010 Graz, Austria
- Institute
of Molecular Biosciences, University of
Graz, NAWI Graz, Humboldtstraße
50/3, 8010 Graz, Austria
| | - Robert Kourist
- Institute
of Molecular Biotechnology, Graz University
of Technology, Petersgasse
14, 8010 Graz, Austria
- Austrian
Centre of Industrial Biotechnology, ACIB GmbH, Petersgasse 14/1, 8010 Graz, Austria
- BioTechMed-Graz, Mozartgasse
12/II, 8010 Graz, Austria
| |
Collapse
|
10
|
Franceus J, Rivas-Fernández JP, Lormans J, Rovira C, Desmet T. Evolution of Phosphorylase Activity in an Ancestral Glycosyltransferase. ACS Catal 2024; 14:3103-3114. [PMID: 38449530 PMCID: PMC10913872 DOI: 10.1021/acscatal.3c05819] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 01/08/2024] [Accepted: 01/11/2024] [Indexed: 03/08/2024]
Abstract
The reconstruction of ancestral sequences can offer a glimpse into the fascinating process of molecular evolution by exposing the adaptive pathways that shape the proteins found in nature today. Here, we track the evolution of the carbohydrate-active enzymes responsible for the synthesis and turnover of mannogen, a critical carbohydrate reserve in Leishmania parasites. Biochemical characterization of resurrected enzymes demonstrated that mannoside phosphorylase activity emerged in an ancestral bacterial mannosyltransferase, and later disappeared in the process of horizontal gene transfer and gene duplication in Leishmania. By shuffling through plausible historical sequence space in an ancestral mannosyltransferase, we found that mannoside phosphorylase activity could be toggled on through various combinations of mutations at positions outside of the active site. Molecular dynamics simulations showed that such mutations can affect loop rigidity and shield the active site from water molecules that disrupt key interactions, allowing α-mannose 1-phosphate to adopt a catalytically productive conformation. These findings highlight the importance of subtle distal mutations in protein evolution and suggest that the vast collection of natural glycosyltransferases may be a promising source of engineering templates for the design of tailored phosphorylases.
Collapse
Affiliation(s)
- Jorick Franceus
- Centre
for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, Ghent 9000, Belgium
| | - José Pablo Rivas-Fernández
- Departament
de Química Inorgànica i Orgànica (Secció
de Química Orgànica) and Institut de Química
Teòrica i Computacional (IQTCUB), Universitat de Barcelona, Martí i Franquès 1, Barcelona 08028, Spain
| | - Jolien Lormans
- Centre
for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, Ghent 9000, Belgium
| | - Carme Rovira
- Departament
de Química Inorgànica i Orgànica (Secció
de Química Orgànica) and Institut de Química
Teòrica i Computacional (IQTCUB), Universitat de Barcelona, Martí i Franquès 1, Barcelona 08028, Spain
- Institució
Catalana de Recerca i Estudis Avançats (ICREA), 08010 Barcelona, Spain
| | - Tom Desmet
- Centre
for Synthetic Biology (CSB), Department of Biotechnology, Ghent University, Coupure Links 653, Ghent 9000, Belgium
| |
Collapse
|
11
|
Yang J, Li FZ, Arnold FH. Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering. ACS CENTRAL SCIENCE 2024; 10:226-241. [PMID: 38435522 PMCID: PMC10906252 DOI: 10.1021/acscentsci.3c01275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 12/26/2023] [Accepted: 01/16/2024] [Indexed: 03/05/2024]
Abstract
Enzymes can be engineered at the level of their amino acid sequences to optimize key properties such as expression, stability, substrate range, and catalytic efficiency-or even to unlock new catalytic activities not found in nature. Because the search space of possible proteins is vast, enzyme engineering usually involves discovering an enzyme starting point that has some level of the desired activity followed by directed evolution to improve its "fitness" for a desired application. Recently, machine learning (ML) has emerged as a powerful tool to complement this empirical process. ML models can contribute to (1) starting point discovery by functional annotation of known protein sequences or generating novel protein sequences with desired functions and (2) navigating protein fitness landscapes for fitness optimization by learning mappings between protein sequences and their associated fitness values. In this Outlook, we explain how ML complements enzyme engineering and discuss its future potential to unlock improved engineering outcomes.
Collapse
Affiliation(s)
- Jason Yang
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Francesca-Zhoufan Li
- Division
of Biology and Biological Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Frances H. Arnold
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
- Division
of Biology and Biological Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| |
Collapse
|
12
|
Teshima M, Sutiono S, Döring M, Beer B, Boden M, Schenk G, Sieber V. Development of a Highly Selective NAD + -Dependent Glyceraldehyde Dehydrogenase and its Application in Minimal Cell-Free Enzyme Cascades. CHEMSUSCHEM 2024; 17:e202301132. [PMID: 37872118 DOI: 10.1002/cssc.202301132] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 10/18/2023] [Accepted: 10/23/2023] [Indexed: 10/25/2023]
Abstract
Anthropogenic climate change has been caused by over-exploitation of fossil fuels and CO2 emissions. To counteract this, the chemical industry has shifted its focus to sustainable chemical production and the valorization of renewable resources. However, the biggest challenges in biomanufacturing are technical efficiency and profitability. In our minimal cell-free enzyme cascade generating pyruvate as the central intermediate, the NAD+ -dependent, selective oxidation of D-glyceraldehyde was identified as a key reaction step to improve the overall cascade flux. Successive genome mining identified one candidate enzyme with 24-fold enhanced activity and another whose stability is unaffected in 10 % (v/v) ethanol, the final product of our model cascade. Semi-rational engineering improved the substrate selectivity of the enzyme up to 21-fold, thus minimizing side reactions in the one-pot enzyme cascade. The final biotransformation of D-glucose showed a continuous linear production of ethanol (via pyruvate) to a final titer of 4.9 % (v/v) with a molar product yield of 98.7 %. Due to the central role of pyruvate in diverse biotransformations, the optimized production module has great potential for broad biomanufacturing applications.
Collapse
Affiliation(s)
- Mariko Teshima
- Chair of Chemistry of Biogenic Resources, Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Schulgasse 16, 94315, Straubing, Germany
| | - Samuel Sutiono
- Chair of Chemistry of Biogenic Resources, Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Schulgasse 16, 94315, Straubing, Germany
- Current address: CarboCode Germany GmbH, Byk-Gulden-Straße 2, 78467, Constance, Germany
| | - Manuel Döring
- Chair of Chemistry of Biogenic Resources, Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Schulgasse 16, 94315, Straubing, Germany
| | - Barbara Beer
- Chair of Chemistry of Biogenic Resources, Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Schulgasse 16, 94315, Straubing, Germany
- Current address: CASCAT GmbH, Europaring 4, 94315, Straubing, Germany
| | - Mikael Boden
- School of Chemistry and Molecular Biosciences, The University of Queensland, 68 Cooper Rd, St. Lucia, 4072, Brisbane, Australia
| | - Gerhard Schenk
- School of Chemistry and Molecular Biosciences, The University of Queensland, 68 Cooper Rd, St. Lucia, 4072, Brisbane, Australia
- Australian Institute for Bioengineering and Nanotechnology, The University of Queensland, Corner of College and Cooper Rds, St. Lucia, 4072, Brisbane, Australia
- Sustainable Minerals Institute, The University of Queensland, Corner of College and Staff House Rds, St. Lucia, 4072, Brisbane, Australia
| | - Volker Sieber
- Chair of Chemistry of Biogenic Resources, Technical University of Munich, Campus Straubing for Biotechnology and Sustainability, Schulgasse 16, 94315, Straubing, Germany
- School of Chemistry and Molecular Biosciences, The University of Queensland, 68 Cooper Rd, St. Lucia, 4072, Brisbane, Australia
- SynBioFoundry@TUM, Technical University of Munich, Schulgasse 22, 94315, Straubing, Germany
- Catalytic Research Center, Technical University of Munich, Ernst-Otto-Fischer Straße 1, 85748, Garching, Germany
| |
Collapse
|
13
|
Kostelac A, Taborda A, Martins LO, Haltrich D. Evolution and separation of actinobacterial pyranose and C-glycoside-3-oxidases. Appl Environ Microbiol 2024; 90:e0167623. [PMID: 38179968 PMCID: PMC10807413 DOI: 10.1128/aem.01676-23] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 11/21/2023] [Indexed: 01/06/2024] Open
Abstract
FAD-dependent pyranose oxidase (POx) and C-glycoside-3-oxidase (CGOx) are both members of the glucose-methanol-choline superfamily of oxidoreductases and belong to the same sequence space. Pyranose oxidases had been studied for their oxidation of monosaccharides such as D-glucose, but recently, a bacterial C-glycoside-3-oxidase that is phylogenetically related to POx and that reacts with C-glycosides such as carminic acid, mangiferin or puerarin has been described. Since these actinobacterial CGOx enzymes belong to the same sequence space as bacterial POx, they must have evolved from the same ancestor. Here, we performed a phylogenetic analysis of actinobacterial sequences and resurrected seven ancestral enzymes of the POx/CGOx sequence space to study the evolutionary trajectory of substrate preferences for monosaccharides and C-glycosides. Clade I, with its dimeric member POx from Kitasatospora aureofaciens, shows strict preference for monosaccharides (D-glucose and D-xylose) and does not react with any of the glycosides tested. No extant member of clade II has been studied to date. The two extant members of clades III and IV, monomeric POx/CGOx from Pseudoarthrobacter siccitolerans and Streptomyces canus, oxidized both monosaccharides as well as various C-glycosides (homoorientin, isovitexin, mangiferin, and puerarin). Steady-state kinetic parameters of several clades III and IV ancestral enzymes indicate that the generalist ancestor N35 slowly evolved to present-day enzymes with a much higher preference for C-glycosides than monosaccharides. Based on structural predictions of ancestors, we hypothesize that the strict specificity of bacterial clade I POx (and also fungal POx) is the result of oligomerization, which in turn results from the evolution of protein segments that were shown to be important for oligomerization, the arm, and the head domain.IMPORTANCEC-Glycosides often form active compounds in various plants. Breakage of the C-C bond in these glycosides to release the aglycone is challenging and proceeds via a two-step reaction, the oxidation of the sugar and subsequent cleavage of the C-C bond. Recently, an enzyme from a soil bacterium, FAD-dependent C-glycoside-3-oxidase (CGOx), was shown to catalyze the initial oxidation reaction. Here, we show that CGOx belongs to the same sequence space as pyranose oxidase (POx), and that an actinobacterial ancestor of the POx/CGOx family evolved into four clades, two of which show a high preference for C-glycosides.
Collapse
Affiliation(s)
- Anja Kostelac
- Department of Food Science and Technology, BOKU—University of Natural Resources and Life Sciences, Vienna, Austria
- Doctoral Programme BioToP—Biomolecular Technology of Proteins, BOKU—University of Natural Resources and Life Sciences, Vienna, Austria
| | - André Taborda
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade NOVA de Lisboa, Oeiras, Portugal
| | - Lígia O. Martins
- Instituto de Tecnologia Química e Biológica António Xavier, Universidade NOVA de Lisboa, Oeiras, Portugal
| | - Dietmar Haltrich
- Department of Food Science and Technology, BOKU—University of Natural Resources and Life Sciences, Vienna, Austria
| |
Collapse
|
14
|
Meador K, Castells-Graells R, Aguirre R, Sawaya MR, Arbing MA, Sherman T, Senarathne C, Yeates TO. A Suite of Designed Protein Cages Using Machine Learning Algorithms and Protein Fragment-Based Protocols. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561468. [PMID: 37873110 PMCID: PMC10592684 DOI: 10.1101/2023.10.09.561468] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
Designed protein cages and related materials provide unique opportunities for applications in biotechnology and medicine, while methods for their creation remain challenging and unpredictable. In the present study, we apply new computational approaches to design a suite of new tetrahedrally symmetric, self-assembling protein cages. For the generation of docked poses, we emphasize a protein fragment-based approach, while for de novo interface design, a comparison of computational protocols highlights the power and increased experimental success achieved using the machine learning program ProteinMPNN. In relating information from docking and design, we observe that agreement between fragment-based sequence preferences and ProteinMPNN sequence inference correlates with experimental success. Additional insights for designing polar interactions are highlighted by experimentally testing larger and more polar interfaces. In all, using X-ray crystallography and cryo-EM, we report five structures for seven protein cages, with atomic resolution in the best case reaching 2.0 Å. We also report structures of two incompletely assembled protein cages, providing unique insights into one type of assembly failure. The new set of designed cages and their structures add substantially to the body of available protein nanoparticles, and to methodologies for their creation.
Collapse
Affiliation(s)
- Kyle Meador
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | | | - Roman Aguirre
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Michael R. Sawaya
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Mark A. Arbing
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| | - Trent Sherman
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Chethaka Senarathne
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
| | - Todd O. Yeates
- Department of Chemistry and Biochemistry, University of California, Los Angeles, CA, USA 90095
- UCLA-DOE Institute for Genomics and Proteomics, Los Angeles, CA, USA 90095
| |
Collapse
|
15
|
Derbyshire MC, Raffaele S. Surface frustration re-patterning underlies the structural landscape and evolvability of fungal orphan candidate effectors. Nat Commun 2023; 14:5244. [PMID: 37640704 PMCID: PMC10462633 DOI: 10.1038/s41467-023-40949-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2023] [Accepted: 08/09/2023] [Indexed: 08/31/2023] Open
Abstract
Pathogens secrete effector proteins to subvert host physiology and cause disease. Effectors are engaged in a molecular arms race with the host resulting in conflicting evolutionary constraints to manipulate host cells without triggering immune responses. The molecular mechanisms allowing effectors to be at the same time robust and evolvable remain largely enigmatic. Here, we show that 62 conserved structure-related families encompass the majority of fungal orphan effector candidates in the Pezizomycotina subphylum. These effectors diversified through changes in patterns of thermodynamic frustration at surface residues. The underlying mutations tended to increase the robustness of the overall effector protein structure while switching potential binding interfaces. This mechanism could explain how conserved effector families maintained biological activity over long evolutionary timespans in different host environments and provides a model for the emergence of sequence-unrelated effector families with conserved structures.
Collapse
Affiliation(s)
- Mark C Derbyshire
- Centre for Crop and Disease Management, School of Molecular and Life Sciences, Curtin University, Perth, Australia
| | - Sylvain Raffaele
- Laboratoire des Interactions Plantes Micro-organismes Environnement (LIPME), INRAE, CNRS, Université de Toulouse, 31326, Castanet-Tolosan, France.
| |
Collapse
|
16
|
Auerbach AA, Becker JT, Moraes SN, Moghadasi SA, Duda JM, Salamango DJ, Harris RS. Ancestral APOBEC3B Nuclear Localization Is Maintained in Humans and Apes and Altered in Most Other Old World Primate Species. mSphere 2022; 7:e0045122. [PMID: 36374108 PMCID: PMC9769932 DOI: 10.1128/msphere.00451-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2022] [Accepted: 10/24/2022] [Indexed: 11/16/2022] Open
Abstract
APOBEC3B is an innate immune effector enzyme capable of introducing mutations in viral genomes through DNA cytosine-to-uracil editing. Recent studies have shown that gamma-herpesviruses, such as Epstein-Barr virus (EBV), have evolved a potent APOBEC3B neutralization mechanism to protect lytic viral DNA replication intermediates in the nuclear compartment. APOBEC3B is additionally unique as the only human DNA deaminase family member that is constitutively nuclear. Nuclear localization has therefore been inferred to be essential for innate antiviral function. Here, we combine evolutionary, molecular, and cell biology approaches to address whether nuclear localization is a conserved feature of APOBEC3B in primates. Despite the relatively recent emergence of APOBEC3B approximately 30 to 40 million years ago (MYA) in Old World primates by genetic recombination (after the split from the New World monkey lineage 40 to 50 MYA), we find that the hallmark nuclear localization of APOBEC3B shows variability. For instance, although human and several nonhuman primate APOBEC3B enzymes are predominantly nuclear, rhesus macaque and other Old World primate APOBEC3B proteins are clearly cytoplasmic or cell wide. A series of human/rhesus macaque chimeras and mutants combined to map localization determinants to the N-terminal half of the protein with residues 15, 19, and 24 proving critical. Ancestral APOBEC3B reconstructed from present-day primate species also shows strong nuclear localization. Together, these results indicate that the ancestral nuclear localization of APOBEC3B is maintained in present-day human and ape proteins, but nuclear localization is not conserved in all Old World monkey species despite a need for antiviral functions in the nuclear compartment. IMPORTANCE APOBEC3 enzymes are single-stranded DNA cytosine-to-uracil deaminases with beneficial roles in antiviral immunity and detrimental roles in cancer mutagenesis. Regarding viral infection, all seven human APOBEC3 enzymes have overlapping roles in restricting virus types that require DNA for replication, including EBV, HIV, human papillomavirus (HPV), and human T-cell leukemia virus (HTLV). Regarding cancer, at least two APOBEC3 enzymes, APOBEC3B and APOBEC3A, are prominent sources of mutation capable of influencing clinical outcomes. Here, we combine evolutionary, molecular, and cell biology approaches to characterize primate APOBEC3B enzymes. We show that nuclear localization is an ancestral property of APOBEC3B that is maintained in present-day human and ape enzymes, but not conserved in other nonhuman primates. This partial mechanistic conservation indicates that APOBEC3B is important for limiting the replication of DNA-based viruses in the nuclear compartment. Understanding these pathogen-host interactions may contribute to the development of future antiviral and antitumor therapies.
Collapse
Affiliation(s)
- Ashley A Auerbach
- Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, Texas, USA
- Institute for Molecular Virology, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
| | - Jordan T Becker
- Institute for Molecular Virology, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
- Department of Microbiology and Immunology, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
| | - Sofia N Moraes
- Institute for Molecular Virology, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
| | - Seyed Arad Moghadasi
- Institute for Molecular Virology, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
| | - Jolene M Duda
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
| | - Daniel J Salamango
- Institute for Molecular Virology, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesotagrid.17635.36 - Twin Cities, Minneapolis, Minnesota, USA
- Department of Microbiology and Immunology, Stony Brook University, Stony Brook, New York, USA
| | - Reuben S Harris
- Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, Texas, USA
- Howard Hughes Medical Institute, University of Texas Health San Antonio, San Antonio, Texas, USA
| |
Collapse
|
17
|
Moraes SN, Becker JT, Moghadasi SA, Shaban NM, Auerbach AA, Cheng AZ, Harris RS. Evidence linking APOBEC3B genesis and evolution of innate immune antagonism by gamma-herpesvirus ribonucleotide reductases. eLife 2022; 11:83893. [PMID: 36458685 DOI: 10.7554/elife.83893] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2022] [Accepted: 10/12/2022] [Indexed: 12/04/2022] Open
Abstract
Viruses have evolved diverse mechanisms to antagonize host immunity such as direct inhibition and relocalization of cellular APOBEC3B (A3B) by the ribonucleotide reductase (RNR) of Epstein-Barr virus. Here, we investigate the mechanistic conservation and evolutionary origin of this innate immune counteraction strategy. First, we find that human gamma-herpesvirus RNRs engage A3B via largely distinct surfaces. Second, we show that RNR-mediated enzymatic inhibition and relocalization of A3B depend upon binding to different regions of the catalytic domain. Third, we show that the capability of viral RNRs to antagonize A3B is conserved among gamma-herpesviruses that infect humans and Old World monkeys that encode this enzyme but absent in homologous viruses that infect New World monkeys that naturally lack the A3B gene. Finally, we reconstruct the ancestral primate A3B protein and demonstrate that it is active and similarly engaged by the RNRs from viruses that infect humans and Old World monkeys but not by the RNRs from viruses that infect New World monkeys. These results combine to indicate that the birth of A3B at a critical branchpoint in primate evolution may have been a driving force in selecting for an ancestral gamma-herpesvirus with an expanded RNR functionality through counteraction of this antiviral enzyme.
Collapse
Affiliation(s)
- Sofia N Moraes
- Department of Biochemistry, Molecular Biology, and Biophysics, Institute for Molecular Virology, Masonic Cancer Center, University of Minnesota, Minneapolis, United States
| | - Jordan T Becker
- Department of Biochemistry, Molecular Biology, and Biophysics, Institute for Molecular Virology, Masonic Cancer Center, University of Minnesota, Minneapolis, United States
| | - Seyed Arad Moghadasi
- Department of Biochemistry, Molecular Biology, and Biophysics, Institute for Molecular Virology, Masonic Cancer Center, University of Minnesota, Minneapolis, United States
| | - Nadine M Shaban
- Department of Biochemistry, Molecular Biology, and Biophysics, Institute for Molecular Virology, Masonic Cancer Center, University of Minnesota, Minneapolis, United States
| | - Ashley A Auerbach
- Department of Biochemistry, Molecular Biology, and Biophysics, Institute for Molecular Virology, Masonic Cancer Center, University of Minnesota, Minneapolis, United States.,Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, United States
| | - Adam Z Cheng
- Department of Biochemistry, Molecular Biology, and Biophysics, Institute for Molecular Virology, Masonic Cancer Center, University of Minnesota, Minneapolis, United States
| | - Reuben S Harris
- Department of Biochemistry, Molecular Biology, and Biophysics, Institute for Molecular Virology, Masonic Cancer Center, University of Minnesota, Minneapolis, United States.,Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, United States.,Howard Hughes Medical Institute, University of Texas Health San Antonio, San Antonio, United States
| |
Collapse
|