1
|
Berríos KN, Barka A, Gill J, Serrano JC, Bailer PF, Parker JB, Evitt NH, Gajula KS, Shi J, Kohli RM. Cooperativity between Cas9 and hyperactive AID establishes broad and diversifying mutational footprints in base editors. Nucleic Acids Res 2024; 52:2078-2090. [PMID: 38261989 PMCID: PMC10899762 DOI: 10.1093/nar/gkae024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Revised: 12/20/2023] [Accepted: 01/04/2024] [Indexed: 01/25/2024] Open
Abstract
The partnership of DNA deaminase enzymes with CRISPR-Cas nucleases is now a well-established method to enable targeted genomic base editing. However, an understanding of how Cas9 and DNA deaminases collaborate to shape base editor (BE) outcomes has been lacking. Here, we support a novel mechanistic model of base editing by deriving a range of hyperactive activation-induced deaminase (AID) base editors (hBEs) and exploiting their characteristic diversifying activity. Our model involves multiple layers of previously underappreciated cooperativity in BE steps including: (i) Cas9 binding can potentially expose both DNA strands for 'capture' by the deaminase, a feature that is enhanced by guide RNA mismatches; (ii) after strand capture, the intrinsic activity of the DNA deaminase can tune window size and base editing efficiency; (iii) Cas9 defines the boundaries of editing on each strand, with deamination blocked by Cas9 binding to either the PAM or the protospacer and (iv) non-canonical edits on the guide RNA bound strand can be further elicited by changing which strand is nicked by Cas9. Leveraging insights from our mechanistic model, we create novel hBEs that can remarkably generate simultaneous C > T and G > A transitions over >65 bp with significant potential for targeted gene diversification.
Collapse
Affiliation(s)
- Kiara N Berríos
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Aleksia Barka
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jasleen Gill
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Juan C Serrano
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Peter F Bailer
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Jared B Parker
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Niklaus H Evitt
- Graduate Group in Cell and Molecular Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Kiran S Gajula
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Junwei Shi
- Department of Cancer Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Rahul M Kohli
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
2
|
Notin P, Kollasch AW, Ritter D, van Niekerk L, Paul S, Spinner H, Rollins N, Shaw A, Weitzman R, Frazer J, Dias M, Franceschi D, Orenbuch R, Gal Y, Marks DS. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570727. [PMID: 38106144 PMCID: PMC10723403 DOI: 10.1101/2023.12.07.570727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Predicting the effects of mutations in proteins is critical to many applications, from understanding genetic disease to designing novel proteins that can address our most pressing challenges in climate, agriculture and healthcare. Despite a surge in machine learning-based protein models to tackle these questions, an assessment of their respective benefits is challenging due to the use of distinct, often contrived, experimental datasets, and the variable performance of models across different protein families. Addressing these challenges requires scale. To that end we introduce ProteinGym, a large-scale and holistic set of benchmarks specifically designed for protein fitness prediction and design. It encompasses both a broad collection of over 250 standardized deep mutational scanning assays, spanning millions of mutated sequences, as well as curated clinical datasets providing high-quality expert annotations about mutation effects. We devise a robust evaluation framework that combines metrics for both fitness prediction and design, factors in known limitations of the underlying experimental methods, and covers both zero-shot and supervised settings. We report the performance of a diverse set of over 70 high-performing models from various subfields (eg., alignment-based, inverse folding) into a unified benchmark suite. We open source the corresponding codebase, datasets, MSAs, structures, model predictions and develop a user-friendly website that facilitates data access and analysis.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Ada Shaw
- Applied Mathematics, Harvard University
| | | | | | - Mafalda Dias
- Centre for Genomic Regulation, Universitat Pompeu Fabra
| | | | | | - Yarin Gal
- Computer Science, University of Oxford
| | | |
Collapse
|
3
|
Fu Y, Bedő J, Papenfuss AT, Rubin AF. Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants. Gigascience 2022; 12:giad073. [PMID: 37721410 PMCID: PMC10506130 DOI: 10.1093/gigascience/giad073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 07/02/2023] [Accepted: 08/23/2023] [Indexed: 09/19/2023] Open
Abstract
BACKGROUND Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors. RESULTS In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins. CONCLUSIONS We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results.
Collapse
Affiliation(s)
- Yunfan Fu
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Justin Bedő
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Anthony T Papenfuss
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
- Peter MacCallum Cancer Centre, Melbourne, Victoria 3000, Australia
| | - Alan F Rubin
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| |
Collapse
|
4
|
Controllable genome editing with split-engineered base editors. Nat Chem Biol 2021; 17:1262-1270. [PMID: 34663942 PMCID: PMC8981362 DOI: 10.1038/s41589-021-00880-w] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2021] [Accepted: 08/12/2021] [Indexed: 12/26/2022]
Abstract
DNA deaminase enzymes play key roles in immunity and have recently been harnessed for their biotechnological applications. In base editors (BEs), the combination of DNA deaminase mutator activity with CRISPR-Cas localization confers the powerful ability to directly convert one target DNA base into another. While efforts have been made to improve targeting efficiency and precision, all BEs so far use a constitutively active DNA deaminase. The absence of regulatory control over promiscuous deaminase activity remains a major limitation to accessing the widespread potential of BEs. Here, we reveal sites that permit splitting of DNA cytosine deaminases into two inactive fragments, whose reapproximation reconstitutes activity. These findings allow for the development of split-engineered BEs (seBEs), which newly enable small-molecule control over targeted mutator activity. We show that the seBE strategy facilitates robust regulated editing with BE scaffolds containing diverse deaminases, offering a generalizable solution for temporally controlling precision genome editing.
Collapse
|
5
|
Esposito D, Weile J, Shendure J, Starita LM, Papenfuss AT, Roth FP, Fowler DM, Rubin AF. MaveDB: an open-source platform to distribute and interpret data from multiplexed assays of variant effect. Genome Biol 2019; 20:223. [PMID: 31679514 PMCID: PMC6827219 DOI: 10.1186/s13059-019-1845-6] [Citation(s) in RCA: 103] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 10/01/2019] [Indexed: 11/10/2022] Open
Abstract
Multiplex assays of variant effect (MAVEs), such as deep mutational scans and massively parallel reporter assays, test thousands of sequence variants in a single experiment. Despite the importance of MAVE data for basic and clinical research, there is no standard resource for their discovery and distribution. Here, we present MaveDB ( https://www.mavedb.org ), a public repository for large-scale measurements of sequence variant impact, designed for interoperability with applications to interpret these datasets. We also describe the first such application, MaveVis, which retrieves, visualizes, and contextualizes variant effect maps. Together, the database and applications will empower the community to mine these powerful datasets.
Collapse
Affiliation(s)
- Daniel Esposito
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
| | - Jochen Weile
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Lea M Starita
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Anthony T Papenfuss
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia
- Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, VIC, Australia
- Department of Mathematics and Statistics, University of Melbourne, Melbourne, VIC, Australia
| | - Frederick P Roth
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada.
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
- Canadian Institute for Advanced Research, Toronto, ON, Canada.
| | - Douglas M Fowler
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Canadian Institute for Advanced Research, Toronto, ON, Canada.
- Department of Bioengineering, University of Washington, Seattle, WA, USA.
| | - Alan F Rubin
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia.
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia.
- Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia.
| |
Collapse
|
6
|
Pereira GRC, Tellini GHAS, De Mesquita JF. In silico analysis of PFN1 related to amyotrophic lateral sclerosis. PLoS One 2019; 14:e0215723. [PMID: 31216283 PMCID: PMC6583998 DOI: 10.1371/journal.pone.0215723] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 04/09/2019] [Indexed: 12/11/2022] Open
Abstract
Profilin 1 (PFN1) protein plays key roles in neuronal growth and differentiation, membrane trafficking, and regulation of the actin cytoskeleton. Four natural variants of PFN1 were described as related to ALS, the most common adult-onset motor neuron disorder. However, the pathological mechanism of PFN1 in ALS is not yet completely understood. The goal of this work is to thoroughly analyze the effects of the ALS-related mutations on PFN1 structure and function using computational simulations. Here, PhD-SNP, PMUT, PolyPhen-2, SIFT, SNAP, SNPS&GO, SAAP, nsSNPAnalyzer, SNPeffect4.0 and I-Mutant2.0 were used to predict the functional and stability effects of PFN1 mutations. ConSurf was used for the evolutionary conservation analysis, and GROMACS was used to perform the MD simulations. The mutations C71G, M114T, and G118V, but not E117G, were predicted as deleterious by most of the functional prediction algorithms that were used. The stability prediction indicated that the ALS-related mutations could destabilize PFN1. The ConSurf analysis indicated that the mutation C71G, M114T, E117G, and G118V occur in highly conserved positions. The MD results indicated that the studied mutations could affect the PFN1 flexibility at the actin and PLP-binding domains, and consequently, their intermolecular interactions. It may be therefore related to the functional impairment of PFN1 upon C71G, M114T, E117G and G118V mutations, and their involvement in ALS development. We also developed a database, SNPMOL (http://www.snpmol.org/), containing the results presented on this paper for biologists and clinicians to exploit PFN1 and its natural variants.
Collapse
Affiliation(s)
- Gabriel Rodrigues Coutinho Pereira
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Giovanni Henrique Almeida Silva Tellini
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Joelma Freire De Mesquita
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
- * E-mail:
| |
Collapse
|
7
|
De Oliveira CCS, Pereira GRC, De Alcantara JYS, Antunes D, Caffarena ER, De Mesquita JF. In silico analysis of the V66M variant of human BDNF in psychiatric disorders: An approach to precision medicine. PLoS One 2019; 14:e0215508. [PMID: 30998730 PMCID: PMC6472887 DOI: 10.1371/journal.pone.0215508] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Accepted: 04/04/2019] [Indexed: 11/19/2022] Open
Abstract
Brain-derived neurotrophic factor (BDNF) plays an important role in neurogenesis and synapse formation. The V66M is the most prevalent BDNF mutation in humans and impairs the function and distribution of BDNF. This mutation is related to several psychiatric disorders. The pro-region of BDNF, particularly position 66 and its adjacent residues, are determinant for the intracellular sorting and activity-dependent secretion of BDNF. However, it has not yet been fully elucidated. The present study aims to analyze the effects of the V66M mutation on BDNF structure and function. Here, we applied nine algorithms, including SIFT and PolyPhen-2, for functional and stability prediction of the V66M mutation. The complete theoretical model of BNDF was generated by Rosetta and validated by PROCHECK, RAMPAGE, ProSa, QMEAN and Verify-3D algorithms. Structural alignment was performed using TM-align. Phylogenetic analysis was performed using the ConSurf server. Molecular dynamics (MD) simulations were performed and analyzed using the GROMACS 2018.2 package. The V66M mutation was predicted as deleterious by PolyPhen-2 and SIFT in addition to being predicted as destabilizing by I-Mutant. According to SNPeffect, the V66M mutation does not affect protein aggregation, amyloid propensity, and chaperone binding. The complete theoretical structure of BDNF proved to be a reliable model. Phylogenetic analysis indicated that the V66M mutation of BDNF occurs at a non-conserved position of the protein. MD analyses indicated that the V66M mutation does not affect the BDNF flexibility and surface-to-volume ratio, but affects the BDNF essential motions, hydrogen-bonding and secondary structure particularly at its pre and pro-domain, which are crucial for its activity and distribution. Thus, considering that these parameters are determinant for protein interactions and, consequently, protein function; the alterations observed throughout the MD analyses may be related to the functional impairment of BDNF upon V66M mutation, as well as its involvement in psychiatric disorders.
Collapse
Affiliation(s)
- Clara Carolina Silva De Oliveira
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Gabriel Rodrigues Coutinho Pereira
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Jamile Yvis Santos De Alcantara
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
| | - Deborah Antunes
- Computational Biophysics and Molecular Modeling Group, Scientific Computing Program (PROCC), Fundação Oswaldo Cruz, Manguinhos, Rio de Janeiro, Brazil
| | - Ernesto Raul Caffarena
- Computational Biophysics and Molecular Modeling Group, Scientific Computing Program (PROCC), Fundação Oswaldo Cruz, Manguinhos, Rio de Janeiro, Brazil
| | - Joelma Freire De Mesquita
- Department of Genetics and Molecular Biology, Bioinformatics and Computational Biology Laboratory, Federal University of the State of Rio de Janeiro (UNIRIO), Rio de Janeiro, Rio de Janeiro, Brazil
- * E-mail:
| |
Collapse
|
8
|
DeNizio JE, Schutsky EK, Berrios KN, Liu MY, Kohli RM. Harnessing natural DNA modifying activities for editing of the genome and epigenome. Curr Opin Chem Biol 2018; 45:10-17. [PMID: 29452938 DOI: 10.1016/j.cbpa.2018.01.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2017] [Revised: 01/12/2018] [Accepted: 01/28/2018] [Indexed: 12/27/2022]
Abstract
The introduction of site-specific DNA modifications to the genome or epigenome presents great opportunities for manipulating biological systems. Such changes are now possible through the combination of DNA-modifying enzymes with targeting modules, including dCas9, that can localize the enzymes to specific sites. In this review, we take a DNA modifying enzyme-centric view of recent advances. We highlight the variety of natural DNA-modifying enzymes-including DNA methyltransferases, oxygenases, deaminases, and glycosylases-that can be used for targeted editing and discuss how insights into the structure and function of these enzymes has further expanded editing potential by introducing enzyme variants with altered activities or by improving spatiotemporal control of modifications.
Collapse
Affiliation(s)
- Jamie E DeNizio
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Emily K Schutsky
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Kiara N Berrios
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Monica Yun Liu
- Graduate Group in Biochemistry and Molecular Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Rahul M Kohli
- Department of Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA; Department of Biochemistry and Biophysics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
9
|
Budzko L, Jackowiak P, Kamel K, Sarzynska J, Bujnicki JM, Figlerowicz M. Mutations in human AID differentially affect its ability to deaminate cytidine and 5-methylcytidine in ssDNA substrates in vitro. Sci Rep 2017. [PMID: 28634398 PMCID: PMC5478644 DOI: 10.1038/s41598-017-03936-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023] Open
Abstract
Activation-induced cytidine deaminase (AID) is known for its established role in antibody production. AID induces the diversification of antibodies by deaminating deoxycytidine (C) within immunoglobulin genes. The capacity of AID to deaminate 5-methyldeoxycytidine (5 mC) and/or 5-hydroxymethyldeoxycytidine (5 hmC), and consequently AID involvement in active DNA demethylation, is not fully resolved. For instance, structural determinants of AID activity on different substrates remain to be identified. To better understand the latter issue, we tested how mutations in human AID (hAID) influence its ability to deaminate C, 5 mC, and 5 hmC in vitro. We showed that each of the selected mutations differentially affects hAID’s ability to deaminate C and 5 mC. At the same time, we did not observe hAID activity on 5 hmC. Surprisingly, we found that the N51A hAID mutant, with no detectable activity on C, efficiently deaminated 5 mC, which may suggest different requirements for C and 5 mC deamination. Homology modeling and molecular dynamics simulations revealed that the pattern of enzyme-substrate recognition is one of the important factors determining enzyme activity on C and 5 mC. Consequently, we have proposed mechanisms that explain why wild type hAID more efficiently deaminates C than 5 mC in vitro and why 5 hmC is not deaminated.
Collapse
Affiliation(s)
- Lucyna Budzko
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland
| | - Paulina Jackowiak
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland
| | - Karol Kamel
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland
| | - Joanna Sarzynska
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland
| | - Janusz M Bujnicki
- Laboratory of Bioinformatics and Protein Engineering, International Institute of Molecular and Cell Biology in Warsaw, Trojdena 4, 02-109, Warsaw, Poland.,Laboratory of Bioinformatics, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Umultowska 89, 61-614, Poznan, Poland
| | - Marek Figlerowicz
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704, Poznan, Poland. .,Institute of Computing Science, Poznan University of Technology, Piotrowo 3A, 60-965, Poznan, Poland.
| |
Collapse
|
10
|
Abstract
The AID/APOBEC family enzymes convert cytosines in single-stranded DNA to uracils, causing base substitutions and strand breaks. They are induced by cytokines produced during the body's inflammatory response to infections, and they help combat infections through diverse mechanisms. AID is essential for the maturation of antibodies and causes mutations and deletions in antibody genes through somatic hypermutation (SHM) and class-switch recombination (CSR) processes. One member of the APOBEC family, APOBEC1, edits mRNA for a protein involved in lipid transport. Members of the APOBEC3 subfamily in humans (APOBEC3A, APOBEC3B, APOBEC3C, APOBEC3D, APOBEC3F, APOBEC3G, and APOBEC3H) inhibit infections of viruses such as HIV-1, HBV, and HCV, and retrotransposition of endogenous retroelements through mutagenic and nonmutagenic mechanisms. There is emerging consensus that these enzymes can cause mutations in the cellular genome at replication forks or within transcription bubbles depending on the physiological state of the cell and the phase of the cell cycle during which they are expressed. We describe here the state of knowledge about the structures of these enzymes, regulation of their expression, and both the advantageous and deleterious consequences of their expression, including carcinogenesis. We highlight similarities among them and present a holistic view of their regulation and function.
Collapse
Affiliation(s)
- Sachini U Siriwardena
- Department of Chemistry, Wayne State University , Detroit, Michigan 48202, United States
| | - Kang Chen
- Department of Obstetrics and Gynecology, Wayne State University , Detroit, Michigan 48201, United States
- Mucosal Immunology Studies Team, National Institute of Allergy and Infectious Diseases, National Institutes of Health , Bethesda, Maryland 20892, United States
- Department of Immunology and Microbiology, Wayne State University School of Medicine , Detroit, Michigan 48201, United States
| | - Ashok S Bhagwat
- Department of Chemistry, Wayne State University , Detroit, Michigan 48202, United States
- Department of Immunology and Microbiology, Wayne State University School of Medicine , Detroit, Michigan 48201, United States
| |
Collapse
|
11
|
Wang Y, Beal PA. Probing RNA recognition by human ADAR2 using a high-throughput mutagenesis method. Nucleic Acids Res 2016; 44:9872-9880. [PMID: 27614075 PMCID: PMC5175354 DOI: 10.1093/nar/gkw799] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2016] [Revised: 08/30/2016] [Accepted: 08/31/2016] [Indexed: 01/05/2023] Open
Abstract
Adenosine deamination is one of the most prevalent post-transcriptional modifications in mRNA. In humans, ADAR1 and ADAR2 catalyze this modification and their malfunction correlates with disease. Recently our laboratory reported crystal structures of the human ADAR2 deaminase domain bound to duplex RNA revealing a protein loop that binds the RNA on the 5′ side of the modification site. This 5′ binding loop appears to be one contributor to substrate specificity differences between ADAR family members. In this study, we endeavored to reveal detailed structure–activity relationships in this loop to advance our understanding of RNA recognition by ADAR2. To achieve this goal, we established a high-throughput mutagenesis approach which allows rapid screening of ADAR variants in single yeast cells and provides quantitative evaluation for enzymatic activity. Using this approach, we determined the importance of specific amino acids at 19 different positions in the ADAR2 5′ binding loop and revealed six residues that provide essential structural elements supporting the fold of the loop and key RNA-binding functional groups. This work provided new insight into RNA recognition by ADAR2 and established a new tool for defining structure–function relationships in ADAR reactions.
Collapse
Affiliation(s)
- Yuru Wang
- Department of Chemistry, University of California, One Shields Ave, Davis, CA 95616, USA
| | - Peter A Beal
- Department of Chemistry, University of California, One Shields Ave, Davis, CA 95616, USA
| |
Collapse
|
12
|
The power of multiplexed functional analysis of genetic variants. Nat Protoc 2016; 11:1782-7. [PMID: 27583640 DOI: 10.1038/nprot.2016.135] [Citation(s) in RCA: 80] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 07/13/2016] [Indexed: 12/30/2022]
Abstract
New technologies have recently enabled saturation mutagenesis and functional analysis of nearly all possible variants of regulatory elements or proteins of interest in single experiments. Here we discuss the past, present, and future of such multiplexed (functional) assays for variant effects (MAVEs). MAVEs provide detailed insight into sequence-function relationships, and they may prove critical for the prospective clinical interpretation of genetic variants.
Collapse
|
13
|
Zanotti KJ, Gearhart PJ. Antibody diversification caused by disrupted mismatch repair and promiscuous DNA polymerases. DNA Repair (Amst) 2016; 38:110-116. [PMID: 26719140 PMCID: PMC4740194 DOI: 10.1016/j.dnarep.2015.11.011] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2015] [Accepted: 11/30/2015] [Indexed: 10/25/2022]
Abstract
The enzyme activation-induced deaminase (AID) targets the immunoglobulin loci in activated B cells and creates DNA mutations in the antigen-binding variable region and DNA breaks in the switch region through processes known, respectively, as somatic hypermutation and class switch recombination. AID deaminates cytosine to uracil in DNA to create a U:G mismatch. During somatic hypermutation, the MutSα complex binds to the mismatch, and the error-prone DNA polymerase η generates mutations at A and T bases. During class switch recombination, both MutSα and MutLα complexes bind to the mismatch, resulting in double-strand break formation and end-joining. This review is centered on the mechanisms of how the MMR pathway is commandeered by B cells to generate antibody diversity.
Collapse
Affiliation(s)
- Kimberly J Zanotti
- Laboratory of Molecular Biology and Immunology, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA
| | - Patricia J Gearhart
- Laboratory of Molecular Biology and Immunology, National Institute on Aging, National Institutes of Health, Baltimore, MD 21224, USA.
| |
Collapse
|
14
|
DNA cytosine and methylcytosine deamination by APOBEC3B: enhancing methylcytosine deamination by engineering APOBEC3B. Biochem J 2015. [PMID: 26195824 PMCID: PMC4613526 DOI: 10.1042/bj20150382] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
APOBEC (apolipoprotein B mRNA-editing enzyme catalytic polypeptide-like) is a family of enzymes that deaminates cytosine (C) to uracil (U) on nucleic acid. APOBEC3B (A3B) functions in innate immunity against intrinsic and invading retroelements and viruses. A3B can also induce genomic DNA mutations to cause cancer. A3B contains two cytosine deaminase domains (CD1, CD2), and there are conflicting reports about whether both domains are active. Here we demonstrate that only CD2 of A3B (A3BCD2) has C deamination activity. We also reveal that both A3B and A3BCD2 can deaminate methylcytosine (mC). Guided by structural and functional analysis, we successfully engineered A3BCD2 to gain over two orders of magnitude higher activity for mC deamination. Important determinants that contribute to the activity and selectivity for mC deamination have been identified, which reveals that multiple elements, rather than single ones, contribute to the mC deamination activity and selectivity in A3BCD2 and possibly other APOBECs.
Collapse
|