1
|
Katayama S, Watanabe M, Kato Y, Nomura W, Yamamoto T. Engineering of Zinc Finger Nucleases Through Structural Modeling Improves Genome Editing Efficiency in Cells. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2310255. [PMID: 38600709 PMCID: PMC11187957 DOI: 10.1002/advs.202310255] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/27/2023] [Revised: 02/28/2024] [Indexed: 04/12/2024]
Abstract
Genome Editing is widely used in biomedical research and medicine. Zinc finger nucleases (ZFNs) are smaller in size than transcription activator-like effector (TALE) nucleases (TALENs) and CRISPR-Cas9. Therefore, ZFN-encoding DNAs can be easily packaged into a viral vector with limited cargo space, such as adeno-associated virus (AAV) vectors, for in vivo and clinical applications. ZFNs have great potential for translational research and clinical use. However, constructing functional ZFNs and improving their genome editing efficiency is extremely difficult. Here, the efficient construction of functional ZFNs and the improvement of their genome editing efficiency using AlphaFold, Coot, and Rosetta are described. Plasmids encoding ZFNs consisting of six fingers using publicly available zinc-finger resources are assembled. Two functional ZFNs from the ten ZFNs tested are successfully obtained. Furthermore, the engineering of ZFNs using AlphaFold, Coot, or Rosetta increases the efficiency of genome editing by 5%, demonstrating the effectiveness of engineering ZFNs based on structural modeling.
Collapse
Affiliation(s)
- Shota Katayama
- Genome Editing Innovation CenterHiroshima UniversityHigashi‐Hiroshima739‐0046Japan
| | - Masahiro Watanabe
- Research Institute for Sustainable ChemistryNational Institute of Advanced Industrial Science and Technology (AIST)Higashi‐Hiroshima739‐0046Japan
| | - Yoshio Kato
- Biomedical Research InstituteNational Institute of Advanced Industrial Science and Technology (AIST)Ibaraki305‐8566Japan
| | - Wataru Nomura
- Graduate School of Biomedical and Health SciencesHiroshima UniversityHiroshima734‐8553Japan
| | - Takashi Yamamoto
- Genome Editing Innovation CenterHiroshima UniversityHigashi‐Hiroshima739‐0046Japan
- Division of Integrated Sciences for LifeGraduate School of Integrated Sciences for LifeHiroshima UniversityHigashi‐Hiroshima739‐8526Japan
| |
Collapse
|
2
|
Park JU, Petassi MT, Hsieh SC, Mehrotra E, Schuler G, Budhathoki J, Truong VH, Thyme SB, Ke A, Kellogg EH, Peters JE. Multiple adaptations underly co-option of a CRISPR surveillance complex for RNA-guided DNA transposition. Mol Cell 2023; 83:1827-1838.e6. [PMID: 37267904 PMCID: PMC10693918 DOI: 10.1016/j.molcel.2023.05.005] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2022] [Revised: 02/23/2023] [Accepted: 05/03/2023] [Indexed: 06/04/2023]
Abstract
CRISPR-associated transposons (CASTs) are natural RNA-directed transposition systems. We demonstrate that transposon protein TniQ plays a central role in promoting R-loop formation by RNA-guided DNA-targeting modules. TniQ residues, proximal to CRISPR RNA (crRNA), are required for recognizing different crRNA categories, revealing an unappreciated role of TniQ to direct transposition into different classes of crRNA targets. To investigate adaptations allowing CAST elements to utilize attachment sites inaccessible to CRISPR-Cas surveillance complexes, we compared and contrasted PAM sequence requirements in both I-F3b CAST and I-F1 CRISPR-Cas systems. We identify specific amino acids that enable a wider range of PAM sequences to be accommodated in I-F3b CAST elements compared with I-F1 CRISPR-Cas, enabling CAST elements to access attachment sites as sequences drift and evade host surveillance. Together, this evidence points to the central role of TniQ in facilitating the acquisition of CRISPR effector complexes for RNA-guided DNA transposition.
Collapse
Affiliation(s)
- Jung-Un Park
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Michael T Petassi
- Department of Microbiology, Cornell University, Ithaca, NY 14853, USA
| | - Shan-Chi Hsieh
- Department of Microbiology, Cornell University, Ithaca, NY 14853, USA
| | - Eshan Mehrotra
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Gabriel Schuler
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Jagat Budhathoki
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Vinh H Truong
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Summer B Thyme
- Department of Neurobiology, The University of Alabama at Birmingham, Birmingham, AL 35294, USA
| | - Ailong Ke
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA
| | - Elizabeth H Kellogg
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY 14853, USA.
| | - Joseph E Peters
- Department of Microbiology, Cornell University, Ithaca, NY 14853, USA.
| |
Collapse
|
3
|
Tang H, Yuan H, Du W, Li G, Xue D, Huang Q. Active-Site Models of Streptococcus pyogenes Cas9 in DNA Cleavage State. Front Mol Biosci 2021; 8:653262. [PMID: 33987202 PMCID: PMC8112549 DOI: 10.3389/fmolb.2021.653262] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 03/22/2021] [Indexed: 11/13/2022] Open
Abstract
CRISPR-Cas9 is a powerful tool for target genome editing in living cells. Significant advances have been made to understand how this system cleaves target DNA. HNH is a nuclease domain, which shares structural similarity with the HNH endonuclease characterzied by a beta-beta-alpha-metal fold. Therefore, based on one- and two-metal-ion mechanisms, homology modeling and molecular dynamics (MD) simulation are suitable tools for building an atomic model of Cas9 in the DNA cleavage state. Here, by modeling and MD, we presented an atomic model of SpCas9-sgRNA-DNA complex with the cleavage state. This model shows that the HNH and RuvC conformations resemble their DNA cleavage state where the active-sites in the complex coordinate with DNA, Mg2+ ions, and water. Among them, residues D10, E762, H983, and D986 locate at the first shell of the RuvC active-site and interact with the ions directly, residues H982 or/and H985 are general (Lewis) bases, and the coordinated water is located at the positions for nucleophilic attack of the scissile phosphate. Meanwhile, this catalytic model led us to engineer a new SpCas9 variant (SpCas9-H982A + H983D) with reduced off-target effects. Thus, our study provided new mechanistic insights into the CRISPR-Cas9 system in the DNA cleavage state and offered useful guidance for engineering new CRISPR-Cas9 editing systems with improved specificity.
Collapse
Affiliation(s)
- Honghai Tang
- State Key Laboratory of Genetic Engineering, Shanghai Engineering Research Center of Industrial Microorganisms, Ministry of Education Engineering Research Centre of Gene Technology, School of Life Sciences, Fudan University, Shanghai, China
| | - Hui Yuan
- State Key Laboratory of Genetic Engineering, Shanghai Engineering Research Center of Industrial Microorganisms, Ministry of Education Engineering Research Centre of Gene Technology, School of Life Sciences, Fudan University, Shanghai, China
| | - Wenhao Du
- State Key Laboratory of Genetic Engineering, Shanghai Engineering Research Center of Industrial Microorganisms, Ministry of Education Engineering Research Centre of Gene Technology, School of Life Sciences, Fudan University, Shanghai, China
| | - Gan Li
- State Key Laboratory of Genetic Engineering, Shanghai Engineering Research Center of Industrial Microorganisms, Ministry of Education Engineering Research Centre of Gene Technology, School of Life Sciences, Fudan University, Shanghai, China
| | - Dongmei Xue
- State Key Laboratory of Genetic Engineering, Shanghai Engineering Research Center of Industrial Microorganisms, Ministry of Education Engineering Research Centre of Gene Technology, School of Life Sciences, Fudan University, Shanghai, China
| | - Qiang Huang
- State Key Laboratory of Genetic Engineering, Shanghai Engineering Research Center of Industrial Microorganisms, Ministry of Education Engineering Research Centre of Gene Technology, School of Life Sciences, Fudan University, Shanghai, China.,Multiscale Research Institute of Complex Systems, Fudan University, Shanghai, China
| |
Collapse
|
4
|
Ng JF, Fraternali F. Understanding the structural details of APOBEC3-DNA interactions using graph-based representations. Curr Res Struct Biol 2020; 2:130-143. [PMID: 34235473 PMCID: PMC8244423 DOI: 10.1016/j.crstbi.2020.07.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2020] [Revised: 07/17/2020] [Accepted: 07/21/2020] [Indexed: 12/22/2022] Open
Abstract
Human APOBEC3 (A3; apolipoprotein B mRNA editing catalytic polypeptide-like 3) is a family of seven enzymes involved in generating mutations in nascent reverse transcripts of many retroviruses, as well as the human genome in a range of cancer types. The structural details of the interaction between A3 proteins and DNA molecules are only available for a few family members. Here we use homology modelling techniques to address the difference in structural coverage of human A3 enzymes interacting with different DNA substrates. A3-DNA interfaces are represented as residue networks ("graphs"), based on which features at these interfaces are compared and quantified. We demonstrate that graph-based representations are effective in highlighting structural features of A3-DNA interfaces. By large-scale in silico mutagenesis of the bound DNA chain, we predicted the preference of substrate DNA sequence for multiple A3 domains. These data suggested that computational modelling approaches could contribute in the exploration of the structural basis for sequence specificity in A3 substrate selection, and demonstrated the utility of graph-based approaches in evaluating a large number of structural models generated in silico. APOBEC3(A3)-DNA structures have been resolved with modified deaminase domains. Structural modelling of interaction between wild-type A3 domains and DNA substrates. Graph-based representations reveal structural differences across A3-DNA interfaces. Using in silico mutagenesis we compared substrate preference of multiple A3 domains. Graph-based approaches can efficiently compare a large number of structural models.
Collapse
|
5
|
Bissonnette S, Del Grosso E, Simon AJ, Plaxco KW, Ricci F, Vallée-Bélisle A. Optimizing the Specificity Window of Biomolecular Receptors Using Structure-Switching and Allostery. ACS Sens 2020; 5:1937-1942. [PMID: 32297508 DOI: 10.1021/acssensors.0c00237] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
To ensure maximum specificity (i.e., minimize cross-reactivity with structurally similar analogues of the desired target), most bioassays invoke "stringency", the careful tuning of the conditions employed (e.g., pH, ionic strength, or temperature). Willingness to control assay conditions will fall, however, as quantitative, single-step biosensors begin to replace multistep analytical processes. This is especially true for sensors deployed in vivo, where the tuning of such parameters is not just inconvenient but impossible. In response, we describe here the rational adaptation of two strategies employed by nature to tune the affinity of biomolecular receptors so as to optimize the placement of their specificity "windows" without the need to alter measurement conditions: structure-switching and allosteric control. We quantitatively validate these approaches using two distinct, DNA-based receptors: a simple, linear-chain DNA suitable for detecting a complementary DNA strand and a structurally complex DNA aptamer used for the detection of a small-molecule drug. Using these models, we show that, without altering assay conditions, structure-switching and allostery can tune the concentration range over which a receptor achieves optimal specificity over orders of magnitude, thus optimally matching the specificity window with the range of target concentrations expected to be seen in a given application.
Collapse
Affiliation(s)
- Stéphanie Bissonnette
- Laboratory of Biosensors & Nanomachines, Département de Chimie, Département de Biochimie et Médecine Moléculaire, Université de Montréal, C.P. 6128, Succursale Centre-ville, Montréal, Québec H3C 3J7, Canada
| | - Erica Del Grosso
- Dipartimento di Scienze e Tecnologie Chimiche, University of Rome, Tor Vergata, Via della Ricerca Scientifica, 00133 Rome, Italy
- Consorzio Interuniversitario Biostrutture e Biosistemi “INBB”, Rome 00136, Italy
| | | | | | - Francesco Ricci
- Dipartimento di Scienze e Tecnologie Chimiche, University of Rome, Tor Vergata, Via della Ricerca Scientifica, 00133 Rome, Italy
- Consorzio Interuniversitario Biostrutture e Biosistemi “INBB”, Rome 00136, Italy
| | - Alexis Vallée-Bélisle
- Laboratory of Biosensors & Nanomachines, Département de Chimie, Département de Biochimie et Médecine Moléculaire, Université de Montréal, C.P. 6128, Succursale Centre-ville, Montréal, Québec H3C 3J7, Canada
| |
Collapse
|
6
|
Leman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, Stein A, Szegedy M, Teets FD, Thyme SB, Wang RYR, Watkins A, Zimmerman L, Bonneau R. Macromolecular modeling and design in Rosetta: recent methods and frameworks. Nat Methods 2020; 17:665-680. [PMID: 32483333 PMCID: PMC7603796 DOI: 10.1038/s41592-020-0848-2] [Citation(s) in RCA: 454] [Impact Index Per Article: 90.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 04/22/2020] [Indexed: 12/12/2022]
Abstract
The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
| | - Brian D Weitzner
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Steven M Lewis
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Department of Biochemistry, Duke University, Durham, NC, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Nawsad Alam
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Melanie Aprahamian
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Kyle A Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA
| | - Patrick Barth
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Benjamin Basanta
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Biological Physics Structure and Design PhD Program, University of Washington, Seattle, WA, USA
| | - Brian J Bender
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Kristin Blacklock
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Jaume Bonet
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Scott E Boyken
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Phil Bradley
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Chris Bystroff
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, USA
| | - Patrick Conway
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Seth Cooper
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bruno E Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lorna Dsilva
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Roland Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Alexander S Ford
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Brandon Frenz
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Darwin Y Fu
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Sharon Guffy
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Scott Horowitz
- Department of Chemistry & Biochemistry, University of Denver, Denver, CO, USA
- The Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, USA
| | - Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Thomas Huber
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Tim M Jacobs
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - David K Johnson
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - John Karanicolas
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Hamed Khakzad
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
| | - Karen R Khar
- Cyrus Biotechnology, Seattle, WA, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Sagar D Khare
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Firas Khatib
- Department of Computer and Information Science, University of Massachusetts Dartmouth, Dartmouth, MA, USA
| | - Alisa Khramushin
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Indigo C King
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Robert Kleffner
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Brian Koepnick
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Daisuke Kuroda
- Medical Device Development and Regulation Research Center, School of Engineering, University of Tokyo, Tokyo, Japan
- Department of Bioengineering, School of Engineering, University of Tokyo, Tokyo, Japan
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Chemistry, Franklin & Marshall College, Lancaster, PA, USA
| | - Jason K Lai
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Gideon Lapidoth
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Andrew Leaver-Fay
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - Thomas Linsky
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Nir London
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Joseph H Lubin
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Lars Malmström
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Enrique Marcos
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Research in Biomedicine Barcelona, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Orly Marcu
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nicholas A Marze
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
- Departments of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
- Institute for Chemical Biology, Vanderbilt University, Nashville, TN, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Santrupti Nerli
- Department of Computer Science, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Christoffer Norn
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Shane Ó'Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Noah Ollikainen
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| | - Michael S Pacella
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Hahnbeom Park
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ryan E Pavlovicz
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Manasi Pethe
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Kala Bharath Pilla
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Barak Raveh
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - P Douglas Renfrew
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Aliza Rubenstein
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Marion F Sauer
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Andreas Scheck
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - William Schief
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Yuval Sedan
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Alexander M Sevy
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Nikolaos G Sgourakis
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Lei Shi
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Justin B Siegel
- Department of Chemistry, University of California, Davis, Davis, CA, USA
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, California, USA
- Genome Center, University of California, Davis, Davis, CA, USA
| | | | - Shannon Smith
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Yifan Song
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Amelie Stein
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Maria Szegedy
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Frank D Teets
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ray Yu-Ruei Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Lior Zimmerman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
- Department of Computer Science, New York University, New York, NY, USA.
- Center for Data Science, New York University, New York, NY, USA.
| |
Collapse
|
7
|
Koehler Leman J, Weitzner BD, Renfrew PD, Lewis SM, Moretti R, Watkins AM, Mulligan VK, Lyskov S, Adolf-Bryfogle J, Labonte JW, Krys J, Bystroff C, Schief W, Gront D, Schueler-Furman O, Baker D, Bradley P, Dunbrack R, Kortemme T, Leaver-Fay A, Strauss CEM, Meiler J, Kuhlman B, Gray JJ, Bonneau R. Better together: Elements of successful scientific software development in a distributed collaborative community. PLoS Comput Biol 2020; 16:e1007507. [PMID: 32365137 PMCID: PMC7197760 DOI: 10.1371/journal.pcbi.1007507] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Many scientific disciplines rely on computational methods for data analysis, model generation, and prediction. Implementing these methods is often accomplished by researchers with domain expertise but without formal training in software engineering or computer science. This arrangement has led to underappreciation of sustainability and maintainability of scientific software tools developed in academic environments. Some software tools have avoided this fate, including the scientific library Rosetta. We use this software and its community as a case study to show how modern software development can be accomplished successfully, irrespective of subject area. Rosetta is one of the largest software suites for macromolecular modeling, with 3.1 million lines of code and many state-of-the-art applications. Since the mid 1990s, the software has been developed collaboratively by the RosettaCommons, a community of academics from over 60 institutions worldwide with diverse backgrounds including chemistry, biology, physiology, physics, engineering, mathematics, and computer science. Developing this software suite has provided us with more than two decades of experience in how to effectively develop advanced scientific software in a global community with hundreds of contributors. Here we illustrate the functioning of this development community by addressing technical aspects (like version control, testing, and maintenance), community-building strategies, diversity efforts, software dissemination, and user support. We demonstrate how modern computational research can thrive in a distributed collaborative community. The practices described here are independent of subject area and can be readily adopted by other software development communities.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America
- Dept of Biology, New York University, New York, NY, United States of America
| | - Brian D. Weitzner
- Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
- Dept of Biochemistry, University of Washington, Seattle, WA, United States of America
- Institute for Protein Design, University of Washington, Seattle, WA, United States of America
- Lyell Immunopharma, Seattle, WA, United States of America
| | - P. Douglas Renfrew
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America
| | - Steven M. Lewis
- Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
- Dept of Biochemistry, Duke University, Durham, NC, United States of America
- Cyrus Biotechnology, Seattle, WA United States of America
| | - Rocco Moretti
- Dept of Chemistry, Vanderbilt University, Nashville, TN, United States of America
| | - Andrew M. Watkins
- Dept of Biochemistry, Stanford University School of Medicine, Stanford CA, United States of America
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America
- Dept of Biochemistry, University of Washington, Seattle, WA, United States of America
- Institute for Protein Design, University of Washington, Seattle, WA, United States of America
| | - Sergey Lyskov
- Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
| | - Jared Adolf-Bryfogle
- Dept of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, United States of America
| | - Jason W. Labonte
- Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
- Dept of Chemistry, Franklin & Marshall College, Lancaster, PA, United States of America
| | - Justyna Krys
- Dept of Chemistry, University of Warsaw, Warsaw, Poland
| | | | - Christopher Bystroff
- Dept of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, United States of America
| | - William Schief
- Dept of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, United States of America
| | - Dominik Gront
- Dept of Chemistry, University of Warsaw, Warsaw, Poland
| | - Ora Schueler-Furman
- Dept of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - David Baker
- Dept of Biochemistry, University of Washington, Seattle, WA, United States of America
- Institute for Protein Design, University of Washington, Seattle, WA, United States of America
| | - Philip Bradley
- Fred Hutchinson Cancer Research Center, Seattle, WA, United States of America
| | - Roland Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia PA, United States of America
| | - Tanja Kortemme
- Dept of Bioengineering and Therapeutic Sciences, University of California San Francisco, CA, United States of America
| | - Andrew Leaver-Fay
- Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
| | - Charlie E. M. Strauss
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, United States of America
| | - Jens Meiler
- Depts of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, United States of America
- Center for Structural Biology, Vanderbilt University, Nashville, TN, United States of America
- Institute for Chemical Biology, Vanderbilt University, Nashville, TN, United States of America
- Institute for Drug Discovery, Leipzig University, Leipzig, Germany
| | - Brian Kuhlman
- Dept of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, United States of America
| | - Jeffrey J. Gray
- Dept of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, United States of America
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, United States of America
- Dept of Biology, New York University, New York, NY, United States of America
- Dept of Computer Science, New York University, New York, NY, United States of America
- Center for Data Science, New York University, New York, NY, United States of America
| |
Collapse
|
8
|
Ye W, Liu T, Zhu M, Zhang W, Huang Z, Li S, Li H, Kong Y, Chen Y. An Easy and Efficient Strategy for the Enhancement of Epothilone Production Mediated by TALE-TF and CRISPR/dcas9 Systems in Sorangium cellulosum. Front Bioeng Biotechnol 2019; 7:334. [PMID: 32039165 PMCID: PMC6988809 DOI: 10.3389/fbioe.2019.00334] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 10/30/2019] [Indexed: 11/23/2022] Open
Abstract
Epothilones are a kind of macrolides with strong cytotoxicity toward cancer cells and relatively lower side effects compared with taxol. Epothilone B derivate ixabepilone has been used for the clinical treatment of advanced breast cancer. However, the low yield of epothilones and the difficulty in the genetic manipulation of Sorangium cellulosum limited their wider application. Transcription activator-like effectors-Trancriptional factor (TALE-TF)-VP64 and clustered regularly interspaced short palindromic repeats (CRISPR)/dCas9-VP64 have been demonstrated as effective systems for the transcriptional improvement. In this study, a promoter for the epothilone biosynthesis cluster was obtained and the function has been verified. The TALE-TF-VP64 and CRISPR/dcas9-VP64 target P3 promoter were electroporated into S. cellulosum strain So ce M4, and the transcriptional levels of epothilone biosynthesis-related genes were significantly upregulated. The yield of epothilone B was improved by 2.89- and 1.53-fold by the introduction of recombinant TALE-TF-VP64-P3 and dCas9-VP64-P3 elements into So ce M4, respectively. The epothilone D yield was also improved by 1.12- and 2.18-fold in recombinant dCas9-So ce M4 and TALE-VP64 strains, respectively. The transcriptional regulation mechanism of TALE-TF-VP64 and the competition mechanism with endogenous transcriptional factor were investigated by electrophoretic mobility shift assay (EMSA) and chromatin immunoprecipitation (ChIP), demonstrating the combination of the P3 promoter and TALE-TF element and the competition between TALE-TF and endogenous transcriptional protein. This is the first report on the transcriptional regulation of the epothilone biosynthetic gene cluster in S. cellulosum using the TALE-TF and dCas9-VP64 systems, and the regulatory mechanism of the TALE-TF system for epothilone biosynthesis in S. cellulosum was also firstly revealed, thus shedding light on the metabolic engineering of S. cellulosum to improve epothilone yields substantially and promoting the application of epothilones in the biomedical industry.
Collapse
Affiliation(s)
- Wei Ye
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Taomei Liu
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Muzi Zhu
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Weimin Zhang
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Zilei Huang
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Saini Li
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Haohua Li
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Yali Kong
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| | - Yuchan Chen
- State Key Laboratory of Applied Microbiology Southern China, Guangdong Provincial Key Laboratory of Microbial Culture Collection and Application, Guangdong Open Laboratory of Applied Microbiology, Guangdong Institute of Microbiology, Guangdong Academy of Sciences, Guangzhou, China
| |
Collapse
|
9
|
Chen WF, Wei XB, Rety S, Huang LY, Liu NN, Dou SX, Xi XG. Structural analysis reveals a "molecular calipers" mechanism for a LATERAL ORGAN BOUNDARIES DOMAIN transcription factor protein from wheat. J Biol Chem 2018; 294:142-156. [PMID: 30425099 DOI: 10.1074/jbc.ra118.003956] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2018] [Revised: 11/10/2018] [Indexed: 01/13/2023] Open
Abstract
LATERAL ORGAN BOUNDARIES DOMAIN (LBD) proteins, a family of plant-specific transcription factors harboring a conserved Lateral Organ Boundaries (LOB) domain, are regulators of plant organ development. Recent studies have unraveled additional pivotal roles of the LBD protein family beyond defining lateral organ boundaries, such as pollen development and nitrogen metabolism. The structural basis for the molecular network of LBD-dependent processes remains to be deciphered. Here, we solved the first structure of the homodimeric LOB domain of Ramosa2 from wheat (TtRa2LD) to 1.9 Å resolution. Our crystal structure reveals structural features shared with other zinc-finger transcriptional factors, as well as some features unique to LBD proteins. Formation of the TtRa2LD homodimer relied on hydrophobic interactions of its coiled-coil motifs. Several specific motifs/domains of the LBD protein were also involved in maintaining its overall conformation. The intricate assembly within and between the monomers determined the precise spatial configuration of the two zinc fingers that recognize palindromic DNA sequences. Biochemical, molecular modeling, and small-angle X-ray scattering experiments indicated that dimerization is important for cooperative DNA binding and discrimination of palindromic DNA through a molecular calipers mechanism. Along with previously published data, this study enables us to establish an atomic-scale mechanistic model for LBD proteins as transcriptional regulators in plants.
Collapse
Affiliation(s)
- Wei-Fei Chen
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Xiao-Bin Wei
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China; School of Life Science and Engineering, Henan University of Urban Construction, Pingdingshan, Henan, 467044, China
| | - Stephane Rety
- University Lyon, ENS de Lyon, University Claude Bernard, CNRS UMR 5239, INSERM U1210, LBMC, 46 Allée d'Italie Site Jacques Monod, F-69007, Lyon, France.
| | - Ling-Yun Huang
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Na-Nv Liu
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China
| | - Shuo-Xing Dou
- Beijing National Laboratory for Condensed Matter Physics and CAS Key Laboratory of Soft Matter Physics, Institute of Physics, Chinese Academy of Sciences, Beijing 100190, China
| | - Xu-Guang Xi
- State Key Laboratory of Crop Stress Biology in Arid Areas, College of Life Sciences, Northwest A&F University, Yangling, Shaanxi 712100, China; LBPA, Ecole Normale Supérieure Paris-Saclay, CNRS, Université Paris Saclay, 61 Avenue du Président Wilson, F-94235 Cachan, France.
| |
Collapse
|
10
|
Huai C, Li G, Yao R, Zhang Y, Cao M, Kong L, Jia C, Yuan H, Chen H, Lu D, Huang Q. Structural insights into DNA cleavage activation of CRISPR-Cas9 system. Nat Commun 2017; 8:1375. [PMID: 29123204 PMCID: PMC5680257 DOI: 10.1038/s41467-017-01496-2] [Citation(s) in RCA: 93] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2017] [Accepted: 09/21/2017] [Indexed: 01/10/2023] Open
Abstract
CRISPR-Cas9 technology has been widely used for genome engineering. Its RNA-guided endonuclease Cas9 binds specifically to target DNA and then cleaves the two DNA strands with HNH and RuvC nuclease domains. However, structural information regarding the DNA cleavage-activating state of two nuclease domains remains sparse. Here, we report a 5.2 Å cryo-EM structure of Cas9 in complex with sgRNA and target DNA. This structure reveals a conformational state of Cas9 in which the HNH domain is closest to the DNA cleavage site. Compared with two known HNH states, our structure shows that the HNH active site moves toward the cleavage site by about 25 and 13 Å, respectively. In combination with EM-based molecular dynamics simulations, we show that residues of the nuclease domains in our structure could form cleavage-compatible conformations with the target DNA. Together, these results strongly suggest that our cryo-EM structure resembles a DNA cleavage-activating architecture of Cas9.
Collapse
Affiliation(s)
- Cong Huai
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Gan Li
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Ruijie Yao
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Yingyi Zhang
- National Center for Protein Science Shanghai, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
- Shanghai Science Research Center, Chinese Academy of Sciences, Shanghai, 201204, China
| | - Mi Cao
- National Center for Protein Science Shanghai, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
- Shanghai Science Research Center, Chinese Academy of Sciences, Shanghai, 201204, China
| | - Liangliang Kong
- National Center for Protein Science Shanghai, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
- Shanghai Science Research Center, Chinese Academy of Sciences, Shanghai, 201204, China
| | - Chenqiang Jia
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Hui Yuan
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Hongyan Chen
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China
| | - Daru Lu
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China.
| | - Qiang Huang
- State Key Laboratory of Genetic Engineering, School of Life Sciences, Fudan University, Shanghai, 200438, China.
| |
Collapse
|
11
|
Corona RI, Guo JT. Statistical analysis of structural determinants for protein-DNA-binding specificity. Proteins 2016; 84:1147-61. [PMID: 27147539 DOI: 10.1002/prot.25061] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2016] [Revised: 04/21/2016] [Accepted: 04/28/2016] [Indexed: 12/27/2022]
Abstract
DNA-binding proteins play critical roles in biological processes including gene expression, DNA packaging and DNA repair. They bind to DNA target sequences with different degrees of binding specificity, ranging from highly specific (HS) to nonspecific (NS). Alterations of DNA-binding specificity, due to either genetic variation or somatic mutations, can lead to various diseases. In this study, a comparative analysis of protein-DNA complex structures was carried out to investigate the structural features that contribute to binding specificity. Protein-DNA complexes were grouped into three general classes based on degrees of binding specificity: HS, multispecific (MS), and NS. Our results show a clear trend of structural features among the three classes, including amino acid binding propensities, simple and complex hydrogen bonds, major/minor groove and base contacts, and DNA shape. We found that aspartate is enriched in HS DNA binding proteins and predominately binds to a cytosine through a single hydrogen bond or two consecutive cytosines through bidentate hydrogen bonds. Aromatic residues, histidine and tyrosine, are highly enriched in the HS and MS groups and may contribute to specific binding through different mechanisms. To further investigate the role of protein flexibility in specific protein-DNA recognition, we analyzed the conformational changes between the bound and unbound states of DNA-binding proteins and structural variations. The results indicate that HS and MS DNA-binding domains have larger conformational changes upon DNA-binding and larger degree of flexibility in both bound and unbound states. Proteins 2016; 84:1147-1161. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- Rosario I Corona
- Department of Bioinformatics and Genomics, College of Computing and Informatics, The University of North Carolina at Charlotte, Charlotte, North Carolina, 28223
| | - Jun-Tao Guo
- Department of Bioinformatics and Genomics, College of Computing and Informatics, The University of North Carolina at Charlotte, Charlotte, North Carolina, 28223
| |
Collapse
|
12
|
Xiao X, Agris PF, Hall CK. Designing peptide sequences in flexible chain conformations to bind RNA: a search algorithm combining Monte Carlo, self-consistent mean field and concerted rotation techniques. J Chem Theory Comput 2016; 11:740-52. [PMID: 26579605 DOI: 10.1021/ct5008247] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
A search algorithm combining Monte Carlo, self-consistent mean field, and concerted rotation techniques was developed to discover peptide sequences that are reasonable HIV drug candidates due to their exceptional binding to human tRNAUUU(Lys3), the primer of HIV replication. The search algorithm allows for iteration between sequence mutations and conformation changes during sequence evolution. Searches conducted for different classes of peptides identified several potential peptide candidates. Analysis of the energy revealed that the asparagine and cysteine at residues 11 and 12 play important roles in "recognizing" tRNA(Lys3) via van der Waals interactions, contributing to binding specificity. Arginines preferentially attract the phosphate linkage via charge-charge interaction, contributing to binding affinity. Evaluation of the RNA/peptide complex's structure revealed that adding conformation changes to the search algorithm yields peptides with better binding affinity and specificity to tRNA(Lys3) than a previous mutation-only algorithm.
Collapse
Affiliation(s)
- Xingqing Xiao
- Chemical and Biomolecular Engineering Department, North Carolina State University , Raleigh, North Carolina 27695-7905, United States
| | - Paul F Agris
- The RNA Institute, University at Albany, State University of New York , Albany, New York 12222, United States
| | - Carol K Hall
- Chemical and Biomolecular Engineering Department, North Carolina State University , Raleigh, North Carolina 27695-7905, United States
| |
Collapse
|
13
|
Tarry MJ, Schmeing TM. Specific disulfide cross-linking to constrict the mobile carrier domain of nonribosomal peptide synthetases. Protein Eng Des Sel 2015; 28:163-70. [PMID: 25713404 DOI: 10.1093/protein/gzv009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2014] [Accepted: 02/03/2015] [Indexed: 01/03/2023] Open
Abstract
Nonribosomal peptide synthetases are large, multi-domain enzymes that produce peptide molecules with important biological activity such as antibiotic, antiviral, anti-tumor, siderophore and immunosuppressant action. The adenylation (A) domain catalyzes two reactions in the biosynthetic pathway. In the first reaction, it activates the substrate amino acid by adenylation and in the second reaction it transfers the amino acid onto the phosphopantetheine arm of the adjacent peptide carrier protein (PCP) domain. The conformation of the A domain differs significantly depending on which of these two reactions it is catalyzing. Recently, several structures of A-PCP di-domains have been solved using mechanism-based inhibitors to trap the PCP domain in the A domain active site. Here, we present an alternative strategy to stall the A-PCP di-domain, by engineering a disulfide bond between the native amino acid substrate and the A domain. Size exclusion studies showed a significant shift in apparent size when the mutant A-PCP was provided with cross-linking reagents, and this shift was reversible in the presence of high concentrations of reducing agent. The cross-linked protein crystallized readily in several of the conditions screened and the best crystals diffracted to ≈8 Å.
Collapse
Affiliation(s)
- Michael J Tarry
- Department of Biochemistry, McGill University, Montréal, QC, Canada H3G 0B1
| | - T Martin Schmeing
- Department of Biochemistry, McGill University, Montréal, QC, Canada H3G 0B1 Groupe de Recherche Axé sur la Structure des Protéines (GRASP), McGill University, Montréal, QC, Canada H3G 0B1
| |
Collapse
|
14
|
Ashworth J, Bernard B, Reynolds S, Plaisier CL, Shmulevich I, Baliga NS. Structure-based predictions broadly link transcription factor mutations to gene expression changes in cancers. Nucleic Acids Res 2014; 42:12973-83. [PMID: 25378323 PMCID: PMC4245936 DOI: 10.1093/nar/gku1031] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2014] [Revised: 10/09/2014] [Accepted: 10/10/2014] [Indexed: 12/24/2022] Open
Abstract
Thousands of unique mutations in transcription factors (TFs) arise in cancers, and the functional and biological roles of relatively few of these have been characterized. Here, we used structure-based methods developed specifically for DNA-binding proteins to systematically predict the consequences of mutations in several TFs that are frequently mutated in cancers. The explicit consideration of protein-DNA interactions was crucial to explain the roles and prevalence of mutations in TP53 and RUNX1 in cancers, and resulted in a higher specificity of detection for known p53-regulated genes among genetic associations between TP53 genotypes and genome-wide expression in The Cancer Genome Atlas, compared to existing methods of mutation assessment. Biophysical predictions also indicated that the relative prevalence of TP53 missense mutations in cancer is proportional to their thermodynamic impacts on protein stability and DNA binding, which is consistent with the selection for the loss of p53 transcriptional function in cancers. Structure and thermodynamics-based predictions of the impacts of missense mutations that focus on specific molecular functions may be increasingly useful for the precise and large-scale inference of aberrant molecular phenotypes in cancer and other complex diseases.
Collapse
Affiliation(s)
| | - Brady Bernard
- Institute for Systems Biology, Seattle, WA 98109, USA
| | | | | | | | | |
Collapse
|
15
|
Mota É, Sousa F, Queiroz JA, Cruz C. Quantitative analysis of the interaction between l-methionine derivative and oligonucleotides. J Biochem 2014; 157:261-70. [PMID: 25425656 DOI: 10.1093/jb/mvu073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
This study explores the use of l-methionine derivative as a potential affinity ligand for nucleic acids purification. The l-methionine derivative is synthesized by activation of the carboxylic acid group with 1-ethyl-3-(3-dimethylaminopropyl) carbodiimide/N-hydroxysuccinimide follow by immobilization on amine sensor surface, previously activated and treated with ethylenediamine. Their affinity towards oligonucleotides has been determined by surface plasmon resonance biosensor. The highest affinity is found for cytosine and thymine, followed by adenine, whereas the lowest affinity is found for guanine. For hetero-oligonucleotides the affinity order is CCCTTT > CCCAAA ≈ AAATTT > GGGTTT, showing that nucleotides with cytosine have the highest affinity, and the presence of guanine reduces the affinity, corroborating with the results obtained with homo-oligonucleotides.
Collapse
Affiliation(s)
- Élia Mota
- CICS-UBI - Centro de Investigação em Ciências da Saúde, Universidade da Beira Interior, Av. Infante D. Henrique, 6200-506 Covilhã, Portugal
| | - Fani Sousa
- CICS-UBI - Centro de Investigação em Ciências da Saúde, Universidade da Beira Interior, Av. Infante D. Henrique, 6200-506 Covilhã, Portugal
| | - João A Queiroz
- CICS-UBI - Centro de Investigação em Ciências da Saúde, Universidade da Beira Interior, Av. Infante D. Henrique, 6200-506 Covilhã, Portugal
| | - Carla Cruz
- CICS-UBI - Centro de Investigação em Ciências da Saúde, Universidade da Beira Interior, Av. Infante D. Henrique, 6200-506 Covilhã, Portugal
| |
Collapse
|
16
|
Yu K, Liu C, Kim BG, Lee DY. Synthetic fusion protein design and applications. Biotechnol Adv 2014; 33:155-164. [PMID: 25450191 DOI: 10.1016/j.biotechadv.2014.11.005] [Citation(s) in RCA: 148] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2014] [Revised: 10/10/2014] [Accepted: 11/11/2014] [Indexed: 11/16/2022]
Abstract
Synthetic fusion proteins can be designed to achieve improved properties or new functionality by synergistically incorporating multiple proteins into one complex. The fusion of two or more protein domains enhances bioactivities or generates novel functional combinations with a wide range of biotechnological and (bio)pharmaceutical applications. In this review, initially, we summarize the commonly used approaches for constructing fusion proteins. For each approach, the design strategy and desired properties are elaborated with examples of recent studies in the areas of biocatalysts, protein switches and bio-therapeutics. Subsequently, the progress in structural prediction of fusion proteins is presented, which can potentially facilitate the structure-based systematic design of fusion proteins toward identifying the best combinations of fusion partners. Finally, the current challenges and future directions in this field are discussed.
Collapse
Affiliation(s)
- Kai Yu
- Department of Chemical and Biomolecular Engineering, Synthetic Biology Research Consortium, National University of Singapore, 4 Engineering Drive 4, Singapore 117585, Singapore
| | - Chengcheng Liu
- Bioprocessing Technology Institute, Agency for Science, Technology and Research (A*STAR), 20 Biopolis Way, #06-01, Singapore 138668, Singapore
| | - Byung-Gee Kim
- School of Chemical and Biological Engineering, Seoul National University, 599 Gwanak-ro, Gwanak-gu, Seoul 151742, South Korea
| | - Dong-Yup Lee
- Department of Chemical and Biomolecular Engineering, Synthetic Biology Research Consortium, National University of Singapore, 4 Engineering Drive 4, Singapore 117585, Singapore; Bioprocessing Technology Institute, Agency for Science, Technology and Research (A*STAR), 20 Biopolis Way, #06-01, Singapore 138668, Singapore.
| |
Collapse
|
17
|
Ashworth J, Plaisier CL, Lo FY, Reiss DJ, Baliga NS. Inference of expanded Lrp-like feast/famine transcription factor targets in a non-model organism using protein structure-based prediction. PLoS One 2014; 9:e107863. [PMID: 25255272 PMCID: PMC4177876 DOI: 10.1371/journal.pone.0107863] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2014] [Accepted: 08/16/2014] [Indexed: 11/18/2022] Open
Abstract
Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer.
Collapse
Affiliation(s)
- Justin Ashworth
- Institute for Systems Biology, Seattle, Washington, United States of America
- * E-mail: (JA); (NB)
| | | | - Fang Yin Lo
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - David J. Reiss
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Nitin S. Baliga
- Institute for Systems Biology, Seattle, Washington, United States of America
- Department of Microbiology, University of Washington, Seattle, Washington, United States of America
- * E-mail: (JA); (NB)
| |
Collapse
|
18
|
Stittrich AB, Lehman A, Bodian D, Ashworth J, Zong Z, Li H, Lam P, Khromykh A, Iyer R, Vockley J, Baveja R, Silva E, Dixon J, Leon E, Solomon B, Glusman G, Niederhuber J, Roach J, Patel M. Mutations in NOTCH1 cause Adams-Oliver syndrome. Am J Hum Genet 2014; 95:275-84. [PMID: 25132448 DOI: 10.1016/j.ajhg.2014.07.011] [Citation(s) in RCA: 134] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2014] [Accepted: 07/22/2014] [Indexed: 12/16/2022] Open
Abstract
Notch signaling determines and reinforces cell fate in bilaterally symmetric multicellular eukaryotes. Despite the involvement of Notch in many key developmental systems, human mutations in Notch signaling components have mainly been described in disorders with vascular and bone effects. Here, we report five heterozygous NOTCH1 variants in unrelated individuals with Adams-Oliver syndrome (AOS), a rare disease with major features of aplasia cutis of the scalp and terminal transverse limb defects. Using whole-genome sequencing in a cohort of 11 families lacking mutations in the four genes with known roles in AOS pathology (ARHGAP31, RBPJ, DOCK6, and EOGT), we found a heterozygous de novo 85 kb deletion spanning the NOTCH1 5' region and three coding variants (c.1285T>C [p.Cys429Arg], c.4487G>A [p.Cys1496Tyr], and c.5965G>A [p.Asp1989Asn]), two of which are de novo, in four unrelated probands. In a fifth family, we identified a heterozygous canonical splice-site variant (c.743-1 G>T) in an affected father and daughter. These variants were not present in 5,077 in-house control genomes or in public databases. In keeping with the prominent developmental role described for Notch1 in mouse vasculature, we observed cardiac and multiple vascular defects in four of the five families. We propose that the limb and scalp defects might also be due to a vasculopathy in NOTCH1-related AOS. Our results suggest that mutations in NOTCH1 are the most common cause of AOS and add to a growing list of human diseases that have a vascular and/or bony component and are caused by alterations in the Notch signaling pathway.
Collapse
|
19
|
An accurate binding interaction model in de novo computational protein design of interactions: If you build it, they will bind. J Struct Biol 2014; 185:136-46. [DOI: 10.1016/j.jsb.2013.03.012] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2012] [Revised: 03/15/2013] [Accepted: 03/21/2013] [Indexed: 01/07/2023]
|
20
|
Abstract
Building protein tools that can selectively bind or cleave specific DNA sequences requires efficient technologies for modifying protein-DNA interactions. Computational design is one method for accomplishing this goal. In this chapter, we present the current state of protein-DNA interface design with the Rosetta macromolecular modeling program. The LAGLIDADG endonuclease family of DNA-cleaving enzymes, under study as potential gene therapy reagents, has been the main testing ground for these in silico protocols. At this time, the computational methods are most useful for designing endonuclease variants that can accommodate small numbers of target site substitutions. Attempts to engineer for more extensive interface changes will likely benefit from an approach that uses the computational design results in conjunction with a high-throughput directed evolution or screening procedure. The family of enzymes presents an engineering challenge because their interfaces are highly integrated and there is significant coordination between the binding and catalysis events. Future developments in the computational algorithms depend on experimental feedback to improve understanding and modeling of these complex enzymatic features. This chapter presents both the basic method of design that has been successfully used to modulate specificity and more advanced procedures that incorporate DNA flexibility and other properties that are likely necessary for reliable modeling of more extensive target site changes.
Collapse
Affiliation(s)
- Summer Thyme
- Department of Biological Sciences, University of Washington, Seattle, WA, USA
| | | |
Collapse
|
21
|
Thyme SB, Boissel SJS, Arshiya Quadri S, Nolan T, Baker DA, Park RU, Kusak L, Ashworth J, Baker D. Reprogramming homing endonuclease specificity through computational design and directed evolution. Nucleic Acids Res 2013; 42:2564-76. [PMID: 24270794 PMCID: PMC3936771 DOI: 10.1093/nar/gkt1212] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Homing endonucleases (HEs) can be used to induce targeted genome modification to reduce the fitness of pathogen vectors such as the malaria-transmitting Anopheles gambiae and to correct deleterious mutations in genetic diseases. We describe the creation of an extensive set of HE variants with novel DNA cleavage specificities using an integrated experimental and computational approach. Using computational modeling and an improved selection strategy, which optimizes specificity in addition to activity, we engineered an endonuclease to cleave in a gene associated with Anopheles sterility and another to cleave near a mutation that causes pyruvate kinase deficiency. In the course of this work we observed unanticipated context-dependence between bases which will need to be mechanistically understood for reprogramming of specificity to succeed more generally.
Collapse
Affiliation(s)
- Summer B Thyme
- Department of Biochemistry, University of Washington, UW Box 357350, 1705 NE Pacific St., Seattle, WA 98195, USA, Graduate Program in Biomolecular Structure and Design, University of Washington, UW Box 357350, 1705 NE Pacific St., Seattle, WA 98195, USA, Graduate Program in Molecular and Cellular Biology, University of Washington, UW Box 357275, 1959 NE Pacific St., Seattle, WA 98195, USA, Department of Life Sciences, Sir Alexander Fleming Building, Imperial College London, Imperial College Road, London SW7 2AZ, UK, Department of Genetics, University of Cambridge, Downing Street, Cambridge CB1 3QA, UK, Institute for Systems Biology, 401 Terry Avenue N, Seattle, WA 98109, USA and Howard Hughes Medical Institute, University of Washington, UW Box 357350, 1705 NE Pacific St., Seattle, WA 98195, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Bach C, Patra PK, Pallis JM, Sherman WB, Bajwa H. Strategy for naturelike designer transcription factors with reduced toxicity. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2013; 10:1340-1343. [PMID: 24384718 DOI: 10.1109/tcbb.2013.107] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/03/2023]
Abstract
For clinical applications, the biological functions of DNA-binding proteins require that they interact with their target binding site with high affinity and specificity. Advances in randomized production and target-oriented selection of engineered artificial DNA-binding domains incited a rapidly expanding field of designer transcription factors (TFs). Engineered transcription factors are used in zinc-finger nuclease (ZFN) technology that allows targeted genome editing. Zinc-finger-binding domains fabricated by modular assembly display an unexpectedly high failure rate having either a lack of activity as ZFNs in human cells or activity at "off-targetâ binding sites on the human genome causing cell death. To address these shortcomings, we created new binding domains using a targeted modification strategy. We produced two SP1 mutants by exchanging amino acid residues in the alpha-helical region of the transcription factor SP1. We identified their best target binding sites and searched the NCBI HuRef genome for matches of the nine-base-pair consensus binding site of SP1 and the best binding sites of its mutants. Our research concludes that we can alter the binding preference of existing zinc-finger domains without altering its biological functionalities.
Collapse
|
23
|
Abstract
Predicting binding sites of a transcription factor in the genome is an important, but challenging, issue in studying gene regulation. In the past decade, a large number of protein–DNA co-crystallized structures available in the Protein Data Bank have facilitated the understanding of interacting mechanisms between transcription factors and their binding sites. Recent studies have shown that both physics-based and knowledge-based potential functions can be applied to protein–DNA complex structures to deliver position weight matrices (PWMs) that are consistent with the experimental data. To further use the available structural models, the proposed Web server, PiDNA, aims at first constructing reliable PWMs by applying an atomic-level knowledge-based scoring function on numerous in silico mutated complex structures, and then using the PWM constructed by the structure models with small energy changes to predict the interaction between proteins and DNA sequences. With PiDNA, the users can easily predict the relative preference of all the DNA sequences with limited mutations from the native sequence co-crystallized in the model in a single run. More predictions on sequences with unlimited mutations can be realized by additional requests or file uploading. Three types of information can be downloaded after prediction: (i) the ranked list of mutated sequences, (ii) the PWM constructed by the favourable mutated structures, and (iii) any mutated protein–DNA complex structure models specified by the user. This study first shows that the constructed PWMs are similar to the annotated PWMs collected from databases or literature. Second, the prediction accuracy of PiDNA in detecting relatively high-specificity sites is evaluated by comparing the ranked lists against in vitro experiments from protein-binding microarrays. Finally, PiDNA is shown to be able to select the experimentally validated binding sites from 10 000 random sites with high accuracy. With PiDNA, the users can design biological experiments based on the predicted sequence specificity and/or request mutated structure models for further protein design. As well, it is expected that PiDNA can be incorporated with chromatin immunoprecipitation data to refine large-scale inference of in vivo protein–DNA interactions. PiDNA is available at: http://dna.bime.ntu.edu.tw/pidna.
Collapse
Affiliation(s)
- Chih-Kang Lin
- Center for Systems Biology, National Taiwan University, Taipei 106, Taiwan
| | | |
Collapse
|
24
|
Yokoyama KD, Pollock DD. SP transcription factor paralogs and DNA-binding sites coevolve and adaptively converge in mammals and birds. Genome Biol Evol 2013; 4:1102-17. [PMID: 23019068 PMCID: PMC3514965 DOI: 10.1093/gbe/evs085] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Functional modification of regulatory proteins can affect hundreds of genes throughout the genome, and is therefore thought to be almost universally deleterious. This belief, however, has recently been challenged. A potential example comes from transcription factor SP1, for which statistical evidence indicates that motif preferences were altered in eutherian mammals. Here, we set out to discover possible structural and theoretical explanations, evaluate the role of selection in SP1 evolution, and discover effects on coregulatory proteins. We show that SP1 motif preferences were convergently altered in birds as well as mammals, inducing coevolutionary changes in over 800 regulatory regions. Structural and phylogenic evidence implicates a single causative amino acid replacement at the same SP1 position along both lineages. Furthermore, paralogs SP3 and SP4, which coregulate SP1 target genes through competitive binding to the same sites, have accumulated convergent replacements at the homologous position multiple times during eutherian and bird evolution, presumably to preserve competitive binding. To determine plausibility, we developed and implemented a simple model of transcription factor and binding site coevolution. This model predicts that, in contrast to prevailing beliefs, even small selective benefits per locus can drive concurrent fixation of transcription factor and binding site mutants under a broad range of conditions. Novel binding sites tend to arise de novo, rather than by mutation from ancestral sites, a prediction substantiated by SP1-binding site alignments. Thus, multiple lines of evidence indicate that selection has driven convergent evolution of transcription factors along with their binding sites and coregulatory proteins.
Collapse
Affiliation(s)
- Ken Daigoro Yokoyama
- Department of Biochemistry and Molecular Genetics, University of Colorado, Denver School of Medicine, USA
| | | |
Collapse
|
25
|
Takeda T, Corona RI, Guo JT. A knowledge-based orientation potential for transcription factor-DNA docking. Bioinformatics 2012; 29:322-30. [DOI: 10.1093/bioinformatics/bts699] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
|
26
|
Turner D, Kim R, Guo JT. TFinDit: transcription factor-DNA interaction data depository. BMC Bioinformatics 2012; 13:220. [PMID: 22943312 PMCID: PMC3483241 DOI: 10.1186/1471-2105-13-220] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2012] [Accepted: 08/23/2012] [Indexed: 11/28/2022] Open
Abstract
Background One of the crucial steps in regulation of gene expression is the binding of transcription factor(s) to specific DNA sequences. Knowledge of the binding affinity and specificity at a structural level between transcription factors and their target sites has important implications in our understanding of the mechanism of gene regulation. Due to their unique functions and binding specificity, there is a need for a transcription factor-specific, structure-based database and corresponding web service to facilitate structural bioinformatics studies of transcription factor-DNA interactions, such as development of knowledge-based interaction potential, transcription factor-DNA docking, binding induced conformational changes, and the thermodynamics of protein-DNA interactions. Description TFinDit is a relational database and a web search tool for studying transcription factor-DNA interactions. The database contains annotated transcription factor-DNA complex structures and related data, such as unbound protein structures, thermodynamic data, and binding sequences for the corresponding transcription factors in the complex structures. TFinDit also provides a user-friendly interface and allows users to either query individual entries or generate datasets through culling the database based on one or more search criteria. Conclusions TFinDit is a specialized structural database with annotated transcription factor-DNA complex structures and other preprocessed data. We believe that this database/web service can facilitate the development and testing of TF-DNA interaction potentials and TF-DNA docking algorithms, and the study of protein-DNA recognition mechanisms.
Collapse
Affiliation(s)
- Daniel Turner
- Department of Bioinformatics and Genomics, College of Computing and Informatics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | | | | |
Collapse
|
27
|
Gabdoulline R, Eckweiler D, Kel A, Stegmaier P. 3DTF: a web server for predicting transcription factor PWMs using 3D structure-based energy calculations. Nucleic Acids Res 2012; 40:W180-5. [PMID: 22693215 PMCID: PMC3394331 DOI: 10.1093/nar/gks551] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
We present the webserver 3D transcription factor (3DTF) to compute position-specific weight matrices (PWMs) of transcription factors using a knowledge-based statistical potential derived from crystallographic data on protein–DNA complexes. Analysis of available structures that can be used to construct PWMs shows that there are hundreds of 3D structures from which PWMs could be derived, as well as thousands of proteins homologous to these. Therefore, we created 3DTF, which delivers binding matrices given the experimental or modeled protein–DNA complex. The webserver can be used by biologists to derive novel PWMs for transcription factors lacking known binding sites and is freely accessible at http://www.gene-regulation.com/pub/programs/3dtf/.
Collapse
Affiliation(s)
- R Gabdoulline
- Heinrich-Heine University of Duesseldorf, Universitaetstr. 1, 40225 Duesseldorf, Germany
| | | | | | | |
Collapse
|
28
|
Thyme SB, Baker D, Bradley P. Improved modeling of side-chain--base interactions and plasticity in protein--DNA interface design. J Mol Biol 2012; 419:255-74. [PMID: 22426128 DOI: 10.1016/j.jmb.2012.03.005] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2011] [Revised: 02/09/2012] [Accepted: 03/09/2012] [Indexed: 12/30/2022]
Abstract
Combinatorial sequence optimization for protein design requires libraries of discrete side-chain conformations. The discreteness of these libraries is problematic, particularly for long, polar side chains, since favorable interactions can be missed. Previously, an approach to loop remodeling where protein backbone movement is directed by side-chain rotamers predicted to form interactions previously observed in native complexes (termed "motifs") was described. Here, we show how such motif libraries can be incorporated into combinatorial sequence optimization protocols and improve native complex recapitulation. Guided by the motif rotamer searches, we made improvements to the underlying energy function, increasing recapitulation of native interactions. To further test the methods, we carried out a comprehensive experimental scan of amino acid preferences in the I-AniI protein-DNA interface and found that many positions tolerated multiple amino acids. This sequence plasticity is not observed in the computational results because of the fixed-backbone approximation of the model. We improved modeling of this diversity by introducing DNA flexibility and reducing the convergence of the simulated annealing algorithm that drives the design process. In addition to serving as a benchmark, this extensive experimental data set provides insight into the types of interactions essential to maintain the function of this potential gene therapy reagent.
Collapse
Affiliation(s)
- Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
| | | | | |
Collapse
|
29
|
Benchmarks for flexible and rigid transcription factor-DNA docking. BMC STRUCTURAL BIOLOGY 2011; 11:45. [PMID: 22044637 PMCID: PMC3262759 DOI: 10.1186/1472-6807-11-45] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/09/2011] [Accepted: 11/01/2011] [Indexed: 12/27/2022]
Abstract
BACKGROUND Structural insight from transcription factor-DNA (TF-DNA) complexes is of paramount importance to our understanding of the affinity and specificity of TF-DNA interaction, and to the development of structure-based prediction of TF binding sites. Yet the majority of the TF-DNA complexes remain unsolved despite the considerable experimental efforts being made. Computational docking represents a promising alternative to bridge the gap. To facilitate the study of TF-DNA docking, carefully designed benchmarks are needed for performance evaluation and identification of the strengths and weaknesses of docking algorithms. RESULTS We constructed two benchmarks for flexible and rigid TF-DNA docking respectively using a unified non-redundant set of 38 test cases. The test cases encompass diverse fold families and are classified into easy and hard groups with respect to the degrees of difficulty in TF-DNA docking. The major parameters used to classify expected docking difficulty in flexible docking are the conformational differences between bound and unbound TFs and the interaction strength between TFs and DNA. For rigid docking in which the starting structure is a bound TF conformation, only interaction strength is considered. CONCLUSIONS We believe these benchmarks are important for the development of better interaction potentials and TF-DNA docking algorithms, which bears important implications to structure-based prediction of transcription factor binding sites and drug design.
Collapse
|
30
|
Pantazes RJ, Grisewood MJ, Maranas CD. Recent advances in computational protein design. Curr Opin Struct Biol 2011; 21:467-72. [PMID: 21600758 DOI: 10.1016/j.sbi.2011.04.005] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2011] [Accepted: 04/28/2011] [Indexed: 11/30/2022]
Affiliation(s)
- Robert J Pantazes
- The Pennsylvania State University, Department of Chemical Engineering, 112 Fenske Lab, University Park, PA 16802, USA
| | | | | |
Collapse
|
31
|
|
32
|
Ulge UY, Baker DA, Monnat RJ. Comprehensive computational design of mCreI homing endonuclease cleavage specificity for genome engineering. Nucleic Acids Res 2011; 39:4330-9. [PMID: 21288879 PMCID: PMC3105429 DOI: 10.1093/nar/gkr022] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Homing endonucleases (HEs) cleave long (∼ 20 bp) DNA target sites with high site specificity to catalyze the lateral transfer of parasitic DNA elements. In order to determine whether comprehensive computational design could be used as a general strategy to engineer new HE target site specificities, we used RosettaDesign (RD) to generate 3200 different variants of the mCreI LAGLIDADG HE towards 16 different base pair positions in the 22 bp mCreI target site. Experimental verification of a range of these designs demonstrated that over 2/3 (24 of 35 designs, 69%) had the intended new site specificity, and that 14 of the 15 attempted specificity shifts (93%) were achieved. These results demonstrate the feasibility of using structure-based computational design to engineer HE variants with novel target site specificities to facilitate genome engineering.
Collapse
Affiliation(s)
- Umut Y Ulge
- Department of Biochemistry, Howard Hughes Medical InstituteUniversity of Washington, Box 357705, Seattle, WA 98195, USA
| | | | | |
Collapse
|
33
|
Boryskina OP, Tkachenko MY, Shestopalova AV. Protein-DNA complexes: specificity and DNA readout mechanisms. ACTA ACUST UNITED AC 2011. [DOI: 10.7124/bc.00007c] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Affiliation(s)
- O. P. Boryskina
- O. Ya. Usikov Institute for Radio Physics and Electronics, National Academy of Sciences of Ukraine
| | - M. Yu. Tkachenko
- O. Ya. Usikov Institute for Radio Physics and Electronics, National Academy of Sciences of Ukraine
| | - A. V. Shestopalova
- O. Ya. Usikov Institute for Radio Physics and Electronics, National Academy of Sciences of Ukraine
| |
Collapse
|
34
|
Friedland GD, Kortemme T. Designing ensembles in conformational and sequence space to characterize and engineer proteins. Curr Opin Struct Biol 2010; 20:377-84. [DOI: 10.1016/j.sbi.2010.02.004] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2010] [Accepted: 02/19/2010] [Indexed: 11/16/2022]
|
35
|
Lauck F, Smith CA, Friedland GF, Humphris EL, Kortemme T. RosettaBackrub--a web server for flexible backbone protein structure modeling and design. Nucleic Acids Res 2010; 38:W569-75. [PMID: 20462859 PMCID: PMC2896185 DOI: 10.1093/nar/gkq369] [Citation(s) in RCA: 95] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
The RosettaBackrub server (http://kortemmelab.ucsf.edu/backrub) implements the Backrub method, derived from observations of alternative conformations in high-resolution protein crystal structures, for flexible backbone protein modeling. Backrub modeling is applied to three related applications using the Rosetta program for structure prediction and design: (I) modeling of structures of point mutations, (II) generating protein conformational ensembles and designing sequences consistent with these conformations and (III) predicting tolerated sequences at protein–protein interfaces. The three protocols have been validated on experimental data. Starting from a user-provided single input protein structure in PDB format, the server generates near-native conformational ensembles. The predicted conformations and sequences can be used for different applications, such as to guide mutagenesis experiments, for ensemble-docking approaches or to generate sequence libraries for protein design.
Collapse
Affiliation(s)
- Florian Lauck
- Department of Bioengineering and Therapeutic Sciences, University of California at San Francisco, 1700 4th Street, San Francisco, CA 94158, USA
| | | | | | | | | |
Collapse
|
36
|
Ashworth J, Taylor GK, Havranek JJ, Quadri SA, Stoddard BL, Baker D. Computational reprogramming of homing endonuclease specificity at multiple adjacent base pairs. Nucleic Acids Res 2010; 38:5601-8. [PMID: 20435674 PMCID: PMC2938204 DOI: 10.1093/nar/gkq283] [Citation(s) in RCA: 77] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
Site-specific homing endonucleases are capable of inducing gene conversion via homologous recombination. Reprogramming their cleavage specificities allows the targeting of specific biological sites for gene correction or conversion. We used computational protein design to alter the cleavage specificity of I-MsoI for three contiguous base pair substitutions, resulting in an endonuclease whose activity and specificity for its new site rival that of wild-type I-MsoI for the original site. Concerted design for all simultaneous substitutions was more successful than a modular approach against individual substitutions, highlighting the importance of context-dependent redesign and optimization of protein–DNA interactions. We then used computational design based on the crystal structure of the designed complex, which revealed significant unanticipated shifts in DNA conformation, to create an endonuclease that specifically cleaves a site with four contiguous base pair substitutions. Our results demonstrate that specificity switches for multiple concerted base pair substitutions can be computationally designed, and that iteration between design and structure determination provides a route to large scale reprogramming of specificity.
Collapse
Affiliation(s)
- Justin Ashworth
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
| | | | | | | | | | | |
Collapse
|
37
|
Thyme SB, Jarjour J, Takeuchi R, Havranek JJ, Ashworth J, Scharenberg AM, Stoddard BL, Baker D. Exploitation of binding energy for catalysis and design. Nature 2009; 461:1300-4. [PMID: 19865174 PMCID: PMC2771326 DOI: 10.1038/nature08508] [Citation(s) in RCA: 80] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2009] [Accepted: 09/15/2009] [Indexed: 01/10/2023]
Abstract
Enzymes utilize substrate binding energy both to promote ground state association and to selectively lower the energy of the reaction transition state.i The monomeric homing endonuclease I-AniI cleaves with high sequence specificity in the center of a 20 base-pair DNA target site, with the N-terminal domain of the enzyme making extensive binding interactions with the left (−) side of the target site and the similarly structured C-terminal domain interacting with the right (+) side.ii Despite the approximate two-fold symmetry of the enzyme-DNA complex, we find that there is almost complete segregation of interactions responsible for substrate binding to the (−) side of the interface and interactions responsible for transition state stabilization to the (+) side. While single base-pair substitutions throughout the entire DNA target site reduce catalytic efficiency, mutations in the (−) DNA half-site almost exclusively increase KD and KM*, and those in the (+) half-site primarily decrease kcat*. The reduction of activity produced by mutations on the (−) side, but not mutations on the (+) side, can be suppressed by tethering the substrate to the endonuclease displayed on the surface of yeast. This dramatic asymmetry in the utilization of enzyme-substrate binding energy for catalysis has direct relevance to the redesign of endonucleases to cleave genomic target sites for gene therapy and other applications. Computationally redesigned enzymes that achieve new specificities on the (−) side do so by modulating KM*, while redesigns with altered specificities on the (+) side modulate kcat*. Our results illustrate how classical enzymology and modern protein design can each inform the other.
Collapse
Affiliation(s)
- Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, Washington 98195, USA.
| | | | | | | | | | | | | | | |
Collapse
|