1
|
Hoch SY, Netzer R, Weinstein JY, Krauss L, Hakeny K, Fleishman SJ. GGAssembler: Precise and economical design and synthesis of combinatorial mutation libraries. Protein Sci 2024; 33:e5169. [PMID: 39283039 PMCID: PMC11403590 DOI: 10.1002/pro.5169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 08/21/2024] [Accepted: 08/26/2024] [Indexed: 09/22/2024]
Abstract
Golden Gate assembly (GGA) can seamlessly generate full-length genes from DNA fragments. In principle, GGA could be used to design combinatorial mutation libraries for protein engineering, but creating accurate, complex, and cost-effective libraries has been challenging. We present GGAssembler, a graph-theoretical method for economical design of DNA fragments that assemble a combinatorial library that encodes any desired diversity. We used GGAssembler for one-pot in vitro assembly of camelid antibody libraries comprising >105 variants with DNA costs <0.007$ per variant and dropping significantly with increased library complexity. >93% of the desired variants were present in the assembly product and >99% were represented within the expected order of magnitude as verified by deep sequencing. The GGAssembler workflow is, therefore, an accurate approach for generating complex variant libraries that may drastically reduce costs and accelerate discovery and optimization of antibodies, enzymes and other proteins. The workflow is accessible through a Google Colab notebook at https://github.com/Fleishman-Lab/GGAssembler.
Collapse
Affiliation(s)
- Shlomo Yakir Hoch
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Ravit Netzer
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | | | - Lucas Krauss
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Karen Hakeny
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | | |
Collapse
|
2
|
Yang J, Ducharme J, Johnston KE, Li FZ, Yue Y, Arnold FH. DeCOIL: Optimization of Degenerate Codon Libraries for Machine Learning-Assisted Protein Engineering. ACS Synth Biol 2023; 12:2444-2454. [PMID: 37524064 DOI: 10.1021/acssynbio.3c00301] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/02/2023]
Abstract
With advances in machine learning (ML)-assisted protein engineering, models based on data, biophysics, and natural evolution are being used to propose informed libraries of protein variants to explore. Synthesizing these libraries for experimental screens is a major bottleneck, as the cost of obtaining large numbers of exact gene sequences is often prohibitive. Degenerate codon (DC) libraries are a cost-effective alternative for generating combinatorial mutagenesis libraries where mutations are targeted to a handful of amino acid sites. However, existing computational methods to optimize DC libraries to include desired protein variants are not well suited to design libraries for ML-assisted protein engineering. To address these drawbacks, we present DEgenerate Codon Optimization for Informed Libraries (DeCOIL), a generalized method that directly optimizes DC libraries to be useful for protein engineering: to sample protein variants that are likely to have both high fitness and high diversity in the sequence search space. Using computational simulations and wet-lab experiments, we demonstrate that DeCOIL is effective across two specific case studies, with the potential to be applied to many other use cases. DeCOIL offers several advantages over existing methods, as it is direct, easy to use, generalizable, and scalable. With accompanying software (https://github.com/jsunn-y/DeCOIL), DeCOIL can be readily implemented to generate desired informed libraries.
Collapse
Affiliation(s)
- Jason Yang
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Julie Ducharme
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Kadina E Johnston
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Francesca-Zhoufan Li
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125, United States
| | - Yisong Yue
- Division of Engineering and Applied Sciences, California Institute of Technology, Pasadena, California 91125, United States
| | - Frances H Arnold
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, United States
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California 91125, United States
| |
Collapse
|
3
|
Juračka S, Hrnčířová B, Burýšková B, Georgiev D, Dvořák P. Building the SynBio community in the Czech Republic from the bottom up: You get what you give. BIOTECHNOLOGY NOTES (AMSTERDAM, NETHERLANDS) 2022; 3:124-134. [PMID: 39416447 PMCID: PMC11446354 DOI: 10.1016/j.biotno.2022.11.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/09/2022] [Revised: 11/23/2022] [Accepted: 11/23/2022] [Indexed: 10/19/2024]
Abstract
Given its highly innovative character and potential socioeconomic impact, Synthetic Biology is often ranked among prominent research areas and national research priorities in developed countries. The global evolution of this field is proceeding by leaps and bounds but its development at the level of individual states varies widely. Despite their current satisfactory economic status, the majority of 13, mostly post-communist, countries that entered the European Union family in and after 2004 (EU13) have long overlooked the blossoming of Synthetic Biology. Their prioritized lines of research have been directed elsewhere or "Synthetic Biology" did not become a widely accepted term to encompass their bioengineering and biotechnology domains. The Czech Republic is not an exception. The local SynBio mycelium already exists but is mainly built bottom-up through the activities of several academic labs, iGEM teams, and spin-off companies. In this article, we tell their individual stories and summarize the prerequisites that allowed their emergence in the Czech academic and business environment. In addition, we provide the reader with a brief overview of laboratories, research hubs, and companies that perform biotechnology and bioengineering-oriented research and that may be included in a notional "shadow SynBio community" but have not yet adopted Synthetic Biology as a unifying term for their ventures. We also map the current hindrances for a broader expansion of Synthetic Biology in the Czech Republic and suggest possible steps that should lead to the maturity of this fascinating research field in our country.
Collapse
Affiliation(s)
- Stanislav Juračka
- Department of Experimental Biology (Section of Microbiology, Microbial Bioengineering Laboratory), Faculty of Science, Masaryk University, Brno, Czech Republic
- Member of iGEM team, Brno, Czech Republic
| | - Barbora Hrnčířová
- Department of Experimental Biology (Section of Microbiology, Microbial Bioengineering Laboratory), Faculty of Science, Masaryk University, Brno, Czech Republic
- Member of iGEM team, Brno, Czech Republic
| | - Barbora Burýšková
- Department of Experimental Biology (Section of Microbiology, Microbial Bioengineering Laboratory), Faculty of Science, Masaryk University, Brno, Czech Republic
- Member of iGEM team, Brno, Czech Republic
- Former EUSynBioS Steering Committee Member, Czech Republic
| | - Daniel Georgiev
- Department of Cybernetics, Faculty of Applied Sciences, University of West Bohemia, Pilsen, Czech Republic
- Member of iGEM team, Czech Republic
| | - Pavel Dvořák
- Department of Experimental Biology (Section of Microbiology, Microbial Bioengineering Laboratory), Faculty of Science, Masaryk University, Brno, Czech Republic
- Member of iGEM team, Brno, Czech Republic
- Former EUSynBioS Steering Committee Member, Czech Republic
| |
Collapse
|
4
|
Tretyachenko V, Vymětal J, Neuwirthová T, Vondrášek J, Fujishima K, Hlouchová K. Modern and prebiotic amino acids support distinct structural profiles in proteins. Open Biol 2022; 12:220040. [PMID: 35728622 PMCID: PMC9213115 DOI: 10.1098/rsob.220040] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
The earliest proteins had to rely on amino acids available on early Earth before the biosynthetic pathways for more complex amino acids evolved. In extant proteins, a significant fraction of the 'late' amino acids (such as Arg, Lys, His, Cys, Trp and Tyr) belong to essential catalytic and structure-stabilizing residues. How (or if) early proteins could sustain an early biosphere has been a major puzzle. Here, we analysed two combinatorial protein libraries representing proxies of the available sequence space at two different evolutionary stages. The first is composed of the entire alphabet of 20 amino acids while the second one consists of only 10 residues (ASDGLIPTEV) representing a consensus view of plausibly available amino acids through prebiotic chemistry. We show that compact conformations resistant to proteolysis are surprisingly similarly abundant in both libraries. In addition, the early alphabet proteins are inherently more soluble and refoldable, independent of the general Hsp70 chaperone activity. By contrast, chaperones significantly increase the otherwise poor solubility of the modern alphabet proteins suggesting their coevolution with the amino acid repertoire. Our work indicates that while both early and modern amino acids are predisposed to supporting protein structure, they do so with different biophysical properties and via different mechanisms.
Collapse
Affiliation(s)
- Vyacheslav Tretyachenko
- Department of Cell Biology, Faculty of Science, Charles University, Prague 12843, Czech Republic,Department of Biochemistry, Faculty of Science, Charles University, Prague 12843, Czech Republic
| | - Jiří Vymětal
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague 16610, Czech Republic
| | - Tereza Neuwirthová
- Department of Cell Biology, Faculty of Science, Charles University, Prague 12843, Czech Republic
| | - Jiří Vondrášek
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague 16610, Czech Republic
| | - Kosuke Fujishima
- Earth-Life Science Institute, Tokyo Institute of Technology, Tokyo 1528550, Japan,Graduate School of Media and Governance, Keio University, Fujisawa 2520882 Japan
| | - Klára Hlouchová
- Department of Cell Biology, Faculty of Science, Charles University, Prague 12843, Czech Republic,Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Prague 16610, Czech Republic
| |
Collapse
|
5
|
Jia TZ, Nishikawa S, Fujishima K. Sequencing the Origins of Life. BBA ADVANCES 2022; 2:100049. [PMID: 37082609 PMCID: PMC10074849 DOI: 10.1016/j.bbadva.2022.100049] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 02/27/2022] [Accepted: 03/02/2022] [Indexed: 01/10/2023] Open
Abstract
One goal of origins of life research is to understand how primitive informational and catalytic biopolymers emerged and evolved. Recently, a number of sequencing techniques have been applied to analysis of replicating and evolving primitive biopolymer systems, providing a sequence-specific and high-resolution view of primitive chemical processes. Here, we review application of sequencing techniques to analysis of synthetic and primitive nucleic acids and polypeptides. This includes next-generation sequencing of primitive polymerization and evolution processes, followed by discussion of other novel biochemical techniques that could contribute to sequence analysis of primitive biopolymer driven chemical systems. Further application of sequencing to origins of life research, perhaps as a life detection technology, could provide insight into the origin and evolution of informational and catalytic biopolymers on early Earth or elsewhere.
Collapse
Affiliation(s)
- Tony Z. Jia
- Earth-Life Science Institute, Tokyo Institute of Technology, 2-12-1-IE-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
- Blue Marble Space Institute of Science, 600 1st Ave, Floor 1, Seattle, WA 98104, USA
- Corresponding author
| | - Shota Nishikawa
- Earth-Life Science Institute, Tokyo Institute of Technology, 2-12-1-IE-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
- School of Life Science and Technology, Tokyo Institute of Technology, Yokohama, Kanagawa 226-8501, Japan
| | - Kosuke Fujishima
- Earth-Life Science Institute, Tokyo Institute of Technology, 2-12-1-IE-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
- Graduate School of Media and Governance, Keio University, 5322 Endo, Fujisawa-shi, Kanagawa 252-0882, Japan
| |
Collapse
|