1
|
Xie X, Wang S, Chen Z, Yu Y, Hu X, Ma N, Ji M, Tian Y. Exploring DNA Computers: Advances in Storage, Cryptography and Logic Circuits. Chembiochem 2025; 26:e202400670. [PMID: 39365708 DOI: 10.1002/cbic.202400670] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2024] [Revised: 09/20/2024] [Accepted: 10/04/2024] [Indexed: 10/06/2024]
Abstract
Over the last four decades, research on DNA as a functional material has primarily focused on its predictable conformation and programmable interaction. However, its low energy consumption, high responsiveness and sensitivity also make it ideal for designing specific signaling pathways, and enabling the development of molecular computers. This review mainly discusses recent advancements in the utilization of DNA nanotechnology for molecular computer, encompassing applications in storage, cryptography and logic circuits. It elucidates the challenges encountered in the application process and presents solutions exemplified by representative works. Lastly, it delineates the challenges and opportunities within this filed.
Collapse
Affiliation(s)
- Xiaolin Xie
- College of Engineering and Applied Sciences, State Key Laboratory of Analytical Chemistry for Life Science, National Laboratory of Solid State Microstructures, Jiangsu Key Laboratory of Artificial Functional Materials, Chemistry and Biomedicine Innovation Center (ChemBIC), ChemBioMed Interdisciplinary Research Center at Nanjing University, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210023, China
| | - Shuang Wang
- State Key Laboratory of Marine Food Processing & Safety Control, College of Food Science and Engineering, Ocean University of China, Qingdao, 266404, China
| | - Zhi Chen
- College of Engineering and Applied Sciences, State Key Laboratory of Analytical Chemistry for Life Science, National Laboratory of Solid State Microstructures, Jiangsu Key Laboratory of Artificial Functional Materials, Chemistry and Biomedicine Innovation Center (ChemBIC), ChemBioMed Interdisciplinary Research Center at Nanjing University, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210023, China
| | - Yifan Yu
- College of Engineering and Applied Sciences, State Key Laboratory of Analytical Chemistry for Life Science, National Laboratory of Solid State Microstructures, Jiangsu Key Laboratory of Artificial Functional Materials, Chemistry and Biomedicine Innovation Center (ChemBIC), ChemBioMed Interdisciplinary Research Center at Nanjing University, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210023, China
| | - Xiaoxue Hu
- College of Engineering and Applied Sciences, State Key Laboratory of Analytical Chemistry for Life Science, National Laboratory of Solid State Microstructures, Jiangsu Key Laboratory of Artificial Functional Materials, Chemistry and Biomedicine Innovation Center (ChemBIC), ChemBioMed Interdisciplinary Research Center at Nanjing University, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210023, China
| | - Ningning Ma
- College of Engineering and Applied Sciences, State Key Laboratory of Analytical Chemistry for Life Science, National Laboratory of Solid State Microstructures, Jiangsu Key Laboratory of Artificial Functional Materials, Chemistry and Biomedicine Innovation Center (ChemBIC), ChemBioMed Interdisciplinary Research Center at Nanjing University, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210023, China
| | - Min Ji
- College of Engineering and Applied Sciences, State Key Laboratory of Analytical Chemistry for Life Science, National Laboratory of Solid State Microstructures, Jiangsu Key Laboratory of Artificial Functional Materials, Chemistry and Biomedicine Innovation Center (ChemBIC), ChemBioMed Interdisciplinary Research Center at Nanjing University, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210023, China
| | - Ye Tian
- College of Engineering and Applied Sciences, State Key Laboratory of Analytical Chemistry for Life Science, National Laboratory of Solid State Microstructures, Jiangsu Key Laboratory of Artificial Functional Materials, Chemistry and Biomedicine Innovation Center (ChemBIC), ChemBioMed Interdisciplinary Research Center at Nanjing University, Collaborative Innovation Center of Advanced Microstructures, Nanjing University, Nanjing, 210023, China
| |
Collapse
|
2
|
Yang S, Bögels BWA, Wang F, Xu C, Dou H, Mann S, Fan C, de Greef TFA. DNA as a universal chemical substrate for computing and data storage. Nat Rev Chem 2024; 8:179-194. [PMID: 38337008 DOI: 10.1038/s41570-024-00576-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/10/2024] [Indexed: 02/12/2024]
Abstract
DNA computing and DNA data storage are emerging fields that are unlocking new possibilities in information technology and diagnostics. These approaches use DNA molecules as a computing substrate or a storage medium, offering nanoscale compactness and operation in unconventional media (including aqueous solutions, water-in-oil microemulsions and self-assembled membranized compartments) for applications beyond traditional silicon-based computing systems. To build a functional DNA computer that can process and store molecular information necessitates the continued development of strategies for computing and data storage, as well as bridging the gap between these fields. In this Review, we explore how DNA can be leveraged in the context of DNA computing with a focus on neural networks and compartmentalized DNA circuits. We also discuss emerging approaches to the storage of data in DNA and associated topics such as the writing, reading, retrieval and post-synthesis editing of DNA-encoded data. Finally, we provide insights into how DNA computing can be integrated with DNA data storage and explore the use of DNA for near-memory computing for future information technology and health analysis applications.
Collapse
Affiliation(s)
- Shuo Yang
- State Key Laboratory of Metal Matrix Composites, School of Materials Science and Engineering, Shanghai Jiao Tong University, Shanghai, China
- Zhangjiang Institute for Advanced Study (ZIAS), Shanghai Jiao Tong University, Shanghai, China
| | - Bas W A Bögels
- Laboratory of Chemical Biology, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
- Institute for Complex Molecular Systems (ICMS), Eindhoven University of Technology, Eindhoven, The Netherlands
- Computational Biology Group, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
| | - Fei Wang
- School of Chemistry and Chemical Engineering, New Cornerstone Science Laboratory, Frontiers Science Center for Transformative Molecules and National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China
| | - Can Xu
- State Key Laboratory of Metal Matrix Composites, School of Materials Science and Engineering, Shanghai Jiao Tong University, Shanghai, China
- Zhangjiang Institute for Advanced Study (ZIAS), Shanghai Jiao Tong University, Shanghai, China
| | - Hongjing Dou
- State Key Laboratory of Metal Matrix Composites, School of Materials Science and Engineering, Shanghai Jiao Tong University, Shanghai, China
- Zhangjiang Institute for Advanced Study (ZIAS), Shanghai Jiao Tong University, Shanghai, China
| | - Stephen Mann
- State Key Laboratory of Metal Matrix Composites, School of Materials Science and Engineering, Shanghai Jiao Tong University, Shanghai, China.
- Zhangjiang Institute for Advanced Study (ZIAS), Shanghai Jiao Tong University, Shanghai, China.
- Centre for Protolife Research and Centre for Organized Matter Chemistry, School of Chemistry, University of Bristol, Bristol, UK.
- Max Planck-Bristol Centre for Minimal Biology, School of Chemistry, University of Bristol, Bristol, UK.
| | - Chunhai Fan
- School of Chemistry and Chemical Engineering, New Cornerstone Science Laboratory, Frontiers Science Center for Transformative Molecules and National Center for Translational Medicine, Shanghai Jiao Tong University, Shanghai, China.
- Institute of Molecular Medicine, Shanghai Key Laboratory for Nucleic Acids Chemistry and Nanomedicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, China.
| | - Tom F A de Greef
- Laboratory of Chemical Biology, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.
- Institute for Complex Molecular Systems (ICMS), Eindhoven University of Technology, Eindhoven, The Netherlands.
- Computational Biology Group, Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.
- Institute for Molecules and Materials, Radboud University, Nijmegen, The Netherlands.
- Center for Living Technologies, Eindhoven-Wageningen-Utrecht Alliance, Utrecht, The Netherlands.
| |
Collapse
|
3
|
Li Y, Abraham C, Suslov O, Yaren O, Shaw RW, Kim MJ, Wan S, Marliere P, Benner SA. Synthetic Biology Pathway to Nucleoside Triphosphates for Expanded Genetic Alphabets. ACS Synth Biol 2023; 12:1772-1781. [PMID: 37227319 PMCID: PMC10911313 DOI: 10.1021/acssynbio.3c00060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
One horizon in synthetic biology seeks alternative forms of DNA that store, transcribe, and support the evolution of biological information. Here, hydrogen bond donor and acceptor groups are rearranged within a Watson-Crick geometry to get 12 nucleotides that form 6 independently replicating pairs. Such artificially expanded genetic information systems (AEGIS) support Darwinian evolution in vitro. To move AEGIS into living cells, metabolic pathways are next required to make AEGIS triphosphates economically from their nucleosides, eliminating the need to feed these expensive compounds in growth media. We report that "polyphosphate kinases" can be recruited for such pathways, working with natural diphosphate kinases and engineered nucleoside kinases. This pathway in vitro makes AEGIS triphosphates, including third-generation triphosphates having improved ability to survive in living bacterial cells. In α-32P-labeled forms, produced here for the first time, they were used to study DNA polymerases, finding cases where third-generation AEGIS triphosphates perform better with natural enzymes than second-generation AEGIS triphosphates.
Collapse
Affiliation(s)
- Yubing Li
- Foundation for Applied Molecular Evolution, 13709 Progress Blvd., Alachua, Florida 32615 United States
| | - Clay Abraham
- Foundation for Applied Molecular Evolution, 13709 Progress Blvd., Alachua, Florida 32615 United States
| | - Oleg Suslov
- Foundation for Applied Molecular Evolution, 13709 Progress Blvd., Alachua, Florida 32615 United States
| | - Ozlem Yaren
- Foundation for Applied Molecular Evolution, 13709 Progress Blvd., Alachua, Florida 32615 United States
| | - Ryan W. Shaw
- Foundation for Applied Molecular Evolution, 13709 Progress Blvd., Alachua, Florida 32615 United States
| | - Myong-Jung Kim
- Firebird Biomolecular Sciences LLC, 13709 Progress Blvd., Alachua, Florida 32615 United States
| | - Shuo Wan
- Foundation for Applied Molecular Evolution, 13709 Progress Blvd., Alachua, Florida 32615 United States
| | - Philippe Marliere
- Institute of Systems & Synthetic Biology, Génopole, 5 rue Desbruères, 91030 Evry Cedex France
| | - Steven A. Benner
- Foundation for Applied Molecular Evolution, 13709 Progress Blvd., Alachua, Florida 32615 United States
- Firebird Biomolecular Sciences LLC, 13709 Progress Blvd., Alachua, Florida 32615 United States
| |
Collapse
|
4
|
Lau B, Chandak S, Roy S, Tatwawadi K, Wootters M, Weissman T, Ji HP. Magnetic DNA random access memory with nanopore readouts and exponentially-scaled combinatorial addressing. Sci Rep 2023; 13:8514. [PMID: 37231057 PMCID: PMC10213054 DOI: 10.1038/s41598-023-29575-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Accepted: 02/07/2023] [Indexed: 05/27/2023] Open
Abstract
The storage of data in DNA typically involves encoding and synthesizing data into short oligonucleotides, followed by reading with a sequencing instrument. Major challenges include the molecular consumption of synthesized DNA, basecalling errors, and limitations with scaling up read operations for individual data elements. Addressing these challenges, we describe a DNA storage system called MDRAM (Magnetic DNA-based Random Access Memory) that enables repetitive and efficient readouts of targeted files with nanopore-based sequencing. By conjugating synthesized DNA to magnetic agarose beads, we enabled repeated data readouts while preserving the original DNA analyte and maintaining data readout quality. MDRAM utilizes an efficient convolutional coding scheme that leverages soft information in raw nanopore sequencing signals to achieve information reading costs comparable to Illumina sequencing despite higher error rates. Finally, we demonstrate a proof-of-concept DNA-based proto-filesystem that enables an exponentially-scalable data address space using only small numbers of targeting primers for assembly and readout.
Collapse
Affiliation(s)
- Billy Lau
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, USA
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA, 94304, USA
| | - Shubham Chandak
- Department of Electrical Engineering, Stanford University, Stanford, CA, 94305, USA
| | - Sharmili Roy
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, USA
| | - Kedar Tatwawadi
- Department of Electrical Engineering, Stanford University, Stanford, CA, 94305, USA
| | - Mary Wootters
- Department of Electrical Engineering, Stanford University, Stanford, CA, 94305, USA
| | - Tsachy Weissman
- Department of Electrical Engineering, Stanford University, Stanford, CA, 94305, USA.
| | - Hanlee P Ji
- Division of Oncology, Department of Medicine, Stanford University School of Medicine, Stanford, CA, 94305, USA.
- Stanford Genome Technology Center, Stanford University, Palo Alto, CA, 94304, USA.
| |
Collapse
|
5
|
Talbot H, Halvorsen K, Chandrasekaran AR. Encoding, Decoding, and Rendering Information in DNA Nanoswitch Libraries. ACS Synth Biol 2023; 12:978-983. [PMID: 36541933 PMCID: PMC10121895 DOI: 10.1021/acssynbio.2c00649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
DNA-based construction allows the creation of molecular devices that are useful in information storage and processing. Here, we combine the programmability of DNA nanoswitches and stimuli-responsive conformational changes to demonstrate information encoding and graphical readout using gel electrophoresis. We encoded information as 5-bit binary codes for alphanumeric characters using a combination of DNA and RNA inputs that can be decoded using molecular stimuli such as a ribonuclease. We also show that a similar strategy can be used for graphical visual readout of alphabets on an agarose gel, information that is encoded by nucleic acids and decoded by a ribonuclease. Our method of information encoding and processing could be combined with DNA actuation for molecular computation and diagnostics that require a nonarbitrary visual readout.
Collapse
Affiliation(s)
- Hannah Talbot
- The RNA Institute, University at Albany, State University of New York, Albany, New York 12203, United States
| | - Ken Halvorsen
- The RNA Institute, University at Albany, State University of New York, Albany, New York 12203, United States
| | - Arun Richard Chandrasekaran
- The RNA Institute, University at Albany, State University of New York, Albany, New York 12203, United States
| |
Collapse
|
6
|
Ezekannagha C, Welzel M, Heider D, Hattab G. DNAsmart: Multiple attribute ranking tool for DNA data storage systems. Comput Struct Biotechnol J 2023; 21:1448-1460. [PMID: 36851917 PMCID: PMC9957737 DOI: 10.1016/j.csbj.2023.02.016] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2022] [Revised: 02/07/2023] [Accepted: 02/07/2023] [Indexed: 02/12/2023] Open
Abstract
In an ever-growing need for data storage capacity, the Deoxyribonucleic Acid (DNA) molecule gains traction as a new storage medium with a larger capacity, higher density, and a longer lifespan over conventional storage media. To effectively use DNA for data storage, it is important to understand the different methods of encoding information in DNA and compare their effectiveness. This requires evaluating which decoded DNA sequences carry the most encoded information based on various attributes. However, navigating the field of coding theory requires years of experience and domain expertise. For instance, domain experts rely on various mathematical functions and attributes to score and evaluate their encodings. To enable such analytical tasks, we provide an interactive and visual analytical framework for multi-attribute ranking in DNA storage systems. Our framework follows a three-step view with user-settable parameters. It enables users to find the optimal en-/de-coding approaches by setting different weights and combining multiple attributes. We assess the validity of our work through a task-specific user study on domain experts by relying on three tasks. Results indicate that all participants completed their tasks successfully under two minutes, then rated the framework for design choices, perceived usefulness, and intuitiveness. In addition, two real-world use cases are shared and analyzed as direct applications of the proposed tool. DNAsmart enables the ranking of decoded sequences based on multiple attributes. In sum, this work unveils the evaluation of en-/de-coding approaches accessible and tractable through visualization and interactivity to solve comparison and ranking tasks.
Collapse
Affiliation(s)
- Chisom Ezekannagha
- Department of Mathematics and Computer Science, Philipps-Universität, Hans-Meerwein-Str. 6, Marburg D-35043, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-Universität Marburg, Karl-von-Frisch-Str. 14, Marburg D-35043, Germany
| | - Marius Welzel
- Department of Mathematics and Computer Science, Philipps-Universität, Hans-Meerwein-Str. 6, Marburg D-35043, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-Universität Marburg, Karl-von-Frisch-Str. 14, Marburg D-35043, Germany
| | - Dominik Heider
- Department of Mathematics and Computer Science, Philipps-Universität, Hans-Meerwein-Str. 6, Marburg D-35043, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-Universität Marburg, Karl-von-Frisch-Str. 14, Marburg D-35043, Germany
| | - Georges Hattab
- Department of Mathematics and Computer Science, Philipps-Universität, Hans-Meerwein-Str. 6, Marburg D-35043, Germany
- Center for Synthetic Microbiology (SYNMIKRO), Philipps-Universität Marburg, Karl-von-Frisch-Str. 14, Marburg D-35043, Germany
| |
Collapse
|
7
|
Yin Q, Zheng Y, Wang B, Zhang Q. Design of Constraint Coding Sets for Archive DNA Storage. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3384-3394. [PMID: 34762590 DOI: 10.1109/tcbb.2021.3127271] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
With the advent of the era of massive data, the increase of storage demand has far exceeded current storage capacity. DNA molecules provide a reliable solution for big data storage by virtue of their large capacity, high density, and long-term stability. To reduce errors in storing procedures, constructing a sufficient set of constraint encoding is critical for achieving DNA storage. A new version of the Marine Predator algorithm (called QRSS-MPA) is proposed in this paper to increase the lower bound of the coding set while satisfying the specific combination of constraints. In order to demonstrate the effectiveness of the improvement, the classical CEC-05 test function is used to test and compare the mean, variance, scalability, and significance. In terms of storage, the lower bound of construction is compared with previous works, and the result is found to be significantly improved. In order to prevent the emergence of a secondary structure that leads to sequencing failure, we give a more stringent lower bound for the constraint coding set, which is of great significance for reducing the error rate of DNA storage amidst its rapid development.
Collapse
|
8
|
Ren Y, Zhang Y, Liu Y, Wu Q, Su J, Wang F, Chen D, Fan C, Liu K, Zhang H. DNA-Based Concatenated Encoding System for High-Reliability and High-Density Data Storage. SMALL METHODS 2022; 6:e2101335. [PMID: 35146964 DOI: 10.1002/smtd.202101335] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Revised: 01/05/2022] [Indexed: 05/25/2023]
Abstract
Information storage based on DNA molecules provides a promising solution with advantages of low-energy consumption, high storage efficiency, and long lifespan. However, there are only four natural nucleotides and DNA storage is thus limited by 2 bits per nucleotide. Here, artificial nucleotides into DNA data storage to achieve higher coding efficiency than 2 bits per nucleotide is introduced. To accommodate the characteristics of DNA synthesis and sequencing, two high-reliability encoding systems suitable for four, six, and eight nucleotides, i.e., the RaptorQ-Arithmetic-LZW-RS (RALR) and RaptorQ-Arithmetic-Base64-RS (RABR) systems, are developed. The two concatenated encoding systems realize the advantages of correcting DNA sequence losses, correcting errors within DNA sequences, reducing homopolymers, and controlling specific nucleotide contents. The average coding efficiencies with error correction and without arithmetic compression by the RALR system using four, six, and eight nucleotides reach 1.27, 1.61, and 1.85 bits per nucleotide, respectively. While the average coding efficiencies by the RABR system are up to 1.50, 2.00, and 2.35 bits per nucleotide, respectively. The coding efficiency, versatility, and tunability of the developed artificial DNA systems might provide significant guidance for high-reliability and high-density data storage.
Collapse
Affiliation(s)
- Yubin Ren
- Department of Chemistry, Tsinghua University, Beijing, 100084, China
| | - Yi Zhang
- State Key Laboratory of Rare Earth Resource Utilization, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, 130022, China
| | - Yawei Liu
- State Key Laboratory of Rare Earth Resource Utilization, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, 130022, China
| | - Qinglin Wu
- Institute of Process Equipment, College of Energy Engineering and State Key Laboratory of Fluid Power and Mechatronic Systems, Zhejiang University, Hangzhou, Zhejiang, 310027, China
| | - Juanjuan Su
- College of Materials Science and Opto-Electronic Technology, University of Chinese Academy of Sciences, Beijing, 100049, China
| | - Fan Wang
- State Key Laboratory of Rare Earth Resource Utilization, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, 130022, China
| | - Dong Chen
- Institute of Process Equipment, College of Energy Engineering and State Key Laboratory of Fluid Power and Mechatronic Systems, Zhejiang University, Hangzhou, Zhejiang, 310027, China
| | - Chunhai Fan
- Frontiers Science Center for Transformative Molecules, School of Chemistry and Chemical Engineering, and Institute of Molecular Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Kai Liu
- Department of Chemistry, Tsinghua University, Beijing, 100084, China
| | - Hongjie Zhang
- Department of Chemistry, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
9
|
Barbosa N, Sagresti L, Brancato G. Photoinduced azobenzene-modified DNA dehybridization: insights into local and cooperativity effects from a molecular dynamics study. Phys Chem Chem Phys 2021; 23:25170-25179. [PMID: 34730143 DOI: 10.1039/d1cp04032d] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Photoresponsive azobenzene-modified DNA (RNA) has become a very fruitful material for nanotechnology due to the capability of switching on and off hybridization (i.e., duplex formation) in smart nanostructures. This nanomaterial exploits the well-known azobenzene trans/cis photo-isomerization. In fact, it has been found that DNA tethered with trans-azobenzene shows normal nucleic acid recognition and hybridization, while the cis form destabilizes the duplex configuration, eventually leading to DNA unzipping. However, while the working principle of the light-triggered DNA dehybridization is apparent, specific details of this mechanism still remain elusive to experiments. Previous in silico studies successfully addressed some aspects (e.g., local structural effects, thermal stability, and early events of azobenzene photoisomerization) of this challenging molecular process characterized by timescales spanning several orders of magnitude, from picoseconds (i.e., azobenzene photoisomerization) to micro- and milli-seconds (i.e., complete strand detachment). In this work, inspired by a recent report by Asanuma and coworkers, we focus on the local and cooperativity effects played by multiple azobenzene units on a 10-mer azobenzene-modified DNA duplex. Using molecular dynamics (MD) simulations, we investigated nine systems equipped with a variable number (from 1 to 7) of photoswitch units and different configurations, focusing our analysis on the initial events (from few ps to hundreds of ns) characterizing DNA destabilization upon trans-to-cis isomerization, such as hydrogen bonding breakage and base pair misalignment. Results highlight, on one hand, the local effects of single azobenzene units on DNA duplex structure and, on the other hand, the cooperative role that multiple photoswitches show in enhancing and accelerating DNA dehybridization following trans-to-cis conversion, in agreement with previously reported data and observations.
Collapse
Affiliation(s)
- Nuno Barbosa
- Scuola Normale Superiore and CSGI, Piazza dei Cavalieri 7, I-56126 Pisa, Italy.
| | - Luca Sagresti
- Scuola Normale Superiore and CSGI, Piazza dei Cavalieri 7, I-56126 Pisa, Italy. .,Istituto Nazionale di Fisica Nucleare, Largo Pontecorvo 3, I-56100 Pisa, Italy
| | - Giuseppe Brancato
- Scuola Normale Superiore and CSGI, Piazza dei Cavalieri 7, I-56126 Pisa, Italy. .,Istituto Nazionale di Fisica Nucleare, Largo Pontecorvo 3, I-56100 Pisa, Italy
| |
Collapse
|
10
|
Anžel A, Heider D, Hattab G. The visual story of data storage: From storage properties to user interfaces. Comput Struct Biotechnol J 2021; 19:4904-4918. [PMID: 34527195 PMCID: PMC8430386 DOI: 10.1016/j.csbj.2021.08.031] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/19/2021] [Accepted: 08/19/2021] [Indexed: 12/15/2022] Open
Abstract
About fifty times more data has been created than there are stars in the observable universe. Current trends in data creation and consumption mean that the devices and storage media we use will require more physical space. Novel data storage media such as DNA are considered a viable alternative. Yet, the introduction of new storage technologies should be accompanied by an evaluation of user requirements. To assess such needs, we designed and conducted a survey to rank different storage properties adapted for visualization. That is, accessibility, capacity, usage, mutability, lifespan, addressability, and typology. Withal, we reported different storage devices over time while ranking them by their properties. Our results indicated a timeline of three distinct periods: magnetic, optical and electronic, and alternative media. Moreover, by investigating user interfaces across different operating systems, we observed a predominant presence of bar charts and tree maps for the usage of a medium and its file directory hierarchy, respectively. Taken together with the results of our survey, this allowed us to create a customized user interface that includes data visualizations that can be toggled for both user groups: Experts and Public.
Collapse
Affiliation(s)
- Aleksandar Anžel
- University of Marburg, Department of Mathematics and Computer Science, Marburg 35043, Germany
| | - Dominik Heider
- University of Marburg, Department of Mathematics and Computer Science, Marburg 35043, Germany
| | - Georges Hattab
- University of Marburg, Department of Mathematics and Computer Science, Marburg 35043, Germany
| |
Collapse
|
11
|
Tomek KJ, Volkel K, Indermaur EW, Tuck JM, Keung AJ. Promiscuous molecules for smarter file operations in DNA-based data storage. Nat Commun 2021; 12:3518. [PMID: 34112775 PMCID: PMC8192770 DOI: 10.1038/s41467-021-23669-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2021] [Accepted: 05/12/2021] [Indexed: 11/22/2022] Open
Abstract
DNA holds significant promise as a data storage medium due to its density, longevity, and resource and energy conservation. These advantages arise from the inherent biomolecular structure of DNA which differentiates it from conventional storage media. The unique molecular architecture of DNA storage also prompts important discussions on how data should be organized, accessed, and manipulated and what practical functionalities may be possible. Here we leverage thermodynamic tuning of biomolecular interactions to implement useful data access and organizational features. Specific sets of environmental conditions including distinct DNA concentrations and temperatures were screened for their ability to switchably access either all DNA strands encoding full image files from a GB-sized background database or subsets of those strands encoding low resolution, File Preview, versions. We demonstrate File Preview with four JPEG images and provide an argument for the substantial and practical economic benefit of this generalizable strategy to organize data.
Collapse
Affiliation(s)
- Kyle J Tomek
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| | - Kevin Volkel
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, NC, USA
| | - Elaine W Indermaur
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| | - James M Tuck
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, NC, USA.
| | - Albert J Keung
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
12
|
Yoo E, Choe D, Shin J, Cho S, Cho BK. Mini review: Enzyme-based DNA synthesis and selective retrieval for data storage. Comput Struct Biotechnol J 2021; 19:2468-2476. [PMID: 34025937 PMCID: PMC8113751 DOI: 10.1016/j.csbj.2021.04.057] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 04/20/2021] [Accepted: 04/22/2021] [Indexed: 11/26/2022] Open
Abstract
The market for using and storing digital data is growing, with DNA synthesis emerging as an efficient way to store massive amounts of data. Storing information in DNA mainly consists of two steps: data writing and reading. The writing step requires encoding data in DNA, building one nucleotide at a time as a form of single-stranded DNA (ssDNA). Once the data needs to be read, the target DNA is selectively retrieved and sequenced, which will also be in the form of an ssDNA. Recently, enzyme-based DNA synthesis is emerging as a new method to be a breakthrough on behalf of decades-old chemical synthesis. A few enzymatic methods have been presented for data memory, including the use of terminal deoxynucleotidyl transferase. Besides, enzyme-based amplification or denaturation of the target strand into ssDNA provides selective access to the desired dataset. In this review, we summarize diverse enzymatic methods for either synthesizing ssDNA or retrieving the data-containing DNA.
Collapse
Affiliation(s)
- Eojin Yoo
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Donghui Choe
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Jongoh Shin
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Suhyung Cho
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea.,Innovative Biomaterials Research Center, KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| | - Byung-Kwan Cho
- Department of Biological Sciences, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea.,Innovative Biomaterials Research Center, KI for the BioCentury, Korea Advanced Institute of Science and Technology, Daejeon 34141, Republic of Korea
| |
Collapse
|
13
|
Yim SS, McBee RM, Song AM, Huang Y, Sheth RU, Wang HH. Robust direct digital-to-biological data storage in living cells. Nat Chem Biol 2021; 17:246-253. [PMID: 33432236 PMCID: PMC7904632 DOI: 10.1038/s41589-020-00711-4] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 10/30/2020] [Accepted: 11/12/2020] [Indexed: 02/06/2023]
Abstract
DNA has been the predominant information storage medium for biology and holds great promise as a next-generation high-density data medium in the digital era. Currently, the vast majority of DNA-based data storage approaches rely on in vitro DNA synthesis. As such, there are limited methods to encode digital data into the chromosomes of living cells in a single step. Here, we describe a new electrogenetic framework for direct storage of digital data in living cells. Using an engineered redox-responsive CRISPR adaptation system, we encoded binary data in 3-bit units into CRISPR arrays of bacterial cells by electrical stimulation. We demonstrate multiplex data encoding into barcoded cell populations to yield meaningful information storage and capacity up to 72 bits, which can be maintained over many generations in natural open environments. This work establishes a direct digital-to-biological data storage framework and advances our capacity for information exchange between silicon- and carbon-based entities.
Collapse
Affiliation(s)
- Sung Sun Yim
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Ross M McBee
- Department of Systems Biology, Columbia University, New York, NY, USA
- Department of Biological Sciences, Columbia University, New York, NY, USA
| | - Alan M Song
- Department of Systems Biology, Columbia University, New York, NY, USA
| | - Yiming Huang
- Department of Systems Biology, Columbia University, New York, NY, USA
- Integrated Program in Cellular, Molecular, and Biomedical Studies, Columbia University, New York, NY, USA
| | - Ravi U Sheth
- Department of Systems Biology, Columbia University, New York, NY, USA
- Integrated Program in Cellular, Molecular, and Biomedical Studies, Columbia University, New York, NY, USA
| | - Harris H Wang
- Department of Systems Biology, Columbia University, New York, NY, USA.
- Department of Pathology and Cell Biology, Columbia University, New York, NY, USA.
| |
Collapse
|
14
|
New insight into G-quadruplexes; diagnosis application in cancer. Anal Biochem 2021; 620:114149. [PMID: 33636157 DOI: 10.1016/j.ab.2021.114149] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2020] [Revised: 02/01/2021] [Accepted: 02/18/2021] [Indexed: 02/06/2023]
Abstract
Biochemical properties and flexibility of nitrogenous bases allow DNA to fold into higher-order structures. Among different DNA secondary structure, G-quadruplexes (tetrapelexes-G4) - which are formed in guanine rich sequences - have gained more attention because of their biological significance, therapeutic intervention, and application in molecular device and biosensor. G4-quadruplex studies categorize into three main fields, in vivo, in vitro, and in silico. The in vitro field includes G4 synthetic oligonucleotides. This review focuses on the G-quadruplex synthetic aptamers structure features and considers the applicability of G4-aptamers for cancer biomarkers detection. Various biosensing methods will be reviewed based on G-quadruplex aptamers for cancer detection.
Collapse
|
15
|
Peng W, Cui S, Song C. One-time-pad cipher algorithm based on confusion mapping and DNA storage technology. PLoS One 2021; 16:e0245506. [PMID: 33471849 PMCID: PMC7817086 DOI: 10.1371/journal.pone.0245506] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 12/31/2020] [Indexed: 11/21/2022] Open
Abstract
In order to solve the problems of low computational security in the encoding mapping and difficulty in practical operation of biological experiments in DNA-based one-time-pad cryptography, we proposed a one-time-pad cipher algorithm based on confusion mapping and DNA storage technology. In our constructed algorithm, the confusion mapping methods such as chaos map, encoding mapping, confusion encoding table and simulating biological operation process are used to increase the key space. Among them, the encoding mapping and the confusion encoding table provide the realization conditions for the transition of data and biological information. By selecting security parameters and confounding parameters, the algorithm realizes a more random dynamic encryption and decryption process than similar algorithms. In addition, the use of DNA storage technologies including DNA synthesis and high-throughput sequencing ensures a viable biological encryption process. Theoretical analysis and simulation experiments show that the algorithm provides both mathematical and biological security, which not only has the difficult advantage of cracking DNA biological experiments, but also provides relatively high computational security.
Collapse
Affiliation(s)
- Weiping Peng
- School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, Henan, China
| | - Shuang Cui
- School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, Henan, China
- * E-mail:
| | - Cheng Song
- School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, Henan, China
| |
Collapse
|
16
|
Wang C, Liu H, Wang H, Tao J, Yang T, Chen H, An R, Wang J, Huang N, Gong X, Song Z, Komiyama M, Liang X. Robust Storage of Chinese Language in a Pool of Small Single-Stranded DNA Rings and Its Facile Reading-Out. BULLETIN OF THE CHEMICAL SOCIETY OF JAPAN 2021. [DOI: 10.1246/bcsj.20200201] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Affiliation(s)
- Chenru Wang
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Hongfang Liu
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Hongyu Wang
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Jiaojiao Tao
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Taiwei Yang
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Hui Chen
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Ran An
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
- Laboratory for Marine Drugs and Bioproducts, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266235, P. R. China
| | - Jing Wang
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Ning Huang
- Globt Institute for Biotechnology Research, Qingdao 266109, P. R. China
| | - Xiangyu Gong
- Globt Institute for Biotechnology Research, Qingdao 266109, P. R. China
| | - Zhihao Song
- Globt Institute for Biotechnology Research, Qingdao 266109, P. R. China
| | - Makoto Komiyama
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
| | - Xingguo Liang
- College of Food Science and Engineering, Ocean University of China, Qingdao 266003, P. R. China
- Laboratory for Marine Drugs and Bioproducts, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266235, P. R. China
| |
Collapse
|
17
|
Wang Y, Noor-A-Rahim M, Zhang J, Gunawan E, Guan YL, Poh CL. Oligo Design with Single Primer Binding Site for High Capacity DNA-Based Data Storage. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2020; 17:2176-2182. [PMID: 31514147 DOI: 10.1109/tcbb.2019.2940177] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
DNA has become an attractive medium for long-term data archiving due to its extremely high storage density and longevity. Short single-stranded DNAs, called oligonucleotides (oligos), have been designed and synthesized to store digital data. Previous works designed the oligos with a pair of primer binding sites (PBSs) (each with a length of around 200) attached at the two ends of each basic readable data block. The addition of PBSs decreases the data density significantly because in the current DNA synthesis, the maximum length of a synthesized oligo in good quality is around 200. Furthermore, the maximum homopolymer run allowed by the existing experiments has been reported to be three nucleotides. In this work, to increase the data density, we have devised and tested an oligo design for DNA-based storage with the basic readable data block appended by a single PBS at one end only, while allowing the maximum homopolymer run to be increased to 4. We also present an oligo assembly algorithm that can reconstruct oligos with a single PBS from the error-prone raw readouts obtained from the sequencing process. We have conducted a wet lab experiment to validate the proposed design, where we tested with 398KB of data stored into 10,750 oligos. The experimental results show that it is possible to recover over 99 percent of the oligo sequences without error, which proves that one PBS is sufficient for implementing a DNA-based data storage system with maximum homopolymer run relaxed to 4. The use of single PBS leads to a significant data density gain from 14.3 to 140.2 percent over the existing short-strand DNA data storage schemes by reserving more nucleotides for storing information bits.
Collapse
|
18
|
Recent Advances in Noninvasive Biosensors for Forensics, Biometrics, and Cybersecurity. SENSORS 2020; 20:s20215974. [PMID: 33105602 PMCID: PMC7659947 DOI: 10.3390/s20215974] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/27/2020] [Revised: 10/17/2020] [Accepted: 10/20/2020] [Indexed: 02/07/2023]
Abstract
Recently, biosensors have been used in an increasing number of different fields and disciplines due to their wide applicability, reproducibility, and selectivity. Three large disciplines in which this has become relevant has been the forensic, biometric, and cybersecurity fields. The call for novel noninvasive biosensors for these three applications has been a focus of research in these fields. Recent advances in these three areas has relied on the use of biosensors based on primarily colorimetric assays based on bioaffinity interactions utilizing enzymatic assays. In forensics, the use of different bodily fluids for metabolite analysis provides an alternative to the use of DNA to avoid the backlog that is currently the main issue with DNA analysis by providing worthwhile information about the originator. In biometrics, the use of sweat-based systems for user authentication has been developed as a proof-of-concept design utilizing the levels of different metabolites found in sweat. Lastly, biosensor assays have been developed as a proof-of-concept for combination with cybersecurity, primarily cryptography, for the encryption and protection of data and messages.
Collapse
|
19
|
Chen YJ, Takahashi CN, Organick L, Bee C, Ang SD, Weiss P, Peck B, Seelig G, Ceze L, Strauss K. Quantifying molecular bias in DNA data storage. Nat Commun 2020; 11:3264. [PMID: 32601272 PMCID: PMC7324401 DOI: 10.1038/s41467-020-16958-3] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2019] [Accepted: 05/19/2020] [Indexed: 11/10/2022] Open
Abstract
DNA has recently emerged as an attractive medium for archival data storage. Recent work has demonstrated proof-of-principle prototype systems; however, very uneven (biased) sequencing coverage has been reported, which indicates inefficiencies in the storage process. Deviations from the average coverage in the sequence copy distribution can either cause wasteful provisioning in sequencing or excessive number of missing sequences. Here, we use millions of unique sequences from a DNA-based digital data archival system to study the oligonucleotide copy unevenness problem and show that the two paramount sources of bias are the synthesis and amplification (PCR) processes. Based on these findings, we develop a statistical model for each molecular process as well as the overall process. We further use our model to explore the trade-offs between synthesis bias, storage physical density, logical redundancy, and sequencing redundancy, providing insights for engineering efficient, robust DNA data storage systems.
Collapse
Affiliation(s)
| | - Christopher N Takahashi
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, 98195, USA
| | - Lee Organick
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, 98195, USA
| | - Callista Bee
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, 98195, USA
| | | | - Patrick Weiss
- Twist Bioscience, San Francisco, California, 94158, USA
| | - Bill Peck
- Twist Bioscience, San Francisco, California, 94158, USA
| | - Georg Seelig
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, 98195, USA.,Department of Electrical and Computer Engineering, University of Washington, Seattle, Washington, 98195, USA
| | - Luis Ceze
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, 98195, USA.
| | - Karin Strauss
- Microsoft Research, Redmond, Washington, 98052, USA.
| |
Collapse
|
20
|
Lin KN, Volkel K, Tuck JM, Keung AJ. Dynamic and scalable DNA-based information storage. Nat Commun 2020; 11:2981. [PMID: 32532979 PMCID: PMC7293219 DOI: 10.1038/s41467-020-16797-2] [Citation(s) in RCA: 41] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2019] [Accepted: 05/20/2020] [Indexed: 11/11/2022] Open
Abstract
The physical architectures of information storage systems often dictate how information is encoded, databases are organized, and files are accessed. Here we show that a simple architecture comprised of a T7 promoter and a single-stranded overhang domain (ss-dsDNA), can unlock dynamic DNA-based information storage with powerful capabilities and advantages. The overhang provides a physical address for accessing specific DNA strands as well as implementing a range of in-storage file operations. It increases theoretical storage densities and capacities by expanding the encodable sequence space and simplifies the computational burden in designing sets of orthogonal file addresses. Meanwhile, the T7 promoter enables repeatable information access by transcribing information from DNA without destroying it. Furthermore, saturation mutagenesis around the T7 promoter and systematic analyses of environmental conditions reveal design criteria that can be used to optimize information access. This simple but powerful ss-dsDNA architecture lays the foundation for information storage with versatile capabilities. The physical architectures of information storage dictate how data is encoded, organised and accessed. Here the authors use DNA with a single-strand overhang as a physical address to access specific data and do in-storage file operations in a scalable and reusuable manner.
Collapse
Affiliation(s)
- Kevin N Lin
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Campus Box 7905, Raleigh, NC, 27695-7905, USA
| | - Kevin Volkel
- Department of Electrical and Computer Engineering, North Carolina State University, Campus Box 7911, Raleigh, NC, 27695-7911, USA
| | - James M Tuck
- Department of Electrical and Computer Engineering, North Carolina State University, Campus Box 7911, Raleigh, NC, 27695-7911, USA.
| | - Albert J Keung
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Campus Box 7905, Raleigh, NC, 27695-7905, USA.
| |
Collapse
|
21
|
Surface Sensitive Analysis Device using Model Membrane and Challenges for Biosensor-chip. BIOCHIP JOURNAL 2020. [DOI: 10.1007/s13206-019-4110-x] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
|
22
|
Nagy L, Kuki Á, Nagy T, Vadkerti B, Erdélyi Z, Kárpáti L, Zsuga M, Kéki S. Encoding Information into Polyethylene Glycol Using an Alcohol-Isocyanate "Click" Reaction. Int J Mol Sci 2020; 21:ijms21041318. [PMID: 32075293 PMCID: PMC7072859 DOI: 10.3390/ijms21041318] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2020] [Revised: 02/07/2020] [Accepted: 02/12/2020] [Indexed: 11/29/2022] Open
Abstract
In this article, the capability of encoding information using a homologous series of monodisperse monomethoxypolyethylene glycols (mPEG), with a number of ethylene oxide units ranging from nEO = 5 to 8, and monodisperse linear aliphatic isocyanates containing a number of CH2 units from 3 to 7, is demonstrated. The “click” reaction of the two corresponding homologous series yielded 20 different isocyanate end-capped polyethylene glycol derivatives (mPEG-OCONHR) whose sodiated adduct ion’s nominal m/z values spanned from 360 to 548, providing an average ca. 8 m/z unit for the storage of one-bit information. These mPEG-OCONHR oligomers were then used to encode information in binary sequences using a 384-well MALDI sample plate and employing the common dried-droplet sample preparation method capable of encoding 20 bit, i.e., 2.5 byte information in one spot, was employed. The information stored in the spots was read by MALDI-TOF MS using the m/z value of the corresponding mPEG-OCONHR oligomers. The capability of the method to store data was demonstrated by writing and reading a text file, visualizing a small picture and capturing a short audio file written in Musical Instrument Digital Interface (MIDI) sequence. Due to the very large similarities in the chemical structures of the encoding oligomers and their “easy to be ionized” property, as well as their very similar ionization efficiencies, the MALDI-TOF MS signal intensities from each compound was so strong and unambiguous that complete decoding could be performed in each case. In addition, the set of the proposed encoding oligomers can be further extended to attain higher bit “densities”.
Collapse
Affiliation(s)
- Lajos Nagy
- Department of Applied Chemistry, Faculty of Science and Technology, University of Debrecen, H-4032 Debrecen, Hungary; (L.N.); (Á.K.); (T.N.); (B.V.); (M.Z.)
| | - Ákos Kuki
- Department of Applied Chemistry, Faculty of Science and Technology, University of Debrecen, H-4032 Debrecen, Hungary; (L.N.); (Á.K.); (T.N.); (B.V.); (M.Z.)
| | - Tibor Nagy
- Department of Applied Chemistry, Faculty of Science and Technology, University of Debrecen, H-4032 Debrecen, Hungary; (L.N.); (Á.K.); (T.N.); (B.V.); (M.Z.)
| | - Bence Vadkerti
- Department of Applied Chemistry, Faculty of Science and Technology, University of Debrecen, H-4032 Debrecen, Hungary; (L.N.); (Á.K.); (T.N.); (B.V.); (M.Z.)
- Doctoral School of Chemistry, University of Debrecen, H-4032 Debrecen, Hungary
| | - Zoltán Erdélyi
- Department of Solid State Physics, Faculty of Science and Technology, University of Debrecen, H-4002 Debrecen, Hungary;
| | - Levente Kárpáti
- Department of Organic Chemistry, Faculty of Pharmacy, Semmelweis University, H-1092 Budapest, Hungary;
| | - Miklós Zsuga
- Department of Applied Chemistry, Faculty of Science and Technology, University of Debrecen, H-4032 Debrecen, Hungary; (L.N.); (Á.K.); (T.N.); (B.V.); (M.Z.)
| | - Sándor Kéki
- Department of Applied Chemistry, Faculty of Science and Technology, University of Debrecen, H-4032 Debrecen, Hungary; (L.N.); (Á.K.); (T.N.); (B.V.); (M.Z.)
- Correspondence:
| |
Collapse
|
23
|
Zhang Y, Li F, Li M, Mao X, Jing X, Liu X, Li Q, Li J, Wang L, Fan C, Zuo X. Encoding Carbon Nanotubes with Tubular Nucleic Acids for Information Storage. J Am Chem Soc 2019; 141:17861-17866. [PMID: 31603326 DOI: 10.1021/jacs.9b09116] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
DNA has evolved to be a type of unparalleled material for storing and transmitting genetic information. Much recent attention has been drawn to translate the natural specificity of DNA hybridization reactions for information storage in vitro. In this work, we developed a new type of tubular nucleic acid (TNA) by condensing DNA chains on the surface of one-dimensional carbon nanotubes (CNTs). We find that DNA interacts with CNTs in a sequence-specific manner, resulting in different conformations including helix, i-motif, and G-quadruplex. Atomic force microscopic (AFM) imaging revealed that TNAs exhibit distinct patterns with characteristic height and distance that can be exploited for two-dimensional encoding on CNTs. We further demonstrate the use of TNA-CNT for information storage with visual output. This noncanonical, DNA hybridization-free strategy provides a new route to DNA-based data storage.
Collapse
Affiliation(s)
- Yueyue Zhang
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China.,Division of Physical Biology and Bioimaging Center, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Synchrotron Radiation Facility , Shanghai Institute of Applied Physics, Chinese Academy of Sciences, University of Chinese Academy of Sciences , Shanghai 201800 , China
| | - Fan Li
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| | - Min Li
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| | - Xiuhai Mao
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| | - Xinxin Jing
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| | - Xiaoguo Liu
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| | - Qian Li
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| | - Jiang Li
- Division of Physical Biology and Bioimaging Center, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Synchrotron Radiation Facility , Shanghai Institute of Applied Physics, Chinese Academy of Sciences, University of Chinese Academy of Sciences , Shanghai 201800 , China
| | - Lihua Wang
- Division of Physical Biology and Bioimaging Center, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Synchrotron Radiation Facility , Shanghai Institute of Applied Physics, Chinese Academy of Sciences, University of Chinese Academy of Sciences , Shanghai 201800 , China
| | - Chunhai Fan
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| | - Xiaolei Zuo
- Institute of Molecular Medicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine and School of Chemistry and Chemical Engineering , Shanghai Jiao Tong University , Shanghai 200127 , China
| |
Collapse
|
24
|
Data storage in DNA with fewer synthesis cycles using composite DNA letters. Nat Biotechnol 2019; 37:1229-1236. [PMID: 31501560 DOI: 10.1038/s41587-019-0240-x] [Citation(s) in RCA: 76] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 07/25/2019] [Indexed: 12/24/2022]
Abstract
The density and long-term stability of DNA make it an appealing storage medium, particularly for long-term data archiving. Existing DNA storage technologies involve the synthesis and sequencing of multiple nominally identical molecules in parallel, resulting in information redundancy. We report the development of encoding and decoding methods that exploit this redundancy using composite DNA letters. A composite DNA letter is a representation of a position in a sequence that consists of a mixture of all four DNA nucleotides in a predetermined ratio. Our methods encode data using fewer synthesis cycles. We encode 6.4 MB into composite DNA, with distinguishable composition medians, using 20% fewer synthesis cycles per unit of data, as compared to previous reports. We also simulate encoding with larger composite alphabets, with distinguishable composition deciles, to show that 75% fewer synthesis cycles are potentially sufficient. We describe applicable error-correcting codes and inference methods, and investigate error patterns in the context of composite DNA letters.
Collapse
|
25
|
Tomek KJ, Volkel K, Simpson A, Hass AG, Indermaur EW, Tuck JM, Keung AJ. Driving the Scalability of DNA-Based Information Storage Systems. ACS Synth Biol 2019; 8:1241-1248. [PMID: 31117362 DOI: 10.1021/acssynbio.9b00100] [Citation(s) in RCA: 38] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
The extreme density of DNA presents a compelling advantage over current storage media; however, to reach practical capacities, new systems for organizing and accessing information are needed. Here, we use chemical handles to selectively extract unique files from a complex database of DNA mimicking 5 TB of data and design and implement a nested file address system that increases the theoretical maximum capacity of DNA storage systems by five orders of magnitude. These advancements enable the development and future scaling of DNA-based data storage systems with modern capacities and file access capabilities.
Collapse
Affiliation(s)
- Kyle J. Tomek
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Kevin Volkel
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Alexander Simpson
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Austin G. Hass
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
- Department of Structural and Molecular Biochemistry, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Elaine W. Indermaur
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - James M. Tuck
- Department of Electrical and Computer Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| | - Albert J. Keung
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, North Carolina 27695, United States
| |
Collapse
|
26
|
Chen K, Kong J, Zhu J, Ermann N, Predki P, Keyser UF. Digital Data Storage Using DNA Nanostructures and Solid-State Nanopores. NANO LETTERS 2019; 19:1210-1215. [PMID: 30585490 DOI: 10.1021/acs.nanolett.8b04715] [Citation(s) in RCA: 94] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Solid-state nanopores are powerful tools for reading the three-dimensional shape of molecules, allowing for the translation of molecular structure information into electric signals. Here, we show a high-resolution integrated nanopore system for identifying DNA nanostructures that has the capability of distinguishing attached short DNA hairpins with only a stem length difference of 8 bp along a DNA double strand named the DNA carrier. Using our platform, we can read up to 112 DNA hairpins with a separating distance of 114 bp attached on a DNA carrier that carries digital information. Our encoding strategy allows for the creation of a library of molecules with a size of up to 5 × 1033 (2112) that is only built from a few hundred types of base molecules for data storage and has the potential to be extended by linking multiple DNA carriers. Our platform provides a nanopore- and DNA nanostructure-based data storage method with convenient access and the potential for miniature-scale integration.
Collapse
Affiliation(s)
- Kaikai Chen
- Cavendish Laboratory , University of Cambridge , JJ Thomson Avenue , Cambridge CB3 0HE , United Kingdom
| | - Jinglin Kong
- Cavendish Laboratory , University of Cambridge , JJ Thomson Avenue , Cambridge CB3 0HE , United Kingdom
| | - Jinbo Zhu
- Cavendish Laboratory , University of Cambridge , JJ Thomson Avenue , Cambridge CB3 0HE , United Kingdom
| | - Niklas Ermann
- Cavendish Laboratory , University of Cambridge , JJ Thomson Avenue , Cambridge CB3 0HE , United Kingdom
| | - Paul Predki
- Iridia Incorporated , 3156 Lionshead Avenue , Suite 1, Carlsbad , California 92010 , United States
| | - Ulrich F Keyser
- Cavendish Laboratory , University of Cambridge , JJ Thomson Avenue , Cambridge CB3 0HE , United Kingdom
| |
Collapse
|
27
|
|
28
|
Abstract
The very high information density of DNA has prompted speculations on its use for information storage. The high costs of DNA synthesis and sequencing made this highly unpractical; however recent progress (notably array oligonucleotide synthesis) is changing the situation. A recent paper shows encoding and decoding of significant amounts of data (200 MB) with random access to individual files and faithful retrieval of content, at a cost that is still high but not extreme. Much progress remains to be achieved, but this use of DNA in now technically achievable and may eventually become practical.
Collapse
Affiliation(s)
- Bertrand Jordan
- UMR 7268 ADÉS, Aix-Marseille, Université/EFS/ CNRS ; CoReBio PACA,case 901, Parc scientifique de Luminy, 13288 Marseille Cedex 09, France
| |
Collapse
|
29
|
Organick L, Ang SD, Chen YJ, Lopez R, Yekhanin S, Makarychev K, Racz MZ, Kamath G, Gopalan P, Nguyen B, Takahashi CN, Newman S, Parker HY, Rashtchian C, Stewart K, Gupta G, Carlson R, Mulligan J, Carmean D, Seelig G, Ceze L, Strauss K. Random access in large-scale DNA data storage. Nat Biotechnol 2018; 36:242-248. [PMID: 29457795 DOI: 10.1038/nbt.4079] [Citation(s) in RCA: 240] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 01/11/2018] [Indexed: 01/02/2023]
Abstract
Synthetic DNA is durable and can encode digital data with high density, making it an attractive medium for data storage. However, recovering stored data on a large-scale currently requires all the DNA in a pool to be sequenced, even if only a subset of the information needs to be extracted. Here, we encode and store 35 distinct files (over 200 MB of data), in more than 13 million DNA oligonucleotides, and show that we can recover each file individually and with no errors, using a random access approach. We design and validate a large library of primers that enable individual recovery of all files stored within the DNA. We also develop an algorithm that greatly reduces the sequencing read coverage required for error-free decoding by maximizing information from all sequence reads. These advances demonstrate a viable, large-scale system for DNA data storage and retrieval.
Collapse
Affiliation(s)
- Lee Organick
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
| | | | | | - Randolph Lopez
- Department of Bioengineering Department, University of Washington, Seattle, Washington, USA
| | | | | | | | | | | | | | - Christopher N Takahashi
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
| | - Sharon Newman
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
| | | | | | - Kendall Stewart
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
| | - Gagan Gupta
- Microsoft Research, Redmond, Washington, USA
| | | | | | | | - Georg Seelig
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA.,Department of Electrical Engineering, University of Washington, Seattle, Washington, USA
| | - Luis Ceze
- Paul G. Allen School of Computer Science and Engineering, University of Washington, Seattle, Washington, USA
| | | |
Collapse
|
30
|
Long-Term Stability and Integrity of Plasmid-Based DNA Data Storage. Polymers (Basel) 2018; 10:polym10010028. [PMID: 30966073 PMCID: PMC6415062 DOI: 10.3390/polym10010028] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2017] [Revised: 12/02/2017] [Accepted: 12/08/2017] [Indexed: 01/14/2023] Open
Abstract
Validation of long-term DNA stability and integrity are essential for the use of DNA in data storage applications. Because of this, we evaluated the plasmid-based DNA data storage in a manner that preserves DNA stability and integrity. A document consisting of 2046 words was encoded with DNA sequences using Perl script, and the encoded DNA sequences were synthesized for information storage. The DNA comprised a total of 22 chemically synthesized DNA fragments with 400 nucleotides each, which were incorporated into a plasmid vector. A long-term DNA stability study demonstrated that 3-year stored plasmid containing text information showed DNA stability at controlled conditions of −20 °C. The plasmid DNA under accelerated aging conditions (AAC) up to 65 °C for 20 days, which corresponds to approximately 20 years of storage at −20 °C, also exhibited no significant differences in DNA stability compared to newly produced plasmid. Also, the 3-year old plasmid stored at −20 °C and the AAC-tested plasmid stored up to 65 °C for 20 days had functional integrity and nucleotide integrity comparable to control sample, thereby allowing for retrieval of the original error-free text data. Finally, the nucleotides were sequenced, and then decoded to retrieve the original data, thereby allowing us to read the text with 100% accuracy, and amplify the DNA with a simple and quick bacterial transformation. To the best of our knowledge, this is the first report on examining the long-term stability and integrity of plasmid-based DNA data storage. Taken together, our results indicate that plasmid DNA data storage can be useful for long-term archival storage to recover the source text in a reproducible and accountable manner.
Collapse
|
31
|
Niu Y, Zhang X, Han F. Image Encryption Algorithm Based on Hyperchaotic Maps and Nucleotide Sequences Database. COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE 2017; 2017:4079793. [PMID: 28392799 PMCID: PMC5368421 DOI: 10.1155/2017/4079793] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/08/2017] [Accepted: 02/27/2017] [Indexed: 12/03/2022]
Abstract
Image encryption technology is one of the main means to ensure the safety of image information. Using the characteristics of chaos, such as randomness, regularity, ergodicity, and initial value sensitiveness, combined with the unique space conformation of DNA molecules and their unique information storage and processing ability, an efficient method for image encryption based on the chaos theory and a DNA sequence database is proposed. In this paper, digital image encryption employs a process of transforming the image pixel gray value by using chaotic sequence scrambling image pixel location and establishing superchaotic mapping, which maps quaternary sequences and DNA sequences, and by combining with the logic of the transformation between DNA sequences. The bases are replaced under the displaced rules by using DNA coding in a certain number of iterations that are based on the enhanced quaternary hyperchaotic sequence; the sequence is generated by Chen chaos. The cipher feedback mode and chaos iteration are employed in the encryption process to enhance the confusion and diffusion properties of the algorithm. Theoretical analysis and experimental results show that the proposed scheme not only demonstrates excellent encryption but also effectively resists chosen-plaintext attack, statistical attack, and differential attack.
Collapse
Affiliation(s)
- Ying Niu
- College of Electric Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
| | - Xuncai Zhang
- College of Electric Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
| | - Feng Han
- College of Electric Information Engineering, Zhengzhou University of Light Industry, Zhengzhou 450002, China
| |
Collapse
|
32
|
Gooch J, Daniel B, Abbate V, Frascione N. Taggant materials in forensic science: A review. Trends Analyt Chem 2016. [DOI: 10.1016/j.trac.2016.08.003] [Citation(s) in RCA: 42] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
33
|
Mayer C, McInroy GR, Murat P, Van Delft P, Balasubramanian S. An Epigenetics-Inspired DNA-Based Data Storage System. Angew Chem Int Ed Engl 2016; 55:11144-8. [PMID: 27440712 PMCID: PMC5113786 DOI: 10.1002/anie.201605531] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2016] [Indexed: 12/22/2022]
Abstract
Biopolymers are an attractive alternative to store and circulate information. DNA, for example, combines remarkable longevity with high data storage densities and has been demonstrated as a means for preserving digital information. Inspired by the dynamic, biological regulation of (epi)genetic information, we herein present how binary data can undergo controlled changes when encoded in synthetic DNA strands. By exploiting differential kinetics of hydrolytic deamination reactions of cytosine and its naturally occurring derivatives, we demonstrate how multiple layers of information can be stored in a single DNA template. Moreover, we show that controlled redox reactions allow for interconversion of these DNA-encoded layers of information. Overall, such interlacing of multiple messages on synthetic DNA libraries showcases the potential of chemical reactions to manipulate digital information on (bio)polymers.
Collapse
Affiliation(s)
- Clemens Mayer
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Gordon R McInroy
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Pierre Murat
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Pieter Van Delft
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Shankar Balasubramanian
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK.
- Cancer Research, UK, Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Robinson Way, Cambridge, CB2 0RE, UK.
- School of Clinical Medicine, University of Cambridge, Cambridge, CB2 0SP, UK.
| |
Collapse
|
34
|
New Trends of Digital Data Storage in DNA. BIOMED RESEARCH INTERNATIONAL 2016; 2016:8072463. [PMID: 27689089 PMCID: PMC5027317 DOI: 10.1155/2016/8072463] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Revised: 07/21/2016] [Accepted: 08/02/2016] [Indexed: 11/17/2022]
Abstract
With the exponential growth in the capacity of information generated and the emerging need for data to be stored for prolonged period of time, there emerges a need for a storage medium with high capacity, high storage density, and possibility to withstand extreme environmental conditions. DNA emerges as the prospective medium for data storage with its striking features. Diverse encoding models for reading and writing data onto DNA, codes for encrypting data which addresses issues of error generation, and approaches for developing codons and storage styles have been developed over the recent past. DNA has been identified as a potential medium for secret writing, which achieves the way towards DNA cryptography and stenography. DNA utilized as an organic memory device along with big data storage and analytics in DNA has paved the way towards DNA computing for solving computational problems. This paper critically analyzes the various methods used for encoding and encrypting data onto DNA while identifying the advantages and capability of every scheme to overcome the drawbacks identified priorly. Cryptography and stenography techniques have been analyzed in a critical approach while identifying the limitations of each method. This paper also identifies the advantages and limitations of DNA as a memory device and memory applications.
Collapse
|
35
|
Mayer C, McInroy GR, Murat P, Van Delft P, Balasubramanian S. An Epigenetics‐Inspired DNA‐Based Data Storage System. Angew Chem Int Ed Engl 2016. [DOI: 10.1002/ange.201605531] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Clemens Mayer
- Department of Chemistry University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| | - Gordon R. McInroy
- Department of Chemistry University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| | - Pierre Murat
- Department of Chemistry University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| | - Pieter Van Delft
- Department of Chemistry University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| | - Shankar Balasubramanian
- Department of Chemistry University of Cambridge Lensfield Road Cambridge CB2 1EW UK
- Cancer Research, UK, Cambridge Institute, Li Ka Shing Centre University of Cambridge Robinson Way Cambridge CB2 0RE UK
- School of Clinical Medicine University of Cambridge Cambridge CB2 0SP UK
| |
Collapse
|
36
|
Zakeri B, Carr PA, Lu TK. Multiplexed Sequence Encoding: A Framework for DNA Communication. PLoS One 2016; 11:e0152774. [PMID: 27050646 PMCID: PMC4822886 DOI: 10.1371/journal.pone.0152774] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2015] [Accepted: 02/28/2016] [Indexed: 11/19/2022] Open
Abstract
Synthetic DNA has great propensity for efficiently and stably storing non-biological information. With DNA writing and reading technologies rapidly advancing, new applications for synthetic DNA are emerging in data storage and communication. Traditionally, DNA communication has focused on the encoding and transfer of complete sets of information. Here, we explore the use of DNA for the communication of short messages that are fragmented across multiple distinct DNA molecules. We identified three pivotal points in a communication-data encoding, data transfer & data extraction-and developed novel tools to enable communication via molecules of DNA. To address data encoding, we designed DNA-based individualized keyboards (iKeys) to convert plaintext into DNA, while reducing the occurrence of DNA homopolymers to improve synthesis and sequencing processes. To address data transfer, we implemented a secret-sharing system-Multiplexed Sequence Encoding (MuSE)-that conceals messages between multiple distinct DNA molecules, requiring a combination key to reveal messages. To address data extraction, we achieved the first instance of chromatogram patterning through multiplexed sequencing, thereby enabling a new method for data extraction. We envision these approaches will enable more widespread communication of information via DNA.
Collapse
Affiliation(s)
- Bijan Zakeri
- Department of Electrical Engineering and Computer Science, Department of Biological Engineering, Research Laboratory of Electronics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, United States of America
- MIT Synthetic Biology Center, 500 Technology Square, Cambridge, MA 02139, United States of America
| | - Peter A. Carr
- MIT Synthetic Biology Center, 500 Technology Square, Cambridge, MA 02139, United States of America
- MIT Lincoln Laboratory, 244 Wood Street, Lexington, MA 02420, United States of America
| | - Timothy K. Lu
- Department of Electrical Engineering and Computer Science, Department of Biological Engineering, Research Laboratory of Electronics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, United States of America
- MIT Synthetic Biology Center, 500 Technology Square, Cambridge, MA 02139, United States of America
| |
Collapse
|
37
|
Brunet TDP. Aims and methods of biosteganography. J Biotechnol 2016; 226:56-64. [PMID: 27021958 DOI: 10.1016/j.jbiotec.2016.03.044] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2016] [Revised: 03/21/2016] [Accepted: 03/23/2016] [Indexed: 12/31/2022]
Abstract
Applications of biotechnology to information security are now possible and have potentially far reaching political and technological implications. This change in information security practices, initiated by advancements in molecular biological and biotechnology, warrants reasonable and widespread consideration by biologists, biotechnologists and philosophers. I offer an explication of the landmark contributions, developments and current possibilities of biosteganography-the process of transmitting secure messages via biological mediums. I address, (i) how information can be stored and encoded in biological mediums, (ii) how biological mediums (e.g. DNA, RNA, protein) and storage systems (e.g. cells, biofilms, organisms) influence the nature of information security, and (iii) what constitutes a viable application of such biotechnologies.
Collapse
Affiliation(s)
- Tyler D P Brunet
- Computational Biology and Bioinformatics, Dalhousie University, 6050 University Avenue, Halifax, NS, Canada B3H 1W5.
| |
Collapse
|
38
|
Zakeri B, Lu TK. DNA nanotechnology: new adventures for an old warhorse. Curr Opin Chem Biol 2015; 28:9-14. [PMID: 26056949 PMCID: PMC4818966 DOI: 10.1016/j.cbpa.2015.05.020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2015] [Revised: 05/11/2015] [Accepted: 05/14/2015] [Indexed: 10/23/2022]
Abstract
As the blueprint of life, the natural exploits of DNA are admirable. However, DNA should not only be viewed within a biological context. It is an elegantly simple yet functionally complex chemical polymer with properties that make it an ideal platform for engineering new nanotechnologies. Rapidly advancing synthesis and sequencing technologies are enabling novel unnatural applications for DNA beyond the realm of genetics. Here we explore the chemical biology of DNA nanotechnology for emerging applications in communication and digital data storage. Early studies of DNA as an alternative to magnetic and optical storage mediums have not only been promising, but have demonstrated the potential of DNA to revolutionize the way we interact with digital data in the future.
Collapse
Affiliation(s)
- Bijan Zakeri
- Department of Electrical Engineering and Computer Science, Department of Biological Engineering, Research Laboratory of Electronics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA; MIT Synthetic Biology Center, 500 Technology Square, Cambridge, MA 02139, USA.
| | - Timothy K Lu
- Department of Electrical Engineering and Computer Science, Department of Biological Engineering, Research Laboratory of Electronics, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA; MIT Synthetic Biology Center, 500 Technology Square, Cambridge, MA 02139, USA.
| |
Collapse
|
39
|
|
40
|
Goldman N, Bertone P, Chen S, Dessimoz C, LeProust EM, Sipos B, Birney E. Towards practical, high-capacity, low-maintenance information storage in synthesized DNA. Nature 2013; 494:77-80. [PMID: 23354052 DOI: 10.1038/nature11875] [Citation(s) in RCA: 409] [Impact Index Per Article: 34.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2012] [Accepted: 12/12/2012] [Indexed: 11/09/2022]
Abstract
Digital production, transmission and storage have revolutionized how we access and use information but have also made archiving an increasingly complex task that requires active, continuing maintenance of digital media. This challenge has focused some interest on DNA as an attractive target for information storage because of its capacity for high-density information encoding, longevity under easily achieved conditions and proven track record as an information bearer. Previous DNA-based information storage approaches have encoded only trivial amounts of information or were not amenable to scaling-up, and used no robust error-correction and lacked examination of their cost-efficiency for large-scale information archival. Here we describe a scalable method that can reliably store more information than has been handled before. We encoded computer files totalling 739 kilobytes of hard-disk storage and with an estimated Shannon information of 5.2 × 10(6) bits into a DNA code, synthesized this DNA, sequenced it and reconstructed the original files with 100% accuracy. Theoretical analysis indicates that our DNA-based storage scheme could be scaled far beyond current global information volumes and offers a realistic technology for large-scale, long-term and infrequently accessed digital archiving. In fact, current trends in technological advances are reducing DNA synthesis costs at a pace that should make our scheme cost-effective for sub-50-year archiving within a decade.
Collapse
Affiliation(s)
- Nick Goldman
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK.
| | | | | | | | | | | | | |
Collapse
|
41
|
|
42
|
Shoshani S, Piran R, Arava Y, Keinan E. A Molecular Cryptosystem for Images by DNA Computing. Angew Chem Int Ed Engl 2012. [DOI: 10.1002/ange.201107156] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
43
|
Shoshani S, Piran R, Arava Y, Keinan E. A Molecular Cryptosystem for Images by DNA Computing. Angew Chem Int Ed Engl 2012; 51:2883-7. [DOI: 10.1002/anie.201107156] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2011] [Revised: 01/09/2012] [Indexed: 01/22/2023]
|
44
|
Oh JM, Park DH, Choy JH. Integrated bio-inorganic hybrid systems for nano-forensics. Chem Soc Rev 2010; 40:583-95. [PMID: 21152667 DOI: 10.1039/c0cs00051e] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
This tutorial review describes a new class of data processing system that applies information theory at the molecular level. We also summarize the recent multidisciplinary advances in biotechnology and nanotechnology that have facilitated the development of reliable nano-level code systems. After a brief introduction of information theory, we present possible ways to adapt this concept to the molecular world. This review explains the requirements and solutions for each step necessary to apply a nano code system to real products. Finally, we introduce a designed nano code system for agricultural products as one example of the many possible applications for nano codes.
Collapse
Affiliation(s)
- Jae-Min Oh
- Department of Chemistry and Medical Chemistry, College of Science and Technology, Yonsei University, Wonju, Gangwondo, Korea
| | | | | |
Collapse
|
45
|
Paek SM, Oh JM, Choy JH. A Lattice-Engineering Route to Heterostructured Functional Nanohybrids. Chem Asian J 2010; 6:324-38. [DOI: 10.1002/asia.201000578] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2010] [Indexed: 01/11/2023]
|
46
|
An improved Huffman coding method for archiving text, images, and music characters in DNA. Biotechniques 2010; 47:747-54. [PMID: 19852760 DOI: 10.2144/000113218] [Citation(s) in RCA: 36] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
An improved Huffman coding method for information storage in DNA is described. The method entails the utilization of modified unambiguous base assignment that enables efficient coding of characters. A plasmid-based library with efficient and reliable information retrieval and assembly with uniquely designed primers is described. We illustrate our approach by synthesis of DNA that encodes text, images, and music, which could easily be retrieved by DNA sequencing using the specific primers. The method is simple and lends itself to automated information retrieval.
Collapse
|
47
|
Stabilizing synthetic data in the DNA of living organisms. SYSTEMS AND SYNTHETIC BIOLOGY 2008; 2:19-25. [PMID: 19083123 PMCID: PMC2671590 DOI: 10.1007/s11693-008-9020-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2008] [Revised: 11/19/2008] [Accepted: 11/25/2008] [Indexed: 12/03/2022]
Abstract
Data-encoding synthetic DNA, inserted into the genome of a living organism, is thought to be more robust than the current media. Because the living genome is duplicated and copied into new generations, one of the merits of using DNA material is long-term data storage within heritable media. A disadvantage of this approach is that encoded data can be unexpectedly broken by mutation, deletion, and insertion of DNA, which occurs naturally during evolution and prolongation, or laboratory experiments. For this reason, several information theory-based approaches have been developed as an error check of broken DNA data in order to achieve data durability. These approaches cannot efficiently recover badly damaged data-encoding DNA. We recently developed a DNA data-storage approach based on the multiple sequence alignment method to achieve a high level of data durability. In this paper, we overview this technology and discuss strategies for optimal application of this approach.
Collapse
|
48
|
Abstract
Biomolecular computing is an emerging field at the interface of computer science, biological science and engineering. It uses DNA and other biological materials as the building blocks for construction of living computational machines to solve difficult combinatorial problems. In this article, notable advances in the biomolecular computing are reviewed and challenges associated with this multidisciplinary research are addressed. Finally, several perspectives are given based on the review of biomolecular computing.
Collapse
Affiliation(s)
- Pengcheng Fu
- Department of Molecular Biosciences and Bioengineering, University of Hawaii at Manoa, Honolulu, HI 96822, USA.
| |
Collapse
|
49
|
Tanaka K, Okamoto A, Saito I. Public-key system using DNA as a one-way function for key distribution. Biosystems 2005; 81:25-9. [PMID: 15917125 DOI: 10.1016/j.biosystems.2005.01.004] [Citation(s) in RCA: 64] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2004] [Revised: 01/26/2005] [Accepted: 01/28/2005] [Indexed: 11/22/2022]
Abstract
Novel public-key system using DNA has been developed. To solve key distribution problem, the public-key cryptography system based on the one-way function has been developed. The message-encoded DNA hidden in dummies can be restored by PCR amplification, followed by sequencing. We used these operations as a one-way function, and constituted a novel method for the key distribution based on the public-key system using DNA. We will show the way of holding a key in common just between specific two persons.
Collapse
Affiliation(s)
- Kazuo Tanaka
- Department of Synthetic Chemistry and Biological Chemistry, Faculty of Engineering, Kyoto University, Kyotodaigakukatsura, Nishigyo-ku, Kyoto 615-8510, Japan.
| | | | | |
Collapse
|