WO2004088585A2 - A method for storing information in dna - Google Patents

A method for storing information in dna Download PDF

Info

Publication number
WO2004088585A2
WO2004088585A2 PCT/IB2004/000975 IB2004000975W WO2004088585A2 WO 2004088585 A2 WO2004088585 A2 WO 2004088585A2 IB 2004000975 W IB2004000975 W IB 2004000975W WO 2004088585 A2 WO2004088585 A2 WO 2004088585A2
Authority
WO
WIPO (PCT)
Prior art keywords
dna
encrypted
information
sequence
segment
Prior art date
Application number
PCT/IB2004/000975
Other languages
French (fr)
Other versions
WO2004088585A3 (en
WO2004088585A8 (en
Inventor
Lalit M. Bharadwaj
Awdesh Kumar Shukla
Amit P. Bhondekar
Rakesh Kumar
Ram Prakash Bajpai
Original Assignee
Council Of Scientific And Industrial Research
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Council Of Scientific And Industrial Research filed Critical Council Of Scientific And Industrial Research
Priority to JP2006506420A priority Critical patent/JP2006522356A/en
Priority to GB0521163A priority patent/GB2416232A/en
Publication of WO2004088585A2 publication Critical patent/WO2004088585A2/en
Publication of WO2004088585A8 publication Critical patent/WO2004088585A8/en
Publication of WO2004088585A3 publication Critical patent/WO2004088585A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/12Computing arrangements based on biological models using genetic models
    • G06N3/123DNA computing
    • BPERFORMING OPERATIONS; TRANSPORTING
    • B82NANOTECHNOLOGY
    • B82YSPECIFIC USES OR APPLICATIONS OF NANOSTRUCTURES; MEASUREMENT OR ANALYSIS OF NANOSTRUCTURES; MANUFACTURE OR TREATMENT OF NANOSTRUCTURES
    • B82Y10/00Nanotechnology for information processing, storage or transmission, e.g. quantum computing or single electron logic
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C13/00Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00
    • G11C13/0002Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00 using resistive RAM [RRAM] elements
    • G11C13/0009RRAM elements whose operation depends upon chemical change
    • G11C13/0014RRAM elements whose operation depends upon chemical change comprising cells based on organic memory material
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11CSTATIC STORES
    • G11C13/00Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00
    • G11C13/0002Digital stores characterised by the use of storage elements not covered by groups G11C11/00, G11C23/00, or G11C25/00 using resistive RAM [RRAM] elements
    • G11C13/0009RRAM elements whose operation depends upon chemical change
    • G11C13/0014RRAM elements whose operation depends upon chemical change comprising cells based on organic memory material
    • G11C13/0019RRAM elements whose operation depends upon chemical change comprising cells based on organic memory material comprising bio-molecules
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16BBIOINFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR GENETIC OR PROTEIN-RELATED DATA PROCESSING IN COMPUTATIONAL MOLECULAR BIOLOGY
    • G16B30/00ICT specially adapted for sequence analysis involving nucleotides or amino acids
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/00086Circuits for prevention of unauthorised reproduction or copying, e.g. piracy
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B20/00Signal processing not specific to the method of recording or reproducing; Circuits therefor
    • G11B20/00086Circuits for prevention of unauthorised reproduction or copying, e.g. piracy
    • G11B20/0021Circuits for prevention of unauthorised reproduction or copying, e.g. piracy involving encryption or decryption of contents recorded on or reproduced from a record carrier

Definitions

  • the present invention relates to a method for storing information in DNA.
  • the method of invention comprises storing information in DNA.
  • the present invention addresses storage for all kind of digital information whether it is a text file, an image file or an audio file. Large sequences are divided into multiple segments. Background of the invention
  • DNA is the best molecular electronic device ever produced on the earth because DNA can store, process and provide information for growth and maintenance of living system. All living species are as a result of single cell produced during reproduction. In most of the cases this single cell does not have most of the materials required for fabricating a living system but contains all the information and processing capability to fabricate living spaces by taking materials from environment, for example, fabrication of baby from Zygote which contains rearranged DNA sequences of parents.
  • DNA is ready to use nanowire of 2 nm and can be synthesized in any sequence of four bases i.e. ATGC.
  • DNA of every living organism consist of large number of DNA segments where each segment represents a processor to execute a particular biological process for growth and maintaining life.
  • DNA the building block of life, can store information for billion of years, .
  • the tremendous information storage capacity of DNA can be imagined from the fact that 1 gram of DNA contains as much information as 1 trillion CD's 1 ' four bases (A,T,G,C) instead of 0 and 1, extremely energy efficient (10 19 operations per joule), synthesis of any imaginable sequence is possible and semiconductor are approaching limit.
  • the main object of the present invention is to develop a comprehensive DNA based information storage technique.
  • Another object of the present invention is to encrypt complete extended ASCII character set in terms of minimum number of DNA bases.
  • Another object of the present invention is to develop software to encrypt/decrypt data in terms DNA bases.
  • Yet another object of the present invention is to design suitable primers to be flanked at both ends of the encrypted and synthesized information. Summary of the invention
  • the present invention provides a method for storing information in DNA.
  • the method of invention comprises storing information in DNA.
  • the present invention addresses storage for all kind of digital information whether it is a text file, an image file or an audio file. Large sequences are divided into multiple segments Brief description of the accompanying drawings
  • Fig. la Information storage in DNA. Structure of prototypical single segment information storage in DNA strand.
  • Fig. lb Information storage in DNA. Structure of prototypical multi segment information storage in DNA strand.
  • the present invention provides a method for storing information in DNA.
  • the method of invention comprises storing information in DNA.
  • the present invention addresses storage for all kind of digital information whether it is a text file, an image file or an audio file. Large sequences are divided into multiple segments
  • the method enables the storage of information in DNA.
  • a software based on the above method enables all 256 Extended ASCII characters to be defined in terms of DNA sequences.
  • Software named as "DNASTORE” has been developed in Visual Basic 6.0 for encryption and decryption of digital information in terms of DNA bases.
  • DNASTORE complete extended ASCII character set can be encoded 256 different ways.
  • plain text/image or any digital information is encrypted in terms of DNA sequences using encryption key (software). If the information overflows the limits i.e. it cannot be synthesized in a single piece then it is encrypted and fragmented in a number of segments. Synthesis of encrypted sequence(s) is carried out using DNA synthesizer.
  • a fixed number of different DNA primers sequence have been designed and assigned a number, which resembles the segment position it represents e.g. segment 1, segment 2 segment n. These are called as header primers.
  • Two tail primers have also been designed one resembles continuation and other resembles termination segment.
  • the DNA segment(s) is/are flanked by known PCR primers [as described earlier] at both the ends i.e. header primers are attached at the beginning of segment and tail primers are attached at the end of the segment. If there is only one segment, at the beginning it is flanked by header primer number 1 and at the end it is flanked by termination tail primer. However, if there are more than one segments, each segment would be attached with header primers numbered as 1, 2,3... n respectively, at the end these would be attached with a continuation tail primer except for last segment which would be attached with a termination tail primer.
  • the SM DNA is then mixed with the enormous complex denatured DNA strands of genomic DNA of human or other organism.
  • genomic DNA As the human genome contains about 3x10 9 nucleotide pairs, fragmented & denatured human DNA provides a very complex background for storing the encrypted DNA.
  • the DNA can be stored and transported on paper, cloths, buttons etc.
  • Example 1 Encryption and decryption of a textual message "CSIR" in terms of DNA bases may be defined as a) Generation of an array of 256 elements (unique 4-base per character i.e. ATGC, ATGA, ATGT, ATGG). These elements represent complete extended ASCII character set values. b) The input information is then encrypted character-by-character using array generated in step 1. The basis is ASCII values of each character is matched with the element no. of the array of step 1.
  • Encryption of the text "CSIR" in terms of DNA bases may be: TATGTTTCTATTTTAC where
  • C is represented by DNA sequence TATG
  • S is represented by DNA sequence TTTC
  • R is represented by DNA sequence TTAC c) If the information overflows the limits i.e. it cannot be synthesized in a single piece or because of any other problem, then the encrypted sequence is fragmented in a number segments. d) Encrypted segment(s) is/are then flanked on each side with header and tail primers. e) Synthesis of encrypted sequence(s) is then carried out using DNA synthesizer. f) The synthesized DNA segment(s) is/are then be kept separately or can be mixed up with the enormous complex denatured DNA strands of genomic
  • DNA of human or other organism As the human genome contains about 3x10 9 nucleotide pairs, fragmented & denatured human DNA provides a very complex background for storing encrypted DNA. g) The encrypted DNA can then be transported on paper, cloths, buttons or through any other medium.
  • TATGTTTCTATTTTAC Isolation decryption of above encrypted DNA sequence TATGTTTCTATTTTAC : a) Isolation and amplification of encrypted DNA is done using known primers flanked at each end by PCR method. b) Retrieved SM DNA is sequenced using DNA sequencer. c) Obtained sequence is interpreted (integrated if multi-segment before interpretation) using DNASTORE software. The basis for retrieval is a string of 4-bases each at a time is taken and matched with array as generated in step 1 of encryption and storage. The element number of matching value is taken and converted to its ASCII equivalent. If the retrieved sequence is TATGTTTCTATTTTAC. The Decryption would be: first 4-bases i.e.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Theoretical Computer Science (AREA)
  • Biophysics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Chemical & Material Sciences (AREA)
  • Mathematical Physics (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Nanotechnology (AREA)
  • General Health & Medical Sciences (AREA)
  • Genetics & Genomics (AREA)
  • Computing Systems (AREA)
  • Biotechnology (AREA)
  • Artificial Intelligence (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Molecular Biology (AREA)
  • Medical Informatics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Proteomics, Peptides & Aminoacids (AREA)
  • Software Systems (AREA)
  • Crystallography & Structural Chemistry (AREA)
  • Analytical Chemistry (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
  • Storage Device Security (AREA)

Abstract

DNA is a natural molecular level storage device. Molecular storage devices use each molecule or part of it for storing a character. Thus it is possible to store information million of times than presently used storage devices. For example a JPEG image (i.e. flag of India) having file size of 1981 Bytes can be encrypted using 7924 DNA bases which occupies about 2694.16 nanometers. In other words flag of India can be encrypted 8.07x105 times in human genome which comprises 6.4x109 DNA bases and occupy a tiny volume of about 0.02 µm3. A method for storing information in DNA has been developed which includes software and a set of schemes to encrypt, store and decrypt information in terms of DNA bases. The main advantages of the present method over exiting art is that it addresses complete set of extended ASCII characters set and thereby, encryption of all kind of digital information (text, image, audio etc.). First of all, information is encrypted along with carefully designed sequences known as header and tail primers at both the ends of actual encrypted information. This encrypted sequence is then synthesized and mixed up with the enormous complex denatured DNA strands of genomic DNA of human or other organism.

Description

A METHOD FOR STORING INFORMATION IN DNA Field of the invention
The present invention relates to a method for storing information in DNA. The method of invention comprises storing information in DNA. The present invention addresses storage for all kind of digital information whether it is a text file, an image file or an audio file. Large sequences are divided into multiple segments. Background of the invention
DNA is the best molecular electronic device ever produced on the earth because DNA can store, process and provide information for growth and maintenance of living system. All living species are as a result of single cell produced during reproduction. In most of the cases this single cell does not have most of the materials required for fabricating a living system but contains all the information and processing capability to fabricate living spaces by taking materials from environment, for example, fabrication of baby from Zygote which contains rearranged DNA sequences of parents. DNA is ready to use nanowire of 2 nm and can be synthesized in any sequence of four bases i.e. ATGC. DNA of every living organism (micro/macro) consist of large number of DNA segments where each segment represents a processor to execute a particular biological process for growth and maintaining life. Other important characteristics of DNA which makes it material of choice for future molecular devices are: DNA the building block of life, can store information for billion of years, . The tremendous information storage capacity of DNA can be imagined from the fact that 1 gram of DNA contains as much information as 1 trillion CD's1' four bases (A,T,G,C) instead of 0 and 1, extremely energy efficient (1019 operations per joule), synthesis of any imaginable sequence is possible and semiconductor are approaching limit.
Clelland et al., 1999[2], and Bancroft, et al. 2001 [3] [U.S. patent no. 6,312,911], have developed the DNA based steganographic technique for sending the secret messages. Although their prime objective was steganography (the art of information hiding), they used DNA as storage an transmission device for secret message. They encrypted the plaintext message into the DNA sequences and retrieved the message using the encryption/decryption key. They used three DNA bases for representing a single alphanumeric character, as DNA has 4 bases (A, T, C, G) so a maximum of 64 (4x4x4) ASCII character can be formed using this scheme. Whereas, a total of 256 extended ASCII characters are required to represent complete set of digital information. Hence, Clelland' s scheme cannot be used to address complete set of digital information and has limited scope. Objects of the invention
The main object of the present invention is to develop a comprehensive DNA based information storage technique.
Another object of the present invention is to encrypt complete extended ASCII character set in terms of minimum number of DNA bases.
Another object of the present invention is to develop software to encrypt/decrypt data in terms DNA bases.
Yet another object of the present invention is to design suitable primers to be flanked at both ends of the encrypted and synthesized information. Summary of the invention
The present invention provides a method for storing information in DNA. The method of invention comprises storing information in DNA. The present invention addresses storage for all kind of digital information whether it is a text file, an image file or an audio file. Large sequences are divided into multiple segments Brief description of the accompanying drawings
Fig. la. Information storage in DNA. Structure of prototypical single segment information storage in DNA strand. Fig. lb. Information storage in DNA. Structure of prototypical multi segment information storage in DNA strand. Fig. 2. Encryption of extended ASCII character set in terms of DNA bases
Fig.3. Encryption Key. Extended ASCII characters in terms of DNA strands
Fig.4. Process sheet for encryption & storage
Fig.5. Process summary
Detailed description of the invention The present invention provides a method for storing information in DNA. The method of invention comprises storing information in DNA. The present invention addresses storage for all kind of digital information whether it is a text file, an image file or an audio file. Large sequences are divided into multiple segments
The method enables the storage of information in DNA. In another embodiment a software based on the above method enables all 256 Extended ASCII characters to be defined in terms of DNA sequences. The basic concept used is to take minimum number of bases to define each Extended ASCII character. With simple permutation we have 4 sequences combinations with one base i.e. A, T, G, C. Similarly, with 2 bases we have 4x4=16 different sequences, with three bases we get 4x4x4=64 distinct sequences and four bases give 4x4x4x4=256 distinct sequences. Therefore, with a set of 4 bases, complete extended ASCII set has been encoded. Software named as "DNASTORE" has been developed in Visual Basic 6.0 for encryption and decryption of digital information in terms of DNA bases. Using DNASTORE complete extended ASCII character set can be encoded 256 different ways. In yet another embodiment in our scheme, plain text/image or any digital information is encrypted in terms of DNA sequences using encryption key (software). If the information overflows the limits i.e. it cannot be synthesized in a single piece then it is encrypted and fragmented in a number of segments. Synthesis of encrypted sequence(s) is carried out using DNA synthesizer. In yet another embodiment a fixed number of different DNA primers sequence have been designed and assigned a number, which resembles the segment position it represents e.g. segment 1, segment 2 segment n. These are called as header primers. Two tail primers have also been designed one resembles continuation and other resembles termination segment. In yet another embodiment the DNA segment(s) is/are flanked by known PCR primers [as described earlier] at both the ends i.e. header primers are attached at the beginning of segment and tail primers are attached at the end of the segment. If there is only one segment, at the beginning it is flanked by header primer number 1 and at the end it is flanked by termination tail primer. However, if there are more than one segments, each segment would be attached with header primers numbered as 1, 2,3... n respectively, at the end these would be attached with a continuation tail primer except for last segment which would be attached with a termination tail primer.
The SM DNA is then mixed with the enormous complex denatured DNA strands of genomic DNA of human or other organism. As the human genome contains about 3x109 nucleotide pairs, fragmented & denatured human DNA provides a very complex background for storing the encrypted DNA. The DNA can be stored and transported on paper, cloths, buttons etc.
In still another embodiment only a recipient knowing the sequences of both the primers [starting and tail] would be able to extract the message, using PCR to isolate & amplify the encrypted DNA strand. Isolated and amphfied DNA can then be sequenced using automated DNA sequencer. The DNA sequence obtained can then be converted into digital message using encryption/decryption key (software key). In yet another embodiment the key is helpful in the secret & secure transfer of information particularly for spying and military purposes. It may also be helpful in anti-theft, anti- counterfeiting, product authentication, copyright infringements etc. Table 1. Comparison of present art with existing art
Figure imgf000006_0001
Example 1. Encryption and decryption of a textual message "CSIR" in terms of DNA bases may be defined as a) Generation of an array of 256 elements (unique 4-base per character i.e. ATGC, ATGA, ATGT, ATGG). These elements represent complete extended ASCII character set values. b) The input information is then encrypted character-by-character using array generated in step 1. The basis is ASCII values of each character is matched with the element no. of the array of step 1.
Encryption of the text "CSIR" in terms of DNA bases may be: TATGTTTCTATTTTAC where
C is represented by DNA sequence TATG
S is represented by DNA sequence TTTC
I is represented by DNA sequence TATT
R is represented by DNA sequence TTAC c) If the information overflows the limits i.e. it cannot be synthesized in a single piece or because of any other problem, then the encrypted sequence is fragmented in a number segments. d) Encrypted segment(s) is/are then flanked on each side with header and tail primers. e) Synthesis of encrypted sequence(s) is then carried out using DNA synthesizer. f) The synthesized DNA segment(s) is/are then be kept separately or can be mixed up with the enormous complex denatured DNA strands of genomic
DNA of human or other organism. As the human genome contains about 3x109 nucleotide pairs, fragmented & denatured human DNA provides a very complex background for storing encrypted DNA. g) The encrypted DNA can then be transported on paper, cloths, buttons or through any other medium.
Isolation decryption of above encrypted DNA sequence TATGTTTCTATTTTAC : a) Isolation and amplification of encrypted DNA is done using known primers flanked at each end by PCR method. b) Retrieved SM DNA is sequenced using DNA sequencer. c) Obtained sequence is interpreted (integrated if multi-segment before interpretation) using DNASTORE software. The basis for retrieval is a string of 4-bases each at a time is taken and matched with array as generated in step 1 of encryption and storage. The element number of matching value is taken and converted to its ASCII equivalent. If the retrieved sequence is TATGTTTCTATTTTAC. The Decryption would be: first 4-bases i.e. "TATG" would be in the array storage and encryption 67 = C next 4-bases i.e. "TTTC" would be in the array of storage and encryption 83 = S next 4-bases i.e. "TATT" would be in the array storage and encryption 73 = I next 4-bases i.e. "TTAC" would be in the array of encryption 67 = R Integration of above decrypted values in the same sequence as retrieved is "CSIR". Example 2. Some examples of DNA encryption for textual data
Digital Information Encrypted DNA sequence
WELCOME TTAGTACATAGCTATGTACCTAACTACA
WORLD PEACE TTAGTACCTTACTAGCTATAAGCTTTCCTACATAGG TATGTACA
INDIA TATTTATCTATATATTTAGG
CSIR TATGTTTCTATTTTAC
CSIO TATGTTTCTATTTACC
Example 3. A JPEG image encrypted in term of DNA bases
Digital Information Encrypted DNA sequence
TAAATATTTAGAAAACAATCTCGTGGCGATCGCGC
CATCGGCTAACCTATCGATCGCTGGTCGCGTATCAA
CAATCGTCGGTCGGTCGCGCCCTACGGGCTCTTCGA
ACCCCGTAGGCGACACGGCGCGGCGGATGATTGTC
GCCTTGCTACCCGTGGTGCGCCCAGACCTTCGACGC
TCCTGGTACCTGCGCCTCATCGTTATCTTTGTTGGA
GTGCAAGATGGAGAGTTTCCCGGACGGGTAGCAAG
CCTGCGTAATATCTCCAAATGTCCAAAGCTTATTGT
TTTCAATAACGTGATCCTTTACCTGCACATTAGTAT
TATCACCAGCGTGCACCCATGCGGGCGCCAACCTT
GCTGGACTTCGACGCCGCTGTCGTTGCCCTCTGAGT
GAATGATTGTGCCCACTGTGGTGGGGCGCCTAGTC
GGTCGGTCGAGGTGTTCATTAATGGATCGATCGAC CTATCGAGGAATCGATCGATCGATCGGGCGATCGC
GCCATCGATCGATCAGTCGTCCTACGCCGGCTCTCT
CTGCATTTCAGCTCGCTTATCGAGAGGCCTGTGCAA
GGAGCCCTGTTACATTGGGCTATCTAAGACATGGG
GACAGTCGGCCGACAGAGTATAATAGGAACCACGC
Figure imgf000009_0001
CTAATGGATAACAGCTTTCGAAACCCACTCCAGAG
CCTGTTTACTCTAATTGGCTCCGGGGCTGATGGTGA
GGGCTGTGAACCCGGACTCCCAGCCTAGGGAGTAC
AGACCATGATCCCTATGCCGGATTAGCCCTAGGCT
GTCACACTAAGCTATCCTCAGCGTGAGCGTGTCCG
GACTTCGCAGGCTGTGCGTCTTGAGTGCGCGAGTG
GACGGGCGTGCGGATCCGCGCACGAACGCTTCGTC
GTTCGGTCGTCTTCACGACCGCCCAACTTTCCAGCC
ATCCAGGTAGCCACGCAAGCACATACACATACAGA
CATTTTATAATCCACTCTATTATCCAATCTTTCTGCT
GATCTGTCTACCTCGTAGGCTCCCTGGCTTAAGTGC
TAACTCACCAAAGTCCCGACCTACCAACCCTCCGTC
TTACCACCCTCCTCGCCGCCCGGCTGCCCTGCCCGC
TATGCGGGCAGCATTGCTAGCCACACAGCAAGCAT
CAGGGCCTGCGTCAACGCACGCTCCGTCGGCCGGG
CCGCTGGTCGGTGCGGAGGGGGGAGCGAGGGTAG
GCATGTGGGGTGGATCGCGCTTGGACTCCTCGGCT
GATTTGCTGACCGAGCCGTAGAATGATGCTCAGAA
GGAGATCGAGATAGACACGATACTTATCAGTCTGT
GTGTATGTACGTTCGTCCGTGCGTGGGTAGGTTGGT
CGATCGATTGATCTACGTTAATCCCACTCTGCGGCG
TGACATAATGAATTACCCGCCGCCCACTGTGCTGCG
AAACCCAGTTTACTCAGTTAATCCGACTATGCCACG
GTACAAAATATCCGGGGTGCATCCGACTTTGCAAA
TGAATCTAAAGCGCTACGTTATTGTAAAGATCGTA
ATTAACGAAGCGGTCGTTAATTAATCTGAGGTGCA
GATGAATACATTTAAACCATGCAGTTATTCATCAGT
CGCATCGCAAACTTGTAGACGCTGAATATTAGGTA
TGATTAATGATACGCGTGATGACAATTACGTGTTTA AGCGCAATTAATTCTGGTAGCGTTATGCCTGTCAAG GCGGTCCTACAACTAGGTTCGATCCTTACGACTGGA AGATGGCTCTACACACGGACCCCCCAAACCAATTA TAGTTACCTAGTCCTTAAAAACCATACTAGTTTGGC TTTATTGATACTAAGACTAAGCTTACGTCCTGACTC GCGATTAATGGACACACGTTTCCTGACAAGCTCCTC GGGGGCCATATATATGCCTGACGCCAGAAACTGGT CTCATTCTCGATATGAAGCGACCCAAAGCGCGGTG TATCGTTGTCGAATCCAACTAAGATGCATCGCGCGC GGCGGATCAATCTTACGAGACTCAGGTACTAGTGG TATCGTGGCTGCCTTGTGACGCTTAAATCGTACTTC GTCGCGATTGATTGTATTATAAACAATCAGCAAATT AAATCGATGGCGGACTTTATAAAGCTAAACTACGC CTTTAAGTTACGCGCTGTGAGCAGCTGAGGCCGGTT CCTTAAGTTCCATACATTCTATCAATAGCGCTTCCT GCCTAGGTATGGGCTCTAGGGCTATCTTGCTAAAGT TGACTCAGAGAGAATTACCTCGGAATAAAACAACA CGCGGCAGTCAGATTTTGTCACTATTTTTACGTAAC TAGGGTGATCTCCGGAATGTCAACTCCGGGCCCCC ACACGATGGTGGAGATCTCCTCGCCCGTGGGCTTCT GGACTAGACGTTAGGGCATGCACATACGTTGACGA AATTGTTACGCGGAGACGATAGAATTTATAACCTTT CCACCATCTAGTATGAGGGATTCATACGCTGCCCTT CTCCTAATAGGAACGTACACTAAATTAATTGCCGTG CTACCAATGCGACTACTTTGGGATAACGGCCTGCG GTTGTCGTCGGGTGAACTATCCTATCGTTCGACTCT ATAGCAAGGCTTATCGTGCTAACTAATTTACATAGT AGGACTATCGCCACACGGGATGCACATACCCGACT ATCGGGTCCCAGAGACTACGTTGAGGAAAGCCAGG CTTAGTTTTACACATTAACCGATGGCGTGACGGGG ACTTTGTCGTCGGTACATAATCGTCAGGTCATCAAT TCCTGCTGATATGGCGAAATTGCTGAGTATCTCTAT GGACTAACAACTGCTAGGTGCTCTGGAGCCGACCG CCGCGACATACAAGATAGACACGTCTAAACAGCTC GTTTTCATCAACACCATCGTGCATGCCGATCGACGT
GGCACAAACAAATTGAATAGAAGGCATACTATATC
GTCTACTTGGTATGGGGCACCTTGCCGTCCAAAACC
GTTCGAAAAAAGATCTGTTTCTAATTCATCGTCAGT
CGATTTGAAATTCTCTCCCCATACGCATGGACGCAA
TAAGTATCGATTGGACACCTCCTCCCAGGTTCAATG
TGAAGTGACATCGCAACATGAACCCCGCGGGGACA
GAATGCAGTCTTCCCTGCTTAATCTCGTTGGGTACA
GCTGAAATGCAGTCAGGCGCGGATGGGGGCCCCTC
ACGGGATATGGTGATAATGTTTACTAGCTTTACACG
TTTCTAGCAGAATTGCGAAATGACGATAGCCTTCCA
CGCATATGTCCTTGCCTCTCACATCCGAATTGGCGA
TGGATGTCTCTAAATGAATTCTTATGGTCGCGACTT
TAACGCTTCCAAGATAACAACAGATGGTGCTCCTG
AATCACATCTCCTTTGATCTTGACATGGTTCCACCC
TGTTCCCCGGGCCAACCCGTTAAGCCTTACTATGTG
ATTCGACCTAATATGGATAGTCCATCCGGCCATCCG
TGTACAATAATCCACAGACTCTGTAATTTAGAATTA
CATGCACTCCTCTCATCGTATCGGCCTAATGCTAGG
ATCGGGTGCGCGATTATACGGCAACTCTGTCGATG
GCCTAGGTTGAAGGGGGATCAACACGGTGTACATA
GGCCCTACAGCTGACGTTCACGTATGATGAATGCTT
CCTCAATGTAATGCTCGAATCGAGAATTCTCAGTCT
TAAGGGCAGCCATCGGAGCACGTGGCGCGGCAATA
TTGATTATGACAGAGCTATACAGCCCACTCGGGCG
ATAGACTGCTGAGACGCAAACGTGATATTAATTAC
GATGGCTAGCATTCGACATATCATAATCAGATATTG
GGTTTAGGACCTTTATCGCAGTATTAGTACGATTTG
GTGCTGTGCGAAATCTTATGTGCGCGTGCGAAACA
ATATATTGTTCGAAGTGATATGGGATAGGTCAGTGT
CATATAATGTAAATCGGTTCGTCTGACGCGATTTAA
GGCTCACATTGTTATCGCTAATCGGGATGAACGGCT
CAAGTGCAGCATGGCACCAAGATTCCGAGGGCAAA
CGCCGCACAGTGAGGTTTGGCTCTCCCCTCTAATAT CTTACACGTTTGTGGGATTATAGGGATCACATGGCC
ACGGCCTGTAATATTGTCATGTAGCCCGGATGATAC
CGGAATACTAAAATTGGAGGGGTTCTAGGTCATGC
TAACTGCTCGGGGCTCATGGAGTTGTAGAGTTATCA
ACAGGATCTCGGAATTCCCGTAAGCGGGATCTCCTT
GCCGATAAGTTTGTGCTGCTGCCCGTCTTCGCGCCG
GAACGCGCTTCCAAATTCTCCCTACTAACGCATGCT
GATGCACCATTGGAGCATTCTGGGATGGGCGTTTAT
CGAAACGAGTGTTTGTCTATAATGCATGACGAGGT
CTCTGCTGGGTAGAATTGGTGATTTGGAAGCGATA
CGGGTTATAGTCTCACGTACTGATGGACTAGTATGC
GTGAAGGAATCGAATACTTCGACACGATGACGTAG
GGAGCCACGCGATCAAGGACTGCCCAGTGGTCTAC
TATCTATCTTCAACAGATTGAGGGGGAGCGGTGCC
GCTGATTTAATTTTAGCATCGGTCGCTGGTTAACTT
TTAGTATCGCGCCTTTAAAGAATCTAATCTCCGTTA
GTGTCGGGTTGATTTTCTGCGAAATAGAACTAATTC
AATTGCTTATCTGCTTGATCGATTCGGAAGCCAGGG
TGGGTAGGGTAGTTACGTACGCCTGAATCTGAACC
ATCAGTCGTAATGAATTACTGAAGACGCGCGATGC
CTGGATAAAATTATCGCCTATGTCCCAACTAATGGC
ACGACAGGCTCAGAGCATGCTACTGTGTAGTGAGA
TCCGCTTATCGCCCCATTCGTGGTCGCGTTATGCCA
CTGAGTAACAAGTGATGTCCAGTGTCTAATACGAC
CGCTCGGGTCGATGGTCAAGCGGCACAGTGACATT
AACTTTTGCTTTCACATTGAACAAATTCTCCCACTT
CAGCACATGTACCCCCTGCTGCATACAGACCAGGT
CTTTTGTCCACACCTTGCACGGGTGCCTGAATGCCT
TTCCGCTGGCCTAAGCCAGTGACGTGAATGTAAAG
AGCGCTCGCACTGTAGTCATGGAGAATTATAATCG
ATAGATAAATACGTGGCGCACCACCCCAACATCCT
CGCGGGCTGTTACTAGAAATTGTGTATACCGTGGG
GGTGATTAAAAAATGGTGAGACGTGCTGTATGGTC
TTTGTGATCTCTGCTACTATTGGGTGCTGCATAAAT CGTACCTCCAACTTGAGGCATCATAGCTACGGAAC
CCGTAAAATTGGTCATATACGCAAACACAACAGTA
AGTAGGTGGAGCCGAAGTGCTCTCGTGGCCGAAGA
CAACAACCTTTGCCCATGCCTTAAAGACTGCGTGAT
AACCGTCTTCCCATCAGGAGGTGAAGGCGATATGG
TAATCTATAGGTATTGATGGCAAGAGGTCGGAACC
CAGCTTACTCGATAGCGTTGTCGATCGCGCTTCCTG
TGCTCCTTCCTACAAAGTGGGATAGCATCATAGAC
AGGCATCCGGGTCCAATCGCCGAACGCGTCACGCA
TCGCATGATTAATTACAGTGTCGCATTACATCTAGT
ATGTATTAGGTGGGCACCGCGGTACAGCATGGACA
GGCGCTCACGGACACAAAAACGCGTCAACAAAAGT
TAGGTATGGGTGGCGCCAGGTGAAAACGCCAGCTC
TGCTATGGTCCTAAGTAATTGCAGCATGTCTTGAGA
TCTCATAGCTACCGTCTTCAGAACGATATTAGCTAA
CTTTCCCTTCCGTCTCATTACTTATGCGGGCTTCATC
GCGGTTACCGGCTGGTAAGATACGTAAGCTACACT
AGTAAGCATACTGCAGGTATGAGCCGATCCTGCAA
TTACCCATATTGGTTTTTGTATTTACACGTATGGCG
ATTACACTTCTTAAACTAGAACTCGTTTACTAATTC
TTCGTTCATACTCATGGCAATAGCATGATCTCGTAT
TACCATGTTATACGTAGTCATAGTGTGCCAACAGTA
CGTTAACCTACAATGCTCCACGCCGACCTTGTAGAA
CAGCATGATACTATATACCCGGGCATCGCGCACCG
ATAACTGCAGATCATGGAATGACCGCTCTACGTGG
ATTTAACTCGGGTGGCCCTATAGATAAATATTCTTA
CCACCGCCCTGGGATATATAGGCCGTCAGCACGTTT
ATGTCCTAGTACGCAGTACGCGCCTATTAATATAAC
AGCTGTCAGTAAGGGTCCAGAATTCTAGGGCCGAT
GAATTACAAGCAGGTGAATAGATACGATTGGGATA
TTATCACAACAACTCGCGAATGGATTATCAGTACG
AGCCACGGCCCAGCACATTATTCACCAACGGGATT
AGGTGACGCCAGTGCGTGCTGCTACTACAATGCAT
CGCGGGTGTTGACGGTTAAGGTAGCTCGGGCGCGA TAGATGATACTGGCCCGAGACCAGTTTCTCTATATT
AACCTAGTAAGACAGGCCTGGCCCGGAAACCGTTT
CTGTACCCCGACCTAGTATAAGACTACTGGGCCGCT
AGCGGACTATTGACAAATCGCGCGTAGAAAATGCC
TGGGCCGTCTGCCGTCGGTTTCTTTAGCTATACCTT
GTAATTAAATACTGGACCAACCACAGTTTCTTCAGA
GTAACCTTGTACTTTAGGCCTTTACATCGTCCTCCTT
CTCCAACACGACCTTGTAGCTCACTACTGGTCCACA
GGCAGTTTCTTCAGCACCAGCTTGTATCTGATGCCT
GGTCCATTGTCCCCTTCTCCAATCGTAGCTTGTTCC
CGAATACTGGTGCTATGCCTAATTCTAGTAGATAAC
CTCGTTACCAAGCTCGTTTGCTTCAAAAGTCTCTTG
TTCCCGACGACGTAGCCAATAGCGGGCGCTCGTTC
AGTCTCTCGAGCTCTCCAGCGTTGGCCATGCCTTTC
GCTAGTCCGCCCTCTGGTCCTATACCTGGTTCCCCC
GAGCGGGGGCCAACACACACGCTGCTCTCAAAGCT
GGTTCAGGAGCGCTGGACCCTTCCAAGTCTCTAATG
CAGTCTCTAGTTGAGATTTACTGGAGCCATGCTCCC
CTCTTATGACAACTGAGGTTATGTTAGCCTGGAGCT
TAGATACCCTCTCACGCGCCCTGACGTTCTATTGTA
GTGGAACTACATTCCCGTCCCACGATAACTGACGTC
GTACTCGCGTGGAACACTAGTACCGTCCGACACCG
GCGGATGTCTTAGTTTAGTGGTACTTGTCGCCCTTC
CAACAAAAGAAGACGTCTCAATAGCGTGGTACCGT
TTTTCCGTCCTACTCTCACGGAGATCACTATGTAGT
TTCAGCGTCAGGGTGTCCTTTAAAACATAGAATCCG
TTAGGAGGTTTAGGGGCCCCCCGTCCCTCTCACGAC
GAAATAATAAATAGGGGGGAGCTCGGACCCGTCCG
TCATACCAGAGAATCTAAGGGCTGGGGGAGGATTA
GACCGTCCATCCTGTCAAAGGATGCACGTGCAGAG
GAAGAGTACACCCATCCCAGCGAAAAGTCTATCCT
CATCCTGGGGGTCCTGAAAACCATCCTCTGTCTGAG
AGTATGTTGAGGAGCGGGATGATGGCGACCCTCCC
CAACCGGGGCCCTCTGGTCCGCCTATAGTTTCAGAG ATGAATTAGCTAAGGTTGTAGCTTATTTTCCATAGG
GTTTTGCTCCGGACCATCCGGTCGTGTAGCGCGATT
GACTTGCCGGGTTGTGTCCCCGTATCCAGGTCACGA
CCTCATGGGGAACTAGTGGCTGTCCGGCAGTATCCT
GGTACGCACCTCATGTGGTATGCGTGGCTGTTGGTC
CGTATATGGACCTATATATGGATCGAAGC
JPEG image of Indian Flag File Size = 1981 Bytes
DNA bases = 7924
In example. 2, a JPEG image if Indian Flag having file size of 1981 Bytes have been encrypted in terms of DNA bases. A total of 7924 DNA bases (4-base/Byte) are required to encrypt the complete image. Since the sequence is large, fragmenting the sequence into smaller segments is required. REFERENCES
1. Lalit M Bharadwaj*, Amol P Bhondekar, Awdhesh K. Shukla, Vijayender Bhalla and R P Bajpai. DNA-Based High-Density Memory Devices And Biomolecular Electronics At CSIO. Proc. SPIE: vol.4937, pp 319-325 (2002).
1. Clelland, C.T., Risea, V. & Bancroft, C. Hiding messages in DNA microdots. Nature. 399, 533-534 (1999).
2. Bancroft, et al. DNA-based steganography. U.S.Patent no. 6,312,911, November 2001.

Claims

CLAIMS:
1. A method for storing information in DNA using a unique sequence of 4-DNA bases for representing each character of extended ASCII character set comprising:
(a) producing a synthetic DNA molecule comprising encrypted digital information that can be decoded with the use of an encryption key, flanked on each side by a primer sequence; and
(b) storing the DNA molecule in a storage DNA, which consists of a mixture of homogenous/heterogeneous DNA.
2. The method of claim 1 wherein the storage DNA is genomic DNA.
3. The method of claim 2 wherein the storage DNA is human DNA or any other organism' s DNA.
4. The method of claim 1 wherein the storage DNA is synthetic.
5. The method of claim 1 wherein a software is provided to enable all 256 Extended ASCII characters to be defined in terms of DNA sequences.
6. The method of claim 1 wherein a minimum number of bases define each extended ASCII character.
7. The method of claim 1 wherein 4 sequences combinations result from one base A, T, G, C.
8. The method of claim 1 wherein with 2 bases 16 (4x4) different sequences are , obtained.
9. The method of claim 1 wherein with three bases 64 (4x4x4) distinct sequences are obtained.
10. The method of claim 1 wherein with four bases 256 (4x4x4x4) distinct sequences are obtained.
11. The method of claim 1 wherein plain text/image or any digital information is encrypted in terms of DNA sequences using an encryption key software.
12. The method of claim 11 wherein the information is encrypted and fragmented in a number of segments if the information overflows the limits and cannot be synthesized in a single piece.
13. The method of claim 1 wherein synthesis of encrypted sequence(s) is carried out using DNA synthesizer.
14. The method of claim 1 wherein with a fixed number of different DNA primers sequence assigned a number, which resembles the segment position they represent.
15. The method of claim 1 wherein two tail primers are also provided, one of which resembles a continuation and other resembles termination segment.
16. The method of claim 1 wherein the DNA segment(s) is/are flanked by PCR primers at both ends with the header primers being attached at the beginning of segment and tail primers being attached at the end of the segment.
17. The method of claim 1 wherein SM DNA is mixed with complex denatured DNA strands of genomic DNA of human or other organism.
18. The method of claim 1 wherein a recipient knowing the sequences of both the primers [starting and tail] extracts the message, using PCR to isolate and amplify the encrypted DNA strand, followed by isolation and amplification of the DNA and sequencing using automated DNA sequencer, thereafter conversion of the DNA sequence obtained into digital message using encryption/decryption key.
19. A DNA molecule comprising an encrypted DNA sequence that can be decoded with the use of an encryption key, flanked on each side by polymerase chain reaction primer sequences wherein amplification of the DNA molecule and determination of the secret message DNA sequence and use of an encryption key, results in a decryption of the message,
20. A method as claimed in claim 1 where the method of encryption comprises: a) encryption of a plain text/image or any digital information in terms of DNA sequences using encryption key, which first generates an array of 256 elements (unique 4-base per character), representing complete extended ASCII character set values; b) encrypting of input information character-by-character using an array by matching the ASCII values of each character with the element number of the array; c) fragmenting the encrypted sequence into a number of segments if the information overflows the limits and cannot be synthesized in a single DNA length; d) flanking of the encrypted segment(s) on each side with header and tail primers; e) synthesising of encrypted sequence(s) using DNA synthesizer; f) mixing the synthesized DNA segment(s) with complex denatured DNA strands of genomic DNA of human or other organism; g) transporting the encrypted DNA. h) Decrypting the encrypting DNA at the recipient end.
21. A method as claimed in claim 20 where the method of decryption comprises: a) Isolation and amplification of encrypted DNA using known primers flanked at each end by PCR method; b) sequencing of the retrieved encrypted DNA using DNA sequencer; c) interpreting the obtained sequence after integration of multi-segment, if required using a pre-determined encryption key;
PCT/IB2004/000975 2003-03-31 2004-03-31 A method for storing information in dna WO2004088585A2 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2006506420A JP2006522356A (en) 2003-03-31 2004-03-31 Method for storing information in DNA
GB0521163A GB2416232A (en) 2003-03-31 2004-03-31 A method for storing information in DNA

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US45914003P 2003-03-31 2003-03-31
US60/459,140 2003-03-31

Publications (3)

Publication Number Publication Date
WO2004088585A2 true WO2004088585A2 (en) 2004-10-14
WO2004088585A8 WO2004088585A8 (en) 2005-04-28
WO2004088585A3 WO2004088585A3 (en) 2005-06-02

Family

ID=33131865

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/IB2004/000975 WO2004088585A2 (en) 2003-03-31 2004-03-31 A method for storing information in dna

Country Status (4)

Country Link
US (1) US20050053968A1 (en)
JP (1) JP2006522356A (en)
GB (1) GB2416232A (en)
WO (1) WO2004088585A2 (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102007057802B3 (en) * 2007-11-30 2009-06-10 Geneart Ag Steganographic embedding of information in coding genes
CN103310157A (en) * 2012-03-14 2013-09-18 大连大学 Reverse transcriptase-deoxyribose nucleic acid (RT-DNA) cellular automaton-based image encryption method
CN102025482B (en) * 2009-09-10 2014-02-12 中国科学院华南植物园 Construction method of virtual genome-based cryptosystem (VGC)
WO2014014991A3 (en) * 2012-07-19 2014-03-27 President And Fellows Of Harvard College Methods of storing information using nucleic acids
CN104050625A (en) * 2014-07-03 2014-09-17 东华大学 Compound chaos image encryption method with initial secret keys built through plaintext
CN105701755A (en) * 2015-08-20 2016-06-22 大连大学 Image encryption method based on fractal and DNA sequence operation
EP3098742A1 (en) * 2015-05-26 2016-11-30 Thomson Licensing Method and apparatus for creating a plurality of oligos with a targeted distribution of nucleotide types
US9928869B2 (en) 2015-07-13 2018-03-27 President And Fellows Of Harvard College Methods for retrievable information storage using nucleic acids
WO2018081745A1 (en) 2016-10-31 2018-05-03 Dodo Omnidata, Inc. Methods, compositions, and devices for information storage
WO2020005598A1 (en) * 2018-06-29 2020-01-02 Microsoft Technology Licensing, Llc Whole pool amplification and in-sequencer random-access of data encoded by polynucleotides
US10640822B2 (en) 2016-02-29 2020-05-05 Iridia, Inc. Systems and methods for writing, reading, and controlling data stored in a polymer
US10859562B2 (en) 2016-02-29 2020-12-08 Iridia, Inc. Methods, compositions, and devices for information storage
US11017170B2 (en) 2018-09-27 2021-05-25 At&T Intellectual Property I, L.P. Encoding and storing text using DNA sequences
CN113604546A (en) * 2015-09-22 2021-11-05 特韦斯特生物科学公司 Flexible substrates for nucleic acid synthesis
AU2020202857B2 (en) * 2012-06-01 2022-03-17 European Molecular Biology Laboratory High-Capacity Storage of Digital Information in DNA
US11785004B2 (en) 2014-09-03 2023-10-10 Nanthealth, Inc. Synthetic genomic variant-based secure transaction devices, systems and methods
US11837302B1 (en) 2020-08-07 2023-12-05 Iridia, Inc. Systems and methods for writing and reading data stored in a polymer using nano-channels

Families Citing this family (77)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI326308B (en) * 2003-08-06 2010-06-21 Applied Dna Sciences Inc British Virgin Island Apdn B V I Inc Method for concealing a secret information carried within a dna molecule and method for decoding thereof
US9101279B2 (en) 2006-02-15 2015-08-11 Virtual Video Reality By Ritchey, Llc Mobile user borne brain activity data and surrounding environment data correlation system
US10741034B2 (en) 2006-05-19 2020-08-11 Apdn (B.V.I.) Inc. Security system and method of marking an inventory item and/or person in the vicinity
US8769491B1 (en) * 2007-11-08 2014-07-01 The Mathworks, Inc. Annotations for dynamic dispatch of threads from scripting language code
KR101420683B1 (en) * 2007-12-24 2014-07-17 삼성전자주식회사 Method and System of Encrypting/Deciphering Information of Microarray
US20090325212A1 (en) * 2008-06-27 2009-12-31 Microsoft Corporation Data standard for biomaterials
US8053744B2 (en) * 2009-04-13 2011-11-08 Src, Inc. Location analysis using nucleic acid-labeled tags
US8703493B2 (en) 2010-06-15 2014-04-22 Src, Inc. Location analysis using fire retardant-protected nucleic acid-labeled tags
US8412462B1 (en) 2010-06-25 2013-04-02 Annai Systems, Inc. Methods and systems for processing genomic data
US8716027B2 (en) 2010-08-03 2014-05-06 Src, Inc. Nucleic acid-labeled tags associated with odorant
WO2012031029A2 (en) * 2010-08-31 2012-03-08 Lawrence Ganeshalingam Method and systems for processing polymeric sequence data and related information
US20120230338A1 (en) * 2011-03-09 2012-09-13 Annai Systems, Inc. Biological data networks and methods therefor
CA2872017A1 (en) * 2012-05-09 2013-11-14 Apdn (B.V.I.) Inc. Verification of physical encryption taggants using digital representatives and authentications thereof
AU2013277986B2 (en) 2012-06-22 2016-12-01 Annai Systems Inc. System and method for secure, high-speed transfer of very large files
US9963740B2 (en) 2013-03-07 2018-05-08 APDN (B.V.I.), Inc. Method and device for marking articles
ES2943498T3 (en) 2013-08-05 2023-06-13 Twist Bioscience Corp de novo synthesized libraries
US9904734B2 (en) 2013-10-07 2018-02-27 Apdn (B.V.I.) Inc. Multimode image and spectral reader
CA2940655C (en) 2014-03-18 2020-07-07 Apdn (B.V.I.) Inc. Encrypted optical markers for security applications
US10745825B2 (en) 2014-03-18 2020-08-18 Apdn (B.V.I.) Inc. Encrypted optical markers for security applications
US10020826B2 (en) * 2014-04-02 2018-07-10 International Business Machines Corporation Generating molecular encoding information for data storage
US10628245B2 (en) 2014-04-02 2020-04-21 Pure Storage, Inc. Monitoring of storage units in a dispersed storage network
US11347590B1 (en) 2014-04-02 2022-05-31 Pure Storage, Inc. Rebuilding data in a distributed storage network
CA2975852A1 (en) 2015-02-04 2016-08-11 Twist Bioscience Corporation Methods and devices for de novo oligonucleic acid assembly
WO2016126987A1 (en) 2015-02-04 2016-08-11 Twist Bioscience Corporation Compositions and methods for synthetic gene assembly
EP3067809A1 (en) 2015-03-13 2016-09-14 Thomson Licensing Method and apparatus for storing and selectively retrieving data encoded in nucleic acid molecules
US11164661B2 (en) 2015-04-10 2021-11-02 University Of Washington Integrated system for nucleic acid-based storage and retrieval of digital data using keys
US9981239B2 (en) 2015-04-21 2018-05-29 Twist Bioscience Corporation Devices and methods for oligonucleic acid library synthesis
US10423341B1 (en) * 2015-06-12 2019-09-24 Bahram Ghaffarzadeh Kermani Accurate and efficient DNA-based storage of electronic data
US9898579B2 (en) 2015-06-16 2018-02-20 Microsoft Technology Licensing, Llc Relational DNA operations
IL258164B (en) 2015-09-18 2022-09-01 Twist Bioscience Corp Methods for modulating protein and cellular activity and method for nucleic acid synthesis
US10566077B1 (en) 2015-11-19 2020-02-18 The Board Of Trustees Of The University Of Illinois Re-writable DNA-based digital storage with random access
EP3384077A4 (en) 2015-12-01 2019-05-08 Twist Bioscience Corporation Functionalized surfaces and preparation thereof
WO2017180302A1 (en) 2016-04-11 2017-10-19 Apdn (B.V.I.) Inc. Method of marking cellulosic products
CN109074424B (en) * 2016-05-04 2022-03-11 深圳华大生命科学研究院 Method for storing text information by using DNA, decoding method and application thereof
CN106100841B (en) * 2016-05-31 2019-02-15 江苏环亚医用科技集团股份有限公司 A kind of data encryption system and method based on molecule encryption technology
ES2817973T3 (en) 2016-07-01 2021-04-08 Microsoft Technology Licensing Llc Storage through iterative DNA editing
US11359234B2 (en) 2016-07-01 2022-06-14 Microsoft Technology Licensing, Llc Barcoding sequences for identification of gene expression
US10892034B2 (en) 2016-07-01 2021-01-12 Microsoft Technology Licensing, Llc Use of homology direct repair to record timing of a molecular event
AU2017315294B2 (en) 2016-08-22 2023-12-21 Twist Bioscience Corporation De novo synthesized nucleic acid libraries
US11177019B2 (en) * 2016-08-30 2021-11-16 Tsinghua University Method for biologically storing and restoring data
US10417457B2 (en) 2016-09-21 2019-09-17 Twist Bioscience Corporation Nucleic acid based data storage
US10995371B2 (en) 2016-10-13 2021-05-04 Apdn (B.V.I.) Inc. Composition and method of DNA marking elastomeric material
US20180108019A1 (en) * 2016-10-15 2018-04-19 Systems Imagination, Inc. Secure Encryption Using Genomic Information
US10370246B1 (en) * 2016-10-20 2019-08-06 The Board Of Trustees Of The University Of Illinois Portable and low-error DNA-based data storage
CN110024037B (en) 2016-11-30 2023-06-27 微软技术许可有限责任公司 DNA random access memory system via connection
EA201991262A1 (en) 2016-12-16 2020-04-07 Твист Байосайенс Корпорейшн LIBRARIES OF OPTIONS OF IMMUNOLOGICAL SYNAPSIS AND THEIR SYNTHESIS
US10839948B2 (en) 2016-12-29 2020-11-17 Intel Corporation Microfluidic information-encoding polymer data storage
US10793897B2 (en) 2017-02-08 2020-10-06 Microsoft Technology Licensing, Llc Primer and payload design for retrieval of stored polynucleotides
US10920274B2 (en) 2017-02-21 2021-02-16 Apdn (B.V.I.) Inc. Nucleic acid coated submicron particles for authentication
EP3586255A4 (en) 2017-02-22 2021-03-31 Twist Bioscience Corporation Nucleic acid based data storage
EP3595674A4 (en) 2017-03-15 2020-12-16 Twist Bioscience Corporation Variant libraries of the immunological synapse and synthesis thereof
US10774379B2 (en) 2017-03-15 2020-09-15 Microsoft Technology Licensing, Llc Random access of data encoded by polynucleotides
JP7169999B2 (en) 2017-06-12 2022-11-11 ツイスト バイオサイエンス コーポレーション Methods for Seamless Nucleic Acid Assembly
WO2018231864A1 (en) 2017-06-12 2018-12-20 Twist Bioscience Corporation Methods for seamless nucleic acid assembly
RU2659025C1 (en) * 2017-06-14 2018-06-26 Общество с ограниченной ответственностью "ЛЭНДИГРАД" Methods of encoding and decoding information
CN109300508B (en) * 2017-07-25 2020-08-11 南京金斯瑞生物科技有限公司 DNA data storage coding decoding method
GB2581620A (en) 2017-09-11 2020-08-26 Twist Bioscience Corp GPCR binding proteins and synthesis thereof
US11538554B1 (en) 2017-09-19 2022-12-27 The Board Of Trustees Of The Univ Of Illinois Nick-based data storage in native nucleic acids
JP7066840B2 (en) 2017-10-20 2022-05-13 ツイスト バイオサイエンス コーポレーション Heated nanowells for polynucleotide synthesis
EP3506135A1 (en) * 2017-12-29 2019-07-03 IMEC vzw Method for writing data
AU2019205269A1 (en) 2018-01-04 2020-07-30 Twist Bioscience Corporation DNA-based digital information storage
KR102138864B1 (en) * 2018-04-11 2020-07-28 경희대학교 산학협력단 Dna digital data storage device and method, and decoding method of dna digital data storage device
US11302421B2 (en) * 2018-04-13 2022-04-12 The Hong Kong Polytechnic University Data storage using peptides
EP3776557A4 (en) * 2018-04-13 2021-12-15 The Hong Kong Polytechnic University Data storage using peptides
US11106633B2 (en) * 2018-04-24 2021-08-31 EMC IP Holding Company, LLC DNA-based data center with deduplication capability
CA3100739A1 (en) 2018-05-18 2019-11-21 Twist Bioscience Corporation Polynucleotides, reagents, and methods for nucleic acid hybridization
US11515012B1 (en) 2018-09-22 2022-11-29 Mark Gordon Arnold Method and apparatus for a pipelined DNA memory hierarchy
CA3131691A1 (en) 2019-02-26 2020-09-03 Twist Bioscience Corporation Variant nucleic acid libraries for antibody optimization
KR20210143766A (en) 2019-02-26 2021-11-29 트위스트 바이오사이언스 코포레이션 Variant Nucleic Acid Libraries for the GLP1 Receptor
US11655465B1 (en) 2019-05-02 2023-05-23 Iridia, Inc. Enzymes and systems for synthesizing DNA
EP3987019A4 (en) 2019-06-21 2023-04-19 Twist Bioscience Corporation Barcode-based nucleic acid sequence assembly
US11755640B2 (en) 2019-12-20 2023-09-12 The Board Of Trustees Of The University Of Illinois DNA-based image storage and retrieval
CN111680797B (en) * 2020-05-08 2023-06-06 中国科学院计算技术研究所 DNA type printer, DNA-based data storage device and method
US11823064B2 (en) 2021-05-28 2023-11-21 Bank Of America Corporation Enterprise market volatility prediction through synthetic DNA and mutant nucleotides
US11823065B2 (en) 2021-05-28 2023-11-21 Bank Of America Corporation Enterprise market volatility predictions through synthetic DNA and mutant nucleotides
US11823066B2 (en) 2021-05-28 2023-11-21 Bank Of America Corporation Enterprise market volatility predictions through synthetic DNA and mutant nucleotides
CN115312128A (en) * 2022-03-14 2022-11-08 深圳先进技术研究院 DNA encoding method, decoding method, apparatus, terminal device and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0795862A2 (en) * 1996-03-12 1997-09-17 International Business Machines Corporation Tape pre-formatting
WO2000059917A2 (en) * 1999-03-31 2000-10-12 Hilmar Rauhe Information-carrying and information-processing polymers
US6312911B1 (en) * 1999-05-06 2001-11-06 Frank Carter Bancroft DNA-based steganography
GB2376686A (en) * 2001-02-10 2002-12-24 Nat Inst Of Agricultural Botan Storage of encoded information within biological macromolecules

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6422476B1 (en) * 1993-11-05 2002-07-23 Intermec Ip Corp. Method, apparatus and character set for encoding and decoding data characters in data carriers, such as RFID tags

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0795862A2 (en) * 1996-03-12 1997-09-17 International Business Machines Corporation Tape pre-formatting
WO2000059917A2 (en) * 1999-03-31 2000-10-12 Hilmar Rauhe Information-carrying and information-processing polymers
US6312911B1 (en) * 1999-05-06 2001-11-06 Frank Carter Bancroft DNA-based steganography
GB2376686A (en) * 2001-02-10 2002-12-24 Nat Inst Of Agricultural Botan Storage of encoded information within biological macromolecules

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
BHARADWAJ L M ET AL: "DNA-BASED HIGH-DENSITY MEMORY DEVICES AND BIOMOLECULAR ELECTRONICS AT CSIO" PROCEEDINGS OF THE SPIE, SPIE, BELLINGHAM, VA, US, vol. 4937, 2002, pages 319-325, XP008041298 ISSN: 0277-786X cited in the application *

Cited By (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE102007057802B3 (en) * 2007-11-30 2009-06-10 Geneart Ag Steganographic embedding of information in coding genes
CN102025482B (en) * 2009-09-10 2014-02-12 中国科学院华南植物园 Construction method of virtual genome-based cryptosystem (VGC)
CN103310157A (en) * 2012-03-14 2013-09-18 大连大学 Reverse transcriptase-deoxyribose nucleic acid (RT-DNA) cellular automaton-based image encryption method
AU2020202857B2 (en) * 2012-06-01 2022-03-17 European Molecular Biology Laboratory High-Capacity Storage of Digital Information in DNA
US11892945B2 (en) 2012-06-01 2024-02-06 European Molecular Biology Laboratory High-capacity storage of digital information in DNA
WO2014014991A3 (en) * 2012-07-19 2014-03-27 President And Fellows Of Harvard College Methods of storing information using nucleic acids
CN104662544A (en) * 2012-07-19 2015-05-27 哈佛大学校长及研究员协会 Methods of storing information using nucleic acids
US9384320B2 (en) 2012-07-19 2016-07-05 President And Fellows Of Harvard College Methods of storing information using nucleic acids
US10460220B2 (en) 2012-07-19 2019-10-29 President And Fellows Of Harvard College Methods of storing information using nucleic acids
KR101743846B1 (en) 2012-07-19 2017-06-05 프레지던트 앤드 펠로우즈 오브 하바드 칼리지 Methods of storing information using nucleic acids
US11900191B2 (en) 2012-07-19 2024-02-13 President And Fellows Of Harvard College Methods of storing information using nucleic acids
US9996778B2 (en) 2012-07-19 2018-06-12 President And Fellows Of Harvard College Methods of storing information using nucleic acids
CN104050625A (en) * 2014-07-03 2014-09-17 东华大学 Compound chaos image encryption method with initial secret keys built through plaintext
US11785004B2 (en) 2014-09-03 2023-10-10 Nanthealth, Inc. Synthetic genomic variant-based secure transaction devices, systems and methods
US11785002B2 (en) 2014-09-03 2023-10-10 Nanthealth, Inc. Synthetic genomic variant-based secure transaction devices, systems and methods
EP3098742A1 (en) * 2015-05-26 2016-11-30 Thomson Licensing Method and apparatus for creating a plurality of oligos with a targeted distribution of nucleotide types
US9928869B2 (en) 2015-07-13 2018-03-27 President And Fellows Of Harvard College Methods for retrievable information storage using nucleic acids
US10289801B2 (en) 2015-07-13 2019-05-14 President And Fellows Of Harvard College Methods for retrievable information storage using nucleic acids
CN108026557A (en) * 2015-07-13 2018-05-11 哈佛学院董事及会员团体 It is used for the method for retrievable information storage using nucleic acid
US11532380B2 (en) 2015-07-13 2022-12-20 President And Fellows Of Harvard College Methods for using nucleic acids to store, retrieve and access information comprising a text, image, video or audio format
CN105701755B (en) * 2015-08-20 2018-10-23 大连大学 It is a kind of based on the image encryption method for dividing shape and DNA sequence dna to operate
CN105701755A (en) * 2015-08-20 2016-06-22 大连大学 Image encryption method based on fractal and DNA sequence operation
CN113604546A (en) * 2015-09-22 2021-11-05 特韦斯特生物科学公司 Flexible substrates for nucleic acid synthesis
US10995373B2 (en) 2016-02-29 2021-05-04 Iridia, Inc. Systems and methods for writing, reading, and controlling data stored in a polymer
US10859562B2 (en) 2016-02-29 2020-12-08 Iridia, Inc. Methods, compositions, and devices for information storage
US11505825B2 (en) 2016-02-29 2022-11-22 Iridia, Inc. Methods of synthesizing DNA
US10714178B2 (en) 2016-02-29 2020-07-14 Iridia, Inc. Methods, compositions, and devices for information storage
US11549140B2 (en) 2016-02-29 2023-01-10 Iridia, Inc. Systems and methods for writing, reading, and controlling data stored in a polymer
US10640822B2 (en) 2016-02-29 2020-05-05 Iridia, Inc. Systems and methods for writing, reading, and controlling data stored in a polymer
US10438662B2 (en) 2016-02-29 2019-10-08 Iridia, Inc. Methods, compositions, and devices for information storage
WO2018081745A1 (en) 2016-10-31 2018-05-03 Dodo Omnidata, Inc. Methods, compositions, and devices for information storage
US11651836B2 (en) 2018-06-29 2023-05-16 Microsoft Technology Licensing, Llc Whole pool amplification and in-sequencer random-access of data encoded by polynucleotides
WO2020005598A1 (en) * 2018-06-29 2020-01-02 Microsoft Technology Licensing, Llc Whole pool amplification and in-sequencer random-access of data encoded by polynucleotides
US11017170B2 (en) 2018-09-27 2021-05-25 At&T Intellectual Property I, L.P. Encoding and storing text using DNA sequences
US11837302B1 (en) 2020-08-07 2023-12-05 Iridia, Inc. Systems and methods for writing and reading data stored in a polymer using nano-channels

Also Published As

Publication number Publication date
GB0521163D0 (en) 2005-11-23
JP2006522356A (en) 2006-09-28
GB2416232A (en) 2006-01-18
WO2004088585A3 (en) 2005-06-02
US20050053968A1 (en) 2005-03-10
WO2004088585A8 (en) 2005-04-28

Similar Documents

Publication Publication Date Title
WO2004088585A2 (en) A method for storing information in dna
Zhang et al. DNA origami cryptography for secure communication
De Silva et al. New trends of digital data storage in DNA
Lai et al. Asymmetric encryption and signature method with DNA technology
Ning A pseudo DNA cryptography method
Halvorsen et al. Binary DNA nanostructures for data encryption
Namasudra et al. Introduction of DNA computing in cryptography
Khalifa et al. Secure blind data hiding into pseudo DNA sequences using playfair ciphering and generic complementary substitution
Hamed et al. Comparative study for various DNA based steganography techniques with the essential conclusions about the future research
Hamed et al. Hybrid technique for steganography-based on DNA with n-bits binary coding rule
Popovici Aspects of DNA cryptography
Sreeja et al. DNA for information security: A Survey on DNA computing and a pseudo DNA method based on central dogma of molecular biology
Zhang et al. A DNA‐Based Encryption Method Based on Two Biological Axioms of DNA Chip and Polymerase Chain Reaction (PCR) Amplification Techniques
Abduallah et al. New Data hiding method based on DNA and Vigenere Autokey
EP1169330A2 (en) Information-carrying and information-processing polymers
JP6175453B2 (en) Encryption and decryption method using nucleic acid
Beck et al. Finding data in DNA: computer forensic investigations of living organisms
Gao A few DNA-based security techniques
Khalifa et al. Hiding secret Information in DNA sequences using silent mutations
CN109617673A (en) It is a kind of to provide the safety communicating method of multiple information protection based on DNA paper folding
Shyam et al. A novel encryption scheme based on DNA computing
Kumar et al. Introduction to DNA computing
Singh et al. Information hiding techniques based on DNA inconsistency: An overview
Vishwakarma et al. High density data storage in DNA using an efficient message encoding scheme
Saju et al. DNA cryptography: New field of cryptography

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
CFP Corrected version of a pamphlet front page
CR1 Correction of entry in section i

Free format text: IN PCT GAZETTE 42/2004 ADD "DECLARATION UNDER RULE 4.17: - AS TO THE APPLICANT'S ENTITLEMENT TO CLAIM THE PRIORITY OF THE EARLIER APPLICATION (RULE 4.17(III)) FOR ALL DESIGNATIONS."

WWE Wipo information: entry into national phase

Ref document number: 3822/DELNP/2005

Country of ref document: IN

WWE Wipo information: entry into national phase

Ref document number: 171176

Country of ref document: IL

Ref document number: 2006506420

Country of ref document: JP

WWE Wipo information: entry into national phase

Ref document number: 0521163

Country of ref document: GB

122 Ep: pct application non-entry in european phase