US 11,810,651 B2
Multi-dimensional mapping of binary data to DNA sequences
Mehmet Fatih Erden, St. Louis Park, MN (US)
Assigned to SEAGATE TECHNOLOGY LLC, Fremont, CA (US)
Filed by SEAGATE TECHNOLOGY LLC, Cupertino, CA (US)
Filed on Aug. 30, 2018, as Appl. No. 16/117,742.
Claims priority of provisional application 62/553,496, filed on Sep. 1, 2017.
Prior Publication US 2019/0073446 A1, Mar. 7, 2019
Int. Cl. G16B 30/00 (2019.01); G16B 50/30 (2019.01); G16B 50/40 (2019.01); G16B 50/00 (2019.01); G16B 5/00 (2019.01); G11C 13/02 (2006.01); B82Y 10/00 (2011.01); B82Y 15/00 (2011.01); C12Q 1/6874 (2018.01); C12Q 1/6869 (2018.01)
CPC G16B 50/00 (2019.02) [G11C 13/02 (2013.01); G16B 5/00 (2019.02); G16B 30/00 (2019.02); G16B 50/30 (2019.02); G16B 50/40 (2019.02); B82Y 10/00 (2013.01); B82Y 15/00 (2013.01); C12Q 1/6869 (2013.01); C12Q 1/6874 (2013.01)] 20 Claims
 
1. A deoxyribonucleic acid (DNA) data storage system comprising:
a DNA reader for reading sequences of DNA bases formed without homopolymers in adjacent pairs of DNA bases, the DNA reader configured to measure respective current levels indicative of respective DNA bases from a sequence of DNA bases, based, at least in part, on a read process performed on the sequence of DNA bases, and to output a sequence of current levels of such determined respective current levels as current level data;
a memory storing processor-implementable instructions, and further storing a signal constellation diagram including a plurality of nodes that multi-dimensionally represent a digital bit to DNA base ratio approaching a theoretical limit of 1.585, wherein each node corresponds to a fixed-length, particular ordered set of adjacent DNA base types and an associated multi-digit, binary number combination; and
a processor configured to decode binary data from the current level data by instantiation of the processor-implementable instructions which instruct the processor to:
receive the current level data from the DNA reader;
associate respective DNA base types corresponding to each respective measured current level in the sequence of current levels;
map sets of adjacent DNA base types determined in the sequence of current levels to corresponding nodes of the signal constellation diagram; and
construct decoded binary data from the multi-digit, binary number combinations associated with each node of the signal constellation diagram, with a maximum of one bit difference between the multi-digit, binary number combinations in any adjacent node, thereby optimizing a bit error rate of the decoded binary data and a signal to noise ratio of the current level data; and
a storage device in communication with the processor that receives and stores the decoded binary data.