US 11,809,414 B2
Serialization of data in a concurrent transaction processing distributed database
Khaled Yagoub, Fremont, CA (US); Wumengjian Zhu, Cupertino, CA (US); Benoit Dageville, San Mateo, CA (US); and William Waddington, Stateline, NV (US)
Assigned to Snowflake Inc., Bozeman, MT (US)
Filed by Snowflake Inc., Bozeman, MT (US)
Filed on Nov. 30, 2021, as Appl. No. 17/538,818.
Prior Publication US 2023/0169068 A1, Jun. 1, 2023
Int. Cl. G06F 16/20 (2019.01); G06F 16/23 (2019.01); G06F 16/28 (2019.01); G06F 11/14 (2006.01); G06F 16/22 (2019.01)
CPC G06F 16/2379 (2019.01) [G06F 11/1458 (2013.01); G06F 16/221 (2019.01); G06F 16/283 (2019.01)] 20 Claims
OG exemplary drawing
 
1. A method comprising:
identifying, in a hybrid database system, column-organized data that comprises primary key data and non-primary key data, the hybrid database system comprising a columnar database system that processes data in a columnar format and a key-value database system that processes key-value data in a key-value format;
serializing, by at least one or more hardware processor, the non-primary key data of the column-organized data into compressed serialized value data that is in a row-organized sequence, the compressed serialized value data compressed using one or more bitmaps, the non-primary key data comprising a schema identifier;
storing the compressed serialized value data in a key-value data store of the key-value database system;
receiving a query by the columnar database system;
deserializing a portion of the compressed serialized value data that corresponds to the query; and
processing the query using the columnar database system.