Computers (Mar 2024)

Lite<sup>2</sup>: A Schemaless Zero-Copy Serialization Format

  • Tianyi Chen,
  • Xiaotong Guan,
  • Shi Shuai,
  • Cuiting Huang,
  • Michal Aibin

DOI
https://doi.org/10.3390/computers13040089
Journal volume & issue
Vol. 13, no. 4
p. 89

Abstract

Read online

In the field of data transmission and storage, serialization formats play a crucial role by converting complex data structures into a byte stream that can be easily stored, transmitted, and reconstructed. Despite the myriad available serialization formats, ranging from JSON to Protobuf, each has limitations, particularly in balancing schema flexibility, performance, and data copying overhead. This paper introduces Lite2, a novel data serialization format that addresses these challenges by combining schemaless flexibility with the efficiency of zero-copy operations for flat or key–value pair data types. Unlike traditional formats that often require a predefined schema and involve significant data copying during serialization and deserialization, Lite2 offers a dynamic schemaless approach that eliminates unnecessary data copying, optimizing system performance and efficiency. Built upon a contiguously stored B-tree structure, Lite2 enables efficient data lookup and modification without deserialization, thereby achieving zero-copy operations.

Keywords