IEEE Access (Jan 2023)

Analyzing I/O Characteristics of Time-Series Data Using High Performance Storage Devices

  • Sangmyung Lee,
  • Yongseok Son,
  • Sunggon Kim

DOI
https://doi.org/10.1109/ACCESS.2023.3329474
Journal volume & issue
Vol. 11
pp. 128998 – 129008

Abstract

Read online

As the importance of data increases, data is continuously collected from diverse sources such as sensors, IoT devices, and edge computing devices. To manage these continuously monitored data, it is often organized chronologically with time which is referred as time-series data. By managing the data using time, data from different streams can be analyzed in a comprehensive manner with an identical index which is time. However, due to the unique characteristics of time-series data, it is essential for the underlying database systems to understand the characteristics of the time-series data. To handle this, time-series database systems, which specially target time-series data, are emerging. These database systems have different performance characteristics due to the unique characteristics of the data which should be investigated to efficiently store and analyze the data. In this paper, we analyze the time-series database from the perspective of I/O using various storage devices from HDD, SATA and NVMe SSD. First, we analyze the I/O characteristics such as runtime, throughput and size of total requests using various storage devices. In addition, we analyze the effect of unique time-series database features such as data chunk interval, compression and number of workers. Our analysis results show that adapting high-performance devices can greatly improve the performance of the database by up to $33.22\times $ .

Keywords