Applied Sciences (Jan 2024)

Text Semantics-Driven Data Classification Storage Optimization

  • Zhu Yuan,
  • Xueqiang Lv,
  • Yunchao Gong,
  • Boshan Liu,
  • Haixiang Yang,
  • Xindong You

DOI
https://doi.org/10.3390/app14031159
Journal volume & issue
Vol. 14, no. 3
p. 1159

Abstract

Read online

Data classification storage has emerged as an effective strategy, harnessing the diverse performance attributes of storage devices to orchestrate a harmonious equilibrium between energy consumption, cost considerations, and user accessibility. The traditional strategy of solely relying on access frequency for data classification is no longer suitable for today’s complex storage environment. Diverging from conventional methods, we explore from the perspective of text semantics to address this issue and propose an effective data classification storage method using text semantic similarity to extract seasonal features. First, we adopt a dual-layer strategy based on semantic similarity to extract seasonal features. Second, we put forward a cost-effective data classification storage framework based on text seasonal features. We compare our work with the data classification approach AS-H, which runs at full high performance. In addition, we also compare it with K-ear, which adopts K-means as the classification algorithm. The experimental results show that compared with AS-H and K-ear, our method reduces energy consumption by 9.51–13.35% and operating costs by 13.20–22.17%.

Keywords