A New Cache Update Scheme Using Reinforcement Learning for Coded Video Streaming Systems

Yu-Sin Kim; Jeong-Min Lee; Jong-Yeol Ryu; Tae-Won Ban

doi:10.3390/s21082867

Sensors (Apr 2021)

A New Cache Update Scheme Using Reinforcement Learning for Coded Video Streaming Systems

Yu-Sin Kim,
Jeong-Min Lee,
Jong-Yeol Ryu,
Tae-Won Ban

Affiliations

Yu-Sin Kim: Algorithm Team, Carvi, Seoul 08513, Korea
Jeong-Min Lee: Department of Information and Communication Engineering, Gyeongsang National University, Gyeongnam 53064, Korea
Jong-Yeol Ryu: Department of Information and Communication Engineering, Gyeongsang National University, Gyeongnam 53064, Korea
Tae-Won Ban: Department of Information and Communication Engineering, Gyeongsang National University, Gyeongnam 53064, Korea

DOI: https://doi.org/10.3390/s21082867
Journal volume & issue: Vol. 21, no. 8
p. 2867

Abstract

Read online

As the demand for video streaming has been rapidly increasing recently, new technologies for improving the efficiency of video streaming have attracted much attention. In this paper, we thus investigate how to improve the efficiency of video streaming by using clients’ cache storage considering exclusive OR (XOR) coding-based video streaming where multiple different video contents can be simultaneously transmitted in one transmission as long as prerequisite conditions are satisfied, and the efficiency of video streaming can be thus significantly enhanced. We also propose a new cache update scheme using reinforcement learning. The proposed scheme uses a K-actor-critic (K-AC) network that can mitigate the disadvantage of actor-critic networks by yielding K candidate outputs and by selecting the final output with the highest value out of the K candidates. The K-AC exists in each client, and each client can train it by using only locally available information without any feedback or signaling so that the proposed cache update scheme is a completely decentralized scheme. The performance of the proposed cache update scheme was analyzed in terms of the average number of transmissions for XOR coding-based video streaming and was compared to that of conventional cache update schemes. Our numerical results show that the proposed cache update scheme can reduce the number of transmissions up to 24% when the number of videos is 100, the number of clients is 50, and the cache size is 5.

Published in Sensors

ISSN: 1424-8220 (Online)
Publisher: MDPI AG
Country of publisher: Switzerland
LCC subjects: Technology: Chemical technology
Website: http://www.mdpi.com/journal/sensors

About the journal

Abstract

Keywords