TY - GEN
T1 - Dynamic Caching and Rate Control System for Short Videos using Reinforcement Learning
AU - Li, Jianke
AU - Sun, Yanming
AU - Hu, Dihao
AU - He, Chao
AU - Luan, Tom H.
N1 - Publisher Copyright:
© 2023 IEEE.
PY - 2023
Y1 - 2023
N2 - In this paper, we present a novel strategy for short video caching in cache queues within short video apps. Our aim is to minimize the number of user jams and reduce user bandwidth wastage. To achieve this goal, we propose a dynamic caching and rate control system based on reinforcement learning. Our system is designed to cache fewer video blocks during periods of good bandwidth to avoid wasting network resources. Conversely, during periods of poor bandwidth, our system ensures that there are more video blocks in the cache and reduces the playback rate of the next short video. We also aim to maintain a high short video cache ratio to reduce the probability of user lag. To achieve these objectives, we introduce a delay-based bandwidth prediction algorithm in WebRTC to predict the network bandwidth. We also design a block-based short video caching policy. Finally, we used the Actor-Critic algorithm to decide three actions: whether to cache video blocks in this time slot, which video block to cache, and the playback rate of the next time slot. Compared to traditional algorithms that aim to fill the cache as much as possible and TikTok-like caching algorithms, our system significantly reduces the number of video jams and network bandwidth waste, thereby enhancing the user's viewing experience and improving the efficiency of network resource utilization.
AB - In this paper, we present a novel strategy for short video caching in cache queues within short video apps. Our aim is to minimize the number of user jams and reduce user bandwidth wastage. To achieve this goal, we propose a dynamic caching and rate control system based on reinforcement learning. Our system is designed to cache fewer video blocks during periods of good bandwidth to avoid wasting network resources. Conversely, during periods of poor bandwidth, our system ensures that there are more video blocks in the cache and reduces the playback rate of the next short video. We also aim to maintain a high short video cache ratio to reduce the probability of user lag. To achieve these objectives, we introduce a delay-based bandwidth prediction algorithm in WebRTC to predict the network bandwidth. We also design a block-based short video caching policy. Finally, we used the Actor-Critic algorithm to decide three actions: whether to cache video blocks in this time slot, which video block to cache, and the playback rate of the next time slot. Compared to traditional algorithms that aim to fill the cache as much as possible and TikTok-like caching algorithms, our system significantly reduces the number of video jams and network bandwidth waste, thereby enhancing the user's viewing experience and improving the efficiency of network resource utilization.
KW - dynamic caching
KW - rate control
KW - reinforcement learning
KW - short video
UR - https://www.scopus.com/pages/publications/85173020624
U2 - 10.1109/ICCC57788.2023.10233576
DO - 10.1109/ICCC57788.2023.10233576
M3 - 会议稿件
AN - SCOPUS:85173020624
T3 - 2023 IEEE/CIC International Conference on Communications in China, ICCC 2023
BT - 2023 IEEE/CIC International Conference on Communications in China, ICCC 2023
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2023 IEEE/CIC International Conference on Communications in China, ICCC 2023
Y2 - 10 August 2023 through 12 August 2023
ER -