TY - GEN
T1 - A contextual multi-armed bandit approach to caching in wireless small cell network
AU - Zhang, Chenxi
AU - Ren, Pinyi
AU - Du, Qinghe
N1 - Publisher Copyright:
© 2017 IEEE.
PY - 2017/12/7
Y1 - 2017/12/7
N2 - In this paper, we study the problem of cache content placement in the wireless small cell networks. We consider the small base station (SBS) has a cache unit and limited backhaul capacity, and the SBS cache the popular content to serve local users request while reducing the traffic load on the backhaul link. The goal of the cache unit in SBS is to maximum the traffic offloading from backhaul link, this problem can be seen as a knapsack problem when the content popularity has known in advance. However, content popularity is an index which is constantly changing, and it is difficult to obtain. Hence, we model the content popularity as a linear model based on the context information of the system, the problem becomes a contextual multi-armed bandit (CMAB) problem. We present an online learning algorithm, in which the SBS can learn content popularity by maintain a credible linear model and refresh the cache content over time. We give the regret bound of our algorithm, which prove that our algorithm can converges to the optimal SBS caching strategy. Our simulation results show that our algorithm could quickly learn the content popularity and outperform the reference algorithms.
AB - In this paper, we study the problem of cache content placement in the wireless small cell networks. We consider the small base station (SBS) has a cache unit and limited backhaul capacity, and the SBS cache the popular content to serve local users request while reducing the traffic load on the backhaul link. The goal of the cache unit in SBS is to maximum the traffic offloading from backhaul link, this problem can be seen as a knapsack problem when the content popularity has known in advance. However, content popularity is an index which is constantly changing, and it is difficult to obtain. Hence, we model the content popularity as a linear model based on the context information of the system, the problem becomes a contextual multi-armed bandit (CMAB) problem. We present an online learning algorithm, in which the SBS can learn content popularity by maintain a credible linear model and refresh the cache content over time. We give the regret bound of our algorithm, which prove that our algorithm can converges to the optimal SBS caching strategy. Our simulation results show that our algorithm could quickly learn the content popularity and outperform the reference algorithms.
KW - Caching
KW - Contextual bandits
KW - Online Learning
KW - Wireless Networks
UR - https://www.scopus.com/pages/publications/85046404602
U2 - 10.1109/WCSP.2017.8171043
DO - 10.1109/WCSP.2017.8171043
M3 - 会议稿件
AN - SCOPUS:85046404602
T3 - 2017 9th International Conference on Wireless Communications and Signal Processing, WCSP 2017 - Proceedings
SP - 1
EP - 6
BT - 2017 9th International Conference on Wireless Communications and Signal Processing, WCSP 2017 - Proceedings
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 9th International Conference on Wireless Communications and Signal Processing, WCSP 2017
Y2 - 11 October 2017 through 13 October 2017
ER -