TY - JOUR
T1 - Reinforcement Learning Based Multi-Access Control with Energy Harvesting
AU - Chu, Man
AU - Li, Hang
AU - Liao, Xuewen
AU - Cui, Shuguang
N1 - Publisher Copyright:
© 2018 IEEE.
PY - 2018
Y1 - 2018
N2 - In this paper, we study an uplink wireless system including N energy harvesting (EH) user equipments (UEs) and one base station (BS) with limited access channels. Each UE has a rechargeable battery with finite capacity. The system control is modeled as a Markov decision process without complete prior knowledge assumed at the BS, which also deals with large sizes in both state and action spaces. To handle such an access control problem, we propose a scheduling algorithm that maximizes the expected sum discounted uplink transmission rate based on reinforcement learning (RL) with deep Q-network (DQN) enhancement. Different from the traditional access control solutions that usually assume strong model knowledges, our goal is to achieve a more stable and balanced transmission over a long time horizon in a data-driven fashion. Finally, experiment results show that the proposed RL algorithm can achieve better performances compared with existing benchmarks.
AB - In this paper, we study an uplink wireless system including N energy harvesting (EH) user equipments (UEs) and one base station (BS) with limited access channels. Each UE has a rechargeable battery with finite capacity. The system control is modeled as a Markov decision process without complete prior knowledge assumed at the BS, which also deals with large sizes in both state and action spaces. To handle such an access control problem, we propose a scheduling algorithm that maximizes the expected sum discounted uplink transmission rate based on reinforcement learning (RL) with deep Q-network (DQN) enhancement. Different from the traditional access control solutions that usually assume strong model knowledges, our goal is to achieve a more stable and balanced transmission over a long time horizon in a data-driven fashion. Finally, experiment results show that the proposed RL algorithm can achieve better performances compared with existing benchmarks.
UR - https://www.scopus.com/pages/publications/85063423295
U2 - 10.1109/GLOCOM.2018.8647438
DO - 10.1109/GLOCOM.2018.8647438
M3 - 会议文章
AN - SCOPUS:85063423295
SN - 2334-0983
JO - Proceedings - IEEE Global Communications Conference, GLOBECOM
JF - Proceedings - IEEE Global Communications Conference, GLOBECOM
M1 - 8647438
T2 - 2018 IEEE Global Communications Conference, GLOBECOM 2018
Y2 - 9 December 2018 through 13 December 2018
ER -