Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization

  • Yaxiong Yuan
  • , Lei Lei
  • , Thang X. Vu
  • , Symeon Chatzinotas
  • , Sumei Sun
  • , Bjorn Ottersten

Research output: Contribution to journalArticlepeer-review

55 Scopus citations

Abstract

In unmanned aerial vehicle (UAV) applications, the UAV's limited energy supply and storage have triggered the development of intelligent energy-conserving scheduling solutions. In this paper, we investigate energy minimization for UAV-aided communication networks by jointly optimizing data-transmission scheduling and UAV hovering time. The formulated problem is combinatorial and non-convex with bilinear constraints. To tackle the problem, firstly, we provide an optimal algorithm (OPT) and a golden section search heuristic algorithm (GSS-HEU). Both solutions are served as offline performance benchmarks which might not be suitable for online operations. Towards this end, from a deep reinforcement learning (DRL) perspective, we propose an actor-critic-based deep stochastic online scheduling (AC-DSOS) algorithm and develop a set of approaches to confine the action space. Compared to conventional RL/DRL, the novelty of AC-DSOS lies in handling two major issues, i.e., exponentially-increased action space and infeasible actions. Numerical results show that AC-DSOS is able to provide feasible solutions, and save around 25-30% energy compared to two conventional deep AC-DRL algorithms. Compared to the developed GSS-HEU, AC-DSOS consumes around 10% higher energy but reduces the computational time from second-level to millisecond-level.

Original languageEnglish
Article number9416816
Pages (from-to)5028-5042
Number of pages15
JournalIEEE Transactions on Vehicular Technology
Volume70
Issue number5
DOIs
StatePublished - May 2021
Externally publishedYes

Keywords

  • UAV
  • actor-critic
  • deep reinforcement learning
  • energy optimization
  • hovering time allocation
  • user scheduling

Fingerprint

Dive into the research topics of 'Energy Minimization in UAV-Aided Networks: Actor-Critic Learning for Constrained Scheduling Optimization'. Together they form a unique fingerprint.

Cite this