Deep Reinforcement Learning Based Resource Management for DNN Inference in IIoT

  • Weiting Zhang
  • , Dong Yang
  • , Haixia Peng
  • , Wen Wu
  • , Wei Quan
  • , Hongke Zhang
  • , Xuemin Sherman Shen

Research output: Contribution to journalConference articlepeer-review

2 Scopus citations

Abstract

In this paper, we investigate the joint task assignment and resource allocation for deep neural network (DNN) inference in the device-edge-cloud based industrial Internet of things (IIoT) networks. To efficiently orchestrate the limited spectrum and computing resources in IIoT networks for massive DNN inference tasks, a resource management problem is formulated with the objective of maximizing the average inference accuracy while satisfying the quality-of-service of DNN inference tasks. Considering the strict delay requirements of inference tasks, we transform the formulated problem into a Markov decision process, and propose a deep deterministic policy gradient based learning algorithm to obtain the solution rapidly. Simulation results show that the proposed algorithm can achieve high average inference accuracy.

Original languageEnglish
Article number9322223
JournalProceedings - IEEE Global Communications Conference, GLOBECOM
DOIs
StatePublished - 2020
Event2020 IEEE Global Communications Conference, GLOBECOM 2020 - Virtual, Taipei, Taiwan, Province of China
Duration: 7 Dec 202011 Dec 2020

Keywords

  • deep deterministic policy gradient
  • DNN inference
  • IIoT
  • resource management

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning Based Resource Management for DNN Inference in IIoT'. Together they form a unique fingerprint.

Cite this