Skip to main navigation Skip to search Skip to main content

A design of reward function in multi-target trajectory recovery with deep reinforcement learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

It attracts a lot of attention in the field of object trajectory detection that detectors always receive several geographical locations without any information about the targets, and furthermore it comes into a problem to use the geographical location information received by the sensors to reconstruct the trajectory of each targets as well as to distinguish the targets in each frame, which is called multi-target trajectory recovery and can be solved by the Deep Reinforcement Learning (DRL). A mathematically model of the direction and curvature of the target trajectory according to the peculiarity of trajectories is proposed. Then, a reward function based on Trajectory Osculating Circle (TOC) is designed based on this mathematical model. Firstly, the issue of the recovery of multi-target trajectories is introduced and it can be switched into a model which can be implemented by DRL. Secondly, a structure of DRL on this issue is come up with, and is tested with the proposed reward function. Finally, a mathematical derivation and physical interpretation of the proposed reward function is implemented. The experimental result shows that with the guidance of the TOC reward function, DRL can reverse the trajectory more effectively than the state-of-the-art clustering method, and the trace is corresponding with the actual trajectory.

Original languageEnglish
Title of host publicationProceedings of 2019 IEEE 8th Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2019
EditorsBing Xu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages286-293
Number of pages8
ISBN (Electronic)9781538681787
DOIs
StatePublished - May 2019
Event8th IEEE Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2019 - Chongqing, China
Duration: 24 May 201926 May 2019

Publication series

NameProceedings of 2019 IEEE 8th Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2019

Conference

Conference8th IEEE Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2019
Country/TerritoryChina
CityChongqing
Period24/05/1926/05/19

Keywords

  • Deep reinforcement learning
  • Q-function
  • Sequential decision
  • Trajectory osculating circle

Fingerprint

Dive into the research topics of 'A design of reward function in multi-target trajectory recovery with deep reinforcement learning'. Together they form a unique fingerprint.

Cite this