Skip to main navigation Skip to search Skip to main content

Tracking beyond Detection: Learning a Global Response Map for End-To-End Multi-Object Tracking

  • Xi'an Jiaotong University

Research output: Contribution to journalArticlepeer-review

20 Scopus citations

Abstract

Most of the existing Multi-Object Tracking (MOT) approaches follow the Tracking-by-Detection and Data Association paradigm, in which objects are firstly detected and then associated in the tracking process. In recent years, deep neural network has been utilized to obtain more discriminative appearance features for cross-frame association, and noticeable performance improvement has been reported. On the other hand, the Tracking-by-Detection framework is yet not completely end-To-end, which leads to huge computation and limited performance especially in the inference (tracking) process. To address this problem, we present an effective end-To-end deep learning framework which can directly take image-sequence/video as input and output the located and tracked objects of learned types. Specifically, a novel global response network is learned to project multiple objects in the image-sequence/video into a continuous response map, and the trajectory of each tracked object can then be easily picked out. The overall process is similar to how a detector inputs an image and outputs the bounding boxes of each detected object. Experimental results based on the MOT16 and MOT17 benchmarks show that our proposed on-line tracker achieves state-of-The-Art performance on several tracking metrics.

Original languageEnglish
Pages (from-to)8222-8235
Number of pages14
JournalIEEE Transactions on Image Processing
Volume30
DOIs
StatePublished - 2021

Keywords

  • Multi-object tracking
  • deep neural network
  • global response map

Fingerprint

Dive into the research topics of 'Tracking beyond Detection: Learning a Global Response Map for End-To-End Multi-Object Tracking'. Together they form a unique fingerprint.

Cite this