Hierarchical model for joint detection and tracking of multi-target

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

We present a hierarchical and compositional model based on an And-or graph for joint detecting and tracking of multiple targets in video. In the graph, an And-node for the joint state of all targets is decomposed into multiple Or-nodes. Each Or-node represents an individual target's state that includes position, appearance, and scale of the target. Leaf nodes are trained detectors. Measurements that supplied by the predictions of the tracker and leaf nodes are shared among Or-nodes.There are two kinds of production rules respectively designed for the problems of varying number and occlusions. One is association relations that distributes measurements to targets, and the other is semantic relations that represent occlusion between targets. The inference algorithm for the graph consists of three processing channels: (1) a bottom-up channel, which provides informative measurements by using learned detectors; (2) a top-down channel, which estimates the individual target state with joint probabilistic data association; (3) a context sensitive reasoning channel, which finalizes the estimation of the joint state with belief propagation. Additionally, an interaction mechanism between detection and tracking is implemented by a hybrid measurement process. The algorithm is validated widely by tracking peoples in several complex scenarios. Empirical results show that our tracker can reliably track multi-target without any prior knowledge about the number of targets and the targets may appear or disappear anywhere in the image frame and at any time in all these test videos.

Original languageEnglish
Title of host publicationComputer Vision, ACCV 2009 - 9th Asian Conference on Computer Vision, Revised Selected Papers
Pages160-171
Number of pages12
EditionPART 2
DOIs
StatePublished - 2009
Event9th Asian Conference on Computer Vision, ACCV 2009 - Xi'an, China
Duration: 23 Sep 200927 Sep 2009

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume5995 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th Asian Conference on Computer Vision, ACCV 2009
Country/TerritoryChina
CityXi'an
Period23/09/0927/09/09

Fingerprint

Dive into the research topics of 'Hierarchical model for joint detection and tracking of multi-target'. Together they form a unique fingerprint.

Cite this