Progressive Latent Models for Self-Learning Scene-Specific Pedestrian Detectors

  • Qixiang Ye
  • , Tianliang Zhang
  • , Wei Ke

Research output: Contribution to journalArticlepeer-review

3 Scopus citations

Abstract

The performance of offline learned pedestrian detectors significantly drops when they are applied to video scenes of various camera views, occlusions, and background structures. Learning a detector for each video scene can avoid the performance drop but it requires repetitive human effort on data annotation. In this paper, a self-learning approach is proposed, toward specifying a pedestrian detector for each video scene without any human annotation involved. Object locations in video frames are treated as latent variables and a progressive latent model (PLM) is proposed to solve such latent variables. The PLM is deployed as components of object discovery, object enforcement, and label propagation, which are used to learn the object locations in a progressive manner. With the difference of convex (DC) objective functions, PLM is optimized by a concave-convex programming algorithm. With specified network branches and loss functions, PLM is integrated with deep feature learning and optimized in an end-to-end manner. From the perspectives of convex regularization and error rate estimation, detailed optimization analysis and learning stability analysis of the proposed PLM are provided. The extensive experiments demonstrate that even without annotation involved the proposed self-learning approach outperforms weakly supervised learning approaches, while achieving comparable performance with transfer learning approaches.

Original languageEnglish
Article number8701617
Pages (from-to)1415-1426
Number of pages12
JournalIEEE Transactions on Intelligent Transportation Systems
Volume21
Issue number4
DOIs
StatePublished - Apr 2020
Externally publishedYes

Keywords

  • Pedestrian detection
  • difference of convex
  • progressive latent model
  • self-learning

Fingerprint

Dive into the research topics of 'Progressive Latent Models for Self-Learning Scene-Specific Pedestrian Detectors'. Together they form a unique fingerprint.

Cite this