摘要
In this paper, we treat tracking as a learning problem of estimating the location and the scale of an object given its previous location, scale, as well as current and previous image frames. Given a set of examples, we train convolutional neural networks (CNNs) to perform the above estimation task. Different from other learning methods, the CNNs learn both spatial and temporal features jointly from image pairs of two adjacent frames. We introduce multiple path ways in CNN to better fuse local and global information. A creative shift-variant CNN architecture is designed so as to alleviate the drift problem when the distracting objects are similar to the target in cluttered environment. Furthermore, we employ CNNs to estimate the scale through the accurate localization of some key points. These techniques are object-independent so that the proposed method can be applied to track other types of object. The capability of the tracker of handling complex situations is demonstrated in many testing sequences.
| 源语言 | 英语 |
|---|---|
| 文章编号 | 5559504 |
| 页(从-至) | 1610-1623 |
| 页数 | 14 |
| 期刊 | IEEE Transactions on Neural Networks |
| 卷 | 21 |
| 期 | 10 |
| DOI | |
| 出版状态 | 已出版 - 10月 2010 |
| 已对外发布 | 是 |
学术指纹
探究 'Human tracking using convolutional neural networks' 的科研主题。它们共同构成独一无二的指纹。引用此
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver