跳到主要导航 跳到搜索 跳到主要内容

Towards efficient learning of optimal spatial Bag-of-Words representations

  • Lu Jiang
  • , Wei Tong
  • , Deyu Meng
  • , Alexander G. Hauptmann
  • Carnegie Mellon University

科研成果: 书/报告/会议事项章节会议稿件同行评审

13 引用 (Scopus)

摘要

Spatial Pyramid Matching (SPM) assumes that the spatial Bag-of-Words (BoW) representation is independent of data. However, evidence has shown that the assumption usually leads to a suboptimal representation. In this paper, we propose a novel method called Jensen-Shannon (JS) Tiling to learn the BoW representation from data directly at the BoW level. The proposed JS Tiling is especially appropriate for large-scale datasets as it is orders of magnitude faster than existing methods, but with comparable or even better classification precision. Experimental results on four benchmarks including two TRECVID12 datasets validate that JS Tiling outperforms the SPM and the state-of-the-art methods. The runtime comparison demonstrates that selecting BoW representations by JS Tiling is more than 1,000 times faster than running classifiers. Besides, JS Tiling is an important component contributing to CMU Teams' final submission in TRECVID 2012 Multimedia Event Detection.

源语言英语
主期刊名ICMR 2014 - Proceedings of the ACM International Conference on Multimedia Retrieval 2014
出版商Association for Computing Machinery
121-128
页数8
ISBN(印刷版)1595930361, 9781595930361
DOI
出版状态已出版 - 1 4月 2014
活动2014 4th ACM International Conference on Multimedia Retrieval, ICMR 2014 - Glasgow, 英国
期限: 1 4月 20144 4月 2014

出版系列

姓名ICMR 2014 - Proceedings of the ACM International Conference on Multimedia Retrieval 2014

会议

会议2014 4th ACM International Conference on Multimedia Retrieval, ICMR 2014
国家/地区英国
Glasgow
时期1/04/144/04/14

学术指纹

探究 'Towards efficient learning of optimal spatial Bag-of-Words representations' 的科研主题。它们共同构成独一无二的指纹。

引用此