Video search via ranking network with very few query exemplars

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper addresses the challenge of video search with only a handful query exemplars by proposing a triplet ranking network-based method. Based on the typical scenario for video search system, a user begins the query process by first utilizing the metadata-based text-to- video search module to find an initial set of videos of interest in the video repository. As bridging the semantic gap between text and video is very challenging, usually only a handful relevant videos appear in the initial retrieved results. The user now can use the video-to-video search module to train a new classifier to search more relevant videos. However, since we found that statistically only fewer than 5 videos are initially relevant, training a complex event classifier with a handful of examples is extremely challenging. Therefore, it is necessary to improve video retrieval method that works for a handful of positive training example videos. The proposed triplet ranking network is mainly designed for this situation and has the following properties: (1) This ranking network can learn an off-line similarity matching projection, which is event independent, from other previous video search tasks or datasets. Such that even with only one query video, we can search its relative videos. Then this method can transfer previous knowledge to the specific video retrieval tasks as more and more relative videos being retrieved, to further improve the retrieval performance; (2) It casts the video search task as a ranking problem, and can exploit partial ordering information in the dataset; (3) Based on the above two merits, this method is suitable for the case where only a handful of positive examples exploit. Experimental results show the effectiveness of our proposed method on video retrieval with only a handful of positive exemplars.

Original languageEnglish
Title of host publicationMultiMedia Modeling - 23rd International Conference, MMM 2017, Proceedings
EditorsCathal Gurrin, Björn Thór Jónsson, Laurent Amsaleg, Shin’ichi Satoh, Gylfi Thór Gudmundsson
PublisherSpringer Verlag
Pages419-430
Number of pages12
ISBN (Print)9783319518138
DOIs
StatePublished - 2017

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10133 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Keywords

  • Few positives
  • Knowledge adaptation
  • Partially ordered
  • Ranking network
  • Video search

Fingerprint

Dive into the research topics of 'Video search via ranking network with very few query exemplars'. Together they form a unique fingerprint.

Cite this