Leveraging multi-modal prior knowledge for large-scale concept learning in noisy web data

  • Junwei Liang
  • , Lu Jiang
  • , Deyu Meng
  • , Alexander Hauptmann

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Learning video concept detectors automatically from the big but noisy web data with no additional manual annotations is a novel but challenging area in the multimedia and the machine learning community. A considerable amount of videos on the web is associated with rich but noisy contextual information, such as the title and other multi-modal information, which provides weak annotations or labels about the video content. To tackle the problem of large-scale noisy learning,We propose a novel method called Multimodal WEbly-Labeled Learning (WELL-MM), which is established on the state-of-the-art machine learning algorithm inspired by the learning process of human. WELL-MM introduces a novel multimodal approach to incorporate meaningful prior knowledge called curriculum from the noisy web videos. We empirically study the curriculum constructed from the multi-modal features of the Internet videos and images. The comprehensive experimental results on FCVID and YFCC100M demonstrate that WELL-MM outperforms state-of-the-art studies by a statically significant margin on learning concepts from noisy web video data. In addition, the results also verify that WELL-MM is robust to the level of noisiness in the video data. Notably, WELL-MM trained on sufficient noisy web labels is able to achieve a be.er accuracy to supervised learning methods trained on the clean manually labeled data.

Original languageEnglish
Title of host publicationICMR 2017 - Proceedings of the 2017 ACM International Conference on Multimedia Retrieval
PublisherAssociation for Computing Machinery, Inc
Pages32-40
Number of pages9
ISBN (Electronic)9781450347013
DOIs
StatePublished - 6 Jun 2017
Event17th ACM International Conference on Multimedia Retrieval, ICMR 2017 - Bucharest, Romania
Duration: 6 Jun 20179 Jun 2017

Publication series

NameICMR 2017 - Proceedings of the 2017 ACM International Conference on Multimedia Retrieval

Conference

Conference17th ACM International Conference on Multimedia Retrieval, ICMR 2017
Country/TerritoryRomania
CityBucharest
Period6/06/179/06/17

Keywords

  • Big data
  • Concept detection
  • Noisy data
  • Prior knowledge
  • Video understanding
  • Web label
  • Weblysupervised learning

Fingerprint

Dive into the research topics of 'Leveraging multi-modal prior knowledge for large-scale concept learning in noisy web data'. Together they form a unique fingerprint.

Cite this