Visual relationship detection based on bidirectional recurrent neural network

Research output: Contribution to journalArticlepeer-review

8 Scopus citations

Abstract

Visual relationship detection is a task aiming at mining the information of interactions between the paired objects in the image, describing the image in the form of (subject − predicate − object). Most of the previous works regard it as a pure classification problem by taking the integrated triplets as the label of the image; however, the numerous combinations of objects and the diversity of predicates are the tough challenges for these studies. Hence, we propose a deep model based on a modified bidirectional recurrent neural network (BRNN) to classify object and predict predicate simultaneously. By using the BRNN, the hidden information of the relationship in the image is extracted and a feature-infusion method is proposed. Additionally, we improve the existing works by introducing a paired non-maximum suppression method. The experiments show that our approach is competitive with the state-of-the-art works.

Original languageEnglish
Pages (from-to)35297-35313
Number of pages17
JournalMultimedia Tools and Applications
Volume79
Issue number47-48
DOIs
StatePublished - Dec 2020
Externally publishedYes

Keywords

  • Detection
  • NMS
  • RNN
  • Visual relationship

Fingerprint

Dive into the research topics of 'Visual relationship detection based on bidirectional recurrent neural network'. Together they form a unique fingerprint.

Cite this