Skip to main navigation Skip to search Skip to main content

Time-Dependent Body Gesture Representation for Video Emotion Recognition

  • Xi'an Jiaotong University

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Video emotion recognition has recently become a research hotspot in the field of affective computing. Although large parts of studies focus on facial cues, body gestures are the only available cues in some scenes such as video monitoring systems. In this paper, we propose a body gesture representation method based on body joint movements. To reduce the model complexity and promote the understanding of video emotion, this method uses body joint information to represent body gestures and captures time-dependent relationship of body joints. Furthermore, we propose an attention-based channelwise convolutional neural network (ACCNN) to retain the independent characteristics of each body joint and learn key body gesture features. Experimental results on the multimodal database of Emotional Speech, Video and Gestures (ESVG) demonstrate the effectiveness of the proposed method, and the accuracy of body gesture features is comparable with that of facial features.

Original languageEnglish
Title of host publicationMultiMedia Modeling - 27th International Conference, MMM 2021, Proceedings
EditorsJakub Lokoc, Tomáš Skopal, Klaus Schoeffmann, Vasileios Mezaris, Xirong Li, Stefanos Vrochidis, Ioannis Patras
PublisherSpringer Science and Business Media Deutschland GmbH
Pages403-416
Number of pages14
ISBN (Print)9783030678319
DOIs
StatePublished - 2021
Event27th International Conference on MultiMedia Modeling, MMM 2021 - Prague, Czech Republic
Duration: 22 Jun 202124 Jun 2021

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12572 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference27th International Conference on MultiMedia Modeling, MMM 2021
Country/TerritoryCzech Republic
CityPrague
Period22/06/2124/06/21

Keywords

  • Body joints
  • Channelwise convolution
  • Gesture representation
  • Video emotion recognition

Fingerprint

Dive into the research topics of 'Time-Dependent Body Gesture Representation for Video Emotion Recognition'. Together they form a unique fingerprint.

Cite this