Emotion recognition with multimodal features and temporal models

Shuai Wang; Wenxuan Wang; Jinming Zhao; Shizhe Chen; Qin Jin; Shilei Zhang; Yong Qin

doi:10.1145/3136755.3143016

ICMI 2017

Conference paper

03 Nov 2017

Emotion recognition with multimodal features and temporal models

View publication

Abstract

This paper presents our methods to the Audio-Video Based Emotion Recognition subtask in the 2017 Emotion Recognition in the Wild (EmotiW) Challenge. The task aims to predict one of the seven basic emotions for short video segments. We extract different features from audio and facial expression modalities. We also explore the temporal LSTM model with the input of frame facial features, which improves the performance of the non-Temporal model. The fusion of different modality features and the temporal model lead us to achieve a 58.5% accuracy on the testing set, which shows the effectiveness of our methods.

Conference paper

Pooling acoustic and lexical features for the prediction of valence

Zakaria Aldeneh, Soheil Khorram, et al.

ICMI 2017

View all publications

Abstract

Related

Pooling acoustic and lexical features for the prediction of valence