Improved adversarial image captioning

Pierre Dognin; Igor Melnyk; Youssef Mroueh; Jerret Ross; Tom Sercu

DGS@ICLR Workshop 2019

Conference paper

06 May 2019

Improved adversarial image captioning

Abstract

In this paper we study image captioning as a conditional GAN training, proposing both a context-aware LSTM captioner and co-attentive discriminator, which enforces semantic alignment between images and captions. We investigate the viability of two discrete GAN training methods: Self-critical Sequence Training (SCST) and Gumbel Straight-Through (ST) and demonstrate that SCST shows more stable gradient behavior and improved results over Gumbel ST.

Conference paper