A coarse-grained model for optimal coupling of ASR and SMT systems for Speech translation

Gaurav Kumar; Graeme Blackwood; Jan Trmal; Daniel Povey; Sanjeev Khudanpur

doi:10.18653/v1/d15-1218

EMNLP 2015

Conference paper

17 Sep 2015

A coarse-grained model for optimal coupling of ASR and SMT systems for Speech translation

View publication

Abstract

Speech translation is conventionally carried out by cascading an automatic speech recognition (ASR) and a statistical machine translation (SMT) system. The hypotheses chosen for translation are based on the ASR system's acoustic and language model scores, and typically optimized for word error rate, ignoring the intended downstream use: automatic translation. In this paper, we present a coarseto-fine model that uses features from the ASR and SMT systems to optimize this coupling. We demonstrate that several standard features utilized by ASR and SMT systems can be used in such a model at the speech-translation interface, and we provide empirical results on the Fisher Spanish-English speech translation corpus.

Paper