Stanley F. Chen, Brian Kingsbury, et al.
IEEE Transactions on Audio, Speech and Language Processing
This paper exploits the fact that when GMM and SVM classifiers with roughly the same level of performance exhibit uncorrelated errors they can be combined to produce a better classifier. The gain accrues from combining the descriptive strength of GMM models with the discriminative power of SVM classifiers. This idea, first exploited in the context of speaker recognition [1, 2], is applied to speech recognition - specifically to a digit recognition task in a noisy environment - with significant gains in performance.
Stanley F. Chen, Brian Kingsbury, et al.
IEEE Transactions on Audio, Speech and Language Processing
George Saon
ICASSP 2006
Tara N. Sainath, I-Hsin Chung, et al.
INTERSPEECH 2014
Gakuto Kurata, Bhuvana Ramabhadran, et al.
ASRU 2017