Inter dataset variability compensation for speaker recognition

Hagai Aronowitz

doi:10.1109/ICASSP.2014.6854353

ICASSP 2014

Conference paper

04 May 2014

Inter dataset variability compensation for speaker recognition

View publication

Abstract

Recently satisfactory results have been obtained in NIST speaker recognition evaluations. These results are mainly due to accurate modeling of a very large development dataset provided by LDC. However, for many realistic scenarios the use of this development dataset is limited due to a dataset mismatch. In such cases, collection of a large enough dataset is infeasible. In this work we analyze the sources of degradation for a particular setup in the context of an i-vector PLDA system and conclude that the main source for degradation is an i-vector dataset shift. As a remedy, we introduce inter dataset variability compensation (IDVC) to explicitly compensate for dataset shift in the i-vector space. This is done using the nuisance attribute projection (NAP) method. Using IDVC we managed to reduce error dramatically by more than 50% for the domain mismatch setup. © 2014 IEEE.

Conference paper