Audio enhancing with DNN autoencoder for speaker recognition
Oldřich Plchot, Lukáš Burget, et al.
ICASSP 2016
Techniques for efficient speaker recognition are presented. These techniques are based on approximating Gaussian mixture modeling (GMM) likelihood scoring using approximated cross entropy (ACE). Gaussian mixture modeling is used for representing both training and test sessions and is shown to perform speaker recognition and retrieval extremely efficiently without any notable degradation in accuracy compared to classic GMM-based recognition. In addition, a GMM compression algorithm is presented. This algorithm decreases considerably the storage needed for speaker retrieval. © 2006 IEEE.
Oldřich Plchot, Lukáš Burget, et al.
ICASSP 2016
Oren Barkan, Hagai Aronowitz
ICASSP 2013
Alexander Sorin, Hagai Aronowitz, et al.
ICASSP 2011
Zvi Kons, Hagai Aronowitz, et al.
INTERSPEECH 2022