Global RNN Transducer Models For Multi-dialect Speech RecognitionTakashi FukudaSamuel Thomaset al.2022INTERSPEECH 2022
Improving ASR Robustness in Noisy Condition Through VAD IntegrationSashi NovitasariTakashi Fukudaet al.2022INTERSPEECH 2022
Towards Reducing the Need for Speech Training Data To Build Spoken Language Understanding SystemsSamuel ThomasJeff Kuoet al.2022ICASSP 2022
Towards End-to-end Integration of Dialog History For Improved Spoken Language UnderstandingVishal SunderSamuel Thomaset al.2022ICASSP 2022
Speech Emotion Recognition Using Self-Supervised FeaturesEdmilson MoraisRon Hooryet al.2022ICASSP 2022
SpeechSplit2.0: Unsupervised Speech Disentanglement for Voice Conversion without Tuning Autoencoder BottlenecksChak Ho ChanKaizhi Qianet al.2022ICASSP 2022
Integrating Text Inputs For Training and Adapting RNN Transducer ASR ModelsSamuel ThomasBrian Kingsburyet al.2022ICASSP 2022
Speech Recognition using Biologically-Inspired Neural NetworksThomas BohnstinglAyush Garget al.2022ICASSP 2022
Decentralized Bilevel Optimization for Personalized Client LearningSongtao LuXiaodong Cuiet al.2022ICASSP 2022