SKIP-SALSA: Skip Synchronous Fusion of ASR LLM DecodersAshish MittalDarshan Prabhuet al.2025INTERSPEECH 2025Conference paper
Voice Activity-based Text Segmentation for ASR Text DenormalizationSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025Conference paper
Improving End-to-end Mixed-case ASR with Knowledge Distillation and Integration of Voice Activity CuesSashi NovitasariTakashi Fukudaet al.2025INTERSPEECH 2025Conference paper
Spoken question answering for visual queriesNimrod ShabtayZvi Konset al.2025INTERSPEECH 2025Conference paper
Exploring the Limits of Conformer CTC-Encoder for Speech Emotion Recognition using Large Language ModelsEdmilson Da Silva MoraisHagai Aronowitzet al.2025INTERSPEECH 2025Conference paper