Using Large Language Models to Understand Suicidality in a Social Media–Based Taxonomy of Mental Health Disorders: Linguistic Analysis of Reddit PostsBrian BauerRaquel Norelet al.2024JMIR Mental Health
Large Language Models are Efficient Learners of Noise-Robust Speech RecognitionYuchen HuChen Chenet al.2024ICLR 2024
It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech RecognitionChen ChenRuizhe Liet al.2024ICLR 2024
MULTIPLE REPRESENTATION TRANSFER FROM LARGE LANGUAGE MODELS TO END-TO-END ASR SYSTEMSTakuma UdagawaMasayuki Suzukiet al.2024ICASSP 2024
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel OptimizationA F M SaifXiaodong Cuiet al.2024ICASSP 2024
Speak While You Think: Streaming Speech Synthesis During Text GenerationAvihu DekelSlava Shechtmanet al.2024ICASSP 2024
A Framework for Mining Speech-to-Text Transcripts of the Customer for Automated Problem RemediationPrateeti MohapatraGargi Dasgupta2024IAAI 2024
Creating an African American-Sounding TTS: Guidelines, Technical Challenges, and Surprising EvaluationsClaudio Santos PinhanezRaul Fernandezet al.2024IUI 2024