A Text-based Safety Benchmark for Reinforcement Learning ProblemsNgoc Lan HoangNicolas Galichetet al.2022NeurIPS 2022
Nonconvex Min-Max Bilevel Optimization for Task Robust Meta LearningAlex GuSongtao Luet al.2021ICML 2021
Q-function Decomposition with Intervention Semantics for Factored Action SpacesJunkyu LeeTian Gaoet al.2025AAAI 2025
How Do Nonlinear Transformers Acquire Generalization-Guaranteed CoT Ability?Hongkang LiMeng Wenget al.2024ICML 2024
Adversarial Data Augmentation Improves Unsupervised Machine LearningChia-Yi HsuPin-Yu Chenet al.2021ICLR 2021
Conditional Moment Alignment for Improved Generalization in Federated LearningJayanth RegattiSongtao Luet al.2022NeurIPS 2022
M2 ASR: Multilingual Multi-task Automatic Speech Recognition via Multi-objective OptimizationA SaifLisha Chenet al.2024INTERSPEECH 2024
How Can Personalized Context Help? Exploring Joint Retrieval of Passage and Personalized ContextHui WanHongkang Liet al.2024ICASSP 2024
Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel OptimizationA F M SaifXiaodong Cuiet al.2024ICASSP 2024