Self-Taught Recognizer: Toward Unsupervised Adaptation for Speech Foundation ModelsYuchen HuChen Chenet al.2024NeurIPS 2024
Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss LandscapesXiaomeng XuPin-Yu Chenet al.2024NeurIPS 2024
Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language ModelsChia-yi HsuYu-Lin Tsaiet al.2024NeurIPS 2024
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language ModelsShengyun PengPin-Yu Chenet al.2024NeurIPS 2024
Neural Network Reparametrization for Accelerated Optimization in Molecular SimulationsNima DehmamyCsaba Bothet al.2024NeurIPS 2024
Trans-LoRA: towards data-free Transferable Parameter Efficient FinetuningRunqian WangSoumya Ghoshet al.2024NeurIPS 2024
Graph-based Uncertainty Metrics for Long-form Language Model GenerationsMingjian JiangYangjun Yangjunet al.2024NeurIPS 2024
Dense Associative Memory Through the Lens of Random FeaturesBenjamin HooverDuen Horng Chauet al.2024NeurIPS 2024