Gradient Cuff: Detecting Jailbreak Attacks on Large Language Models by Exploring Refusal Loss LandscapesXiaomeng XuPin-Yu Chenet al.2024NeurIPS 2024
Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language ModelsChia-yi HsuYu-Lin Tsaiet al.2024NeurIPS 2024
Navigating the Safety Landscape: Measuring Risks in Finetuning Large Language ModelsShengyun PengPin-Yu Chenet al.2024NeurIPS 2024
Trans-LoRA: towards data-free Transferable Parameter Efficient FinetuningRunqian WangSoumya Ghoshet al.2024NeurIPS 2024
GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative ModelsZhaitang LiPin-Yu Chenet al.2024NeurIPS 2024
Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for ChemistryMarvin AlbertsOliver Schilteret al.2024NeurIPS 2024
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMsIrene HuangWei Linet al.2024NeurIPS 2024
Optimizing GPU Multiplexing for Efficient and Cost-Effective Access to Diverse Large Language Models in GPU ClustersYue ZhuChen Wanget al.2024MASCOTS 2024