Fine-tuning for Extreme Event Prediction: Are Ensemble Methods All You Need?Imran NasimJoao Lucas de Sousa Almeida2025KDD 2025
ConCodeEval: Evaluating Large Language Models for Code Constraints in Domain-Specific LanguagesMehant KammakomatiSameer Pimparkhedeet al.2025ACL 2025
Combining Domain and Alignment Vectors Provides Better Knowledge-Safety Trade-offs in LLMsMegh ThakkarQuentin Fournieret al.2025ACL 2025
Multi-Level Explanations for Generative Language ModelsLucas Monteiro PaesDennis Weiet al.2025ACL 2025
NGQA: A Nutritional Graph Question Answering Benchmark for Personalized Health-aware Nutritional ReasoningZheyuan ZhangYiyang Liet al.2025ACL 2025
Conceptual Diagnostics for Knowledge Graphs and Large Language ModelsRosario Uceda-SosaMaria Changet al.2025ACL 2025
EpMAN: Episodic Memory AttentioN for Generalizing to Longer ContextsSUBHAJIT CHAUDHURYPayel Daset al.2025ACL 2025
Defensive Prompt Patch: A Robust and Generalizable Defense of Large Language Models against Jailbreak AttacksChen XiongXiangyu Qiet al.2025ACL 2025
PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool PlayWei FangYang Zhanget al.2025ACL 2025
Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2025ACL 2025