Hierarchical Bias-Driven Stratification for Interpretable Causal Effect EstimationLucile Ter-MinassianLiran Szlaket al.2025AISTATS 2025
InspectorRAGet: An Introspection Platform for RAG EvaluationBenjamin SznajderKshitij Fadniset al.2025NAACL 2025
Designing and implementing LLM guardrails components in production environmentsMateus Do Amor Devino PereiraEvaline Juet al.2025CAIN 2025
Field Trials of Autonomous Navigation Robot for Visually Impaired PeopleHironobu TakagiKakuya Naitoet al.2025CHI 2025
A new framework for evaluating model out-of-distribution generalisation for the biochemical domainRaúl Fernández DíazLam Thanh Hoanget al.2025ICLR 2025
SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model MergingAladin DjuheraSwanand Ravindra Kadheet al.2025ICLR 2025
Cloud Native Communities in Action: How Japan Shaped Its Path to KubeConNoriaki FukuyasuYuichi Nakamuraet al.2025KubeCon EU 2025
A new framework for evaluating machine learning in biochemistry and its application for small molecules and peptidesRaúl Fernández DíazLam Thanh Hoanget al.2025IRB-AI-DD 2025