Protecting Users From Themselves: Safeguarding Contextual Privacy in Interactions with Conversational AgentsIvoline NgongSwanand Ravindra Kadheet al.2024NeurIPS 2024
Final-Model-Only Data Attribution with a Unifying View of Gradient-Based MethodsDennis WeiInkit Padhiet al.2024NeurIPS 2024
Causal Markov Blanket Representation Learning for Domain Generalization PredictionNaiyu YinHanjing Wanget al.2024ECCV 2024
NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language ModelsAmit DhurandharTejaswini Pedapatiet al.2024ACL 2024
Trust Regions for Explanations via Black-Box Probabilistic CertificationAmit DhurandharSwagatam Haldaret al.2024ICML 2024
Fusion of biomedical imaging studies for increased sample size and diversity: a case study of brain MRIMatias AiskovichEduardo Castroet al.2024Frontiers in Radiology
Model Agnostic Contrastive Explanations for Classification ModelsAmit DhurandharTejaswini Pedapatiet al.2024IEEE JESTCS
Locally Invariant Explanations: Towards Stable and Unidirectional Explanations through Local Invariant LearningAmit DhurandharKarthikeyan Natesan Ramamurthyet al.2023NeurIPS 2023