Certified Interpretability Robustness for Class Activation MappingAlex GuTsui-Wei Wenget al.2020NeurIPS 2020
Reprogramming Language Models for Molecular Representation LearningRia VinodPin-Yu Chenet al.2020NeurIPS 2020
Elevating Defenses: Bridging Adversarial Training and Watermarking for Model ResilienceJanvi ThakkarGiulio Zizzoet al.2024AAAI 2024
Differentially Private and Adversarially Robust Machine Learning: An Empirical EvaluationJanvi ThakkarGiulio Zizzoet al.2024AAAI 2024
Forcing Generative Models to Degenerate Ones: The Power of Data Poisoning AttacksShuli JiangSwanand Ravindra Kadheet al.2023NeurIPS 2023
On Robustness-Accuracy Characterization of Large Language Models using Synthetic DatasetsChing-yun KoPin-Yu Chenet al.2023ICML 2023
c-MBA: Adversarial Attack for Cooperative MARL Using Learned Dynamics ModelNhan PhamLam Nguyenet al.2022NeurIPS 2022