MAD-MAX: Modular And Diverse Malicious AttackMiXtures for Automated LLM Red TeamingStefan SchoepfMuhammad Zaid Hameedet al.2025ICML 2025
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAIAmbrish RawatStefan Schoepfet al.2024NeurIPS 2024
MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt AttacksGiandomenico CornacchiaKieran Fraseret al.2024AIES 2024
Domain Adaptation for Time series Transformers using One-step fine-tuningSubina KhanalSeshu Tirupathiet al.2024AAAI 2024
Pruning Federated Learning Models for Anomaly Detection in Resource-Constrained EnvironmentsSimone MagnaniStefano Braghinet al.2023Big Data 2023
FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMsSwanand Ravindra KadheAnisa Halimiet al.2023NeurIPS 2023
Machine Learning Platform for Extreme Scale Computing on Compressed IoT DataSeshu TirupathiDhaval Salwalaet al.2022Big Data 2022
Federated Continual Learning with Differentially Private Data SharingGiulio ZizzoAmbrish Rawatet al.2022NeurIPS 2022