Publications

19 results for Ambrish Rawat

MAD-MAX: Modular And Diverse Malicious AttackMiXtures for Automated LLM Red Teaming
- - Stefan Schoepf
  - Muhammad Zaid Hameed
  - et al.
- 2025
- ICML 2025
Granite Guardian: Comprehensive LLM Safeguarding
- - Inkit Padhi
  - Manish Nagireddy
  - et al.
- 2025
- NAACL 2025
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI
- - Ambrish Rawat
  - Stefan Schoepf
  - et al.
- 2024
- NeurIPS 2024
MoJE: Mixture of Jailbreak Experts, Naive Tabular Classifiers as Guard for Prompt Attacks
- - Giandomenico Cornacchia
  - Kieran Fraser
  - et al.
- 2024
- AIES 2024
Data Forging Is Harder Than You Think
- - Mohamed Suliman
  - Swanand Ravindra Kadhe
  - et al.
- 2024
- ICLR 2024
Domain Adaptation for Time series Transformers using One-step fine-tuning
- - Subina Khanal
  - Seshu Tirupathi
  - et al.
- 2024
- AAAI 2024
Pruning Federated Learning Models for Anomaly Detection in Resource-Constrained Environments
- - Simone Magnani
  - Stefano Braghin
  - et al.
- 2023
- Big Data 2023
FairSISA: Ensemble Post-Processing to Improve Fairness of Unlearning in LLMs
- - Swanand Ravindra Kadhe
  - Anisa Halimi
  - et al.
- 2023
- NeurIPS 2023
Machine Learning Platform for Extreme Scale Computing on Compressed IoT Data
- - Seshu Tirupathi
  - Dhaval Salwala
  - et al.
- 2022
- Big Data 2022
Federated Continual Learning with Differentially Private Data Sharing
- - Giulio Zizzo
  - Ambrish Rawat
  - et al.
- 2022
- NeurIPS 2022