DARE to Diversify: DAta Driven and Diverse LLM REd TeamingManish NagireddyBernat Guillen Pegueroleset al.2024KDD 2024
Prompt Templates: A Methodology for Improving Manual Red Teaming PerformanceBrandon DominiqueDavid Piorkowskiet al.2024CHI 2024
Simulating Iterative Human-AI Interaction in Programming with LLMsHussein MozannarValerie Chenet al.2023NeurIPS 2023
Influence Based Approaches to Algorithmic Fairness: A Closer LookSoumya GhoshPrasanna Sattigeriet al.2023NeurIPS 2023