Publications

11 results for Erik Miehling

Who Sees the Risk? Stakeholder Conflicts and Explanatory Policies in LLM-based Risk Assessment
- - Srishti Yadav
  - Jasmina Gajcin
  - et al.
- 2026
- AAAI 2026
Workshop paper
Foundations of Agentic Systems Theory
- - Erik Miehling
  - Chenchen Ye
  - et al.
- 2026
- AAAI 2026
Workshop
Learning to Steer Large Language Models
- - Erik Miehling
  - Irene Ko
  - et al.
- 2026
- AAAI 2026
Tutorial
Synthetic Data for Evaluation: Supporting LLM-as-a-Judge Workflows with EvalAssist
- - Elizabeth Daly
  - Erik Miehling
  - et al.
- 2025
- EMNLP 2025
Demo paper
Localizing Persona Representations in LLMs
- - Celia Cintas
  - Miriam Rateike
  - et al.
- 2025
- AIES 2025
Conference paper
Localizing Persona Representations in LLMs
- - Celia Cintas
  - Miriam Rateike
  - et al.
- 2025
- COLM 2025
Workshop paper
Granite Guardian: Comprehensive LLM Safeguarding
- - Inkit Padhi
  - Manish Nagireddy
  - et al.
- 2025
- NAACL 2025
Conference paper
Programming Refusal with Conditional Activation Steering
- - Bruce Lee
  - Inkit Padhi
  - et al.
- 2025
- ICLR 2025
Conference paper
Attack Atlas: A Practitioner's Perspective on Challenges and Pitfalls in Red Teaming GenAI
- - Ambrish Rawat
  - Stefan Schoepf
  - et al.
- 2024
- NeurIPS 2024
Workshop
Language Models in Dialogue: Conversational Maxims for Human-AI Interactions
- - Erik Miehling
  - Manish Nagireddy
  - et al.
- 2024
- EMNLP 2024
Paper