Publications

Evaluating LLM-based Agents: Foundations, Best Practices and Open Challenges