MTRAG: A Multi-Turn Conversational Benchmark for Evaluating Retrieval-Augmented Generation SystemsYannis KatsisSara Rosenthalet al.2025ACL 2025
How do Categorical Duplicates Affect ML? A New Benchmark and Empirical AnalysesVraj ShahThomas Parashoset al.2024VLDB 2024