mt-rag-benchmark

MTRAGEval

🎉 Welcome to MTRAGEval! MTRAGEval is a task for Evaluating Multi-Turn RAG Conversations at SemEval 2026. 🎉

Join Our Mailing List!

MTRAGEval Mailing List

Training and Trial Data

The MTRAG Benchmark is released as the trial and training data for MTRAG. You can access the full dataset here

📋 Tasks

Read more about our tasks in our proposal

Evaluation Scripts

Stay Tuned! Coming Soon!

📆 Timeline (Tentative)

Task Organizers

Sara Rosenthal ✉️
Yannis Katsis ✉️
Vraj Shah ✉️
Marina Danilevsky ✉️