Yuya Jeremy Ong, Jay Pankaj Gala, et al.
IEEE CISOSE 2024
We present MTRAG-UN, a benchmark for exploring open challenges in multi-turn retrieval augment generation, a popular use of large language models. We release a benchmark of 666 tasks from 666 conversations containing over 2,800 conversation turns across 6 domains with accompanying corpora. Our experiments show that retrieval and generation models continue to struggle on conversations with UNanswerable, UNderspecified, and NONstandalone questions and UNclear responses.
Yuya Jeremy Ong, Jay Pankaj Gala, et al.
IEEE CISOSE 2024
Eyal Shnarch, Alon Halfon, et al.
EMNLP 2022
Avi Sil, Jaydeep Sen, et al.
ACL 2023
Elron Bandel, Asaf Yehudai, et al.
ICML 2026