LLMs are Brittle to Simple Code Transformations: Introducing CETBench – A Benchmark for Code-Equivalence CheckingNeeva OzaIshaan Govilet al.2026ACL 2026Paper
PO-KGQA: Preference Optimization for Low-Resource Complex Knowledge Graph Question AnsweringPrerna AgarwalAyushman Singhet al.2026ACL 2026Paper
STaD: Scaffolded Task Design for Identifying Compositional Skill Gaps in LLMsSungeun AnSwanand Ravindra Kadheet al.2026ACL 2026Paper
AutoForest: Automatically Generating Forest Plots from Biomedical Studies with End-to-End Evidence Extraction and SynthesisMassimiliano PronestiAngelo Miculescuet al.2026ACL 2026Demo paper
SemEval-2026 Task 7: Everyday Knowledge Across Diverse Languages and CulturesNedjma OusidhoumJunho Myunget al.2026ACL 2026Workshop paper
SemEval-2026 Task 8: MTRAGEval: Evaluating Multi-Turn RAG ConversationsSara RosenthalYannis Katsiset al.2026ACL 2026Workshop