LLMs are Brittle to Simple Code Transformations: Introducing CETBench – A Benchmark for Code-Equivalence CheckingNeeva OzaIshaan Govilet al.2026ACL 2026Paper
FILL IN THE BLANK: EXPLORING AND ENHANCING LLM CAPABILITIES FOR BACKWARD REASONING IN MATH WORD PROBLEMSAniruddha DebNeeva Ozaet al.2024ACL 2024Workshop paper