Towards Trustworthy and Socially Responsible Generative Foundation ModelsYue HuangZhenhong Zhouet al.2026AAAI 2026Tutorial
BenchmarkCards: Standardized Documentation for Large Language Model BenchmarksAnna SokolElizabeth Dalyet al.2025NeurIPS 2025Conference paper
Adaptive Distraction: Probing LLM Contextual Robustness with Automated Tree SearchYanbo WangZixiang Xuet al.2025NeurIPS 2025Conference paper