AutoTuneX: Interactive Automated Fine-Tuning for Large Language ModelsDaniel Karl I. WeidelePriyanshu Raiet al.2026AAAI 2026Demo paper
ToolSmith: A Multi-Agent Framework for Enterprise Tool CreationPurna Chandra Sekhar VakudavathuKushal Mukherjeeet al.2026AAAI 2026Demo paper
Risk Atlas Nexus: A System for Managing AI RisksInge VejsbjergRahul Nairet al.2026AAAI 2026Demo paper
Auto-BenchmarkCard: Automated Synthesis of Benchmark DocumentationAris HofmannInge Vejsbjerget al.2026AAAI 2026Demo paper
DFAgent: From Natural Language Data Interactions to Reusable Agent-Ready ToolsNeelamadhav GantayatRenuka Sindhgattaet al.2026AAAI 2026Demo paper
QueryGym: Step-by-Step Interaction with Relational DatabasesHaritha AnanthakrishnanHarsha Kokelet al.2026AAAI 2026Demo paper
AssetOpsBench-Live: Privacy-Aware Online Evaluation of Multi-Agent Performance in Industrial OperationsDhaval PatelNianjun Zhouet al.2026AAAI 2026Demo paper
CLEAR: Error Analysis via LLM-as-a-Judge Made EasyAsaf YehudaiLilach Edelsteinet al.2026AAAI 2026Demo paper
Towards Generalist Large Language Models for Molecular Property Prediction: Distilling Knowledge from Specialist ModelsHuy Khiem LeSreejata Deyet al.2026AAAI 2026Workshop paper
A Multi-Agent Framework for Enterprise Tool CreationPurna Chandra Sekhar VakudavathuKushal Mukherjeeet al.2026AAAI 2026Workshop paper