Lars Graf, Thomas Bohnstingl, et al.
NeurIPS 2025
State-of-the-art neural language models can now be used to solve ad-hoc language tasks through zero-shot without the need for supervised training. This approach has gained popularity in recent years, and researchers have demonstrated prompts that achieve strong accuracy on specific NLP tasks. However, finding a prompt for new tasks requires experimentation. Different prompt templates with different wording choices lead to significant accuracy differences. PromptIDE allows users to experiment with prompt variations, visualize prompt performance, and iteratively optimize prompts. We developed a workflow that allows users to first focus on model feedback using small data before moving on to a large data regime that allows empirical grounding of promising prompts using quantitative measures of the task. The tool then allows easy deployment of the newly created ad-hoc models. We demonstrate the utility of PromptIDE (demo: ) and our workflow using several real-world use cases.
Lars Graf, Thomas Bohnstingl, et al.
NeurIPS 2025
Yuya Jeremy Ong, Jay Pankaj Gala, et al.
IEEE CISOSE 2024
Michael Glass, Gaetano Rossiello, et al.
NAACL 2022
Viviane T. Silva, Breno William Santos Rezende de Carvalho, et al.
ACS Fall 2023