Compress then Serve: Serving Thousands of LoRA Adapters with Little OverheadRickard GabrielssonJiacheng Zhuet al.2025ICML 2025Conference paper
Efficient multi-prompt evaluation of LLMsFelipe Maia PoloRonald Xuet al.2024NeurIPS 2024Conference paper
Knowledge-Based News Event Analysis & Forecasting ToolkitOktie HassanzadehParul Awasthyet al.2022IJCAI 2022Demo paper