Portable High‑Performance LLM Serving: A Triton Backend for vLLMBurkhard RingleinJan van Lunteren2026PyTorchEU 2026Talk
vllm-triton-backend: How to get state-of-the-art performance on NVIDIA and AMD with just tritonBurkhard RingleinThomas Parnellet al.2025PyTorch Conference 2025Talk
The Anatomy of a Triton Attention BackendBurkhard RingleinJan van Lunterenet al.2025Triton Developer Conference 2025Poster
Accelerating Decision-Tree-based Inference through Adaptive ParallelizationJan van Lunteren2023PACT 2023Conference paper