Vela: A Virtualized LLM Training System with GPU Direct and RoCEApoorve MohanRobert Walkupet al.2025ASPLOS 2025Conference paper
Enabling Scalability in the Cloud for Scientific Workflows: An Earth Science Use CasePaula OlayaJakob Luettgauet al.2023CLOUD 2023Conference paper
Best Practices for HPC Workloads on Public Cloud Platforms A guide for computational scientists to use public cloud for HPC workloadsRobert WalkupSeetharami R. Seelamet al.2022ICPE 2022Conference paper
Parallelism-Centric optimization and performance study of a finance aggregation engine on modern NUMA systemsGuojing CongSophia Wenet al.2015WHPCF 2015Conference paper