Swagath Venkataramani

Title

Principal Research Scientist, AIU Architecture and Compilers

Publications

Is Finer Better? The Limits of Microscaling Formats in Large Language Models
- - Andrea Fasoli
  - Monodeep Kar
  - et al.
- 2026
- ICLR 2026
Conference paper
Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators
- - Prasanth Chatarasi
  - Alex Gatea
  - et al.
- 2026
- CGO 2026
Conference paper
Spyre: An inference-optimized scalable AI accelerator for enterprise workloads
- - Matt Cohen
  - Monodeep Kar
  - et al.
- 2026
- ISSCC 2026
Conference paper
Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization
- - Prasanth Chatarasi
  - Alex Gatea
  - et al.
- 2026
- CGO 2026
Conference paper
DeepTools: A Full-Stack Machine Learning Compiler for the IBM Spyre Accelerator
- - Prasanth Chatarasi
  - Shubham Jain
  - et al.
- 2026
- CGO 2026
Workshop paper
Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure
- - Rui Xie
  - Asad Ul Haq
  - et al.
- 2025
- IEEE Computer Architecture Letters
Paper
MixTrain: accelerating DNN training via input mixing
- - Sarada Krithivasan
  - Sanchari Sen
  - et al.
- 2024
- Frontiers in Artificial Intelligence
Paper
A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- ISSCC 2024
Conference paper
DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU
- - Sanchari Sen
  - Shubham Jain
  - et al.
- 2024
- IEEE Micro
Paper
Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- IEEE Journal of Solid-State Circuits
Paper

Top collaborators

Alberto Mannari

Software Developer

Prasanth Chatarasi

Senior Research Scientist, AI Accelerator Compilers and Architecture

Matthew Ziegler

Principal Research Scientist

Paul G Crumley

STSM, AI & Hybrid Cloud Infrastructure

Swagath Venkataramani

Title

Publications

Is Finer Better? The Limits of Microscaling Formats in Large Language Models

Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators

Spyre: An inference-optimized scalable AI accelerator for enterprise workloads

Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization

DeepTools: A Full-Stack Machine Learning Compiler for the IBM Spyre Accelerator

Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure

MixTrain: accelerating DNN training via input mixing

A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC

DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU

Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC

Patents

Optimized Hierarchical Scratchpads For Enhanced Artificial Intelligence Accelerator Core Utilization

Dynamically Resizing Minibatch In Neural Network Execution

Bi-scaled Deep Neural Networks

Facilitating Neural Network Efficiency

Deep Neural Network Performance Analysis On Shared Memory Accelerator Systems

Self-evaluating Array Of Memory

Low-overhead Error Prediction And Preemption In Deep Neural Network Using Apriori Network Statistics

Programmable Data Delivery By Load And Store Agents On A Processing Chip Interfacing With On-chip Memory Components And Directing Data To External Memory Components