Swagath Venkataramani

Title

Principal Research Scientist, AIU Architecture and Compilers

Publications

Is Finer Better? The Limits of Microscaling Formats in Large Language Models
- - Andrea Fasoli
  - Monodeep Kar
  - et al.
- 2026
- ICLR 2026
Conference paper
Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators
- - Prasanth Chatarasi
  - Alex Gatea
  - et al.
- 2026
- CGO 2026
Conference paper
Spyre: An inference-optimized scalable AI accelerator for enterprise workloads
- - Matt Cohen
  - Monodeep Kar
  - et al.
- 2026
- ISSCC 2026
Conference paper
Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization
- - Prasanth Chatarasi
  - Alex Gatea
  - et al.
- 2026
- CGO 2026
Conference paper
DeepTools: A Full-Stack Machine Learning Compiler for the IBM Spyre Accelerator
- - Prasanth Chatarasi
  - Shubham Jain
  - et al.
- 2026
- CGO 2026
Workshop paper
Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure
- - Rui Xie
  - Asad Ul Haq
  - et al.
- 2025
- IEEE Computer Architecture Letters
Paper
MixTrain: accelerating DNN training via input mixing
- - Sarada Krithivasan
  - Sanchari Sen
  - et al.
- 2024
- Frontiers in Artificial Intelligence
Paper
A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- ISSCC 2024
Conference paper
DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU
- - Sanchari Sen
  - Shubham Jain
  - et al.
- 2024
- IEEE Micro
Paper
Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC
- - Monodeep Kar
  - Joel Silberman
  - et al.
- 2024
- IEEE Journal of Solid-State Circuits
Paper

Top collaborators

Alberto Mannari

Software Developer

Prasanth Chatarasi

Staff Research Scientist, AIU Accelerator Compilers and Architecture

Matthew Ziegler

Principal Research Scientist

Paul G Crumley

STSM, AI & Hybrid Cloud Infrastructure

Swagath Venkataramani

Title

Publications

Is Finer Better? The Limits of Microscaling Formats in Large Language Models

Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators

Spyre: An inference-optimized scalable AI accelerator for enterprise workloads

Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization

DeepTools: A Full-Stack Machine Learning Compiler for the IBM Spyre Accelerator

Breaking the HBM Bit Cost Barrier: Domain-Specific ECC for AI Inference Infrastructure

MixTrain: accelerating DNN training via input mixing

A Software-Assisted Peak Current Regulation Scheme to Improve Power-Limited Inference Performance in a 5nm AI SoC

DNNDaSher: A Compiler Framework for Dataflow Compatible End-to-End Acceleration on IBM AIU

Power-Limited Inference Performance Optimization Using a Software-Assisted Peak Current Regulation Scheme in a 5-nm AI SoC

Patents

Reformatting Of Tensors To Provide Sub-tensors

System-aware Selective Quantization For Performance Optimized Distributed Deep Learning

Hybrid Data-model Parallelism For Efficient Deep Learning

Stickification Using Anywhere Padding To Accelerate Data Manipulation

Training Convolution Neural Network On Analog Resistive Processing Unit System

Hybrid Data-model Parallelism For Efficient Deep Learning

Padding Input Data For Artificial Intelligence Accelerators

Multichannel Memory To Augment Local Memory

Multichannel Memory To Augment Local Memory

Multichannel Memory To Augment Local Memory

Top collaborators

Alberto Mannari

Prasanth Chatarasi

Matthew Ziegler

Paul G Crumley