Swagath Venkataramani

Overview

Title

Principal Research Scientist, AIU Architecture and Compilers

Location

IBM Research - Yorktown Heights Yorktown Heights, NY USA

Publications

Performance-driven Programming of Multi-TFLOP Deep Learning Accelerators∗
- - Swagath Venkataramani
  - Jungwook Choi
  - et al.
- 2019
- IISWC 2019
DeepTools: Compiler and Execution Runtime Extensions for RaPiD AI Accelerator
- - Swagath Venkataramani
  - Jungwook Choi
  - et al.
- 2019
- IEEE Micro
Dynamic Spike Bundling for Energy-Efficient Spiking Neural Networks
- - Sarada Krithivasan
  - Sanchari Sen
  - et al.
- 2019
- ISLPED 2019
BiScaled-DNN: Quantizing long-tailed datastructures with two scale factors for deep neural networks
- - Shubham Jain
  - Swagath Venkataramani
  - et al.
- 2019
- DAC 2019
SparCE: Sparsity Aware General-Purpose Core Extensions to Accelerate Deep Neural Networks
- - Sanchari Sen
  - Shubham Jain
  - et al.
- 2019
- IEEE TC
A Compiler for Deep Neural Network Accelerators to Generate Optimized Code for a Wide Range of Data Parameters from a Hand-crafted Computation Kernel
- - Eri Ogawa
  - Kazuaki Ishizaki
  - et al.
- 2019
- COOL CHIPS 2019
Data Subsetting: A Data-Centric Approach to Approximate Computing
- - Younghoon Kim
  - Swagath Venkataramani
  - et al.
- 2019
- DATE 2019
A Scalable Multi-TeraOPS Core for AI Training and Inference
- - Sunil Shukla
  - Bruce Fleischer
  - et al.
- 2018
- IEEE SSC-L
A Scalable Multi-TeraOPS Deep Learning Processor Core for AI Trainina and Inference
- - Bruce Fleischer
  - Sunil Shukla
  - et al.
- 2018
- VLSI Circuits 2018
DyHard-DNN: Even more DNN acceleration with dynamic hardware reconfiguration
- - Mateja Putic
  - Alper Buyuktosunoglu
  - et al.
- 2018
- DAC 2018

Patents

- 31 May 2023
- TW
- I804209
Exploiting Fine-grained Structured Weight Sparsity In Systolic Arrays
- 11 May 2023
- CN
- ZL202010150294.1
Programmable Data Delivery To A System Of Shared Processing Elements With Shared Memory
- 06 Mar 2023
- US
- 11599795
Reducing The Cost Of N Modular Redundancy For Neural Networks
- 16 Jan 2023
- US
- 11556450
Hybrid Data-model Parallelism For Efficient Deep Learning
- 09 Jan 2023
- US
- 11551054
System-aware Selective Quantization For Performance Optimized Distributed Deep Learning
- 07 Dec 2022
- JP
- 7190799
Low Precision Deep Neural Network Enabled By Compensation Instructions
- 06 Dec 2022
- GB
- 2590000
Low Precision Deep Neural Network Enabled By Compensation Instructions
- 20 Oct 2022
- JP
- 7163381
Facilitating Neural Network Efficiency
- 29 Aug 2022
- US
- 11429524
Optimized Hierarchical Scratchpads For Enhanced Artificial Intelligence Accelerator Core Utilization
- 06 Jun 2022
- US
- 11354573
Dynamically Resizing Minibatch In Neural Network Execution

Top collaborators

Xiaodong Cui

Principal Research Scientist

Alberto Mannari

Software Developer

Jinwook Jung

Research Staff Member

Mori Ohara

Deputy Director, IBM Research Tokyo, Distinguished Engineer, Chief SW Engineer for Hybrid Cloud on IBM HW