Swagath Venkataramani

Title

Principal Research Scientist, AIU Architecture and Compilers

Publications

4-bit quantization of LSTM-based speech recognition models
- - Andrea Fasoli
  - Chia-Yu Chen
  - et al.
- 2021
- INTERSPEECH 2021
Efficacy of Pruning in Ultra-Low Precision DNNs
- - Sanchari Sen
  - Swagath Venkataramani
  - et al.
- 2021
- ISLPED 2021
RaPiD: AI Accelerator for Ultra-Low Precision Training and Inference
- - Swagath Venkataramani
  - Vijayalakshmi Srinivasan
  - et al.
- 2021
- ISCA 2021
Efficient Management of Scratch-Pad Memories in Deep Learning Accelerators
- - Subhankar Pal
  - Swagath Venkataramani
  - et al.
- 2021
- ISPASS 2021
A 7nm 4-Core AI Chip with 25.6TFLOPS Hybrid FP8 Training, 102.4TOPS INT4 Inference and Workload-Aware Throttling
- - Ankur Agrawal
  - Saekyu Lee
  - et al.
- 2021
- ISSCC 2021
Value Similarity Extensions for Approximate Computing in General-Purpose Processors
- - Younghoon Kim
  - Swagath Venkataramani
  - et al.
- 2021
- DATE 2021
Ultra-Low Precision 4-bit Training of Deep Neural Networks
- - Xiao Sun
  - Naigang Wang
  - et al.
- 2020
- NeurIPS 2020
ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training
- - Chia-Yu Chen
  - Jiamin Ni
  - et al.
- 2020
- NeurIPS 2020
Efficient AI System Design with Cross-Layer Approximate Computing
- - Swagath Venkataramani
  - Xiao Sun
  - et al.
- 2020
- Proceedings of the IEEE
A 3.0 TFLOPS 0.62V Scalable Processor Core for High Compute Utilization AI Training and Inference
- - Jinwook Oh
  - Sae Kyu Lee
  - et al.
- 2020
- VLSI Circuits 2020

Patents

- 25 Feb 2025
- CZ
- 4356236
Reformatting Of Tensors To Provide Sub-tensors
- 25 Feb 2025
- HU
- 4356236
Reformatting Of Tensors To Provide Sub-tensors
- 25 Feb 2025
- CH
- 4356236
Reformatting Of Tensors To Provide Sub-tensors
- 24 Feb 2025
- US
- 12236338
Single Function To Perform Combined Matrix Multiplication And Bias Add Operations
- 11 Nov 2024
- US
- 12141513
Method To Map Convolutional Layers Of Deep Neural Network On A Plurality Of Processing Elements With Simd Execution Units, Private Memories, And Connected As A 2d Systolic Processor Array
- 15 Oct 2024
- GB
- 2604060
Hybrid Data-model Parallelism For Efficient Deep Learning
- 16 Sep 2024
- US
- 12094525
Multichannel Memory To Augment Local Memory
- 05 Aug 2024
- US
- 12056594
Low Precision Deep Neural Network Enabled By Compensation Instructions
- 08 Jul 2024
- CN
- ZL201980032566.3
Low Precision Deep Neural Network Enabled By Compensation Instructions
- 02 Jun 2024
- JP
- 7497946
Hybrid Data-model Parallelism For Efficient Deep Learning

Top collaborators

MZ

Matthew Ziegler

Matthew Ziegler

Principal Research Scientist

AM

Alberto Mannari

Alberto Mannari

Software Developer

XC

Xiaodong Cui

Xiaodong Cui

Principal Research Scientist

KE

Kaoutar El Maghraoui

Kaoutar El Maghraoui

Principal Research Scientist and Manager, AIU Spyre Model Enablement, AI Hardware Center