A journey to enable generative AI on a new hardware platform with PyTorch 2.0Kazuaki Ishizaki2023PyTorch Conference 2023
RaPiD: AI Accelerator for Ultra-Low Precision Training and InferenceSwagath VenkataramaniVijayalakshmi Srinivasanet al.2021ISCA 2021
Efficient AI System Design with Cross-Layer Approximate ComputingSwagath VenkataramaniXiao Sunet al.2020Proceedings of the IEEE
DeepTools: Compiler and Execution Runtime Extensions for RaPiD AI AcceleratorSwagath VenkataramaniJungwook Choiet al.2019IEEE Micro
A Compiler for Deep Neural Network Accelerators to Generate Optimized Code for a Wide Range of Data Parameters from a Hand-crafted Computation KernelEri OgawaKazuaki Ishizakiet al.2019COOL CHIPS 2019
Analyzing and optimizing Java code generation for apache spark query planKazuaki Ishizaki2019ICPE 2019
Identifying the potential of near data processing for Apache SparkAhsan Javed AwanEduard Ayguadéet al.2017MEMSYS 2017
Accelerating Spark Datasets by Inlining DeserializationJan WroblewskiKazuaki Ishizakiet al.2017IPDPS 2017
Compiling and Optimizing Java 8 Programs for GPU ExecutionKazuaki IshizakiAkihiro Hayashiet al.2015PACT 2015