Donato Francesco Falcone, Stephan Menzel, et al.
IMW 2023
Analogue in-memory computing (AIMC) is an emerging computational paradigm that can efficiently accelerate the key operations in deep learning (DL) inference workloads. Heterogeneous architectures, which integrate both AIMC tiles and digital processing units, have been proposed to enable the end-to-end execution of various deep neural network models. However, developing a software stack for these architectures is challenging, owing to their distinct characteristics — such as the need for extensive or complete weight stationarity and pipelined execution across layers, if maximum performance is to be achieved. Moreover, AIMC tiles are inherently stochastic and hence introduce a combination of stochastic and deterministic noise, which adversely affects accuracy. As a result, existing tools for software stack development are not directly applicable. In this Perspective, we give an overview of the key attributes of DL software stacks and AIMC-based accelerators, outline the challenges associated with designing DL software stacks for AIMC-based accelerators and present opportunities for future research.
Donato Francesco Falcone, Stephan Menzel, et al.
IMW 2023
Apoorve Mohan, Robert Walkup, et al.
ASPLOS 2025
Wei Ren, Sandhya Koteshwara, et al.
CLOUD 2024
Martin Cochet, Karthik Swaminathan, et al.
ISCA 2024