Reducing Data Motion to Accelerate the Training of Deep Neural NetworksSicong ZhuangCristiano Malossiet al.2020arXiv
XwattPilot: A Full-stack Cloud System Enabling Agile Development of Transprecision Software for Low-power SoCsDionysios DiamantopoulosFlorian Scheideggeret al.2020COOL CHIPS 2020
Constrained deep neural network architecture search for IoT devices accounting for hardware calibrationFlorian ScheideggerLuca Beniniet al.2019NeurIPS 2019
FloatX: A C++ library for customized floating-point arithmeticGoran FlegarFlorian Scheideggeret al.2019ACM TOMS
TAPAS: Train-less accuracy predictor for architecture searchRoxana IstrateFlorian Scheideggeret al.2019AAAI 2019
NeuNetS: An Automated Synthesis Engine for Neural Network DesignAtin SoodBenjamin Elderet al.2019arXiv
A scalable iterative dense linear system solver for multiple right-hand sides in data analyticsVassilis KalantzisCristiano Malossiet al.2018Parallel Computing
The transprecision computing paradigm: Concept, design, and applicationsCristiano MalossiMichael Schaffneret al.2018DATE 2018