Efficient image dataset classification difficulty estimation for predicting deep-learning accuracyFlorian ScheideggerRoxana Istrateet al.2020Visual ComputerPaper
torcpy: Supporting task parallelism in PythonPanagiotis HadjidoukasA. Bartezzaghiet al.2020SoftwareXPaper
Reducing Data Motion to Accelerate the Training of Deep Neural NetworksSicong ZhuangCristiano Malossiet al.2020arXivPaper
XwattPilot: A Full-stack Cloud System Enabling Agile Development of Transprecision Software for Low-power SoCsDionysios DiamantopoulosFlorian Scheideggeret al.2020COOL CHIPS 2020Conference paper
Constrained deep neural network architecture search for IoT devices accounting for hardware calibrationFlorian ScheideggerLuca Beniniet al.2019NeurIPS 2019Conference paper
FloatX: A C++ library for customized floating-point arithmeticGoran FlegarFlorian Scheideggeret al.2019ACM TOMSPaper
TAPAS: Train-less accuracy predictor for architecture searchRoxana IstrateFlorian Scheideggeret al.2019AAAI 2019Conference paper
NeuNetS: An Automated Synthesis Engine for Neural Network DesignAtin SoodBenjamin Elderet al.2019arXivPaper
A scalable iterative dense linear system solver for multiple right-hand sides in data analyticsVassilis KalantzisCristiano Malossiet al.2018Parallel ComputingPaper
The transprecision computing paradigm: Concept, design, and applicationsCristiano MalossiMichael Schaffneret al.2018DATE 2018Conference paper