ScaleCom: Scalable Sparsified Gradient Compression for Communication-Efficient Distributed Training Chia-Yu ChenJiamin Niet al.2020NeurIPS 2020
Efficient AI System Design with Cross-Layer Approximate ComputingSwagath VenkataramaniXiao Sunet al.2020Proceedings of the IEEE
A 3.0 TFLOPS 0.62V Scalable Processor Core for High Compute Utilization AI Training and InferenceJinwook OhSae Kyu Leeet al.2020VLSI Circuits 2020
Hybrid 8-bit floating point (HFP8) training and inference for deep neural networksXiao SunJungwook Choiet al.2019NeurIPS 2019
DLFloat: A 16-b Floating Point Format Designed for Deep Learning Training and InferenceAnkur AgrawalBruce Fleischeret al.2019ARITH 2019
Accumulation bit-width scaling for ultra-low precision training of deep networksCharbel SakrNaigang Wanget al.2019ICLR 2019
Innovate Practices on CyberSecurity of Hardware Semiconductor DevicesAlfred L. CrouchPeter Levinet al.2019VTS 2019
Training deep neural networks with 8-bit floating point numbersNaigang WangJungwook Choiet al.2018NeurIPS 2018
A Scalable Multi-TeraOPS Core for AI Training and InferenceSunil ShuklaBruce Fleischeret al.2018IEEE SSC-L
26 Aug 2019US10396665On-chip Dc-dc Power Converters With Fully Integrated Gan Power Switches, Silicon Cmos Transistors And Magnetic Inductors
26 Aug 2019US10396144Magnetic Inductor Stack Including Magnetic Materials Having Multiple Permeabilities
19 Aug 2019US10389356Resonant Virtual Supply Booster For Synchronous Logic Circuits And Other Circuits With Use Of On-chip Integrated Magnetic Inductor
15 Jul 2019US10355070Magnetic Inductor Stack Including Magnetic Materials Having Multiple Permeabilities
06 May 2019US10283249Magnetic Material Stack And Magnetic Inductor Structure Fabricated With Surface Roughness Control
22 Apr 2019US10270443Resonant Virtual Supply Booster For Synchronous Logic Circuits And Other Circuits With Use Of On-chip Integrated Magnetic Inductor
KEKaoutar El MaghraouiPrincipal Research Scientist and Manager, AIU Spyre Model Enablement, AI Hardware Center