Is Finer Better? The Limits of Microscaling Formats in Large Language ModelsAndrea FasoliMonodeep Karet al.2026ICLR 2026Conference paper
Spyre: An inference-optimized scalable AI accelerator for enterprise workloadsMatt CohenMonodeep Karet al.2026ISSCC 2026Conference paper