Why a decades old architecture decision is impeding the power of AI computingExplainerPeter Hess10 Feb 2025AIAI HardwareHardware TechnologySemiconductors
Improving Hugging Face training efficiency through packing with flash attentionTechnical noteRhui Dih Lee, Arthur Zucker, Achintya Kundu, Laura Wynter, Raghu Ganti, and Mayank Mishra28 Aug 2024AI