Effective cluster management for large scale AI and GPUs: Challenges and opportunitiesClaudia MisaleDavid Grove2025Cloud Native + Kubernetes AI Day 2025Talk
Fit-to-Serve: How a New DRA Capability for Dynamic Device Sharing Fits into Distributed LLM ServingSunyanan ChoochotkaewTatsuhiro Chiba2025Cloud Native + Kubernetes AI Day 2025Talk