Unleashing the Power of DRA (Dynamic Resource Allocation) for Just-in-Time GPU SlicingAbhishek MalvankarOlivier Tardieu2024KubeCon EU 2024
Timely and Efficient AI Insights on EHR: System DesignPartha SuryanarayananEdward A. Epsteinet al.2020AMIA Annual Symposium 2020
Incremental GPU Slicing in ActionAbhishek MalvankarOlivier Tardieu2024CNCF-hosted Co-located Events North America 2024
Training Foundation Model Workloads on Kubernetes at Scale With MCADOlivier TardieuAbhishek Malvankar2023K8SAIHPCDAY 2023