Cloud-native Workflow Scheduling using a Hybrid Priority Rule, Dynamic Resource Allocation, and Dynamic Task PartitionJungeun ShinDiana Arroyoet al.2024SoCC 2024
Towards Pareto Optimal Throughput in Small Language Model ServingPol G. RecasensYue Zhuet al.2024EuroSys 2024
AWARE: Automate Workload Autoscaling with Reinforcement Learning in Production Cloud SystemsHaoran QiuWeichao Maoet al.2023USENIX ATC 2023
A Carbon-aware Workload Dispatcher in Cloud Computing SystemsTayebeh BahreiniAsser Tantawiet al.2023CLOUD 2023
Cloud-native Workflow Scheduling using a Hybrid Priority Rule and Dynamic Task ParallelismJungeun ShinDiana Arroyoet al.2022SoCC 2022
Floki: A Proactive Data Forwarding System for Direct Inter-Function Communication for Serverless WorkflowsAnna Maria NestorovJosep Berralet al.2022Middleware 2022
Proactive Container Auto-scaling for Cloud Native Machine Learning ServicesDavid BuchacaJosep Berralet al.2020CLOUD 2020
AI4DL: Mining Behaviors of Deep Learning Workloads for Resource ManagementJosep BerralChen Wanget al.2020HotCloud 2020
Resource Profile Advisor for Containers in Cognitive PlatformMehmet F. AktasChen Wanget al.2018SoCC 2018