Towards Pareto Optimal Throughput in Small Language Model Serving
- Pol G. Recasens
- Yue Zhu
- et al.
- 2024
- EuroSys 2024
Alaa Youssef is a Senior Manager and Master Inventor at IBM T.J. Watson Research Center. He leads the cloud native AI platform research team, contributing to IBM’s Watsonx and OpenShift AI platform. His research interests are in hybrid cloud, cloud native AI and HPC, resource management, sustainable and trusted distributed cloud computing. Dr Youssef has held multiple technical and management positions in IBM Research, and in IBM Software Services in multiple geographies. He received his PhD in Computer Science from Old Dominion University, Virginia, and his MSc in Computer Engineering from Alexandria University, Egypt.