Principal Platform Engineer. Building cost- and SLO-aware routing for heterogeneous LLM inference (llm-d, vLLM, Kubernetes).
-
Microsoft
- Seattle
-
17:38
(UTC -07:00) - https://sorayapartow.vercel.app/
- in/soraya-partow
Popular repositories Loading
-
-
-
-
-
distributed-ml-training
distributed-ml-training PublicWeb-based distributed machine learning training system with real-time visualization and multi-GPU support
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
