cotalks.dev

Effortless Scalability: Orchestrating Large Language Model Inference with Kubernetes

(link)