cotalks.dev
Login
Lightning Talk: Decoding and Taming the Costs of Serving Large Language Models - Yuan Chen, NVIDIA
(link)
Event:
KubeCon + CloudNativeCon Europe 2024
Channel:
CNCF [Cloud Native Computing Foundation]
unsorted
todo
resolved
completed
canceled
submit