cotalks.dev

Code Live with Google Cloud - Episode 2: Serving a really large model with Leader Worker Set (LWS)

(link)
note