cotalks.dev

Streaming Attention Approximation via Discrepancy Theory

(link)
note