SWA
KV Cache Matrix
\( S_{t-1} \)
Update
INPUT
t=1
\( U_t \)