The Forgetting Wall in Video and World Models
To stay consistent over a long video, a model must remember its own past. In Video Diffusion model, that memory is the KV cache, which balloons far faster than in text. We call this the Forgetting Wall. We survey the partial answers, and point to one thing we tried: QuantVideoGen.