Efficient Long-Horizon GUI Agents via Training-Free KV Cache Compression
The paper proposes ST-Lite, a training-free KV cache compression framework that leverages the uniform high-sparsity of GUI attention patterns through a dual-branch scoring policy of spatial saliency and trajectory-aware semantic gating, achieving significant decoding acceleration with minimal performance loss in long-horizon GUI agents.