What Layers When: Learning to Skip Compute in LLMs with Residual Gates Paper • 2510.13876 • Published 27 days ago • 11
KV Cache Steering for Inducing Reasoning in Small Language Models Paper • 2507.08799 • Published Jul 11 • 40