Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 21 days ago • 107
The Strong Lottery Ticket Hypothesis for Multi-Head Attention Mechanisms Paper • 2511.04217 • Published 14 days ago • 15