Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 14 days ago • 102
Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization Paper • 1603.06560 • Published Mar 21, 2016 • 1
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods Paper • 2502.01618 • Published Feb 3 • 10