The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published 9 days ago • 113
Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning Paper • 2510.23473 • Published 12 days ago • 83
view article Article Granite 4.0 Nano: Just how small can you go? By ibm-granite and 1 other • 11 days ago • 106
ReCode: Unify Plan and Action for Universal Granularity Control Paper • 2510.23564 • Published 11 days ago • 118
Reasoning with Sampling: Your Base Model is Smarter Than You Think Paper • 2510.14901 • Published 22 days ago • 45
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning Paper • 2510.19338 • Published 17 days ago • 110
Unified Reinforcement and Imitation Learning for Vision-Language Models Paper • 2510.19307 • Published 17 days ago • 26
LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts Paper • 2510.19363 • Published 17 days ago • 59
view article Article Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face 23 days ago • 15
Attention Is All You Need for KV Cache in Diffusion LLMs Paper • 2510.14973 • Published 22 days ago • 37
Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Paper • 2510.14967 • Published 22 days ago • 32