The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published 26 days ago • 26
Supervised Reinforcement Learning: From Expert Trajectories to Step-wise Reasoning Paper • 2510.25992 • Published 26 days ago • 43
The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published 26 days ago • 113