Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published about 1 month ago • 97
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving Paper • 2509.20109 • Published Sep 24 • 3
StyleBench: Evaluating thinking styles in Large Language Models Paper • 2509.20868 • Published Sep 25 • 3
When Judgment Becomes Noise: How Design Failures in LLM Judge Benchmarks Silently Undermine Validity Paper • 2509.20293 • Published Sep 24 • 7
Understanding the Thinking Process of Reasoning Models: A Perspective from Schoenfeld's Episode Theory Paper • 2509.14662 • Published Sep 18 • 13
CE-GPPO: Controlling Entropy via Gradient-Preserving Clipping Policy Optimization in Reinforcement Learning Paper • 2509.20712 • Published Sep 25 • 19
TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them Paper • 2509.21117 • Published Sep 25 • 29
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources Paper • 2509.21268 • Published Sep 25 • 101