Asking like Socrates: Socrates helps VLMs understand remote sensing images Paper • 2511.22396 • Published 14 days ago • 4
Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning Paper • 2512.05591 • Published 6 days ago • 16
RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards Paper • 2512.00473 • Published 12 days ago • 19
SPARK: Stepwise Process-Aware Rewards for Reference-Free Reinforcement Learning Paper • 2512.03244 • Published 9 days ago • 14