DRIVE: Data Curation Best Practices for Reinforcement Learning with Verifiable Reward in Competitive Code Generation Paper • 2511.06307 • Published 7 days ago • 48
MANZANO: A Simple and Scalable Unified Multimodal Model with a Hybrid Vision Tokenizer Paper • 2509.16197 • Published Sep 19 • 54
naver-hyperclovax/HyperCLOVAX-SEED-Think-14B Text Generation • 15B • Updated Aug 27 • 60.5k • 93
Promptomatix: An Automatic Prompt Optimization Framework for Large Language Models Paper • 2507.14241 • Published Jul 17 • 17
VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning Paper • 2507.13348 • Published Jul 17 • 75
SynLogic Collection Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond • 5 items • Updated Jun 3 • 13
MiniMax-M1 Collection MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model. • 6 items • Updated 26 days ago • 116