pace bigcode/the-stack-v2 Viewer • Updated Apr 23, 2024 • 5.45B • 6.02k • 430 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 429 Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21 • 196k • • 1.25k
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 429
pace bigcode/the-stack-v2 Viewer • Updated Apr 23, 2024 • 5.45B • 6.02k • 430 DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 429 Qwen/Qwen3-Coder-480B-A35B-Instruct Text Generation • 480B • Updated Aug 21 • 196k • • 1.25k
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 429