nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8 Text Generation • 32B • Updated about 21 hours ago • 464k • 214
nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-Base-BF16 Text Generation • 32B • Updated 17 days ago • 7.33k • 82
Front-Loading Reasoning: The Synergy between Pretraining and Post-Training Data Paper • 2510.03264 • Published Sep 26, 2025 • 23