deepseek-ai/DeepSeek-V3.2-Speciale Text Generation • 685B • Updated about 1 month ago • 25.4k • 627
view article Article Rank-Stabilized LoRA: Unlocking the Potential of LoRA Fine-Tuning Feb 20, 2024 • 30
HuggingFaceH4/zephyr-7b-alpha Text Generation • 7B • Updated Oct 16, 2024 • 1.83k • • 1.12k
Running on CPU Upgrade Featured 2.75k The Smol Training Playbook 📚 2.75k The secrets to building world-class LLMs
mattshumer/Reflection-Llama-3.1-70B Text Generation • 71B • Updated Sep 24, 2024 • 387 • 1.71k
nvidia/OpenReasoning-Nemotron-32B Text Generation • 33B • Updated Sep 16, 2025 • 308 • • 121
view article Article OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models Jul 18, 2025 • 50