Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 14 items • Updated 3 days ago • 35
NovaFlow: Zero-Shot Manipulation via Actionable Flow from Generated Videos Paper • 2510.08568 • Published about 1 month ago • 1
Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer Paper • 2510.06590 • Published Oct 8 • 70
view article Article 5 Things You Need to Know About Moonshot AI and Kimi K2, the New #1 model on the Hub By fdaudens and 1 other • Jul 15 • 24
high-quality Chinese training datasets Collection a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated May 22 • 22
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit • 28 items • Updated 9 days ago • 89
Qwen3 Collection Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 9 days ago • 228
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published Apr 7 • 200
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 11 items • Updated Jul 21 • 87