Running on CPU Upgrade 1.32k 1.32k The Smol Training Playbook: The Secrets to Building World-Class LLMs 📝
view article Article Granite 4.0 Nano: Just how small can you go? By ibm-granite and 1 other • 7 days ago • 96
gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss • 2 items • Updated 6 days ago • 55
SPLADE-Tiny-MSMARCO Collection SPLADE sparse retrieval models based on BERT-Tiny (4M) and BERT-Mini (11M) distilled from a Cross-Encoder on the MSMARCO dataset • 6 items • Updated 13 days ago • 1
PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model Paper • 2510.14528 • Published 19 days ago • 80