Running on CPU Upgrade 1.67k 1.67k The Smol Training Playbook: The Secrets to Building World-Class LLMs π
view article Article Granite 4.0 Nano: Just how small can you go? By ibm-granite and 1 other β’ 10 days ago β’ 106
MobileLLM-R1 Collection MobileLLM-R1, a series of sub-billion parameter reasoning models β’ 7 items β’ Updated 25 days ago β’ 21
Discrete-Time Hybrid Automata Learning: Legged Locomotion Meets Skateboarding Paper β’ 2503.01842 β’ Published Mar 3 β’ 3
view article Article You could have designed state of the art positional encoding Nov 25, 2024 β’ 390
Running 3.45k 3.45k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
microsoft/Phi-3.5-vision-instruct Image-Text-to-Text β’ 4B β’ Updated Sep 26, 2024 β’ 379k β’ 709
VITA: Towards Open-Source Interactive Omni Multimodal LLM Paper β’ 2408.05211 β’ Published Aug 9, 2024 β’ 50