Less is More: Recursive Reasoning with Tiny Networks Paper โข 2510.04871 โข Published Oct 6 โข 470
Cache-to-Cache: Direct Semantic Communication Between Large Language Models Paper โข 2510.03215 โข Published Oct 3 โข 96
Running on Zero 18 18 Moondream3 Preview ๐ Process images and text to answer questions, caption, detect objects, and find points
Running 3.46k 3.46k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters