The Station: An Open-World Environment for AI-Driven Discovery Paper • 2511.06309 • Published 3 days ago • 31
OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation Paper • 2410.17799 • Published Oct 23, 2024 • 5
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Paper • 2403.13372 • Published Mar 20, 2024 • 168
view article Article Unlocking Healthcare AI: I'm Releasing State-of-the-Art Medical Models for Free. Forever. By MaziyarPanahi • Jul 16 • 144
Running on CPU Upgrade 2.06k 2.06k The Smol Training Playbook: The Secrets to Building World-Class LLMs 📝 Display loss curves for training LLMs
Agent Lightning: Train ANY AI Agents with Reinforcement Learning Paper • 2508.03680 • Published Aug 5 • 114
The Era of Agentic Organization: Learning to Organize with Language Models Paper • 2510.26658 • Published 13 days ago • 23
Exploring Conditions for Diffusion models in Robotic Control Paper • 2510.15510 • Published 26 days ago • 39
AgentFold: Long-Horizon Web Agents with Proactive Context Management Paper • 2510.24699 • Published 15 days ago • 65
Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks Paper • 2510.25760 • Published 14 days ago • 16
ODesign: A World Model for Biomolecular Interaction Design Paper • 2510.22304 • Published 18 days ago • 22
The Tool Decathlon: Benchmarking Language Agents for Diverse, Realistic, and Long-Horizon Task Execution Paper • 2510.25726 • Published 14 days ago • 44
Video-Thinker: Sparking "Thinking with Videos" via Reinforcement Learning Paper • 2510.23473 • Published 16 days ago • 83
JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence Paper • 2510.23538 • Published 16 days ago • 95
Kimi Linear: An Expressive, Efficient Attention Architecture Paper • 2510.26692 • Published 13 days ago • 102