view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 260
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 28 days ago • 31
Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs Paper • 2510.11062 • Published Oct 13, 2025 • 28
Reflect, Retry, Reward: Self-Improving LLMs via Reinforcement Learning Paper • 2505.24726 • Published May 30, 2025 • 277
view article Article Building Your Own AI Document Dream Team: A Generic Multi-Agent System Apr 8, 2025 • 6
view article Article Fine-Tune Meta Llama 3.2-Vision-Instruct Multimodal LLM on Intel Accelerators Jan 28, 2025 • 8
view article Article Model Card Generator Interface: Crafting Clear Insights into AI Models Sep 27, 2024 • 4
view article Article Fine Tuning a LLM Using Kubernetes with Intel® Gaudi® Accelerator Sep 9, 2024 • 7
view article Article Llama-3.1-Storm-8B: Improved SLM with Self-Curation + Model Merging Aug 19, 2024 • 79
view article Article Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors Apr 24, 2024 • 7