view article Article Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face Feb 11 • 93
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM +2 Mar 12 • 479
view article Article From PyTorch DDP to Accelerate to Trainer, mastery of distributed training with ease Oct 21, 2022 • 42
view article Article Efficient LLM Pretraining: Packed Sequences and Masked Attention Oct 7, 2024 • 63
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge Feb 7 • 263