view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 11 days ago • 234
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 14 days ago • 183
PixelDiT: Pixel Diffusion Transformers for Image Generation Paper • 2511.20645 • Published 16 days ago • 26
RELIC: Interactive Video World Model with Long-Horizon Memory Paper • 2512.04040 • Published 8 days ago • 23
Rethinking JEPA: Compute-Efficient Video SSL with Frozen Teachers Paper • 2509.24317 • Published Sep 29 • 10
UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer Paper • 2504.11289 • Published Apr 15 • 2
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published Feb 4 • 66
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 647