Adv-GRPO - a benzweijia Collection

benzweijia 's Collections

Adv-GRPO

updated 8 days ago

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

benzweijia/Adv-GRPO

Updated 13 days ago • 24 • 3
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published 10 days ago • 26
Runtime error

Adv GRPO

📊

an RL method using adversarial reward models
Running on Zero

Adv-GRPO DINO

👁

Generate images from text prompts
benzweijia/QWen_Image_PickScore

Viewer • Updated 8 days ago • 128 • 118