benzweijia/Adv-GRPO
Updated
•
24
•
3
The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation
an RL method using adversarial reward models
Generate images from text prompts