Ruize Zhang's picture

7 3

Ruize Zhang

Ruize-Zhang

·

zrz-sh

AI & ML interests

Interested in RL

Recent Activity

liked a dataset 3 days ago

gaia-benchmark/GAIA

upvoted a paper 12 days ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

authored a paper about 1 month ago

JuggleRL: Mastering Ball Juggling with a Quadrotor via Deep Reinforcement Learning

View all activity

Organizations

None yet

liked a dataset 3 days ago

gaia-benchmark/GAIA

Viewer • Updated 18 days ago • 932 • 9.36k • 483

upvoted a paper 12 days ago

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published 17 days ago • 60

authored 4 papers about 1 month ago

JuggleRL: Mastering Ball Juggling with a Quadrotor via Deep Reinforcement Learning

Paper • 2509.24892 • Published Sep 29

Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

Paper • 2505.04317 • Published May 7 • 1

VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play

Paper • 2502.01932 • Published Feb 4

OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control

Paper • 2309.12825 • Published Sep 22, 2023

upvoted 3 papers about 1 month ago

Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning

Paper • 2505.04317 • Published May 7 • 1

Cache-to-Cache: Direct Semantic Communication Between Large Language Models

Paper • 2510.03215 • Published Oct 3 • 96

RLinf-VLA: A Unified and Efficient Framework for VLA+RL Training

Paper • 2510.06710 • Published Oct 8 • 38

liked 2 datasets 4 months ago

walledai/HarmBench

Viewer • Updated Jul 31, 2024 • 400 • 3.2k • 23

THU-KEG/IFBench

Viewer • Updated Mar 7 • 444 • 224 • 9

upvoted a paper 5 months ago

Multiverse: Your Language Models Secretly Decide How to Parallelize and Merge Generation

Paper • 2506.09991 • Published Jun 11 • 55

authored a paper 5 months ago

A Survey on Self-play Methods in Reinforcement Learning

Paper • 2408.01072 • Published Aug 2, 2024 • 2

upvoted 2 papers 5 months ago

A Survey on Self-play Methods in Reinforcement Learning

Paper • 2408.01072 • Published Aug 2, 2024 • 2

VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments

Paper • 2506.02387 • Published Jun 3 • 58