arxiv:2509.25779
Yanbin Jiang
jybsuper
AI & ML interests
None yet
Recent Activity
authored
a paper
about 1 month ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
upvoted
a
paper
about 1 month ago
Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller
LLMs
updated
a dataset
4 months ago
jybsuper/hermes-function-calling-thinking-V1-openai-format
Organizations
None yet