Rin's picture

18 1

Rin

hu5enpai

·

AI & ML interests

None yet

Recent Activity

new activity 12 days ago

PaddlePaddle/PaddleOCR-VL:ms-swift has supported inference, deployment, and fine-tuning of the PaddleOCR-VL model.

upvoted a paper about 2 months ago

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

commented on a paper about 2 months ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

View all activity

Organizations

New activity in PaddlePaddle/PaddleOCR-VL 12 days ago

ms-swift has supported inference, deployment, and fine-tuning of the PaddleOCR-VL model.

#42 opened 12 days ago by

commented a paper about 2 months ago

On-Policy RL Meets Off-Policy Experts: Harmonizing Supervised Fine-Tuning and Reinforcement Learning via Dynamic Weighting

Paper • 2508.11408 • Published Aug 15 • 8 •

commented 3 papers 3 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 178 •

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Paper • 2505.14362 • Published May 20 • 2 •

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 185 •

New activity in Qwen/Qwen3-Coder-480B-A35B-Instruct 3 months ago

👍👍

#19 opened 3 months ago by

New activity in ChenShawn/DeepEyes-Datasets-47k 4 months ago

Unable to load the dataset

#2 opened 4 months ago by

New activity in microsoft/Florence-2-large-ft over 1 year ago

Swift now supports inference, training, and deployment of the Florence models.

#14 opened over 1 year ago by

New activity in microsoft/Florence-2-large over 1 year ago

How to Finetune?

#19 opened over 1 year ago by

Fix incorrect bos_token, eos_token, and pad_token ids in config.json

#17 opened over 1 year ago by

New activity in liuhaotian/LLaVA-Instruct-150K over 1 year ago

Unable to load dataset.

#10 opened almost 2 years ago by

New activity in OpenGVLab/InternVL-Chat-V1-5 over 1 year ago

Swift now supports inference, training of InternVL-Chat-V1-5

#11 opened over 1 year ago by