2 63 169

wangrui

varuy322

varuy322

AI & ML interests

None yet

Recent Activity

upvoted an article about 8 hours ago

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

liked a dataset 1 day ago

nvidia/Nemotron-VLM-Dataset-v2

liked a dataset 10 days ago

open-r1/codeforces-cots

View all activity

Organizations

None yet

upvoted an article about 8 hours ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

•

4 days ago

• 28

liked a dataset 1 day ago

nvidia/Nemotron-VLM-Dataset-v2

Viewer • Updated 2 days ago • 4.58M • 5.61k • 53

liked a dataset 10 days ago

open-r1/codeforces-cots

Viewer • Updated Mar 28 • 254k • 1.22k • 191

upvoted a paper 18 days ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published 24 days ago • 102

liked a dataset 18 days ago

HuggingFaceFW/finepdfs

Viewer • Updated Sep 8 • 475M • 60.4k • 652

upvoted a collection 22 days ago

Ferret

Collection

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated 17 days ago • 1

liked a model 22 days ago

nvidia/omni-embed-nemotron-3b

Feature Extraction • 5B • Updated 29 days ago • 41.2k • 62

liked a dataset 23 days ago

OpenGVLab/InternVL-Chat-V1-2-SFT-Data

Viewer • Updated Sep 20, 2024 • 573k • 777 • 29

liked 2 models 23 days ago

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated 23 days ago • 488k • 221

internlm/CapRL-3B

Image-Text-to-Text • 4B • Updated 16 days ago • 1.25k • 43

liked a dataset 25 days ago

zai-org/DeepDive

Viewer • Updated Sep 17 • 4.11k • 920 • 13

liked a model about 1 month ago

Qwen/Qwen3-Omni-30B-A3B-Instruct

Any-to-Any • 35B • Updated Sep 22 • 298k • 703

liked a dataset about 1 month ago

google/simpleqa-verified

Viewer • Updated Sep 22 • 1k • 1.13k • 19

upvoted 2 papers about 1 month ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 49

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 34

upvoted a collection about 2 months ago

ZeroSearch_Policy_Google_V2

Collection

6 items • Updated Sep 7 • 5

liked a dataset about 2 months ago

openbmb/RLAIF-V-Dataset

Preview • Updated 24 days ago • 1.28k • 194

liked a model about 2 months ago

Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated 28 days ago • 12.4k • 752

upvoted a paper about 2 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89

liked a dataset 2 months ago

jupyter-agent/jupyter-agent-dataset

Viewer • Updated Sep 10 • 95.8k • 1.57k • 149

wangrui

AI & ML interests

Recent Activity

Organizations

varuy322's activity

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix