wangrui's picture

wangrui

varuy322

·

varuy322

AI & ML interests

None yet

Recent Activity

upvoted an article about 21 hours ago

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

liked a dataset 2 days ago

nvidia/Nemotron-VLM-Dataset-v2

liked a dataset 11 days ago

open-r1/codeforces-cots

View all activity

Organizations

None yet

upvoted an article about 21 hours ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

By

•

5 days ago

• 29

upvoted a paper 19 days ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published 25 days ago • 103

upvoted a collection 23 days ago

Ferret

A framework for training LLM agents via RL with advanced search capability: https://github.com/Tree-Shu-Zhao/ferret • 7 items • Updated 18 days ago • 1

upvoted 2 papers about 1 month ago

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Paper • 2509.18154 • Published Sep 16 • 49

ARE: Scaling Up Agent Environments and Evaluations

Paper • 2509.17158 • Published Sep 21 • 34

upvoted a collection about 2 months ago

ZeroSearch_Policy_Google_V2

6 items • Updated Sep 7 • 5

upvoted a paper about 2 months ago

Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning

Paper • 2508.20751 • Published Aug 28 • 89

upvoted an article 2 months ago

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

May 21

• 228

upvoted a paper 2 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 123

upvoted a collection 2 months ago

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 102

upvoted a paper 2 months ago

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

Paper • 2505.04921 • Published May 8 • 185

upvoted a collection 2 months ago

Seed-X

A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated Aug 22 • 65

upvoted a paper 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 255

upvoted 2 collections 3 months ago

Intern-S1

7 items • Updated Aug 22 • 25

agent

210 items • Updated 1 day ago • 15

upvoted 5 papers 3 months ago

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

Paper • 2503.21460 • Published Mar 27 • 83

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published Aug 1 • 91

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published Jul 17 • 258

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 156