35 22 6

yubo

ubowang

AI & ML interests

None yet

Recent Activity

upvoted an article 10 days ago

Aligning to What? Rethinking Agent Generalization in MiniMax M2

upvoted a paper 12 days ago

VisCoder2: Building Multi-Language Visualization Coding Agents

updated a dataset 15 days ago

TIGER-Lab/MMLU-Pro

View all activity

Organizations

upvoted an article 10 days ago

Article

Aligning to What? Rethinking Agent Generalization in MiniMax M2

•

10 days ago

• 22

upvoted a paper 12 days ago

VisCoder2: Building Multi-Language Visualization Coding Agents

Paper • 2510.23642 • Published 16 days ago • 21

updated a dataset 15 days ago

TIGER-Lab/MMLU-Pro

Viewer • Updated 15 days ago • 12.1k • 58.6k • 392

New activity in TIGER-Lab/MMLU-Pro 15 days ago

158 exact duplicate pairs and 111 redundant superset pairs

#33 opened about 2 months ago by

mkieffer

Consolidated note on Health category issues and recent updates (category changes, time-sensitive items, and near-duplicates)

#36 opened 15 days ago by

ubowang

Many issues, duplicates, and problem questions in the Health category

#31 opened about 2 months ago by

mkieffer

New activity in TIGER-Lab/MMLU-Pro 16 days ago

Remove CoTs that refer to Wikipedia and incorrect answer

#34 opened about 1 month ago by

mkieffer

Typo of Question-11783

#35 opened 25 days ago by

Loalii

New activity in TIGER-Lab/MMEB-Leaderboard 26 days ago

Update results of OEmbedding-v1-7B

#70 opened 27 days ago by

lopel233

upvoted a paper 27 days ago

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

Paper • 2510.10666 • Published 28 days ago • 27

New activity in TIGER-Lab/MMEB-Leaderboard 27 days ago

Add results of IFM-TTE-7B

#69 opened 29 days ago by

haoyubu

upvoted 2 papers about 1 month ago

UniVideo: Unified Understanding, Generation, and Editing for Videos

Paper • 2510.08377 • Published Oct 9 • 70

MM-HELIX: Boosting Multimodal Long-Chain Reflective Reasoning with Holistic Platform and Adaptive Hybrid Policy Optimization

Paper • 2510.08540 • Published Oct 9 • 108

upvoted a paper about 2 months ago

Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning

Paper • 2509.03646 • Published Sep 3 • 30

upvoted a paper 2 months ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published Sep 1 • 73

updated a dataset 2 months ago

ubowang/test_data_hy_temp_0902

Updated Sep 2 • 3

published a dataset 2 months ago

ubowang/test_data_hy_temp_0902

Updated Sep 2 • 3

New activity in TIGER-Lab/MMLU-Pro 3 months ago

Question ID 996 Error in options

#28 opened 6 months ago by

maxidl

Question ID 5635 Incosistency in options

#29 opened 6 months ago by

maxidl

Question ID 3983: Inconsistency between 'answer' and 'answer_index'

#30 opened 5 months ago by

Cookie061499

yubo

AI & ML interests

Recent Activity

Organizations

ubowang's activity

Aligning to What? Rethinking Agent Generalization in MiniMax M2

158 exact duplicate pairs and 111 redundant superset pairs

Consolidated note on Health category issues and recent updates (category changes, time-sensitive items, and near-duplicates)

Many issues, duplicates, and problem questions in the Health category

Remove CoTs that refer to Wikipedia and incorrect answer

Typo of Question-11783

Update results of OEmbedding-v1-7B

Add results of IFM-TTE-7B

Question ID 996 Error in options

Question ID 5635 Incosistency in options

Question ID 3983: Inconsistency between 'answer' and 'answer_index'