HappyEval

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

wonderwind271 updated a dataset about 12 hours ago

HappyEval/MATH-500-embedding

wonderwind271 published a dataset about 12 hours ago

HappyEval/MATH-500-embedding

wonderwind271 updated a dataset about 12 hours ago

HappyEval/tombench-embedding

View all activity

wonderwind271

updated a dataset about 12 hours ago

HappyEval/MATH-500-embedding

Viewer • Updated about 12 hours ago • 500 • 3

wonderwind271

published a dataset about 12 hours ago

HappyEval/MATH-500-embedding

Viewer • Updated about 12 hours ago • 500 • 3

wonderwind271

updated a dataset about 12 hours ago

HappyEval/tombench-embedding

Viewer • Updated about 12 hours ago • 2.86k • 5

wonderwind271

published a dataset about 12 hours ago

HappyEval/tombench-embedding

Viewer • Updated about 12 hours ago • 2.86k • 5

wonderwind271

updated a dataset about 21 hours ago

HappyEval/Social_i_qa-embedding

Viewer • Updated about 21 hours ago • 33.4k • 3

wonderwind271

published a dataset about 21 hours ago

HappyEval/Social_i_qa-embedding

Viewer • Updated about 21 hours ago • 33.4k • 3

wonderwind271

updated a dataset 1 day ago

HappyEval/Social_i_qa-text

Viewer • Updated 1 day ago • 35.4k • 9

wonderwind271

published a dataset 1 day ago

HappyEval/Social_i_qa-text

Viewer • Updated 1 day ago • 35.4k • 9

wonderwind271

updated a dataset 1 day ago

HappyEval/Hi-ToM-embedding

Viewer • Updated 1 day ago • 1.2k • 4

wonderwind271

published a dataset 1 day ago

HappyEval/Hi-ToM-embedding

Viewer • Updated 1 day ago • 1.2k • 4

marstin

authored 4 papers 21 days ago

AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies

Paper • 2508.08113 • Published Aug 11 • 11

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Paper • 2510.02292 • Published Oct 2 • 1

Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry

Paper • 2510.25595 • Published 27 days ago

ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation

Paper • 2511.01163 • Published 22 days ago • 31

wonderwind271

authored a paper about 2 months ago

From Behavioral Performance to Internal Competence: Interpreting Vision-Language Models with VLM-Lens

Paper • 2510.02292 • Published Oct 2 • 1

marstin

authored a paper 5 months ago

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation

Paper • 2506.21876 • Published Jun 27 • 28

yuexiang96

authored 4 papers 5 months ago

AI & ML interests

Recent Activity

Team members 7

HappyEval's activity