Datasets with reasoning traces for math and code (Train + Eval)
Maojia Song
OrangeEye
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
4 days ago
LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social
Interactions
liked
a Space
6 days ago
HuggingFaceTB/smol-training-playbook
upvoted
a
paper
about 1 month ago
Demystifying deep search: a holistic evaluation with hint-free multi-hop
questions and factorised metrics