zhangshuo's picture

4 2

zhangshuo

mcflurryshuoz

·

zsxzs

AI & ML interests

None yet

Recent Activity

authored a paper 2 days ago

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

authored a paper 2 days ago

RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving

authored a paper 2 days ago

KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions

View all activity

Organizations

authored 7 papers 2 days ago

GitTaskBench: A Benchmark for Code Agents Solving Real-World Tasks Through Code Repository Leveraging

Paper • 2508.18993 • Published Aug 26, 2025 • 4

RepoMaster: Autonomous Exploration and Understanding of GitHub Repositories for Complex Task Solving

Paper • 2505.21577 • Published May 27, 2025 • 3

KnowMe-Bench: Benchmarking Person Understanding for Lifelong Digital Companions

Paper • 2601.04745 • Published 9 days ago • 50

Watching, Reasoning, and Searching: A Video Deep Research Benchmark on Open Web for Agentic Video Reasoning

Paper • 2601.06943 • Published 6 days ago • 202

FinVault: Benchmarking Financial Agent Safety in Execution-Grounded Environments

Paper • 2601.07853 • Published 8 days ago • 7

Controlled Self-Evolution for Algorithmic Code Optimization

Paper • 2601.07348 • Published 5 days ago • 105

EvoFSM: Controllable Self-Evolution for Deep Research with Finite State Machines

Paper • 2601.09465 • Published 3 days ago • 38