Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Shi Liu's picture
3 7 2

Shi Liu

CLLBJ16

AI & ML interests

None yet

Organizations

None yet

upvoted a paper 2 months ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 123
upvoted a paper 3 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21 • 88
upvoted a paper 4 months ago

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

Paper • 2507.19478 • Published Jul 25 • 30
upvoted 2 papers 5 months ago

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23 • 33

CoMemo: LVLMs Need Image Context with Image Memory

Paper • 2506.06279 • Published Jun 6 • 8
upvoted a paper 7 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 301
upvoted a paper 12 months ago

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization

Paper • 2411.10442 • Published Nov 15, 2024 • 86
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs