Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rayruiyang 's Collections
VST
Haplo-VL

VST

updated 21 days ago

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.

Upvote
6

  • rayruiyang/VST-3B-RL

    Image-Text-to-Text • 4B • Updated 22 days ago • 813 • 3

  • rayruiyang/VST-3B-SFT

    Image-Text-to-Text • 4B • Updated 22 days ago • 2.3k

  • rayruiyang/VST-7B-SFT

    Image-Text-to-Text • 8B • Updated 22 days ago • 2.52k

  • rayruiyang/VST-7B-RL

    Image-Text-to-Text • 8B • Updated 22 days ago • 624

  • Visual Spatial Tuning

    Paper • 2511.05491 • Published 26 days ago • 49
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs