Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rayruiyang 's Collections
VST
Haplo-VL

VST

updated 17 days ago

A comprehensive framework designed to cultivate VLMs with human-like visuospatial abilities.

Upvote
6

  • rayruiyang/VST-3B-RL

    Image-Text-to-Text • 4B • Updated 18 days ago • 769 • 3

  • rayruiyang/VST-3B-SFT

    Image-Text-to-Text • 4B • Updated 18 days ago • 1.95k

  • rayruiyang/VST-7B-SFT

    Image-Text-to-Text • 8B • Updated 18 days ago • 2.18k

  • rayruiyang/VST-7B-RL

    Image-Text-to-Text • 8B • Updated 18 days ago • 610

  • Visual Spatial Tuning

    Paper • 2511.05491 • Published 22 days ago • 49
Upvote
6
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs