Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
Music
Computer Use Models
Document & UI Intelligence
Multimodal Models
Medical MultiModal
Computer Use Models
updated
1 day ago
Upvote
1
ByteDance-Seed/UI-TARS-72B-DPO
Image-Text-to-Text
•
73B
•
Updated
Jan 25
•
503
•
146
ByteDance-Seed/UI-TARS-7B-DPO
Image-Text-to-Text
•
8B
•
Updated
Jan 25
•
1.34k
•
221
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
379
•
1.7k
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
8B
•
Updated
Jan 8
•
226
•
68
microsoft/GUI-Actor-7B-Qwen2.5-VL
Image-Text-to-Text
•
8B
•
Updated
Aug 9
•
1.69k
•
24
showlab/ShowUI-2B
Updated
Mar 11
•
2.67k
•
268
Zery/CUA_World_State_Model
Image-Text-to-Text
•
Updated
Aug 7
•
31
•
4
microsoft/Fara-7B
Image-Text-to-Text
•
8B
•
Updated
about 20 hours ago
•
40
•
166
Qwen/Qwen2.5-Omni-7B
Any-to-Any
•
11B
•
Updated
Apr 30
•
126k
•
1.82k
Hcompany/Holo2-30B-A3B
Image-Text-to-Text
•
31B
•
Updated
5 days ago
•
1.12k
•
36
Hcompany/Holo2-4B
Image-Text-to-Text
•
4B
•
Updated
12 days ago
•
2.02k
•
15
Hcompany/Holo2-8B
Image-Text-to-Text
•
9B
•
Updated
12 days ago
•
227
•
15
AskUI/PTA-1
Image-Text-to-Text
•
0.3B
•
Updated
Nov 28, 2024
•
525
•
97
OS-Copilot/OS-Atlas-Base-7B
Image-Text-to-Text
•
8B
•
Updated
Nov 19, 2024
•
784
•
42
Qwen/Qwen2-VL-7B-Instruct
Image-Text-to-Text
•
8B
•
Updated
Feb 6
•
1.55M
•
•
1.24k
xlangai/OpenCUA-72B
Image-Text-to-Text
•
73B
•
Updated
15 days ago
•
63
•
3
xlangai/OpenCUA-32B
Image-Text-to-Text
•
33B
•
Updated
Aug 18
•
674
•
24
xlangai/OpenCUA-7B
Image-Text-to-Text
•
8B
•
Updated
13 days ago
•
11.3k
•
18
xlangai/Jedi-7B-1080p
Image-Text-to-Text
•
8B
•
Updated
Jun 18
•
81
•
29
xlangai/Jedi-3B-1080p
Image-Text-to-Text
•
4B
•
Updated
Jun 18
•
109
•
17
Qwen/Qwen3-VL-8B-Instruct
Image-Text-to-Text
•
9B
•
Updated
Oct 15
•
2.27M
•
•
472
Qwen/Qwen3-VL-8B-Thinking
Image-Text-to-Text
•
9B
•
Updated
about 3 hours ago
•
220k
•
145
Upvote
1
Share collection
View history
Collection guide
Browse collections