stepfun-ai/GOT-OCR2_0
Image-Text-to-Text
•
0.7B
•
Updated
•
38.1k
•
1.52k
Generate MIDI music from prompts
Segment and track objects in a video
Demo for multimodal understanding and generation