microsoft/trocr-base-handwritten
Image-to-Text
•
0.3B
•
Updated
•
209k
•
454
None defined yet.
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs
QueST: Incentivizing LLMs to Generate Difficult Problems