Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
xlalex 's Collections
interleaved
ocr
3d
world model
omni
infra
synthesis
perception
survey
RL
critic
speech full duplex
agent
self-paly

synthesis

updated 17 days ago
Upvote
-

  • Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis

    Paper • 2503.08741 • Published Mar 11 • 1

  • Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models

    Paper • 2406.17294 • Published Jun 25, 2024 • 11

  • SynthRL: Scaling Visual Reasoning with Verifiable Data Synthesis

    Paper • 2506.02096 • Published Jun 2 • 52

  • VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL

    Paper • 2505.23977 • Published May 29 • 10
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs