Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
waltsmorz 's Collections
Toread

Toread

updated Mar 6, 2024
Upvote
-

  • Scaling Rectified Flow Transformers for High-Resolution Image Synthesis

    Paper • 2403.03206 • Published Mar 5, 2024 • 70

  • Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models

    Paper • 2403.03003 • Published Mar 5, 2024 • 11

  • MAGID: An Automated Pipeline for Generating Synthetic Multi-modal Datasets

    Paper • 2403.03194 • Published Mar 5, 2024 • 15

  • Finetuned Multimodal Language Models Are High-Quality Image-Text Data Filters

    Paper • 2403.02677 • Published Mar 5, 2024 • 18
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs