Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
patelruskin 's Collections
RL
read
LLMs
Image
Video

Video

updated Jan 4
Upvote
-

  • Mind the Time: Temporally-Controlled Multi-Event Video Generation

    Paper • 2412.05263 • Published Dec 6, 2024 • 11

  • Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation

    Paper • 2412.04432 • Published Dec 5, 2024 • 16

  • MotionShop: Zero-Shot Motion Transfer in Video Diffusion Models with Mixture of Score Guidance

    Paper • 2412.05355 • Published Dec 6, 2024 • 9

  • SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

    Paper • 2412.07760 • Published Dec 10, 2024 • 55

  • 2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

    Paper • 2501.00958 • Published Jan 1 • 107
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs