Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OmarAlterkait 's Collections
diffusion
NERF
3D
CV

CV

updated 23 days ago
Upvote
-

  • MaskBit: Embedding-free Image Generation via Bit Tokens

    Paper • 2409.16211 • Published Sep 24, 2024 • 17

  • Goku: Flow Based Video Generative Foundation Models

    Paper • 2502.04896 • Published Feb 7 • 106

  • Discrete Audio Tokens: More Than a Survey!

    Paper • 2506.10274 • Published Jun 12 • 32

  • HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling

    Paper • 2506.20452 • Published Jun 25 • 19

  • Radial Attention: O(nlog n) Sparse Attention with Energy Decay for Long Video Generation

    Paper • 2506.19852 • Published Jun 24 • 41

  • Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

    Paper • 2507.14137 • Published Jul 18 • 34

  • InfGen: A Resolution-Agnostic Paradigm for Scalable Image Synthesis

    Paper • 2509.10441 • Published Sep 12 • 30

  • Vision Transformers Don't Need Trained Registers

    Paper • 2506.08010 • Published Jun 9 • 21

  • Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

    Paper • 2510.06590 • Published about 1 month ago • 70

  • Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

    Paper • 2510.12586 • Published 24 days ago • 107
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs