Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2507.17744

about 24 hours ago

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 132
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17 • 57
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 79

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85
SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 95
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Paper • 2506.06941 • Published Jun 7 • 15
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 185

Video Generation

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Paper • 2507.07202 • Published Jul 9 • 24
StreamDiT: Real-Time Streaming Text-to-Video Generation

Paper • 2507.03745 • Published Jul 4 • 31
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Paper • 2507.01945 • Published Jul 2 • 78
TokensGen: Harnessing Condensed Tokens for Long Video Generation

Paper • 2507.15728 • Published Jul 21 • 7

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Paper • 2506.17201 • Published Jun 20 • 56
Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18 • 64
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation

Paper • 2508.07981 • Published Aug 11 • 58
CharacterShot: Controllable and Consistent 4D Character Animation

Paper • 2508.07409 • Published Aug 10 • 39
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Paper • 2508.10881 • Published Aug 14 • 52
Puppeteer: Rig and Animate Your 3D Models

Paper • 2508.10898 • Published Aug 14 • 32

about 23 hours ago

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Paper • 2412.09013 • Published Dec 12, 2024 • 13
Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21 • 66
nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 123
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 60
SeqTex: Generate Mesh Textures in Video Sequence

Paper • 2507.04285 • Published Jul 6 • 9
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Paper • 2508.10893 • Published Aug 14 • 31

Running

Featured

1.69k

Qwen2.5 Coder Artifacts

🐢

1.69k

Generate code for applications
Running on Zero

MCP

Featured

1.41k

LTX Video Fast

🎥

1.41k

ultra-fast video model, LTX 0.9.8 13B distilled
Paused

Featured

247

Step1X 3D

🐨

247

image2mesh
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

about 24 hours ago

EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters

Paper • 2402.04252 • Published Feb 6, 2024 • 29
Vision Superalignment: Weak-to-Strong Generalization for Vision Foundation Models

Paper • 2402.03749 • Published Feb 6, 2024 • 14
ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Paper • 2402.04615 • Published Feb 7, 2024 • 44
EfficientViT-SAM: Accelerated Segment Anything Model Without Performance Loss

Paper • 2402.05008 • Published Feb 7, 2024 • 23

Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation

Paper • 2508.07981 • Published Aug 11 • 58
CharacterShot: Controllable and Consistent 4D Character Animation

Paper • 2508.07409 • Published Aug 10 • 39
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Paper • 2508.10881 • Published Aug 14 • 52
Puppeteer: Rig and Animate Your 3D Models

Paper • 2508.10898 • Published Aug 14 • 32

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 132
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85
Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17 • 57
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development

Paper • 2506.05010 • Published Jun 5 • 79

about 23 hours ago

Arbitrary-steps Image Super-resolution via Diffusion Inversion

Paper • 2412.09013 • Published Dec 12, 2024 • 13
Deep Researcher with Test-Time Diffusion

Paper • 2507.16075 • Published Jul 21 • 66
nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 123
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85
SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 95
The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

Paper • 2506.06941 • Published Jun 7 • 15
Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2 • 185

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

Video Generation

A Survey on Long-Video Storytelling Generation: Architectures, Consistency, and Cinematic Quality

Paper • 2507.07202 • Published Jul 9 • 24
StreamDiT: Real-Time Streaming Text-to-Video Generation

Paper • 2507.03745 • Published Jul 4 • 31
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Paper • 2507.01945 • Published Jul 2 • 78
TokensGen: Harnessing Condensed Tokens for Long Video Generation

Paper • 2507.15728 • Published Jul 21 • 7

AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models

Paper • 2506.19851 • Published Jun 24 • 60
SeqTex: Generate Mesh Textures in Video Sequence

Paper • 2507.04285 • Published Jul 6 • 9
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85
STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

Paper • 2508.10893 • Published Aug 14 • 31

Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition

Paper • 2506.17201 • Published Jun 20 • 56
Sekai: A Video Dataset towards World Exploration

Paper • 2506.15675 • Published Jun 18 • 64
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

Running

Featured

1.69k

Qwen2.5 Coder Artifacts

🐢

1.69k

Generate code for applications
Running on Zero

MCP

Featured

1.41k

LTX Video Fast

🎥

1.41k

ultra-fast video model, LTX 0.9.8 13B distilled
Paused

Featured

247

Step1X 3D

🐨

247

image2mesh
Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs