bb's picture

1 17

bb

bubbleseller

AI & ML interests

None yet

Recent Activity

updated a model 16 days ago

bubbleseller/dino_wm_ckpt

published a model 16 days ago

bubbleseller/dino_wm_ckpt

View all activity

Organizations

None yet

upvoted 6 papers 5 months ago

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29, 2025 • 136

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20, 2025 • 85

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14, 2025 • 89

Token Bottleneck: One Token to Remember Dynamics

Paper • 2507.06543 • Published Jul 9, 2025 • 20

Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling

Paper • 2507.07982 • Published Jul 10, 2025 • 33

Critiques of World Models

Paper • 2507.05169 • Published Jul 7, 2025 • 25

upvoted 11 papers 6 months ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published Jul 10, 2025 • 159

Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models

Paper • 2507.07104 • Published Jul 9, 2025 • 45

KV Cache Steering for Inducing Reasoning in Small Language Models

Paper • 2507.08799 • Published Jul 11, 2025 • 40

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23, 2025 • 78

Matrix-Game: Interactive World Foundation Model

Paper • 2506.18701 • Published Jun 23, 2025 • 72

VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory

Paper • 2506.18903 • Published Jun 23, 2025 • 22

PAROAttention: Pattern-Aware ReOrdering for Efficient Sparse and Quantized Attention in Visual Generation Models

Paper • 2506.16054 • Published Jun 19, 2025 • 60

Show-o2: Improved Native Unified Multimodal Models

Paper • 2506.15564 • Published Jun 18, 2025 • 29

V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning

Paper • 2506.09985 • Published Jun 11, 2025 • 29

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16, 2025 • 43

Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency

Paper • 2506.08343 • Published Jun 10, 2025 • 54