3 50 17

Zhiyuan Ma PRO

ZhiyuanthePony

https://theericma.github.io/

AI & ML interests

3D Generation

Recent Activity

upvoted a paper 6 days ago

FullPart: Generating each 3D Part at Full Resolution

upvoted a paper 6 days ago

Emu3.5: Native Multimodal Models are World Learners

upvoted a paper 21 days ago

FlashWorld: High-quality 3D Scene Generation within Seconds

View all activity

Organizations

None yet

upvoted 2 papers 6 days ago

FullPart: Generating each 3D Part at Full Resolution

Paper • 2510.26140 • Published 7 days ago • 5

Emu3.5: Native Multimodal Models are World Learners

Paper • 2510.26583 • Published 6 days ago • 98

upvoted a paper 21 days ago

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published 21 days ago • 70

upvoted a paper 22 days ago

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Paper • 2510.12747 • Published 22 days ago • 36

upvoted a paper 23 days ago

InfiniHuman: Infinite 3D Human Creation with Precise Control

Paper • 2510.11650 • Published 23 days ago • 5

upvoted 2 papers 24 days ago

Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Paper • 2510.08673 • Published 27 days ago • 121

Webscale-RL: Automated Data Pipeline for Scaling RL Data to Pretraining Levels

Paper • 2510.06499 • Published 29 days ago • 31

upvoted a paper 27 days ago

VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning

Paper • 2510.08555 • Published 27 days ago • 62

upvoted a paper 30 days ago

Triangle Splatting+: Differentiable Rendering with Opaque Triangles

Paper • 2509.25122 • Published Sep 29 • 8

upvoted a paper about 1 month ago

Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Paper • 2509.25161 • Published Sep 29 • 23

upvoted a paper about 2 months ago

Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis

Paper • 2509.09595 • Published Sep 11 • 48

upvoted 9 papers 3 months ago

MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

Paper • 2508.01242 • Published Aug 2 • 10

BANG: Dividing 3D Assets via Generative Exploded Dynamics

Paper • 2507.21493 • Published Jul 29 • 64

ARC-Hunyuan-Video-7B: Structured Video Comprehension of Real-World Shorts

Paper • 2507.20939 • Published Jul 28 • 56

EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Paper • 2507.16535 • Published Jul 22 • 20

Captain Cinema: Towards Short Movie Generation

Paper • 2507.18634 • Published Jul 24 • 40

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 307

Zhiyuan Ma PRO

AI & ML interests

Recent Activity

Organizations

ZhiyuanthePony's activity