Dian Zheng's picture

1 9

Dian Zheng

zhengli1013

·

https://zhengdian1.github.io/

zhengdian1

AI & ML interests

generative model

Recent Activity

authored a paper 1 day ago

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

upvoted a paper 1 day ago

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

upvoted a paper 4 days ago

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

View all activity

Organizations

upvoted a paper 1 day ago

EditThinker: Unlocking Iterative Reasoning for Any Image Editor

Paper • 2512.05965 • Published 4 days ago • 33

upvoted a paper 4 days ago

DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling

Paper • 2512.03000 • Published 7 days ago • 33

upvoted a paper 5 days ago

OneThinker: All-in-one Reasoning Model for Image and Video

Paper • 2512.03043 • Published 7 days ago • 29

upvoted a paper 6 days ago

Panorama Generation From NFoV Image Done Right

Paper • 2503.18420 • Published Mar 24 • 1

upvoted a paper 8 days ago

Architecture Decoupling Is Not All You Need For Unified Multimodal Model

Paper • 2511.22663 • Published 12 days ago • 28

upvoted 2 papers about 2 months ago

Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views

Paper • 2510.18632 • Published Oct 21 • 21

Uni-MMMU: A Massive Multi-discipline Multimodal Unified Benchmark

Paper • 2510.13759 • Published Oct 15 • 9

upvoted a paper 4 months ago

LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation

Paper • 2508.03694 • Published Aug 5 • 50

upvoted a paper 9 months ago

VBench-2.0: Advancing Video Generation Benchmark Suite for Intrinsic Faithfulness

Paper • 2503.21755 • Published Mar 27 • 33