Zedong Wang's picture

5 44 1

Zedong Wang

JackyWangAI

·

https://jacky1128.github.io

AI & ML interests

Computer Vision, Multi-task Learning.

Recent Activity

upvoted an article 7 days ago

Gemma 3n fully available in the open-source ecosystem!

upvoted a paper 12 days ago

MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

upvoted a paper 22 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

View all activity

Organizations

upvoted an article 7 days ago

Article

Gemma 3n fully available in the open-source ecosystem!

Jun 26

• 120

upvoted a paper 12 days ago

MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding

Paper • 2510.23479 • Published 15 days ago • 14

upvoted a paper 22 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published 24 days ago • 86

upvoted 3 papers about 1 month ago

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 92

Less is More: Recursive Reasoning with Tiny Networks

Paper • 2510.04871 • Published Oct 6 • 470

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 526

upvoted 2 papers 3 months ago

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Paper • 2507.23478 • Published Jul 31 • 15

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Paper • 2410.10801 • Published Oct 14, 2024 • 3

updated a collection 3 months ago

Model Merging

6 items • Updated Aug 3

upvoted 5 papers 3 months ago

BANG: Dividing 3D Assets via Generative Exploded Dynamics

Paper • 2507.21493 • Published Jul 29 • 64

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 131

AnimalClue: Recognizing Animals by their Traces

Paper • 2507.20240 • Published Jul 27 • 9

Music Arena: Live Evaluation for Text-to-Music

Paper • 2507.20900 • Published Jul 28 • 10

Temporal In-Context Fine-Tuning for Versatile Control of Video Diffusion Models

Paper • 2506.00996 • Published Jun 1 • 38

updated 2 collections 3 months ago

Model Merging

6 items • Updated Aug 3

Multi-Task Learning

18 items • Updated Jul 29