1 9 3

Tong He

tonghe90

http://tonghe90.github.io

AI & ML interests

SII is an institution dedicated to innovation in education and research in the field of AI

Recent Activity

upvoted a paper about 1 month ago

BRIDGE - Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

upvoted a paper about 2 months ago

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

authored a paper about 2 months ago

ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network

View all activity

Organizations

upvoted a paper about 1 month ago

BRIDGE - Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation

Paper • 2509.25077 • Published Sep 29 • 14

upvoted a paper about 2 months ago

Understand Before You Generate: Self-Guided Training for Autoregressive Image Generation

Paper • 2509.15185 • Published Sep 18 • 29

authored 5 papers about 2 months ago

upvoted a paper about 2 months ago

OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling

Paper • 2509.12201 • Published Sep 15 • 103

liked a dataset about 2 months ago

InternRobotics/OmniWorld

Viewer • Updated 28 days ago • 5.54B • 35.8k • 72

liked a model 2 months ago

facebook/MobileLLM-R1-950M

Text Generation • 0.9B • Updated Sep 30 • 4.05k • 351

upvoted a paper 2 months ago

WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool

Paper • 2509.05296 • Published Sep 5 • 7

upvoted a paper 3 months ago

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published Aug 4 • 130

upvoted a paper 4 months ago

Yume: An Interactive World Generation Model

Paper • 2507.17744 • Published Jul 23 • 85

authored 2 papers 4 months ago

Aether: Geometric-Aware Unified World Modeling

Paper • 2503.18945 • Published Mar 24 • 28

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17 • 64

upvoted a paper 4 months ago

π^3: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17 • 64

commented a paper 4 months ago

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning

Paper • 2507.13347 • Published Jul 17 • 64 •

authored 3 papers 4 months ago

GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous Driving

Paper • 2503.05689 • Published Mar 7 • 3

DICEPTION: A Generalist Diffusion Model for Visual Perceptual Tasks

Paper • 2502.17157 • Published Feb 24 • 52

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 159

Tong He

AI & ML interests

Recent Activity

Organizations

tonghe90's activity