3 23 4

Shi Minglei

MingleiShi

AI & ML interests

None yet

Recent Activity

updated a model about 1 month ago

KlingTeam/SVG-T2I

upvoted a paper about 1 month ago

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

authored a paper about 1 month ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

View all activity

Organizations

updated a model about 1 month ago

KlingTeam/SVG-T2I

Text-to-Image • Updated Dec 18, 2025 • 39 • 29

upvoted a paper about 1 month ago

Scone: Bridging Composition and Distinction in Subject-Driven Image Generation via Unified Understanding-Generation Modeling

Paper • 2512.12675 • Published Dec 14, 2025 • 41

authored a paper about 1 month ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published Dec 12, 2025 • 39

commented a paper about 1 month ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published Dec 12, 2025 • 39 •

upvoted a paper about 1 month ago

SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder

Paper • 2512.11749 • Published Dec 12, 2025 • 39

liked a model about 1 month ago

KlingTeam/SVG-T2I

Text-to-Image • Updated Dec 18, 2025 • 39 • 29

published a model about 1 month ago

KlingTeam/SVG-T2I

Text-to-Image • Updated Dec 18, 2025 • 39 • 29

authored a paper about 1 month ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 49

upvoted a paper 3 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 49

commented a paper 3 months ago

Latent Diffusion Model without Variational Autoencoder

Paper • 2510.15301 • Published Oct 17, 2025 • 49 •

liked a dataset 6 months ago

UCSC-VLAA/GPT-Image-Edit-1.5M

Viewer • Updated Aug 21, 2025 • 2.78M • 5.83k • 68

upvoted 2 papers 6 months ago

NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining

Paper • 2507.14119 • Published Jul 18, 2025 • 60

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

Paper • 2507.16746 • Published Jul 22, 2025 • 35

upvoted a paper 7 months ago

UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting

Paper • 2506.09952 • Published Jun 11, 2025 • 6

liked a dataset 8 months ago

yandex/alchemist

Viewer • Updated Jun 6, 2025 • 3.35k • 170 • 48

upvoted 3 papers 9 months ago

upvoted 2 papers 10 months ago

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published Mar 31, 2025 • 76

Wan: Open and Advanced Large-Scale Video Generative Models

Paper • 2503.20314 • Published Mar 26, 2025 • 56

Shi Minglei

AI & ML interests

Recent Activity

Organizations

MingleiShi's activity