Shilong Zhang's picture

4 6 12

Shilong Zhang

shilongz

·

https://jshilong.github.io/

jshilong

AI & ML interests

His research interests are primarily focused on Large Vision-Language Model and Large Vision Generation Model

Organizations

authored 2 papers 10 months ago

FlashVideo:Flowing Fidelity to Detail for Efficient High-Resolution Video Generation

Paper • 2502.05179 • Published Feb 7 • 24

Goku: Flow Based Video Generative Foundation Models

Paper • 2502.04896 • Published Feb 7 • 106

authored 3 papers over 1 year ago

Zero-shot Image Editing with Reference Imitation

Paper • 2406.07547 • Published Jun 11, 2024 • 33

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Paper • 2406.06525 • Published Jun 10, 2024 • 71

FlashFace: Human Image Personalization with High-fidelity Identity Preservation

Paper • 2403.17008 • Published Mar 25, 2024 • 22

authored 2 papers over 2 years ago

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Paper • 2307.03601 • Published Jul 7, 2023 • 12

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

Paper • 2305.04790 • Published May 8, 2023 • 1