Running on Zero MCP Featured 1.98k Stable Video Diffusion 1.1 📺 Featured 1.98k Generate a video from a single image
Generative Multimodal Models are In-Context Learners Paper • 2312.13286 • Published Dec 20, 2023 • 37
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training Paper • 2401.00849 • Published Jan 1, 2024 • 17
Running on A10G Featured 142 TextDiffuser 2 📚 Featured 142 Generate images from text prompts with layout planning
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling Paper • 2402.12226 • Published Feb 19, 2024 • 45