IDEFICS2 Playground
Chat with an AI assistant using text and images
Chat with an AI assistant using text and images
Create images with enhanced prompts
Video Editing
Generate relit images with foreground condition
Generate realistic audio from text
Enhance and restore old photos and AI-generated faces
Transfer portrait styles to images and videos
Segment images into parts and maps
Generate 3D views from a single image
Transform images based on text instructions
Train LoRAs with Ease
Upload an image and edit it using segmentation, inpainting, or regeneration
Create an animated video from audio and a reference image
Generate text from images and prompts
Edit images by providing prompts and noise settings
Generate OpenPose-filtered video from input video
Video Dubbing with Open Source Projects
Create videos with FFMPEG + Qwen2.5-Coder
Create video ads from product names
Generate images with your face
Real-Time Image Generation with SDXL Lightning
Clone a voice to speak any text
Generate images from text, existing images, or by inpainting
Transform images using various artistic effects
Generate a video from two images and text prompts
Generate stunning high quality illusion artwork
Generate images from sketches or uploaded images
Transcribe audio or YouTube videos into text
Generate detailed captions for images
4M: Massively Multimodal Masked Modeling
Generate a video from an image
Generate images from text prompts
Generate images from prompts or images
Stable Diffusion 3 with text2img and img2img
Create interactive videos from images with drag-and-draw controls
Advanced Image Generator
Enhance and upscale images with advanced controls
Generate images preserving face identity
Inpaint images using prompts
Gradio demo of CharacterGen (SIGGRAPH 2024)
Edit images using text prompts and masks
DALLE 4K | A RealVisXL_V3, V4 | HI-Res Images Gen.
Generate 360° panorama images from text prompts
Create videos from text prompts
Generate customized realistic photos from face images
Generate images using text and reference images
Remove background from images
Clarity AI Upscaler Reproduction
Generate a video animating a source image to match a given audio
Audio-based Lip Sync for Talking Head Video Editing
Mesclar dois vídeos e verificar GPU e NVENC
Create images of a given character in different poses
Transfers textures from a reference image to a masked region in a source image
Create a video using aligned poses from an image and a dance video
Analyze human behaviors from videos
Generate subtitles and translate audio files
Create virtual outfits by combining images
Erase any object from an image with just a prompt
Audio-Driven Portrait Animations
Generate normal maps from images and videos
Stunning images using stable diffusion.
Vocal and background audio separator
Stunning images using stable diffusion.
Create animated videos from images
Edit images with predefined styles or text prompts
Generate images from text prompts
Generate a 3D mesh model from an image
Segment objects in images by selecting points
Launch a web interface for model interaction
Fast Text 2 Video Generator
Teleport objects into new backgrounds using masks
Aesthetically Controllable Text-Driven Stylization w/o Train
Generate images from text and an image prompt
Easily remove your videos background!
Turn an image into a motion video
Text-to-Video
Animate Your Pictures With Stable VIdeo DIffusion
Try on clothes virtually with images
Generate text based on an image and prompt
Create HD cutouts from any image with just a prompt
Text-to-Video
Quickly edit the expression of a face
Flux-Labs with LoRA
Add a logo to anything
Text-to-3D and Image-to-3D Generation
Generate an edited image based on text and input image
Automatically discover creative knowledge inside diffusion
Fast image relighting using Latent Bridge Matching
Generate styled images from prompts and references
Generate high-resolution images with prompts and masks
Generate 3D video from input images
Generate 3D character models from single images
Convert images of humans to biomechanically accurate 3D skeletons
Transcribe audio files or YouTube videos into text
Enhance facial features in images using a reference face
Generate videos from images and prompts
Infinite-Length Film Generation
Use NVIDIA H100 GPU
Enhance image resolution up to 8x