Generate edited video frames using text prompts
Set up and customize Stable Diffusion WebUI
Generate and convert voice using text and audio inputs