Model Fintune

#3
by BITDDD - opened

Does the model support extending its own tags and training based on its own images?

Google org

@BITDDD you can probably fine-tune it on different images and policies like how you would fine-tune a vision language model.

Use Unsloth for gemma3 and vor visual fine tuning is working great.

Google org

Hi , Apologies for the delay,

Yes, you can extend the model's capabilities to recognize new concepts and tags, but it's done through a process called fine-tuning, not through a fully autonomous, self-training loop.
Gemma 3 (if you're referring to the multimodal variant) can be fine-tuned like other vision-language models (VLMs), but tag extension and image-specific training depend on how your training data is structured and what framework you're using.

Kindly refer this documentation for more information. Thank you

Sign up or log in to comment