Model Fintune

by BITDDD - opened Mar 24

Discussion

BITDDD

Mar 24

Does the model support extending its own tags and training based on its own images?

merve

Google org Mar 25

@BITDDD you can probably fine-tune it on different images and policies like how you would fine-tune a vision language model.

kumanatsu

May 1

Use Unsloth for gemma3 and vor visual fine tuning is working great.

lkv

Google org Sep 2

Hi , Apologies for the delay,

Yes, you can extend the model's capabilities to recognize new concepts and tags, but it's done through a process called fine-tuning, not through a fully autonomous, self-training loop.
Gemma 3 (if you're referring to the multimodal variant) can be fine-tuned like other vision-language models (VLMs), but tag extension and image-specific training depend on how your training data is structured and what framework you're using.

Kindly refer this documentation for more information. Thank you

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment