SigLip2 Math

This version of siglip2 is fine tuned on shiwk24/MathCanvas-Imagen using the code_derived_captions split. I trained for 1 epoch on 4M math images, with a random selection between the tikz code or caption using a batch size of 640.

This is not a classification model, since the loss function was pairwise contrastive loss. Use for embedding or downstream classifier training is recommended.

image

Downloads last month
21
Safetensors
Model size
0.4B params
Tensor type
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for dleemiller/siglip2-math-base-patch16-256

Finetuned
(109)
this model

Dataset used to train dleemiller/siglip2-math-base-patch16-256