euhidaman
/

bitmar-multimodal

Model card Files Files and versions

BitMar 100M Token Model

This model was trained on exactly 100 million tokens as part of the BabyLM challenge.

Training Details

Total tokens: 100,000,000
Epochs completed: 10
Tokens processed: 996,965,763
Cross-modal similarity: 0.4638

Model Architecture

Text encoder: 4 layers, 128 hidden size
Vision encoder: DiNOv2 features compressed to 128
Episodic memory: 32 slots

Usage

from transformers import AutoModel, AutoTokenizer

model = AutoModel.from_pretrained("euhidaman/bitmar-multimodal")
tokenizer = AutoTokenizer.from_pretrained("euhidaman/bitmar-multimodal")

Training Status

Status: Completed
Tokens Processed: 996,965,763
Best Cross-modal Similarity: 0.4638

Downloads last month: 5

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support