october-project-first-sweep-20251007-141824-t00

Multilingual XLM-T (EN/IT/ES) binary classifier
Task: LGBTQ+ reclamation vs non-reclamation on social media text.

Trial timestamp (UTC): 2025-10-07 14:18:24

Configuration (trial hyperparameters)

Hyperparameter	Value
LR	2e-05
EPOCHS	4
MAX_LENGTH	256
USE_BIO	True
USE_LANG_TOKEN	True
GATED_BIO	True
FOCAL_LOSS	True
FOCAL_GAMMA	1.5
USE_SAMPLER	True
R_DROP	True
R_KL_ALPHA	1.0

Dev set results

Metric	Value
trial	0
cfg	{'LR': 2e-05, 'EPOCHS': 4.0, 'MAX_LENGTH': 256.0, 'USE_BIO': 1.0, 'USE_LANG_TOKEN': 1.0, 'GATED_BIO': 1.0, 'FOCAL_LOSS': 1.0, 'FOCAL_GAMMA': 1.5, 'USE_SAMPLER': 1.0, 'R_DROP': 1.0, 'R_KL_ALPHA': 1.0}
f1_macro_dev	0.7784357266222551
best_threshold_dev	0.65
precision_macro_dev	0.7822163874301908
recall_macro_dev	0.7748173701298702

Data

Train/Dev: private dataset (merged train_en.csv, train_it.csv, train_es.csv with 15% stratified Dev).
The dataset id is intentionally not disclosed in metadata.

Usage

from transformers import AutoTokenizer, AutoModelForSequenceClassification
tok = AutoTokenizer.from_pretrained("SimoneAstarita/october-project-first-sweep-20251007-141824-t00")
model = AutoModelForSequenceClassification.from_pretrained("SimoneAstarita/october-project-first-sweep-20251007-141824-t00")

Downloads last month: -; Downloads are not tracked for this model. How to track