Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
vincentoh
/
Llama-3.2-3B-GuardReasoner-Exp19-HSDPO-Toy
like
0
Text Classification
PEFT
Safetensors
Transformers
English
llama-3.2
llama
guardreasoner
guardrails
content-moderation
safety
dpo
hs-dpo
lora
trl
License:
llama3.2
Model card
Files
Files and versions
xet
Community
Use this model
7213397
Llama-3.2-3B-GuardReasoner-Exp19-HSDPO-Toy
733 MB
1 contributor
History:
5 commits
vincentoh
Update README.md
7213397
verified
about 2 months ago
checkpoint-16
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
checkpoint-8
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
.gitattributes
1.7 kB
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
README.md
5.18 kB
Update README.md
about 2 months ago
adapter_config.json
1.06 kB
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
adapter_model.safetensors
97.3 MB
xet
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
chat_template.jinja
3.83 kB
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
special_tokens_map.json
454 Bytes
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
tokenizer.json
17.2 MB
xet
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
tokenizer_config.json
50.6 kB
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago
training_args.bin
6.93 kB
xet
Upload Llama 3.2 3B GuardReasoner Exp 19: HS-DPO Toy (10% dataset) - License compliant
about 2 months ago