Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
razozang
/
deepseek-r1-dpo-1.5B
like
0
Text Generation
Transformers
Safetensors
qwen2
unsloth
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
deepseek-r1-dpo-1.5B
/
.gitattributes
razozang
Trained with Unsloth
2a80a9b
verified
8 months ago
raw
Copy download link
history
blame
contribute
delete
Safe
105 Bytes
tokenizer.json
filter
=lfs
diff
=lfs
merge
=lfs -text
model.safetensors
filter
=lfs
diff
=lfs
merge
=lfs -text