Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
razozang
/
deepseek-r1-dpo-1.5B
like
0
Text Generation
Transformers
Safetensors
qwen2
unsloth
conversational
text-generation-inference
arxiv:
1910.09700
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
deepseek-r1-dpo-1.5B
/
.gitattributes
Commit History
Trained with Unsloth
2a80a9b
verified
razozang
commited on
Mar 24
Upload tokenizer
26e050f
verified
razozang
commited on
Mar 24
Delete .gitattributes
4c91a1d
verified
razozang
commited on
Mar 24
Upload model-00002-of-00002.safetensors
106384f
verified
razozang
commited on
Mar 19
Upload model-00001-of-00002.safetensors
d7f389e
verified
razozang
commited on
Mar 19
Upload 11 files
dd6b42c
verified
razozang
commited on
Mar 17
Delete .gitattributes
7b4c57a
verified
razozang
commited on
Mar 17
Upload 11 files
c69a94e
verified
razozang
commited on
Mar 13
initial commit
7d643cd
verified
razozang
commited on
Mar 13