Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
mradermacher
/
gpt2-rlhf-anthropic-GGUF
like
0
Transformers
GGUF
Anthropic/hh-rlhf
English
rlhf
reinforcement-learning-from-human-feedback
anthropic-hh-rlhf
chatgpt-style-training
ppo
supervised-fine-tuning
human-preferences
ai-alignment
gpt2
License:
mit
Model card
Files
Files and versions
xet
Community
Deploy
Use this model
main
gpt2-rlhf-anthropic-GGUF
Commit History
auto-patch README.md
837eaa8
verified
mradermacher
commited on
Sep 22
auto-patch README.md
1a40b9e
verified
mradermacher
commited on
Sep 22
uploaded from leia
c5dd94b
verified
mradermacher
commited on
Sep 22
uploaded from leia
550efac
verified
mradermacher
commited on
Sep 22
initial commit
77c6a47
verified
mradermacher
commited on
Sep 22