jaredjoss
·
AI & ML interests
None yet
Organizations
None yet
jaredjoss/reward-models
Updated
jaredjoss/pythia-70m-irl-29eps-01-rlhf-model
71M
•
Updated
jaredjoss/pythia-70m-irl-29eps-0035-rlhf-model
71M
•
Updated
jaredjoss/pythia-410m-irl-6eps-15reps-rlhf-model
0.4B
•
Updated
jaredjoss/pythia-70m-dahoas-hh-1-epoch-1000-steps-1e-7-lr-sft
70.4M
•
Updated
jaredjoss/pythia-160m-dahoas-hh-1-epoch-1000-steps-1e-7-lr-sft
0.2B
•
Updated
jaredjoss/pythia-410m-dahoas-hh-1-epoch-10000-steps-sft
0.4B
•
Updated
jaredjoss/pythia-160m-dahoas-hh-1-epoch-10000-steps-sft
0.2B
•
Updated
jaredjoss/pythia-70m-dahoas-hh-1-epoch-10000-steps-sft
70.4M
•
Updated
jaredjoss/pythia-70m-irl-10eps-58reps-rlhf-model
71M
•
Updated
jaredjoss/pythia-410m-roberta-lr_8e7-kl_01-steps_12000-rlhf-model
Text Generation
•
0.4B
•
Updated
•
16
jaredjoss/pythia-410m-roberta-lr_8e7-kl_005-steps_2000-rlhf-model
Text Generation
•
0.4B
•
Updated
jaredjoss/pythia-160m-roberta-lr_1e6-kl_0035-steps_1000-rlhf-model
Text Generation
•
0.2B
•
Updated
jaredjoss/pythia-70m-roberta-lr_3e6-kl_0035-steps_600-rlhf-model
Text Generation
•
71M
•
Updated
jaredjoss/pythia-410m-roberta-rlhf-model
Text Generation
•
0.4B
•
Updated
•
1
•
1
jaredjoss/pythia-160m-roberta-rlhf-model
Text Generation
•
0.2B
•
Updated
jaredjoss/pythia-70m-roberta-rlhf-model
Text Generation
•
71M
•
Updated
jaredjoss/finetuned_toxicity_410_model
Updated
jaredjoss/finetuned_toxicity_70_model
Updated
jaredjoss/pythia-160m-rlhf-pythia-70m-toxicity-model-v2
Text Generation
•
0.2B
•
Updated
jaredjoss/pythia-70m-toxicity-model-pythia-160m-rlhf
Text Generation
•
0.2B
•
Updated
jaredjoss/roberta-toxicity-classifier-pythia-160m-rlhf
Text Generation
•
0.2B
•
Updated
jaredjoss/pythia-160m-rlhf-pythia-70m-toxicity-model
Text Generation
•
0.2B
•
Updated