Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

NeoByBy's picture

3 9 4

NeoByBy

NeoByBy

21world's profile picture

0xSojalSec's profile picture

·

AI & ML interests

None yet

Organizations

Collections 1

DPO STAR for math

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 33

DPO STAR for math

Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Paper • 2407.18248 • Published Jul 25, 2024 • 33

models 0

None public yet

datasets 0

None public yet

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs