Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
NeoByBy's picture
3 9 4

NeoByBy

NeoByBy
21world's profile picture 0xSojalSec's profile picture
·

AI & ML interests

None yet

Organizations

ByteDance's profile picture

Collections 1

DPO STAR for math
  • Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

    Paper • 2407.18248 • Published Jul 25, 2024 • 33
DPO STAR for math
  • Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

    Paper • 2407.18248 • Published Jul 25, 2024 • 33

models 0

None public yet

datasets 0

None public yet
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs