Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2402.00396
RL/Alignment
Collection by Oct 14
27
  • Moral Foundations of Large Language Models

    Paper • 2310.15337 • Published Oct 23, 2023 • 1
  • Specific versus General Principles for Constitutional AI

    Paper • 2310.13798 • Published Oct 20, 2023 • 3
  • Contrastive Prefence Learning: Learning from Human Feedback without RL

    Paper • 2310.13639 • Published Oct 20, 2023 • 25
  • RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

    Paper • 2309.00267 • Published Sep 1, 2023 • 51
RL/Alignment
Collection by Oct 14
27
  • Moral Foundations of Large Language Models

    Paper • 2310.15337 • Published Oct 23, 2023 • 1
  • Specific versus General Principles for Constitutional AI

    Paper • 2310.13798 • Published Oct 20, 2023 • 3
  • Contrastive Prefence Learning: Learning from Human Feedback without RL

    Paper • 2310.13639 • Published Oct 20, 2023 • 25
  • RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback

    Paper • 2309.00267 • Published Sep 1, 2023 • 51
  • Previous
  • 1
  • 2
  • Next
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs