trl-internal-testing/hh-rlhf-helpful-base-trl-style Viewer • Updated May 2, 2024 • 46.2k • 2.18k • 13
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Paper • 2506.08234 • Published Jun 9 • 9
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Paper • 2506.08234 • Published Jun 9 • 9
Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions Paper • 2506.08234 • Published Jun 9 • 9 • 3
Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity Paper • 2505.11107 • Published May 16 • 29
Pythia Scaling Suite Collection Pythia is the first LLM suite designed specifically to enable scientific research on LLMs. To learn more see https://github.com/EleutherAI/pythia • 18 items • Updated Feb 26 • 31