Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following Paper • 2511.10507 • Published 4 days ago • 5
Extrapolative Controlled Sequence Generation via Iterative Refinement Paper • 2303.04562 • Published Mar 8, 2023 • 1
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples Paper • 2305.15269 • Published May 24, 2023 • 1
SQuALITY: Building a Long-Document Summarization Dataset the Hard Way Paper • 2205.11465 • Published May 23, 2022 • 1
QuALITY: Question Answering with Long Input Texts, Yes! Paper • 2112.08608 • Published Dec 16, 2021 • 3
Extrapolative Controlled Sequence Generation via Iterative Refinement Paper • 2303.04562 • Published Mar 8, 2023 • 1