GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer Paper • 2510.16136 • Published Oct 17 • 3
No Tokens Wasted: Leveraging Long Context in Biomedical Vision-Language Models Paper • 2510.03978 • Published Oct 4 • 2
Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Paper • 2502.17387 • Published Feb 24 • 7
RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems Paper • 2510.02263 • Published Oct 2 • 8
Detecting Corpus-Level Knowledge Inconsistencies in Wikipedia with Large Language Models Paper • 2509.23233 • Published Sep 27 • 3
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition Paper • 2509.19768 • Published Sep 24 • 4
Thinking While Listening: Simple Test Time Scaling For Audio Classification Paper • 2509.19676 • Published Sep 24 • 4
BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation Paper • 2403.09227 • Published Mar 14, 2024 • 1
BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities Paper • 2503.05652 • Published Mar 7 • 11
BioClinical ModernBERT: A State-of-the-Art Long-Context Encoder for Biomedical and Clinical NLP Paper • 2506.10896 • Published Jun 12 • 4
MIRIAD: Augmenting LLMs with millions of medical query-response pairs Paper • 2506.06091 • Published Jun 6 • 9
Re-thinking Temporal Search for Long-Form Video Understanding Paper • 2504.02259 • Published Apr 3 • 1
Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations Paper • 2408.15232 • Published Aug 27, 2024 • 1
LEMONADE: A Large Multilingual Expert-Annotated Abstractive Event Dataset for the Real World Paper • 2506.00980 • Published Jun 1
MedCaseReasoning: Evaluating and learning diagnostic reasoning from clinical case reports Paper • 2505.11733 • Published May 16 • 7
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks Paper • 2505.00234 • Published May 1 • 26