QE4PE: Word-level Quality Estimation for Human Post-Editing Paper • 2503.03044 • Published Mar 4, 2025 • 6
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement Paper • 2505.23183 • Published May 29, 2025 • 1
Can Large Language Models Capture Human Annotator Disagreements? Paper • 2506.19467 • Published Jun 24, 2025 • 18
COMET-poly: Machine Translation Metric Grounded in Other Candidates Paper • 2508.18549 • Published Aug 25, 2025
Hearing to Translate: The Effectiveness of Speech Modality Integration into LLMs Paper • 2512.16378 • Published 22 days ago • 7
Biased Tales: Cultural and Topic Bias in Generating Children's Stories Paper • 2509.07908 • Published Sep 9, 2025
eth-nlped/Qwen2.5-1.5B-pedagogical-rewardmodel Text Classification • 2B • Updated Nov 18, 2025 • 30 • 3
From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning Paper • 2505.15607 • Published May 21, 2025 • 3