Party Golem's picture

6 1

Party Golem

partygolem

·

AI & ML interests

None yet

Organizations

None yet

upvoted 6 papers 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 79

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25 • 14

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28 • 109

Morae: Proactively Pausing UI Agents for User Choices

Paper • 2508.21456 • Published Aug 29 • 5

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28 • 35

Meta-Reasoning Improves Tool Use in Large Language Models

Paper • 2411.04535 • Published Nov 7, 2024 • 1