In a Training Loop 🔄

5 51 103

Arthur EDMOND

Shumatsurontek

AI & ML interests

LLM & Computer Vision

Recent Activity

upvoted a paper 2 days ago

Agentic Reasoning for Large Language Models

upvoted a paper 3 days ago

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

liked a model 5 days ago

openbmb/AgentCPM-Explore

View all activity

Organizations

upvoted a paper 2 days ago

Agentic Reasoning for Large Language Models

Paper • 2601.12538 • Published 6 days ago • 163

upvoted a paper 3 days ago

Advances and Frontiers of LLM-based Issue Resolution in Software Engineering: A Comprehensive Survey

Paper • 2601.11655 • Published 9 days ago • 59

upvoted a paper 11 days ago

GlimpRouter: Efficient Collaborative Inference by Glimpsing One Token of Thoughts

Paper • 2601.05110 • Published 16 days ago • 28

upvoted a paper about 1 month ago

Kling-Omni Technical Report

Paper • 2512.16776 • Published Dec 18, 2025 • 168

upvoted an article about 1 month ago

Article

Gotchas in Tokenizer Behavior Every Developer Should Know

Apr 18, 2025

•

upvoted an article about 2 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4, 2025

•

269

upvoted a paper about 2 months ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 294

upvoted 2 papers 2 months ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published Nov 14, 2025 • 186

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

Paper • 2511.06221 • Published Nov 9, 2025 • 133

upvoted 2 papers 3 months ago

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25, 2025 • 84

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published Oct 24, 2025 • 101

upvoted 2 papers 4 months ago

Paper2Video: Automatic Video Generation from Scientific Papers

Paper • 2510.05096 • Published Oct 6, 2025 • 119

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published Oct 1, 2025 • 120

upvoted a collection 4 months ago

Granite 4.0 Language Models

Collection

13 items • Updated Nov 17, 2025 • 201

upvoted 6 papers 4 months ago

DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search

Paper • 2509.25454 • Published Sep 29, 2025 • 142

TruthRL: Incentivizing Truthful LLMs via Reinforcement Learning

Paper • 2509.25760 • Published Sep 30, 2025 • 55

EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning

Paper • 2509.22576 • Published Sep 26, 2025 • 135