Agent - a JuanRafap Collection

SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models

Paper • 2506.04180 • Published Jun 4 • 33

AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation

Paper • 2506.10540 • Published Jun 12 • 37

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

Paper • 2506.10974 • Published Jun 12 • 19

SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search

Paper • 2507.15245 • Published Jul 21 • 11

GUI-G^2: Gaussian Reward Modeling for GUI Grounding

Paper • 2507.15846 • Published Jul 21 • 132

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published Jul 30 • 98

AWorld: Dynamic Multi-Agent System with Stable Maneuvering for Robust GAIA Problem Solving

Paper • 2508.09889 • Published Aug 13 • 32

Seeing, Listening, Remembering, and Reasoning: A Multimodal Agent with Long-Term Memory

Paper • 2508.09736 • Published Aug 13 • 57

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems

Paper • 2508.07407 • Published Aug 10 • 97

Efficient Agents: Building Effective Agents While Reducing Cost

Paper • 2508.02694 • Published Jul 24 • 85

SSRL: Self-Search Reinforcement Learning

Paper • 2508.10874 • Published Aug 14 • 96

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6 • 127

Mobile-Agent-v3: Foundamental Agents for GUI Automation

Paper • 2508.15144 • Published Aug 21 • 64

AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs

Paper • 2508.16153 • Published Aug 22 • 155

AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications

Paper • 2508.16279 • Published Aug 22 • 53

CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning

Paper • 2508.20096 • Published Aug 27 • 36

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 115

AWorld: Orchestrating the Training Recipe for Agentic AI

Paper • 2508.20404 • Published Aug 28 • 38

UItron: Foundational GUI Agent with Advanced Perception and Planning

Paper • 2508.21767 • Published Aug 29 • 12

GTA1: GUI Test-time Scaling Agent

Paper • 2507.05791 • Published Jul 8 • 26

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published Sep 2 • 123

Morae: Proactively Pausing UI Agents for User Choices

Paper • 2508.21456 • Published Aug 29 • 5

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks

Paper • 2509.01396 • Published Sep 1 • 56

Paper2Agent: Reimagining Research Papers As Interactive and Reliable AI Agents

Paper • 2509.06917 • Published Sep 8 • 41

Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers

Paper • 2509.06493 • Published Sep 8 • 11

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Paper • 2509.06951 • Published Sep 8 • 31

EnvX: Agentize Everything with Agentic AI

Paper • 2509.08088 • Published Sep 9 • 8

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Paper • 2509.08755 • Published Sep 10 • 56

Alibaba-NLP/Tongyi-DeepResearch-30B-A3B

Text Generation • 31B • Updated Oct 10 • 13.9k • 775

WebResearcher: Unleashing unbounded reasoning capability in Long-Horizon Agents

Paper • 2509.13309 • Published Sep 16 • 67

Virtual Agent Economies

Paper • 2509.10147 • Published Sep 12 • 26

QuantAgent: Price-Driven Multi-Agent LLMs for High-Frequency Trading

Paper • 2509.09995 • Published Sep 12 • 14

hkust-nlp/WebExplorer-8B

Image-Text-to-Text • 8B • Updated Sep 11 • 287 • 12

VoiceAssistant-Eval: Benchmarking AI Assistants across Listening, Speaking, and Viewing

Paper • 2509.22651 • Published Sep 26 • 22

ACON: Optimizing Context Compression for Long-horizon LLM Agents

Paper • 2510.00615 • Published Oct 1 • 32

StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?

Paper • 2510.02209 • Published Oct 2 • 52

CoDA: Agentic Systems for Collaborative Data Visualization

Paper • 2510.03194 • Published Oct 3 • 28

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 265

Training-Free Group Relative Policy Optimization

Paper • 2510.08191 • Published Oct 9 • 44

CoDA: Coding LM via Diffusion Adaptation

Paper • 2510.03270 • Published Sep 27 • 42

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7 • 102

Don't Just Fine-tune the Agent, Tune the Environment

Paper • 2510.10197 • Published Oct 11 • 28

Demystifying Reinforcement Learning in Agentic Reasoning

Paper • 2510.11701 • Published Oct 13 • 31

Agentic Entropy-Balanced Policy Optimization

Paper • 2510.14545 • Published Oct 16 • 103

PokeeAI/pokee_research_7b

Text Generation • 8B • Updated Oct 23 • 109k • 100

MiniMaxAI/MiniMax-M2

Text Generation • 229B • Updated 18 days ago • 395k • • 1.37k

moonshotai/Kimi-Linear-48B-A3B-Instruct

Text Generation • 49B • Updated 6 days ago • 410k • 492

HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration

Paper • 2510.27266 • Published Oct 31 • 20

IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction

Paper • 2511.07327 • Published 21 days ago • 73

AIonopedia: an LLM agent orchestrating multimodal learning for ionic liquid discovery

Paper • 2511.11257 • Published 17 days ago • 24

CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards

Paper • 2510.08529 • Published Oct 9 • 18

MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

Paper • 2511.11373 • Published 17 days ago • 12

UI2Code^N: A Visual Language Model for Test-Time Scalable Interactive UI-to-Code Generation

Paper • 2511.08195 • Published 20 days ago • 30

cerebras/MiniMax-M2-REAP-162B-A10B

Text Generation • 162B • Updated 17 days ago • 4.19k • 70

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 11 days ago • 97

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

Paper • 2511.15593 • Published 12 days ago • 54

Tongyi DeepResearch Technical Report

Paper • 2510.24701 • Published Oct 28 • 95

AgentFold: Long-Horizon Web Agents with Proactive Context Management

Paper • 2510.24699 • Published Oct 28 • 67

Search Self-play: Pushing the Frontier of Agent Capability without Supervision

Paper • 2510.18821 • Published Oct 21 • 17

Multi-Agent Deep Research: Training Multi-Agent Systems with M-GRPO

Paper • 2511.13288 • Published 14 days ago • 17

DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs

Paper • 2511.20468 • Published 6 days ago

Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation

Paper • 2511.02303 • Published 27 days ago • 1

AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework

Paper • 2510.04206 • Published Oct 5 • 2

MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games

Paper • 2510.15414 • Published Oct 17

Multi-Agent Tool-Integrated Policy Optimization

Paper • 2510.04678 • Published Oct 6 • 30

Agentic Learner with Grow-and-Refine Multimodal Semantic Memory

Paper • 2511.21678 • Published 5 days ago • 9