deep research - a dearaj23 Collection

dearaj23 's Collections

memory

RL

LLM

CoT

survey

deep research

updated Oct 20

Scaling Agents via Continual Pre-training

Paper • 2509.13310 • Published Sep 16 • 115
WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning

Paper • 2509.13305 • Published Sep 16 • 90
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published Oct 7 • 102
Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents

Paper • 2510.14438 • Published Oct 16 • 13