inclusionAI
/

ASearcher-Web-QwQ-V2

Model card Files Files and versions

Update README.md

#1

by samjia2000 - opened Sep 18

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

Files changed (1) hide show

README.md +85 -3

README.md CHANGED Viewed

@@ -1,3 +1,85 @@
----
-license: apache-2.0
----

+# ASearcher-Web-QwQ-V2
+## Overview
+**ASearcher-Web-QwQ-V2** is a 32B-scale search agent trained using large-scale reinforcement learning. This model represents an improved version of the ASearcher framework, achieving cutting-edge performance on challenging web search benchmarks through advanced agentic RL training techniques.
+## Key Features
+- 🏆 **Cutting-Edge Performance**: Achieves Avg@4 scores of 58.7, 51.1, and 74.5 on GAIA, xBench-DeepSearch, and Frames benchmarks respectively
+- ⚡ **Fully Asynchronous RL Training**: Enables efficient long-horizon search capabilities with tool calls exceedind 100 rounds
+- 🔁 **Advanced Data Synthesis**: Trained on autonomously generated QA pairs with rigorous multi-stage validation
+- 🌐 **Real Web Search Capabilities**: Designed to interact with live web search tools for up-to-date information retrieval
+## Performance Highlights
+| Benchmark | Avg@4 Score | Pass@4 Score |
+|-----------|------------|-------------|
+| GAIA | 58.7 | 74.7 |
+| xBench-DeepSearch | 51.1 | 75.0 |
+| Frames | 74.5 | 85.5 |
+**Substantial RL Improvements**: Reinforcement learning training brings significant gains:
+- +15.0 improvement on GAIA
+- +22.4 improvement on xBench-DeepSearch
+- +14.6 improvement on Frames
+## Quick Start
+### Evaluation
+To reproduce the benchmark results:
+```bash
+cd evaluation/
+python search_eval_async.py \
+    --model_name_or_path inclusionAI/ASearcher-Web-QwQ-V2 \
+    --data_names GAIA,xbench-deepsearch,Frames \
+    --agent-type asearcher-reasoning \
+    --search-client-type async-web-search-access
+```
+## Training Details
+This model was trained using:
+- **Architecture**: QwQ-32B
+- **Training Method**: Fully asynchronous reinforcement learning
+- **Data**: Synthesized QA pairs with multi-stage validation
+- **Framework**: AReaL
+## Applications
+- Complex web search and information retrieval
+- Multi-step problem solving with tool usage
+- Real-time information gathering and synthesis
+- Long-horizon reasoning tasks
+## Citation
+If you use this model, please cite:
+```bibtex
+@misc{gao2025turnsunlockinglonghorizonagentic,
+      title={Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL},
+      author={Jiaxuan Gao and Wei Fu and Minyang Xie and Shusheng Xu and Chuyi He and Zhiyu Mei and Banghua Zhu and Yi Wu},
+      year={2025},
+      eprint={2508.07976},
+      archivePrefix={arXiv},
+      primaryClass={cs.CL},
+      url={https://arxiv.org/abs/2508.07976},
+}
+```
+## License
+---
+license: apache-2.0
+---
+## Contact
+For questions and support, please refer to the [ASearcher GitHub repository](https://github.com/inclusionAI/ASearcher) or open an issue on the project page.