Wenqi Shi's picture

3 9 8

Wenqi Shi

wshi83

·

https://wshi83.github.io

AI & ML interests

LLMs, Generative AI, Data-Centric AI

Recent Activity

upvoted a paper about 1 month ago

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

liked a dataset about 2 months ago

MedAgentGym/MedAgentGym-Data

liked a model about 2 months ago

MedAgentGym/MedCopilot-7B

View all activity

Organizations

upvoted a paper about 1 month ago

AceSearcher: Bootstrapping Reasoning and Search for LLMs via Reinforced Self-Play

Paper • 2509.24193 • Published Sep 29 • 6

upvoted a paper 3 months ago

CellForge: Agentic Design of Virtual Cell Models

Paper • 2508.02276 • Published Aug 4 • 39

upvoted 2 papers 4 months ago

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 84

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 75

upvoted a paper 5 months ago

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

Paper • 2506.04405 • Published Jun 4 • 7

upvoted a paper 6 months ago

MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering

Paper • 2505.07782 • Published May 12 • 19

upvoted a paper 8 months ago

MedAgentsBench: Benchmarking Thinking Models and Agent Frameworks for Complex Medical Reasoning

Paper • 2503.07459 • Published Mar 10 • 16

upvoted a paper 9 months ago

Hephaestus: Improving Fundamental Agent Capabilities of Large Language Models through Continual Pre-Training

Paper • 2502.06589 • Published Feb 10 • 20

upvoted a paper over 1 year ago

ToolChain: Efficient Action Space Navigation in Large Language Models with A Search

Paper • 2310.13227 • Published Oct 20, 2023 • 14