An Empirical Study of Testing Practices in Open Source AI Agent Frameworks and Agentic Applications Paper • 2509.19185 • Published Sep 23 • 3
Model Context Protocol (MCP) at First Glance: Studying the Security and Maintainability of MCP Servers Paper • 2506.13538 • Published Jun 16 • 1
Agentic Software Engineering: Foundational Pillars and a Research Roadmap Paper • 2509.06216 • Published Sep 7 • 7
From Hugging Face to GitHub: Tracing License Drift in the Open-Source AI Ecosystem Paper • 2509.09873 • Published Sep 11 • 2
On the Use of Agentic Coding: An Empirical Study of Pull Requests on GitHub Paper • 2509.14745 • Published Sep 18 • 4
Developer-LLM Conversations: An Empirical Study of Interactions and Generated Code Quality Paper • 2509.10402 • Published Sep 12 • 5
ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models Paper • 2502.09696 • Published Feb 13 • 43