Benchmark self-evolving Agent upon realistic large-scale file workspaces
benchmark dataset autonomous-agents ai-agents large-language-models llm file-dependencies workspace-learning
-
Updated
Jun 25, 2026 - Python