Skip to content

Document DeepSeek agent loop 50 query eval#131

Merged
SonAIengine merged 1 commit into
mainfrom
docs-deepseek-agent-loop-50
Jul 2, 2026
Merged

Document DeepSeek agent loop 50 query eval#131
SonAIengine merged 1 commit into
mainfrom
docs-deepseek-agent-loop-50

Conversation

@SonAIengine

Copy link
Copy Markdown
Contributor

Summary

  • add the DeepSeek Flash 50-query agent-loop eval artifacts
  • update the public scale report with the 50-query summary row
  • record delayed-discovery hits and high-call miss bottlenecks for next tuning

Results

  • DeepSeek Flash 50-query check: 23/50 reach
  • zero-tool answers: 0/50
  • duplicate tool calls: 0
  • empty tool calls: 11
  • delayed discovery hits: 178627, 87892, 1090242, 45924, 323998, 333486
  • high-call misses: 54544, 293992, 208145, 14151, 91711, 237373

Verification

  • DeepSeek 50-query live eval with --llm-preset deepseek, --subset 50, --resume
  • rg -n "DEEPSEEK_API_KEY=|sk-" examples/ablation/diagnostics/agent_loop_20260702_201106.md examples/ablation/diagnostics/agent_loop_deepseek_v4_flash_50.jsonl examples/ablation/diagnostics/public_scale_20260702.md || true
  • git diff --check

@SonAIengine SonAIengine merged commit 009ddb2 into main Jul 2, 2026
2 checks passed
@SonAIengine SonAIengine deleted the docs-deepseek-agent-loop-50 branch July 2, 2026 11:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant