fix(cli): serialize subagent execution to prevent tool-permission leak by ImmortalDemonGod · Pull Request #12548 · continuedev/continue

ImmortalDemonGod · 2026-06-04T06:18:38Z

Description

Serializes subagent execution to close a tool-permission escalation race. executeSubAgent reads the main agent's TOOL_PERMISSIONS, flips the shared state to allow-all so the subagent can call any tool, then restores the captured main permissions in a finally. With two subagents running concurrently the read/restore interleaves: the second reads the allow-all state as its "main" and later restores that, leaving the main agent permanently escalated to allow-all. This wraps execution in a single-promise chain (withSubagentExecutionLock) so only one subagent runs at a time and the restore is never lost.

Checklist

I've read the contributing guide
The relevant docs, if any, have been updated or created
The relevant tests, if any, have been updated or created

Screen recording or screenshot

N/A — internal CLI logic fix (subagent-executor concurrency); no user-facing UI change.

Tests

Added extensions/cli/src/subagent/executor.test.ts — a behavioral concurrency test that drives the exact "A parks mid-stream with allow-all set, B starts" interleave over parsed permission objects and asserts the final TOOL_PERMISSIONS policy equals the original main policy. It fails before the fix (final state stuck at allow-all) and passes after. npm run lint and npm test pass in extensions/cli.

_{Full forensic verification evidence (AIV-L3) is in the comment below.}

Summary by cubic

Serializes subagent execution in the CLI to prevent a tool-permission escalation race. Only one subagent runs at a time, ensuring the main agent’s TOOL_PERMISSIONS are always restored.

Bug Fixes
- Wrapped executeSubAgent with withSubagentExecutionLock (single-promise chain) for exclusive execution.
- Removed the read/restore interleave that could leave the main agent stuck at allow-all.
- Added a deterministic concurrency test validating final TOOL_PERMISSIONS match the original policies.

^{Written for commit b62cc9c. Summary will update on new commits.}

executeSubAgent temporarily flips the shared tool-permission state to allow-all and restores it in a finally. Two concurrent subagents interleave the read/restore: the second reads the allow-all state as 'main' and restores THAT, leaving the main agent permanently escalated to allow-all. Wrap execution in a single-promise chain (withSubagentExecutionLock) so only one subagent runs at a time and the restore is never lost. Adds a deterministic concurrency test that fails before this change and passes after.

github-actions · 2026-06-04T06:18:50Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

ImmortalDemonGod · 2026-06-04T06:18:58Z

AIV-L3 Verification Packet — `F-CO-SUBAGENT-RACE`

Independent forensic verification evidence for the fix in this PR, provided as a gift by Black Box Research Labs. The change itself is described above; this comment is the full evidence trail (execution, SHA-pinned referential, negative, and provenance).

AIV Verification Packet (v2.1)

Identification

Field	Value
Repository	github.com/ImmortalDemonGod/continue
Finding ID	`F-CO-SUBAGENT-RACE` — concurrent subagent permission-state escalation
Branch	`fix/serialize-subagent-executor` → `main`
Head SHA	`b62cc9c20296939e70deac55d5bcef7505ce5f32`
Base SHA	`cb273098d968906d25ee737b454f0b5f13ea2482`

Classification Record

risk_tier: R3
sod_mode: S0 # self-verification; independent S1 (maintainer review) pending — see Known Limitations
critical_surfaces:
  - Authorization / Privilege Boundaries — the defect causes the main agent's tool-permission policy to
    escalate to allow-all (`{tool: "*", permission: "allow"}`) and persist; the fix governs that path.
blast_radius: component # the CLI subagent executor; consumed by parallel-subagent dispatch
classification_rationale: >
  Mandatory R3 per §5.2: the change governs the Authorization/Privilege-Boundary critical surface
  (tool-permission state set/restore). Full evidence floor A+B+C+D+E+F collected below.
classified_by: Miguel Ingram
classified_at: "2026-06-03"
evidence_floor: A + B + C + D + E + F

Claims

C-1 — Concurrent executeSubAgent calls can no longer interleave the TOOL_PERMISSIONS read/restore: execution is serialized behind withSubagentExecutionLock, so the main agent's permissions are always restored (never left at allow-all).
C-2 — No behavior change for a single subagent and no public-API change: the exported executeSubAgent(options): Promise<SubAgentResult> signature is unchanged; its original body is moved verbatim to executeSubAgentImpl and invoked through the lock.
C-3 — The permission set/restore logic and values are unchanged; only the critical section is made mutually exclusive.

Evidence

Class A — Execution Evidence

Local run under the repo's real test runner (vitest 3.2.4, node v26.0.0; @continuedev/* workspace deps installed in extensions/cli):
- Passes-after (fix @ b62cc9c): npx vitest run src/subagent/executor.test.ts → Test Files 1 passed (1), Tests 1 passed (1) (1 passed / 0 failed / 0 skipped).
- Fails-before (base executor.ts @ cb273098, test kept): same command → Test Files 1 failed (1) — AssertionError: expected [ { tool: '*', permission: 'allow' } ] to deeply equal [ { tool: 'read', … } ], i.e. the main-agent permission policy is left stuck at allow-all (the exact escalation). A true semantic-negative in the real runner.
- Cross-check: a standalone deterministic simulation of the set→run→restore interleave independently reproduces BEFORE → [{"tool":"*","permission":"allow"}] (corrupted) / AFTER → [{"tool":"read","permission":"allow"}] (restored).
CI — head-bound run from continue's own pipeline (immutable, run https://github.com/ImmortalDemonGod/continue/actions/runs/26863515602, verified head_sha=b62cc9c20296… — A-001/A-002): continue's test (macos-latest, 18/20/22/24) jobs — which run the extensions/cli vitest suite including this executor.test.ts — all pass, alongside prettier-check, the VSIX builds, and the e2e suite (45 pass, 0 relevant fail). This is an immutable execution artifact bound to the head, satisfying the Class A immutable-CI requirement for this R3 change. (1 unrelated red — jetbrains-tests, the IDE plugin, outside this extensions/cli-only diff — see §10.)
Static analysis: esbuild parses both files; prettier-check passes in CI.
CodeRabbit posted a substantive auto-summary walkthrough (accurately describing the concurrency test) with 0 inline review comments — no actionable findings surfaced (no explicit verdict line at this head).

Class B — Referential Evidence (SHA-bound to `b62cc9c`)

withSubagentExecutionLock (the serialization primitive):
https://github.com/ImmortalDemonGod/continue/blob/b62cc9c20296939e70deac55d5bcef7505ce5f32/extensions/cli/src/subagent/executor.ts#L72-L89
executeSubAgent wrapper (public entry, unchanged signature) → delegates to the impl through the lock:
https://github.com/ImmortalDemonGod/continue/blob/b62cc9c20296939e70deac55d5bcef7505ce5f32/extensions/cli/src/subagent/executor.ts#L101-L116
executeSubAgentImpl (the original body, renamed):
https://github.com/ImmortalDemonGod/continue/blob/b62cc9c20296939e70deac55d5bcef7505ce5f32/extensions/cli/src/subagent/executor.ts#L118
Concurrency test:
https://github.com/ImmortalDemonGod/continue/blob/b62cc9c20296939e70deac55d5bcef7505ce5f32/extensions/cli/src/subagent/executor.test.ts
Scope inventory (B-003, matches git diff --name-only cb273098..b62cc9c):
- extensions/cli/src/subagent/executor.ts
- extensions/cli/src/subagent/executor.test.ts

Class C — Negative Evidence

Test integrity: executor.test.ts is a net-new test file — there were no prior tests for executor.ts to remove or weaken (git show cb273098:extensions/cli/src/subagent/ lists no executor.test.ts). The diff is additive.
Semantic test design (not grep): the test mocks the ServiceContainer/services/stream and drives the exact interleave (A parks mid-stream with allow-all set; B starts) over parsed permission objects, asserting the final TOOL_PERMISSIONS policy equals the original main policy — not a text match.
No new skips / no .only: the test file contains a single active test(...); no describe.skip/it.only.
CodeRabbit: posted a substantive walkthrough for head b62cc9c (accurately describing the concurrency test) with 0 inline review comments — no actionable findings surfaced (the explicit summary "No actionable comments" verdict line is not present at this head). Test integrity rests on the additive net-new test + the executed vitest run.

Class D — Differential Evidence (per surface)

API surface — UNCHANGED. git diff cb273098..b62cc9c -- executor.ts shows no +/- on the export async function executeSubAgent( signature line; params and Promise<SubAgentResult> return are identical. The original function body is relocated verbatim to executeSubAgentImpl (rename + indentation), confirmable by exhibition in the diff.
Security / Authorization surface — bounded. The only semantic change is serialization of the existing set→run→restore critical section. The permission values ({tool: "*", permission: "allow"} during run; mainAgentPermissionsState on restore) and the serviceContainer.set(SERVICE_NAMES.TOOL_PERMISSIONS, …) calls are byte-identical; no new grant, role, or policy is introduced.
Concurrency / throughput surface — CHANGED (disclosed). The fix serializes subagent execution: subagents that previously could run concurrently now execute strictly one-at-a-time (the lock await previous before running each fn). This is the intended trade-off — correctness (no permission-escalation race on the shared TOOL_PERMISSIONS state) over parallelism — and is signalled by the PR title ("serialize"). The throughput impact is bounded to subagent dispatch; the deeper alternative (per-subagent permission scoping that would preserve parallelism without the shared-state race) is the larger redesign noted out-of-scope in Class E, deferred.
Dependencies — none. No package.json/lockfile change (the lock is a local promise chain; no new import).
Data schema / Config — none.
UNTOUCHED (blast-radius bound): the diff is confined to executor.ts + executor.test.ts; ServiceContainer.ts, ToolPermissionService.ts, services/index.ts, and all other files are absent from the diff.

Class E — Intent Evidence

Requirement source (capability-gated audit report): [Black Box forensic audit report — shared privately with maintainers on request] — finding F-CO-SUBAGENT-RACE traces the global-state desync in executeSubAgent (read/mutate/restore of TOOL_PERMISSIONS under parallel dispatch).
Reference immutability (addresses E-F1b): the audit report is served from a static, content-addressable deploy. Snapshot obligation: the finding text is pinned to the build that produced this audit page; Black Box retains the immutable source-of-record (the finding JSON at the deploy SHA). At L1/L2 a token-gated URL is permitted with this obligation declared (§6.6.2.1).
Requirement → claim → evidence: permission-escalation race → C-1 (serialized) + C-2 (no API change) + C-3 (values unchanged) → Class A result + Class B permalinks + Class D diff.
Acceptance: ✅ concurrent subagents can no longer strand allow-all; ✅ public API preserved; (out of scope: a deeper redesign to scope permissions per-subagent rather than mutate shared global state — a larger change deferred).

Class F — Provenance Evidence

SHA-256 manifest of the functional artifacts @ b62cc9c (immutability mechanism per §3.3):

dbb3b3ea83273c6d06c2944f46ed27bcfb892e012e056697b93c3a68a63f0af2  extensions/cli/src/subagent/executor.ts
e3ddeeaa72ce7dc52939e74dc8fb8fc0a045985ca21d06a7b25e83600fd590cb  extensions/cli/src/subagent/executor.test.ts

Builder identity: sole author Miguel Ingram across all branch commits (git log cb273098..b62cc9c --format=%an | sort -u → 1 author).
Cryptographic signature (F-005, labeled unsigned): commits are not GPG/Sigstore-signed in this environment; integrity is provided by the SHA-256 manifest + the public commit chain bound to b62cc9c. GPG/Sigstore signing is a documented gap (see Known Limitations).

§10 Exception — Unrelated CI red (`jetbrains-tests`), severity downgraded

Check: jetbrains-tests (and its require-all-checks-to-pass cascade) is red in continue's CI run at head b62cc9c.
Why it is not this change: jetbrains-tests exercises the JetBrains IDE plugin; this diff touches only extensions/cli/src/subagent/ (confirmed by git diff --name-only cb273098..b62cc9c → 2 files). The CLI test jobs that actually cover this change (test (macos-latest, 18–24), which run the vitest suite incl. executor.test.ts) all pass, as does prettier-check.
Containment: PR fail-set ⊆ base fail-set — the JetBrains plugin tests fail on the fork independent of this CLI change. No action required in this diff. Class A's immutable-CI requirement (A-001/A-002) is now satisfied by the head-bound CLI test run cited above — this §10 covers only the disjoint JetBrains red.

Known Limitations

SoD = S0. Author and verifier are the same identity; independent S1 (the upstream maintainer's review) pending. R3 requires S1 (§5.4); the maintainer review is the intended S1 step.
Class G omitted per §6.8.7 (no pre-code prediction; not fabricated post-hoc).
GPG/Sigstore not wired: Class F integrity rests on the SHA-256 manifest + public commit chain rather than a cryptographic signature (F-005).
Docstrings: withSubagentExecutionLock, the executeSubAgent wrapper, and executeSubAgentImpl all carry full TSDoc (@param/@returns/@typeParam/@remarks) explaining the serialization and the escalation it prevents; the concurrency test is documented with the bug it catches.

Summary

Metric	Value
Files changed	2 (`subagent/executor.ts`, `subagent/executor.test.ts`)
Lines	+157 / −2 (lock primitive + thin wrapper + rename; new concurrency test)
Risk tier	R3 (Authorization critical surface; S0; A+B+C+D+E+F)
Behavior impact	Concurrent subagents can no longer strand main-agent permissions at allow-all; public API unchanged
Test impact	+1 net-new behavioral concurrency test; 0 removals

Provided as a gift to support Continue's engineering velocity.

ImmortalDemonGod · 2026-06-04T06:19:01Z

I have read the CLA Document and I hereby sign the CLA

cubic-dev-ai

No issues found across 2 files

_{Re-trigger cubic}

ImmortalDemonGod · 2026-06-04T06:38:21Z

Quick note on the red checks, in case it helps triage:

The only failing CLI test is test (macos-latest, 24). The other 15 jobs in that matrix (macOS 18/20/22, ubuntu 18/20/22/24, windows 18/20/22/24) all pass on this same commit, and the diff is confined to extensions/cli/src/subagent/executor.ts plus its test, so this reads as a flake rather than anything in the change. I don't have permission to re-run it from here; a maintainer re-run should clear it.

The Continuous AI checks look like they're waiting on the first-run approval external PRs need. Happy to address anything real that surfaces from them. Thanks for taking a look.

ImmortalDemonGod requested a review from a team as a code owner June 4, 2026 06:18

github-project-automation Bot added this to Issues and PRs Jun 4, 2026

github-project-automation Bot moved this to Todo in Issues and PRs Jun 4, 2026

dosubot Bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Jun 4, 2026

cubic-dev-ai Bot reviewed Jun 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(cli): serialize subagent execution to prevent tool-permission leak#12548

fix(cli): serialize subagent execution to prevent tool-permission leak#12548
ImmortalDemonGod wants to merge 1 commit into
continuedev:mainfrom
ImmortalDemonGod:fix/serialize-subagent-executor

ImmortalDemonGod commented Jun 4, 2026 •

edited by cubic-dev-ai Bot

Loading

Uh oh!

github-actions Bot commented Jun 4, 2026 •

edited

Loading

Uh oh!

ImmortalDemonGod commented Jun 4, 2026

Uh oh!

ImmortalDemonGod commented Jun 4, 2026

Uh oh!

cubic-dev-ai Bot left a comment

Uh oh!

ImmortalDemonGod commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ImmortalDemonGod commented Jun 4, 2026 • edited by cubic-dev-ai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

Screen recording or screenshot

Tests

Summary by cubic

Uh oh!

github-actions Bot commented Jun 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ImmortalDemonGod commented Jun 4, 2026

AIV-L3 Verification Packet — F-CO-SUBAGENT-RACE

AIV Verification Packet (v2.1)

Identification

Classification Record

Claims

Evidence

Class A — Execution Evidence

Class B — Referential Evidence (SHA-bound to b62cc9c)

Class C — Negative Evidence

Class D — Differential Evidence (per surface)

Class E — Intent Evidence

Class F — Provenance Evidence

§10 Exception — Unrelated CI red (jetbrains-tests), severity downgraded

Known Limitations

Summary

Uh oh!

ImmortalDemonGod commented Jun 4, 2026

Uh oh!

cubic-dev-ai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

ImmortalDemonGod commented Jun 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ImmortalDemonGod commented Jun 4, 2026 •

edited by cubic-dev-ai Bot

Loading

github-actions Bot commented Jun 4, 2026 •

edited

Loading

AIV-L3 Verification Packet — `F-CO-SUBAGENT-RACE`

Class B — Referential Evidence (SHA-bound to `b62cc9c`)

§10 Exception — Unrelated CI red (`jetbrains-tests`), severity downgraded