Feature/vscode extension by KillerQueen-Z · Pull Request #4 · BlockRunAI/franklin

KillerQueen-Z · 2026-04-09T07:28:50Z

Add a VS Code sidebar extension (vscode-extension/) that provides the full RunCode agent experience
inside VS Code — chat panel, tool execution, model switching, and all slash commands
- Implement real-time balance tracking in the webview: balance updates synchronously on each usage
  event using local cost estimation (same approach as the Ink terminal UI), with RPC sync on turn
  completion
- Add headless session API (src/api/vscode-session.ts) that decouples the agent loop from terminal
  I/O, enabling any host (VS Code, future web UI, etc.) to drive sessions via getUserInput + onEvent
- Enhance /model command to render a full numbered model list via StreamEvent (previously only worked
  via console.error in TTY), allowing VS Code and other non-terminal hosts to display the picker
- Add StreamStatusUpdate event type so model changes propagate to persistent UI elements without
  stderr
- Two-tone banner rendering (gold "Run" + cyan "Code") in webview, matching CLI chalk output
- Update README with VS Code extension install instructions and architecture diagram
- Add dist/, .vscode/, vscode-extension/out/, *.vsix to .gitignore
Test plan
- npm run build in root — compiles cleanly
- npm run compile in vscode-extension/ — compiles cleanly
- F5 launch Extension Development Host — RunCode panel appears in sidebar
- Send a message — agent responds, model and balance update in status strip
- Balance decreases after each API call (no page reload needed)
- /model displays numbered list in chat log
- /model 3 or /model sonnet switches model, status strip updates
- Stop button / Esc cancels in-flight requests
- CLI runcode still works as before — no regressions

… config

…format

… pricing

…o chosen model

…infrastructure

…chain docs, README overhaul

…m, issue links

…e Fix.

…u 8192)

….50 to ~$0.05

… call uses 4096 default

…vation; debug logs to stderr

…nside Claude Code

Based on Claude Code source review: - Grep/Glob: output relative paths instead of absolute (saves tokens) - Grep: add --max-columns 500 to prevent base64/minified line bloat Glob task tokens: 2,191 → 761 (-65%).

…cking (v2.5.8) Based on Claude Code source review: - context.ts: memoize assembleInstructions per workingDir; cap git log at 2KB - webfetch.ts: 15-min session cache (50 entries) — no re-fetching same URL - edit.ts: normalize curly quotes before failing (reduces false edit errors) - read.ts: track partial reads; edit warns when file was only partially read

…e fix + e2e tests (v2.5.11)

…rn (v2.5.13)

…ost unit (v2.5.14)

…(v2.5.15)

…e summaries (v2.5.16)

…mitted on turn done (v2.5.17)

…fresh, E2E tests (v2.5.18) - Scrollback: commit full responses to Static immediately; show last-5-line preview in dynamic area so terminal scrollback buffer captures all output - User-Agent: centralize USER_AGENT in config.ts, apply to LLM client and WebFetch tool - Balance: re-fetch after every turn_done so status bar reflects real spend - Session cost: live spend shown in status bar (-$X.XXXX in yellow) - Type-ahead: queue input while agent is busy, auto-submit on turn_done - E2E test suite: 13 tests covering all core tools + session cost tracking (node:test, no extra deps)

- Live balance: compute display balance as fetchedBalance - sessionCost in real-time, no stale on-chain lag - zai/glm-5 pricing: switch from per-token ($1.0/$3.2M) to per-call ($0.001/call) matching actual BlockRun billing - Track LLM call count per turn, emit in usage event, accumulate in session stats - Show call count in session footer: "109,795 in / 300 calls · $0.30 session" - estimateCost: add perCall support for flat-rate models

- Prompt caching: add cache_control ephemeral markers on system prompt, tools, and penultimate message for anthropic/* models — cuts input token costs ~90% on cache hits - Add anthropic-beta: prompt-caching-2024-07-31 header automatically for Anthropic models - README: comprehensive Solana/Base payment docs — setup, funding sources, chain switching, USDC acquisition - README: add `runcode base` / `runcode solana` commands to reference table

- GLM-specific request tuning: temperature=0.8 default per official zai spec - Thinking mode: auto-enable {"thinking": {"type": "enabled"}} for -thinking- model variants - Add zai/glm-5-turbo to model picker with shortcut glm-turbo - Add glm5/glm-turbo shortcuts to model resolver

…EADME with GLM-5/caching docs (v2.5.23)

…call count (v2.5.24)

….5.25)

…mmand-aware git/test/build filters (v2.5.26)

…ing permission prompt (v2.5.28)

…ack (v2.5.29)

- Updated .gitignore to include VS Code and distribution files. - Added exports for VS Code integration in package.json. - Expanded README with installation instructions for the VS Code extension. - Introduced new functions for plain-text banner lines and footer for non-terminal UIs. - Enhanced model management commands to support listing models and switching via index or name. - Updated StreamEvent types to include status updates for UI hosts.

1bcMax

Thanks for the PR — good feature choices. /history, /delete, and the numbered model picker are all real user pain points. The headless session API is the right abstraction for multi-host support.

A few things before I can review further:

Remove dist/ from the PR. Compiled output shouldn't be committed — it'll be generated by CI. Please also remove vscode-extension/package-lock.json and vscode-extension/out/ if present.
Split extension.ts (782 lines) into smaller modules. Suggestion: webview-provider.ts, session-manager.ts, and ui-renderer.ts. One file doing webview setup + session management + HTML rendering + balance tracking is too much to review or maintain.
Version bump should be a separate commit (2.5.28 → 2.5.29), not mixed into the feature.
Can you explain the design decision behind the headless session API (vscode-session.ts) — why decouple via getUserInput + onEvent callbacks rather than importing the agent loop directly? Want to make sure we're aligned on the architecture before this ships.

Looking forward to the updated PR.

1bcMax added 30 commits March 22, 2026 01:50

feat: initial brcc — BlockRun Claude Code CLI

75bd756

fix: correct proxy URL path stripping to avoid double /api prefix

cd62199

docs: add README

df3255a

refactor: use @blockrun/llm for wallet management and x402 signing

4fe103a

feat: publish-ready — BUSL license, battle-tested README, npm package…

297ff08

… config

chore: use @blockrun/llm 1.4.2 from npm

7f58d70

chore: rename to @blockrun/cc for npm

fe43125

docs: update install command to @blockrun/cc

5dea996

feat: dual chain support — brcc setup base|solana

dfc87d7

chore: bump 0.3.0 — dual chain support

c28812f

fix: avoid auth conflict — clean oauth tokens, use realistic API key …

b082d8b

…format

feat: add --model flag and brcc models command — list 40+ models with…

662bfd0

… pricing

fix: remove unsupported --api-key-auth flag

7c21c4b

feat: proxy model override — brcc start --model routes all requests t…

1fce3b4

…o chosen model

docs: add ROADMAP — tweakcc + CCR integration plan, phases 1-4

cf9f616

feat: Phase 1 — env var model mapping, config command, smart routing …

4591ad7

…infrastructure

feat: Phase 2 — install script, config command, models command, dual …

7e7fd75

…chain docs, README overhaul

docs: 10K-star README — badges, comparison table, architecture diagra…

785c38e

…m, issue links

docs: remove unnecessary legal FAQ

27dfb99

docs: trim FAQ

42b11bc

docs: add launch blog — Claude Code Rate Limits Are Broken. Here's th…

3a1d6cb

…e Fix.

docs: add sudo for Linux npm install

410c973

fix: cap max_tokens for models with lower limits (DeepSeek 8192, Haik…

fb183b3

…u 8192)

chore: bump 0.6.1

cefbc96

fix: cap max_tokens to 16384 default — reduces per-call cost from ~$0…

8db58a6

….50 to ~$0.05

feat: adaptive max_tokens — use last output_tokens for pricing, first…

57b023b

… call uses 4096 default

fix: debug logging behind --debug flag, silent by default

16b07d5

fix: adaptive max_tokens uses max(lastOutput*2, 4096) — prevents star…

527ac11

…vation; debug logs to stderr

fix: debug logs to file (~/.blockrun/brcc-debug.log) instead of stderr

d091a8b

feat: in-session model switching — type 'use gpt' or 'use deepseek' i…

1b82c72

…nside Claude Code

1bcMax and others added 27 commits April 5, 2026 00:53

chore: v2.5.6

54905be

perf: relative paths + --max-columns 500 in grep/glob (v2.5.7)

214805e

Based on Claude Code source review: - Grep/Glob: output relative paths instead of absolute (saves tokens) - Grep: add --max-columns 500 to prevent base64/minified line bloat Glob task tokens: 2,191 → 761 (-65%).

chore: v2.5.7

a4085b0

chore: v2.5.8

0ed82a9

feat: permission dialogs, live bash output, terminal bell, ghost fram…

5c539c1

…e fix + e2e tests (v2.5.11)

chore: v2.5.12

05030d1

add gstack-style project docs: CLAUDE.md, CONTRIBUTING.md, VERSION

786b4c9

feat: live session cost in status bar + balance refresh after each tu…

848df00

…rn (v2.5.13)

test: session cost e2e — token accumulation, /cost command, estimateC…

ca154a8

…ost unit (v2.5.14)

fix: commit responses to Static so long replies are fully scrollable …

bc44094

…(v2.5.15)

fix: latest response shown in full, older responses folded to one-lin…

bb461cc

…e summaries (v2.5.16)

feat: type-ahead queueing — type next message while AI runs, auto-sub…

2308803

…mitted on turn done (v2.5.17)

feat: terminal bell notification on turn completion (v2.5.22)

d75e9e7

fix: show per-turn cost in response footer (not cumulative); update R…

540d220

…EADME with GLM-5/caching docs (v2.5.23)

perf: batch sequential bash commands via system prompt to reduce LLM …

93e1acc

…call count (v2.5.24)

perf: trim tool schemas + system prompt to reduce tokens per call (v2…

e4939e6

….5.25)

perf: smart bash output compression — strip ANSI, collapse blanks, co…

47d13b8

…mmand-aware git/test/build filters (v2.5.26)

perf: line-level dedup + ANSI stripping in tool result history (v2.5.27)

a8e109a

fix: permission dialog y/n/a not captured — block TextInput input dur…

180fd6a

…ing permission prompt (v2.5.28)

feat: fix AskUser crash — route ink UI dialog through onAskUser callb…

34a2fd1

…ack (v2.5.29)

Add /history and /delete slash commands

89f67e2

1bcMax reviewed Apr 9, 2026

View reviewed changes

1bcMax closed this Apr 12, 2026

1bcMax force-pushed the main branch from ff9148b to cd969bc Compare April 12, 2026 03:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/vscode extension#4

Feature/vscode extension#4
KillerQueen-Z wants to merge 185 commits intoBlockRunAI:mainfrom
KillerQueen-Z:feature/vscode-extension

KillerQueen-Z commented Apr 9, 2026

Uh oh!

1bcMax left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KillerQueen-Z commented Apr 9, 2026

Uh oh!

1bcMax left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants