fix(voice): don't re-issue in-flight tool calls on a new turn by longcw · Pull Request #6227 · livekit/agents

longcw · 2026-06-25T11:48:06Z

When a new generation fires while a tool from a previous turn is still running, the in-flight call has no entry in the turn's chat context, so the model re-issues it and duplicates side effects (e.g. booking a flight twice).

This injects an ephemeral in-progress function_call/function_call_output pair for the running tool calls (from the session-wide _RunningTasks registry, covering both activity- and session-scoped executors) into the copy of the chat context fed to the LLM only. It is recomputed every turn and never persisted or forwarded, so it is superseded as soon as the real output lands and a placeholder can never go stale.

It also stops spawning a tool-response generation for an interrupted turn: that reply would be dropped by the interrupted check anyway and waste an LLM call. The completed tool outputs are committed to the chat context directly instead, so an interrupted tool's result is preserved rather than lost.

devin-ai-integration

Devin Review found 1 potential issue.

devin-ai-integration · 2026-06-25T11:54:58Z


            tool_messages = new_calls + new_fnc_outputs
-            if fnc_executed_ev._reply_required:
+            if fnc_executed_ev._reply_required and not speech_handle.interrupted:


🚩 Interruption guard only applied to the pipeline path, not the realtime path

The new and not speech_handle.interrupted guard at livekit-agents/livekit/agents/voice/agent_activity.py:3130 prevents scheduling a tool response when the speech was interrupted during tool execution. However, the analogous realtime model path at livekit-agents/livekit/agents/voice/agent_activity.py:3781-3804 does not have this guard. The realtime path's fnc_executed_ev._reply_required check at line 3782 does not account for interruption. This is a pre-existing inconsistency (not introduced by this PR), but worth noting since this PR explicitly addresses the pipeline case.

Was this helpful? React with 👍 or 👎 to provide feedback.

When a new generation fires while a tool from a previous turn is still running, the in-flight call has no entry in the turn's chat context, so the model re-issues it and duplicates side effects. This injects an in-progress function_call/output pair for the running tool calls (from the session-wide _RunningTasks registry, covering both activity- and session-scoped executors) before the LLM call, so the model leaves the call alone. The pair is flagged and stripped again before the context is forwarded to the tool-reply turn, which re-injects from the live running set, so a placeholder is never persisted and can't go stale. Injecting in place rather than into a copy preserves any edits a custom llm_node makes to the context. It also stops spawning a tool-response generation for an interrupted turn: that reply would be dropped by the interrupted check anyway and waste an LLM call. The completed tool outputs are committed to the chat context directly instead, so an interrupted tool's result is preserved rather than lost.

gyx09212214-prog

Thanks for tackling the duplicate in-flight tool call case. One edge case I think still needs coverage: _inject_running_tool_calls(chat_ctx, ...) mutates chat_ctx before the LLM call, but _strip_running_tool_calls(chat_ctx) currently only runs inside the _reply_required and not speech_handle.interrupted branch. If a turn is interrupted, or if _reply_required is false, the injected function_call/function_call_output pair can remain in this activity's chat context even though the PR description says placeholders are stripped and never persisted/stale.

Could we move the strip into a finally/common cleanup path after inference, or add a regression test for interrupted / no-reply-required tool completion showing the placeholder is removed from chat_ctx?

longcw · 2026-06-26T06:12:42Z

currently only runs inside the _reply_required and not speech_handle.interrupted branch. If a turn is interrupted, or if _reply_required is false, the injected function_call/function_call_output pair can remain in this activity's chat context even though the PR description says placeholders are stripped and never persisted/stale.

the chat_ctx it modified is a copy and will be discarded after the pipeline generation task, the only chance it reused is passing it to next tool reply turn

longcw requested a review from a team as a code owner June 25, 2026 11:48

devin-ai-integration Bot reviewed Jun 25, 2026

View reviewed changes

longcw force-pushed the longc/inflight-tool-placeholder branch from 09fd12d to 7e6e03f Compare June 25, 2026 13:25

gyx09212214-prog reviewed Jun 25, 2026

View reviewed changes

davidzhao approved these changes Jun 26, 2026

View reviewed changes

Bobronium approved these changes Jun 26, 2026

View reviewed changes

longcw merged commit 3f95602 into main Jun 26, 2026
26 checks passed

longcw deleted the longc/inflight-tool-placeholder branch June 26, 2026 11:53

rosetta-livekit-bot Bot mentioned this pull request Jun 26, 2026

fix(voice): don't re-issue in-flight tool calls livekit/agents-js#1890

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(voice): don't re-issue in-flight tool calls on a new turn#6227

fix(voice): don't re-issue in-flight tool calls on a new turn#6227
longcw merged 1 commit into
mainfrom
longc/inflight-tool-placeholder

longcw commented Jun 25, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Jun 25, 2026 •

edited

Loading

Uh oh!

gyx09212214-prog left a comment

Uh oh!

longcw commented Jun 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

longcw commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gyx09212214-prog left a comment

Choose a reason for hiding this comment

Uh oh!

longcw commented Jun 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

longcw commented Jun 25, 2026 •

edited

Loading

devin-ai-integration Bot Jun 25, 2026 •

edited

Loading