feat(room-io): add json_format option for timed transcription output by longcw · Pull Request #5472 · livekit/agents

longcw · 2026-04-17T02:19:06Z

Summary

Add json_format field to TextOutputOptions for the room text output chain
When enabled, each transcription chunk published on the lk.transcription datastream topic is a JSON object with text, and start_time/end_time if the chunk is a TimedString

Adds `json_format` to `TextOutputOptions` so the transcription stream on the `lk.transcription` topic emits each chunk as a JSON object with `text` and optional `start_time`/`end_time` fields when the chunk is a `TimedString`. This makes it easier for clients to consume TTS-aligned timed transcripts.

devin-ai-integration

✅ Devin Review: No Issues Found

Devin Review analyzed this PR and found no potential bugs to report.

View in Devin Review to see 4 additional findings.

chenghao-mou

lgtm. one small question.

chenghao-mou · 2026-04-23T02:59:54Z

+                    ts_pb.confidence = text.confidence
+                if utils.is_given(text.start_time_offset):
+                    ts_pb.start_time_offset = text.start_time_offset
+            text = json.dumps(MessageToDict(ts_pb, preserving_proto_field_name=True)) + "\n"


should we use always_print_fields_with_no_presence so keys are always present?

perhaps no, if the text is not a TimeString we may not want start_time or end_time be included in the dict.

davidzhao · 2026-04-17T02:57:40Z

        stt=inference.STT("deepgram/nova-3"),
        llm=inference.LLM("google/gemini-2.5-flash"),
-        tts=inference.TTS("cartesia/sonic-3"),
+        tts=cartesia.TTS(),


does inference not support this? if not we should let the team know.

we do have these options to enable the timestamp in tts inference (added in #4949), but it seems there is no timestamps returned when enabled. will forward to the team.

devin-ai-integration

Devin Review found 2 new potential issues.

View 7 additional findings in Devin Review.

devin-ai-integration · 2026-04-23T05:56:47Z

+            self._out_ch.send_nowait(
+                TimedString(word, end_time=time.time() - self._start_wall_time)
+            )


🔴 TimedString end_time does not subtract _paused_duration, producing incorrect timestamps after pause/resume

In _main_task, the newly added TimedString objects compute end_time as time.time() - self._start_wall_time, but fail to subtract self._paused_duration. The synchronization delay calculation on synchronizer.py:337 correctly uses elapsed = time.time() - self._start_wall_time - self._paused_duration, but the end_time written to the output TimedString at lines 332 and 367 omits this subtraction. When audio playback is paused and resumed (e.g., during barge-in via _SyncedAudioOutput.pause() at synchronizer.py:613-618), the reported end_time will be inflated by the total pause duration, producing incorrect timing data for downstream consumers like the JSON format transcription output.

Suggested change

self._out_ch.send_nowait(

TimedString(word, end_time=time.time() - self._start_wall_time)

)

self._out_ch.send_nowait(

TimedString(word, end_time=time.time() - self._start_wall_time - self._paused_duration)

)

Was this helpful? React with 👍 or 👎 to provide feedback.

this is intentional, we should include paused time in the timestamp from synchronizer, it's the actual sent time of a transcript.

toubatbrian · 2026-04-23T06:44:53Z

🤖 This is an automated Claude Code routine created by @toubatbrian. Right now it is in experimentation stage.

This PR looks like a core runtime improvement (room_io text output chain) and is eligible for automatic porting. The automation will start porting this PR into agents-js automatically and will open a follow-up PR there shortly.

Generated by Claude Code

toubatbrian · 2026-04-23T06:52:39Z

🤖 Port opened: livekit/agents-js#1305

Generated by Claude Code

chenghao-mou requested a review from a team April 17, 2026 02:19

devin-ai-integration Bot reviewed Apr 17, 2026

View reviewed changes

longcw marked this pull request as draft April 17, 2026 07:55

use TimedString proto

4c8cb9e

longcw marked this pull request as ready for review April 21, 2026 02:52

chenghao-mou approved these changes Apr 23, 2026

View reviewed changes

davidzhao approved these changes Apr 23, 2026

View reviewed changes

synchronizer output timedstring

fe5d687

devin-ai-integration Bot reviewed Apr 23, 2026

View reviewed changes

longcw merged commit 489f1e8 into main Apr 23, 2026
25 checks passed

longcw deleted the longc/json-text-output branch April 23, 2026 06:42

toubatbrian mentioned this pull request Apr 23, 2026

feat(room-io): add jsonFormat option for timed transcription output livekit/agents-js#1305

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(room-io): add json_format option for timed transcription output#5472

feat(room-io): add json_format option for timed transcription output#5472
longcw merged 3 commits intomainfrom
longc/json-text-output

longcw commented Apr 17, 2026 •

edited

Loading

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

chenghao-mou left a comment

Uh oh!

chenghao-mou Apr 23, 2026

Uh oh!

longcw Apr 23, 2026

Uh oh!

davidzhao Apr 17, 2026

Uh oh!

longcw Apr 23, 2026

Uh oh!

devin-ai-integration Bot left a comment

Uh oh!

devin-ai-integration Bot Apr 23, 2026

Uh oh!

longcw Apr 23, 2026

Uh oh!

Uh oh!

Uh oh!

toubatbrian commented Apr 23, 2026

Uh oh!

toubatbrian commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

longcw commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

✅ Devin Review: No Issues Found

Uh oh!

chenghao-mou left a comment

Choose a reason for hiding this comment

Uh oh!

chenghao-mou Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

longcw Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

davidzhao Apr 17, 2026

Choose a reason for hiding this comment

Uh oh!

longcw Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot left a comment

Choose a reason for hiding this comment

Uh oh!

devin-ai-integration Bot Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

longcw Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

toubatbrian commented Apr 23, 2026

Uh oh!

toubatbrian commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

longcw commented Apr 17, 2026 •

edited

Loading