Skip to content

fix(webapp): capture Prisma infra errors and obfuscate leaked messages#3960

Open
d-cs wants to merge 7 commits into
mainfrom
fix/prisma-infra-error-leak
Open

fix(webapp): capture Prisma infra errors and obfuscate leaked messages#3960
d-cs wants to merge 7 commits into
mainfrom
fix/prisma-infra-error-leak

Conversation

@d-cs

@d-cs d-cs commented Jun 15, 2026

Copy link
Copy Markdown
Collaborator

Summary

Prisma infrastructure failures (P1xxx-class: database unreachable, timed out, connection dropped, engine init/panic) carry the database hostname in their .message. This captures them centrally for observability and ensures they never reach API clients verbatim.

Design

A $allOperations client extension on the writer and replica clients logs infrastructure errors with the originating model and operation, then rethrows the original error unchanged — call sites that branch on error.code (unique-violation idempotency, not-found handling) and transaction retries keep working. Only infrastructure errors are logged; routine query/validation errors (P2xxx) are left alone.

$allOperations can't see the transaction boundary ($transaction is a client method, not an operation), so infrastructure errors surfacing from $transaction() without a Prisma code — e.g. PrismaClientInitializationError — are logged separately at the transaction wrapper, where the existing coded-error path would otherwise miss them.

clientSafeErrorMessage() swaps an infrastructure error's message for "Internal Server Error" at the API routes that previously returned error.message raw. Status codes, headers, and every non-infrastructure message are unchanged.

Test plan

  • P2002 / P2025 rethrow with code intact and are not logged
  • Statement errors inside $transaction keep their code (retry logic intact)
  • Raw queries wrapped without crashing on the undefined model
  • A genuine connectivity failure is logged with model/operation/code
  • clientSafeErrorMessage obfuscates infra messages, preserves all others
  • pnpm run typecheck --filter webapp (12/12)

Note

Overlaps with #3391 (Prisma 7 migration) on apps/webapp/app/db.server.ts — coordinate rebasing.

…eir messages

Prisma infrastructure failures (P1xxx-class: DB unreachable/timed out/connection
dropped, engine init/panic) carry the database hostname in their message. Capture
them centrally and ensure they never reach API clients verbatim.

- db.server.ts: a $allOperations extension on the writer and replica clients logs
  infra errors with the model/operation, then rethrows the ORIGINAL error so the
  ~40 call sites that branch on error.code (and transaction retries) keep working.
- transaction boundary: log infra errors that surface from $transaction() without
  a Prisma code (e.g. PrismaClientInitializationError), which the existing coded-
  error callback misses.
- clientSafeErrorMessage(): swap an infra error's message for "Internal Server
  Error" at the API routes that returned it raw, leaving status codes, headers,
  and all non-infra messages unchanged. Applied to the batch trigger routes,
  schedule delete, and the worker continue action.

Adds testcontainer + real-error-instance tests covering message obfuscation,
pass-through of P2xxx codes, transaction-interior firing, and the boundary path.
@changeset-bot

changeset-bot Bot commented Jun 15, 2026

Copy link
Copy Markdown

⚠️ No Changeset found

Latest commit: a068023

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

@coderabbitai

coderabbitai Bot commented Jun 15, 2026

Copy link
Copy Markdown
Contributor

Review Change Stack

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info
⚙️ Run configuration

Configuration used: Repository UI

Review profile: CHILL

Plan: Pro

Run ID: 6eb6c21d-fb25-4d6f-a3dc-328b65f7f2f1

📥 Commits

Reviewing files that changed from the base of the PR and between fbbc9f8 and 15ea162.

📒 Files selected for processing (1)
  • apps/webapp/app/routes/api.v1.token.ts
📜 Recent review details
⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (12)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (10, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (5, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (6, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (1, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (8, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (3, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (9, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (4, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (2, 10)
  • GitHub Check: webapp / 🧪 Unit Tests: Webapp (7, 10)
  • GitHub Check: e2e-webapp / 🧪 E2E Tests: Webapp
  • GitHub Check: 🛡️ E2E Auth Tests (full)
🧰 Additional context used
📓 Path-based instructions (6)
**/*.{ts,tsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

**/*.{ts,tsx}: Use types over interfaces for TypeScript
Avoid using enums; prefer string unions or const objects instead

Import from @trigger.dev/sdk when writing Trigger.dev tasks. Never use @trigger.dev/sdk/v3 or deprecated client.defineJob

Files:

  • apps/webapp/app/routes/api.v1.token.ts
{packages/core,apps/webapp}/**/*.{ts,tsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

Use zod for validation in packages/core and apps/webapp

Files:

  • apps/webapp/app/routes/api.v1.token.ts
**/*.{ts,tsx,js,jsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

Use function declarations instead of default exports

**/*.{ts,tsx,js,jsx}: Prefer static imports over dynamic imports. Only use dynamic import() when circular dependencies cannot be resolved, code splitting is needed for performance, or the module must be loaded conditionally at runtime
Import subpaths only from packages/core (@trigger.dev/core), never import from the root

Files:

  • apps/webapp/app/routes/api.v1.token.ts
**/*.ts

📄 CodeRabbit inference engine (.cursor/rules/otel-metrics.mdc)

**/*.ts: When creating or editing OTEL metrics (counters, histograms, gauges), ensure metric attributes have low cardinality by using only enums, booleans, bounded error codes, or bounded shard IDs
Do not use high-cardinality attributes in OTEL metrics such as UUIDs/IDs (envId, userId, runId, projectId, organizationId), unbounded integers (itemCount, batchSize, retryCount), timestamps (createdAt, startTime), or free-form strings (errorMessage, taskName, queueName)
When exporting OTEL metrics via OTLP to Prometheus, be aware that the exporter automatically adds unit suffixes to metric names (e.g., 'my_duration_ms' becomes 'my_duration_ms_milliseconds', 'my_counter' becomes 'my_counter_total'). Account for these transformations when writing Grafana dashboards or Prometheus queries

Files:

  • apps/webapp/app/routes/api.v1.token.ts
apps/webapp/**/*.{ts,tsx}

📄 CodeRabbit inference engine (.cursor/rules/webapp.mdc)

apps/webapp/**/*.{ts,tsx}: Access environment variables through the env export of env.server.ts instead of directly accessing process.env
Use subpath exports from @trigger.dev/core package instead of importing from the root @trigger.dev/core path

Use named constants for sentinel/placeholder values (e.g. const UNSET_VALUE = '__unset__') instead of raw string literals scattered across comparisons

Files:

  • apps/webapp/app/routes/api.v1.token.ts
**/*.{js,ts,tsx,jsx,css,json,md}

📄 CodeRabbit inference engine (AGENTS.md)

Use Prettier for code formatting and run pnpm run format before committing

Files:

  • apps/webapp/app/routes/api.v1.token.ts
🧠 Learnings (8)
📚 Learning: 2026-03-22T13:26:12.060Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3244
File: apps/webapp/app/components/code/TextEditor.tsx:81-86
Timestamp: 2026-03-22T13:26:12.060Z
Learning: In the triggerdotdev/trigger.dev codebase, do not flag `navigator.clipboard.writeText(...)` calls for `missing-await`/`unhandled-promise` issues. These clipboard writes are intentionally invoked without `await` and without `catch` handlers across the project; keep that behavior consistent when reviewing TypeScript/TSX files (e.g., usages like in `apps/webapp/app/components/code/TextEditor.tsx`).

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
📚 Learning: 2026-03-22T19:24:14.403Z
Learnt from: matt-aitken
Repo: triggerdotdev/trigger.dev PR: 3187
File: apps/webapp/app/v3/services/alerts/deliverErrorGroupAlert.server.ts:200-204
Timestamp: 2026-03-22T19:24:14.403Z
Learning: In the triggerdotdev/trigger.dev codebase, webhook URLs are not expected to contain embedded credentials/secrets (e.g., fields like `ProjectAlertWebhookProperties` should only hold credential-free webhook endpoints). During code review, if you see logging or inclusion of raw webhook URLs in error messages, do not automatically treat it as a credential-leak/secrets-in-logs issue by default—first verify the URL does not contain embedded credentials (for example, no username/password in the URL, no obvious secret/token query params or fragments). If the URL is credential-free per this project’s conventions, allow the logging.

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
📚 Learning: 2026-05-18T08:21:27.694Z
Learnt from: d-cs
Repo: triggerdotdev/trigger.dev PR: 3632
File: apps/webapp/sentry.server.ts:4-21
Timestamp: 2026-05-18T08:21:27.694Z
Learning: When handling Prisma error P1001 ("Can't reach database server") in TypeScript, don’t assume a single error shape. Prisma can surface P1001 via two different error classes/fields: `PrismaClientKnownRequestError` exposes it as `err.code === "P1001"` (common during mid-query connection drops), while `PrismaClientInitializationError` exposes it as `err.errorCode === "P1001"` (common on client startup failure). Therefore, predicates should use `err.code === "P1001" || err.errorCode === "P1001"`. Do not flag `err.code === "P1001"` as “unreachable/never matches,” as it is expected in production.

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
📚 Learning: 2026-05-18T08:21:27.694Z
Learnt from: d-cs
Repo: triggerdotdev/trigger.dev PR: 3632
File: apps/webapp/sentry.server.ts:4-21
Timestamp: 2026-05-18T08:21:27.694Z
Learning: When handling Prisma errors for P1001 ("Can't reach database server"), do not assume it only appears under a single property name. Prisma may surface P1001 via either `PrismaClientKnownRequestError` (`err.code === "P1001"`, e.g., mid-query connection drops) or `PrismaClientInitializationError` (`err.errorCode === "P1001"`, e.g., client startup connection failure). To reliably detect the condition, check `err.code === "P1001" || err.errorCode === "P1001"`, and avoid review rules that would incorrectly flag `err.code === "P1001"` as unreachable/never-matching.

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
📚 Learning: 2026-06-13T19:53:13.759Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3937
File: packages/trigger-sdk/skills/realtime-and-frontend/SKILL.md:258-260
Timestamp: 2026-06-13T19:53:13.759Z
Learning: When reviewing code that uses `trigger.dev/react-hooks`’s `useRealtimeRun`, preserve the call signature where the first argument is the full realtime handle object (not `handle.id`). This is intentional to maintain type-safety and is consistent with the official docs; do not suggest changing the first argument from the handle object to `handle.id`.

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
📚 Learning: 2026-05-12T21:04:05.815Z
Learnt from: ericallam
Repo: triggerdotdev/trigger.dev PR: 3542
File: apps/webapp/app/components/sessions/v1/SessionStatus.tsx:1-3
Timestamp: 2026-05-12T21:04:05.815Z
Learning: In this Remix + TypeScript codebase, do not flag a server/client boundary violation when a file imports only types from a module matching `*.server`.

Specifically, it’s safe to import types using `import type { Foo } from "*.server"` or `import { type Foo } from "*.server"` because TypeScript erases type-only imports at compile time and they emit no JavaScript, so they won’t cross the Remix server/client bundle boundary.

Only raise the boundary concern for value imports (e.g., `import { Foo }` without `type`, or `import Foo`), since those produce JavaScript output.

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
📚 Learning: 2026-06-04T18:16:35.386Z
Learnt from: nicktrn
Repo: triggerdotdev/trigger.dev PR: 3836
File: apps/supervisor/src/backpressure/backpressureMonitor.ts:3-5
Timestamp: 2026-06-04T18:16:35.386Z
Learning: When reviewing TypeScript in this repo, apply the rule “prefer type aliases over interfaces” only to data/object shapes and union/intersection type modeling. If an interface is being used as a behavioral contract for collaborators to implement (e.g., method-shape interfaces that define required behavior, such as `BackpressureLogger` / `BackpressureSignalSource` in `apps/supervisor/src/backpressure/backpressureMonitor.ts`), keep it as an `interface` and do not flag it as a type-alias-vs-interface violation.

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
📚 Learning: 2026-06-09T17:58:04.699Z
Learnt from: 0ski
Repo: triggerdotdev/trigger.dev PR: 3879
File: apps/webapp/app/models/vercelIntegration.server.ts:619-630
Timestamp: 2026-06-09T17:58:04.699Z
Learning: In this codebase, outbound raw `fetch` calls should typically rely on Node/undici’s default request timeout (about ~300s) rather than adding a per-call `AbortController` + `setTimeout` wrapper inside individual functions (e.g. in files like `apps/webapp/app/models/vercelIntegration.server.ts`). During code review, do not flag the absence of a per-call timeout on a single `fetch` as an issue; if per-call timeouts are needed, they should be implemented via a codebase-wide convention (e.g., a shared fetch wrapper or documented pattern) rather than ad-hoc per-function changes.

Applied to files:

  • apps/webapp/app/routes/api.v1.token.ts
🔇 Additional comments (2)
apps/webapp/app/routes/api.v1.token.ts (2)

10-10: LGTM!


49-49: LGTM!


Walkthrough

Three new exported functions are added to apps/webapp/app/utils/prismaErrors.ts: captureInfrastructureErrors (a Prisma client $extends wrapper that intercepts per-operation errors, logs infrastructure failures with code and metadata when available, tags errors for deduplication, and rethrows), logTransactionInfrastructureError (a boundary logger for the $transaction path that skips already-logged and known-error cases), and clientSafeErrorMessage (replaces infrastructure error messages with a generic string while preserving other error messages). A deduplication marker system using a unique symbol prevents duplicate logging of the same infrastructure error. Both Prisma client singletons (prisma, $replica) in db.server.ts are wrapped with captureInfrastructureErrors, and $transaction gains a try/catch that calls logTransactionInfrastructureError. Seven API route handlers are updated to use clientSafeErrorMessage instead of raw error.message in JSON error responses.

🚥 Pre-merge checks | ✅ 3 | ❌ 2

❌ Failed checks (2 warnings)

Check name Status Explanation Resolution
Description check ⚠️ Warning The description provides a comprehensive summary and design explanation, but is missing key template sections: the issue reference, testing verification steps, and the required checklist with actual confirmations. Add the issue reference (Closes #), complete the checklist with checked items, and provide explicit testing steps taken.
Docstring Coverage ⚠️ Warning Docstring coverage is 31.25% which is insufficient. The required threshold is 80.00%. Write docstrings for the functions missing them to satisfy the coverage threshold.
✅ Passed checks (3 passed)
Check name Status Explanation
Title check ✅ Passed The title clearly and concisely summarizes the main change: capturing Prisma infrastructure errors and obfuscating their messages. It directly reflects the core purpose of the changeset.
Linked Issues check ✅ Passed Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check ✅ Passed Check skipped because no linked issues were found for this pull request.

✏️ Tip: You can configure your own custom pre-merge checks in the settings.

✨ Finishing Touches
📝 Generate docstrings
  • Create stacked PR
  • Commit on current branch
🧪 Generate unit tests (beta)
  • Create PR with unit tests
  • Commit unit tests in branch fix/prisma-infra-error-leak

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

Comment @coderabbitai help to get the list of available commands and usage tips.

@d-cs d-cs self-assigned this Jun 15, 2026
coderabbitai[bot]

This comment was marked as resolved.

d-cs added 3 commits June 16, 2026 09:55
- api.v1.runs.$runParam.replay.ts returned a raw error.message; route it
  through clientSafeErrorMessage so infra errors are obfuscated like the
  other patched routes.
- Tag an infra error when the client extension logs it at the statement
  level, and skip it in the $transaction-boundary loggers, so a single
  failure is logged exactly once instead of twice inside a transaction.
…loads

Code review caught that the infraErrorAlreadyLogged guard was added to only
the named-$transaction callback; the anonymous overload still double-logged
statement-level infra errors. Extract one shared boundary callback so the
guard can't drift between the two overloads.

Also harden the dedupe marker: define it non-enumerable (so error-spreads
can't copy the tag onto a different error) and best-effort (so a frozen
error object can't make the assignment throw and mask the original error).
@d-cs d-cs marked this pull request as ready for review June 16, 2026 09:32
@pkg-pr-new

pkg-pr-new Bot commented Jun 16, 2026

Copy link
Copy Markdown

Open in StackBlitz

@trigger.dev/build

npm i https://pkg.pr.new/@trigger.dev/build@0e300c3

trigger.dev

npm i https://pkg.pr.new/trigger.dev@0e300c3

@trigger.dev/core

npm i https://pkg.pr.new/@trigger.dev/core@0e300c3

@trigger.dev/python

npm i https://pkg.pr.new/@trigger.dev/python@0e300c3

@trigger.dev/react-hooks

npm i https://pkg.pr.new/@trigger.dev/react-hooks@0e300c3

@trigger.dev/redis-worker

npm i https://pkg.pr.new/@trigger.dev/redis-worker@0e300c3

@trigger.dev/rsc

npm i https://pkg.pr.new/@trigger.dev/rsc@0e300c3

@trigger.dev/schema-to-json

npm i https://pkg.pr.new/@trigger.dev/schema-to-json@0e300c3

@trigger.dev/sdk

npm i https://pkg.pr.new/@trigger.dev/sdk@0e300c3

commit: 0e300c3

devin-ai-integration[bot]

This comment was marked as resolved.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant