Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion accounts-billing/billing.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -89,7 +89,7 @@ Each Runpod product has its own pricing structure. See the detailed pricing page
<Card title="Pods pricing" icon="server" href="/pods/pricing">
On-demand and savings plan pricing for GPU Pods.
</Card>
<Card title="Serverless pricing" icon="bolt" href="/serverless/pricing">
<Card title="Serverless pricing" icon="bolt" href="https://www.runpod.io/pricing">
Pay-per-second pricing for flex and active workers.
</Card>
</CardGroup>
Expand Down
5 changes: 4 additions & 1 deletion docs.json
Original file line number Diff line number Diff line change
Expand Up @@ -94,7 +94,6 @@
"serverless/overview",
"serverless/quickstart",
"serverless/sdks",
"serverless/pricing",
{
"group": "Create handler functions",
"pages": [
Expand Down Expand Up @@ -612,6 +611,10 @@
}
},
"redirects": [
{
"source": "/serverless/pricing",
"destination": "/serverless/overview"
},
{
"source": "/public-endpoints/models/seedance-1-pro",
"destination": "/public-endpoints/overview"
Expand Down
6 changes: 3 additions & 3 deletions flash/pricing.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -4,7 +4,7 @@ sidebarTitle: "Pricing"
description: "Understand Flash pricing and optimize your costs."
---

Flash follows the same pricing model as [Runpod Serverless](/serverless/pricing). You pay per second of compute time, with no charges when your code isn't running. Pricing depends on the GPU or CPU type you configure for your endpoints.
Flash follows the same pricing model as [Runpod Serverless](/serverless/overview). You pay per second of compute time, with no charges when your code isn't running. Pricing depends on the GPU or CPU type you configure for your endpoints.

## How pricing works

Expand All @@ -25,7 +25,7 @@ Flash supports both GPU and CPU workers. Pricing varies based on the hardware ty
- **GPU workers**: Use `@Endpoint(gpu=...)` configuration. Pricing depends on the GPU type (e.g., RTX 4090, A100 80GB).
- **CPU workers**: Use `@Endpoint(cpu=...)` configuration. Pricing depends on the CPU instance type.

See the [Serverless pricing page](/serverless/pricing) for current rates by GPU and CPU type.
See the [Runpod pricing page](https://www.runpod.io/pricing) for current rates by GPU and CPU type.

## How to estimate and optimize costs

Expand Down Expand Up @@ -114,5 +114,5 @@ Monitor your usage in the [Runpod console](https://www.runpod.io/console/serverl
## Next steps

- [Create endpoint functions](/flash/create-endpoints) with optimized configurations.
- [View Serverless pricing details](/serverless/pricing) for current rates.
- [View Runpod pricing details](https://www.runpod.io/pricing) for current rates.
- [Configure resources](/flash/configuration/parameters) for your workloads.
2 changes: 1 addition & 1 deletion release-notes.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -261,7 +261,7 @@ Flash now supports deploying endpoints to [multiple datacenters](/flash/configur
- **Self-service worker upgrade**: Rebuild and roll workers from the dashboard without support tickets.
- **Edit template from endpoint page**: Inline edit and redeploy the underlying template directly from the endpoint view.
- **Improved Serverless metrics page**: Refinements to charts and filters for quicker root-cause analysis.
- [Flex and active workers](/serverless/pricing): Always-on "active" workers for baseline load with on-demand "flex" workers for bursts.
- [Flex and active workers](/serverless/overview): Always-on "active" workers for baseline load with on-demand "flex" workers for bursts.
- **Billing explorer**: Inspect costs by resource, region, and time to identify optimization opportunities.

</Update>
Expand Down
6 changes: 5 additions & 1 deletion serverless/overview.mdx
Original file line number Diff line number Diff line change
Expand Up @@ -172,4 +172,8 @@ flowchart TD

linkStyle default stroke-width:2px,stroke:#5F4CFE
```
</div>
</div>

## Pricing

Serverless uses pay-per-second pricing with no upfront costs. You're billed only for the compute time your workers use, from the moment a worker starts until it fully stops, and you're never charged while workers are idle. For current rates by GPU type, see the [Runpod pricing page](https://www.runpod.io/pricing).
66 changes: 0 additions & 66 deletions serverless/pricing.mdx

This file was deleted.

Loading