SwitchIt AI Gateway

SwitchIt is a lightweight, OpenAI-compatible AI gateway. I built it because I needed a simple gateway for my projects running on resource-constrained hardware like Set-Top Boxes (STB).

Features

OpenAI Compatible: Exposes /v1/chat/completions and /v1/models endpoints, making it easy to point my coding tools or OpenAI clients to it.
Gemini Translation: Translates request and response structures (including SSE streams) to Google Gemini API.
OpenAI Pass-Through: Proxies requests to other OpenAI-compatible upstreams (e.g., AgentRouter, LiteLLM) when I want to use other models.
Custom Headers: Can send custom headers (like User-Agent) to upstreams if needed (disabled by default).
Priority & Failover: Orders providers by priority. If a high-priority provider fails (e.g. rate limit 429 or auth error), it automatically falls over to the next provider in the list.
Budget Control: Tracks daily and monthly spend in USD. Blocks requests with a 429 error once budgets are hit to prevent runaway coding agent bills.
SQLite Request Logs: Logs requests in a local SQLite database, automatically pruning entries older than 7 days (configurable) to save space.
TUI Dashboard: A terminal-based monitor (switchit-tui) that displays real-time spend gauges, token usage, and recent request history.
Lightweight: Built in Rust and runs on a single-threaded Tokio event loop to keep memory usage under 15MB at idle.
Hot-Reload: Polls the config file for modification changes every 5 seconds and reloads config dynamically without restarting the server.

Folder Structure

switchit/
├── Cargo.toml                    # Workspace configuration
├── config.example.toml           # Template config file
├── config.toml                   # Local config (gitignored)
├── switchit.service              # systemd unit file for Linux
└── crates/
    ├── switchit-common/          # Shared structs & types
    ├── switchit-daemon/          # Axum gateway & provider handlers
    └── switchit-tui/             # Ratatui terminal dashboard

Routing & Failover

Priority Routing: Providers are sorted by priority. The gateway always uses the highest-priority provider first to hit prompt context caching.
Non-retryable Errors: If the gateway gets a rate limit (429) or authentication error (401/403), it immediately falls over to the next provider.
Retryable Errors: For connection timeouts or 5xx server errors, it retries with exponential backoff before falling over.

Getting Started

1. Build

To compile:

cargo build --workspace

To build a size-optimized production binary:

cargo build --release --workspace

2. Configure

Copy the template config file:

cp config.example.toml config.toml

Edit config.toml to add keys and adjust daily/monthly budgets or logs retention:

[server]
listen    = "127.0.0.1:3000"
log_level = "info"
ctl_port  = 3001

[storage]
backend = "sqlite"
path = "switchit.db"
retention_days = 7

[limits]
daily_budget = 1.00
monthly_budget = 20.00
usage_file = "usage.json"

3. Run the Daemon

export GEMINI_API_KEY="AIzaSy..."
./target/debug/switchit-daemon --config config.toml

4. Run the TUI Dashboard

To monitor stats, token usage, and logs in real-time:

./target/debug/switchit-tui --config config.toml

API Examples

Liveness Check

curl http://127.0.0.1:3000/health
# Response: "ok"

List Models

curl http://127.0.0.1:3000/v1/models -H "Authorization: Bearer sk-local-key"

Chat Completion

curl http://127.0.0.1:3000/v1/chat/completions \
  -H "Authorization: Bearer sk-local-key" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "deepseek-v4-flash",
    "messages": [{"role": "user", "content": "1+1="}]
  }'

Connecting AI Tools & Editors

SwitchIt exposes a standard OpenAI-compatible API at http://<IP>:3000/v1 (offering /v1/chat/completions and /v1/models). This allows you to easily connect it to any AI coding assistant, IDE extension, or CLI tool that supports custom OpenAI endpoints.

1. Kilo Code / Cline (VS Code Extensions)

Kilo Code / Cline:
1. Open settings in the extension panel.
2. Choose OpenAI Compatible (or Custom Provider) as the API provider.
3. Set Base URL to http://10.0.0.15:3000/v1 (replace with your gateway's IP).
4. Set API Key to sk-local-key (or leave blank if auth is disabled in your config.toml).
5. Select or type your desired model (e.g., gemini-2.5-flash).

Continue: Add the following block to your ~/.continue/config.json:

{
  "models": [
    {
      "title": "SwitchIt Gateway",
      "provider": "openai",
      "model": "gemini-2.5-flash",
      "apiBase": "http://10.0.0.15:3000/v1",
      "apiKey": "sk-local-key"
    }
  ]
}

2. Claude Code (Anthropic CLI)

Claude Code (claude) can be configured to point to a custom API proxy or gateway by setting environment variables:

Via environment variables (per-session):

export ANTHROPIC_BASE_URL="http://10.0.0.15:3000/v1"
export ANTHROPIC_AUTH_TOKEN="sk-local-key"  # (if auth is enabled)
claude

Via settings file (persistent) (~/.claude/settings.json):

{
  "env": {
    "ANTHROPIC_BASE_URL": "http://10.0.0.15:3000/v1",
    "ANTHROPIC_AUTH_TOKEN": "sk-local-key"
  }
}

3. Aider (CLI Coding Assistant)

Aider natively supports custom OpenAI-compatible backends:

export OPENAI_API_BASE="http://10.0.0.15:3000/v1"
export OPENAI_API_KEY="sk-local-key"
aider --model openai/gemini-2.5-flash

4. Gemini CLI / Clients

If you are using a client or CLI tool that wraps the standard Google Gemini SDK and respects custom endpoints via environment variables:

export GOOGLE_GEMINI_BASE_URL="http://10.0.0.15:3000/v1"
export GEMINI_API_KEY="sk-local-key"

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
crates		crates
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
config.example.toml		config.example.toml
opencode.json		opencode.json
switchit.service		switchit.service

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SwitchIt AI Gateway

Features

Folder Structure

Routing & Failover

Getting Started

1. Build

2. Configure

3. Run the Daemon

4. Run the TUI Dashboard

API Examples

Liveness Check

List Models

Chat Completion

Connecting AI Tools & Editors

1. Kilo Code / Cline (VS Code Extensions)

2. Claude Code (Anthropic CLI)

3. Aider (CLI Coding Assistant)

4. Gemini CLI / Clients

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SwitchIt AI Gateway

Features

Folder Structure

Routing & Failover

Getting Started

1. Build

2. Configure

3. Run the Daemon

4. Run the TUI Dashboard

API Examples

Liveness Check

List Models

Chat Completion

Connecting AI Tools & Editors

1. Kilo Code / Cline (VS Code Extensions)

2. Claude Code (Anthropic CLI)

3. Aider (CLI Coding Assistant)

4. Gemini CLI / Clients

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages