Local development setup

This guide covers getting a full PhyslibSearch stack running on your machine for development.

Prerequisites

Tool	Minimum version	Install
Python	3.11	python.org
PostgreSQL	14	postgresql.org/download
Node.js	20	nodejs.org
Lean 4 + elan	latest	`curl https://elan.lean-lang.org/elan-init.sh -sSf

1 — Python environment

python -m venv .venv
source .venv/bin/activate       # Windows: .venv\Scripts\activate
pip install -r requirements.txt

2 — PostgreSQL

Create a dedicated database:

createdb physlibsearch

If you want to reset from scratch at any point:

dropdb physlibsearch && createdb physlibsearch

3 — jixia

jixia is the Lean 4 project parser. Build it from source:

git clone https://github.com/frenzymath/jixia.git
cd jixia
lake build          # first build takes ~70 s
cd ..

Toolchain matching — jixia's lean-toolchain must match the toolchain of the project you're indexing. Run cat jixia/lean-toolchain and compare with cat /path/to/physlib/lean-toolchain. If they differ, update one to match the other before building.

4 — Environment variables

cp .env.example .env

Open .env and fill in at minimum:

Variable	What to set
`JIXIA_PATH`	Absolute path to `jixia/.lake/build/bin/jixia`
`LEAN_SYSROOT`	Run `lake env` in PhysLib and copy the `LEAN_SYSROOT` value
`CONNECTION_STRING`	`"dbname=physlibsearch user=YOUR_USER password=YOUR_PASSWORD"`
`GEMINI_API_KEY`	Your key from aistudio.google.com

The remaining variables have sensible defaults.

Using a custom LLM endpoint

If you want to use OpenRouter (or any OpenAI-compatible endpoint) instead of Gemini directly for the fast model, add:

LLM_API_KEY  = "sk-or-..."
LLM_BASE_URL = "https://openrouter.ai/api/v1"
GEMINI_FAST_MODEL = "google/gemini-3-flash-preview"   # use the endpoint's model name

5 — Index a small project (recommended for dev)

For development it's easiest to index a small Lean project rather than all of PhysLib. Set DRY_RUN=true to verify the pipeline runs without spending any API quota:

DRY_RUN=true python -m database jixia /path/to/physlib Physlib
DRY_RUN=true python -m database informal
DRY_RUN=true python -m database vector-db

For a real index of PhysLib:

python -m database jixia /path/to/physlib Physlib,QuantumInfo
python -m database informal
python -m database vector-db

Makefile shortcuts

# Set env vars first, then:
export DBNAME=physlibsearch
export INDEXED_REPO_PATH=/path/to/physlib
export MODULE_NAMES=Physlib,QuantumInfo
export CHROMA_PATH=chroma

make index      # runs reset → jixia → informal → vector-db in sequence

6 — Run the backend

uvicorn server:app --reload --port 8000

The API is now at http://localhost:8000. Interactive docs at http://localhost:8000/docs.

7 — Run the frontend

cd frontend
npm install
npm run dev

Open http://localhost:3000.

The frontend reads NEXT_PUBLIC_API_URL (defaults to http://localhost:8000). To point it at a different backend:

NEXT_PUBLIC_API_URL=http://my-server:8000 npm run dev

Code quality

# Python linting + formatting (ruff)
ruff check .
ruff format .

# Frontend linting
cd frontend && npm run lint

CI runs ruff check and ruff format --check on every push (see .github/workflows/lint.yaml).

Database schema management

The schema is defined in database/create_schema.py. To (re-)apply it:

python -m database schema

This is idempotent — safe to run on an empty database. It creates all tables, types, views, and the physlibsearch operational schema.

CLI search (no frontend needed)

python search.py "conservation of momentum"
python search.py "Schrödinger equation" "Hamiltonian operator" --json
python search.py -n 20 "entropy"

Flags:

-n N — number of results (default 5)
--json — output as JSON

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local development setup

Prerequisites

1 — Python environment

2 — PostgreSQL

3 — jixia

4 — Environment variables

Using a custom LLM endpoint

5 — Index a small project (recommended for dev)

Makefile shortcuts

6 — Run the backend

7 — Run the frontend

Code quality

Database schema management

CLI search (no frontend needed)

FilesExpand file tree

development.md

Latest commit

History

development.md

File metadata and controls

Local development setup

Prerequisites

1 — Python environment

2 — PostgreSQL

3 — jixia

4 — Environment variables

Using a custom LLM endpoint

5 — Index a small project (recommended for dev)

Makefile shortcuts

6 — Run the backend

7 — Run the frontend

Code quality

Database schema management

CLI search (no frontend needed)