docs(workflow): add complete workflow diagram and reference

nullhack · nullhack · commit 6bf5550ac919 · 2026-04-16T15:48:13.000-04:00
diff --git a/docs/workflow.md b/docs/workflow.md
@@ -0,0 +1,239 @@
+# Development Workflow
+
+This document describes the complete feature lifecycle used to develop software with this framework.
+
+---
+
+## Overview
+
+Features flow through 6 steps with a WIP limit of 1 feature at a time. The filesystem enforces the limit:
+
+```
+docs/features/backlog/<name>.feature      ← waiting
+docs/features/in-progress/<name>.feature  ← exactly one being built
+docs/features/completed/<name>.feature    ← accepted and shipped
+```
+
+Each step has a designated agent and a specific deliverable. No step is skipped.
+
+---
+
+## Full Workflow Diagram
+
+```
+╔══════════════════════════════════════════════════════════════════════╗
+║                    FEATURE LIFECYCLE (WIP = 1)                       ║
+╚══════════════════════════════════════════════════════════════════════╝
+
+  FILESYSTEM ENFORCES WIP:
+  backlog/<name>.feature  →  in-progress/<name>.feature  →  completed/<name>.feature
+
+
+┌─────────────────────────────────────────────────────────────────────┐
+│  STEP 1 — SCOPE                              agent: product-owner   │
+├─────────────────────────────────────────────────────────────────────┤
+│                                                                     │
+│  Phase 1 — Project Discovery (once per project)                     │
+│    PO asks stakeholder 7 questions → silent pre-mortem              │
+│    → baseline → create backlog/<name>.feature stubs                 │
+│                                                                     │
+│  Phase 2 — Feature Discovery (per feature)                          │
+│    PO populates Entities table → generates questions from gaps      │
+│    → interview rounds → stakeholder says "baseline"                 │
+│    → decomposition check (>2 concerns or >8 examples → split)       │
+│                                                                     │
+│  Phase 3 — Stories (PO alone)                                       │
+│    Write Rule: blocks with user story headers (no Examples yet)     │
+│    commit: feat(stories): write user stories for <name>             │
+│                                                                     │
+│  Phase 4 — Criteria (PO alone)                                      │
+│    Write @id-tagged Example: blocks under each Rule:                │
+│    commit: feat(criteria): write acceptance criteria for <name>     │
+│    ★ FROZEN — changes require @deprecated + new Example             │
+│                                                                     │
+└─────────────────────────────────────────────────────────────────────┘
+                              ↓  PO picks feature from backlog
+┌─────────────────────────────────────────────────────────────────────┐
+│  STEP 2 — ARCHITECTURE                           agent: developer   │
+├─────────────────────────────────────────────────────────────────────┤
+│                                                                     │
+│  mv backlog/<name>.feature → in-progress/<name>.feature             │
+│  Read discovery + feature file                                      │
+│  Silent pre-mortem (YAGNI/KISS/DRY/SOLID/OC/patterns)              │
+│  Append Architecture section to feature file description            │
+│    (Module Structure + ADRs + Build Changes)                        │
+│  Architecture contradiction check → PO acknowledges                │
+│  commit: feat(<name>): add architecture                             │
+│                                                                     │
+└─────────────────────────────────────────────────────────────────────┘
+                              ↓
+┌─────────────────────────────────────────────────────────────────────┐
+│  STEP 3 — TEST FIRST                             agent: developer   │
+├─────────────────────────────────────────────────────────────────────┤
+│                                                                     │
+│  uv run task gen-tests   →  creates tests/features/<name>/          │
+│                              one <rule-slug>_test.py per Rule:      │
+│                              test_<rule_slug>_<hex>() per Example   │
+│  Write test bodies (real assertions, not raise NotImplementedError) │
+│  Confirm every test FAILS (ImportError / AssertionError)            │
+│  ★ STOP — reviewer checks test design + semantic alignment          │
+│  ★ WAIT for APPROVED                                                │
+│  commit: test(<name>): write failing tests                          │
+│                                                                     │
+└─────────────────────────────────────────────────────────────────────┘
+                              ↓
+┌─────────────────────────────────────────────────────────────────────┐
+│  STEP 4 — IMPLEMENT                              agent: developer   │
+├─────────────────────────────────────────────────────────────────────┤
+│                                                                     │
+│  For each failing test (one at a time):                             │
+│                                                                     │
+│    RED → GREEN → REFACTOR → SELF-DECLARE ─STOP─ REVIEWER ─WAIT─   │
+│                                                          ↓ APPROVED │
+│                                                       COMMIT        │
+│                                                          ↓          │
+│                                                    next test        │
+│                                                                     │
+│  RED:         confirm test fails                                     │
+│  GREEN:       minimum code to pass (YAGNI + KISS only)              │
+│  REFACTOR:    DRY → SOLID → Object Calisthenics (9 rules)           │
+│               → type hints → docstrings                             │
+│  SELF-DECLARE: write ## Self-Declaration block in TODO.md           │
+│               21-item checklist with file:line evidence             │
+│  REVIEWER:    code-design check only (no lint/pyright/coverage)     │
+│  COMMIT:      feat(<name>): implement <what>                        │
+│                                                                     │
+│  After all tests green:                                             │
+│    lint + static-check + test + timeout run  (all must pass)        │
+│    developer pre-mortem (2-3 sentences)                             │
+│                                                                     │
+└─────────────────────────────────────────────────────────────────────┘
+                              ↓
+┌─────────────────────────────────────────────────────────────────────┐
+│  STEP 5 — VERIFY                                  agent: reviewer   │
+├─────────────────────────────────────────────────────────────────────┤
+│                                                                     │
+│  Default hypothesis: broken despite green checks                    │
+│                                                                     │
+│  1. Read feature file — all @id Examples, interaction model         │
+│  2. Check commit history — one commit per test, clean status        │
+│  3. Production-grade gate:                                          │
+│       app exits cleanly + output changes with input                 │
+│  4. Code review (stop on first failure):                            │
+│       4a Correctness (dead code, DRY, YAGNI)                        │
+│       4b KISS (one thing, nesting, size)                            │
+│       4c SOLID (5-row table)                                        │
+│       4d Object Calisthenics (9-row table)                          │
+│       4e Design Patterns (5 smells)                                 │
+│       4f Tests (docstrings, contracts, @id coverage, naming)        │
+│       4g Code Quality (noqa, type hints, docstrings, coverage)      │
+│  5. Run: gen-tests --orphans → lint → static-check → test           │
+│  6. Interactive verification (if UI involved)                       │
+│  7. Written report: APPROVED or REJECTED                            │
+│                                                                     │
+└─────────────────────────────────────────────────────────────────────┘
+                              ↓ APPROVED
+┌─────────────────────────────────────────────────────────────────────┐
+│  STEP 6 — ACCEPT                             agent: product-owner   │
+├─────────────────────────────────────────────────────────────────────┤
+│                                                                     │
+│  PO runs/observes the feature (real user interaction)               │
+│  Checks against original Rule: user stories                         │
+│                                                                     │
+│  ACCEPTED:                                                          │
+│    mv in-progress/<name>.feature → completed/<name>.feature         │
+│    developer creates PR + tags release                              │
+│                                                                     │
+│  REJECTED:                                                          │
+│    feedback in TODO.md → back to relevant step                      │
+│                                                                     │
+└─────────────────────────────────────────────────────────────────────┘
+```
+
+---
+
+## Supporting Tools
+
+| Command | When | Purpose |
+|---|---|---|
+| `uv run task gen-tests` | Step 3, Step 4 | Reads `.feature` files → creates/syncs test stubs in `tests/features/` |
+| `uv run task gen-tests -- --check` | Before gen-tests | Dry run — preview what would change |
+| `uv run task gen-tests -- --orphans` | Step 5 | List tests with no matching `@id` |
+| `uv run task gen-todo` | Every session | Reads in-progress `.feature` → syncs `TODO.md` |
+| `uv run task gen-id` | Step 1 Phase 4 | Generate 8-char hex `@id` for a new Example |
+| `uv run task test-fast` | Step 4 cycle | Fast test run (no coverage) — used during Red-Green-Refactor |
+| `uv run task test` | Handoff, Step 5 | Full suite with coverage — must reach 100% |
+| `uv run task lint` | Handoff, Step 5 | ruff — must exit 0 |
+| `uv run task static-check` | Handoff, Step 5 | pyright — must exit 0, 0 errors |
+| `timeout 10s uv run task run` | Handoff, Step 5 | App must exit cleanly (exit 124 = hang = fix it) |
+
+---
+
+## Test Layout
+
+```
+tests/
+  features/<feature-name>/
+    <rule-slug>_test.py     ← generated by gen-tests, one per Rule: block
+                              function: test_<rule_slug>_<8char_hex>()
+  unit/
+    <anything>_test.py      ← developer-authored extras, no @id traceability
+                              plain pytest or Hypothesis @given (developer's choice)
+```
+
+---
+
+## TODO.md Structure (Step 4)
+
+```markdown
+# Current Work
+
+Feature: <name>
+Step: 4 (implement)
+Source: docs/features/in-progress/<name>.feature
+
+## Cycle State
+Test: @id:<hex> — <description>
+Phase: RED | GREEN | REFACTOR | SELF-DECLARE | REVIEWER(code-design) | COMMITTED
+
+## Self-Declaration (@id:<hex>)
+- [x] YAGNI-1 … SOLID-D … OC-1…OC-9 … Semantic  (21 items, file:line each)
+
+## Progress
+- [x] @id:<hex>: <done> — reviewer(code-design) APPROVED
+- [~] @id:<hex>: <in progress>
+- [ ] @id:<hex>: <next>
+
+## Next
+<one actionable sentence>
+```
+
+---
+
+## Roles
+
+| Role | Type | Responsibilities |
+|---|---|---|
+| **Stakeholder** | Human | Answers questions, provides domain knowledge, says "baseline" |
+| **Product Owner** | AI agent | Interviews stakeholder, writes `.feature` files, picks features, accepts deliveries |
+| **Developer** | AI agent | Architecture, tests, code, git, releases |
+| **Reviewer** | AI agent | Adversarial verification — defaults to REJECTED until proven correct |
+
+---
+
+## Quality Gates (non-negotiable)
+
+| Gate | Standard |
+|---|---|
+| Test coverage | 100% |
+| Type errors (pyright) | 0 |
+| Lint errors (ruff) | 0 |
+| Function length | ≤ 20 lines |
+| Class length | ≤ 50 lines |
+| Max nesting | 2 levels |
+| Instance variables per class | ≤ 2 |
+| Uncovered `@id` tags | 0 |
+| `noqa` comments | 0 |
+| `type: ignore` comments | 0 |
+| Orphaned tests | 0 |
+| Hypothesis tests missing `@pytest.mark.slow` | 0 |