deterministic-testing

Here are 27 public repositories matching this topic...

justindobbs / Tracecore

Deterministic runtime for agent evaluation

reliability-engineering specification ai-agents benchmarking-framework autogen fastapi langchain observability-platform ai-evaluation-framework agent-testing agent-benchmark deterministic-testing autoresearch

Updated Mar 25, 2026
Python

Tuntii / KayaDB

Sponsor

Star

KayaDB distributed key-value storage engine

rust distributed-systems database storage-engine raft embedded-database correctness jepsen lsm-tree deterministic-testing

Updated Jul 19, 2026
Rust

ElliotOne / nl-prompt-versioning-ab-evaluation

Star

Local-first C# project for deterministic prompt versioning, A/B evaluation, and evidence-based promotion using structured scoring.

csharp dotnet ab-testing experiment-design ai-systems prompt-engineering semantic-kernel local-llm llm-evaluation deterministic-testing

Updated Apr 4, 2026
C#

44-99 / Web2DKit

Star

MCP tools, Agent Skills, and a runtime bridge for building, playtesting, and debugging browser-native 2D games with Codex, Claude Code, and other coding agents.

typescript canvas phaser mcp html5-game browser-game developer-tools pixijs codex 2d-games game-testing playwright web-game-development agent-skills claude-code deterministic-testing

Updated Jul 23, 2026
TypeScript

georgejeffers / Wordle-AI-Benchmark

Star

WordleBench — Deterministic AI Wordle benchmark. Compare 34+ LLMs (GPT-5, Claude 4.5, Gemini, Grok, Llama) head-to-head on accuracy, speed, and cost across 50 standardized words.

typescript nextjs gemini wordle language-models claude wordle-solver gpt-5 vercel-ai-sdk llm-leaderboard ai-benchmark llm-benchmark ai-comparison deterministic-testing

Updated Feb 6, 2026
TypeScript

hraness / direct

Star

Deterministic scenarios and verification for real application interfaces.

typescript testing-tools frontend-testing scenario-testing deterministic-testing

Updated Jul 24, 2026
TypeScript

zoobz-io / clockz

Star

Type-safe clock abstractions for Go with zero dependencies

testing go golang time clock deterministic-testing zoobzio fake-clock

Updated Mar 19, 2026
Go

oonyl / constitutional-agent-testbench

Star

A deterministic Python testbench for evaluating structured responses against declared JSON rules.

python json validation ai-agents structured-output policy-as-code ai-evaluation deterministic-testing

Updated Jul 23, 2026
Python

diba7star / DeHeisenbug

Star

The deterministic heap groomer for C/C++ memory debugging.

c debugging cplusplus cpp gdb memory-allocator fuzzing memory-safety valgrind low-level memory-corruption exploit-development use-after-free heisenbug double-free heap-grooming deterministic-testing

Updated Dec 10, 2025
C

kody-w / static-sap-s4hana

Star

Deterministic, zero-dependency static SAP S/4HANA OData and ERP workflow simulator.

simulator erp sap odata api-mock s4hana deterministic-testing

Updated Jul 13, 2026
JavaScript

PPDEGRET / EMPCOAnalyzer

Star

Python CLI for environmental marketing-claim risk review with structured outputs, image support, and offline mock testing.

python cli human-in-the-loop structured-output empco deterministic-testing marketing-compliance environmental-claims

Updated Jul 24, 2026
Python

kody-w / static-servicenow

Star

Deterministic, zero-dependency static ITSM and ServiceNow Table API compatibility simulator.

simulator itsm api-mock servicenow table-api deterministic-testing

Updated Jul 13, 2026
JavaScript

compiledbyutkarsh / quorum

Star

A Raft consensus library built from scratch in Rust, with a deterministic network simulator for testing leader election, log replication, and partition recovery.

rust distributed-systems database simulation raft consensus systems-programming deterministic-testing