White Circle
Runtime safety and alignment infrastructure for AI in the real world.
Pinned Loading
Repositories
Showing 2 of 2 repositories
- killbench Public
Benchmark showing all major LLMs exhibit measurable decision biases, worsened by structured outputs that reduce safety refusals.
whitecircle/killbench’s past year of commit activity - circle-guard-bench Public
First-of-its-kind AI benchmark for evaluating the protection capabilities of large language model (LLM) guard systems (guardrails and safeguards)
whitecircle/circle-guard-bench’s past year of commit activity
Top languages
Loading…
Most used topics
Loading…