Skip to content

fix(GgufInsights): correct KV cache size estimate for quantized types

c042c47
Select commit
Loading
Failed to load commit list.
Sign in for the full log view
Open

fix(GgufInsights): correct KV cache VRAM estimate for quantized types #608

fix(GgufInsights): correct KV cache size estimate for quantized types
c042c47
Select commit
Loading
Failed to load commit list.