fix: support quantization ranges for int8/uint8 sentence-transformers… by DebadityaHait · Pull Request #3543 · deepset-ai/haystack-core-integrations

DebadityaHait · 2026-07-03T07:43:03Z

Related Issues

fixes SentenceTransformersTextEmbedder zero embeddings for single text queries when using precision="int8" haystack#9100

Proposed Changes:

With precision="int8" or "uint8", sentence-transformers calibrates the scalar-quantization min/max range from the batch being encoded. For a single text (as in SentenceTransformersTextEmbedder.run) the range is degenerate (min == max), producing meaningless all-equal embeddings (e.g. all zeros).

SentenceTransformersTextEmbedder and SentenceTransformersDocumentEmbedder accept a new optional quantization_ranges init parameter — explicit calibration ranges with shape (2, embedding_dim) (mins in first row, maxs in second).
_SentenceTransformersEmbeddingBackend.embed encodes in float32 and forwards the ranges to sentence_transformers.quantize_embeddings when a quantized precision is used with explicit ranges; behavior is unchanged when quantization_ranges is None.
A warning is logged when int8/uint8 is used without ranges, pointing users at the parameter.
quantization_ranges is included in to_dict/from_dict serialization.

How did you test it?

New unit tests: init/serialization, kwarg forwarding from both embedders, backend quantization path (with and without ranges), warning emission.
New integration test: real tiny model (sentence-transformers-testing/stsb-bert-tiny-safetensors) with int8 + ranges produces a non-degenerate integer embedding.
hatch run test:unit (162 passed), hatch run test:integration -k quantization (2 passed), hatch run test:types and hatch run fmt clean.

Notes for the reviewer

This ports the fix originally prepared against haystack core (fix: support quantization ranges for int8 single-text embeddings haystack#11854); maintainers there indicated the sentence-transformers components now live in this repo.
Ranges are converted with np.asarray before being passed to quantize_embeddings.

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:

… embeddings

CLAassistant · 2026-07-03T07:43:16Z

All committers have signed the CLA.

github-actions · 2026-07-03T07:43:33Z

Heads-up for maintainers

This PR is from a fork and touches integrations whose integration tests require API keys.
Those tests are skipped in CI because fork PRs don't have access to repo secrets for security reasons.

Affected integrations:

sentence_transformers

Please run the integration tests locally (hatch run test:integration inside each folder) before approving.

fix: support quantization ranges for int8/uint8 sentence-transformers…

3fc4ea3

… embeddings

DebadityaHait requested a review from a team as a code owner July 3, 2026 07:43

DebadityaHait requested review from julian-risch and removed request for a team July 3, 2026 07:43

github-actions Bot added the integration:sentence-transformers label Jul 3, 2026

github-actions Bot added the type:documentation Improvements or additions to documentation label Jul 3, 2026

Merge branch 'main' into fix-st-int8-quantization-ranges

98fd3fd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: support quantization ranges for int8/uint8 sentence-transformers…#3543

fix: support quantization ranges for int8/uint8 sentence-transformers…#3543
DebadityaHait wants to merge 2 commits into
deepset-ai:mainfrom
DebadityaHait:fix-st-int8-quantization-ranges

DebadityaHait commented Jul 3, 2026

Uh oh!

CLAassistant commented Jul 3, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

DebadityaHait commented Jul 3, 2026

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

Uh oh!

CLAassistant commented Jul 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions Bot commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

CLAassistant commented Jul 3, 2026 •

edited

Loading