[Docs]: release/v1.21.6 doc update by abukhoy · Pull Request #1007 · quic/efficient-transformers

abukhoy · 2026-05-25T04:51:49Z

This Pr is created for updating the release docs of the release branch release/v1.21.6.

Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>

quic-hemagnih · 2026-05-25T08:34:53Z

+# Efficient Transformer Library - 1.21.6 Release Notes
+
+Welcome to the official release of **Efficient Transformer Library v1.21.6**! This targeted release builds on the v1.21 line with multi-resolution Vision Language Model workflows, Qwen3-VL stability fixes, on-device sampling enablement, and compatibility updates for newer model and framework APIs.
+


Also add online serving support for Gemma4 through vLLM

quic-hemagnih · 2026-05-25T08:36:17Z

+## Key Features & Enhancements
+
+- **Multi-specialization vision compilation for Qwen VLMs**
+  - Qwen2.5-VL, Qwen3-VL Dense, and Qwen3-VL-MoE can compile multiple vision resolution and frame configurations in one pass.


WE should remove QWE3-VL MOE model from this list, as this model is not tested by SIT in this release. We should only keep the models which are vetted by SIT.

quic-hemagnih · 2026-05-25T08:37:07Z

+  - Qwen2.5-VL, Qwen3-VL Dense, and Qwen3-VL-MoE can compile multiple vision resolution and frame configurations in one pass.
+  - `height`, `width`, and `num_frames` can be supplied as lists when building specializations.
+  - Runtime generation can select the matching specialization through the multi-frame generation path.
+  - New example scripts are available for [Qwen2.5-VL](https://github.com/quic/efficient-transformers/tree/release/v1.21.6/examples/image_text_to_text/models/qwen2_5_vl), [Qwen3-VL Dense](https://github.com/quic/efficient-transformers/tree/release/v1.21.6/examples/image_text_to_text/models/qwen3vl), and [Qwen3-VL-MoE](https://github.com/quic/efficient-transformers/tree/release/v1.21.6/examples/image_text_to_text/models/qwen3_vl_moe).


remove qwen3-vl moe example script also. Only keep Qwen2.5VL and QWEN3-VL dense models

quic-hemagnih · 2026-05-25T08:37:46Z

+  - Adds regression coverage for large embedding and reranker model export flows.
+
+- **Qwen VLM runtime stability**
+  - Fixes RoPE handling for Qwen3-VL-MoE disaggregated mode.


remove this line

quic-hemagnih · 2026-05-25T08:38:15Z

+
+- **Gemma3 configuration compatibility**
+  - Updates Gemma3 cache handling for the newer `_sliding_window_pattern` config field.
+  - Preserves sliding-window behavior for Gemma3 models using updated Transformers configs.


add online serving support for Gemma3 through vLLM is added

quic-hemagnih · 2026-05-25T08:39:08Z

+  - Accepts `vision_feature_layer` and `vision_feature_select_strategy` forwarded by newer Transformers Llama4 APIs.
+  - Fixes ONNX export failures for Llama4 vision models while remaining backward compatible.
+
+---


Add GPT OSS 120B with BS>1 and GPT OSS 20B BS>2 support is enabled

release/v1.21.6 doc update

d2d18a6

Signed-off-by: Abukhoyer Shaik <abukhoye@qti.qualcomm.com>

quic-hemagnih requested changes May 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Docs]: release/v1.21.6 doc update#1007

[Docs]: release/v1.21.6 doc update#1007
abukhoy wants to merge 1 commit into
quic:release/v1.21.6from
abukhoy:doc-update-v1.21.6

abukhoy commented May 25, 2026

Uh oh!

quic-hemagnih May 25, 2026

Uh oh!

quic-hemagnih May 25, 2026

Uh oh!

quic-hemagnih May 25, 2026

Uh oh!

quic-hemagnih May 25, 2026

Uh oh!

quic-hemagnih May 25, 2026

Uh oh!

quic-hemagnih May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# Efficient Transformer Library - 1.21.6 Release Notes

		Welcome to the official release of Efficient Transformer Library v1.21.6! This targeted release builds on the v1.21 line with multi-resolution Vision Language Model workflows, Qwen3-VL stability fixes, on-device sampling enablement, and compatibility updates for newer model and framework APIs.

Conversation

abukhoy commented May 25, 2026

Uh oh!

quic-hemagnih May 25, 2026

Choose a reason for hiding this comment

Uh oh!

quic-hemagnih May 25, 2026

Choose a reason for hiding this comment

Uh oh!

quic-hemagnih May 25, 2026

Choose a reason for hiding this comment

Uh oh!

quic-hemagnih May 25, 2026

Choose a reason for hiding this comment

Uh oh!

quic-hemagnih May 25, 2026

Choose a reason for hiding this comment

Uh oh!

quic-hemagnih May 25, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants