Qwen 3.5 MoE: Add --backend metal export path by manuelcandales · Pull Request #18880 · pytorch/executorch

manuelcandales · 2026-04-14T16:25:44Z

Adds Metal backend support to export.py via --backend metal flag:

_prepare_and_quantize_metal: applies source transforms, quantizes
experts to MLX affine INT4, quantizes non-expert layers with fpa4w
(skips shared_expert_gate with N<4 for prefill compatibility)
_export_metal: exports decode + prefill methods via MetalBackend/
MetalPartitioner

CUDA and MLX paths are unchanged.

Authored with Claude.

[ghstack-poisoned]

manuelcandales · 2026-04-14T16:25:46Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2026-04-14T16:25:48Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18880

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Rolling out OSDC (ARC) runners on pull workflow for PyTorch trunk commits

❌ 3 New Failures, 3 Unrelated Failures

As of commit d70d646 with merge base 5707e2a ():

NEW FAILURES - The following jobs have failed:

Cadence Build & Test / cpu-test / test-aot / test-aot (gh)
backends/cadence/aot/tests/test_replace_ops_passes.py::TestReplaceOpsPasses::test_replace_transposed_conv_with_linear_1
pull / unittest-arm-backend-with-no-deps (test_pytest_ops_tosa) / linux-job (gh)
RuntimeError: Command docker exec -t 9c0022e80bc1708dde5860640085cda36db631c5d322685747f97ce8a83535ca /exec failed with exit code 1
pull / unittest-editable / macos / macos-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv3_model

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-vulkan-models-linux / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

[ghstack-poisoned]

Update

638edaa

[ghstack-poisoned]

manuelcandales requested a review from lucylq as a code owner April 14, 2026 16:25

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 14, 2026

manuelcandales added 2 commits April 14, 2026 18:23

Update

c9ecdde

[ghstack-poisoned]

Update

d70d646

[ghstack-poisoned]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen 3.5 MoE: Add --backend metal export path#18880

Qwen 3.5 MoE: Add --backend metal export path#18880
manuelcandales wants to merge 3 commits intogh/manuelcandales/174/headfrom
gh/manuelcandales/175/head

manuelcandales commented Apr 14, 2026

Uh oh!

manuelcandales commented Apr 14, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 14, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

manuelcandales commented Apr 14, 2026

Uh oh!

manuelcandales commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18880

❗ 1 Active SEVs

❌ 3 New Failures, 3 Unrelated Failures

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

manuelcandales commented Apr 14, 2026 •

edited

Loading

pytorch-bot bot commented Apr 14, 2026 •

edited

Loading