Qualcomm AI Engine Direct - Minimal Inference Runtime Core Requirment by winskuo-quic · Pull Request #18434 · pytorch/executorch

winskuo-quic · 2026-03-24T02:06:30Z

Summary

Removed from_blob tensor creation
Compile and Linking Option optimization
Function visibility optimization
Expose Power Config to user: This is a quick workaround, however, we should expose more configs for user to set. To make it easier for user to set all configs, we have also perform some refactor on Python API to make it easier to achieve this. Python API refactor PR: Qualcomm AI Engine Direct - Python API Refactor #18312

Test plan

add --direct_build_folder build-hexagon/ at end of any TestQNNQuantizedUtils, TestQNNQuantizedModel, TestQNNFloatingPointModel, TestQNNFloatingPointOperator

Author: @haowhsu-quic, @shewu-quic, @winskuo-quic

pytorch-bot · 2026-03-24T02:06:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18434

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 2 Unrelated Failures

As of commit 3c9a652 with merge base c7f1d72 ():

NEW FAILURE - The following job has failed:

Cadence Build & Test / cpu-test / test-ops / test-ops (gh)
examples/cadence/operators/test_g3_ops.py::ATenOpTestCases::test_g3_neg_out_4

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / windows / windows-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-03-24T02:07:15Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

abhinaykukkadapu · 2026-03-26T19:26:36Z

@winskuo-quic can you please rebase.

winskuo-quic · 2026-03-27T09:42:40Z

@winskuo-quic can you please rebase.

I have rebased. Thanks

meta-codesync · 2026-04-03T03:54:52Z

@abhinaykukkadapu has imported this pull request. If you are a Meta employee, you can view this in D99394613.

abhinaykukkadapu · 2026-04-03T03:59:21Z

backends/qualcomm/runtime/QnnManager.cpp

-      auto dump_tensor = executorch::extension::from_blob(
-          QNN_TENSOR_VER_PTR(output_tensor)->clientBuf.data,
-          sizes,
+      std::vector<executorch::aten::StridesType> stride_size(sizes.size(), 0);


@winskuo-quic do we want to init zeros for stride?

Updated to align with from_blob behavior. Thanks

1. Removed from_blob tensor creation 2. Compile and Linking Option optimization 3. Function visibility optimization 4. Expose Power Config to user

winskuo-quic requested review from abhinaykukkadapu, cccclai, kirklandsign and larryliu0820 as code owners March 24, 2026 02:06

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 24, 2026

winskuo-quic force-pushed the dev1/winskuo/reduce_library_size branch from b52b847 to 0f916c6 Compare March 27, 2026 09:42

winskuo-quic marked this pull request as draft March 31, 2026 05:08

winskuo-quic force-pushed the dev1/winskuo/reduce_library_size branch from 0f916c6 to 4883181 Compare March 31, 2026 06:25

winskuo-quic changed the title ~~Qualcomm AI Engine Direct - Minimal Inerence Runtime Core Requirment~~ Qualcomm AI Engine Direct - Minimal Inference Runtime Core Requirment Mar 31, 2026

winskuo-quic marked this pull request as ready for review March 31, 2026 06:27

abhinaykukkadapu reviewed Apr 3, 2026

View reviewed changes

winskuo-quic added 3 commits April 7, 2026 10:34

Qualcomm AI Engine Direct - Minimal Inerence Runtime Core Requirment

83486b9

1. Removed from_blob tensor creation 2. Compile and Linking Option optimization 3. Function visibility optimization 4. Expose Power Config to user

Fix External CI

8abd765

Update stride computation

3c9a652

winskuo-quic force-pushed the dev1/winskuo/reduce_library_size branch from 4883181 to 3c9a652 Compare April 7, 2026 02:48

abhinaykukkadapu approved these changes Apr 7, 2026

View reviewed changes

abhinaykukkadapu merged commit 788be2d into pytorch:main Apr 8, 2026
174 of 180 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qualcomm AI Engine Direct - Minimal Inference Runtime Core Requirment#18434

Qualcomm AI Engine Direct - Minimal Inference Runtime Core Requirment#18434
abhinaykukkadapu merged 3 commits intopytorch:mainfrom
CodeLinaro:dev1/winskuo/reduce_library_size

winskuo-quic commented Mar 24, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 24, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 24, 2026

Uh oh!

abhinaykukkadapu commented Mar 26, 2026

Uh oh!

winskuo-quic commented Mar 27, 2026

Uh oh!

meta-codesync bot commented Apr 3, 2026

Uh oh!

abhinaykukkadapu Apr 3, 2026

Uh oh!

winskuo-quic Apr 7, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

winskuo-quic commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Uh oh!

pytorch-bot bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18434

❌ 1 New Failure, 2 Unrelated Failures

Uh oh!

github-actions bot commented Mar 24, 2026

This PR needs a release notes: label

Uh oh!

abhinaykukkadapu commented Mar 26, 2026

Uh oh!

winskuo-quic commented Mar 27, 2026

Uh oh!

meta-codesync bot commented Apr 3, 2026

Uh oh!

abhinaykukkadapu Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

winskuo-quic Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

winskuo-quic commented Mar 24, 2026 •

edited

Loading

pytorch-bot bot commented Mar 24, 2026 •

edited

Loading

This PR needs a `release notes:` label