Title: Add MobileNetV2 pytest for Cortex-M backend #17300

psiddh · 2026-02-08T17:38:11Z

Summary:

Added RemoveNoopPass (from ARM backend) to CortexMPassManager — it checks that
_clone_dim_order has matching input/output dtypes before removing it, so it won't strip any
actual type or dim_order conversions. This eliminates the identity clone from MV2's
flatten/view and unblocks the runtime.

Lowered qtol from 20 to 10 — passes reliably. Didn't go lower since calibration data is
random (torch.randn) so the numerical diff varies across runs. The argmax assertion provides
an additional correctness check independent of tolerance.

Removed xfail from test_dialect_mv2 (stable now), kept it on test_implementation_mv2 with
strict=False for potential flakiness in the serialization/runtime path.

Test plan

pytest -c backends/arm/test/pytest.ini backends/cortex_m/test/models/test_mobilenet_v2.py -v

pytorch-bot · 2026-02-08T17:38:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17300

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 10 New Failures, 1 Cancelled Job, 1 Unrelated Failure

As of commit 882c4c1 with merge base ba2516c ():

NEW FAILURES - The following jobs have failed:

pull / test-moshi-linux / linux-job (gh)
RuntimeError: Command docker exec -t 3fff76b625fb4d30042b99abc387f80686cdaf10e910ff9d8a8da7db396a288c /exec failed with exit code 1
pull / test-samsung-models-linux / linux-job (gh)
RuntimeError: Command docker exec -t 5b03df56700cc11a4ddb8401d6b94bed75500de314d2ab8d03990ecc28e767a6 /exec failed with exit code 127
Test CUDA Builds / test-model-cuda-e2e (openai, whisper-large-v3-turbo, non-quantized) / linux-job (gh)
RuntimeError: Command docker exec -t be16fd91333aa8bd0c089bbf15214810fc3bd35483d0f9b881e23788a944f9fe /exec failed with exit code 1
Test CUDA Builds / test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-tile-packed) / linux-job (gh)
RuntimeError: Command docker exec -t bb782a3809c52658a682e5bfbc566ac836e96cb1e4a687d673163639f0420273 /exec failed with exit code 1
Test CUDA Builds / test-model-cuda-e2e (openai, whisper-large-v3-turbo, quantized-int4-weight-only) / linux-job (gh)
RuntimeError: Command docker exec -t 767220b3b3f4254a09374888d3d60df22dae492316bbe5d10cac932946eefbc1 /exec failed with exit code 1
Test CUDA Builds / test-model-cuda-e2e (openai, whisper-small, non-quantized) / linux-job (gh)
RuntimeError: Command docker exec -t 31ba1c274e2cd2f4d8c7f498b00f62849db35178f804a75cf08e8ebd73db384c /exec failed with exit code 1
Test CUDA Builds / test-model-cuda-e2e (openai, whisper-small, quantized-int4-tile-packed) / linux-job (gh)
RuntimeError: Command docker exec -t 0bdefff9fe54ac2f48a00f007bdff7c53f074aafe69fb620397ba3d5f36ac961 /exec failed with exit code 1
Test CUDA Builds / test-model-cuda-e2e (openai, whisper-small, quantized-int4-weight-only) / linux-job (gh)
RuntimeError: Command docker exec -t 6eec09bbc69d996f173e27db9d17576edd62b77b3da4acf0db9eca4b211dc84e /exec failed with exit code 1
Test Metal Backend / test-model-metal-e2e (openai, whisper-large-v3-turbo, non-quantized) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
Test Metal Backend / test-model-metal-e2e (openai, whisper-small, non-quantized) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1

CANCELLED JOB - The following job was cancelled. Please retry:

pull / test-samsung-quantmodels-linux / linux-job (gh)
##[error]The operation was canceled.

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-models-linux (linear, xnnpack-quantization-delegation, linux.2xlarge) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-02-08T17:38:52Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

psiddh · 2026-02-08T17:41:55Z

Note : test_implementation_mv2 is marked XFAIL because MV2's graph contains a dim_order_ops._clone_dim_order.default
op (from the flatten/view before the classifier) that MV3 doesn't have. The runtime doesn't include a kernel for
this op (use_portable_ops=False).

Should we add ARM's RemoveNoopPass to CortexMPassManager to remove identity clone ops.

AdrianLundell · 2026-02-09T09:36:36Z

Yes if it is a noop I think it makes sense to remove, just make sure that it does not do any dim_order/ type conversion first. Also see if you can lower the numerical tolerance compared to mv3 and still get it to pass, would be interesting to know.

rascani · 2026-02-10T19:13:49Z

backends/cortex_m/test/models/test_mobilenet_v2.py

+    xfails={
+        "mobilenet_v2": "MLETORCH-XXX - Investigate mobilenet_v2 flakiness"
+    },


Shouldn't this be on the test_implementation_mv2?

Yes you're right, Removed xfail from test_dialect_mv2 (stable now), kept it on test_implementation_mv2 with
strict=False for potential flakiness in the serialization/runtime path.

Copilot

Pull request overview

Adds a new MobileNetV2 model test for the Cortex-M backend and updates the Cortex-M pass pipeline to remove certain no-op operations during lowering, aiming to unblock the MV2 runtime path.

Changes:

Add a new test_mobilenet_v2.py pytest covering both dialect and implementation paths (with calibration + top-1 argmax sanity check).
Add RemoveNoopPass to CortexMPassManager pass list to eliminate identity _clone_dim_order/related no-op nodes.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
backends/cortex_m/test/models/test_mobilenet_v2.py	Introduces MobileNetV2 Cortex-M tests (dialect + implementation) with op-count assertions and calibration.
backends/cortex_m/passes/cortex_m_pass_manager.py	Inserts `RemoveNoopPass` into the Cortex-M lowering pass sequence to remove no-op ops earlier in the pipeline.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-11T01:02:30Z

backends/cortex_m/test/models/test_mobilenet_v2.py

+    "test_case",
+    test_cases,
+    xfails={
+        "mobilenet_v2": "MLETORCH-XXX - Investigate mobilenet_v2 flakiness"


The xfail reason uses a placeholder ticket id ("MLETORCH-XXX"), which makes it hard to track and resolve the expected failure. Please replace it with the real issue/task identifier (or a concrete explanation if no tracker exists).

Suggested change

"mobilenet_v2": "MLETORCH-XXX - Investigate mobilenet_v2 flakiness"

"mobilenet_v2": "Flaky on Cortex-M due to nondeterministic quantized kernels; keep xfailed until kernels are made deterministic or flakiness is resolved (no external ticket)."

backends/cortex_m/test/models/test_mobilenet_v2.py

Copilot · 2026-02-11T01:02:31Z

backends/cortex_m/passes/cortex_m_pass_manager.py

    pass_list: list[ExportPass] = [
        # Run before folding so qparams attach to max_pool2d values, not tuple + getitem.
        RemoveGetItemPass,
        FoldAndAnnotateQParamsPass,
+        RemoveNoopPass,
        ReplaceScalarWithTensorArgPass,
        ReplaceQuantNodesPass,
        ActivationFusionPass,


Adding RemoveNoopPass here may be unsafe for Cortex-M graphs: the current implementation in backends/arm/_passes/remove_noop_pass.py removes dim_order_ops._clone_dim_order / _to_dim_order_copy based only on dtype, without checking whether dim_order (layout) actually changes. That can drop real dim-order conversions. It will also likely change existing Cortex-M op-count expectations (e.g., tests that currently expect _clone_dim_order to remain after transforms). Consider either (a) extending/wrapping the pass to verify dim_order equivalence via node meta (similar to backends/transforms/remove_clone_ops.py), or (b) limiting removal to proven-identity cases, and update affected op-count tests accordingly.

Summary: Added RemoveNoopPass (from ARM backend) to CortexMPassManager — it checks that _clone_dim_order has matching input/output dtypes before removing it, so it won't strip any actual type or dim_order conversions. This eliminates the identity clone from MV2's flatten/view and unblocks the runtime. Lowered qtol from 20 to 10 — passes reliably. Didn't go lower since calibration data is random (torch.randn) so the numerical diff varies across runs. The argmax assertion provides an additional correctness check independent of tolerance. Removed xfail from test_dialect_mv2 (stable now), kept it on test_implementation_mv2 with strict=False for potential flakiness in the serialization/runtime path.

… (introduced in #17300) which broke 8 quantization tests. After #17300 merged, the following tests started failing: - `test_shared_qspec_quantizer[input_fork_x_shared]` - `test_shared_qspec_quantizer[input_fork_y_shared]` - `test_shared_qspec_quantizer[surrounded_quantized_op]` - `test_shared_qspec_quantizer[output_fork_shared]` - `test_shared_qspec_quantizer[many_forks]` - `test_dialect_mv2[mobilenet_v2]` (+ 2 more) `RemoveNoopPass` removes `clone_dim_order` operators that are functionally necessary for shared quantization specs. It only checks dtype equality, not dim_order/layout changes, causing it to incorrectly remove clones needed for: - Tensor forking in quantized graphs - Shared quantization parameter propagation - Correct quantization fusion Remove `RemoveNoopPass` from the Cortex-M pass pipeline. The MobileNetV2 `clone_dim_order` issue it was meant to address needs a more targeted solution. ```bash pytest -c backends/arm/test/pytest.ini backends/cortex_m/test/misc/test_quantization.py::test_shared_qspec_quantizer -v pytest -c backends/arm/test/pytest.ini backends/cortex_m/test/models/test_mobilenet_v2.py::test_dialect_mv2 -v

#17407) … (introduced in #17300) which broke 8 quantization tests. After #17300 merged, the following tests started failing: - `test_shared_qspec_quantizer[input_fork_x_shared]` - `test_shared_qspec_quantizer[input_fork_y_shared]` - `test_shared_qspec_quantizer[surrounded_quantized_op]` - `test_shared_qspec_quantizer[output_fork_shared]` - `test_shared_qspec_quantizer[many_forks]` - `test_dialect_mv2[mobilenet_v2]` (+ 2 more) `RemoveNoopPass` removes `clone_dim_order` operators that are functionally necessary for shared quantization specs. It only checks dtype equality, not dim_order/layout changes, causing it to incorrectly remove clones needed for: - Tensor forking in quantized graphs - Shared quantization parameter propagation - Correct quantization fusion Remove `RemoveNoopPass` from the Cortex-M pass pipeline. The MobileNetV2 `clone_dim_order` issue it was meant to address needs a more targeted solution. ```bash pytest -c backends/arm/test/pytest.ini backends/cortex_m/test/misc/test_quantization.py::test_shared_qspec_quantizer -v pytest -c backends/arm/test/pytest.ini backends/cortex_m/test/models/test_mobilenet_v2.py::test_dialect_mv2 -v Co-authored-by: Github Executorch <[email protected]>

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 8, 2026

psiddh requested a review from AdrianLundell February 8, 2026 17:38

psiddh force-pushed the test_mv2 branch from 06fc349 to 481a5b9 Compare February 9, 2026 06:41

psiddh force-pushed the test_mv2 branch from 481a5b9 to 886fed9 Compare February 10, 2026 17:14

psiddh requested a review from rascani February 10, 2026 17:41

psiddh mentioned this pull request Feb 10, 2026

Alif E8 board --> Run MV2 #16628

Open

rascani reviewed Feb 10, 2026

View reviewed changes

psiddh force-pushed the test_mv2 branch from 886fed9 to 59b20c6 Compare February 11, 2026 00:54

psiddh marked this pull request as ready for review February 11, 2026 00:56

Copilot AI review requested due to automatic review settings February 11, 2026 00:56

Copilot started reviewing on behalf of psiddh February 11, 2026 00:57 View session

Copilot AI reviewed Feb 11, 2026

View reviewed changes

psiddh force-pushed the test_mv2 branch from 59b20c6 to 882c4c1 Compare February 11, 2026 02:02

rascani approved these changes Feb 11, 2026

View reviewed changes

psiddh merged commit 9bef7d3 into main Feb 11, 2026
175 of 187 checks passed

psiddh deleted the test_mv2 branch February 11, 2026 18:34

psiddh mentioned this pull request Feb 12, 2026

Reverts the addition of RemoveNoopPass to the Cortex-M pass manager… #17407

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Title: Add MobileNetV2 pytest for Cortex-M backend #17300

Title: Add MobileNetV2 pytest for Cortex-M backend #17300

Uh oh!

psiddh commented Feb 8, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Feb 8, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 8, 2026

Uh oh!

psiddh commented Feb 8, 2026

Uh oh!

AdrianLundell commented Feb 9, 2026

Uh oh!

rascani Feb 10, 2026

Uh oh!

psiddh Feb 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 11, 2026

Uh oh!

Uh oh!

Copilot AI Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	"mobilenet_v2": "MLETORCH-XXX - Investigate mobilenet_v2 flakiness"
	"mobilenet_v2": "Flaky on Cortex-M due to nondeterministic quantized kernels; keep xfailed until kernels are made deterministic or flakiness is resolved (no external ticket)."

Title: Add MobileNetV2 pytest for Cortex-M backend #17300

Title: Add MobileNetV2 pytest for Cortex-M backend #17300

Uh oh!

Conversation

psiddh commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test plan

Uh oh!

pytorch-bot bot commented Feb 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17300

❌ 10 New Failures, 1 Cancelled Job, 1 Unrelated Failure

Uh oh!

github-actions bot commented Feb 8, 2026

This PR needs a release notes: label

Uh oh!

psiddh commented Feb 8, 2026

Uh oh!

AdrianLundell commented Feb 9, 2026

Uh oh!

rascani Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

psiddh Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

psiddh commented Feb 8, 2026 •

edited

Loading

pytorch-bot bot commented Feb 8, 2026 •

edited

Loading

This PR needs a `release notes:` label