[Backend Tester] Clean up a few test issues #13258

GregoryComer · 2025-08-09T19:09:21Z

There are a few broken tests that need cleaning up. Some are failing due to missing portable kernels. These tests are now skipped if any unsupported portable ops remain post-delegation. I also fixed a few other small issues and bumped the element-wise tolerance to reduce false positives. SNR should hopefully catch most blatant correctness issues. The fp16 and quantized tests can generate occasional high element-wise error but still have decent SNR (~60+).

[ghstack-poisoned]

GregoryComer · 2025-08-09T19:09:22Z

Stack from ghstack (oldest at bottom):

pytorch-bot · 2025-08-09T19:09:24Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13258

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures

As of commit 5e92884 with merge base 3a02146 ():

NEW FAILURES - The following jobs have failed:

Build documentation / build (buck2) / Build doc (gh)
At least one of the pre-conditions you specified did not hold
pull / unittest-arm-backend-with-no-fvp (test_pytest_models) / linux-job (gh)
backends/arm/test/models/stable_diffusion/test_vae_AutoencoderKL.py::TestAutoencoderKL::test_AutoencoderKL_tosa_MI
pull / unittest-arm-backend-with-no-fvp (test_pytest_ops) / linux-job (gh)
RuntimeError: Command docker exec -t ad298f438cebbaa01bb45711de0affa147c207da998d1eea6d16b84c34333ccf /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: 0a3c4cc ghstack-comment-id: 3172044664 Pull-Request: #13258

[ghstack-poisoned]

ghstack-source-id: f04ccde ghstack-comment-id: 3172044664 Pull-Request: #13258

[ghstack-poisoned]

ghstack-source-id: 5c06eb7 ghstack-comment-id: 3172044664 Pull-Request: #13258

[ghstack-poisoned]

ghstack-source-id: 8c0ec06 ghstack-comment-id: 3172044664 Pull-Request: #13258

[ghstack-poisoned]

ghstack-source-id: 9ac560d ghstack-comment-id: 3172044664 Pull-Request: #13258

[ghstack-poisoned]

digantdesai · 2025-08-12T11:35:10Z

backends/test/suite/runner.py

@@ -9,6 +9,14 @@

 import torch

+# Set of unsupported ops that should cause tests to be skipped
+UNSUPPORTED_PORTABLE_OPS = {


If we are adding a Portable flow, how would these show up there? Some PTE_FAIL?

digantdesai · 2025-08-12T11:35:59Z

backends/test/suite/runner.py

@@ -142,12 +159,15 @@ def build_result(
            tester.run_method_and_compare_outputs(
                inputs=None if generate_random_test_inputs else inputs,
                statistics_callback=lambda stats: error_statistics.append(stats),
+                atol=1e-1,


atol seems pretty high for a general default, no?

The fp16 and quantized tests can generate occasional high element-wise error but still have decent SNR (~60+).

Do you know how this is tested on PyTorch/PyTorch side? >60 SNR is good but outliers are not great esp for individual ops, if they are expected I would prefer if we set them per test basis which would allow us to reason about the math being done on that specific test warrenting high ATOL/RTOL.

Update

c99c41a

[ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Aug 9, 2025

GregoryComer added a commit that referenced this pull request Aug 9, 2025

[Backend Tester] Clean up a few test issues

12d5de2

ghstack-source-id: 0a3c4cc ghstack-comment-id: 3172044664 Pull-Request: #13258

GregoryComer requested a review from digantdesai August 9, 2025 19:11

Update

bf57d6c

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 9, 2025

[Backend Tester] Clean up a few test issues

b7324ca

ghstack-source-id: f04ccde ghstack-comment-id: 3172044664 Pull-Request: #13258

Update

0e162ab

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 11, 2025

[Backend Tester] Clean up a few test issues

f4bc7b9

ghstack-source-id: 5c06eb7 ghstack-comment-id: 3172044664 Pull-Request: #13258

Update

c6bd56b

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 11, 2025

[Backend Tester] Clean up a few test issues

248d1bc

ghstack-source-id: 8c0ec06 ghstack-comment-id: 3172044664 Pull-Request: #13258

Update

144a8ae

[ghstack-poisoned]

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Clean up a few test issues

d28c9c6

ghstack-source-id: 9ac560d ghstack-comment-id: 3172044664 Pull-Request: #13258

GregoryComer added a commit that referenced this pull request Aug 12, 2025

[Backend Tester] Clean up a few test issues

e485eff

ghstack-source-id: 9ac560d ghstack-comment-id: 3172044664 Pull-Request: #13258

This was referenced Aug 12, 2025

[Backend Tester] Clean up report output #13306

Open

[Backend Tester] Write report progressively #13308

Open

Update

ffaa1c3

[ghstack-poisoned]

GregoryComer mentioned this pull request Aug 12, 2025

Link Vulkan backend with pybinding lib when built #13309

Open

GregoryComer marked this pull request as ready for review August 12, 2025 04:29

GregoryComer requested a review from cccclai as a code owner August 12, 2025 04:29

This was referenced Aug 12, 2025

[Backend Tester] Add subtest index field #13311

Open

[Backend Tester] Reduce log verbosity / spam #13312

Open

Update

5e92884

[ghstack-poisoned]

GregoryComer mentioned this pull request Aug 12, 2025

[Backend Tester] Seed based on test name #13313

Open

digantdesai reviewed Aug 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Backend Tester] Clean up a few test issues #13258

[Backend Tester] Clean up a few test issues #13258

GregoryComer commented Aug 9, 2025 •

edited

Loading

Uh oh!

GregoryComer commented Aug 9, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Aug 9, 2025 •

edited

Loading

Uh oh!

digantdesai Aug 12, 2025

Uh oh!

digantdesai Aug 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

[Backend Tester] Clean up a few test issues #13258

Are you sure you want to change the base?

[Backend Tester] Clean up a few test issues #13258

Conversation

GregoryComer commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GregoryComer commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Aug 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/13258

❌ 3 New Failures

Uh oh!

digantdesai Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

digantdesai Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

GregoryComer commented Aug 9, 2025 •

edited

Loading

GregoryComer commented Aug 9, 2025 •

edited

Loading

pytorch-bot bot commented Aug 9, 2025 •

edited

Loading

digantdesai Aug 12, 2025 •

edited

Loading