Creating lists of intermittently failing tests #6472

adamfarley · 2025-07-31T12:30:16Z

This change lists known unreliable tests in a consistent format.

Resolves #6471

This change lists known unreliable tests in a consistent format. Signed-off-by: Adam Farley <[email protected]>

For the July 2025 release. Signed-off-by: Adam Farley <[email protected]>

Signed-off-by: Adam Farley <[email protected]>

…_failures_list

Signed-off-by: Adam Farley <[email protected]>

adamfarley · 2025-08-11T09:51:51Z

Requesting reviews from @ShelleyLambert and @sophia-guo please.

smlambert · 2025-08-11T11:38:12Z

thanks @adamfarley - a heads up that my active Github tag is @smlambert ... but I did see this review request in any event.

smlambert

Thank you for compiling this list @adamfarley. Only openjdk testcase failures are handled by jtreg exclude Problemlist format. For the other test groups that are part of AQAvit, they get excluded via playlist exclusions.

I would have expected the perf and system targets to be listed via playlist files (which do not use a different file for each version as the problem lists do).

I guess it depends on what we intend to do with these lists. If we are adding a feature to exclude unreliable targets on the fly, or post-process failures against a list, we have to consider what format to use, and how to organize them within the directory (as different vendors may eventually have different lists).

adamfarley · 2025-08-11T14:03:19Z

Thank you for compiling this list @adamfarley. Only openjdk testcase failures are handled by jtreg exclude Problemlist format. For the other test groups that are part of AQAvit, they get excluded via playlist exclusions.

I noticed that. For Perf and Systemtest, I used the jtreg format for consistency, rather than explicit usability (though a consistent format does make any future parsing easier)

I would have expected the perf and system targets to be listed via playlist files (which do not use a different file for each version as the problem lists do).

I considered that, but opted not to because I'm unaware of a mechanism for disabling targets via the playlist files sometimes. As far as I know, for a given vendor, platform, and version, a target is either enabled or disabled. An openjdk test can include an extra exclude file dynamically, so we can choose whether to exclude the test in some runs or not.

I guess it depends on what we intend to do with these lists. If we are adding a feature to exclude unreliable targets on the fly, or post-process failures against a list, we have to consider what format to use, and how to organize them within the directory (as different vendors may eventually have different lists).

My first priority with this task was to provide a unified list of all intermittently failing tests.

The lists should use a single format to allow parsing and easy-reading.
The lists should be stored in the repository to allow use at runtime.

Specific uses were a secondary priority. My first thought was that they could be used as exclude files during releases (or during reruns, or reruns during a release), but I didn't want to sidetrack myself looking too much into applications. I figured that if we did pick an application for this data, the best thing I can do right now is to use a consistent format across the board, to make conversion/parsing easier later on.

I'll add a note to the General Retrospective to start discussion into some of our options here.

smlambert · 2025-08-11T14:25:10Z

openjdk jtreg problemlists are the 'odd man out'... no other test group uses that format

I do not want to store non-openjdk test material in problemlist format as it might lead people to think that is the commonly used format, which it is not.

Agree that we need to have a requirements gathering session and design discussion on this (in terms of formats, and how to structure and where to locate such files). If this is merely a form of documentation, and not a plan for using for debug / temp exclude, then likely all could be in a doc folder.

adamfarley · 2025-08-11T14:52:57Z

openjdk jtreg problemlists are the 'odd man out'... no other test group uses that format

I do not want to store non-openjdk test material in problemlist format as it might lead people to think that is the commonly used format, which it is not.

Fair. Perhaps I was over-focused on the OpenJDK ProblemList format due to the majority of the intermittent failures being OpenJDK tests.

Agree that we need to have a requirements gathering session and design discussion on this (in terms of formats, and how to structure and where to locate such files). If this is merely a form of documentation, and not a plan for using for debug / temp exclude, then likely all could be in a doc folder.

In my mind, this is a debug/exclude tool first. Mind if I get the ball rolling on Slack?

adamfarley added 11 commits July 31, 2025 13:29

Creating unreliables files and adding first set of infrequent fails

2e3a153

This change lists known unreliable tests in a consistent format. Signed-off-by: Adam Farley <[email protected]>

Adding unreliables for system test and openjdk on jdk8

05969dd

For the July 2025 release. Signed-off-by: Adam Farley <[email protected]>

Unreliables for JDK11 and added unreliable files for perf tests

0c495a3

Signed-off-by: Adam Farley <[email protected]>

Merge remote-tracking branch 'upstream/master' into intermittent_test…

595357c

…_failures_list

Unreliables for JDK17

aad78e5

Signed-off-by: Adam Farley <[email protected]>

Unreliables for JDK21

ea8be0c

Signed-off-by: Adam Farley <[email protected]>

Unreliables for JDK25

04df342

Signed-off-by: Adam Farley <[email protected]>

Relocating unreliable test reference to different target

bfda753

Signed-off-by: Adam Farley <[email protected]>

Consolidating any duplicate unreliables

0bbaffa

Signed-off-by: Adam Farley <[email protected]>

Removing excluded tests and adding duplicate finding tool

10e348c

Signed-off-by: Adam Farley <[email protected]>

Unreliables from other non-release issues

8589506

Signed-off-by: Adam Farley <[email protected]>

adamfarley changed the title ~~WIP: Creating unreliables files and adding first set of infrequent fails~~ Creating lists of intermittent tests Aug 11, 2025

adamfarley changed the title ~~Creating lists of intermittent tests~~ Creating lists of intermittently failing tests Aug 11, 2025

smlambert requested changes Aug 11, 2025

View reviewed changes

adamfarley mentioned this pull request Aug 11, 2025

General Retrospective for July 2025 Release adoptium/temurin#84

Closed

8 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Creating lists of intermittently failing tests #6472

Creating lists of intermittently failing tests #6472

Uh oh!

adamfarley commented Jul 31, 2025

Uh oh!

adamfarley commented Aug 11, 2025

Uh oh!

smlambert commented Aug 11, 2025

Uh oh!

smlambert left a comment

Uh oh!

adamfarley commented Aug 11, 2025

Uh oh!

smlambert commented Aug 11, 2025

Uh oh!

adamfarley commented Aug 11, 2025

Uh oh!

Uh oh!

Uh oh!

Creating lists of intermittently failing tests #6472

Are you sure you want to change the base?

Creating lists of intermittently failing tests #6472

Uh oh!

Conversation

adamfarley commented Jul 31, 2025

Uh oh!

adamfarley commented Aug 11, 2025

Uh oh!

smlambert commented Aug 11, 2025

Uh oh!

smlambert left a comment

Choose a reason for hiding this comment

Uh oh!

adamfarley commented Aug 11, 2025

Uh oh!

smlambert commented Aug 11, 2025

Uh oh!

adamfarley commented Aug 11, 2025

Uh oh!

Uh oh!