Making testing requirements clearer by PawelPlesniak · Pull Request #673 · DUNE-DAQ/drunc

PawelPlesniak · 2025-11-11T14:09:30Z

Description

Changes the structure of the PR template to prioritize the testing, introduces the requirements from other repos as a field.

No tests or further checks have been run, as this is a template issue, and does not affect the core code.

Type of change

Documentation (non-breaking change that adds or improves the documentation)
New feature (non-breaking change which adds functionality)
Optimization (non-breaking, back-end change that speeds up the code)
Bug fix (non-breaking change which fixes an issue)
Breaking change (whatever its nature)

Key checklist

All tests pass (eg. python -m pytest)
Pre-commit hooks run successfully (eg. pre-commit run --all-files)

Further checks

Code is commented, particularly in hard-to-understand areas
Tests added or an issue has been opened to tackle that in the future.
(Indicate issue here: # (issue))

PawelPlesniak · 2025-11-20T15:15:37Z

@jamesturner246 @Aurashk @miruuna this is the PR to clarify what tests should be performed for PR merges. This will update the PR template so the testing requiremenst are clearer, this is sometthing that I would value feedback on. The tests that should be performed are:

unit tests with pytest
regression tests with minimal_system_quick_test
integration tests with daqsystemtest_integtest_bundle for anyone with either access to the NP0x cluster or CVMFS

Looking forward to feedback from you

Aurashk · 2025-11-21T16:28:37Z

@jamesturner246 @Aurashk @miruuna this is the PR to clarify what tests should be performed for PR merges. This will update the PR template so the testing requiremenst are clearer, this is sometthing that I would value feedback on. The tests that should be performed are:
* unit tests with `pytest`

* regression tests with `minimal_system_quick_test`

* integration tests with `daqsystemtest_integtest_bundle` for anyone with either access to the NP0x cluster or CVMFS
Looking forward to feedback from you

Thanks Pawel, that's very helpful.
Just wondering about the integration tests, is it enough to daqsystemtest_integtest_bundle on any machine with CVMFS active? So there is no requirement to test on specific hardware?

PawelPlesniak · 2025-11-21T17:31:26Z

There are no hardware restrictions, but this code has been written to run on AlmaLinux9. If I remember correctly, @jamesturner246 got CVMFS set up on his laptop, and should be able to run these tests locally

miruuna · 2025-11-24T13:46:58Z

This looks good and it'll help standardize the workflow. However, running daqsystemtest_integtest_bundle on my machine (Ubuntu 24.04.3 LTS) seems to always take a very long time (>30 min). I am not sure if it's got to do with my machine being slow but this would definitely slow the workflow in my case by quite a bit.

PawelPlesniak · 2025-11-24T13:56:04Z

This looks good and it'll help standardize the workflow. However, running daqsystemtest_integtest_bundle on my machine (Ubuntu 24.04.3 LTS) seems to always take a very long time (>30 min). I am not sure if it's got to do with my machine being slow but this would definitely slow the workflow in my case by quite a bit.

Thanks Miruna. Yes, it is expected that this test takes very long, but this should be only have to be run when major changes are made to the core codebase, prior to merging a PR. I will make this clearer in the testing list requirements

bieryAtFnal · 2025-11-25T03:02:32Z

In case it is useful, I will contribute a couple of notes regarding where we can run the daqsystemtest regression tests.

The system software packages that are needed on a computer in order to run the DAQ generally are mentioned on the "software area instructions" page here in section 1.vi.
As @PawelPlesniak knows, we don't yet have sufficient-computer-resource checks in all of the existing regression tests, including those in daqsystemtest. These checks attempt to ensure that the current computer has enough CPU/memory/free disk to handle a given regression test. If such checking should be a priority, I will work on it - please let me know. (For reference, somewhat recently we added such checking to the regression tests in the dfmodules package. Now, the tests in that package will skip tests that require more resources than the current computer has available. This helps to avoid confusing errors when we run a test on a system that is underpowered for the test.)

Aurashk · 2025-11-25T18:41:44Z

In case it is useful, I will contribute a couple of notes regarding where we can run the daqsystemtest regression tests.

1. The system software packages that are needed on a computer in order to run the DAQ generally are mentioned on the "software area instructions" page [here](https://github.com/DUNE-DAQ/daqconf/wiki/Setting-up-a-fddaq%E2%80%90v5.5.0-development-area) in section 1.vi.

2. As @PawelPlesniak knows, we don't yet have sufficient-computer-resource checks in all of the existing regression tests, including those in `daqsystemtest`.  These checks attempt to ensure that the current computer has enough CPU/memory/free disk to handle a given regression test.  If such checking should be a priority, I will work on it - please let me know.  (For reference, somewhat recently we added such checking to the regression tests in the `dfmodules` package.  Now, the tests in that package will skip tests that require more resources than the current computer has available.  This helps to avoid confusing errors when we run a test on a system that is underpowered for the test.)

Thanks Kurt, that's very helpful, it's possible 2. explains some test failures I was seeing locally last time I tried. Is it totally unpredictable what will happen in the tests without these checks for computational resources or is there a common point of failure? Also one other more general thing that might be useful to know is what makes the tests run long? Is it a timed simulation of everything working meaninfully together or is it doing a lot of computational work?

Aurashk · 2025-11-25T18:47:32Z

Also another thing came to mind. What's the situation with this MSQT test in the CI https://github.com/DUNE-DAQ/drunc/actions/workflows/run_mqst.yml? It seems like it was abandoned some time earlier in the year judging by the actions runs. Is this something we want to get working again?

PawelPlesniak · 2025-11-26T12:28:00Z

In case it is useful, I will contribute a couple of notes regarding where we can run the daqsystemtest regression tests.
1. The system software packages that are needed on a computer in order to run the DAQ generally are mentioned on the "software area instructions" page [here](https://github.com/DUNE-DAQ/daqconf/wiki/Setting-up-a-fddaq%E2%80%90v5.5.0-development-area) in section 1.vi.

2. As @PawelPlesniak knows, we don't yet have sufficient-computer-resource checks in all of the existing regression tests, including those in `daqsystemtest`.  These checks attempt to ensure that the current computer has enough CPU/memory/free disk to handle a given regression test.  If such checking should be a priority, I will work on it - please let me know.  (For reference, somewhat recently we added such checking to the regression tests in the `dfmodules` package.  Now, the tests in that package will skip tests that require more resources than the current computer has available.  This helps to avoid confusing errors when we run a test on a system that is underpowered for the test.)
Thanks Kurt, that's very helpful, it's possible 2. explains some test failures I was seeing locally last time I tried. Is it totally unpredictable what will happen in the tests without these checks for computational resources or is there a common point of failure? Also one other more general thing that might be useful to know is what makes the tests run long? Is it a timed simulation of everything working meaninfully together or is it doing a lot of computational work?

When a session is running on a host with insufficient resources, a session will likley throw errors with the number of missing/empty data products. This takes time as we run a variety of configurations with many runs - there are 9 tests and multiple configurations for some of these tests. Supposing each tests takes 3 mins, this will get you the approximate half hour for running.

PawelPlesniak · 2025-11-26T12:28:37Z

Also another thing came to mind. What's the situation with this MSQT test in the CI https://github.com/DUNE-DAQ/drunc/actions/workflows/run_mqst.yml? It seems like it was abandoned some time earlier in the year judging by the actions runs. Is this something we want to get working again?

This is something held back by the development of the Subprocess process manager, in this PR

bieryAtFnal · 2025-11-26T18:13:14Z

In principle, the time that each regression test takes to run is dominated by the amount of time spent waiting in each FSM state (e.g. trigger-enabled), as much time as the writer of the test chose. Of course, if lots of failures happen and/or a process either stalls or crashes, run control transitions can take longer than usual (e.g. some of the "stop-run" transitions), and those might produce a noticeable extra amount of time.

jamesturner246 · 2025-12-15T12:50:08Z

Hi all. As discussed last meeting, I think a special cluster account just for testing PRs would be invaluable for this workflow.

Something we could perhaps hook into CI -- e.g. manually (or even auto, but maybe too noisy) trigger the full integration test suite on the cluster once the PR is marked ready for review.

PawelPlesniak · 2026-02-05T16:11:59Z

@jamesturner246 @Aurashk @miruuna @bieryAtFnal
I have attempted to make the testing and review policy as clear as possible. If you could all review to make sure we're on the same page, we can make the testing less bottlenecked. If anything is unclear or you would like to suggest a change, please indicate so here. Thanks
The list of role based developers is now on the drunc wiki.

bieryAtFnal · 2026-02-05T19:37:59Z

Hi @PawelPlesniak , the updated template looks reasonable to me. I've made a note to myself to revisit the template once the global bundle script is generally available. When I do that, I will update the template to reference the new script (dunedaq_integtest_bundle.sh).

Aurashk · 2026-02-06T08:42:05Z

Looking nice and very useful, thanks @PawelPlesniak. I have a couple of small suggestions.

Explicitly say here that daqsystemtest_integtest_bundle.sh needs to be run on specific clusters (I think that's the case)
Add two check marks for announcing major changes (possibly with an example of a major change) on the slack channel to be discussed. i.e. 'Does not require discussion with the wider dunedaq' or 'has been discussed and approved with wider dunedaq.' I'd put it just before the final review stage or in the review stage.

emmuhamm · 2026-02-06T09:11:47Z

.github/pull_request_template.md

-_Please include a summary of the change and which issue is fixed (if any). Please also
-include relevant motivation and context. List any dependencies that are required for
-this change._
+Addresses issue # _Fill this in with the relevant issue number so the relevant issue can be closed._


Suggested change

Addresses issue # _Fill this in with the relevant issue number so the relevant issue can be closed._

Fixes issue # _Fill this in with the relevant issue number so the relevant issue can be closed._

I would suggest changing Addresses to Fixes, as I don't think 'Addresses' will close the issue. See:
https://docs.github.com/en/issues/tracking-your-work-with-issues/using-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword

emmuhamm · 2026-02-06T09:18:25Z

.github/pull_request_template.md

+# Reviewer checklist
+_Note - if a reveiwer requests changes and those changes are implemented, this block should be re-checked._

-## Further checks
+- [ ] Pre-commit hooks run successfully if applicable (e.g. `pre-commit run --all-files`)
+- [ ] Unit tests pass (`pytest`) - note please use the broadest marker possible
+- [ ] Suggested manual tests pass as described above
+- [ ] Integration tests pass (`daqsystemtest_integtest_bundle.sh`)


This block here (except the manual tests) seems best placed in a CI workflow rather than a reviewer's manual actions. As we've discussed in the meeting, we should look into this (quite fond of @jamesturner246's suggestion about getting a cluster account to test these PRs).

It should be fine for this PR but something to revisit later on

emmuhamm · 2026-02-06T09:19:49Z

.github/pull_request_template.md

- [ ] Code is commented, particularly in hard-to-understand areas
- [ ] Tests added or an issue has been opened to tackle that in the future.
-  (Indicate issue here: # (issue))
+Once the features are validated and both the unit and integrationm tests pass, the PRs can be merged.


Something that might be useful to make explicit for this repo is who has the responsibility to merge things in develop after it passes review. Is it the author ( / = assignee), or the reviewer?

(noticed in several repos in LHCb / DUNE that this responsibility changes, so would be good to know here)

Making testing requirements clearer

a3d8ba0

PawelPlesniak requested a review from MRiganSUSX November 11, 2025 14:10

Adding the remainding tests

9b0e7e3

Clarifying details on the integration test bundle

3f19a18

Adding supplemental review block

1535e58

PawelPlesniak marked this pull request as draft January 14, 2026 11:02

PawelPlesniak added the enhancement New feature or request label Jan 22, 2026

PawelPlesniak added 4 commits February 5, 2026 13:47

Update prior to review with Kurt

aee4436

Testing checklist finalized

8db5374

Note on integtests

367f869

testing list done

1dfaf6c

PawelPlesniak requested a review from emmuhamm February 5, 2026 16:05

PawelPlesniak added 2 commits February 5, 2026 17:10

Final notes

7fd2217

Should have git added again

0c7dad5

emmuhamm reviewed Feb 6, 2026

View reviewed changes

	Addresses issue # _Fill this in with the relevant issue number so the relevant issue can be closed._
	Fixes issue # _Fill this in with the relevant issue number so the relevant issue can be closed._

Conversation

PawelPlesniak commented Nov 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Key checklist

Further checks

Uh oh!

PawelPlesniak commented Nov 20, 2025

Uh oh!

Aurashk commented Nov 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PawelPlesniak commented Nov 21, 2025

Uh oh!

miruuna commented Nov 24, 2025

Uh oh!

PawelPlesniak commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bieryAtFnal commented Nov 25, 2025

Uh oh!

Aurashk commented Nov 25, 2025

Uh oh!

Aurashk commented Nov 25, 2025

Uh oh!

PawelPlesniak commented Nov 26, 2025

Uh oh!

PawelPlesniak commented Nov 26, 2025

Uh oh!

bieryAtFnal commented Nov 26, 2025

Uh oh!

jamesturner246 commented Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PawelPlesniak commented Feb 5, 2026

Uh oh!

bieryAtFnal commented Feb 5, 2026

Uh oh!

Aurashk commented Feb 6, 2026

Uh oh!

emmuhamm Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

emmuhamm Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

emmuhamm Feb 6, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

PawelPlesniak commented Nov 11, 2025 •

edited

Loading

Aurashk commented Nov 21, 2025 •

edited

Loading

PawelPlesniak commented Nov 24, 2025 •

edited

Loading

jamesturner246 commented Dec 15, 2025 •

edited

Loading