added features for runsolver/starexec compatibility #1201

geoffgeoffgeoff3 · 2025-10-28T10:49:09Z

Hi Philip, I'll happily take feedback rather than a merge. I have done the work for only the --no-container case. I run with ...
bin/runexec --no-container --read-only-dir / --no-output-header --timestamp --add-eof --

geoffgeoffgeoff3 · 2025-10-28T14:41:05Z

Now the stdout and stderr work, interleaved, and I added "-" to allow output to stdout. But the --timelimit and --walltimelimit are not working.

geoffgeoffgeoff3 · 2025-10-28T16:54:39Z

The --timelimit and --walltimelimit are working. I was confused by the amount of WC time it takes to print a lot of tracing from my test program (recursive Fibonacci).

geoffgeoffgeoff3 · 2025-10-28T16:55:04Z

When you are happy with what I have done give me a push, and I'll work on the container version.

PhilippWendler · 2025-10-29T10:29:25Z

In general this looks quite promising, nice, and thank you! We can still discuss naming a little bit.

I can also somewhat understand the request for configuring output files, though maybe you can also simply redirect the output?

Maybe we should not add the overhead of piping the tool output through BenchExec for everyone and disable the new code if no timestamp is requested? Should be easy.

When you are happy with what I have done give me a push, and I'll work on the container version.

Would be great!

I just remembered that we have a start of the GSoC attempt on this here: https://github.com/sosy-lab/benchexec/pull/1170/files It is a more complicated because it keeps stdout and stderr separate and thus needs to work on two streams in parallel. But actually I think your approach should also work, I don't see any disadvantage right now.

geoffgeoffgeoff3 · 2025-10-29T10:36:26Z

Yes, last night I also thought about disabling the code for timestamping. I'll do it this afternoon. I like the symmetry of having "-" to pipeline both stdin and stdout/stderr ... it's easy for users to remember when it's the same in both directions.

… stuck

geoffgeoffgeoff3 · 2025-10-29T12:20:50Z

The container version has got me stumped for now. I might need you (Philip) to nudge me in the right direction.

geoffgeoffgeoff3 · 2025-10-30T12:02:40Z

I see the code ...

def _set_termination_reason(self, reason):
    if not self._termination_reason:
        self._termination_reason = reason

If I have a soft limit, typically that means the solver will catch the SIGTERM and continue. The code sets the termination reason with self.callback("cputime-soft"). If the solver hits the hard limit while continuing, and that really stops the solver, then the code tries to set the termination reason with self.callback("cputime"). However, as it was set previously the real new reason is not recorded and the output says "terminationreason=cputime-soft". I suggest taking out "if not self._termination_reason:". Whatcha think?

PhilippWendler · 2025-10-30T12:21:09Z

It is deliberate that we record the first reason for termination that occurs, because this is the actual reason why the run was a failure. For example, if the process first reaches the soft memory limit and then the memory limit, it is a timeout and not an OOM.

PhilippWendler · 2025-10-30T12:23:39Z

This PR is accumulating more and more independent features :-) I like them, but it is becoming a little bit hard to review the diff and keeping in mind which pieces of changes belong together. How much effort would it be to split off orthogonal stuff like soft wall-time limits and output redirections? If it is too much effort we can probably proceed with the current MR, but otherwise it would be helpful (and allow merging already finished features).

PhilippWendler · 2025-10-30T12:39:16Z

The container version has got me stumped for now. I might need you (Philip) to nudge me in the right direction.

Finally I could really think about this. I have the hypothesis that either of the following would work:

We implement the timestamping in the "child" process. That process currently does wait_for_child_and_forward_signals() while the process is running, so we could not do simple blocking I/O but would have to implement something that can handle I/O at the same time as forwarding signals. @larskotthoff reminded me of the possibility to use async I/O in Python and that this should make it easier to implement this solution, but I have no personal experience with it. A disadvantage of adding code to the "child" process is also that this process run inside the container and any problems in it are harder to debug (we do not get nice debug output).
We send the required file descriptors (stdout/stderr of "grandchild") to "parent" via the existing sockets. It is possible to send file descriptors between processes with socket.send_fds(), though I have never done so.
We pre-create pipes for stdout/stderr in the "parent" process, and let them being copied to the "child" process when it is created. Then we can use them there as parameters for Popen (so instead of letting subprocess create pipes and return them we supply our own). This should be relatively straight forward, we just need to make sure that we close all copies of these pipes in the appropriate places.

I would tend to say 3. looks like the most promising option. The place to create these new pipes would be in the same place here we already create pipes by calling os.pipe() in _start_execution_in_container.

geoffgeoffgeoff3 · 2025-10-30T12:48:06Z

It would be hard to separate the varo=ious things I have done:

Time stamps and EOF for non-container
Soft wall time limit
IO and statistics redirection
However, the hacking I have done in containerexecutor.py can be ignored - just replace it with an original containerexecutor.py. I'll do that and add it to the PR, then stop adding to that PR.

I will start a separate branch for the containerexecutor.py work.

geoffgeoffgeoff3 · 2025-10-30T13:03:30Z

Regarding the reason, how about ...

def _set_termination_reason(self, reason):
if not self._termination_reason:
self._termination_reason = reason
else:
self._termination_reason = self._termination_reason + " then " + reason

geoffgeoffgeoff3 · 2025-10-30T14:14:51Z

I have a cunning scheme for the container case, but my limited Python skills are holding me back. I think I'll need help from you or Marco (he's a Python wizard).

geoffgeoffgeoff3 · 2025-10-30T16:36:33Z

I marked the lines with change with ZZZZ, if that helps. I can remove those lines ATGB.

geoffgeoffgeoff3 · 2025-10-30T18:04:45Z

Strangely, the non-container "solver" runs slower than the container version.

geoffgeoffgeoff3 · 2025-10-31T07:45:05Z

You said we should discuss, at least, the option names. Already there was yuckkiness --timelimit --softtimelimit --walltimelimit, and my softwalltimelimit was a horrible attempt to be compatible. I know you have to keep the old ones for backwards compatibility, but they could be deprecated under these new options ...
--cpu-limit
--soft-cpu-limit
--wc-limit
--soft-wc-limit

--timestamp and --add-eof seem OK to me.

I shortened the stats file one to --statistics-file, which is fine for me.

Any others?

geoffgeoffgeoff3 added 7 commits October 28, 2025 10:45

added features for runsolver/starexec compatibility

55cca82

syntax fix for Ruff

0638db1

syntax fix for Ruff

b5d3563

syntax fix for Ruff

62d38ef

fixed ^C, added option for output to stdout

fa4eb4a

fix for Ruff

2577b8b

fix for Ruff

3ec4025

added flag to send statistics to a file instead of stdout

8771788

fix for Ruff

c0bb2d6

fixed so no piping if no timestamps. working on container version but…

fee787b

… stuck

soft wall clock limit implemented

12fdc16

fixes for Ruff

e8e0dd1

geoffgeoffgeoff3 added 3 commits October 30, 2025 12:51

Final version (I hope) with original containerexecutor.py

b0b4491

Final version fix for Ruff

27d29ea

Final version fix for Ruff

c4dfa83

geoffgeoffgeoff3 added 2 commits October 30, 2025 13:06

Put back original containerexecutor.py

365bff7

branch for working on container timestamps

7ad4a34

geoffgeoffgeoff3 added 2 commits October 30, 2025 16:34

container timestamps

84afb6a

Merge branch 'timestamps_and_more' into runsolver_compatible

cb8b04b

geoffgeoffgeoff3 added 2 commits October 30, 2025 16:42

checked timestamp if add-eof

8da5e01

tidy timestamps

72e17bb

Ruff-ians

c5ab2d6

added features for runsolver/starexec compatibility #1201

Are you sure you want to change the base?

added features for runsolver/starexec compatibility #1201

Uh oh!

Conversation

geoffgeoffgeoff3 commented Oct 28, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 28, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 28, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 28, 2025

Uh oh!

PhilippWendler commented Oct 29, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 29, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 29, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 30, 2025

Uh oh!

PhilippWendler commented Oct 30, 2025

Uh oh!

PhilippWendler commented Oct 30, 2025

Uh oh!

PhilippWendler commented Oct 30, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 30, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 30, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 30, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 30, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 30, 2025

Uh oh!

geoffgeoffgeoff3 commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Milestone

Development

Uh oh!

2 participants