mcp: add CI, reduce dependabot false positives, and miscellaneous polishing by sgmenda · Pull Request #169 · jrh13/hol-light

sgmenda · 2026-04-15T17:54:08Z

Add CI tests: Added .github/workflows/mcp.yml to run unit tests and smoke tests on PRs that modify the mcp/** folder.

Reduce dependabot false positives: We only use the stdio transport, but the mcp dependency pulls HTTP transport deps like cryptography. Added .github/dependabot.yml to allow-list our direct deps. This should reduce false positive dependabot pings like #165

Miscellaneous polishing: Replace fragile String.sub JSON slicing in mcp_json_apply_tactics with more robust buffer-based construction. Rename default checkpoint from "noledit" to "base". And more.

h/t @jargh for spotting bugs in an earlier version of this PR.

All 48 unit tests pass.

The old name was a leftover from when the checkpoint was created without ledit. "base" is clearer and matches the README examples.

Runs on PRs/pushes to master when mcp/ files change. Builds HOL Light (OCaml 4.14), installs uv, then runs: - pytest test_server.py (unit tests) - smoke_test.py (MCP integration tests)

We only use the stdio transport (FastMCP server + client). The [cli] extra pulled in typer, rich, shellingham, etc. that we never use. Removes 7 transitive dependencies, reducing dependabot surface.

The mcp SDK pulls ~30 transitive dependencies (uvicorn, starlette, cryptography, pydantic, etc.) that we don't control. Using allow-list so dependabot only opens PRs for our direct deps (mcp, pytest) and GitHub Actions versions.

Docs: - Fix stale test counts in README (38→48 unit, 34→37 smoke) - Add start_recording/stop_recording to README tools table - Add recording tools to TUTORIAL.md workflow summary CI: - Add timeout-minutes: 30 to mcp.yml workflow Code: - mcp_helpers.ml: replace fragile String.sub JSON slicing in mcp_json_apply_tactics with proper buffer-based construction - server.py: make recording append-only with backtrack markers instead of rewriting the entire file on every tactic - server.py: handle nested OCaml comments (* ... *) in _extract_e_tactic paren counting - make_checkpoint.py: deduplicate opam env parsing by importing _opam_env from server.py (with inline fallback) All 48 unit tests pass.

- start_recording: truncate existing file instead of appending - _record_tactics_batch: capture total_goals for last tactic from the batch result instead of always recording 0 - smoke_test: use set-contains for tool list check so adding new tools doesn't break the existing assertion - .gitignore: add __pycache__/ (was only in global gitignore) - hol_restart: reset _recording_flushed so recording stays consistent after a restart All 48 unit tests pass.

- Add argparse with --help, --name, -I flags - Validate prerequisites before launching (ocaml-hol exists, dmtcp_launch on PATH, include dirs exist) - Clear error messages instead of cryptic "EOF before loaded" - Fix LD_LIBRARY_PATH not being set (caused silent dmtcp failures) - Print checkpoint plan and size on completion - s/Bare/Base/ in README checkpoint example

_replay_prefix() was setting _recording_flushed=0 then flushing, which re-appended all replayed entries to the file that already contained them. Fix: set _recording_flushed=len(replayed) since the entries are already on disk. Similarly, hol_restart() was setting _recording_flushed=0 without truncating the file, causing duplication on next flush. Fix: set _recording_flushed=len(_recording) to mark existing entries as already written.

The OCaml side sets "step" to the count of successful tactics (incremented after e(tac) succeeds). The Python code was computing succeeded = step - 1, undercounting by 1. For example, if the first tactic succeeded and the second failed (step=1), zero tactics were recorded instead of one.

jrh13 · 2026-04-16T22:22:48Z

All great, thank you! I will merge.

sgmenda added 10 commits April 15, 2026 17:08

mcp: rename default checkpoint from "noledit" to "base"

a6c869c

The old name was a leftover from when the checkpoint was created without ledit. "base" is clearer and matches the README examples.

mcp: add CI workflow for unit and smoke tests

b0b0235

Runs on PRs/pushes to master when mcp/ files change. Builds HOL Light (OCaml 4.14), installs uv, then runs: - pytest test_server.py (unit tests) - smoke_test.py (MCP integration tests)

mcp: drop unused [cli] extra from mcp dependency

de66983

We only use the stdio transport (FastMCP server + client). The [cli] extra pulled in typer, rich, shellingham, etc. that we never use. Removes 7 transitive dependencies, reducing dependabot surface.

add dependabot config, limit pip alerts to direct deps

59c7f20

The mcp SDK pulls ~30 transitive dependencies (uvicorn, starlette, cryptography, pydantic, etc.) that we don't control. Using allow-list so dependabot only opens PRs for our direct deps (mcp, pytest) and GitHub Actions versions.

fix missing trailing newline in .gitignore

46f694d

jrh13 merged commit 9fa0dff into jrh13:master Apr 16, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mcp: add CI, reduce dependabot false positives, and miscellaneous polishing#169

mcp: add CI, reduce dependabot false positives, and miscellaneous polishing#169
jrh13 merged 10 commits intojrh13:masterfrom
sgmenda:mcp-ci

sgmenda commented Apr 15, 2026 •

edited

Loading

Uh oh!

jrh13 commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sgmenda commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jrh13 commented Apr 16, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sgmenda commented Apr 15, 2026 •

edited

Loading