Skip to content

fix(holocron): public API connection status and deploy gates#97

Merged
th3w1zard1 merged 6 commits into
mainfrom
feat/holocron-topnav-ci-followup
Jun 4, 2026
Merged

fix(holocron): public API connection status and deploy gates#97
th3w1zard1 merged 6 commits into
mainfrom
feat/holocron-topnav-ci-followup

Conversation

@th3w1zard1
Copy link
Copy Markdown
Contributor

Summary

  • Root cause: trask-worker proxies to openkotor-holocron-trask-http.hf.space, which is in HF Space ERROR/api/trask/* returns 503. Holocron retried silently and stayed on “Dispatching”.
  • Worker: /healthz now probes upstream; failed upstream returns structured JSON on /api/trask/* (upstream URL, status, hint).
  • Holocron UI: TraskBackendStatus banner, research-step Connection error / Retrying lines, fail after 8 attempts with a clear assistant error.
  • Assets: holocron-artifact.png added under public/holocron/ with BASE_URL-aware paths; Spark stub when ENABLE_SPARK=0 (fixes /_spark/loaded 405 noise).
  • CI: scripts/check_trask_public_api.sh gates Pages deploy and post worker deploy.

Ops to restore production (after merge)

  1. Restart HF Space: Actions → Deploy Trask HTTP (Hugging Face)workflow_dispatch until stage RUNNING.
  2. Deploy worker: merge + Deploy Trask Worker (needs CLOUDFLARE_* secrets + TRASK_RESEARCHWIZARD_BASE_URL).
  3. Pages: Holocron publish runs only when check_trask_public_api.sh passes.

Test plan

  • pnpm check
  • pnpm --dir infra/trask-worker run build
  • pnpm --filter @openkotor/holocron-web build
  • After HF Space healthy: TRASK_API_BASE=https://trask-worker.bocloud.workers.dev bash scripts/check_trask_public_api.sh
  • Spot-check qa-webui after worker + Pages deploy

Harden ci_warm_trask_embed cache probe and attempts validation; grant
build-and-test actions: write so Actions cache can save FastEmbed/Chroma.
Use /projects, /faq, and /formats instead of hash anchors to match
openkotor/site routing.
Holocron browser MCP 5/5 on :4010 with fresh threads; refresh evidence
and mark pass-5 plan completed after PR #96 merge.
…ealth

The qa-webui was stuck on Dispatching because trask-worker proxied to an HF
Space in ERROR (503) with no UI feedback. Add upstream health probing on the
worker, structured API errors, a Holocron backend status banner, bounded retry
failure messages, BASE_URL-safe assets, Spark stub for Pages builds, and CI
scripts that block deploy when /healthz or /api/trask/ask are unhealthy.
@chatgpt-codex-connector
Copy link
Copy Markdown

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Strip spaced markdown links and images from visible answer text while
preserving [n] markers, move trailing numbered bibliography into the
Sources panel when the model omits a heading, and tighten compose-side
markdown cleanup in grounded-evidence.
@th3w1zard1 th3w1zard1 merged commit 42ee3b2 into main Jun 4, 2026
8 checks passed
@th3w1zard1 th3w1zard1 deleted the feat/holocron-topnav-ci-followup branch June 4, 2026 12:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant