Skip to content

nemo_retriever: Add structured harness metrics reports#1699

Open
jioffe502 wants to merge 6 commits intoNVIDIA:mainfrom
jioffe502:fix/harness_metrics
Open

nemo_retriever: Add structured harness metrics reports#1699
jioffe502 wants to merge 6 commits intoNVIDIA:mainfrom
jioffe502:fix/harness_metrics

Conversation

@jioffe502
Copy link
Copy Markdown
Collaborator

@jioffe502 jioffe502 commented Mar 23, 2026

Switch the retriever harness from stdout scraping to structured run reports shared across supported run modes. Follow-on commits also harden artifact metadata handling so these reports stay reliable in real container/worktree runs.

  • add shared batch, inprocess, and fused run-report plumbing and wire the harness to consume it
  • persist runtime and detection metrics as structured artifacts instead of parsing console output
  • keep latest_commit populated in harness artifacts by falling back to .git/HEAD refs when git rev-parse fails in container/worktree setups
  • preserve detection counters through LanceDB round-trips by writing metadata as JSON and accepting legacy metadata literals during detection-summary reads
  • validate behavior with real bo20/jp20 harness runs as integration coverage

- route harness runs through shared batch, fused, and inprocess reports
- persist runtime and detection metrics without scraping stdout
- tighten tests around config and LanceDB metadata behavior

Signed-off-by: Jacob Ioffe <jioffe@nvidia.com>
@jioffe502 jioffe502 requested a review from a team as a code owner March 23, 2026 21:52
@jioffe502 jioffe502 marked this pull request as draft March 23, 2026 22:00
jioffe502 and others added 5 commits March 24, 2026 16:55
Revert non-essential docs/example/utility churn so this branch only carries the structured run report metrics path in harness and mode runners.
Signed-off-by: Jacob Ioffe <jioffe@nvidia.com>
Keep harness run metadata stable across container/worktree setups by falling back to git HEAD refs when rev-parse fails, and persist detection counters as JSON metadata so detection summaries survive LanceDB round-trips.

Signed-off-by: Jacob Ioffe <jioffe@nvidia.com>
@jioffe502 jioffe502 marked this pull request as ready for review March 30, 2026 21:17
@jioffe502 jioffe502 requested a review from a team as a code owner March 30, 2026 21:17
@jioffe502 jioffe502 requested a review from edknv March 30, 2026 21:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant