test(#184): add missing no-criteria + inline-criteria tests for JudgeAgent by drewdrewthis · Pull Request #549 · langwatch/scenario

drewdrewthis · 2026-05-25T07:12:33Z

Summary

The judge-agent.test.ts file had comprehensive coverage for trace-size paths but was missing the judgmentRequest enforcement path tested in isolation. Closes #184.

New tests added

"when no criteria provided and judgment requested"

returns failure with 'No criteria' reasoning without calling the LLM — verifies the early-return short-circuit when judgmentRequest != null but criteria list is empty
uses inline criteria from judgmentRequest when provided — verifies inline criteria in judgmentRequest override the agent's empty base list and the LLM is called normally

Test plan

pnpm exec vitest run src/agents/judge/__tests__/judge-agent.test.ts → 20/20 passed

Closes #184

🤖 Generated with Claude Code

github-actions

Approved by automation: PR qualifies as low-risk-change under the documented policy.

…or JudgeAgent Two new tests close the gap noted in issue #184: 1. When judgmentRequest is set but criteria is empty, the agent must short-circuit before calling the LLM and return failure with a 'No criteria' message. 2. When judgmentRequest carries inline criteria, the judge should use those (not the empty base list) and call the LLM normally. All 20 judge-agent tests now pass. Closes #184 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

github-actions · 2026-05-25T07:15:25Z

Automated low-risk assessment

This PR was evaluated against the repository's Low-Risk Pull Requests procedure.

Scope: Adds two tests in javascript/src/agents/judge/tests/judge-agent.test.ts covering judgmentRequest behavior with empty/inline criteria; updates javascript/tsconfig.json to set compiler target to ES2022.
Exclusions confirmed: no changes to auth, security settings, database schema, business-critical logic, or external integrations.
Classification: low-risk-change under the documented policy.

The diff only adds unit tests for JudgeAgent and a minor TypeScript compiler setting change (target: ES2022). It does not modify authentication/authorization, secrets, database schemas, business‑critical logic, or external integrations, and the tsconfig edit is a reversible build configuration tweak. Therefore it meets the low-risk criteria.

An approving review has been submitted by automation. The PR may merge once required CI checks pass.

github-actions

Approved by automation: PR qualifies as low-risk-change under the documented policy.

drewdrewthis · 2026-05-25T07:20:33Z

[grinder] READY for human review

CI: green (zero failing, zero pending)
Threads: zero unresolved

Verified by:
`command gh pr checks 549` → all pass: ci-checks (24.x) pass 4m7s, javascript-complete pass, python-complete pass
Latest run: https://github.com/langwatch/scenario/actions/runs/26388400232/job/77672539302

drewdrewthis added the grinding Grinder is actively managing this PR label May 25, 2026

github-actions Bot added the low-risk-change PR qualifies as low-risk per policy and can be merged without manual review label May 25, 2026

github-actions Bot previously approved these changes May 25, 2026

View reviewed changes

drewdrewthis dismissed github-actions[bot]’s stale review via 09814f4 May 25, 2026 07:14

drewdrewthis force-pushed the issue184/judge-no-criteria-test branch from b3925cf to 09814f4 Compare May 25, 2026 07:14

github-actions Bot added low-risk-change PR qualifies as low-risk per policy and can be merged without manual review and removed low-risk-change PR qualifies as low-risk per policy and can be merged without manual review labels May 25, 2026

github-actions Bot approved these changes May 25, 2026

View reviewed changes

drewdrewthis added pr-ready and removed low-risk-change PR qualifies as low-risk per policy and can be merged without manual review grinding Grinder is actively managing this PR labels May 25, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test(#184): add missing no-criteria + inline-criteria tests for JudgeAgent#549

test(#184): add missing no-criteria + inline-criteria tests for JudgeAgent#549
drewdrewthis wants to merge 1 commit into
mainfrom
issue184/judge-no-criteria-test

drewdrewthis commented May 25, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

github-actions Bot commented May 25, 2026

Uh oh!

github-actions Bot left a comment

Uh oh!

drewdrewthis commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

drewdrewthis commented May 25, 2026

Summary

New tests added

Test plan

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 25, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

drewdrewthis commented May 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant