feat: moderation v2 core backend engine and pipeline by ArthurzKV · Pull Request #333 · openclaw/clawhub

ArthurzKV · 2026-02-15T20:48:24Z

Summary

Introduce moderation v2 backend foundation in ClawHub: normalized verdict/reason/evidence model, deterministic static scanning, publish-time moderation derivation, and backfill support.

Why

Trust decisions need to be consistent and explainable across static, VT, and LLM signals while preserving compatibility with existing moderation fields.

Focused scope

This PR is scoped to one theme: core moderation v2 backend pipeline.

What changed

Added normalized moderation fields in convex/schema.ts.
Added canonical reason code + verdict utilities in convex/lib/moderationReasonCodes.ts.
Added moderation engine in convex/lib/moderationEngine.ts.
Integrated deterministic static scan in publish/backfill paths (convex/lib/skillPublish.ts, convex/skills.ts, convex/vt.ts).
Updated moderation/public safety logic (convex/lib/moderation.ts, convex/lib/public.ts, convex/lib/skillSafety.ts).
Follow-up fixes included:
- escalateByVtInternal moderation flag overwrite bug
- backfill cursor skip edge case
- child_process false-positive fallback in scanner
- rule name alignment to suspicious.nonstandard_network

Local validation

bun run lint:oxlint
bunx vitest run convex/lib/moderationEngine.test.ts convex/skills.rateLimit.test.ts

AI assistance transparency

AI-assisted: Yes (implemented with Codex assistance)
Testing level: Targeted local validation on touched modules
I reviewed the final diffs and understand the behavior changes.

vercel · 2026-02-15T20:48:27Z

@ArthurzKV is attempting to deploy a commit to the Amantus Machina Team on Vercel.

A member of the Team first needs to authorize it.

greptile-apps

_{10 files reviewed, 3 comments}

_{Edit Code Review Agent Settings | Greptile}

convex/skills.ts

convex/lib/moderationEngine.ts

ArthurzKV · 2026-02-15T21:04:25Z

Addressed review feedback in follow-up commits:

ac8fde6:
- fixed escalateByVtInternal so moderationFlags are not overwritten after merge logic.
- fixed backfill cursor advancement to avoid skipping a candidate at batch boundaries.
- fixed child_process exec guard so fallback line text does not create false positives.
beba7a0: renamed network reason code to suspicious.nonstandard_network for naming consistency.

Validation run: lint + moderation engine/rate-limit tests.

feat: add moderation v2 core engine and backend pipeline

3f10048

ArthurzKV mentioned this pull request Feb 15, 2026

feat: moderation v2 trust verification pipeline #332

Closed

greptile-apps bot reviewed Feb 15, 2026

View reviewed changes

convex/skills.ts Outdated Show resolved Hide resolved

convex/skills.ts Show resolved Hide resolved

convex/lib/moderationEngine.ts Show resolved Hide resolved

ArthurzKV added 2 commits February 15, 2026 14:58

fix: address moderation engine and backfill edge cases

ac8fde6

chore: rename network reason code to nonstandard_network

beba7a0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

feat: moderation v2 core backend engine and pipeline#333

feat: moderation v2 core backend engine and pipeline#333
ArthurzKV wants to merge 3 commits intoopenclaw:mainfrom
ArthurzKV:codex/skill-verification-v2-clawhub-core

ArthurzKV commented Feb 15, 2026 •

edited

Loading

Uh oh!

vercel bot commented Feb 15, 2026

Uh oh!

greptile-apps bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurzKV commented Feb 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Comments

Conversation

ArthurzKV commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Focused scope

What changed

Local validation

AI assistance transparency

Uh oh!

vercel bot commented Feb 15, 2026

Uh oh!

greptile-apps bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ArthurzKV commented Feb 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ArthurzKV commented Feb 15, 2026 •

edited

Loading