Referee model selection #87

ketchup8866 · 2025-09-22T01:51:04Z

ketchup8866
Sep 22, 2025

AS for a safety check, which external model should I choose as the referee model? Any recommendations?

Answered by zonashi

Sep 22, 2025

1、Recommended Choices for MCP Scan

GLM4.5
DeepSeek-V3.1
Kimi-K2-Instruct
Qwen3-Coder-480B
Hunyuan-Turbos

2、Recommended Choices for Jailbreak Evaluation Models

When working with a custom dataset, selecting an appropriate safety evaluation model can significantly improve the accuracy of automated assessments. You can balance model selection from two dimensions: language and scenario.

Language

Chinese Recommendation:
- qwen3-max (best performance)
- qwen3-235b-a22b-2507 (cost-effective choice)
English Recommendation:
- claude-opus-4.1 (best performance)
- claude-sonnet-4 (very good performance)
- gemini-2.0-flash (cost-effective choice)

Scenario

Politically sensitive content testing:
Do not

View full answer

zonashi · 2025-09-22T06:14:14Z

zonashi
Sep 22, 2025
Maintainer

1、Recommended Choices for MCP Scan

GLM4.5
DeepSeek-V3.1
Kimi-K2-Instruct
Qwen3-Coder-480B
Hunyuan-Turbos

2、Recommended Choices for Jailbreak Evaluation Models

When working with a custom dataset, selecting an appropriate safety evaluation model can significantly improve the accuracy of automated assessments. You can balance model selection from two dimensions: language and scenario.

Language

Chinese Recommendation:
- qwen3-max (best performance)
- qwen3-235b-a22b-2507 (cost-effective choice)
English Recommendation:
- claude-opus-4.1 (best performance)
- claude-sonnet-4 (very good performance)
- gemini-2.0-flash (cost-effective choice)

Scenario

Politically sensitive content testing:
Do not choose Gemini models. Instead, prioritize domestic models such as hunyuan-turbos or qwen3. Cloud-based API calls yield better results.
National, regional, or racial bias testing:
Gemini models perform best.
Dangerous weapons or high-risk behavior testing:
Claude models perform best. For cost-effectiveness, Gemini models are also an option.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Referee model selection #87

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Referee model selection #87

Uh oh!

ketchup8866 Sep 22, 2025

1、Recommended Choices for MCP Scan

2、Recommended Choices for Jailbreak Evaluation Models

Replies: 1 comment

Uh oh!

zonashi Sep 22, 2025 Maintainer

1、Recommended Choices for MCP Scan

2、Recommended Choices for Jailbreak Evaluation Models

ketchup8866
Sep 22, 2025

zonashi
Sep 22, 2025
Maintainer