Referee model selection #87
-
|
AS for a safety check, which external model should I choose as the referee model? Any recommendations? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
1、Recommended Choices for MCP Scan
2、Recommended Choices for Jailbreak Evaluation ModelsWhen working with a custom dataset, selecting an appropriate safety evaluation model can significantly improve the accuracy of automated assessments. You can balance model selection from two dimensions: language and scenario. Language
Scenario
|
Beta Was this translation helpful? Give feedback.
1、Recommended Choices for MCP Scan
2、Recommended Choices for Jailbreak Evaluation Models
When working with a custom dataset, selecting an appropriate safety evaluation model can significantly improve the accuracy of automated assessments. You can balance model selection from two dimensions: language and scenario.
Language
Chinese Recommendation:
qwen3-max(best performance)qwen3-235b-a22b-2507(cost-effective choice)English Recommendation:
claude-opus-4.1(best performance)claude-sonnet-4(very good performance)gemini-2.0-flash(cost-effective choice)Scenario
Politically sensitive content testing:
Do not