Pinned Loading
-
llm-safety-evaluation-lab
llm-safety-evaluation-lab PublicOWASP-inspired LLM red teaming framework with adversarial test cases, automated benchmarking, and cross-model safety evaluation.
Python 1
-
rusafetybench
rusafetybench PublicOpen adversarial benchmark for Russian-language LLM safety evaluation.
-
multi-agent-safety-sim
multi-agent-safety-sim PublicExperimental Python framework for simulating multi-agent AI safety failure modes
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.