Add first draft of base_jailbreak by rmura498 · Pull Request #177 · pralab/secml-torch

rmura498 · 2026-01-28T15:16:08Z

This PR introduces a first draft of a BaseJailbreakAttack abstraction.
The goal is to define a minimal, prompt-wise interface for jailbreak attacks, aligned with the existing design used for evasion attacks.

The base class provides orchestration over a list of harmful behaviors and leaves attack-specific logic, success criteria, and objectives to specific attack implementations.

codecov · 2026-01-28T15:27:35Z

Codecov Report

❌ Patch coverage is 95.34884% with 2 lines in your changes missing coverage. Please review.
✅ Project coverage is 91.96%. Comparing base (9a798ff) to head (b3c57a9).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
src/secmlt/adv/jailbreak/base_jailbreak_attack.py	94.44%	1 Missing ⚠️
src/secmlt/tests/test_jailbreaks.py	96.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #177      +/-   ##
==========================================
+ Coverage   91.90%   91.96%   +0.06%     
==========================================
  Files          67       69       +2     
  Lines        2173     2216      +43     
==========================================
+ Hits         1997     2038      +41     
- Misses        176      178       +2

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Add first draft of base_jailbreak

9201759

rmura498 linked an issue Jan 28, 2026 that may be closed by this pull request

BaseJailbreak — skeleton for text-attack interface #169

Open

fix coverage

b3c57a9

rmura498 marked this pull request as ready for review January 28, 2026 16:20

rmura498 requested a review from maurapintor January 28, 2026 16:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add first draft of base_jailbreak#177

Add first draft of base_jailbreak#177
rmura498 wants to merge 2 commits intomainfrom
169-basejailbreak-skeleton-for-text-attack-interface

rmura498 commented Jan 28, 2026 •

edited

Loading

Uh oh!

codecov bot commented Jan 28, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rmura498 commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jan 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rmura498 commented Jan 28, 2026 •

edited

Loading

codecov bot commented Jan 28, 2026 •

edited

Loading