Skip to content

fix(text): Filter single-character redactions as false positives#200

Open
mlissner wants to merge 3 commits into
filter-district-of-boilerplate-20260327from
filter-single-char-redactions-20260328
Open

fix(text): Filter single-character redactions as false positives#200
mlissner wants to merge 3 commits into
filter-district-of-boilerplate-20260327from
filter-single-char-redactions-20260328

Conversation

@mlissner
Copy link
Copy Markdown
Member

Summary

  • Add is_single_char() filter to text_utils.py — a single alphanumeric character (possibly with surrounding whitespace/punctuation) is not meaningful redacted content
  • Add pytest to [dependency-groups] dev in pyproject.toml (was missing, causing uv sync --group dev to uninstall it)
  • Fix CLAUDE.md install instructions to use uv sync --group dev instead of uv pip install -e ".[dev]" which didn't install dependency groups
  • Document text filtering step in README

Test plan

🤖 Generated with Claude Code

mlissner and others added 3 commits March 28, 2026 08:27
A single letter or digit — possibly surrounded by whitespace or
punctuation — is not meaningful redacted content. These commonly
appear when a rectangle slightly overlaps an adjacent character.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
…tions

pytest was missing from [dependency-groups] dev, so `uv sync --group dev`
would uninstall it. Also updated CLAUDE.md to use `uv sync --group dev`
instead of the old `uv pip install -e ".[dev]"` which didn't actually
install dependency groups.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
@mlissner mlissner marked this pull request as ready for review March 28, 2026 15:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant