-
-
Notifications
You must be signed in to change notification settings - Fork 1
Home
Nick edited this page Oct 31, 2025
·
3 revisions
Welcome to the DocStripper documentation wiki! This wiki contains comprehensive guides, tutorials, and reference materials for using and contributing to DocStripper.
- Installation Guide - How to install and set up DocStripper
- Usage Guide - How to use DocStripper web app and CLI tool
- API Documentation - API reference for developers
- Contributing Guide - How to contribute to DocStripper
- FAQ - Frequently asked questions
- Web Application: https://kiku-jw.github.io/DocStripper/
- GitHub Repository: https://github.com/kiku-jw/DocStripper
- Issues: https://github.com/kiku-jw/DocStripper/issues
- Discussions: https://github.com/kiku-jw/DocStripper/discussions
DocStripper is an AI-powered batch document cleaner that automatically removes noise from text documents:
- Page numbers - Lines with only digits (1, 2, 3...)
- Headers/Footers - Common patterns like "Page X of Y", "Confidential"
- Duplicate lines - Consecutive identical lines
- Empty lines - Whitespace-only lines
- Punctuation lines - Lines with only symbols (---, ***, ===)
- 🤖 Smart Clean (Beta) - AI-powered cleaning using on-device LLM
- ⚡ Fast Clean - Instant rule-based cleaning
- 🔒 100% Private - All processing happens in your browser
- 🌐 Web App - No installation required
- 🖥️ CLI Tool - Command-line interface for batch processing
- Try it online: Visit https://kiku-jw.github.io/DocStripper/
- Read the Installation Guide: Installation
- Learn how to use it: Usage Guide
- Check out examples: See the Usage Guide for examples
- Check the FAQ for common questions
- Join Discussions
- Open an Issue for bugs or feature requests
Made with ❤️ for clean documents