doc-ocr

This module outputs the text found in images using OCR. It supports not only image data but also images embedded in HTML.

Install

Install the required Python packages:

pip install -r requirements.txt

Set source path to .env

Run the translation script:

python src/main.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
src		src
.env.sample		.env.sample
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt