Skip to content

ricrowl/doc-ocr

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

doc-ocr

This module outputs the text found in images using OCR. It supports not only image data but also images embedded in HTML.

Install

Install the required Python packages:

pip install -r requirements.txt

Usage

Set source path to .env

  • For image dir: SOURCE_PATH=path/to/images/dir
  • For html path: SOURCE_PATH=path/to/html/path

Run the translation script:

python src/main.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors