Add async Vision LLM extraction module (task 05) by zalun · Pull Request #10 · zalun/PaperRoute

zalun · 2026-04-03T10:43:22Z

Summary

Add src/docproc/vision.py — async Vision extraction via DeepFellow OpenAI-compatible API
Convert PDF pages to images locally using PyMuPDF (zero system deps)
Send base64-encoded images to Vision LLM via AsyncOpenAI chat completions
Return VisionResult with extracted markdown content
Retry with exponential backoff on 5xx/connection errors, fail fast on 4xx
27 tests covering validation, conversion, API calls, retries, and integration

Test plan

Closes #9

Implements src/docproc/vision.py that converts PDF pages to images using PyMuPDF locally, sends base64-encoded images to DeepFellow's OpenAI-compatible chat completions endpoint, and returns VisionResult. Closes #9

- Wrap PDF page rendering errors in VisionError (was propagating raw) - Add test for connection error exhausting all retries - Add test for image file read failure (OSError) - Add test for PDF rendering failure with doc.close() guarantee

- Raise VisionError on empty choices list instead of IndexError - Log warning when Vision API returns None/empty content - Add tests for both edge cases

zalun added 3 commits April 3, 2026 12:42

Add async Vision LLM extraction module (task 05)

729f247

Implements src/docproc/vision.py that converts PDF pages to images using PyMuPDF locally, sends base64-encoded images to DeepFellow's OpenAI-compatible chat completions endpoint, and returns VisionResult. Closes #9

Guard empty API choices and log None content from review round 2

ede7fdb

- Raise VisionError on empty choices list instead of IndexError - Log warning when Vision API returns None/empty content - Add tests for both edge cases

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add async Vision LLM extraction module (task 05)#10

Add async Vision LLM extraction module (task 05)#10
zalun wants to merge 3 commits intomainfrom
05-vision-extraction

zalun commented Apr 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zalun commented Apr 3, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant