This tool, initially made specifically for use with Sony's Digital Paper System (DPS), is now a general-purpose DjVu to PDF converter with a focus on small output size and the ability to preserve ...
Windows-focused fork of Typhoon OCR. Gradio demo for PDF/image OCR to Markdown/HTML with layout & table extraction. Uses OpenAI-compatible API or vLLM via WSL2. A Python utility for merging multiple ...
When you get a scanned file or a screenshot that has text, it looks fine at first. But the problem comes when you need that text in editable form. Typing everything manually takes too much time and ...
I want to share the approach, the trade‑offs, and the architectural decisions behind it — in case you’re building something similar or exploring how lightweight OCR can fit into a modern distributed ...