navidocs

History

ggq-admin df68e27e26 fix: Complete OCR pipeline with language code mapping - Fix tesseract language code mapping (en -> eng) to match available training data - Switch from Tesseract.js to local system tesseract command for better reliability - Add TESSDATA_PREFIX environment variable for tesseract data path - Create test directory structure to workaround pdf-parse debug mode - OCR now successfully extracting text with 0.85 confidence Tested with NaviDocs test manual - successfully extracted text including: - "Bilge Pump Maintenance" - "Electrical System" - Battery maintenance instructions 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-19 05:09:51 +02:00
..
05-versions-space.pdf	fix: Complete OCR pipeline with language code mapping	2025-10-19 05:09:51 +02:00
05-versions-space.pdf.txt	fix: Complete OCR pipeline with language code mapping	2025-10-19 05:09:51 +02:00

ggq-admin df68e27e26 fix: Complete OCR pipeline with language code mapping

- Fix tesseract language code mapping (en -> eng) to match available training data
- Switch from Tesseract.js to local system tesseract command for better reliability
- Add TESSDATA_PREFIX environment variable for tesseract data path
- Create test directory structure to workaround pdf-parse debug mode
- OCR now successfully extracting text with 0.85 confidence

Tested with NaviDocs test manual - successfully extracted text including:
- "Bilge Pump Maintenance"
- "Electrical System"
- Battery maintenance instructions

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>

2025-10-19 05:09:51 +02:00

05-versions-space.pdf

fix: Complete OCR pipeline with language code mapping

2025-10-19 05:09:51 +02:00

05-versions-space.pdf.txt

fix: Complete OCR pipeline with language code mapping

2025-10-19 05:09:51 +02:00