navidocs/server
ggq-admin df68e27e26 fix: Complete OCR pipeline with language code mapping
- Fix tesseract language code mapping (en -> eng) to match available training data
- Switch from Tesseract.js to local system tesseract command for better reliability
- Add TESSDATA_PREFIX environment variable for tesseract data path
- Create test directory structure to workaround pdf-parse debug mode
- OCR now successfully extracting text with 0.85 confidence

Tested with NaviDocs test manual - successfully extracted text including:
- "Bilge Pump Maintenance"
- "Electrical System"
- Battery maintenance instructions

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude <noreply@anthropic.com>
2025-10-19 05:09:51 +02:00
..
config feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
db feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
examples feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
middleware feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
routes feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
scripts feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
services fix: Complete OCR pipeline with language code mapping 2025-10-19 05:09:51 +02:00
test/data chore: Local development environment setup 2025-10-19 04:42:55 +02:00
workers fix: Switch to local system tesseract command for OCR 2025-10-19 04:48:18 +02:00
.env.example feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
API_SUMMARY.md feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
index.js feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
package.json feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00
test-routes.js feat: NaviDocs MVP - Complete codebase extraction from lilian1 2025-10-19 01:55:44 +02:00