No description

Find a file

ggq-admin 09d9f1b601 Implement PDF image extraction with OCR in OCR worker This commit adds comprehensive image extraction and OCR functionality to the OCR worker: Features: - Created image-extractor.js worker module with extractImagesFromPage() function - Uses pdftoppm (with ImageMagick fallback) to convert PDF pages to high-res images - Images saved to /uploads/{documentId}/images/page-{N}-img-{M}.png - Returns image metadata: id, path, position, width, height OCR Worker Integration: - Imports image-extractor module and extractTextFromImage from OCR service - After processing page text, extracts images from each page - Runs Tesseract OCR on extracted images - Stores image data in document_images table with extracted text and confidence - Indexes images in Meilisearch with type='image' for searchability - Updates document.imageCount and sets imagesExtracted flag Database: - Uses existing document_images table from migration 004 - Stores image metadata, OCR text, and confidence scores Dependencies: - Added pdf-img-convert and sharp packages - Uses system tools (pdftoppm/ImageMagick) for reliable PDF conversion Testing: - Created test-image-extraction.js to verify image extraction - Created test-full-pipeline.js to test end-to-end extraction + OCR - Successfully tested with 05-versions-space.pdf test document Error Handling: - Graceful degradation if image extraction fails - Continues OCR processing even if images cannot be extracted - Comprehensive logging for debugging Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>		2025-10-19 19:54:25 +02:00
client	feat: Add image extraction design, database schema, and migration	2025-10-19 19:47:30 +02:00
docs	feat: Add image extraction design, database schema, and migration	2025-10-19 19:47:30 +02:00
scripts	Add StackCP hosting evaluation and deployment guides	2025-10-19 09:35:27 +02:00
server	Implement PDF image extraction with OCR in OCR worker	2025-10-19 19:54:25 +02:00
test/data	fix: Complete OCR pipeline with language code mapping	2025-10-19 05:09:51 +02:00
.gitignore	feat(ui): Meilisearch-style polish (badges, glass, grid, skeleton) + theme color\n\n- Add accessible focus ring and kbd styling\n- Add badge/glass/section/accent-border/bg-grid/skeleton utilities\n- Update theme-color + OG meta\n- Ignore sensitive handover file\n\nSee docs/ui/CHANGELOG_UI.md for details	2025-10-19 16:52:02 +02:00
ANALYSIS_INDEX.md	docs: Add complete NaviDocs handover documentation and StackCP analysis	2025-10-19 13:19:42 +02:00
ARCHITECTURE-SUMMARY.md	docs: Add architecture summary	2025-10-19 01:23:40 +02:00
BRANDING_CREATIVE_BRIEF.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
BUILD_COMPLETE.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
CLEANUP_COMPLETE.sh	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
DEVELOPMENT.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
GITEA_ACCESS.md	docs: Add Gitea access explanation	2025-10-19 13:48:58 +02:00
GOOGLE_DRIVE_OCR_QUICKSTART.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
IMPLEMENTATION_COMPLETE.md	feat: NaviDocs MVP - Complete codebase extraction from lilian1	2025-10-19 01:55:44 +02:00
NAVIDOCS_HANDOVER.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
navidocs_search_token_test_report.json	feat: Add image extraction design, database schema, and migration	2025-10-19 19:47:30 +02:00
OCR_FINAL_RECOMMENDATION.md	docs: Add final OCR recommendation and comparison summary	2025-10-19 09:09:22 +02:00
OCR_PIPELINE_SETUP.md	feat: NaviDocs MVP - Complete codebase extraction from lilian1	2025-10-19 01:55:44 +02:00
PORT_ALLOCATION.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
PORT_MIGRATION_SUMMARY.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
QUICK_REFERENCE.md	feat(ui): Meilisearch-style polish (badges, glass, grid, skeleton) + theme color\n\n- Add accessible focus ring and kbd styling\n- Add badge/glass/section/accent-border/bg-grid/skeleton utilities\n- Update theme-color + OG meta\n- Ignore sensitive handover file\n\nSee docs/ui/CHANGELOG_UI.md for details	2025-10-19 16:52:02 +02:00
QUICKSTART.md	feat: NaviDocs MVP - Complete codebase extraction from lilian1	2025-10-19 01:55:44 +02:00
README.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
README_NEW.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
REORGANIZE_FILES.sh	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
SERVICES_STATUS.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
SESSION_STATUS.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
STACKCP_ARCHITECTURE_ANALYSIS.md	docs: Add complete NaviDocs handover documentation and StackCP analysis	2025-10-19 13:19:42 +02:00
STACKCP_DEBATE_BRIEF.md	docs: Add complete NaviDocs handover documentation and StackCP analysis	2025-10-19 13:19:42 +02:00
STACKCP_EVALUATION_REPORT.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
STACKCP_QUICK_REFERENCE.md	docs: Add complete NaviDocs handover documentation and StackCP analysis	2025-10-19 13:19:42 +02:00
STACKCP_VERIFICATION_SUMMARY.md	Add StackCP deployment verification summary	2025-10-19 09:36:43 +02:00
start-all.sh	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
stop-all.sh	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00
test-backend-e2e.js	feat: Add image extraction design, database schema, and migration	2025-10-19 19:47:30 +02:00
test-e2e.js	feat: Add image extraction design, database schema, and migration	2025-10-19 19:47:30 +02:00
test-manual.pdf	fix: Switch to local system tesseract command for OCR	2025-10-19 04:48:18 +02:00
TEST_RESULTS.md	feat: Complete frontend UI polish with Meilisearch-inspired design	2025-10-19 16:40:48 +02:00

README.md

NaviDocs - Professional Boat Manual Management

Production-ready boat manual management platform with OCR and intelligent search

Built with Vue 3, Express, SQLite, and Meilisearch. Extracted from the lilian1 (FRANK-AI) prototype with clean, professional code only.

Features

Upload PDFs - Drag and drop boat manuals
OCR Processing - Automatic text extraction with Tesseract.js
Intelligent Search - Meilisearch with boat terminology synonyms
Offline-First - PWA with service worker caching
Multi-Vertical - Supports boats, marinas, and properties
Secure - Tenant tokens, file validation, rate limiting

Tech Stack

Backend

Node.js 20 - Express 5
SQLite - better-sqlite3 with WAL mode
Meilisearch - Sub-100ms search with synonyms
BullMQ - Background OCR job processing
Tesseract.js - PDF text extraction

Frontend

Vue 3 - Composition API with <script setup>
Vite - Fast builds and HMR
Tailwind CSS - Meilisearch-inspired design
Pinia - State management
PDF.js - Document viewer

Quick Start

Prerequisites

# Required
node >= 20.0.0
npm >= 10.0.0

# For OCR
pdftoppm (from poppler-utils)
tesseract >= 5.0.0

# For search
meilisearch >= 1.0.0

# For queue
redis >= 6.0.0

Installation

# Clone repository
cd ~/navidocs

# Install server dependencies
cd server
npm install
cp .env.example .env
# Edit .env with your configuration

# Initialize database
npm run init-db

# Install client dependencies
cd ../client
npm install

# Start services (each in separate terminal)
meilisearch --master-key=masterKey
redis-server
cd ~/navidocs/server && node workers/ocr-worker.js
cd ~/navidocs/server && npm run dev
cd ~/navidocs/client && npm run dev

Visit http://localhost:8080

Architecture

See docs/architecture/ for complete schema and configuration details.

Ship it. Learn from users. Iterate.