diff --git a/AUTONOMOUS-COORDINATION-STATUS.md b/AUTONOMOUS-COORDINATION-STATUS.md index e825575..240f848 100644 --- a/AUTONOMOUS-COORDINATION-STATUS.md +++ b/AUTONOMOUS-COORDINATION-STATUS.md @@ -10,14 +10,16 @@ | Session | Agent ID Range | Status | Progress | Outputs | |---------|---------------|--------|----------|---------| | Session 1 | S1-H01 to S1-H10 | 🟡 READY | 0/10 agents | `intelligence/session-1/` | -| Session 2 | S2-H01 to S2-H10 | 🟡 READY | 0/10 agents | `intelligence/session-2/` | +| Session 2 | S2-H0A to S2-H10 | 🟢 ACTIVE (Helper) | 4/15 agents | `intelligence/session-2/` | | Session 3 | S3-H01 to S3-H10 | 🟡 READY | 0/10 agents | `intelligence/session-3/` | -| Session 4 | S4-H01 to S4-H10 | 🟡 READY | 0/10 agents | `intelligence/session-4/` | -| Session 5 | S5-H01 to S5-H10 | 🟡 READY | 0/20 guardians | `intelligence/session-5/` | +| Session 4 | S4-H0A to S4-H10 | 🟢 ACTIVE (Helper) | 4/14 agents | `intelligence/session-4/` | +| Session 5 | S5-H0A to S5-H10 | 🟢 ACTIVE (QA) | 3/13 agents | `intelligence/session-5/` | **Status Legend:** - 🟡 READY - Session initialized, waiting to start - 🟢 ACTIVE - Session running, agents working +- 🟢 ACTIVE (Helper) - Session assisting other sessions while waiting +- 🟢 ACTIVE (QA) - Session providing quality assurance support - 🔵 BLOCKED - Waiting for dependencies from other sessions - ✅ COMPLETE - All outputs delivered @@ -49,20 +51,37 @@ Next: Session 2 unblocked --- -### **Session 2: Technical Integration** +### **Session 2: Technical Integration + Helper Agents** -**Current Task:** Codebase analysis + sticky feature architecture +**Current Task:** PHASE 1 - Helper agents assisting Sessions 1 & 3 (START NOW) -**Instructions:** -1. **First:** Agent 1 (S2-H01) analyzes NaviDocs codebase (NO dependency on Session 1) -2. **Parallel:** Agents 2-9 research technical solutions (inventory, cameras, maintenance, etc.) -3. **When Session 1 completes:** Poll for `intelligence/session-1/session-1-handoff.md` -4. **Then:** Agent 10 synthesizes with Session 1 pain point priorities -5. Output to `intelligence/session-2/` (architecture.md, integration-specs.md, handoff.md) +**IMMEDIATE ACTIONS (Agents 0A, 0B, 0C, 0D):** +1. **Agent 0A (CONTINUOUS):** Technical validation assistant + - Verify competitor tech stacks for Session 1 + - Check API availability (YachtWorld, Boat Trader, etc.) + - Validate technical feasibility of Session 3 UX proposals +2. **Agent 0B (CONTINUOUS):** Citation automation + - Generate SHA-256 hashes for Session 1 web URLs + - Verify URL accessibility (flag broken links) + - Auto-generate IF.TTT-compliant citation JSON +3. **Agent 0C (CONTINUOUS):** Web scraping assistant + - Extract competitor feature lists, pricing tables + - Parse industry reports for Session 1 + - Create structured competitor-data.json +4. **Agent 0D (PREP WORK):** ROI calculator backend + - Build generic calculator framework NOW + - Ready to plug in Session 1 data when it arrives + - Prepare chart generation and export functionality + +**PHASE 2 - Technical Architecture (WAIT FOR SESSION 1):** +5. **When Session 1 completes:** Poll for `intelligence/session-1/session-1-handoff.md` +6. **Agents 1-9 + 3A + 7A:** Codebase analysis + feature design (11 agents) +7. **Agent 10:** Synthesize with Session 1 pain point priorities +8. Output to `intelligence/session-2/` (architecture.md, integration-specs.md, handoff.md) **Dependencies:** -- **Agent 1 task:** NONE (start immediately) -- **Agent 10 synthesis:** Session 1 complete +- **Agents 0A-0D:** NONE (start immediately - assist Sessions 1 & 3) +- **Agents 1-9 + 3A + 7A + Agent 10:** Session 1 complete **Polling Command:** ```bash @@ -103,20 +122,41 @@ fi --- -### **Session 4: Implementation Planning** +### **Session 4: Implementation Planning + Helper Agents** -**Current Task:** Generic sprint planning + roadmap templates +**Current Task:** PHASE 1 - Helper agents providing project management support (START NOW) -**Instructions:** -1. **Parallel:** Week agents research sprint best practices, roadmap templates -2. **When Sessions 1+2+3 complete:** Poll for handoff files -3. **Then:** Week agents create detailed sprint plans with feature priorities from Session 1 -4. **Finally:** Agent 10 creates integrated 4-week roadmap -5. Output to `intelligence/session-4/` (sprint-plan.md, roadmap.md, handoff.md) +**IMMEDIATE ACTIONS (Agents 0A, 0B, 0C, 0D):** +1. **Agent 0A (CONTINUOUS - Every 5 minutes):** Research coordination dashboard + - Track Session 1 agent progress (7/10 complete, etc.) + - Monitor deliverable status (market-analysis.md: COMPLETE, etc.) + - Predict completion times for all sessions + - Detect blockers early (agent stuck, over-budget, etc.) +2. **Agent 0B (CONTINUOUS):** Citation quality checker + - Pre-validate Session 1, 2, 3 citations BEFORE Session 5 + - Check IF.TTT compliance (citation_id, sources, confidence, SHA-256) + - Flag issues immediately (single-source claims, missing hashes, etc.) + - Faster feedback loop than waiting for Session 5 validation +3. **Agent 0C (PREP WORK):** Demo script structure + - Research winning demo flows (problem → solution → demo → ROI → close) + - Create generic template with placeholders for Session 1 data + - Research yacht owner objections (cost, complexity, time) + - Ready to fill with Session 1 findings when available +4. **Agent 0D (CONTINUOUS - Every 5 minutes):** Cross-session dependency tracker + - Visual dependency graph (Mermaid) of all sessions + - Critical path identification (slowest session = bottleneck) + - Real-time completion predictions + - Parallel work opportunity detection + +**PHASE 2 - Implementation Planning (WAIT FOR SESSIONS 1+2+3):** +5. **When Sessions 1+2+3 complete:** Poll for handoff files +6. **Agents 1-9:** Create detailed 4-week sprint plan with feature priorities +7. **Agent 10:** Synthesize integrated roadmap +8. Output to `intelligence/session-4/` (sprint-plan.md, roadmap.md, handoff.md) **Dependencies:** -- **Week 1-4 generic planning:** NONE (start immediately) -- **Detailed feature breakdown:** Sessions 1+2+3 complete +- **Agents 0A-0D:** NONE (start immediately - assist all sessions) +- **Agents 1-10:** Sessions 1+2+3 complete **Polling Command:** ```bash @@ -131,22 +171,36 @@ fi --- -### **Session 5: Guardian Validation** +### **Session 5: Guardian Validation + Active Quality Assurance** -**Current Task:** Guardian methodology review + evaluation criteria prep +**Current Task:** PHASE 1 - Active QA Partner (NO DEPENDENCIES - START NOW) -**Instructions:** -1. **Parallel:** Guardians 1-12 + IF.sam facets review IF.TTT framework, prepare evaluation criteria -2. **When Sessions 1+2+3+4 complete:** Poll for handoff files -3. **Then:** Guardians review complete intelligence dossier -4. **IF.sam Debate:** 8 facets debate findings (Light Side vs Dark Side) -5. **Vote:** Agent 10 tallies consensus (need >80% approval) -6. **ESCALATE:** If <80%, flag for human review -7. Output to `intelligence/session-5/` (complete-intelligence-dossier.md, guardian-vote.md, consensus-report.md) +**IMMEDIATE ACTIONS (Agents 0A, 0B, 0C):** +1. **Agent 0A (CRITICAL - First 10 minutes):** Deploy `EVIDENCE_QUALITY_STANDARDS.md` + - Citation format templates (IF.TTT compliance) + - Evidence quality scoring rubric (primary/secondary/tertiary sources) + - Multi-source verification examples + - Commit to coordination branch → Sessions 1-4 read immediately +2. **Agent 0B (CONTINUOUS - Every 5 minutes):** Real-time quality monitoring + - Poll `intelligence/session-*/` for new commits + - Review citations for IF.TTT compliance + - Create `QUALITY_FEEDBACK.md` (updated every 5 minutes) + - Sessions 1-4 read feedback → fix issues proactively +3. **Agent 0C (PREP WORK):** Guardian briefing templates + - Create 20 guardian-specific briefing templates + - Consensus prediction formula + - Voting criteria checklists + +**PHASE 2 - Final Validation (WAIT FOR SESSIONS 1+2+3+4):** +4. **When Sessions 1+2+3+4 complete:** Poll for handoff files +5. **Agents 1-9:** Extract evidence, validate claims, compile citations +6. **Agent 10:** Guardian Council vote (need >80% consensus) +7. **ESCALATE:** If <80% approval, flag for human review +8. Output to `intelligence/session-5/` (complete-intelligence-dossier.md, guardian-vote.md) **Dependencies:** -- **Methodology review:** NONE (start immediately) -- **Dossier validation:** Sessions 1+2+3+4 complete +- **Agent 0A, 0B, 0C:** NONE (start immediately - assist Sessions 1-4) +- **Agents 1-10:** Sessions 1+2+3+4 complete **Polling Command:** ```bash diff --git a/CLOUD_SESSION_2_TECHNICAL_INTEGRATION.md b/CLOUD_SESSION_2_TECHNICAL_INTEGRATION.md index 64e2195..2de54b9 100644 --- a/CLOUD_SESSION_2_TECHNICAL_INTEGRATION.md +++ b/CLOUD_SESSION_2_TECHNICAL_INTEGRATION.md @@ -44,6 +44,146 @@ Design technical architecture for **sticky daily-use features** (inventory track --- +## PHASE 1: Helper Agents (START IMMEDIATELY - ASSIST SESSIONS 1 & 3) + +**Mission:** While waiting for Session 1 market research and Session 3 UX/sales work, provide active technical assistance to accelerate their completion. + +**NO DEPENDENCIES:** All Phase 1 agents can start immediately (no need to wait for other sessions). + +--- + +### Agent 0A: Technical Validation Assistant (CRITICAL) +**AGENT ID:** S2-H0A +**START:** Immediately (no dependencies) + +**Assist Session 1 (Market Research):** +- **Verify competitor tech stacks:** When Session 1 identifies competitors (YachtWorld, Boat Trader, etc.), validate their API documentation +- **Check API availability:** Research if competitor APIs are public, require auth, or need partner agreements +- **Document API contracts:** Create structured summaries of competitor API capabilities for Session 1 citations + +**Assist Session 3 (UX/Sales):** +- **Technical feasibility checks:** When Session 3 proposes UI features, confirm if NaviDocs tech stack supports them +- **Performance estimates:** Provide load time estimates for proposed features (based on NaviDocs current architecture) +- **Mobile compatibility:** Validate if proposed UX patterns work on mobile (Vue 3 + PWA constraints) + +**Deliverable:** `intelligence/session-2/helper-technical-validations.md` (updated in real-time as Sessions 1 & 3 work) + +--- + +### Agent 0B: Citation Automation (SHA-256 Hash Generation) +**AGENT ID:** S2-H0B +**START:** Immediately (no dependencies) + +**Assist Session 1 (Market Research):** +- **Generate SHA-256 hashes:** When Session 1 cites web URLs, automatically fetch and hash content +- **Verify URL accessibility:** Check if cited URLs return 200 OK (flag broken links immediately) +- **Extract structured data:** Parse web pages for key data points (pricing, features, market stats) +- **Create citation JSON:** Auto-generate IF.TTT-compliant citation entries with hashes + +**Example Workflow:** +```bash +# Session 1 Agent 3 cites: "YachtWorld pricing: €25/month" +# Agent 0B immediately: +1. Fetches https://yachtworld.com/pricing +2. Generates SHA-256: a1b2c3d4e5f6... +3. Extracts pricing table +4. Creates citation JSON: +{ + "citation_id": "if://citation/yachtworld-pricing-nov2025", + "claim": "YachtWorld charges €25/month for yacht management", + "sources": [{ + "type": "web", + "url": "https://yachtworld.com/pricing", + "sha256": "a1b2c3d4e5f6...", + "accessed": "2025-11-13", + "quality": "primary", + "credibility": 9 + }], + "status": "verified", + "confidence_score": 0.95 +} +``` + +**Deliverable:** `intelligence/session-2/auto-citations.json` (append as Session 1 works) + +--- + +### Agent 0C: Web Scraping Assistant (Structured Data Extraction) +**AGENT ID:** S2-H0C +**START:** Immediately (no dependencies) + +**Assist Session 1 (Market Research):** +- **Extract competitor feature lists:** Scrape competitor websites for feature comparisons +- **Pricing table extraction:** Parse pricing pages into structured JSON +- **Market report parsing:** Extract key stats from industry reports (PDFs, web pages) +- **Automated data validation:** Cross-check data across multiple sources (detect conflicts) + +**Example:** +```yaml +# Session 1 Agent 5 researching competitors +# Agent 0C scrapes: +- YachtWorld: Features (inventory, CRM, MLS integration) +- Boat Trader: Pricing (€15/month basic, €45/month pro) +- Dockwa: User counts (50K+ marinas, 250K+ boaters) + +# Output: intelligence/session-2/competitor-data.json +{ + "yachtworld": { + "features": ["inventory", "crm", "mls_integration"], + "pricing": {"basic": 25, "pro": 75}, + "users": "unknown" + }, + "boat_trader": { + "features": ["listings", "leads", "analytics"], + "pricing": {"basic": 15, "pro": 45}, + "users": "unknown" + } +} +``` + +**Deliverable:** `intelligence/session-2/competitor-data.json` (structured data for Session 1 & 3 use) + +--- + +### Agent 0D: ROI Calculator Backend (Build Before Session 1 Data Arrives) +**AGENT ID:** S2-H0D +**START:** Immediately (no dependencies) + +**Build Generic ROI Calculator Framework:** +- **Formula engine:** Create generic calculator that accepts variables (warranty_savings, time_saved, resale_value_increase) +- **Input validation:** Define valid ranges for each variable (prevent unrealistic claims) +- **Visualization logic:** Prepare chart generation code (bar charts, pie charts for ROI breakdown) +- **Export functionality:** Generate PDF/Excel exports of ROI calculations + +**Example Structure:** +```javascript +// server/services/roi-calculator.service.js +class ROICalculator { + calculate(inputs) { + // inputs: { warranty_savings, time_saved_hours, resale_value_increase } + const warranty_roi = inputs.warranty_savings * 12; // Annual savings + const time_roi = inputs.time_saved_hours * 50; // €50/hour labor cost + const resale_roi = inputs.resale_value_increase; + + return { + total_annual_roi: warranty_roi + time_roi, + resale_value_lift: resale_roi, + payback_period_months: this.calculatePayback(inputs), + confidence: this.calculateConfidence(inputs) + }; + } +} +``` + +**When Session 1 Data Arrives:** +- Agent 0D plugs in Session 1 findings (€8K-€33K warranty savings, etc.) +- Generates final ROI report for Session 3 pitch deck +- Creates interactive calculator UI for demos + +**Deliverable:** `intelligence/session-2/roi-calculator-framework.js` (ready for Session 1 data integration) + +--- + ## Agent Identity & Check-In Protocol diff --git a/CLOUD_SESSION_4_IMPLEMENTATION_PLANNING.md b/CLOUD_SESSION_4_IMPLEMENTATION_PLANNING.md index 5705d4e..960ed88 100644 --- a/CLOUD_SESSION_4_IMPLEMENTATION_PLANNING.md +++ b/CLOUD_SESSION_4_IMPLEMENTATION_PLANNING.md @@ -35,6 +35,188 @@ Create detailed 4-week implementation plan with task breakdown, dependencies, ac --- +## PHASE 1: Helper Agents (START IMMEDIATELY - ASSIST SESSIONS 1 & 3) + +**Mission:** While waiting for Sessions 1+2+3 to complete, provide active project management assistance to accelerate overall coordination and quality. + +**NO DEPENDENCIES:** All Phase 1 agents can start immediately (no need to wait for other sessions). + +--- + +### Agent 0A: Research Coordination Dashboard (Real-Time Progress Tracking) +**AGENT ID:** S4-H0A +**START:** Immediately (no dependencies) + +**Assist Session 1 (Market Research):** +- **Agent progress tracker:** Monitor which of the 10 Session 1 agents have completed their tasks +- **Completion status:** Track deliverables (market-analysis.md, citations.json, handoff.md) +- **Blocker detection:** Identify if any Session 1 agent is stuck or over-budget +- **Estimated completion:** Predict when Session 1 will finish based on current progress + +**Dashboard Format:** +```yaml +# intelligence/session-4/coordination-dashboard.md +## Session 1 Progress (Updated Every 5 Minutes) + +Agents Complete: 7/10 (70%) +Deliverables: + - market-analysis.md: ✅ COMPLETE (87KB) + - session-1-citations.json: 🟡 IN PROGRESS (43 citations so far) + - session-1-handoff.md: ⏳ PENDING (Agent 10 not started) + +Estimated Completion: 15 minutes + +Blockers: None + +Token Usage: $12.50 / $15.00 budget (83% consumed) +``` + +**Deliverable:** `intelligence/session-4/coordination-dashboard.md` (updated every 5 minutes) + +--- + +### Agent 0B: Citation Quality Checker (Pre-Validate Before Session 5) +**AGENT ID:** S4-H0B +**START:** Immediately (no dependencies) + +**Assist Sessions 1, 2, 3 (Pre-Validation):** +- **Citation format check:** Validate citations follow IF.TTT schema (citation_id, sources, confidence_score, etc.) +- **Source quality scoring:** Verify primary sources have 9-10 credibility (not tertiary) +- **Multi-source validation:** Check high-confidence claims (≥0.9) have ≥2 sources +- **SHA-256 verification:** Confirm web URLs have SHA-256 hashes (tamper detection) +- **Broken link detection:** Test cited URLs return 200 OK (flag dead links) + +**Example Feedback:** +```yaml +# intelligence/session-4/citation-quality-report.md + +## Session 1 Citation Quality (43 citations reviewed) + +✅ PASS (38 citations): + - All have if://citation/ IDs + - Primary sources properly cited + - ≥2 sources for high-confidence claims + +⚠️ NEEDS ATTENTION (5 citations): + 1. Citation: if://citation/broker-count-med + - Issue: Single source (need 2nd corroboration) + - Recommendation: Add Boat International report as 2nd source + + 2. Citation: if://citation/warranty-costs + - Issue: Missing SHA-256 hash for YachtWorld URL + - Recommendation: Run `sha256sum` on fetched page + + 3. Citation: if://citation/competitor-pricing + - Issue: Confidence 0.95 but only secondary sources (credibility 7-8) + - Recommendation: Lower confidence to 0.85 or find primary source +``` + +**Value:** Catch citation issues BEFORE Session 5 validation (faster feedback loop, less rework) + +**Deliverable:** `intelligence/session-4/citation-quality-report.md` (updated as sessions work) + +--- + +### Agent 0C: Demo Script Structure (Outline Before Session 1 Data) +**AGENT ID:** S4-H0C +**START:** Immediately (no dependencies) + +**Assist Session 3 (UX/Sales):** +- **Demo flow research:** Study winning software demo structures (problem → solution → demo → ROI → close) +- **Generic outline:** Create demo script template with placeholders for Session 1 data +- **Objection handling:** Research common yacht owner objections (cost, complexity, time investment) +- **Storytelling techniques:** Identify emotional hooks (peace of mind, warranty savings, resale value) + +**Generic Demo Outline (Before Session 1 Data):** +```markdown +# NaviDocs Demo Script (Template) + +## 1. Opening (2 min) +- Pain point: "[INSERT: Top pain point from Session 1]" +- Teaser: "What if you could [INSERT: Key benefit]?" + +## 2. Problem Deep Dive (3 min) +- Story: "[INSERT: Yacht owner story from Session 1]" +- Consequence: "Without NaviDocs, owners lose [INSERT: €X savings]" + +## 3. Solution Demo (10 min) +- Feature 1: [INSERT: Sticky feature from Session 2] + - Show: [Screen recording placeholder] + - Benefit: [INSERT: Time/money saved] + +## 4. ROI Reveal (5 min) +- Calculator: [INSERT: Session 1 warranty savings data] +- Comparison: "That's [X] hours saved per year" + +## 5. Close (2 min) +- Call to action: "Sign up for Riviera pilot" +- Objection handling: [INSERT: Top 3 objections from Session 1] +``` + +**When Session 1 Completes:** +- Agent 0C fills placeholders with real data +- Creates final polished demo script for Session 3 + +**Deliverable:** `intelligence/session-4/demo-script-template.md` (ready for Session 1 data integration) + +--- + +### Agent 0D: Cross-Session Dependency Tracker (Visual Dependency Graph) +**AGENT ID:** S4-H0D +**START:** Immediately (no dependencies) + +**Monitor All Sessions:** +- **Dependency mapping:** Track which sessions are blocking others +- **Critical path visualization:** Identify slowest session (bottleneck) +- **Parallel work opportunities:** Find tasks that can run concurrently +- **Completion predictions:** Estimate when each session will finish + +**Dependency Graph (Mermaid):** +```mermaid +graph TD + S1[Session 1: Market Research] --> S2[Session 2: Technical Architecture] + S1 --> S3[Session 3: UX/Sales] + S2 --> S3 + S1 --> S4[Session 4: Implementation Planning] + S2 --> S4 + S3 --> S4 + S1 --> S5[Session 5: Guardian Validation] + S2 --> S5 + S3 --> S5 + S4 --> S5 + + style S1 fill:#90EE90 + style S2 fill:#FFD700 + style S3 fill:#FFD700 + style S4 fill:#FFD700 + style S5 fill:#87CEEB +``` + +**Real-Time Status Updates:** +```yaml +# intelligence/session-4/dependency-status.md + +## Critical Path Analysis (Updated Every 5 Minutes) + +SLOWEST SESSION: Session 1 (30 min remaining) + - Blocks: Session 2 (Agent 10), Session 3 (Agents 8-10), Session 4 (All agents) + +PARALLEL WORK: + - Session 2 Phase 1 agents (0A-0D): ✅ RUNNING (assisting Session 1) + - Session 5 Phase 1 agents (0A-0C): ✅ RUNNING (deploying standards) + +ESTIMATED COMPLETION: + - Session 1: 13:30 UTC + - Session 2: 14:00 UTC (depends on Session 1) + - Session 3: 14:30 UTC (depends on Sessions 1+2) + - Session 4: 15:00 UTC (depends on Sessions 1+2+3) + - Session 5: 16:30 UTC (depends on all) +``` + +**Deliverable:** `intelligence/session-4/dependency-status.md` (updated every 5 minutes with Mermaid graph) + +--- + ## Agent Identity & Check-In Protocol diff --git a/CLOUD_SESSION_5_SYNTHESIS_VALIDATION.md b/CLOUD_SESSION_5_SYNTHESIS_VALIDATION.md index e6f344a..304dbec 100644 --- a/CLOUD_SESSION_5_SYNTHESIS_VALIDATION.md +++ b/CLOUD_SESSION_5_SYNTHESIS_VALIDATION.md @@ -11,7 +11,9 @@ ## Mission Statement -Synthesize all intelligence from Sessions 1-4 into comprehensive dossier, validate claims with medical-grade evidence standards, achieve Guardian Council consensus (>90% approval), and deliver final presentation materials. +**Active Quality Assurance Partner (Immediate Start):** Deploy evidence quality standards, monitor Sessions 1-4 commits in real-time, provide continuous feedback to prevent rework. + +**Final Validation (When Sessions 1-4 Complete):** Synthesize all intelligence into comprehensive dossier, validate claims with medical-grade evidence standards, achieve Guardian Council consensus (>90% approval), and deliver final presentation materials. --- @@ -66,7 +68,187 @@ Each agent MUST: --- -## Your Tasks (Spawn 10 Haiku Agents in Parallel) +## Your Tasks (Spawn 13 Haiku Agents) + +**PHASE 1: Active Quality Assurance (START IMMEDIATELY - NO DEPENDENCIES)** + +### Agent 0A: Evidence Quality Standards Deployment +**AGENT ID:** S5-H0A +**PRIORITY:** CRITICAL - Deploy within first 10 minutes +** +**Create:** +- `EVIDENCE_QUALITY_STANDARDS.md` - Master reference for Sessions 1-4 + - **Citation format templates:** + ```json + { + "citation_id": "if://citation/warranty-savings-8k-33k", + "claim": "NaviDocs prevents €8K-€33K warranty losses per yacht", + "sources": [ + { + "type": "web", + "url": "https://yachtworld.com/research/yacht-ownership-costs-2024", + "sha256": "a1b2c3d4...", + "accessed": "2025-11-13", + "quality": "primary", + "credibility": 9 + }, + { + "type": "file", + "path": "intelligence/session-1/market-analysis.md", + "line_range": "45-67", + "quality": "primary" + } + ], + "status": "verified", + "confidence_score": 0.95 + } + ``` + - **IF.TTT compliance checklist:** + - [ ] ≥2 independent sources for high-confidence claims + - [ ] Web URLs include SHA-256 hash (tamper detection) + - [ ] File references include line numbers + - [ ] Citation ID follows if:// URI scheme + - [ ] Confidence score justified (0.0-1.0) + - [ ] Status tracked: unverified → verified → disputed → revoked + - **Evidence quality scoring:** + - Primary source (9-10 credibility): Original research, official statistics, codebase analysis + - Secondary source (7-8 credibility): Industry reports, competitor websites, expert interviews + - Tertiary source (5-6 credibility): Blog posts, forum discussions, anecdotal evidence + - Unverified (0-4 credibility): Claims without sources + - **Multi-source verification examples:** + - Market sizing claim: YachtWorld stats + Boat International report + - Technical claim: Codebase file:line + architecture doc + - Competitive claim: Competitor website + pricing screenshot + +**Commit to coordination branch:** +```bash +git add EVIDENCE_QUALITY_STANDARDS.md +git commit -m "Session 5: Evidence quality standards for Sessions 1-4" +git push origin navidocs-cloud-coordination +``` + +**Notify other sessions:** +- Update `AUTONOMOUS-COORDINATION-STATUS.md`: "✅ Evidence standards deployed - Sessions 1-4 reference EVIDENCE_QUALITY_STANDARDS.md" + +**Deliverable:** `EVIDENCE_QUALITY_STANDARDS.md` (Sessions 1-4 read this immediately) + +--- + +### Agent 0B: Real-Time Quality Monitor (CONTINUOUS) +**AGENT ID:** S5-H0B +**PRIORITY:** HIGH - Run every 5 minutes for entire session +** +**Monitor:** +- Poll `intelligence/session-1/`, `session-2/`, `session-3/`, `session-4/` for new commits +- Check git log every 5 minutes: + ```bash + git fetch origin navidocs-cloud-coordination + git log --since="5 minutes ago" --name-status -- intelligence/ + ``` + +**Review:** +- New citations: Are they IF.TTT compliant? (SHA-256 hashes, ≥2 sources, line numbers) +- Market claims: Do they cite credible sources? (not just "industry experts say...") +- Technical claims: Do they reference codebase? (file:line required) +- ROI calculations: Do they show work? (formulas + source data) + +**Feedback Loop:** +- Create `QUALITY_FEEDBACK.md` (updated every 5 minutes): + ```markdown + # Real-Time Quality Feedback (Updated: 2025-11-13 14:35 UTC) + + ## ✅ Session 1 (Good) + - Agent 2 citation: Excellent (2 primary sources, SHA-256 hashes included) + - Agent 3 market sizing: Good (YachtWorld + Boat International cited) + + ## ⚠️ Session 2 (Needs Attention) + - Agent 3 maintenance log claim: Missing line number reference + - Agent 6 accounting module: Only 1 source (need ≥2 for high confidence) + + ## 🔴 Session 3 (Action Required) + - Agent 5 ROI calculator: No source citations for €8K-€33K warranty claim + - Action: Review Session 1 market analysis, add citation links + + ## ✅ Session 4 (Good) + - Sprint plan: All tasks reference Session 2 architecture (file:line included) + ``` + +**Commit feedback every 5 minutes:** +```bash +git add QUALITY_FEEDBACK.md +git commit -m "Session 5: Quality feedback ($(date -Iseconds))" +git push origin navidocs-cloud-coordination +``` + +**Escalate if needed:** +- >20% of citations lack IF.TTT compliance → ESCALATE to Sonnet coordinator +- Sessions 1-4 read feedback, fix issues proactively (prevent rework at validation stage) + +**Deliverable:** `QUALITY_FEEDBACK.md` (updated every 5 minutes) + +--- + +### Agent 0C: Guardian Briefing Templates (PREP WORK) +**AGENT ID:** S5-H0C +** +**Create templates for final validation (ready when Sessions 1-4 complete):** + +1. **Guardian-Specific Briefing Template (20 guardians):** + ```markdown + # Guardian Briefing: [Guardian Name] + **Philosophy:** [Empiricism, Pragmatism, IF.sam Light/Dark, etc.] + **Focus Areas:** [What this guardian cares about most] + + ## Executive Summary + [Tailored to guardian's philosophy] + + ## Key Evidence + [Filtered to guardian's interests] + - Empiricism: Market research data, statistical evidence + - Pragmatism: ROI calculations, implementation feasibility + - IF.sam (Light): Ethical sales, transparency, user benefit + - IF.sam (Dark): Competitive advantage, revenue potential, market dominance + + ## Questions for This Guardian + [Anticipated concerns based on philosophy] + + ## Voting Criteria + - [ ] Evidence quality meets standards + - [ ] Claims aligned with guardian's values + - [ ] Implementation feasible + ``` + +2. **Consensus Prediction Formula:** + ```javascript + function predictConsensus(dossier) { + let approvalScore = 0; + // Evidence quality (40% weight) + approvalScore += dossier.verifiedCitations / dossier.totalCitations * 0.4; + // Multi-source verification (30% weight) + approvalScore += dossier.multiSourceClaims / dossier.totalClaims * 0.3; + // Implementation feasibility (20% weight) + approvalScore += dossier.feasibilityScore * 0.2; + // Guardian alignment (10% weight) + approvalScore += dossier.philosophyAlignment * 0.1; + + return approvalScore * 100; // Return as percentage + } + ``` + +3. **Voting Criteria Checklist:** + - [ ] All high-confidence claims have ≥2 sources + - [ ] Technical claims reference codebase (file:line) + - [ ] Market sizing backed by credible sources + - [ ] ROI calculations show work (formulas + data) + - [ ] Implementation timeline realistic (based on codebase complexity) + - [ ] Acceptance criteria testable + - [ ] No unverified claims in executive summary + +**Deliverable:** `GUARDIAN_BRIEFING_TEMPLATES/` directory with 20 templates + consensus formula + +--- + +**PHASE 2: Evidence Extraction & Validation (WAIT FOR SESSIONS 1-4)** ### Agent 1: Session 1 Evidence Extraction **AGENT ID:** S5-H01 diff --git a/EVIDENCE_QUALITY_STANDARDS.md b/EVIDENCE_QUALITY_STANDARDS.md new file mode 100644 index 0000000..5f3830a --- /dev/null +++ b/EVIDENCE_QUALITY_STANDARDS.md @@ -0,0 +1,343 @@ +# Evidence Quality Standards for NaviDocs Intelligence Sessions +**For:** Sessions 1, 2, 3, 4 (reference this document while working) +**Created by:** Session 5 Agent 0A +**Last Updated:** 2025-11-13 +**Status:** ACTIVE - All sessions must follow these standards + +--- + +## 🎯 Purpose + +Ensure all market research, technical claims, and business intelligence meet medical-grade evidence standards (IF.TTT: Traceable, Transparent, Trustworthy). + +**Why This Matters:** +- Guardian Council requires >90% consensus (18/20 votes) +- 100% consensus requires empirical validation + testable predictions +- Poor evidence quality = rework at validation stage (expensive) +- High-quality citations = faster Guardian approval = faster launch + +--- + +## 📋 Citation Format (IF.TTT Compliant) + +### **Template:** + +```json +{ + "citation_id": "if://citation/[unique-identifier]", + "claim": "[The specific claim being made]", + "sources": [ + { + "type": "web", + "url": "https://example.com/research", + "sha256": "a1b2c3d4e5f6...", + "accessed": "2025-11-13", + "quality": "primary", + "credibility": 9, + "excerpt": "[Relevant quote from source]" + }, + { + "type": "file", + "path": "intelligence/session-1/market-analysis.md", + "line_range": "45-67", + "quality": "primary", + "credibility": 9 + } + ], + "status": "verified", + "confidence_score": 0.95, + "verified_by": "S1-H02", + "verification_date": "2025-11-13" +} +``` + +### **Required Fields:** + +| Field | Required | Description | +|-------|----------|-------------| +| `citation_id` | ✅ YES | Unique ID following `if://citation/[identifier]` format | +| `claim` | ✅ YES | Exact claim being cited (1-2 sentences) | +| `sources` | ✅ YES | Array of ≥2 sources for high-confidence claims | +| `status` | ✅ YES | `unverified`, `verified`, `disputed`, or `revoked` | +| `confidence_score` | ✅ YES | 0.0-1.0 (justify based on source quality) | +| `verified_by` | ✅ YES | Agent ID (e.g., `S1-H02`) | +| `verification_date` | ✅ YES | ISO 8601 format | + +--- + +## 🔍 Evidence Quality Scoring + +### **Primary Sources (9-10 credibility):** +- Official government statistics (e.g., DGCCRF yacht registration data) +- Original research studies (peer-reviewed journals) +- Industry association reports (ECPY, Nautical Statistics) +- Codebase analysis (file:line references in NaviDocs repo) +- Direct interviews with verified experts (transcripts available) + +**Examples:** +- ✅ "YachtWorld 2024 Ownership Cost Report (PDF, 47 pages)" +- ✅ "NaviDocs codebase: `server/db/schema.sql:45-67`" +- ✅ "Boat International Annual Market Report 2024" + +### **Secondary Sources (7-8 credibility):** +- Industry news articles (Boat International, YachtWorld) +- Competitor websites (pricing pages, feature lists) +- Trade show presentations (documented with photos/slides) +- Expert blog posts (verified industry professionals) +- LinkedIn profiles (for market sizing claims) + +**Examples:** +- ✅ "Northrop & Johnson website pricing (screenshot + SHA-256 hash)" +- ✅ "Camper & Nicholsons feature comparison table" + +### **Tertiary Sources (5-6 credibility):** +- Forum discussions (YachtForums, The Hull Truth) +- Reddit threads (r/sailing, r/yachts) +- Anecdotal evidence ("broker told me...") +- Marketing materials (press releases, brochures) + +**Examples:** +- ⚠️ "YachtForums thread: 'What do yacht owners really need?'" +- ⚠️ Use only if ≥2 primary sources unavailable + +### **Unverified (0-4 credibility):** +- Claims without sources ("industry experts estimate...") +- Single-source claims (need ≥2 sources) +- Broken links (URL returns 404) +- Paywalled content (can't verify) + +**Examples:** +- ❌ "Experts say warranty claims cost €10K-€50K" (who? which experts?) +- ❌ Single YachtWorld article without corroboration + +--- + +## ✅ IF.TTT Compliance Checklist + +**Before committing any citation, verify:** + +- [ ] **≥2 independent sources** for high-confidence claims (confidence ≥0.9) +- [ ] **Web URLs include SHA-256 hash** (tamper detection via `sha256sum `) +- [ ] **File references include line numbers** (`intelligence/session-1/market-analysis.md:45-67`) +- [ ] **Citation ID follows if:// URI scheme** (`if://citation/warranty-savings-8k-33k`) +- [ ] **Confidence score justified** (0.9+ requires ≥2 primary sources) +- [ ] **Status tracked** (unverified → verified → disputed → revoked) +- [ ] **Agent ID recorded** (who verified this claim?) +- [ ] **Verification date recorded** (when was this verified?) + +--- + +## 📊 Multi-Source Verification Examples + +### **Example 1: Market Sizing Claim** + +**Claim:** "Mediterranean yacht brokerage market: 150-200 active brokers" + +**Good Citation (≥2 sources):** +```json +{ + "citation_id": "if://citation/mediterranean-broker-count", + "claim": "Mediterranean yacht brokerage market: 150-200 active brokers", + "sources": [ + { + "type": "web", + "url": "https://yachtworld.com/research/mediterranean-brokers-2024", + "sha256": "a1b2c3d4...", + "quality": "primary", + "credibility": 9, + "excerpt": "Our database shows 178 active yacht brokers in Mediterranean region" + }, + { + "type": "web", + "url": "https://boatinternational.com/market-analysis/2024", + "sha256": "e5f6g7h8...", + "quality": "primary", + "credibility": 9, + "excerpt": "Estimated 150-200 professional yacht brokers operating in Med" + } + ], + "status": "verified", + "confidence_score": 0.95 +} +``` + +### **Example 2: Technical Claim** + +**Claim:** "NaviDocs uses Express.js + SQLite for backend" + +**Good Citation (codebase reference):** +```json +{ + "citation_id": "if://citation/navidocs-tech-stack", + "claim": "NaviDocs uses Express.js + SQLite for backend", + "sources": [ + { + "type": "file", + "path": "server/index.js", + "line_range": "1-15", + "quality": "primary", + "credibility": 10, + "excerpt": "const express = require('express'); const sqlite3 = require('sqlite3');" + }, + { + "type": "file", + "path": "package.json", + "line_range": "12-18", + "quality": "primary", + "credibility": 10, + "excerpt": "dependencies: { express: ^4.18.0, sqlite3: ^5.1.0 }" + } + ], + "status": "verified", + "confidence_score": 1.0 +} +``` + +### **Example 3: Competitive Claim** + +**Claim:** "Competitor X charges €25/month for yacht management software" + +**Good Citation (competitor website + screenshot):** +```json +{ + "citation_id": "if://citation/competitor-x-pricing", + "claim": "Competitor X charges €25/month for yacht management software", + "sources": [ + { + "type": "web", + "url": "https://competitorx.com/pricing", + "sha256": "b2c3d4e5...", + "accessed": "2025-11-13", + "quality": "primary", + "credibility": 9, + "screenshot": "intelligence/session-1/screenshots/competitor-x-pricing.png" + }, + { + "type": "file", + "path": "intelligence/session-1/competitive-analysis.md", + "line_range": "120-125", + "quality": "secondary", + "credibility": 8, + "excerpt": "Competitor X pricing confirmed via website analysis" + } + ], + "status": "verified", + "confidence_score": 0.90 +} +``` + +--- + +## 🚨 Common Mistakes to Avoid + +### **❌ Bad: Single Source** +```json +{ + "claim": "Warranty claims cost €8K-€33K per yacht", + "sources": [ + { + "type": "web", + "url": "https://yachtworld.com/article", + "quality": "primary" + } + ], + "confidence_score": 0.95 // ❌ Can't claim 0.95 with single source! +} +``` + +### **✅ Good: Multiple Sources** +```json +{ + "claim": "Warranty claims cost €8K-€33K per yacht", + "sources": [ + { + "type": "web", + "url": "https://yachtworld.com/warranty-costs-2024", + "sha256": "a1b2...", + "credibility": 9 + }, + { + "type": "web", + "url": "https://boatinternational.com/ownership-costs", + "sha256": "c3d4...", + "credibility": 9 + } + ], + "confidence_score": 0.95 // ✅ Justified with ≥2 primary sources +} +``` + +--- + +## 🔄 Real-Time Quality Feedback Loop + +**Sessions 1-4: Check `QUALITY_FEEDBACK.md` every 5 minutes** + +Session 5 Agent 0B monitors your commits and provides real-time feedback: + +```markdown +## ⚠️ Session 2 (Needs Attention) +- Agent 3 maintenance log claim: Missing line number reference + - Claim: "NaviDocs tracks maintenance via BullMQ workers" + - Fix: Add file:line reference (e.g., `server/workers/maintenance.js:45-67`) + +## 🔴 Session 1 (Action Required) +- Agent 5 ROI calculator: No source citations for €8K-€33K warranty claim + - Fix: Add ≥2 sources (YachtWorld + Boat International reports) +``` + +**Action:** Read feedback → Fix issues → Commit → Continue working + +--- + +## 📈 Confidence Score Guidelines + +| Score | Sources Required | Quality Required | Use Case | +|-------|------------------|------------------|----------| +| 0.95-1.0 | ≥2 primary | Both 9-10 credibility | Market sizing, ROI calculations | +| 0.85-0.94 | ≥2 mixed | 1 primary + 1 secondary | Competitive analysis, feature claims | +| 0.70-0.84 | ≥1 primary | 7-10 credibility | Technical claims (if codebase verified) | +| 0.50-0.69 | ≥1 secondary | 5-8 credibility | Anecdotal evidence, forum discussions | +| <0.50 | Any | <5 credibility | Unverified claims (flag for review) | + +--- + +## 🎯 Guardian Council Expectations + +### **What Gets >90% Approval:** +- All high-confidence claims (≥0.9) have ≥2 primary sources +- Technical claims reference codebase with file:line +- Market sizing backed by official statistics or industry reports +- ROI calculations show work (formulas + source data visible) +- Implementation timeline realistic (validated against codebase complexity) + +### **What Gets <80% Approval (ESCALATED):** +- >20% of claims lack proper citations +- Single-source claims for critical market data +- Broken URLs or inaccessible sources +- Confidence scores not justified by source quality +- Unverified claims in executive summary + +--- + +## 📞 Need Help? + +**Questions about citation format?** +- Check `schemas/citation/v1.0.schema.json` (JSON schema reference) +- Review Session 5 examples in `CLOUD_SESSION_5_SYNTHESIS_VALIDATION.md` + +**Quality feedback unclear?** +- Check `QUALITY_FEEDBACK.md` (updated every 5 minutes by Agent 0B) +- ESCALATE to Sonnet coordinator if blocked + +**Citation tool available:** +```bash +# Validate citation JSON against schema +python tools/citation_validate.py citations/session-1-citations.json +``` + +--- + +**Remember: High-quality evidence now = Faster Guardian approval later = Faster launch!** + +🚀 Generated with [Claude Code](https://claude.com/claude-code)