Merge branch 'navidocs-cloud-coordination' of http://127.0.0.1:61061/git/dannystocker/navidocs into claude/navidocs-cloud-coordination-011CV53B2oMH6VqjaePrFZgb

This commit is contained in:
Claude 2025-11-13 02:33:55 +00:00
commit a89ee4995c
No known key found for this signature in database
5 changed files with 938 additions and 37 deletions

View file

@ -10,14 +10,16 @@
| Session | Agent ID Range | Status | Progress | Outputs |
|---------|---------------|--------|----------|---------|
| Session 1 | S1-H01 to S1-H10 | 🟡 READY | 0/10 agents | `intelligence/session-1/` |
| Session 2 | S2-H01 to S2-H10 | 🟡 READY | 0/10 agents | `intelligence/session-2/` |
| Session 2 | S2-H0A to S2-H10 | 🟢 ACTIVE (Helper) | 4/15 agents | `intelligence/session-2/` |
| Session 3 | S3-H01 to S3-H10 | 🟡 READY | 0/10 agents | `intelligence/session-3/` |
| Session 4 | S4-H01 to S4-H10 | 🟡 READY | 0/10 agents | `intelligence/session-4/` |
| Session 5 | S5-H01 to S5-H10 | 🟡 READY | 0/20 guardians | `intelligence/session-5/` |
| Session 4 | S4-H0A to S4-H10 | 🟢 ACTIVE (Helper) | 4/14 agents | `intelligence/session-4/` |
| Session 5 | S5-H0A to S5-H10 | 🟢 ACTIVE (QA) | 3/13 agents | `intelligence/session-5/` |
**Status Legend:**
- 🟡 READY - Session initialized, waiting to start
- 🟢 ACTIVE - Session running, agents working
- 🟢 ACTIVE (Helper) - Session assisting other sessions while waiting
- 🟢 ACTIVE (QA) - Session providing quality assurance support
- 🔵 BLOCKED - Waiting for dependencies from other sessions
- ✅ COMPLETE - All outputs delivered
@ -49,20 +51,37 @@ Next: Session 2 unblocked
---
### **Session 2: Technical Integration**
### **Session 2: Technical Integration + Helper Agents**
**Current Task:** Codebase analysis + sticky feature architecture
**Current Task:** PHASE 1 - Helper agents assisting Sessions 1 & 3 (START NOW)
**Instructions:**
1. **First:** Agent 1 (S2-H01) analyzes NaviDocs codebase (NO dependency on Session 1)
2. **Parallel:** Agents 2-9 research technical solutions (inventory, cameras, maintenance, etc.)
3. **When Session 1 completes:** Poll for `intelligence/session-1/session-1-handoff.md`
4. **Then:** Agent 10 synthesizes with Session 1 pain point priorities
5. Output to `intelligence/session-2/` (architecture.md, integration-specs.md, handoff.md)
**IMMEDIATE ACTIONS (Agents 0A, 0B, 0C, 0D):**
1. **Agent 0A (CONTINUOUS):** Technical validation assistant
- Verify competitor tech stacks for Session 1
- Check API availability (YachtWorld, Boat Trader, etc.)
- Validate technical feasibility of Session 3 UX proposals
2. **Agent 0B (CONTINUOUS):** Citation automation
- Generate SHA-256 hashes for Session 1 web URLs
- Verify URL accessibility (flag broken links)
- Auto-generate IF.TTT-compliant citation JSON
3. **Agent 0C (CONTINUOUS):** Web scraping assistant
- Extract competitor feature lists, pricing tables
- Parse industry reports for Session 1
- Create structured competitor-data.json
4. **Agent 0D (PREP WORK):** ROI calculator backend
- Build generic calculator framework NOW
- Ready to plug in Session 1 data when it arrives
- Prepare chart generation and export functionality
**PHASE 2 - Technical Architecture (WAIT FOR SESSION 1):**
5. **When Session 1 completes:** Poll for `intelligence/session-1/session-1-handoff.md`
6. **Agents 1-9 + 3A + 7A:** Codebase analysis + feature design (11 agents)
7. **Agent 10:** Synthesize with Session 1 pain point priorities
8. Output to `intelligence/session-2/` (architecture.md, integration-specs.md, handoff.md)
**Dependencies:**
- **Agent 1 task:** NONE (start immediately)
- **Agent 10 synthesis:** Session 1 complete
- **Agents 0A-0D:** NONE (start immediately - assist Sessions 1 & 3)
- **Agents 1-9 + 3A + 7A + Agent 10:** Session 1 complete
**Polling Command:**
```bash
@ -103,20 +122,41 @@ fi
---
### **Session 4: Implementation Planning**
### **Session 4: Implementation Planning + Helper Agents**
**Current Task:** Generic sprint planning + roadmap templates
**Current Task:** PHASE 1 - Helper agents providing project management support (START NOW)
**Instructions:**
1. **Parallel:** Week agents research sprint best practices, roadmap templates
2. **When Sessions 1+2+3 complete:** Poll for handoff files
3. **Then:** Week agents create detailed sprint plans with feature priorities from Session 1
4. **Finally:** Agent 10 creates integrated 4-week roadmap
5. Output to `intelligence/session-4/` (sprint-plan.md, roadmap.md, handoff.md)
**IMMEDIATE ACTIONS (Agents 0A, 0B, 0C, 0D):**
1. **Agent 0A (CONTINUOUS - Every 5 minutes):** Research coordination dashboard
- Track Session 1 agent progress (7/10 complete, etc.)
- Monitor deliverable status (market-analysis.md: COMPLETE, etc.)
- Predict completion times for all sessions
- Detect blockers early (agent stuck, over-budget, etc.)
2. **Agent 0B (CONTINUOUS):** Citation quality checker
- Pre-validate Session 1, 2, 3 citations BEFORE Session 5
- Check IF.TTT compliance (citation_id, sources, confidence, SHA-256)
- Flag issues immediately (single-source claims, missing hashes, etc.)
- Faster feedback loop than waiting for Session 5 validation
3. **Agent 0C (PREP WORK):** Demo script structure
- Research winning demo flows (problem → solution → demo → ROI → close)
- Create generic template with placeholders for Session 1 data
- Research yacht owner objections (cost, complexity, time)
- Ready to fill with Session 1 findings when available
4. **Agent 0D (CONTINUOUS - Every 5 minutes):** Cross-session dependency tracker
- Visual dependency graph (Mermaid) of all sessions
- Critical path identification (slowest session = bottleneck)
- Real-time completion predictions
- Parallel work opportunity detection
**PHASE 2 - Implementation Planning (WAIT FOR SESSIONS 1+2+3):**
5. **When Sessions 1+2+3 complete:** Poll for handoff files
6. **Agents 1-9:** Create detailed 4-week sprint plan with feature priorities
7. **Agent 10:** Synthesize integrated roadmap
8. Output to `intelligence/session-4/` (sprint-plan.md, roadmap.md, handoff.md)
**Dependencies:**
- **Week 1-4 generic planning:** NONE (start immediately)
- **Detailed feature breakdown:** Sessions 1+2+3 complete
- **Agents 0A-0D:** NONE (start immediately - assist all sessions)
- **Agents 1-10:** Sessions 1+2+3 complete
**Polling Command:**
```bash
@ -131,22 +171,36 @@ fi
---
### **Session 5: Guardian Validation**
### **Session 5: Guardian Validation + Active Quality Assurance**
**Current Task:** Guardian methodology review + evaluation criteria prep
**Current Task:** PHASE 1 - Active QA Partner (NO DEPENDENCIES - START NOW)
**Instructions:**
1. **Parallel:** Guardians 1-12 + IF.sam facets review IF.TTT framework, prepare evaluation criteria
2. **When Sessions 1+2+3+4 complete:** Poll for handoff files
3. **Then:** Guardians review complete intelligence dossier
4. **IF.sam Debate:** 8 facets debate findings (Light Side vs Dark Side)
5. **Vote:** Agent 10 tallies consensus (need >80% approval)
6. **ESCALATE:** If <80%, flag for human review
7. Output to `intelligence/session-5/` (complete-intelligence-dossier.md, guardian-vote.md, consensus-report.md)
**IMMEDIATE ACTIONS (Agents 0A, 0B, 0C):**
1. **Agent 0A (CRITICAL - First 10 minutes):** Deploy `EVIDENCE_QUALITY_STANDARDS.md`
- Citation format templates (IF.TTT compliance)
- Evidence quality scoring rubric (primary/secondary/tertiary sources)
- Multi-source verification examples
- Commit to coordination branch → Sessions 1-4 read immediately
2. **Agent 0B (CONTINUOUS - Every 5 minutes):** Real-time quality monitoring
- Poll `intelligence/session-*/` for new commits
- Review citations for IF.TTT compliance
- Create `QUALITY_FEEDBACK.md` (updated every 5 minutes)
- Sessions 1-4 read feedback → fix issues proactively
3. **Agent 0C (PREP WORK):** Guardian briefing templates
- Create 20 guardian-specific briefing templates
- Consensus prediction formula
- Voting criteria checklists
**PHASE 2 - Final Validation (WAIT FOR SESSIONS 1+2+3+4):**
4. **When Sessions 1+2+3+4 complete:** Poll for handoff files
5. **Agents 1-9:** Extract evidence, validate claims, compile citations
6. **Agent 10:** Guardian Council vote (need >80% consensus)
7. **ESCALATE:** If <80% approval, flag for human review
8. Output to `intelligence/session-5/` (complete-intelligence-dossier.md, guardian-vote.md)
**Dependencies:**
- **Methodology review:** NONE (start immediately)
- **Dossier validation:** Sessions 1+2+3+4 complete
- **Agent 0A, 0B, 0C:** NONE (start immediately - assist Sessions 1-4)
- **Agents 1-10:** Sessions 1+2+3+4 complete
**Polling Command:**
```bash

View file

@ -44,6 +44,146 @@ Design technical architecture for **sticky daily-use features** (inventory track
---
## PHASE 1: Helper Agents (START IMMEDIATELY - ASSIST SESSIONS 1 & 3)
**Mission:** While waiting for Session 1 market research and Session 3 UX/sales work, provide active technical assistance to accelerate their completion.
**NO DEPENDENCIES:** All Phase 1 agents can start immediately (no need to wait for other sessions).
---
### Agent 0A: Technical Validation Assistant (CRITICAL)
**AGENT ID:** S2-H0A
**START:** Immediately (no dependencies)
**Assist Session 1 (Market Research):**
- **Verify competitor tech stacks:** When Session 1 identifies competitors (YachtWorld, Boat Trader, etc.), validate their API documentation
- **Check API availability:** Research if competitor APIs are public, require auth, or need partner agreements
- **Document API contracts:** Create structured summaries of competitor API capabilities for Session 1 citations
**Assist Session 3 (UX/Sales):**
- **Technical feasibility checks:** When Session 3 proposes UI features, confirm if NaviDocs tech stack supports them
- **Performance estimates:** Provide load time estimates for proposed features (based on NaviDocs current architecture)
- **Mobile compatibility:** Validate if proposed UX patterns work on mobile (Vue 3 + PWA constraints)
**Deliverable:** `intelligence/session-2/helper-technical-validations.md` (updated in real-time as Sessions 1 & 3 work)
---
### Agent 0B: Citation Automation (SHA-256 Hash Generation)
**AGENT ID:** S2-H0B
**START:** Immediately (no dependencies)
**Assist Session 1 (Market Research):**
- **Generate SHA-256 hashes:** When Session 1 cites web URLs, automatically fetch and hash content
- **Verify URL accessibility:** Check if cited URLs return 200 OK (flag broken links immediately)
- **Extract structured data:** Parse web pages for key data points (pricing, features, market stats)
- **Create citation JSON:** Auto-generate IF.TTT-compliant citation entries with hashes
**Example Workflow:**
```bash
# Session 1 Agent 3 cites: "YachtWorld pricing: €25/month"
# Agent 0B immediately:
1. Fetches https://yachtworld.com/pricing
2. Generates SHA-256: a1b2c3d4e5f6...
3. Extracts pricing table
4. Creates citation JSON:
{
"citation_id": "if://citation/yachtworld-pricing-nov2025",
"claim": "YachtWorld charges €25/month for yacht management",
"sources": [{
"type": "web",
"url": "https://yachtworld.com/pricing",
"sha256": "a1b2c3d4e5f6...",
"accessed": "2025-11-13",
"quality": "primary",
"credibility": 9
}],
"status": "verified",
"confidence_score": 0.95
}
```
**Deliverable:** `intelligence/session-2/auto-citations.json` (append as Session 1 works)
---
### Agent 0C: Web Scraping Assistant (Structured Data Extraction)
**AGENT ID:** S2-H0C
**START:** Immediately (no dependencies)
**Assist Session 1 (Market Research):**
- **Extract competitor feature lists:** Scrape competitor websites for feature comparisons
- **Pricing table extraction:** Parse pricing pages into structured JSON
- **Market report parsing:** Extract key stats from industry reports (PDFs, web pages)
- **Automated data validation:** Cross-check data across multiple sources (detect conflicts)
**Example:**
```yaml
# Session 1 Agent 5 researching competitors
# Agent 0C scrapes:
- YachtWorld: Features (inventory, CRM, MLS integration)
- Boat Trader: Pricing (€15/month basic, €45/month pro)
- Dockwa: User counts (50K+ marinas, 250K+ boaters)
# Output: intelligence/session-2/competitor-data.json
{
"yachtworld": {
"features": ["inventory", "crm", "mls_integration"],
"pricing": {"basic": 25, "pro": 75},
"users": "unknown"
},
"boat_trader": {
"features": ["listings", "leads", "analytics"],
"pricing": {"basic": 15, "pro": 45},
"users": "unknown"
}
}
```
**Deliverable:** `intelligence/session-2/competitor-data.json` (structured data for Session 1 & 3 use)
---
### Agent 0D: ROI Calculator Backend (Build Before Session 1 Data Arrives)
**AGENT ID:** S2-H0D
**START:** Immediately (no dependencies)
**Build Generic ROI Calculator Framework:**
- **Formula engine:** Create generic calculator that accepts variables (warranty_savings, time_saved, resale_value_increase)
- **Input validation:** Define valid ranges for each variable (prevent unrealistic claims)
- **Visualization logic:** Prepare chart generation code (bar charts, pie charts for ROI breakdown)
- **Export functionality:** Generate PDF/Excel exports of ROI calculations
**Example Structure:**
```javascript
// server/services/roi-calculator.service.js
class ROICalculator {
calculate(inputs) {
// inputs: { warranty_savings, time_saved_hours, resale_value_increase }
const warranty_roi = inputs.warranty_savings * 12; // Annual savings
const time_roi = inputs.time_saved_hours * 50; // €50/hour labor cost
const resale_roi = inputs.resale_value_increase;
return {
total_annual_roi: warranty_roi + time_roi,
resale_value_lift: resale_roi,
payback_period_months: this.calculatePayback(inputs),
confidence: this.calculateConfidence(inputs)
};
}
}
```
**When Session 1 Data Arrives:**
- Agent 0D plugs in Session 1 findings (€8K-€33K warranty savings, etc.)
- Generates final ROI report for Session 3 pitch deck
- Creates interactive calculator UI for demos
**Deliverable:** `intelligence/session-2/roi-calculator-framework.js` (ready for Session 1 data integration)
---
## Agent Identity & Check-In Protocol

View file

@ -35,6 +35,188 @@ Create detailed 4-week implementation plan with task breakdown, dependencies, ac
---
## PHASE 1: Helper Agents (START IMMEDIATELY - ASSIST SESSIONS 1 & 3)
**Mission:** While waiting for Sessions 1+2+3 to complete, provide active project management assistance to accelerate overall coordination and quality.
**NO DEPENDENCIES:** All Phase 1 agents can start immediately (no need to wait for other sessions).
---
### Agent 0A: Research Coordination Dashboard (Real-Time Progress Tracking)
**AGENT ID:** S4-H0A
**START:** Immediately (no dependencies)
**Assist Session 1 (Market Research):**
- **Agent progress tracker:** Monitor which of the 10 Session 1 agents have completed their tasks
- **Completion status:** Track deliverables (market-analysis.md, citations.json, handoff.md)
- **Blocker detection:** Identify if any Session 1 agent is stuck or over-budget
- **Estimated completion:** Predict when Session 1 will finish based on current progress
**Dashboard Format:**
```yaml
# intelligence/session-4/coordination-dashboard.md
## Session 1 Progress (Updated Every 5 Minutes)
Agents Complete: 7/10 (70%)
Deliverables:
- market-analysis.md: ✅ COMPLETE (87KB)
- session-1-citations.json: 🟡 IN PROGRESS (43 citations so far)
- session-1-handoff.md: ⏳ PENDING (Agent 10 not started)
Estimated Completion: 15 minutes
Blockers: None
Token Usage: $12.50 / $15.00 budget (83% consumed)
```
**Deliverable:** `intelligence/session-4/coordination-dashboard.md` (updated every 5 minutes)
---
### Agent 0B: Citation Quality Checker (Pre-Validate Before Session 5)
**AGENT ID:** S4-H0B
**START:** Immediately (no dependencies)
**Assist Sessions 1, 2, 3 (Pre-Validation):**
- **Citation format check:** Validate citations follow IF.TTT schema (citation_id, sources, confidence_score, etc.)
- **Source quality scoring:** Verify primary sources have 9-10 credibility (not tertiary)
- **Multi-source validation:** Check high-confidence claims (≥0.9) have ≥2 sources
- **SHA-256 verification:** Confirm web URLs have SHA-256 hashes (tamper detection)
- **Broken link detection:** Test cited URLs return 200 OK (flag dead links)
**Example Feedback:**
```yaml
# intelligence/session-4/citation-quality-report.md
## Session 1 Citation Quality (43 citations reviewed)
✅ PASS (38 citations):
- All have if://citation/ IDs
- Primary sources properly cited
- ≥2 sources for high-confidence claims
⚠️ NEEDS ATTENTION (5 citations):
1. Citation: if://citation/broker-count-med
- Issue: Single source (need 2nd corroboration)
- Recommendation: Add Boat International report as 2nd source
2. Citation: if://citation/warranty-costs
- Issue: Missing SHA-256 hash for YachtWorld URL
- Recommendation: Run `sha256sum` on fetched page
3. Citation: if://citation/competitor-pricing
- Issue: Confidence 0.95 but only secondary sources (credibility 7-8)
- Recommendation: Lower confidence to 0.85 or find primary source
```
**Value:** Catch citation issues BEFORE Session 5 validation (faster feedback loop, less rework)
**Deliverable:** `intelligence/session-4/citation-quality-report.md` (updated as sessions work)
---
### Agent 0C: Demo Script Structure (Outline Before Session 1 Data)
**AGENT ID:** S4-H0C
**START:** Immediately (no dependencies)
**Assist Session 3 (UX/Sales):**
- **Demo flow research:** Study winning software demo structures (problem → solution → demo → ROI → close)
- **Generic outline:** Create demo script template with placeholders for Session 1 data
- **Objection handling:** Research common yacht owner objections (cost, complexity, time investment)
- **Storytelling techniques:** Identify emotional hooks (peace of mind, warranty savings, resale value)
**Generic Demo Outline (Before Session 1 Data):**
```markdown
# NaviDocs Demo Script (Template)
## 1. Opening (2 min)
- Pain point: "[INSERT: Top pain point from Session 1]"
- Teaser: "What if you could [INSERT: Key benefit]?"
## 2. Problem Deep Dive (3 min)
- Story: "[INSERT: Yacht owner story from Session 1]"
- Consequence: "Without NaviDocs, owners lose [INSERT: €X savings]"
## 3. Solution Demo (10 min)
- Feature 1: [INSERT: Sticky feature from Session 2]
- Show: [Screen recording placeholder]
- Benefit: [INSERT: Time/money saved]
## 4. ROI Reveal (5 min)
- Calculator: [INSERT: Session 1 warranty savings data]
- Comparison: "That's [X] hours saved per year"
## 5. Close (2 min)
- Call to action: "Sign up for Riviera pilot"
- Objection handling: [INSERT: Top 3 objections from Session 1]
```
**When Session 1 Completes:**
- Agent 0C fills placeholders with real data
- Creates final polished demo script for Session 3
**Deliverable:** `intelligence/session-4/demo-script-template.md` (ready for Session 1 data integration)
---
### Agent 0D: Cross-Session Dependency Tracker (Visual Dependency Graph)
**AGENT ID:** S4-H0D
**START:** Immediately (no dependencies)
**Monitor All Sessions:**
- **Dependency mapping:** Track which sessions are blocking others
- **Critical path visualization:** Identify slowest session (bottleneck)
- **Parallel work opportunities:** Find tasks that can run concurrently
- **Completion predictions:** Estimate when each session will finish
**Dependency Graph (Mermaid):**
```mermaid
graph TD
S1[Session 1: Market Research] --> S2[Session 2: Technical Architecture]
S1 --> S3[Session 3: UX/Sales]
S2 --> S3
S1 --> S4[Session 4: Implementation Planning]
S2 --> S4
S3 --> S4
S1 --> S5[Session 5: Guardian Validation]
S2 --> S5
S3 --> S5
S4 --> S5
style S1 fill:#90EE90
style S2 fill:#FFD700
style S3 fill:#FFD700
style S4 fill:#FFD700
style S5 fill:#87CEEB
```
**Real-Time Status Updates:**
```yaml
# intelligence/session-4/dependency-status.md
## Critical Path Analysis (Updated Every 5 Minutes)
SLOWEST SESSION: Session 1 (30 min remaining)
- Blocks: Session 2 (Agent 10), Session 3 (Agents 8-10), Session 4 (All agents)
PARALLEL WORK:
- Session 2 Phase 1 agents (0A-0D): ✅ RUNNING (assisting Session 1)
- Session 5 Phase 1 agents (0A-0C): ✅ RUNNING (deploying standards)
ESTIMATED COMPLETION:
- Session 1: 13:30 UTC
- Session 2: 14:00 UTC (depends on Session 1)
- Session 3: 14:30 UTC (depends on Sessions 1+2)
- Session 4: 15:00 UTC (depends on Sessions 1+2+3)
- Session 5: 16:30 UTC (depends on all)
```
**Deliverable:** `intelligence/session-4/dependency-status.md` (updated every 5 minutes with Mermaid graph)
---
## Agent Identity & Check-In Protocol

View file

@ -11,7 +11,9 @@
## Mission Statement
Synthesize all intelligence from Sessions 1-4 into comprehensive dossier, validate claims with medical-grade evidence standards, achieve Guardian Council consensus (>90% approval), and deliver final presentation materials.
**Active Quality Assurance Partner (Immediate Start):** Deploy evidence quality standards, monitor Sessions 1-4 commits in real-time, provide continuous feedback to prevent rework.
**Final Validation (When Sessions 1-4 Complete):** Synthesize all intelligence into comprehensive dossier, validate claims with medical-grade evidence standards, achieve Guardian Council consensus (>90% approval), and deliver final presentation materials.
---
@ -66,7 +68,187 @@ Each agent MUST:
---
## Your Tasks (Spawn 10 Haiku Agents in Parallel)
## Your Tasks (Spawn 13 Haiku Agents)
**PHASE 1: Active Quality Assurance (START IMMEDIATELY - NO DEPENDENCIES)**
### Agent 0A: Evidence Quality Standards Deployment
**AGENT ID:** S5-H0A
**PRIORITY:** CRITICAL - Deploy within first 10 minutes
**
**Create:**
- `EVIDENCE_QUALITY_STANDARDS.md` - Master reference for Sessions 1-4
- **Citation format templates:**
```json
{
"citation_id": "if://citation/warranty-savings-8k-33k",
"claim": "NaviDocs prevents €8K-€33K warranty losses per yacht",
"sources": [
{
"type": "web",
"url": "https://yachtworld.com/research/yacht-ownership-costs-2024",
"sha256": "a1b2c3d4...",
"accessed": "2025-11-13",
"quality": "primary",
"credibility": 9
},
{
"type": "file",
"path": "intelligence/session-1/market-analysis.md",
"line_range": "45-67",
"quality": "primary"
}
],
"status": "verified",
"confidence_score": 0.95
}
```
- **IF.TTT compliance checklist:**
- [ ] ≥2 independent sources for high-confidence claims
- [ ] Web URLs include SHA-256 hash (tamper detection)
- [ ] File references include line numbers
- [ ] Citation ID follows if:// URI scheme
- [ ] Confidence score justified (0.0-1.0)
- [ ] Status tracked: unverified → verified → disputed → revoked
- **Evidence quality scoring:**
- Primary source (9-10 credibility): Original research, official statistics, codebase analysis
- Secondary source (7-8 credibility): Industry reports, competitor websites, expert interviews
- Tertiary source (5-6 credibility): Blog posts, forum discussions, anecdotal evidence
- Unverified (0-4 credibility): Claims without sources
- **Multi-source verification examples:**
- Market sizing claim: YachtWorld stats + Boat International report
- Technical claim: Codebase file:line + architecture doc
- Competitive claim: Competitor website + pricing screenshot
**Commit to coordination branch:**
```bash
git add EVIDENCE_QUALITY_STANDARDS.md
git commit -m "Session 5: Evidence quality standards for Sessions 1-4"
git push origin navidocs-cloud-coordination
```
**Notify other sessions:**
- Update `AUTONOMOUS-COORDINATION-STATUS.md`: "✅ Evidence standards deployed - Sessions 1-4 reference EVIDENCE_QUALITY_STANDARDS.md"
**Deliverable:** `EVIDENCE_QUALITY_STANDARDS.md` (Sessions 1-4 read this immediately)
---
### Agent 0B: Real-Time Quality Monitor (CONTINUOUS)
**AGENT ID:** S5-H0B
**PRIORITY:** HIGH - Run every 5 minutes for entire session
**
**Monitor:**
- Poll `intelligence/session-1/`, `session-2/`, `session-3/`, `session-4/` for new commits
- Check git log every 5 minutes:
```bash
git fetch origin navidocs-cloud-coordination
git log --since="5 minutes ago" --name-status -- intelligence/
```
**Review:**
- New citations: Are they IF.TTT compliant? (SHA-256 hashes, ≥2 sources, line numbers)
- Market claims: Do they cite credible sources? (not just "industry experts say...")
- Technical claims: Do they reference codebase? (file:line required)
- ROI calculations: Do they show work? (formulas + source data)
**Feedback Loop:**
- Create `QUALITY_FEEDBACK.md` (updated every 5 minutes):
```markdown
# Real-Time Quality Feedback (Updated: 2025-11-13 14:35 UTC)
## ✅ Session 1 (Good)
- Agent 2 citation: Excellent (2 primary sources, SHA-256 hashes included)
- Agent 3 market sizing: Good (YachtWorld + Boat International cited)
## ⚠️ Session 2 (Needs Attention)
- Agent 3 maintenance log claim: Missing line number reference
- Agent 6 accounting module: Only 1 source (need ≥2 for high confidence)
## 🔴 Session 3 (Action Required)
- Agent 5 ROI calculator: No source citations for €8K-€33K warranty claim
- Action: Review Session 1 market analysis, add citation links
## ✅ Session 4 (Good)
- Sprint plan: All tasks reference Session 2 architecture (file:line included)
```
**Commit feedback every 5 minutes:**
```bash
git add QUALITY_FEEDBACK.md
git commit -m "Session 5: Quality feedback ($(date -Iseconds))"
git push origin navidocs-cloud-coordination
```
**Escalate if needed:**
- >20% of citations lack IF.TTT compliance → ESCALATE to Sonnet coordinator
- Sessions 1-4 read feedback, fix issues proactively (prevent rework at validation stage)
**Deliverable:** `QUALITY_FEEDBACK.md` (updated every 5 minutes)
---
### Agent 0C: Guardian Briefing Templates (PREP WORK)
**AGENT ID:** S5-H0C
**
**Create templates for final validation (ready when Sessions 1-4 complete):**
1. **Guardian-Specific Briefing Template (20 guardians):**
```markdown
# Guardian Briefing: [Guardian Name]
**Philosophy:** [Empiricism, Pragmatism, IF.sam Light/Dark, etc.]
**Focus Areas:** [What this guardian cares about most]
## Executive Summary
[Tailored to guardian's philosophy]
## Key Evidence
[Filtered to guardian's interests]
- Empiricism: Market research data, statistical evidence
- Pragmatism: ROI calculations, implementation feasibility
- IF.sam (Light): Ethical sales, transparency, user benefit
- IF.sam (Dark): Competitive advantage, revenue potential, market dominance
## Questions for This Guardian
[Anticipated concerns based on philosophy]
## Voting Criteria
- [ ] Evidence quality meets standards
- [ ] Claims aligned with guardian's values
- [ ] Implementation feasible
```
2. **Consensus Prediction Formula:**
```javascript
function predictConsensus(dossier) {
let approvalScore = 0;
// Evidence quality (40% weight)
approvalScore += dossier.verifiedCitations / dossier.totalCitations * 0.4;
// Multi-source verification (30% weight)
approvalScore += dossier.multiSourceClaims / dossier.totalClaims * 0.3;
// Implementation feasibility (20% weight)
approvalScore += dossier.feasibilityScore * 0.2;
// Guardian alignment (10% weight)
approvalScore += dossier.philosophyAlignment * 0.1;
return approvalScore * 100; // Return as percentage
}
```
3. **Voting Criteria Checklist:**
- [ ] All high-confidence claims have ≥2 sources
- [ ] Technical claims reference codebase (file:line)
- [ ] Market sizing backed by credible sources
- [ ] ROI calculations show work (formulas + data)
- [ ] Implementation timeline realistic (based on codebase complexity)
- [ ] Acceptance criteria testable
- [ ] No unverified claims in executive summary
**Deliverable:** `GUARDIAN_BRIEFING_TEMPLATES/` directory with 20 templates + consensus formula
---
**PHASE 2: Evidence Extraction & Validation (WAIT FOR SESSIONS 1-4)**
### Agent 1: Session 1 Evidence Extraction
**AGENT ID:** S5-H01

View file

@ -0,0 +1,343 @@
# Evidence Quality Standards for NaviDocs Intelligence Sessions
**For:** Sessions 1, 2, 3, 4 (reference this document while working)
**Created by:** Session 5 Agent 0A
**Last Updated:** 2025-11-13
**Status:** ACTIVE - All sessions must follow these standards
---
## 🎯 Purpose
Ensure all market research, technical claims, and business intelligence meet medical-grade evidence standards (IF.TTT: Traceable, Transparent, Trustworthy).
**Why This Matters:**
- Guardian Council requires >90% consensus (18/20 votes)
- 100% consensus requires empirical validation + testable predictions
- Poor evidence quality = rework at validation stage (expensive)
- High-quality citations = faster Guardian approval = faster launch
---
## 📋 Citation Format (IF.TTT Compliant)
### **Template:**
```json
{
"citation_id": "if://citation/[unique-identifier]",
"claim": "[The specific claim being made]",
"sources": [
{
"type": "web",
"url": "https://example.com/research",
"sha256": "a1b2c3d4e5f6...",
"accessed": "2025-11-13",
"quality": "primary",
"credibility": 9,
"excerpt": "[Relevant quote from source]"
},
{
"type": "file",
"path": "intelligence/session-1/market-analysis.md",
"line_range": "45-67",
"quality": "primary",
"credibility": 9
}
],
"status": "verified",
"confidence_score": 0.95,
"verified_by": "S1-H02",
"verification_date": "2025-11-13"
}
```
### **Required Fields:**
| Field | Required | Description |
|-------|----------|-------------|
| `citation_id` | ✅ YES | Unique ID following `if://citation/[identifier]` format |
| `claim` | ✅ YES | Exact claim being cited (1-2 sentences) |
| `sources` | ✅ YES | Array of ≥2 sources for high-confidence claims |
| `status` | ✅ YES | `unverified`, `verified`, `disputed`, or `revoked` |
| `confidence_score` | ✅ YES | 0.0-1.0 (justify based on source quality) |
| `verified_by` | ✅ YES | Agent ID (e.g., `S1-H02`) |
| `verification_date` | ✅ YES | ISO 8601 format |
---
## 🔍 Evidence Quality Scoring
### **Primary Sources (9-10 credibility):**
- Official government statistics (e.g., DGCCRF yacht registration data)
- Original research studies (peer-reviewed journals)
- Industry association reports (ECPY, Nautical Statistics)
- Codebase analysis (file:line references in NaviDocs repo)
- Direct interviews with verified experts (transcripts available)
**Examples:**
- ✅ "YachtWorld 2024 Ownership Cost Report (PDF, 47 pages)"
- ✅ "NaviDocs codebase: `server/db/schema.sql:45-67`"
- ✅ "Boat International Annual Market Report 2024"
### **Secondary Sources (7-8 credibility):**
- Industry news articles (Boat International, YachtWorld)
- Competitor websites (pricing pages, feature lists)
- Trade show presentations (documented with photos/slides)
- Expert blog posts (verified industry professionals)
- LinkedIn profiles (for market sizing claims)
**Examples:**
- ✅ "Northrop & Johnson website pricing (screenshot + SHA-256 hash)"
- ✅ "Camper & Nicholsons feature comparison table"
### **Tertiary Sources (5-6 credibility):**
- Forum discussions (YachtForums, The Hull Truth)
- Reddit threads (r/sailing, r/yachts)
- Anecdotal evidence ("broker told me...")
- Marketing materials (press releases, brochures)
**Examples:**
- ⚠️ "YachtForums thread: 'What do yacht owners really need?'"
- ⚠️ Use only if ≥2 primary sources unavailable
### **Unverified (0-4 credibility):**
- Claims without sources ("industry experts estimate...")
- Single-source claims (need ≥2 sources)
- Broken links (URL returns 404)
- Paywalled content (can't verify)
**Examples:**
- ❌ "Experts say warranty claims cost €10K-€50K" (who? which experts?)
- ❌ Single YachtWorld article without corroboration
---
## ✅ IF.TTT Compliance Checklist
**Before committing any citation, verify:**
- [ ] **≥2 independent sources** for high-confidence claims (confidence ≥0.9)
- [ ] **Web URLs include SHA-256 hash** (tamper detection via `sha256sum <file>`)
- [ ] **File references include line numbers** (`intelligence/session-1/market-analysis.md:45-67`)
- [ ] **Citation ID follows if:// URI scheme** (`if://citation/warranty-savings-8k-33k`)
- [ ] **Confidence score justified** (0.9+ requires ≥2 primary sources)
- [ ] **Status tracked** (unverified → verified → disputed → revoked)
- [ ] **Agent ID recorded** (who verified this claim?)
- [ ] **Verification date recorded** (when was this verified?)
---
## 📊 Multi-Source Verification Examples
### **Example 1: Market Sizing Claim**
**Claim:** "Mediterranean yacht brokerage market: 150-200 active brokers"
**Good Citation (≥2 sources):**
```json
{
"citation_id": "if://citation/mediterranean-broker-count",
"claim": "Mediterranean yacht brokerage market: 150-200 active brokers",
"sources": [
{
"type": "web",
"url": "https://yachtworld.com/research/mediterranean-brokers-2024",
"sha256": "a1b2c3d4...",
"quality": "primary",
"credibility": 9,
"excerpt": "Our database shows 178 active yacht brokers in Mediterranean region"
},
{
"type": "web",
"url": "https://boatinternational.com/market-analysis/2024",
"sha256": "e5f6g7h8...",
"quality": "primary",
"credibility": 9,
"excerpt": "Estimated 150-200 professional yacht brokers operating in Med"
}
],
"status": "verified",
"confidence_score": 0.95
}
```
### **Example 2: Technical Claim**
**Claim:** "NaviDocs uses Express.js + SQLite for backend"
**Good Citation (codebase reference):**
```json
{
"citation_id": "if://citation/navidocs-tech-stack",
"claim": "NaviDocs uses Express.js + SQLite for backend",
"sources": [
{
"type": "file",
"path": "server/index.js",
"line_range": "1-15",
"quality": "primary",
"credibility": 10,
"excerpt": "const express = require('express'); const sqlite3 = require('sqlite3');"
},
{
"type": "file",
"path": "package.json",
"line_range": "12-18",
"quality": "primary",
"credibility": 10,
"excerpt": "dependencies: { express: ^4.18.0, sqlite3: ^5.1.0 }"
}
],
"status": "verified",
"confidence_score": 1.0
}
```
### **Example 3: Competitive Claim**
**Claim:** "Competitor X charges €25/month for yacht management software"
**Good Citation (competitor website + screenshot):**
```json
{
"citation_id": "if://citation/competitor-x-pricing",
"claim": "Competitor X charges €25/month for yacht management software",
"sources": [
{
"type": "web",
"url": "https://competitorx.com/pricing",
"sha256": "b2c3d4e5...",
"accessed": "2025-11-13",
"quality": "primary",
"credibility": 9,
"screenshot": "intelligence/session-1/screenshots/competitor-x-pricing.png"
},
{
"type": "file",
"path": "intelligence/session-1/competitive-analysis.md",
"line_range": "120-125",
"quality": "secondary",
"credibility": 8,
"excerpt": "Competitor X pricing confirmed via website analysis"
}
],
"status": "verified",
"confidence_score": 0.90
}
```
---
## 🚨 Common Mistakes to Avoid
### **❌ Bad: Single Source**
```json
{
"claim": "Warranty claims cost €8K-€33K per yacht",
"sources": [
{
"type": "web",
"url": "https://yachtworld.com/article",
"quality": "primary"
}
],
"confidence_score": 0.95 // ❌ Can't claim 0.95 with single source!
}
```
### **✅ Good: Multiple Sources**
```json
{
"claim": "Warranty claims cost €8K-€33K per yacht",
"sources": [
{
"type": "web",
"url": "https://yachtworld.com/warranty-costs-2024",
"sha256": "a1b2...",
"credibility": 9
},
{
"type": "web",
"url": "https://boatinternational.com/ownership-costs",
"sha256": "c3d4...",
"credibility": 9
}
],
"confidence_score": 0.95 // ✅ Justified with ≥2 primary sources
}
```
---
## 🔄 Real-Time Quality Feedback Loop
**Sessions 1-4: Check `QUALITY_FEEDBACK.md` every 5 minutes**
Session 5 Agent 0B monitors your commits and provides real-time feedback:
```markdown
## ⚠️ Session 2 (Needs Attention)
- Agent 3 maintenance log claim: Missing line number reference
- Claim: "NaviDocs tracks maintenance via BullMQ workers"
- Fix: Add file:line reference (e.g., `server/workers/maintenance.js:45-67`)
## 🔴 Session 1 (Action Required)
- Agent 5 ROI calculator: No source citations for €8K-€33K warranty claim
- Fix: Add ≥2 sources (YachtWorld + Boat International reports)
```
**Action:** Read feedback → Fix issues → Commit → Continue working
---
## 📈 Confidence Score Guidelines
| Score | Sources Required | Quality Required | Use Case |
|-------|------------------|------------------|----------|
| 0.95-1.0 | ≥2 primary | Both 9-10 credibility | Market sizing, ROI calculations |
| 0.85-0.94 | ≥2 mixed | 1 primary + 1 secondary | Competitive analysis, feature claims |
| 0.70-0.84 | ≥1 primary | 7-10 credibility | Technical claims (if codebase verified) |
| 0.50-0.69 | ≥1 secondary | 5-8 credibility | Anecdotal evidence, forum discussions |
| <0.50 | Any | <5 credibility | Unverified claims (flag for review) |
---
## 🎯 Guardian Council Expectations
### **What Gets >90% Approval:**
- All high-confidence claims (≥0.9) have ≥2 primary sources
- Technical claims reference codebase with file:line
- Market sizing backed by official statistics or industry reports
- ROI calculations show work (formulas + source data visible)
- Implementation timeline realistic (validated against codebase complexity)
### **What Gets <80% Approval (ESCALATED):**
- >20% of claims lack proper citations
- Single-source claims for critical market data
- Broken URLs or inaccessible sources
- Confidence scores not justified by source quality
- Unverified claims in executive summary
---
## 📞 Need Help?
**Questions about citation format?**
- Check `schemas/citation/v1.0.schema.json` (JSON schema reference)
- Review Session 5 examples in `CLOUD_SESSION_5_SYNTHESIS_VALIDATION.md`
**Quality feedback unclear?**
- Check `QUALITY_FEEDBACK.md` (updated every 5 minutes by Agent 0B)
- ESCALATE to Sonnet coordinator if blocked
**Citation tool available:**
```bash
# Validate citation JSON against schema
python tools/citation_validate.py citations/session-1-citations.json
```
---
**Remember: High-quality evidence now = Faster Guardian approval later = Faster launch!**
🚀 Generated with [Claude Code](https://claude.com/claude-code)