feat(PBI-67): IDEA_REVIEW_PLAN — iterative multi-model plan review (#199)

* feat(ideas): upload-plan knop — short-circuit van Make-Plan AI-flow Voegt een 'Upload plan' knop toe in idea-row-actions (verschijnt in zowel list als idea-detail). Klik → file picker → kies .md → server-side parse + opslaan; idea-status springt naar PLAN_READY. Vandaaruit de bestaande 'Maak PBI' knop voor materialize. Server (uploadPlanMdAction): - Toegestaan vanuit DRAFT, GRILLED, PLAN_FAILED, PLAN_READY - DRAFT → skip-grill: status gaat direct naar PLAN_READY - PLAN_READY overschrijft het bestaande plan (consistent met updatePlanMdAction, geen confirmation) - Geblokkeerd in GRILLING/PLANNING (job loopt), PLANNED (al gematerialiseerd) - Parse-failure → 422 + details (NIET opslaan, zodat een onparseerbaar plan nooit in de DB belandt) - Empty / >100k chars → 422 - Schrijft IdeaLog NOTE met from_status + length - Rate-limit + demo-guard + ownership-check via loadOwnedIdea (zelfde patroon als updatePlanMdAction) UI (idea-row-actions.tsx): - Hidden <input type=file accept=".md,.markdown,text/markdown,text/plain"> - FileReader → text → action - Toast bij success + router.refresh() - Blocked-tooltip in andere statussen Tests: 10 nieuwe in __tests__/actions/ideas-crud.test.ts dekkend voor: happy paths (DRAFT/GRILLED/PLAN_READY-overwrite/PLAN_FAILED), blocks (PLANNED/GRILLING), validation (empty/oversized/parse-fail), 404. Full suite groen: 849/849. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * Add reviews for Bootstrap-wizard plans v3.2 to v3.4 - Review v3.2: Addressed executor model, fire-and-forget issues, and PAT handling. - Review v3.3: Improved transaction handling, stale recovery, and ID generation. - Review v3.4: Finalized GitHub permissions, catalog versioning, and E2E verification queries. - Updated recommendations for each version to enhance implementation readiness. * docs(plans): M8 bootstrap-wizard upload-variant v1.4 — backtick-paden Upload-variant van het volledige technische plan (docs/plans/M8-bootstrap-wizard.md), bedoeld voor de "Upload plan"-functie. Genereert 1 PBI + 4 Stories + 22 Tasks via materializeIdeaPlanAction. v1.4-aanpassingen tov eerdere generatie-iteratie: - Alle bestandspaden in implementation_plan in backticks (path-extractor matchen) - Expliciete "Bestanden:" blok per task vóór de stappen - Alle tasks op verify_required: ALIGNED_OR_PARTIAL (was deels ALIGNED — te strict voor ADR-stubs en multi-file edits) Fixt forward-only: T-963 cancelled_by_self door DIVERGENT verifier-verdict. Re-upload van dit bestand produceert tasks die door verify_task_against_plan als ALIGNED of PARTIAL geclassificeerd kunnen worden. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * PBI-67: Add review-plan support to Idea model and job config - Add plan_review_log and reviewed_at fields to Idea model - Add REVIEWING_PLAN, PLAN_REVIEW_FAILED, PLAN_REVIEWED to IdeaStatus enum - Add IDEA_REVIEW_PLAN to ClaudeJobKind enum - Add IDEA_REVIEW_PLAN config to job-config.ts with model=opus, thinking_budget=6000 - Create migration record for schema changes (applied via db push) Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com> * PBI-67 Phase 2: Add update-idea-plan-reviewed MCP tool - Create src/tools/update-idea-plan-reviewed.ts: saves review-log and transitions idea status to PLAN_REVIEWED - Add PLAN_REVIEW_RESULT to IdeaLogType enum (both repos) - Register tool in src/index.ts - Update Prisma schemas (both repos): add plan_review_log and reviewed_at fields to Idea model - Add REVIEWING_PLAN, PLAN_REVIEW_FAILED, PLAN_REVIEWED to IdeaStatus enum (MCP schema) - Add IDEA_REVIEW_PLAN to ClaudeJobKind enum (MCP schema) - Tool includes transaction safety and convergence metrics logging Co-Authored-By: Claude Haiku 4.5 <noreply@anthropic.com> * feat(PBI-67): IDEA_REVIEW_PLAN Phases 3-6 — server actions, UI components, prompt & tests - Phase 3: startReviewPlanJobAction, cancelIdeaJobAction, status transitions (REVIEWING_PLAN / PLAN_REVIEWED / PLAN_REVIEW_FAILED), status colors, job-card/jobs-column filters, idea-list status tabs - Phase 4: review-plan-job.md prompt (multi-model orchestration with codex injection + active plan revision via update_idea_plan_md after each round), runbook, 13 unit tests - Phase 5: ReviewLogViewer component (rounds, convergence, approval, issues), idea-detail integration, proper ReviewLog TypeScript types exported from component - Phase 6.1: wait-for-job discriminator wired (IDEA_REVIEW_PLAN), plan-revision step made mandatory in prompt (was previously optional/missing) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-14 01:35:02 +00:00 · 2026-05-14 01:35:02 +00:00 · d84cdf664f
commit d84cdf664f
parent b8e22539f6
28 changed files with 4387 additions and 30 deletions
--- a/tests/review-plan-job.test.ts
+++ b/tests/review-plan-job.test.ts
@ -0,0 +1,212 @@
+import { describe, it, expect } from 'vitest'
+
+/**
+ * Review-Plan Job Tests
+ *
+ * Tests for the IDEA_REVIEW_PLAN job kind and review-log schema validation.
+ */
+
+// Sample review-log structure for testing
+const sampleReviewLog = {
+  plan_file: 'I-042',
+  created_at: new Date().toISOString(),
+  rounds: [
+    {
+      round: 0,
+      model: 'claude-3-5-haiku',
+      role: 'Structure Review',
+      focus: 'YAML parsing, format, syntax',
+      plan_before: '---\npbi:\n  title: "Test PBI"\nstories:\n  - title: "Story 1"\n---',
+      plan_after:
+        '---\npbi:\n  title: "Test PBI"\nstories:\n  - title: "Story 1"\n    priority: 2\n---',
+      issues: [
+        {
+          category: 'structure',
+          severity: 'warning',
+          suggestion: 'Add priority field to story',
+        },
+      ],
+      score: 75,
+      plan_diff_lines: 1,
+      converged: false,
+      timestamp: new Date().toISOString(),
+    },
+    {
+      round: 1,
+      model: 'claude-3-5-sonnet',
+      role: 'Logic & Patterns',
+      focus: 'Logic gaps, missing patterns, architecture fit',
+      plan_before: '---\npbi:\n  title: "Test PBI"\nstories:\n  - title: "Story 1"\n---',
+      plan_after: '---\npbi:\n  title: "Test PBI"\nstories:\n  - title: "Story 1"\n---',
+      issues: [
+        {
+          category: 'logic',
+          severity: 'info',
+          suggestion: 'Consider adding acceptance criteria',
+        },
+      ],
+      score: 80,
+      plan_diff_lines: 0,
+      converged: false,
+      timestamp: new Date().toISOString(),
+    },
+    {
+      round: 2,
+      model: 'claude-opus-4-7',
+      role: 'Risk Assessment',
+      focus: 'Risk assessment, edge cases, refactoring',
+      plan_before: '---\npbi:\n  title: "Test PBI"\nstories:\n  - title: "Story 1"\n---',
+      plan_after: '---\npbi:\n  title: "Test PBI"\nstories:\n  - title: "Story 1"\n---',
+      issues: [],
+      score: 85,
+      plan_diff_lines: 0,
+      converged: true,
+      timestamp: new Date().toISOString(),
+    },
+  ],
+  convergence: {
+    stable_at_round: 2,
+    final_diff_pct: 0.5,
+    convergence_metric: 'plan_stability',
+  },
+  approval: {
+    status: 'approved',
+    timestamp: new Date().toISOString(),
+  },
+  summary: 'Plan reviewed across three rounds. Minor structure improvements suggested. Plan approved.',
+}
+
+describe('review-plan-job', () => {
+  describe('ReviewLog Schema', () => {
+    it('should have required top-level fields', () => {
+      expect(sampleReviewLog).toHaveProperty('plan_file')
+      expect(sampleReviewLog).toHaveProperty('created_at')
+      expect(sampleReviewLog).toHaveProperty('rounds')
+      expect(sampleReviewLog).toHaveProperty('convergence')
+      expect(sampleReviewLog).toHaveProperty('approval')
+      expect(sampleReviewLog).toHaveProperty('summary')
+    })
+
+    it('should have valid plan_file format', () => {
+      expect(typeof sampleReviewLog.plan_file).toBe('string')
+      expect(sampleReviewLog.plan_file.length).toBeGreaterThan(0)
+    })
+
+    it('should have valid ISO timestamps', () => {
+      const isoRegex = /^\d{4}-\d{2}-\d{2}T\d{2}:\d{2}:\d{2}/
+      expect(sampleReviewLog.created_at).toMatch(isoRegex)
+      expect(sampleReviewLog.approval.timestamp).toMatch(isoRegex)
+    })
+
+    it('should have at least one round', () => {
+      expect(sampleReviewLog.rounds.length).toBeGreaterThan(0)
+    })
+
+    it('should have valid round structure', () => {
+      for (const round of sampleReviewLog.rounds) {
+        expect(round).toHaveProperty('round')
+        expect(round).toHaveProperty('model')
+        expect(round).toHaveProperty('role')
+        expect(round).toHaveProperty('focus')
+        expect(round).toHaveProperty('plan_before')
+        expect(round).toHaveProperty('plan_after')
+        expect(round).toHaveProperty('issues')
+        expect(round).toHaveProperty('score')
+        expect(round).toHaveProperty('plan_diff_lines')
+        expect(round).toHaveProperty('converged')
+        expect(round).toHaveProperty('timestamp')
+
+        expect(typeof round.round).toBe('number')
+        expect(round.round).toBeGreaterThanOrEqual(0)
+        expect(typeof round.score).toBe('number')
+        expect(round.score).toBeGreaterThanOrEqual(0)
+        expect(round.score).toBeLessThanOrEqual(100)
+        expect(typeof round.plan_diff_lines).toBe('number')
+        expect(round.plan_diff_lines).toBeGreaterThanOrEqual(0)
+      }
+    })
+
+    it('should have valid issue structure per round', () => {
+      for (const round of sampleReviewLog.rounds) {
+        for (const issue of round.issues) {
+          expect(issue).toHaveProperty('category')
+          expect(issue).toHaveProperty('severity')
+          expect(issue).toHaveProperty('suggestion')
+
+          expect(['structure', 'logic', 'risk', 'pattern']).toContain(issue.category)
+          expect(['error', 'warning', 'info']).toContain(issue.severity)
+          expect(typeof issue.suggestion).toBe('string')
+          expect(issue.suggestion.length).toBeGreaterThan(0)
+        }
+      }
+    })
+
+    it('should have valid convergence structure when present', () => {
+      if (sampleReviewLog.convergence) {
+        expect(sampleReviewLog.convergence).toHaveProperty('stable_at_round')
+        expect(sampleReviewLog.convergence).toHaveProperty('final_diff_pct')
+        expect(sampleReviewLog.convergence).toHaveProperty('convergence_metric')
+
+        expect(typeof sampleReviewLog.convergence.stable_at_round).toBe('number')
+        expect(sampleReviewLog.convergence.stable_at_round).toBeGreaterThanOrEqual(0)
+        expect(typeof sampleReviewLog.convergence.final_diff_pct).toBe('number')
+        expect(sampleReviewLog.convergence.final_diff_pct).toBeGreaterThanOrEqual(0)
+        expect(sampleReviewLog.convergence.final_diff_pct).toBeLessThanOrEqual(100)
+      }
+    })
+
+    it('should have valid approval status', () => {
+      expect(['pending', 'approved', 'rejected']).toContain(sampleReviewLog.approval.status)
+      if (sampleReviewLog.approval.status !== 'pending') {
+        expect(sampleReviewLog.approval.timestamp).toBeDefined()
+      }
+    })
+
+    it('should have non-empty summary', () => {
+      expect(typeof sampleReviewLog.summary).toBe('string')
+      expect(sampleReviewLog.summary.length).toBeGreaterThan(0)
+    })
+  })
+
+  describe('Convergence Detection', () => {
+    it('should detect convergence when diff_pct < 5% for two consecutive rounds', () => {
+      // Simulate convergence: round 0 has 1 diff line, rounds 1-2 have 0 diffs
+      const totalLines = 50
+      const diff0 = 1
+      const diff1 = 0
+      const diff2 = 0
+
+      const pct0 = (diff0 / totalLines) * 100 // 2%
+      const pct1 = (diff1 / totalLines) * 100 // 0%
+      const pct2 = (diff2 / totalLines) * 100 // 0%
+
+      expect(pct0).toBeLessThan(5) // Should converge
+      expect(pct1).toBeLessThan(5) // Should converge
+      expect(pct2).toBeLessThan(5) // Should converge
+    })
+
+    it('should not detect convergence when diff_pct >= 5%', () => {
+      const totalLines = 50
+      const diff = 3 // 6% change
+
+      const pct = (diff / totalLines) * 100
+      expect(pct).toBeGreaterThanOrEqual(5)
+    })
+  })
+
+  describe('Status Transitions', () => {
+    it('should transition REVIEWING_PLAN → PLAN_REVIEWED when approved', () => {
+      const log = { ...sampleReviewLog, approval: { status: 'approved', timestamp: new Date().toISOString() } }
+      expect(log.approval.status).toBe('approved')
+      // In actual implementation: update_idea_plan_reviewed({ approval_status: 'approved' })
+      // → idea.status = 'PLAN_REVIEWED'
+    })
+
+    it('should transition REVIEWING_PLAN → PLAN_REVIEW_FAILED when rejected', () => {
+      const log = { ...sampleReviewLog, approval: { status: 'rejected' } }
+      expect(log.approval.status).toBe('rejected')
+      // In actual implementation: update_idea_plan_reviewed({ approval_status: 'rejected' })
+      // → idea.status = 'PLAN_REVIEW_FAILED'
+    })
+  })
+})