Chapter 66: Kaizen Roadmap — R4 Research Tickets (KAIZEN-0017..0044)

In the pmat project, kaizen (改善, “continuous improvement”) refers to a structured, evidence-driven backlog of enhancements derived from external research rather than internal feature planning. Round 4 of this research — tracked in GitHub issue #337 and opened on 2026-04-18 — indexes 28 new tickets (KAIZEN-0017 through KAIZEN-0044). Seventeen tickets originate from newly published arxiv papers in the 2604 series (post-dating the R1/R2/R3 rounds); eleven come from integration gaps exposed by Claude Code 3.7+ changelogs (v2.1.84 through v2.1.113). Each ticket maps a specific research finding or upstream feature to a concrete pmat capability, and this chapter catalogs them so book readers can trace proposed commands and MCP tools back to their scientific or product justification.

Ticket Index (KAIZEN-0017..0044)

#TitleSource (arxiv ID or CC feature)PriorityStatus
KAIZEN-0017AgentOpt UCB-E model selectionarxiv:2604.06296MProposed
KAIZEN-0018Local-Splitter 7 tactics (45-79% token savings)arxiv:2604.12301MProposed
KAIZEN-0019SkillMOO NSGA-II optimizationarxiv:2604.09297MProposed
KAIZEN-0020TSUBASA long-horizon memory distillationarxiv:2604.07894SProposed
KAIZEN-0021ContextCurator RL entropy-reducing policyarxiv:2604.11462MProposed
KAIZEN-0022ClawVM harness-managed typed pagesarxiv:2604.10352MProposed
KAIZEN-0023ClawGuard runtime MCP middlewarearxiv:2604.11790SProposed
KAIZEN-0024MCP-DPT 6-layer defense taxonomyarxiv:2604.07551SProposed
KAIZEN-0025Compiler-LLM cooperation for hot functionsarxiv:2604.04238MProposed
KAIZEN-0026CascadeDebate confidence router (haiku→opus)arxiv:2604.12262MProposed
KAIZEN-0027Scaffold codegen (react / plan-exec / tree)arxiv:2604.03515SProposed
KAIZEN-0028Agentic bug taxonomy auto-labelingarxiv:2604.08906SProposed
KAIZEN-0029Argus SAST multi-agent orchestratorarxiv:2604.06633MProposed
KAIZEN-0030Beyond-Fluency trajectory verification gatesarxiv:2604.04269SProposed
KAIZEN-0031WebXSkill executable skill miningarxiv:2604.13318MProposed
KAIZEN-0032AnyPoC universal PoC generatorarxiv:2604.11950MProposed
KAIZEN-0033CCCE continuous dependency calibrationarxiv:2604.13102MProposed
KAIZEN-0034PreCompact hook for .pmat-work/ snapshotsCC v2.1.105 changelogSProposed
KAIZEN-0035Hardened Bash(pmat *) denyRulesCC v2.1.113 changelogSProposed
KAIZEN-0036sandbox.network.deniedDomains blocklistCC v2.1.113 changelogSProposed
KAIZEN-0037/effort xhigh advisor for pmat tasksCC v2.1.111 changelogSProposed
KAIZEN-0038Session recap (pmat session-recap)CC v2.1.108 changelogSProposed
KAIZEN-0039monitors manifest key for quality monitorCC v2.1.105 changelogMProposed
KAIZEN-0040Skill-chain automation via Skill toolCC v2.1.108 changelogSProposed
KAIZEN-0041forceRemoteSettingsRefresh for team driftCC v2.1.92 changelogMProposed
KAIZEN-0042WorktreeCreate auto-indexes new worktreesCC v2.1.84 changelogSProposed
KAIZEN-0043pmat observe agents subagent observabilityCC v2.1.97 changelogMProposed
KAIZEN-0044Skills paths: glob scoping for autoloadCC v2.1.84 changelogSProposed

Effort labels use the same convention as prior rounds: S ≈ one-sprint ticket, M ≈ multi-sprint with design doc.

Execution Plan

The 28 tickets cluster into six themes. Sequencing below reflects dependency order and the issue-level triage notes: S-effort Claude Code gaps are queued first because they unblock downstream tickets, while larger arxiv-driven capabilities land after the CC integration surface is stable.

1. Claude Code Integration & Hook Ecosystem

  • KAIZEN-0034 — PreCompact hook preserving .pmat-work/ state
  • KAIZEN-0042 — WorktreeCreate hook triggering pmat context build --worktree
  • KAIZEN-0039monitors manifest entry for background quality monitor
  • KAIZEN-0038pmat session-recap writing to .pmat/recap/<date>.md

These are the minimum-viable hook integrations needed so that pmat state survives compaction and worktree creation across a CC session.

2. MCP Conformance & Security

  • KAIZEN-0023 — ClawGuard MCP middleware with allowlist schema
  • KAIZEN-0024 — MCP-DPT audit across pmat’s 6 defense layers
  • KAIZEN-0022 — ClawVM typed-page state abstraction (pmat-state MCP tool)
  • KAIZEN-0035 — Strict Bash(pmat *) denyRules template
  • KAIZEN-0036sandbox.network.deniedDomains recommendations

Security work must land before external-facing cost optimizations (theme 4) because ClawGuard validates adversarial tool returns that the cascading router would otherwise trust.

3. Settings, Skills, and SDK Updates

  • KAIZEN-0044paths: globs on shipped skills for autoload scoping
  • KAIZEN-0040 — Documented skill chains (coverage-gaps → tests → /compact)
  • KAIZEN-0041pmat settings sync --remote for managed team settings
  • KAIZEN-0037/effort xhigh advisor mapping pmat tasks to effort levels

4. Cost-Aware Cascading & MOO Delegation

  • KAIZEN-0017pmat agent-opt UCB-E model selection
  • KAIZEN-0018pmat agent-split --tactics t1,t2,t3 local-route + compress
  • KAIZEN-0019pmat skills-optimize --goals pass,cost,latency NSGA-II
  • KAIZEN-0026pmat ask --cascade haiku→sonnet→opus by confidence

Quantified gains from the source papers: 13-32× cost gap (AgentOpt), 45-79% token savings (Local-Splitter), +131% pass rate (SkillMOO), +26.75% Pareto improvement (CascadeDebate).

5. Memory Fidelity & Context Curation

  • KAIZEN-0020pmat memory-distill --session <id> TSUBASA-style distillation
  • KAIZEN-0021pmat context curate --budget 50000 RL entropy reduction
  • KAIZEN-0043pmat observe agents subagent + MCP-latency tail

6. Trajectory Integrity, Scaffolds & Debugging

  • KAIZEN-0027pmat agent scaffold --pattern react|plan-exec|tree
  • KAIZEN-0030pmat trace verify --gates-per-step contract validation
  • KAIZEN-0028pmat work classify-bug --taxonomy agentic auto-label + Five Whys
  • KAIZEN-0031pmat skills-mine --from-history reusable-sequence mining
  • KAIZEN-0032pmat poc-gen --from-bug-report regression-grade PoC
  • KAIZEN-0029pmat security-scan --agents argus Argus SAST orchestrator
  • KAIZEN-0025pmat optimize --cooperate rustc,clippy,miri
  • KAIZEN-0033pmat deps autopatch --gate auto-validate

This theme is the R4 “shift from endpoint accuracy to trajectory integrity” — every tool call is a verifiable step, mirroring pmat’s existing Five Whys + contract-falsification posture.

Cross-References

  • Round 1–3 tickets (KAIZEN-0001..0016) were indexed in the previous chapter (see Chapter 55 — Autonomous Continuous Improvement for the kaizen execution engine itself).
  • Ticket status updates land on the GitHub issue first, then propagate here on each release of the book.
  • For ClawVM (KAIZEN-0022), check whether the existing trueno-graph O(1) context layer already satisfies the typed-page requirement before reimplementing.
  • For the agentic bug taxonomy (KAIZEN-0028), reconcile the 5-layer taxonomy against the categories already encoded in pmat five-whys.