English | 简体中文 | 日本語 | 한국어 | Tiếng Việt | Português | Español | Deutsch | Français | हिंदी
Built by world-class engineers, for vibecoders at
flowser.ai — AI Agents with computers for GTM

"Total Concentration — Spec Breathing, Tenth Form: the Vibe Flow never breaks."
— Tanjiro Kamado
Drop this into any project. Your AI agent gets a complete plan-first dev process — 7 gated phases, self-healing check loops, and autopilot that runs start to finish without losing its place.
| 📦 One-command install One curl line drops it into any project. It detects new vs. returning users and never overwrites your files. |
🌐 Works everywhere Any tech stack, any language, and any AI coding agent — Claude Code, Codex, Cursor, Windsurf, Copilot, and more. |
| 🧭 RIPER-5 plan-first workflow 7 gated phases (Research → Spec → Innovate → Plan → Validate → Execute → Update-Process) stop the agent from jumping straight to code. |
🚀 Autopilot mode (quick / fast / full) Start a hands-free run at any phase with a single phrase. Three lanes match the ceremony to the risk. |
🎯 /goal — the run-until-done tokenOne copy-pasteable block keeps the agent running phase after phase without stopping — and resumes the run in a fresh session. |
🔁 PVL + EVL self-healing loops Plan-check-fix and test-check-fix loops find gaps, fix them, and re-check on their own — up to 10 cycles each. |
| 🔍 vc-autoresearch A reusable find-gaps → fix → repeat loop you can point at plans, tests, specs, docs, or evals. |
🧪 Feasibility probes Test-before-you-build verdicts (VIABLE / NOT-VIABLE) before the agent commits to any design approach. |
| 🎛️ Smart strategy picker Before each phase it weighs one agent vs. many vs. a coordinated team — with cost estimates — and picks the cheapest that fits. |
🧮 Smart model use The expensive model only writes code; the cheaper model does everything else. Lower cost, same quality. |
| 🤔 Intent clarification When a request is vague, the agent asks a few sharp questions up front instead of guessing and building the wrong thing. |
🛡️ 36 validators Mechanical correctness checks — not opinions — guard the kit's own structure and catch drift before it ships. |
| 🏗️ Phase programs Large projects are split into independent phases with quality gates between them, so big work doesn't fall apart. |
🔀 Programs that reshape themselves As it learns, the agent inserts new phases, reorders work, and skips blocked steps — the plan adapts on the fly. |
| 🧠 Never loses its place Progress notes are written to disk every phase, so a run survives a memory reset and picks up exactly where it left off. |
📚 Self-improving project memory It learns your codebase on setup and keeps its own shared notes current after every feature ships, so docs never go stale. |
| ⚡ Quick Fix + Fast Mode Light lanes for small changes skip the heavy ceremony, so a one-line fix stays a one-line fix. |
🧱 Layered, auto-discovered skills Skills are organized in clear layers and discovered automatically — the agent always finds the right tool for the step. |
| 🤖 15 agents · 33 skills · 10 hooks A full team of specialized agents, reusable skills, and safety hooks, all wired together out of the box. |
🔄 Full kit lifecycle Install, setup, update, and publish are all one command each — keeping every project on the latest kit safely. |
| 📝 SPEC — your plain-language sign-off Before any design, you state what to build in simple user stories — the cheapest place to catch a misunderstanding. |
🎯 Always checks your intent Every later phase measures back against your SPEC: is what we're building actually what you asked for? |
The simplest, most flexible, team-friendly coding kit for
Works across any tech stack, any language, any project
Not just for show. When you run vc-setup, agents scan your codebase,
detect your stack, and build project-specific knowledge groups that every skill reads before it works.
Other harnesses lock agents to one language — rust-review-agent, python-linter — useless elsewhere.
This one adapts to any combination above and builds up knowledge as you ship.
Prerequisites: Node.js ≥ 22, git, bash (macOS / Linux / WSL; on Alpine:
apk add bash).
There is only one command, and it works for everyone. Run it inside your project folder. It detects whether you are a new or returning user, installs safely without overwriting your files, and then tells you the exact next thing to say.
curl -fsSL https://raw.githubusercontent.com/withkynam/vibecode-pro-max-kit/main/install.sh | bashWhen it finishes, it prints one of two messages — read the bottom of the output and do exactly what it says:
Installer detects no harness and prints:
Next: Run: claude → Say: "Run vc-setup"
→ Open your agent and say Run vc-setup
vc-setup detects your tech stack, creates the process/ folder, scans your codebase, and fills in your real architecture, conventions, and test commands — a conversation, not a checklist.
|
Installer detects a prior install and prints:
Next (upgrade detected): Run: claude → Say: "Run vc-update"
→ Open your agent and say Run vc-update
vc-update pulls the latest version and, if it finds old-format plans or folders, gives you a ready-to-paste prompt to finish the move with zero data loss. Your process/ is never touched.
|
💡 You never have to guess the command.
install.shroutes you: fresh →vc-setup, upgrade →vc-update. Re-running install is always safe — it never breaks things. Codex users: run/vc-setup(or/vc-update) instead of saying it in chat.
📦 What install puts on disk (non-destructive)
your-project/
├── .claude/
│ ├── agents/ # 🤖 15 agent definitions (.md)
│ ├── skills/ # ⚡ 33 skills (each a dir with SKILL.md)
│ └── hooks/ # 🪝 10 lifecycle hooks (.cjs / .mjs)
├── .codex/agents/ # 🔄 Mirrored agents for Codex
├── .agents/skills → # 🔗 Symlink to .claude/skills (Codex discovery)
├── CLAUDE.md # 📋 Orchestrator + routing rules
├── AGENTS.md # 📖 Agent + skill registry (cross-tool)
└── process/
└── development-protocols/ # 📜 22 shared workflow docs (seeded by install)
# context/, plans, features → built by vc-setup
- Non-destructive. Your existing
.claude/skills/,.claude/agents/,process/, andsettings.jsonare never wiped. Only kit-owned files are written or updated. - Existing config? Backed up to
.vibecode-backup/; yoursettings.jsonis restored afterward. - Existing
CLAUDE.md? Backed up asCLAUDE.md.pre-vibecode. - Existing
process/? Never touched by install —vc-setup/vc-updatemigrate it interactively, showing you the diff first.
One-time first-install caveat: if you have custom skills/agents whose names start with
vc-(the reserved kit namespace) and have never run install before, the stale-removal step may flag them. After install, runls .claude/skills/ .claude/agents/to confirm. Use a non-vc-prefix (my-,team-,proj-) for your own additions to avoid this entirely.
🤖 Prefer to drive setup from your agent? (full prompt)
Open Claude Code or Codex with your project folder as the working directory, then paste:
First, install the vibecode-pro-max-kit agent harness by running this command:
curl -fsSL https://raw.githubusercontent.com/withkynam/vibecode-pro-max-kit/main/install.sh | bash
After install completes, run vc-setup and follow the full interactive flow:
1. DETECT — Read package.json (or go.mod, Cargo.toml, pyproject.toml, etc.), detect my
stack: framework, package manager, monorepo structure, test framework, database, auth.
Also check for any existing .claude/, process/, or context files.
2. SHOW ME WHAT YOU FOUND — Summarize detection and wait for me to confirm. If this is an
existing project, tell me what looks good vs what could be improved.
3. ASK ME ABOUT THE PROJECT — Have a real conversation. Ask follow-ups, probe anything
vague, keep going until you genuinely understand it. Summarize back and confirm.
4. SCAFFOLD — Create the process/ directory. If process/ already exists, show me the plan
and wait for approval. Never silently move or delete my files.
5. STUDY — Deep-scan and populate process/context/all-context.md with REAL content: repo
structure, stack + versions, patterns, import aliases, env vars, routes, schema, tests.
No placeholder text.
6. VALIDATE — Run all validation checks to confirm everything is wired correctly.
Rules: read and preserve good existing context; show me a summary before each major change
and wait for my OK; never create empty placeholder files; ask before reorganizing.
Table of Contents
- At a Glance · The Problem · The Fix
- Vibe Coding Revolution · Who Is This For · How This Compares · What Makes This Different
- How It Works: The Coordinator · The RIPER-5 Lifecycle · Intent Clarification
- The Two Quality Loops (PVL + EVL) · Strategy Compare + Model Policy · Autopilot Mode · Feasibility Probes + Validators
- Built-in Safety Systems · Pre-Implementation Intelligence · Quality Pipeline
- Plan Lifecycle · Phase Programs · Context Groups · Feature Folders · Skill Layers · Self-Improving Memory
- What's Inside · Quick Fix + Fast Mode · Kit Lifecycle · Contributing
| Agents One per phase + 6 specialist agents |
Skills 20 workflow + 13 helper, matched by keyword |
Hooks Safety rails + automatic context loading |
Protocols Shared rules every agent follows |
| Validators Automated checks that catch errors before they ship |
Tools Claude Code · Codex · Cursor · Windsurf · Antigravity · OpenCode · Copilot |
Languages EN · 中文 · 日本語 · 한국어 · VI · PT · DE · FR · ES · हिन्दी |
Install One command, then your agent guides the rest |
| Autopilot 3 lanes (quick / fast / full) — start at any phase, runs start to finish without stopping |
/goal blocks Short copy-pasteable texts that resume hands-free runs across sessions after a reset |
vc-autoresearch Find-gaps → fix → repeat loop (shared tool for plans, tests, and evals) |
Feasibility probes Test-before-you-build verdicts (VIABLE / NOT-VIABLE) before locking in a design |
You ask Claude to "add webhook support." It immediately starts writing code. No questions about your architecture. No check on existing patterns. No plan. You get 400 lines that don't fit your codebase, and you spend an hour fixing it.
But that's just the surface. The deeper problems hit harder:
|
Context dies every session Your agent forgets everything it learned. Same mistakes, same questions, every time. No memory, no compounding knowledge. |
Docs go stale instantly You wrote great context docs last week. They're already outdated. Nothing auto-updates them as the codebase evolves. |
|
Big tasks collapse mid-way The context window fills, state is lost, the agent starts hallucinating. You restart from scratch on hour 3. |
No specs, no review, no collaboration Your PM can't review what the agent is about to build. There is no written plan to share, discuss, or approve before code is written. |
|
Architecture decisions are hallucinated The agent invents patterns instead of researching how other codebases solved the same problem. |
Nothing verifies "done" The agent says "all tests pass" — but it never independently re-ran them. You find out in production. |
Your agent has intelligence but no process, no memory, and no way to collaborate with your team. Whether you're a developer, a PM, or a CEO who just started vibe coding — this hits everyone the same way, and the fix is the same: give your agent a real development process.
This kit installs a complete development system into your project — not just a CLAUDE.md, but 15 specialized agents, 33 skills, 10 hooks, and 22 protocols — with a step-locked workflow that makes your agent understand before it builds, and prove before it ships.
| Plan-first approach PMs and devs review the same written plan before any code is written |
Self-improving knowledge Updates itself every time a feature ships — docs never go stale |
| Hands-free execution Survives session resets — runs for hours, not minutes |
Architecture research Studies real codebases before making design decisions |
| Two quality checks Plans are checked before coding; tests are re-run independently after |
Smart knowledge routing Loads only what is relevant — not your whole knowledge base every time |
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
R["RESEARCH\nread-only facts"]
S["SPEC\nrequirements doc"]
I["INNOVATE\n2-3 approaches"]
P["PLAN\ndetailed checklist"]
V["VALIDATE\nplan → contract\n(PVL loop)"]
E["EXECUTE\nimplement\n(EVL loop)"]
U["UPDATE PROCESS\ncapture + archive"]
R -->|"go"| S
S -->|"go"| I
I -->|"go"| P
P -->|"ENTER VALIDATE"| V
V -->|"Gate: PASS"| E
E -->|"gates green"| U
style R fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style S fill:#0277BD,stroke:#01579B,color:#FFFFFF
style I fill:#E65100,stroke:#BF360C,color:#FFFFFF
style P fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style V fill:#558B2F,stroke:#33691E,color:#FFFFFF
style E fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style U fill:#00695C,stroke:#004D40,color:#FFFFFF
In interactive mode, each phase waits for your "go" before it moves on — you stay in the loop at every step. In autopilot or /goal mode, you give approval once up front, then the system drives itself all the way to done. It stops only for three specific hard stops listed below. VALIDATE and the post-EXECUTE re-test are not optional — they are hard gates that block bad work from shipping — and they run automatically in both modes.
Vibe coding changed who can build software. Plan-first development changes what they can ship.
| of vibe coding users are NOT developers | citizen developers worldwide (38% YoY growth) |
| vibe coding market growing 38% annually |
of YC W25 startups had 95%+ AI-generated codebases |
Most tools help you start a project. This kit helps you finish it — with plans your team can review, knowledge that never goes stale, and safety checks that catch mistakes before they ship.
|
CEO / Founder "Build me a SaaS with auth, billing, and a landing page" The agent researches your stack, writes an architecture plan you can review, implements with tests, and captures every decision for your technical co-founder to audit later. |
Product Manager "Create a dashboard showing MRR, churn, and growth metrics" It generates a PRD-style SPEC, gets your approval before writing code, implements to spec, and archives the plan as searchable project history. |
|
Designer "Match this Figma screenshot pixel-perfect" The design-aware agent analyzes your mockup, implements component-by-component with your design tokens, and spawns visual comparison checks. |
Engineer "Refactor the auth module to support RBAC with zero downtime" It researches your current auth code and how other codebases solved RBAC, writes a migration plan that maps which files could be affected, then builds it safely with rollback notes. |
| Feature | vibecode-pro-max-kit | Superpowers | GSD | gstack |
|---|---|---|---|---|
| Plan-first lifecycle | Full RIPER-5 (research → spec → innovate → plan → validate → execute → update) | Mandatory workflows | Context-rot fix | Partial |
| Step-locked safety | Agent tools are restricted per phase (read-only research, no writing in innovate) | Skill-based constraints | Phase separation | None |
| Quality check loops | Two — PVL (check the plan) + EVL (independently re-run tests) | Per-skill | None automatic | None |
| Multi-tool support | 7 tools via AGENTS.md + SKILL.md open standards |
Claude Code plugin | 14 runtimes | 1 tool |
| Auto-improving knowledge | Topic-grouped knowledge, updated after every feature | Plugin memory | Disk-persisted state | Manual |
| Team collaboration | Shared plans, specs, and review files | Solo | Solo | Solo |
| Skills system | 33 auto-discovered, keyword-matched at every prompt | 86 composable skills | Meta-prompting | 23 role tools |
| Large multi-phase projects | Umbrella plans + per-phase inner loop with regression checks | Single task | Single task | Single task |
| Hands-free mode | Autopilot (3 lanes) + standing /goal consent |
Manual | Manual | Manual |
| Installation | 30s curl + auto-routed setup |
Plugin marketplace | npx one-liner | git clone |
On runtime breadth: GSD supports 14 runtimes. We support 7 deeply — with full agent harnesses, skill discovery, and lifecycle hooks on every platform. Breadth vs. depth: your choice.
|
Step-Locked Tool Restrictions Your agent literally cannot write code during research. RESEARCH is read-only, INNOVATE has no Write, PLAN/VALIDATE write only to process/. Real capability limits, not just suggestions.
|
The Lead Agent Never Touches Code The coordinator routes, monitors, and drives loops — it never edits source files or runs tests itself. Every edit and every test run happens inside a dedicated sub-agent. No hidden work. |
|
Automatic Skill Discovery Before handling any request, it scans 33 skills and matches keywords. Say "add webhook support" and vc-security + vc-scenario are pulled in automatically.
|
Survives Session Resets Plans, reports, knowledge docs, and learnings all live on disk. The startup hook restores approval gates after a session reset. Nothing is lost. |
|
Self-Policing Step Guard When the agent is about to skip a required step, it stops itself: "PHASE JUMPING PREVENTED." A built-in guard against shortcuts. |
Works Across 7 AI Coding Tools Two open standards — AGENTS.md and SKILL.md — mean zero adapters, zero plugins. Start in Claude Code, switch to Cursor, continue in Codex.
|
Your main session is a coordinator (called the orchestrator), not a worker. It does four things and nothing else:
Your request
→ Step 0: Skill Discovery (scan 33 skills, match keywords, attach candidates)
→ Detect intent (feature / bug / question / refactor / UI) + score ambiguity
→ Route to the right agent in a fresh context window
→ Monitor: step compliance, status codes, loop driving
|
It delegates, never implements Research → vc-research-agent. Plan → vc-plan-agent. Code → vc-execute-agent. The coordinator hands off the right context and waits — it never does the actual work itself.
|
No hidden execution — ever The moment a plan with an agreed checklist exists, "ENTER EXECUTE MODE" always launches vc-execute-agent. Even a one-line fix goes through it. Tests run only inside a dedicated vc-tester. This holds regardless of change size.
|
|
Clear status codes, not vague signals Every sub-agent ends with one of: DONE · DONE_WITH_CONCERNS · BLOCKED · NEEDS_CONTEXT. The coordinator never ignores a blocker and never retries the same blocked approach three times.
|
It drives the fix loops Sub-agents run once, report a result, and stop. Only the coordinator re-launches them. It drives both the PVL (plan-check-fix) and EVL (test-check-fix) loops and owns all tracking. |
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
REQ["User request"]
SD["Step 0: Skill Discovery\nscan 33 skills, match keywords"]
INT{"Detect intent\n+ score ambiguity"}
RES["vc-research-agent\n(feature / question)"]
PLN["vc-plan-agent\n(after INNOVATE)"]
VAL["vc-validate-agent\n(PVL loop)"]
EXE["vc-execute-agent\n(EXECUTE)"]
TST["vc-tester\n(EVL loop)"]
UPD["vc-update-process-agent\n(closeout)"]
MON["Monitor\nstatus codes · loop driving\nno inline execution ever"]
REQ --> SD --> INT
INT -->|"feature / research"| RES
INT -->|"plan phase"| PLN
INT -->|"validate phase"| VAL
INT -->|"execute phase"| EXE
EXE -.->|"EVL"| TST
INT -->|"update phase"| UPD
RES --> MON
PLN --> MON
VAL --> MON
TST --> MON
UPD --> MON
style REQ fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style SD fill:#0277BD,stroke:#01579B,color:#FFFFFF
style INT fill:#E65100,stroke:#BF360C,color:#FFFFFF
style RES fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style PLN fill:#558B2F,stroke:#33691E,color:#FFFFFF
style VAL fill:#558B2F,stroke:#33691E,color:#FFFFFF
style EXE fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style TST fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style UPD fill:#00695C,stroke:#004D40,color:#FFFFFF
style MON fill:#37474F,stroke:#263238,color:#FFFFFF
Why this matters: an agent that can both decide and secretly edit will find ways to skip the plan. By separating the coordinator from the workers (sub-agents), the process becomes structurally honest — the only way to write code is to go through the required steps.
| Phase | What happens | Agent | You say |
|---|---|---|---|
| 🔍 RESEARCH | Read-only fact gathering — codebase + web. Never modifies files. | vc-research-agent |
(auto on feature requests) |
| 📝 SPEC | Product-discovery requirements doc — user stories, acceptance criteria, out-of-scope — for your review before any design. | vc-spec-agent |
go / ENTER SPEC MODE |
| 💡 INNOVATE | Explore 2-3 approaches with trade-offs. Decision summary (chosen + rejected + why). | vc-innovate-agent |
go |
| 📋 PLAN | Write the detailed spec: touchpoints, public contracts, which files it can touch, verification evidence, resume handoff. | vc-plan-agent |
go |
| ✅ VALIDATE | Turn the plan into an agreed checklist (V1–V7 checkpoints). Verdict: PASS / CONDITIONAL / BLOCKED. Runs the PVL loop. | vc-validate-agent |
ENTER VALIDATE MODE |
| ⚡ EXECUTE | Implement exactly the plan. Progress notes to the phase report, deviation protocol, self-review. Then the EVL loop re-runs the checkpoints. | vc-execute-agent |
ENTER EXECUTE MODE |
| 🧠 UPDATE PROCESS | Capture learnings, update context, archive plan, write closeout packet. | vc-update-process-agent |
(recommended after non-trivial work) |
📝 Why SPEC is its own phase: most harnesses jump from "understand" to "design." Inserting a product-discovery SPEC step means you (or your PM) sign off on what is being built — in plain user stories and acceptance criteria — before the agent debates how. It is the cheapest possible place to catch a misunderstanding. (Inside a phase program's inner loop, SPEC is skipped — the umbrella SPEC governs all phases.)
The SPEC is the measuring stick. It states the expected behavior in simple terms you can scan in a minute. Every phase after it — Innovate, Plan, Validate, Execute — checks back against it and asks the same question: is what we are building actually what you asked for? When the work starts to drift, the SPEC is what catches it.
flowchart TD
U["You: what I really want<br>(plain words)"] --> S["📝 SPEC<br>expected behavior + acceptance<br>criteria — you approve"]
S --> I["💡 INNOVATE"]
S --> P["📋 PLAN"]
S --> V["✅ VALIDATE"]
S --> E["⚡ EXECUTE"]
I -.->|"check back"| Q{"Is this what<br>you asked for?"}
P -.->|"check back"| Q
V -.->|"check back"| Q
E -.->|"check back"| Q
Q -->|yes| GO["keep building"]
Q -->|no| S
# 🆕 Feature request
You: "add webhook support to the API"
→ Skill discovery surfaces: vc-scenario, vc-security
→ research-agent gathers context (read-only, can't touch code)
→ "go" → spec-agent writes requirements doc → you approve
→ "go" → innovate-agent compares approaches → decision summary
→ "go" → plan-agent writes the plan, listing which files it will touch
→ "ENTER VALIDATE MODE" → validate-agent gates the plan (PVL loop) → Gate: PASS
→ "ENTER EXECUTE MODE" → execute-agent implements → tester re-runs gates (EVL) → reviewer → git-manager
→ Closeout packet: what changed, what's verified, recommended next step
# 🐛 Bug fix
You: "login redirect is broken"
→ Routes to vc-debugger → gathers evidence FIRST → 2-3 competing hypotheses
→ Systematically eliminates each → root cause with proof chain
→ execute-agent implements the fix → EVL re-test → quality pipeline
# ⏩ Fast mode
You: "ENTER FAST MODE - add rate limiting middleware"
→ Compressed RESEARCH + SPEC + INNOVATE + PLAN + VALIDATE in one pass
→ Mandatory safety pause after VALIDATE → you review → "ENTER EXECUTE MODE"
# 🤖 Autopilot (hands-free)
You: "autopilot full: build a notifications system"
→ ONE consolidated clarification round → provisional /goal block (standing consent)
→ Drives the full RIPER-5 sequence autonomously, pausing only on hard stops
# 🏗️ Large program
You: "build a full testing platform"
→ Umbrella plan + phase plans in a feature folder
→ Each phase inner loop: research → innovate → plan → PVL → execute → EVL → update
→ Progress survives context compaction — durable reports on disk
Before routing, the lead agent scores your request's ambiguity on 4 binary signals (0–4) and picks a tier. It asks questions only when they would actually change what it does.
| Tier | When | Behavior |
|---|---|---|
| Tier 0 — silent auto-route | Score 0–1, or you said "go" / "just do it", or resuming a plan | Routes immediately, no questions |
| Tier 1 — inline summary | Score 2 | States its understanding + chosen route in one line, then proceeds |
| Tier 2 — questions | Score 3+ | Asks focused multiple-choice questions before routing |
🧠 Two rounds max. If still unclear after Tier 2, it asks one final plain question, then defaults to research with the narrowest reasonable scope. It never loops clarification forever. After RESEARCH, it re-checks intent — if research shows the request was different from what was assumed, it re-presents; if confirmed, it proceeds without re-asking.
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
REQ["User request"]
SCORE{"ambiguity score\n0–4 binary signals"}
AUTO["Auto-skip conditions\n(go / continue / mid-phase\n/ trivial / explicit mode\n/ resuming plan / pure info)"]
T0["Tier 0\nsilent auto-route\n(score 0–1 OR auto-skip)"]
T1["Tier 1\ninline summary\n(score 2)"]
T2["Tier 2\nask focused questions\n(score 3+)"]
ROUTE["Route to matching agent\n(research / plan / execute / …)"]
STILL{"still unclear\nafter Tier 2?"}
FINAL["One final plain question\n(max 2 clarification rounds)"]
NARROW["Default: vc-research-agent\nnarrowest reasonable scope"]
REQ --> AUTO
AUTO -->|"auto-skip matches"| T0
AUTO -->|"no auto-skip"| SCORE
SCORE -->|"0–1"| T0
SCORE -->|"2"| T1
SCORE -->|"3+"| T2
T0 --> ROUTE
T1 --> ROUTE
T2 --> STILL
STILL -->|"resolved"| ROUTE
STILL -->|"still unclear"| FINAL --> NARROW
style REQ fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style AUTO fill:#0277BD,stroke:#01579B,color:#FFFFFF
style SCORE fill:#E65100,stroke:#BF360C,color:#FFFFFF
style T0 fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style T1 fill:#558B2F,stroke:#33691E,color:#FFFFFF
style T2 fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style ROUTE fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style STILL fill:#F57F17,stroke:#F9A825,color:#000000
style FINAL fill:#AD1457,stroke:#880E4F,color:#FFFFFF
style NARROW fill:#00695C,stroke:#004D40,color:#FFFFFF
Most harnesses check once, if at all. This one wraps EXECUTE in two independent loops — one before code is written, one after.
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '15px', 'lineColor': '#8888AA'}} }%%
flowchart TD
PLAN["📋 PLAN written"]
PVL{"✅ PVL\nValidate the plan\nV1–V7 gates"}
PASS["Gate: PASS"]
SUPP["📝 plan-agent supplements\n(addresses gaps)"]
EXEC["⚡ EXECUTE\nimplement the plan"]
EVL{"🧪 EVL\ntester re-runs the\nvalidate-contract gates"}
FIX["⚡ execute-agent\nsupplement (fix gate)"]
DONE["🧠 UPDATE PROCESS"]
PLAN --> PVL
PVL -->|"CONDITIONAL / BLOCKED"| SUPP
SUPP -->|"re-run from V1"| PVL
PVL -->|"PASS"| PASS
PASS -->|"ENTER EXECUTE"| EXEC
EXEC --> EVL
EVL -->|"gate fails"| FIX
FIX -->|"re-run"| EVL
EVL -->|"all gates green"| DONE
style PLAN fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style PVL fill:#558B2F,stroke:#33691E,color:#FFFFFF
style PASS fill:#00695C,stroke:#004D40,color:#FFFFFF
style SUPP fill:#E65100,stroke:#BF360C,color:#FFFFFF
style EXEC fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style EVL fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style FIX fill:#AD1457,stroke:#880E4F,color:#FFFFFF
style DONE fill:#00695C,stroke:#004D40,color:#FFFFFF
Before EXECUTE, vc-validate-agent runs the plan through V1–V7 checkpoints — splitting the work across several agents to cover infra, test coverage, breaking changes, security, and per-section feasibility. A first-pass CONDITIONAL or BLOCKED is never the end — it routes back to vc-plan-agent to update the plan, then re-checks from V1.
Tracked by vc-autoresearch (domain: plan) — a find-gaps-and-fix loop. 10-cycle cap. Plateau detection. Only Gate: PASS (or a CONDITIONAL you explicitly accept) unlocks EXECUTE.
|
After EXECUTE reports done — even when it claims all checkpoints are green — the lead agent always spawns vc-tester to independently re-run the exact agreed-checklist test commands. A failing checkpoint routes to a scoped vc-execute-agent fix, then re-tests.
Tracked by vc-autoresearch (domain: tests). 10-cycle cap. The execute-agent's own internal "iterate until green" loop never substitutes for this independent confirmation.
|
💎 The verdict ladder: PASS → proceed · CONDITIONAL → fixable gaps; the loop fires (or you accept them on record) · BLOCKED → a deeper problem; returns to PLAN (under autopilot: the gap goes to a backlog and the run continues).
Both PVL and EVL use the same tracking layer: vc-autoresearch — a find-gaps → fix → repeat loop. The lead agent drives the loop — it owns the round counter, per-round reports, TSV log, and plateau/cap/regression checks. Worker agents are fire-and-forget: they return a result and stop. No agent re-spawns itself or spawns another phase agent.
The same engine can run on its own: "harden this spec", "fix all lint", "improve test coverage", "improve these docs" — any repeated find-gaps-and-fix task across 6 domains (spec · tests · ux · docs · plan · errors).
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
START["Start loop\n(orchestrator init results.tsv\nheader + baseline row)"]
FIND["Find gaps\n(validate-agent / tester\nreturns gap list)"]
RPT["Write iteration report\n{slug}-iteration-{NNN}_REPORT_{dd-mm-yy}.md"]
TSV["Append results.tsv row\n(cycle N, gap count)"]
FIX["Fix gaps\n(plan-agent supplement\nOR execute-agent supplement)"]
CHK["Guard checks\nplateau? cap hit? regression?"]
RECHECK["Re-check\n(re-run validate / tester)"]
SUCC["SUCCESS\nall-clear 2 consecutive rounds"]
HALT["HALT\nplateau / 10-cycle cap / regression"]
START --> FIND --> RPT --> TSV --> FIX --> CHK
CHK -->|"safe to continue"| RECHECK
RECHECK --> FIND
CHK -->|"plateau / cap / regression"| HALT
FIND -->|"no gaps found (×2)"| SUCC
style START fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style FIND fill:#E65100,stroke:#BF360C,color:#FFFFFF
style RPT fill:#558B2F,stroke:#33691E,color:#FFFFFF
style TSV fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style FIX fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style CHK fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style RECHECK fill:#0277BD,stroke:#01579B,color:#FFFFFF
style SUCC fill:#00695C,stroke:#004D40,color:#FFFFFF
style HALT fill:#B71C1C,stroke:#7F0000,color:#FFFFFF
| Mode | Does | Stops when |
|---|---|---|
vc-autoresearch (core) |
find gaps → fix → repeat | no gaps found OR metric goal hit |
vc-autoresearch:probe |
8 personas interrogate the corpus until saturation | no new constraints for 3 rounds |
vc-autoresearch:reason |
adversarial debate with blind judges | judges converge or iteration cap |
vc-autoresearch:evals |
analyze TSV results — trends, plateaus, recommendations | analysis only |
Stop conditions: SUCCESS (all-clear two rounds in a row) · HALT_PLATEAU (no progress for 3 rounds) · HALT_CAP (10-round hard limit) · HALT_REGRESSION (a check that was passing now fails).
At every phase transition, the lead agent invokes vc-agent-strategy-compare to recommend how to run the next phase — with cost estimates.
| Strategy | When | Coordination |
|---|---|---|
| Sequential | Work depends on prior output | One agent at a time |
| Parallel subagents | Independent dimensions, fire-and-forget | None — lead agent collects + combines results |
| Workflow | Predictable splitting of work across a list | Scripted steps |
| Agent team | Agents must talk to each other mid-run (e.g. each touches separate files across 3+ phase plans) | TeamCreate + shared task list + SendMessage |
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
SS["vc-agent-strategy-compare\n(every phase transition)"]
SC{"signal score\n0–7"}
SEQ["Sequential\none agent at a time\n(output feeds next)"]
PAR["Parallel subagents\nfire-and-forget\n(independent dimensions)"]
WF["Workflow\nscripted steps\nacross a list"]
TEAM["Agent team\nTeamCreate + TaskCreate\n+ SendMessage\n(must coordinate mid-run)"]
MC{"which phase?"}
OPUS["🔴 opus\n(EXECUTE only)"]
SONNET["🔵 sonnet\n(every other phase)"]
SS --> SC
SC -->|"low / dependent"| SEQ
SC -->|"mid / independent"| PAR
SC -->|"predictable split"| WF
SC -->|"high / must coordinate\nor 3+ phase plans"| TEAM
SS --> MC
MC -->|"EXECUTE / fast-mode\nquick-fix (real code)"| OPUS
MC -->|"Research / Spec\nInnovate / Plan\nValidate / Update\nall reviewers"| SONNET
style SS fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style SC fill:#E65100,stroke:#BF360C,color:#FFFFFF
style SEQ fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style PAR fill:#558B2F,stroke:#33691E,color:#FFFFFF
style WF fill:#00695C,stroke:#004D40,color:#FFFFFF
style TEAM fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style MC fill:#0277BD,stroke:#01579B,color:#FFFFFF
style OPUS fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style SONNET fill:#37474F,stroke:#263238,color:#FFFFFF
⚠️ "Agent team" means the real machinery — named teammates, a shared task list, and inter-agent messaging — not bare parallel agents called a "team." It is required (not optional) for creating 3+ phase plans and for multi-file edits where agents must each stay in their own files. Only a true team can communicate while running.
| Phase | Model | Why |
|---|---|---|
| EXECUTE (+ fast-mode, quick-fix doing real code) | 🔴 opus | Real source edits, builds, migrations |
| Research · Spec · Innovate · Plan · Validate · Update · all reviewers/researchers | 🔵 sonnet | Planning and analysis — cheaper, plenty capable |
When work is split across several agents, only the coding agent uses opus. Every reviewer, researcher, validator, and planner uses sonnet. The lead agent names the model each time it spawns a worker.
Say autopilot [task] (or run autopilot, autonomous mode, ENTER AUTOPILOT MODE) and the agent runs the entire remaining RIPER-5 sequence with one clarification round up front — then no more pauses until it is done.
Trigger anywhere: autopilot can start at the beginning of a session or at any point mid-session. On trigger, the lead agent reads the saved files on disk to figure out which RIPER-5 phase you are already in, then picks up from there and drives the rest on its own.
| On-disk state | Entry phase |
|---|---|
| No SPEC file | Start at RESEARCH |
| SPEC file present | Skip to post-SPEC (INNOVATE) |
| Plan file present | Skip to post-PLAN (VALIDATE) |
| Validate-contract with PASS/CONDITIONAL | Skip to EXECUTE |
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
TRG["autopilot trigger phrase\n(anywhere in session)"]
DISK["Read saved files on disk\ndetect current phase"]
CLR["ONE consolidated\nclarification round"]
GOAL["Emit provisional /goal block\n(≤4000 chars, copy-pasteable\nstanding EXECUTE consent)"]
ACT["AUTOPILOT_ACTIVATED\nautonomous run begins"]
PHASES["Drive remaining phases\nRESEARCH → … → UPDATE PROCESS\n(no user pauses)"]
HS1["🛑 Irreversible / outward action\nnot pre-approved"]
HS2["⛔ Cascade BLOCKED\n(several phases stuck)"]
HS3["💸 Live-provider billed probe\n(double opt-in required)"]
DONE["Run complete\nautopilot deactivates"]
TRG --> DISK --> CLR --> GOAL --> ACT --> PHASES
PHASES -->|"hard stop 1"| HS1
PHASES -->|"hard stop 2"| HS2
PHASES -->|"hard stop 3"| HS3
PHASES -->|"all phases done"| DONE
style TRG fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style DISK fill:#0277BD,stroke:#01579B,color:#FFFFFF
style CLR fill:#E65100,stroke:#BF360C,color:#FFFFFF
style GOAL fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style ACT fill:#558B2F,stroke:#33691E,color:#FFFFFF
style PHASES fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style HS1 fill:#B71C1C,stroke:#7F0000,color:#FFFFFF
style HS2 fill:#B71C1C,stroke:#7F0000,color:#FFFFFF
style HS3 fill:#B71C1C,stroke:#7F0000,color:#FFFFFF
style DONE fill:#00695C,stroke:#004D40,color:#FFFFFF
You: "autopilot full: add team invitations with email + role management"
→ Reads saved files → detects current phase → enters there
→ ONE consolidated clarification round (scope, hard stops, autonomy boundaries, first-phase strategy)
→ Provisional /goal block emitted (≤4000 chars, copy-pasteable, standing EXECUTE consent)
→ AUTOPILOT_ACTIVATED → drives remaining phases on its own
→ Stops ONLY for hard stops
| Lane | Trigger | Flow |
|---|---|---|
| 🟢 quick | autopilot quick: [task] |
Scout → edit → scoped check. No plan, no contract, no EVL. |
| 🟡 fast | autopilot fast: [task] |
Compressed R→S→I→P→V → EXECUTE + EVL. |
| 🔴 full | autopilot [task] / autopilot full: |
Complete RIPER-5 (default). |
Say autopilot full: [task] — or paste a /goal block — and the following all happen with zero human input:
- Plan-check-and-fix loop — finds gaps in the plan, fixes them, and re-checks. Up to 10 rounds on its own.
- Build-test-and-fix loop — writes code, runs tests, fixes failures, re-runs. Up to 10 rounds on its own. It never trusts its own "all green" — a separate checker (vc-tester) independently re-runs every test to confirm.
- Phase-to-phase advancement — moves from research to plan to code to done without waiting for you.
- Picks up after a memory reset — plans, reports, and progress all live as files on disk. After compaction (when the AI's short-term memory clears), the next session reads those files and continues exactly where it left off.
- Stuck feature? Set it aside, keep going — if one phase can't be resolved, the agent writes a backlog note and moves on to the next feature. You can run many features in parallel without one blocker stopping everything.
- Teams of agents for parallel features — multiple agents can build separate features at the same time, each locked to its own files so they never collide. A stuck feature is parked, not a blocker for the rest.
These are the only three times it stops and asks you:
- 🛑 Anything it cannot undo, or that reaches the outside world and was not pre-approved (going live, sending real messages, charging money)
- ⛔ Several phases in a row get stuck with no progress — a real dead-end worth your eyes
- 💸 A test that would spend real money on a paid outside service — it asks before running
Required, not decoration: after every VALIDATE phase completes, the lead agent must emit a copy-pasteable /goal block before EXECUTE starts. This is a required handoff file — not optional commentary.
Format constraints:
| Block type | Required fields | Hard limit |
|---|---|---|
| Post-VALIDATE block | SESSION GOAL · Charter+umbrella plan · Autonomy · Hard stop conditions · Next phase · Validate contract · Execute start | ≤ 4000 chars |
| Provisional (autopilot) block | SESSION GOAL · ENTRY PHASE · REMAINING PHASES · CLARIFICATIONS LOCKED · EXECUTE CONSENT · DECISION POLICY · HARD STOPS · TEST GATES · START (+ optional LANE) | ≤ 4000 chars |
The /goal command rejects blocks longer than 4000 characters. Keep it short — use the required fields as the structure, not a prose essay.
Standalone /goal mode: paste a /goal block into a new session and the run picks up from the phase named in START. Clarifications and decision rules are already set — no new clarification round. Under a standing /goal, the agent decides on its own at every reversible step, sends BLOCKED items to a backlog, and writes its own reports — but worker agent delegation stays mandatory. Autopilot removes approval pauses only, never the no-inline-execution rule.
Validated by validate-autopilot-goal-block.mjs.
When SPEC, INNOVATE, or VALIDATE hits a key assumption it cannot confirm by reading alone, it emits VC-FEASIBILITY-PROBE-NEEDED and stops. The lead agent spawns vc-debugger to run a real test and write a VERDICT:
| Verdict | Meaning |
|---|---|
| ✅ VIABLE | Assumption holds — design may rely on it |
| ❌ NOT-VIABLE | Assumption is false — that approach is forbidden |
| ❓ INCONCLUSIVE | Couldn't prove it — carried forward as a known-gap |
Each verdict comes with a 3-part design note: what the result allows · what it rules out · what is still uncertain — fed word-for-word back into the paused phase. Probes are cost-classed (cheap-local / needs-container / needs-live-provider → double opt-in / needs-browser / needs-cf) so a billed or shared-resource probe never runs silently.
The kit ships 36 validator scripts that turn "did the agent follow the rules?" into a clear pass/fail result. They run after any phase that touches harness files, and as required checkpoints in UPDATE PROCESS:
| Validator family | Checks |
|---|---|
vc-audit-vc |
Agent parity (Claude/Codex), skill registry, kit portability, agent frontmatter |
vc-audit-context |
Context routing, discovery frontmatter, skill keywords |
vc-audit-plans |
Plan inventory, umbrella state, phase completeness, phase reports, backlog notes |
| 14 VC-system behavior validators | Each owns a pass/fail fixture pair — strategy-compare output, closeout, intent-clarify, feasibility verdict, autoresearch log, and more |
These are not guidelines — they are hard rules built into every agent.
|
Progress Notes, Not Mid-Run Pauses During coding the agent writes progress notes to the phase report file as it works. No mid-run pause, no "continue or return?" prompt. If it hits a problem that needs a plan change, it stops and returns to PLAN. Otherwise it keeps going. |
Never Quietly Deviate If coding hits a problem that needs a plan change, the agent stops immediately, explains, and returns to PLAN. No silent improvising. |
|
Privacy Guardrails Hook The agent is blocked from reading .env, credentials, SSH keys, and .pem files without explicit approval.
|
High-Risk Evidence Packs For auth, billing, schema migrations, or public-API changes, the system requires a formal 5-file evidence pack before calling work "done" — always manual, never auto-bypassed. |
|
Status-Code Discipline Worker agents must close with DONE / DONE_WITH_CONCERNS / BLOCKED / NEEDS_CONTEXT. Blockers are never ignored; correctness concerns become action items.
|
Closeout + Drift Scoring After coding, a closeout packet scores urgency: LOW (light touch) → MEDIUM (significant) → HIGH (harness/protocol files touched), and recommends the next safe step. |
Before a single line of code is written, three specialist skills can catch issues:
5-Persona Debate — vc-predictArchitect, Security, Performance, UX, and Devil's Advocate debate your plan. Produces a GO / CAUTION / STOP verdict before you write a line. |
12-Dimension Edge Cases — vc-scenarioDecomposes a feature across 12 dimensions (user types, input extremes, timing, scale, state, env, errors, auth, data, integrations, compliance, business logic). Output doubles as test specs. |
STRIDE + OWASP Audit — vc-securityDual-methodology security audit with dependency auditing, secret detection, and an auto-fix mode that sorts by severity and fixes Critical first with regression guards. |
Evidence-First Debugging — vc-debuggerGathers evidence → forms 2-3 competing hypotheses → tests each → documents the elimination path. Never guesses — proves. |
Tests first, then code. The agreed checklist (written before any code is touched) defines the exact tests that must pass. The execute-agent writes code until those tests go green. Then a separate checker — vc-tester — re-runs every test on its own to confirm. The execute-agent's own "all green" is never taken at face value. At the very end, the reviewer checks that the whole project still works together, not just the new piece.
The execute-agent does not just write code and call it done. It moves through a quality pipeline automatically:
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '16px', 'lineColor': '#8888AA'}} }%%
flowchart TD
E["⚡ Execute-Agent\nImplements the plan"]
SR["🔎 Self-Review\nLine-by-line check\nagainst plan"]
T["🧪 Tester (EVL)\nDiff-aware — re-runs\nthe contract gates"]
CR["🔍 Code Reviewer\nEdge case scout\n+ adversarial review"]
CS["✨ Code Simplifier\nClarity refactoring"]
GM["📦 Git Manager\nLogical commit splitting\nfrom touched_files"]
E --> SR --> T --> CR --> CS --> GM
style E fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style SR fill:#AD1457,stroke:#880E4F,color:#FFFFFF
style T fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style CR fill:#283593,stroke:#1A237E,color:#FFFFFF
style CS fill:#00695C,stroke:#004D40,color:#FFFFFF
style GM fill:#37474F,stroke:#263238,color:#FFFFFF
| Step | What it does |
|---|---|
| 🔎 Self-review | Checks every checklist item against the plan, records any deviation |
| 🧪 Tester (EVL) | Re-runs the agreed-checklist tests independently; maps changed files → test files, escalates to the full suite when >70% mapped |
| 🔍 Code reviewer | Sends an edge-case scout before review; checks N+1 queries, auth paths, data leaks |
| ✨ Simplifier | Tidies the code for clarity after review — no behavior changes |
| 📦 Git manager | Receives touched_files, splits into logical conventional commits, refuses unknown files |
Every non-trivial feature follows a plan lifecycle — a written spec that is created, reviewed, built against, and then archived as permanent project history.
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '16px', 'lineColor': '#8888AA'}} }%%
flowchart TD
A["🆕 Feature Request"]
B["📝 Plan Created\nin active/{slug}_{date}/"]
C{"👀 User Reviews\nthe Plan"}
D["✅ VALIDATE\n(PVL gates)"]
E["⚡ Execute + EVL"]
F["📦 Plan Archived\nto completed/"]
G["🧠 Learnings →\nall-context.md"]
H["🔄 Next Feature\nStarts Smarter"]
A --> B --> C
C -->|"✏️ Needs Changes"| B
C -->|"✅ Approved"| D --> E --> F --> G --> H
H -.->|"context compounds"| A
style A fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style B fill:#E65100,stroke:#BF360C,color:#FFFFFF
style C fill:#F57F17,stroke:#F9A825,color:#000000
style D fill:#558B2F,stroke:#33691E,color:#FFFFFF
style E fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style F fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style G fill:#00695C,stroke:#004D40,color:#FFFFFF
style H fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
💡 Six months from now, when someone asks "why did we build auth this way?", the answer is in
completed/. Not lost in a Slack thread.
Where plans live — task-folder convention:
process/
├── general-plans/
│ ├── active/
│ │ └── webhooks_28-05-26/ # 📋 Task folder: plan + colocated reports/refs
│ │ └── webhooks_PLAN_28-05-26.md
│ ├── completed/ # ✅ Archived (searchable history)
│ └── backlog/ # 📌 Deferred work
└── features/
└── billing/ # 🏷️ Feature-scoped (5+ artifacts)
├── active/{slug}_{date}/
├── completed/
└── backlog/
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
PC["vc-plan-agent creates task folder\nprocess/general-plans/active/{slug}_{date}/\nOR features/{feature}/active/{slug}_{date}/"]
PLAN["{slug}_PLAN_{dd-mm-yy}.md\n— plan file\n— Validate Contract appended here"]
REF["{slug}_REF_{dd-mm-yy}.md\n— optional references"]
RPT["{slug}-iteration-{NNN}_REPORT_{dd-mm-yy}.md\n— per PVL/EVL cycle report"]
TSV["results.tsv\n— rolling loop log\n(header + baseline + cycle rows)"]
PP["PHASE PROGRAM extras"]
UMB["umbrella_PLAN_{dd-mm-yy}.md\n— Program Goal Charter\n— /goal block"]
PHN["phase-N_PLAN_{dd-mm-yy}.md\n— one file per phase"]
REG["phase-blast-radius-registry.md\n— per-phase file ownership"]
PC --> PLAN
PC --> REF
PC --> RPT
PC --> TSV
PC --> PP
PP --> UMB
PP --> PHN
PP --> REG
style PC fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style PLAN fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style REF fill:#558B2F,stroke:#33691E,color:#FFFFFF
style RPT fill:#E65100,stroke:#BF360C,color:#FFFFFF
style TSV fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style PP fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style UMB fill:#00695C,stroke:#004D40,color:#FFFFFF
style PHN fill:#00695C,stroke:#004D40,color:#FFFFFF
style REG fill:#37474F,stroke:#263238,color:#FFFFFF
Every plan carries: 📍 touchpoints (files created/modified) · 📜 public contracts · 💥 which files it can touch (what could break, what to test) · ✅ verification evidence · 🔄 resume handoff.
vc-plan-discoveryfinds the right plan to resume; thepost-write-plan-checkhook checks plan structure on every plan write.
Normal features use one plan. Large multi-phase projects use a phase program — an umbrella plan plus per-phase plans, each running a full 7-step inner loop with its own checkpoints and a saved report.
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
UP["🎯 Umbrella Plan\nProgram Goal Charter\n(north star · done · scope tiers)"]
P1["📋 Phase 1"]
P2["📋 Phase 2 ..."]
subgraph LOOP["🔁 Per-phase inner loop (skips SPEC)"]
direction TB
R["🔍 Research"]
I["💡 Innovate"]
PL["📋 Plan-supplement"]
PVL["✅ PVL"]
EX["⚡ Execute"]
EVL["🧪 EVL"]
UPD["🧠 Update"]
R --> I --> PL --> PVL --> EX --> EVL --> UPD
end
UP --> P1 --> LOOP
LOOP -.->|"learnings feed\nnext phase"| P2
style UP fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style P1 fill:#E65100,stroke:#BF360C,color:#FFFFFF
style P2 fill:#E65100,stroke:#BF360C,color:#FFFFFF
style R fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style I fill:#E65100,stroke:#BF360C,color:#FFFFFF
style PL fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style PVL fill:#558B2F,stroke:#33691E,color:#FFFFFF
style EX fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style EVL fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style UPD fill:#00695C,stroke:#004D40,color:#FFFFFF
| Feature | Why it matters | |
|---|---|---|
| 🔄 | Re-research at every phase | Checks for code drift, reads latest reports, refreshes assumptions |
| ✅ | Checkpoints per phase | A phase is not done until evidence proves it. Honest status: PLANNED → CODE DONE → TESTING → VERIFIED or BLOCKED |
| 📄 | Saved reports | Every phase writes results to disk — progress survives a memory reset |
| 🧠 | Learnings feed forward | Phase 1 discoveries update Phase 2's plan before coding starts |
| 🏗️ | Foundation vs expansion | Explicitly separates "prove the architecture" from "implement everything" |
| 🚧 | Honest blocker handling | Stuck phases stay BLOCKED with evidence. No faking a green status |
The plan you write at the start is a rough map, not a fixed contract. As the program runs, it adjusts — so you do not have to predict every step in advance.
It can add a new phase in the middle of a run.
While working, the agent may discover a missing step — something that must happen before the next phase can proceed. When that happens, it inserts a new phase right there, renumbers the rest, and carries on. No human needed. (Internal signal: MID_PROGRAM_PLAN_CREATED — the new plan is written to disk and added to the registry automatically.)
It can reorder phases.
Research sometimes shows the planned order is wrong — for example, Phase 3 depends on something only Phase 4 produces. The agent rearranges the remaining phases and records why. (Internal signal: PHASE_RESTRUCTURE_NOTICE — saved in the phase report as an audit trail, not a blocker.)
It updates each phase's own plan right before coding it. Before any phase starts coding, a quick research pass reviews what the program has learned so far. It then updates that phase's checklist with new findings. This is called a plan-supplement step. Plans are never frozen — they absorb fresh facts from earlier phases.
It skips work that cannot start yet. If a phase depends on something not yet ready — a service not yet built, a decision not yet made — the agent marks that phase as dependency-blocked, sets it aside, and moves on to the next one. The whole program does not stall because one phase is waiting.
It knows when to stop and ask. A single stuck phase just gets parked in a backlog and the program continues. But if several phases in a row hit a wall with no progress, the agent treats that as a real dead-end — a cascade stop — and pauses to show you what happened. One stuck phase is normal. Several in a row signals something structural is wrong.
It keeps a live scoreboard. Every program has a one-page status section in the umbrella plan showing which phase is current, whether it is done, and where the report lives. Anyone — or the agent itself after a memory reset — can read it and know exactly where things stand. It also keeps a simple file registry so two phases working at the same time never edit the same files.
One big final check. At the end of the whole program, the agent runs an end-to-end test that the entire project still works together — not just each piece on its own. Individual phase checkpoints prove each part works; this final check proves the parts work as a whole.
Long jobs finish correctly — even when the AI's memory resets mid-way. The plan, the progress, and the proof all live in files on disk, not only in the agent's head.
AI agents have a limited working memory. On a long job that memory fills up and gets squeezed down — details can blur. When a new session starts (or memory is cleared), the agent does not guess where it left off. It reads the files.
Here is exactly how that works:
1. It writes a short report after every phase. When a phase finishes, a report file is written to disk. Progress lives in your project folder, not just in the agent's head. A memory squeeze cannot erase a file.
2. It keeps a checklist of which steps are done. Each phase plan holds a Phase Loop Progress list — tick-boxes for every step (research, plan-check, build, test, capture learnings). After a reset, the agent reads those boxes and knows the exact next step. No need to catch it up.
3. A brief "envelope" at the start of every phase. Every worker agent (a focused helper that does one phase of work) opens by emitting a Context Envelope — a 10-field note: which feature, which phase, which branch, which plan file, which tests to run. It takes seconds to read. The agent is ready before it does anything.
4. It trusts the files over its own memory. On resume, the agent checks what is actually in the code and git history versus what the plan says. The real state wins. A plan that went stale cannot mislead the agent into repeating work or skipping steps.
5. A running scoreboard and per-round reports.
Every fix loop (the plan-check loop and the build-test loop) keeps a results.tsv scoreboard file — one row per round, tracking how many issues remain. When a session ends mid-loop, the next session reads the count, picks up at the right round, and continues. No rounds are lost.
6. It re-injects a reminder on resume. When memory is squeezed, the system automatically reloads the latest status note into the new session. If any approval was pending — say, a checkpoint that needed a "yes" before moving on — the reminder flags it. Nothing is silently skipped.
💡 In short: you can start an autopilot run, close your laptop, and come back hours later. The agent will be exactly where it should be — or will pick up from the last saved checkpoint, with evidence on disk to prove it.
Project knowledge is organized into context groups — stable knowledge areas, each with an all-{group}.md router file that tells agents what to read and when. Agents follow the router, loading only what is relevant — not the whole knowledge base every time.
process/context/
├── all-context.md # 🧭 Root router — architecture, stack, patterns, conventions
├── tests/all-tests.md # 🧪 Test runners, commands, debugging procedures
├── container/all-container.md # 🐳 Docker, deployment, infra procedures
├── uxui/all-uxui.md # 🎨 Components, design tokens, patterns
├── infra/all-infra.md # 🖥️ Server infrastructure, deployment
└── {your-domain}/all-{domain}.md # 📚 Any domain with 3+ durable docs (auto-promoted)
| How it works | |
|---|---|
| 🧭 Router pattern | Agents read only what is relevant to their task |
| 📏 Auto-promotion | Topics with 3+ docs (or a single file that gets too large) get their own group |
| 🔄 Always current | Updated by vc-update-process-agent after every non-trivial feature |
| 🧪 Auditable | vc-audit-context checks routing, discovery frontmatter, and consistency |
| 📨 Context Envelope | Every inner-loop agent emits a 10-field note at start (feature → phase → session-goal → branch → worktree → context-group → blast-radius-packages → active-plan → test-runner → validate-contract) so a fresh worker agent knows exactly where it stands |
The kit ships only the protocol seed — your context groups are built for your project by
vc-setup, scanning your real code. They are a pattern, not a fixed list.
When a topic builds up 5 or more files, it gets its own feature folder — a complete lifecycle container.
process/features/{feature}/
├── active/{slug}_{date}/ # 📋 Plans being worked on (reports/refs colocated)
├── completed/ # ✅ Archived plans (searchable decision history)
└── backlog/ # 📌 Deferred work (agents check before duplicating)
| What happens | |
|---|---|
| 🆕 | New work starts in active/ → reports accumulate → plan archives to completed/ |
| 📌 | Deferred work goes to backlog/ — agents check it before creating duplicate plans |
| 📦 | Feature promotion happens automatically when general artifacts hit 5+ |
| 🔍 | Every feature has complete, self-contained history — plans, decisions, reports, research |
The 33 skills fall into three layers. Every SKILL.md declares its layer + trigger_keywords in frontmatter, and a generated catalog keeps discovery fast.
|
Actor agents Own a phase or role. Live in .claude/agents/ — these are the 15 agents, not skills.
|
Contract skills (20) Each one produces a specific file or agreed output — vc-generate-plan, vc-validate-findings, vc-autopilot, the audits. Results can be checked.
|
Helper skills (13) Improve how agents work, produce no file of their own — vc-scout, vc-sequential-thinking, vc-problem-solving, vc-docs-seeker.
|
Every completed feature feeds learnings back into the context system — the knowledge builds up, it does not reset.
Most AI-assisted codebases have the opposite property: every new session starts cold. The agent re-reads the same files, re-discovers the same patterns, and re-makes the same decisions — because the last session's insight lived only in a chat window. The kit's answer is not a prompt trick. It is a durable context-file system (process/context/) that every agent reads at session start, every validator protects, and every completed feature enriches.
Six months and many memory resets later, the agent still knows why your auth works the way it does — because that knowledge is on disk, routed, and auditable, not trapped in a dead session.
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
FEAT["Feature ships\n(EXECUTE + EVL complete)"]
UP["UPDATE PROCESS phase\nvc-update-process-agent"]
CTX["process/context/ updated\nsmallest relevant file\n+ all-context.md router"]
AGENT["Next agent spawned\nreads context router\n→ routes to correct group file"]
BETTER["Better next feature\nno re-discovery, no stale patterns"]
FEAT --> UP --> CTX --> AGENT --> BETTER
BETTER -.->|"compounds each feature"| FEAT
style FEAT fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style UP fill:#00695C,stroke:#004D40,color:#FFFFFF
style CTX fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style AGENT fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style BETTER fill:#E65100,stroke:#BF360C,color:#FFFFFF
process/context/ holds structured knowledge organized into topic groups — architecture decisions, coding conventions, deployment steps, test patterns, infrastructure facts. Unlike a chat history, this knowledge:
- travels into every worker agent —
vc-context-discoveryroutes each spawned agent to the rightall-{group}.mdrouter for its task, then to the smallest relevant deep file. A research agent, a plan agent, and a coding agent all start with the same shared understanding - survives a memory reset — it is on disk, not in a context window; a squeezed session loses none of it
- is readable by both Claude and Codex —
.agents/skillsis a shortcut link to.claude/skills/, so the same context system serves both agents without duplication
The root router (all-context.md) points to group routers (all-{group}.md), which route to the smallest relevant deep file. Agents follow the router — they never hard-code file paths. This means renames and group splits require only router edits, not a codebase-wide search.
process/context/
├── all-context.md ← root router (architecture, stack, patterns)
├── tests/all-tests.md ← test runners, debugging, commands
├── container/all-container.md ← Docker, deployment, infra procedures
├── uxui/all-uxui.md ← components, design tokens, visual conventions
└── {domain}/all-{domain}.md ← any domain with 3+ durable docs (auto-promoted)
The phrase "living docs" usually means "docs we intend to keep up to date but mostly forget." This system enforces the intention mechanically.
The UPDATE PROCESS phase requires a per-file context review before it can close. vc-update-process-agent cannot finish a phase until every potentially-affected context file has been reviewed with a concrete reason per file. "No updates needed" is allowed — but it must name each reviewed file and explain why. Vague reasons are rejected. The checkpoint is binary: record the review, or the phase does not close.
The full feedback loop per completed feature:
| Step | Owner | What happens |
|---|---|---|
| 1. Git diff analysis | vc-scout |
Maps changed files → affected context areas |
| 2. Per-file review | vc-update-process-agent |
Names each context file, states the update or an explicit "no change + reason" |
| 3. Updates applied | parallel worker agents | Each area's context file is updated with new patterns, decisions, learnings |
| 4. Routing verified | validate-context-discovery.mjs |
Confirms every doc is indexed and routers are consistent |
| 5. Discovery confirmed | validate-all-context.mjs |
Confirms all-context.md and group routers match the current files on disk |
Your 100th feature benefits from everything learned in the first 99 — not as an aspiration, but as a mechanical guarantee.
Every phase report carries a ## Forward Preview section written for the next phase's agent. It gives the exact commands to keep green, dependency changes, and file-scope changes found mid-phase. The agent picking up Phase 3 does not have to re-read Phase 2's output and guess what matters. It is handed a focused brief.
This is different from context docs: context docs carry lasting knowledge (decisions that stay true across features); Forward Preview carries temporary handoff state (what the next work session needs to know right now).
Lasting knowledge goes stale when nobody checks it. The kit ships validators that run as part of every phase closeout:
| Validator | What it catches |
|---|---|
validate-context-discovery.mjs |
Docs not indexed by any router; broken links; missing frontmatter |
validate-all-context.mjs |
all-context.md out of sync with actual files on disk |
validate-skill-keywords.mjs |
Skills missing trigger_keywords or layer fields (breaks routing Step 0) |
validate-protocol-discovery.mjs |
Protocol files in process/development-protocols/ missing discovery frontmatter |
These run like automated checks — a stale or orphaned doc fails. The system polices its own health.
Groups are created automatically when a topic reaches 3+ docs or a single file goes past ~800 lines. Agents follow routers and never hard-code paths — so adding a new group (e.g. process/context/billing/all-billing.md) requires only a router update, not changes to every agent that mentions billing. The router is the stable reference; the files behind it can reorganize freely.
The kit seeds context groups from your real codebase (via
vc-setup). The groups are not a fixed list — they are a pattern. Your auth area, your infra area, your payments area each become first-class routable knowledge as the project grows.
Click to expand the agent roster
Core workflow agents — one per RIPER-5 phase (R → SPEC → I → P → V → E → UP):
| Agent | Model | Role |
|---|---|---|
🔍 vc-research-agent |
sonnet | Codebase + web research, read-only. Contradiction tracking built in |
📝 vc-spec-agent |
sonnet | Product-discovery requirements doc before INNOVATE. Produces *_SPEC_*.md |
💡 vc-innovate-agent |
sonnet | Compare 2-3 approaches. Decision summary (chosen + rejected) before PLAN |
📋 vc-plan-agent |
sonnet | Write the plan with anti-shortcut guards. "I already know how" is not a plan |
✅ vc-validate-agent |
sonnet | Turn plan → agreed checklist (V1–V7). Checkpoint: PASS/CONDITIONAL/BLOCKED |
⚡ vc-execute-agent |
opus | Implement per plan. Progress notes to phase report, deviation protocol, self-review |
⏩ vc-fast-mode-agent |
opus | Compressed R→S→I→P→V with a required safety pause before EXECUTE |
🔧 vc-quick-fix-agent |
opus | QUICK FIX lane: one small low-risk edit + scoped check, no plan/validate |
🧠 vc-update-process-agent |
sonnet | 7-phase closeout: archive, update context, stale-artifact scan, learnings |
Specialist agents — called during EXECUTE or standalone:
| Agent | Role |
|---|---|
🐛 vc-debugger |
Gathers evidence before forming a hypothesis. Competing hypotheses, elimination chains, feasibility probes |
🧪 vc-tester |
Change-aware. Re-runs agreed-checklist tests (EVL). Auto-escalates on config changes |
🔎 vc-code-reviewer |
Sends an edge-case scout BEFORE review. N+1 detection, auth-path checking |
✨ vc-code-simplifier |
Tidies code for clarity without changing behavior |
🎨 vc-ui-ux-designer |
Design-aware frontend. Can spawn a research worker mid-build |
📦 vc-git-manager |
Splits into logical commits from touched_files. Refuses unknown files |
Click to expand the skill list (20 contract + 13 helper)
📜 Contract skills (20) — own an artifact: vc-generate-plan · vc-generate-context · vc-generate-spec · vc-generate-closeout · vc-generate-phase-program · vc-audit-context · vc-audit-plans · vc-audit-vc · vc-update · vc-publish · vc-feasibility-test · vc-risk-evidence-pack · vc-test-coverage-plan · vc-validate-findings · vc-autoresearch · vc-intent-clarify · vc-autopilot · vc-agent-strategy-compare · vc-plan-discovery · vc-context-discovery
🛠️ Helper skills (13) — improve how agents work: vc-review-situation · vc-sequential-thinking · vc-problem-solving · vc-scout · vc-debug · vc-docs-seeker · vc-frontend-design · vc-agent-browser · vc-web-testing · vc-setup · vc-predict · vc-scenario · vc-security
⚠️ Naming rule: Do NOT use thevc-prefix for your own skills or agents — that namespace is reserved for kit-shipped files, and the stale-removal guard treats anyvc-*path under.claude/skills/and.claude/agents/as kit-owned. Usemy-,team-, orproj-instead.
| Hook | What it does |
|---|---|
🔐 privacy-block.cjs |
Blocks reading .env, credentials, SSH keys. Requires explicit approval |
🚫 scout-block.cjs |
Prevents wandering into node_modules/, dist/. Gitignore-syntax .ckignore |
🧠 session-init.cjs |
Detects stack, injects env, recovers approval gates after compaction |
💉 subagent-init.cjs |
Injects a compact context block into every subagent |
✨ post-edit-simplify-reminder.cjs |
After 5+ edits, nudges to run the simplifier (non-blocking, throttled) |
📛 descriptive-name.cjs |
Language-aware file-naming conventions on every Write |
📊 session-state.cjs |
Session metrics + token awareness |
📋 post-write-plan-check.mjs |
Validates plan-artifact structure on every Write to a *_PLAN_*.md |
🧹 post-commit-lint.mjs |
Checks conventional-commits prefix on every git commit |
🔍 stop-validator-sweep.cjs |
Runs core harness validators when the session stops |
Where everything lives:
your-project/
├── .claude/{agents,skills,hooks}/ # 🤖 15 agents · ⚡ 33 skills · 🪝 10 hooks
├── .codex/agents/ # 🔄 Mirrored for Codex
├── .agents/skills -> .claude/skills # 🔗 Symlink for Codex discovery
├── CLAUDE.md · AGENTS.md # 📋 Orchestrator config + cross-tool registry
└── process/
├── context/ # 🧠 Auto-routed knowledge domains
├── general-plans/ # 📋 Cross-cutting plans + task folders
├── features/ # 🏷️ Feature-scoped lifecycle folders
└── development-protocols/ # 📜 22 shared workflow docs
Two lighter options for when the full RIPER-5 process is more than the job needs:
Quick Fix — "quick fix: …"Bigger than a trivial one-liner, smaller than "needs a plan." The lead agent scouts read-only → one-line confirm → spawns vc-quick-fix-agent for the edit + a scoped check on touched files only. No plan, no agreed checklist, no EVL.
Cancelled immediately if the change touches schema, auth, API, billing, or migration surfaces — then it routes to full RESEARCH. |
Fast Mode — "ENTER FAST MODE - …"Squeezes RESEARCH + SPEC + INNOVATE + PLAN + VALIDATE into one pass — but still writes a plan, writes an agreed checklist, and pauses before EXECUTE. In plain Fast Mode, there is a post-VALIDATE pause — you review, then say "ENTER EXECUTE MODE." Use autopilot fast: [task] to remove that pause and run all the way through without stopping.
|
| Command | What it does | When |
|---|---|---|
curl … install.sh | bash |
Syncs kit files without overwriting yours; auto-detects fresh vs upgrade and routes you | First install + every upgrade |
| Run vc-setup | Detects stack, scaffolds process/, deep-scans codebase, populates real context |
After a fresh install |
| Run vc-update | Computes a precise diff, shows what will change, waits for your OK; migrates old-format plans/folders with zero data loss | On every upgrade |
| Run vc-publish (maintainers) | Publishes harness changes back out to the kit repo | Contributing to the kit itself |
%%{init: {'theme': 'base', 'themeVariables': {'fontSize': '14px', 'lineColor': '#8888AA'}} }%%
flowchart TD
TRG["EXECUTE complete\nENTER UPDATE PROCESS MODE"]
AGT["spawn vc-update-process-agent"]
A["(a) Archive plan\nactive/ → completed/"]
B["(b) Update process/context/\nsmallest relevant file\n+ all-context.md router"]
C["(c) Tier-1 audits\n(change-type gated)"]
AVC["vc-audit-vc\nharness/agent edits"]
ACX["vc-audit-context\ncontext-doc edits"]
APL["vc-audit-plans\nplan/program edits"]
D["(d) Capture learnings\nto memory"]
E["(e) Write closeout packet\nvc-generate-closeout"]
F["(f) Conventional commit\nvc-git-manager"]
TRG --> AGT --> A --> B --> C
C --> AVC
C --> ACX
C --> APL
AVC --> D
ACX --> D
APL --> D
D --> E --> F
style TRG fill:#1565C0,stroke:#0D47A1,color:#FFFFFF
style AGT fill:#0277BD,stroke:#01579B,color:#FFFFFF
style A fill:#6A1B9A,stroke:#4A148C,color:#FFFFFF
style B fill:#00695C,stroke:#004D40,color:#FFFFFF
style C fill:#E65100,stroke:#BF360C,color:#FFFFFF
style AVC fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style ACX fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style APL fill:#2E7D32,stroke:#1B5E20,color:#FFFFFF
style D fill:#558B2F,stroke:#33691E,color:#FFFFFF
style E fill:#C62828,stroke:#B71C1C,color:#FFFFFF
style F fill:#37474F,stroke:#263238,color:#FFFFFF
💡
vc-updateshows a preview diff and waits for your OK. Yourprocess/directory and project-specific content are never silently changed. Re-running install is safe to run twice.
Many small, smart defaults add up to less babysitting and lower cost.
-
Each role only gets the tools it needs. During planning, the agent literally cannot edit code — those tools are turned off. This stops the agent from jumping ahead and changing things before the plan is approved. The system simply does not allow it.
-
It uses the premium AI model only where it matters. Code-writing uses the top model. Planning, research, review, and checking all use a cheaper, faster model. The result: roughly 60–70% lower cost compared to running the top model for everything — with no quality loss on the work that counts.
-
It tests risky guesses before building on them. When the agent is not sure something will work — a specific API behavior, a library feature, an infrastructure assumption — it runs a tiny real experiment first. The result is clear: works, does not work, or unclear. That verdict and a plain-English note get fed straight into the plan. The agent does not spend hours building on a wrong assumption.
-
Tidy, meaningful save points. Changes are committed in clean, logical chunks with clear messages — automatically. The history is easy to read and easy to undo one piece at a time.
-
Helpful automatic reminders. Small built-in helpers nudge for things like running the right checks on changed files, keeping code simple, and writing a proper commit message. Quality stays high without you having to police it.
-
You can run the self-improving loop on its own. The same "find problems, fix them, repeat" engine that drives plan-checking and test-fixing also works as a standalone tool on any messy area — a spec, the docs, the tests, an error list. You do not need a full feature build to use it.
-
Built-in proof the workflow rules actually work. The kit ships with its own test suite: a set of checks with known-good and known-bad examples that prove the workflow rules behave correctly. The system checks itself. You do not have to trust that the guardrails are on — you can run the checks and see.
We welcome contributions! See CONTRIBUTING.md for guidelines.
Quick links:
vibecode-pro-max-kit focuses on the spec-driven development framework and self-improving context organization, without bloating you with 80+ skills. Fewer tools, more structure.
MIT