Summary
The v2.6.1 audit (scripts/audit_skills.py) reveals 263 of 298 legacy SKILL.md files (88%) exceed Matt Pocock's 100-line ceiling — the dominant pattern violation across the repo.
Per v2.6.1's engineering/write-a-skill/skills/write-a-skill/references/quality_gates_for_skills.md, this rule is now formally advisory for legacy skills (binding only for new skills post-v2.6.0). But the long-term goal is to grow the PASS rate (currently 9/298 = 3%) by refactoring the highest-impact skills.
This issue tracks the planning + execution sweep for v2.7.
Background
| Audit metric |
v2.6.0 baseline |
v2.6.1 |
| PASS (6/6) |
4 (1%) |
9 (3%) |
| WARN (5/6) |
111 (37%) |
137 (46%) |
| FAIL (≤4/6) |
183 (61%) |
152 (51%) |
| "SKILL.md ≤ 100 lines" failures |
263 |
263 (unchanged) |
The other 5 of Matt's 6 rules have improved (description triggers, terminology, examples, time-sensitivity, references depth). Only the 100-line ceiling has resisted because it requires structural refactoring — splitting SKILL.md content into references/<topic>.md files per Matt's progressive disclosure pattern.
Scope (proposed)
Phase 1 — Prioritize (1 PR, low effort):
- Identify top-20 high-traffic skills (most-installed via marketplace metrics OR most-mentioned in domain CLAUDE.md files)
- Refactor only these 20 to ≤100 lines via
references/ splits
- Expected impact: PASS rate jumps from 9/298 (3%) to ~25-30/298 (8-10%)
Phase 2 — Domain sweep (3-4 PRs, medium effort):
- Address engineering-advanced tier first (40 skills, most cs-* commands depend on these)
- Then engineering-team tier (32 skills)
- Then C-level + product + marketing in parallel
- Expected impact: PASS rate reaches ~50%
Phase 3 — Long tail (ongoing):
- Remaining skills addressed when touched for other reasons (Boy Scout rule)
Per-skill refactoring pattern
For each over-100-line SKILL.md:
- Identify the largest section (often "Workflows" or "Use Cases")
- Move to
references/<topic>.md
- Replace inline content in SKILL.md with:
See [references/topic.md](references/topic.md) for X.
- Validate:
python3 engineering/write-a-skill/skills/write-a-skill/scripts/skill_structure_validator.py <skill-folder>
Acceptance criteria for v2.7 release
Open questions
- Should we batch all 20 in one PR or one-per-skill PRs?
- How do we determine "high-traffic"? Marketplace install metrics aren't exposed. Could use ClawHub downloads if available, or domain-CLAUDE-md prominence as a proxy.
- Does the 100-line ceiling apply to the whole SKILL.md including frontmatter, or just the body?
Surfaced by
v2.6.1 ecosystem verification — Agent 5 (regression + quality evaluator), SC9.
Related
🤖 Filed via Claude Code session
Summary
The v2.6.1 audit (
scripts/audit_skills.py) reveals 263 of 298 legacy SKILL.md files (88%) exceed Matt Pocock's 100-line ceiling — the dominant pattern violation across the repo.Per v2.6.1's
engineering/write-a-skill/skills/write-a-skill/references/quality_gates_for_skills.md, this rule is now formally advisory for legacy skills (binding only for new skills post-v2.6.0). But the long-term goal is to grow the PASS rate (currently 9/298 = 3%) by refactoring the highest-impact skills.This issue tracks the planning + execution sweep for v2.7.
Background
The other 5 of Matt's 6 rules have improved (description triggers, terminology, examples, time-sensitivity, references depth). Only the 100-line ceiling has resisted because it requires structural refactoring — splitting SKILL.md content into
references/<topic>.mdfiles per Matt's progressive disclosure pattern.Scope (proposed)
Phase 1 — Prioritize (1 PR, low effort):
references/splitsPhase 2 — Domain sweep (3-4 PRs, medium effort):
Phase 3 — Long tail (ongoing):
Per-skill refactoring pattern
For each over-100-line SKILL.md:
references/<topic>.mdSee [references/topic.md](references/topic.md) for X.python3 engineering/write-a-skill/skills/write-a-skill/scripts/skill_structure_validator.py <skill-folder>Acceptance criteria for v2.7 release
engineering/write-a-skill/Open questions
Surfaced by
v2.6.1 ecosystem verification — Agent 5 (regression + quality evaluator), SC9.
Related
🤖 Filed via Claude Code session