Add guarded-surface safety rules by eggrollofchaos · Pull Request #140 · DietrichGebert/ponytail

eggrollofchaos · 2026-06-17T21:35:08Z

Summary

Add guarded-surface rules for auth, persistence, money, privacy, concurrency, security, and public APIs.
Preserve critical controls when Ponytail minimizes code: authorization, validation, idempotence, migrations, auditability, rate limits, privacy boundaries, concurrency safety, tests, and public API compatibility.
Treat /ponytail-review as a complexity pass only, not approval to remove safety controls.
Strengthen copied-rule invariants so the skill, compact agent rules, and hook fallback cannot drift silently.
Make the correctness benchmark more portable by resolving Python/pandas through explicit interpreter and uv fallback paths instead of assuming base python3 has pandas.

Motivation

Ponytail already says "lazy, not negligent." Public feedback points to correctness-by-context: a shorter answer can be right for one prompt while quietly dropping validation, readability, maintainability, or future-change affordances.

GitHub issue #112 asks whether readability belongs on the never-simplify-away list, using the concrete failure mode of a dense one-liner that cuts lines but gets harder to change.
GitHub issue #100 raises the broader question of whether always-on minimality constraints trade off model performance or reasoning quality, and recommends stronger eval coverage for quality regressions alongside LOC.
The Reddit launch discussion includes exactly the validation critique this PR is meant to address: one line can be smaller without preserving the same validation level.
The Hacker News discussion makes the context point directly: a native control can be the correct lazy answer in one scenario and the wrong one in another.
Ponytail's own v4.4.0 release notes describe a real field test as a net win while also saying the laziness needed a tighter leash around failsafes, validation, and auth checks.

This PR makes that leash explicit for the surfaces where "less code" can otherwise become "less responsibility."

Verification

node scripts/check-rule-copies.js
node --test tests/*.test.js pi-extension/test/*.test.js

Latest local run: 12 rule invariants, 10 hook invariants, 47/47 tests passing.

Suggestion for default mode change

Consider making lite the default Ponytail mode. The tool makes potentially broad changes to everyday agent behavior; users will likely want to test it in lite first, watch it work on their own codebase, and then decide whether to run full broadly across their agents.

GitHub issue #100 explicitly mentions adoption hesitation around model-degradation worries from always-on constraints.
The Hacker News discussion shows the same adoption tension: strong interest in reducing over-engineering, paired with caution that senior judgment depends on context.

Teach Ponytail to keep a lite posture on guarded surfaces like auth, persistence, money, privacy, concurrency, security, and public APIs. Preserve authorization, validation, idempotence, migrations, auditability, rate limits, privacy boundaries, concurrency safety, tests, and public API compatibility when minimizing code. Strengthen copied-rule checks so the safety text does not drift. Also make the correctness benchmark runner portable by using explicit Python interpreter resolution and uv fallback for pandas. The default Ponytail mode remains full.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add guarded-surface safety rules#140

Add guarded-surface safety rules#140
eggrollofchaos wants to merge 1 commit into
DietrichGebert:mainfrom
eggrollofchaos:codex/guarded-surfaces

eggrollofchaos commented Jun 17, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

eggrollofchaos commented Jun 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Verification

Suggestion for default mode change

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

eggrollofchaos commented Jun 17, 2026 •

edited

Loading