3.0 KiB
3.0 KiB
Operating Modes
This playbook expands the compact mode routing in SKILL.md.
Scaffold
Use when:
- the skill is exploratory
- the workflow is personal or short-lived
- eval and packaging cost would exceed reuse value
Default deliverables:
SKILL.mdagents/interface.yamlreferences/only when a small amount of deferred reading is clearly helpful
Avoid:
- automatic
scripts/,evals/, ormanifest.json - packaging targets the user did not ask for
Production
Use when:
- the skill will be reused by a team
- routing mistakes would waste time
- a small amount of deterministic automation improves reliability
Default deliverables:
- lean
SKILL.md agents/interface.yamlreferences/for policies, checklists, or examplesscripts/only when deterministic logic is realevals/when trigger or output quality should be checkedmanifest.jsonwhen lifecycle metadata matters
Minimum gates:
resource_boundary_check.pyvalidate_skill.pytrigger_eval.pywhen route confusion is plausible
Library
Use when:
- the skill is organizationally important
- the package will be shared broadly
- maintenance and portability matter
- the skill itself shapes how other skills are created or governed
Default deliverables:
- trigger positives, negatives, and near neighbors
- packaging expectations
- maintenance metadata
- visible regression evidence
- governance review readiness
Minimum gates:
resource_boundary_check.pygovernance_check.pytrigger_eval.pycross_packager.pyfor requested targets
Governed
Use when:
- the skill affects incident, release, compliance, security, or organizational standards
- external distribution, public claims, or high-permission scripts require reviewable evidence
- wrong output or wrong activation can cause operational, legal, trust, or reputational harm
Default deliverables:
- everything required for Library
- explicit owner, lifecycle, review cadence, and expiry-aware approvals
- trust/security reports for scripts, dependencies, permissions, secrets, and package hash
- output eval evidence with blind review status and reviewer-visible boundaries
- world-class or public-claim evidence ledger when public readiness is claimed
Minimum gates:
- Library gates
trust_check.py- runtime permission probes for packaged adapters
- review waiver ledger for accepted warning-level risk
- Review Studio before release
- claim guard before public world-class language
Escalation Rules
- stay in Scaffold unless reuse is clearly real
- move to Production when team reuse or route confusion matters
- move to Library when the skill becomes shared infrastructure
- move to Governed when the skill needs explicit risk ownership, high-permission review, or public-claim evidence
Context Discipline
- a mode upgrade does not justify a larger
SKILL.md - higher rigor should mostly add better references, reports, evals, and metadata
- if a mode upgrade bloats the initial load, move detail out before adding more checks