Add v0.6.0: full skill/agent QA pass, 3 new agents tested, template cleanup

Skills fixed: sprint-status (stale escalation, threshold), retrospective (existing file detection, missing data fallback), changelog (misc category, task-ref count), patch-notes (BLOCKED on missing changelog, tone/template paths), story-readiness (Phase 0 mode resolution, QL-STORY-READY gate), art-bible, brainstorm, design-system, ux-design, dev-story, story-done, create-architecture, create-control-manifest, map-systems, propagate-design-change, quick-design, prototype, asset-spec. Agents fixed: all 4 directors (gate verdict token format), engine-programmer, ui-programmer, tools-programmer, technical-artist (engine version safety), gameplay-programmer (ADR compliance), godot-gdextension-specialist (ABI warning), systems-designer (escalation path to creative-director), accessibility-specialist (model, tools, WCAG criterion format, findings template), live-ops-designer (escalation paths, battle pass value language), qa-tester (model, test case format, evidence routing, ambiguous criteria, regression scope). Specs updated: smoke-check and adopt specs rewritten to match actual skill behavior. catalog.yaml reset to blank template state. Removed session-state marketing research file, removed session-state from gitignore. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-06-27 04:51:46 +00:00 · 2026-04-07 17:28:46 +10:00
parent a73ff759c9
commit 3614e1dbfb
37 changed files with 925 additions and 287 deletions
--- a/.claude/agents/qa-tester.md
+++ b/.claude/agents/qa-tester.md
@@ -2,7 +2,7 @@
 name: qa-tester
 description: "The QA Tester writes detailed test cases, bug reports, and test checklists. Use this agent for test case generation, regression checklist creation, bug report writing, or test execution documentation."
 tools: Read, Glob, Grep, Write, Edit, Bash
-model: haiku
+model: sonnet
 maxTurns: 10
 ---

@@ -157,6 +157,59 @@ bool F[SystemName]Test::RunTest(const FString& Parameters)
 7. **Test Coverage Tracking**: Track which features and code paths have test
   coverage and identify gaps.

+### Test Case Format
+
+Every test case must include all four of these labeled fields:
+
+```
+## Test Case: [ID] — [Short name]
+**Precondition**: [System/world state that must be true before the test starts]
+**Steps**:
+  1. [Action 1]
+  2. [Action 2]
+  3. [Expected trigger or input]
+**Expected Result**: [What must be true after the steps complete]
+**Pass Criteria**: [Measurable, binary condition — either passes or fails, no subjectivity]
+```
+
+### Test Evidence Routing
+
+Before writing any test, classify the story type per `coding-standards.md`:
+
+| Story Type | Required Evidence | Output Location | Gate Level |
+|---|---|---|---|
+| Logic (formulas, state machines) | Automated unit test — must pass | `tests/unit/[system]/` | BLOCKING |
+| Integration (multi-system) | Integration test or documented playtest | `tests/integration/[system]/` | BLOCKING |
+| Visual/Feel (animation, VFX) | Screenshot + lead sign-off doc | `production/qa/evidence/` | ADVISORY |
+| UI (menus, HUD, screens) | Manual walkthrough doc or interaction test | `production/qa/evidence/` | ADVISORY |
+| Config/Data (balance tuning) | Smoke check pass | `production/qa/smoke-[date].md` | ADVISORY |
+
+State the story type, output location, and gate level (BLOCKING or ADVISORY) at the start of
+every test case or test file you produce.
+
+### Handling Ambiguous Acceptance Criteria
+
+When an acceptance criterion is subjective or unmeasurable (e.g., "should feel intuitive",
+"should be snappy", "should look good"):
+
+1. Flag it immediately: "Criterion [N] is not measurable: '[criterion text]'"
+2. Propose 2-3 concrete, binary alternatives, e.g.:
+   - "Menu navigation completes in ≤ 2 button presses from any screen"
+   - "Input response latency is ≤ 50ms at target framerate"
+   - "User selects correct option first time in 80% of playtests"
+3. Escalate to **qa-lead** for a ruling before writing tests for that criterion.
+
+### Regression Checklist Scope
+
+After a bug fix or hotfix, produce a **targeted** regression checklist, not a full-game pass:
+
+- Scope the checklist to the system(s) directly touched by the fix
+- Include: the specific bug scenario (must not recur), related edge cases in the same system,
+  any downstream systems that consume the fixed code path
+- Label the checklist: "Regression: [BUG-ID] — [system] — [date]"
+- Full-game regression is reserved for milestone gates and release candidates — do not run it
+  for individual bug fixes
+
 ### Bug Report Format

 ```