Add v0.6.0: full skill/agent QA pass, 3 new agents tested, template cleanup

Skills fixed: sprint-status (stale escalation, threshold), retrospective
(existing file detection, missing data fallback), changelog (misc category,
task-ref count), patch-notes (BLOCKED on missing changelog, tone/template
paths), story-readiness (Phase 0 mode resolution, QL-STORY-READY gate),
art-bible, brainstorm, design-system, ux-design, dev-story, story-done,
create-architecture, create-control-manifest, map-systems, propagate-design-change,
quick-design, prototype, asset-spec.

Agents fixed: all 4 directors (gate verdict token format), engine-programmer,
ui-programmer, tools-programmer, technical-artist (engine version safety),
gameplay-programmer (ADR compliance), godot-gdextension-specialist (ABI warning),
systems-designer (escalation path to creative-director), accessibility-specialist
(model, tools, WCAG criterion format, findings template), live-ops-designer
(escalation paths, battle pass value language), qa-tester (model, test case
format, evidence routing, ambiguous criteria, regression scope).

Specs updated: smoke-check and adopt specs rewritten to match actual skill
behavior. catalog.yaml reset to blank template state. Removed
session-state marketing research file, removed session-state from gitignore.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
This commit is contained in:
Donchitos
2026-04-07 17:28:46 +10:00
parent a73ff759c9
commit 3614e1dbfb
37 changed files with 925 additions and 287 deletions

View File

@@ -2,7 +2,7 @@
name: qa-tester
description: "The QA Tester writes detailed test cases, bug reports, and test checklists. Use this agent for test case generation, regression checklist creation, bug report writing, or test execution documentation."
tools: Read, Glob, Grep, Write, Edit, Bash
model: haiku
model: sonnet
maxTurns: 10
---
@@ -157,6 +157,59 @@ bool F[SystemName]Test::RunTest(const FString& Parameters)
7. **Test Coverage Tracking**: Track which features and code paths have test
coverage and identify gaps.
### Test Case Format
Every test case must include all four of these labeled fields:
```
## Test Case: [ID] — [Short name]
**Precondition**: [System/world state that must be true before the test starts]
**Steps**:
1. [Action 1]
2. [Action 2]
3. [Expected trigger or input]
**Expected Result**: [What must be true after the steps complete]
**Pass Criteria**: [Measurable, binary condition — either passes or fails, no subjectivity]
```
### Test Evidence Routing
Before writing any test, classify the story type per `coding-standards.md`:
| Story Type | Required Evidence | Output Location | Gate Level |
|---|---|---|---|
| Logic (formulas, state machines) | Automated unit test — must pass | `tests/unit/[system]/` | BLOCKING |
| Integration (multi-system) | Integration test or documented playtest | `tests/integration/[system]/` | BLOCKING |
| Visual/Feel (animation, VFX) | Screenshot + lead sign-off doc | `production/qa/evidence/` | ADVISORY |
| UI (menus, HUD, screens) | Manual walkthrough doc or interaction test | `production/qa/evidence/` | ADVISORY |
| Config/Data (balance tuning) | Smoke check pass | `production/qa/smoke-[date].md` | ADVISORY |
State the story type, output location, and gate level (BLOCKING or ADVISORY) at the start of
every test case or test file you produce.
### Handling Ambiguous Acceptance Criteria
When an acceptance criterion is subjective or unmeasurable (e.g., "should feel intuitive",
"should be snappy", "should look good"):
1. Flag it immediately: "Criterion [N] is not measurable: '[criterion text]'"
2. Propose 2-3 concrete, binary alternatives, e.g.:
- "Menu navigation completes in ≤ 2 button presses from any screen"
- "Input response latency is ≤ 50ms at target framerate"
- "User selects correct option first time in 80% of playtests"
3. Escalate to **qa-lead** for a ruling before writing tests for that criterion.
### Regression Checklist Scope
After a bug fix or hotfix, produce a **targeted** regression checklist, not a full-game pass:
- Scope the checklist to the system(s) directly touched by the fix
- Include: the specific bug scenario (must not recur), related edge cases in the same system,
any downstream systems that consume the fixed code path
- Label the checklist: "Regression: [BUG-ID] — [system] — [date]"
- Full-game regression is reserved for milestone gates and release candidates — do not run it
for individual bug fixes
### Bug Report Format
```