Agent artifact contract

Executable Stories can publish a behavior catalog for coding agents. Tests stay in the host framework; agents read generated artifacts.

Canonical Artifact

Use StoryReport v1 JSON as the stable machine contract:

executable-stories format .executable-stories/raw-run.json --format story-report-json

Default output (with --output-name index):

reports/index.story-report.json

StoryReport v1 contains:

run metadata: id, timestamps, project root, package version, git SHA, CI info
features: title, source file, status summary
scenarios: id, title, status, duration, tags, tickets, covers, source line, docs, steps, attachments
steps: keyword, text, status, duration, errors, doc entries

Code → scenario (`covers`)

Scenarios declare the product-code paths/globs they exercise via a covers option (project-root-relative), beside tags/tickets. It travels through the StoryReport contract in every language adapter. Agents invert it with the get_scenarios_for_paths MCP tool (or GET /scenarios/covering): pass the files you’re editing, get the behavior at risk. Scenarios with no covers surface as a missing-covers warning in the behavior manifest’s debugger.

Agents should depend on this artifact before reading prose docs.

Agent Index

Preferred: generate the scenario index artifact from RawRun:

executable-stories format .executable-stories/raw-run.json \
  --format scenario-index-json \
  --output-dir reports \
  --output-name index

Output: reports/index.scenario-index.json

Legacy alternative (same shape, fewer metadata fields):

executable-stories list .executable-stories/raw-run.json --list-format json

The index includes scenario id, title, status, source file/line, tags, tickets, steps, doc kinds, and errors.

Behavior Manifest

For agent-oriented discovery and quality signals:

executable-stories format .executable-stories/raw-run.json \
  --format behavior-manifest-json \
  --output-dir reports \
  --output-name index

Output: reports/index.behavior-manifest.json — source file rollups, tag index, doc coverage, debugger warnings (missing tags, missing source lines, etc.).

Chat Paste (`agent-text`)

Not every model reading your run is a coding agent with tools. Sooner or later a product owner pastes the HTML report into ChatGPT and asks “what does this product do?”. A 1.3 MB report overflows the context window, so the model answers from whatever fraction survived truncation, without saying so.

agent-text is the artifact for that paste: the full run (steps, doc entries, errors) as flat plain text, with a self-describing header and none of the tokens a model never reads (ids, hashes, durations, markup).

executable-stories format .executable-stories/raw-run.json --format agent-text

Output: reports/index.agent.txt. On a real 74-scenario run: HTML 1,312 KB (~330k tokens), agent-text 107 KB (~27k tokens). The same behavior, at a size a chat window actually keeps. For tool-using agents, prefer the JSON artifacts above; this format optimizes tokens, not parseability.

Release Manifest

For release evidence, generate a tested-together manifest:

executable-stories format .executable-stories/raw-run.json \
  --format release-manifest \
  --output-dir reports \
  --output-name index

Output: reports/index.release-manifest.md.

The manifest records scenario ids, titles, statuses, source files, tags, branch/commit metadata when present, and a SHA-256 hash built from the exact scenario/status set. Use it when an agent or reviewer needs to confirm what batch was tested before a release.

Agent Loop

Run framework tests.
Generate StoryReport + index + manifest.
Query failing scenarios or browse the index.
Inspect scenario source files.
Change product code or tests.
Rerun focused or full tests.
Regenerate artifacts.

The framework remains the execution layer. Executable Stories supplies behavior context and evidence.

For loop-shaped, unattended agents, three commands wrap this loop: triage (the worklist of what to fix), check (the per-turn backpressure signal), and goal (a behavioral definition-of-done with an anti-fake-done ratchet). See Agent loops and backpressure.

MCP

Use executable-stories-mcp when an MCP-capable agent needs direct tools:

npx executable-stories-mcp

Read-only tools:

list_scenarios (optional statuses / tags / sourceFiles filters)
get_scenario
get_failing_scenarios
get_scenarios_for_paths — code→scenario via declared covers
get_feature_summary
get_scenario_index
get_behavior_manifest
get_behavior_diff — regressed / fixed / added / removed between two reports
get_deployment_status — latest recorded deployment per environment
get_environment_drift — scenarios only in one environment and status drift for shared scenarios

Execution tool:

run_scenario — runs one scenario through vitest, jest, playwright, or cypress

Each tool reads StoryReport v1 JSON. By default it uses:

reports/index.story-report.json

Pass reportPath to use another file. See MCP server.

Live index (watch)

Keep the agent artifacts fresh while you work. executable-stories watch regenerates the requested formats whenever the framework rewrites its raw-run file:

executable-stories watch reports/raw-run.json \
  --format story-report-json,scenario-index-json,behavior-manifest-json \
  --output-dir reports \
  --output-name index

Pair it with the host framework’s own watch mode (vitest --watch, jest --watch, …): tests rerun on code change → raw-run is rewritten → the index regenerates automatically. It is language-agnostic — any adapter that emits a raw-run drives it. Change events are debounced and overlapping runs are coalesced. The same step is available programmatically via startWatch / regenerateArtifacts from executable-stories-formatters.

CI Recipe

Recommended CI flow:

pnpm test
executable-stories format reports/raw-run.json \
  --format story-report-json,scenario-index-json,behavior-manifest-json,agent-text,release-manifest,traceability-matrix,html,markdown \
  --output-dir reports \
  --output-name index

Publish as CI artifacts:

reports/index.story-report.json
reports/index.scenario-index.json
reports/index.behavior-manifest.json
reports/index.agent.txt
reports/index.release-manifest.md
reports/index.traceability-matrix.md

Example apps expose pnpm report:agents with this recipe.