Skip to content

feat(results): restructure AgentV run artifacts#1513

Merged
christso merged 9 commits into
mainfrom
strict-vercel-results
Jun 25, 2026
Merged

feat(results): restructure AgentV run artifacts#1513
christso merged 9 commits into
mainfrom
strict-vercel-results

Conversation

@christso

@christso christso commented Jun 25, 2026

Copy link
Copy Markdown
Collaborator

Summary

AgentV run bundles now use index.jsonl as the discovery anchor and root summary.json as the run-level aggregate. The old root benchmark.json file is no longer written or read; local discovery, git-backed remote discovery, CI gating, exports, comparison/reporting, Dashboard loading, and WIP resume metadata now use the summary/index contract.

The layout keeps AgentV-specific value where it belongs: per-case summary.json remains, each run-N/ keeps result.json, grading.json, metrics, timing, transcripts, and outputs, and repeat-run rows remain inspectable in the Dashboard. The Dashboard table also preserves the intended Target, Suite, Score order and renders repeat-run detail rows.

Results publishing now treats existing checkout remotes as user-owned state. AgentV writes result refs through the configured remote name, defaulting to origin, but does not add or rewrite remotes in an existing checkout.

Validation

  • bun test packages/core/test/evaluation/results-repo.test.ts
  • bun test apps/cli/test/commands/results/serve.test.ts
  • bun test apps/dashboard/src/lib/result-table.test.ts
  • bun --filter @agentv/core typecheck
  • bun --filter agentv typecheck
  • bun run lint
  • cd apps/dashboard && bun run build
  • Dashboard browser UAT against the local run fixture confirmed result headers Status, Expand, Test ID, Target, Suite, Score and repeat detail content for run-1/run-2.

Compound Engineering
Codex

@cloudflare-workers-and-pages

cloudflare-workers-and-pages Bot commented Jun 25, 2026

Copy link
Copy Markdown

Deploying agentv with  Cloudflare Pages  Cloudflare Pages

Latest commit: c799beb
Status: ✅  Deploy successful!
Preview URL: https://62dff0da.agentv.pages.dev
Branch Preview URL: https://strict-vercel-results.agentv.pages.dev

View logs

@christso christso changed the title feat(results): adopt strict Vercel run layout (drop benchmark.json) feat(results): restructure AgentV run artifacts Jun 25, 2026
@christso christso merged commit e2f865a into main Jun 25, 2026
8 checks passed
@christso christso deleted the strict-vercel-results branch June 25, 2026 21:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant