feat(results): restructure AgentV run artifacts by christso · Pull Request #1513 · EntityProcess/agentv

christso · 2026-06-25T12:18:35Z

Summary

AgentV run bundles now use index.jsonl as the discovery anchor and root summary.json as the run-level aggregate. The old root benchmark.json file is no longer written or read; local discovery, git-backed remote discovery, CI gating, exports, comparison/reporting, Dashboard loading, and WIP resume metadata now use the summary/index contract.

The layout keeps AgentV-specific value where it belongs: per-case summary.json remains, each run-N/ keeps result.json, grading.json, metrics, timing, transcripts, and outputs, and repeat-run rows remain inspectable in the Dashboard. The Dashboard table also preserves the intended Target, Suite, Score order and renders repeat-run detail rows.

Results publishing now treats existing checkout remotes as user-owned state. AgentV writes result refs through the configured remote name, defaulting to origin, but does not add or rewrite remotes in an existing checkout.

Validation

bun test packages/core/test/evaluation/results-repo.test.ts
bun test apps/cli/test/commands/results/serve.test.ts
bun test apps/dashboard/src/lib/result-table.test.ts
bun --filter @agentv/core typecheck
bun --filter agentv typecheck
bun run lint
cd apps/dashboard && bun run build
Dashboard browser UAT against the local run fixture confirmed result headers Status, Expand, Test ID, Target, Suite, Score and repeat detail content for run-1/run-2.

cloudflare-workers-and-pages · 2026-06-25T12:19:03Z

Deploying agentv with Cloudflare Pages

Latest commit:	`c799beb`
Status:	✅ Deploy successful!
Preview URL:	https://62dff0da.agentv.pages.dev
Branch Preview URL:	https://strict-vercel-results.agentv.pages.dev

View logs

christso added 3 commits June 25, 2026 14:17

feat(results): adopt summary/index artifact contract

b6297f3

feat(dashboard): render strict-layout repeat runs

93a213d

docs(results): document strict Vercel layout

5846113

christso added 3 commits June 25, 2026 15:03

fix(results): preserve existing git remotes

c5cf843

fix(dashboard): keep suite column after target

ab8cb9b

test(results): format artifact branch changes

0d24d97

christso changed the title ~~feat(results): adopt strict Vercel run layout (drop benchmark.json)~~ feat(results): restructure AgentV run artifacts Jun 25, 2026

christso added 3 commits June 25, 2026 23:22

test(results): align raw provider log artifact expectation

cea9b20

fix(core): avoid Bun stdin hangs for exec helpers

f0fcf1f

test(results): align CLI fixtures with run summary layout

c799beb

christso merged commit e2f865a into main Jun 25, 2026
8 checks passed

christso deleted the strict-vercel-results branch June 25, 2026 21:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(results): restructure AgentV run artifacts#1513

feat(results): restructure AgentV run artifacts#1513
christso merged 9 commits into
mainfrom
strict-vercel-results

christso commented Jun 25, 2026 •

edited

Loading

Uh oh!

cloudflare-workers-and-pages Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

christso commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Uh oh!

cloudflare-workers-and-pages Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying agentv with Cloudflare Pages

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

christso commented Jun 25, 2026 •

edited

Loading

cloudflare-workers-and-pages Bot commented Jun 25, 2026 •

edited

Loading