Add benchmarkoor-dashboard: ethrex EL benchmark dashboard by edg-l · Pull Request #9 · lambdaclass/ethrex-tooling

edg-l · 2026-06-11T09:28:14Z

New subfolder tool: a FastAPI + Jinja + HTMX + Plotly dashboard over the ethPandaOps Benchmarkoor API, focused on the ethrex EL client.

Sync (app/sync.py) pulls all suites and results into a local SQLite snapshot. The API intermittently returns 5xx under load, so client.py wraps every request in tenacity retry/exponential backoff. Only the newest suite per name is indexed (drops stale regenerated duplicates). Run selection picks the latest full run per (suite, instance) to guard against partial/truncated runs skewing aggregates.

Pages: Overview, Leaderboard (ranked by gas-weighted aggregate Mgas/s = Σgas/Σtime; median/mean/wins/gas-won as secondary columns), Coverage gaps, Compare (per-test ethrex-vs-clients matrix), Trends, Test detail.

Agent API: /agent.md (alias /llm.md) is a self-contained Markdown brief for LLM agents. JSON endpoints /api/targets, /api/targets/by_file, /api/leaderboard, /api/coverage, /api/suites. Optimization targets are ranked by time_lost_ms (ethrex time minus fastest competitor time, summed per file/opcode), which weights by recoverable wall-clock rather than Mgas/s ratio.

API key is read from a gitignored .env (see .env.example). Run: cp .env.example .env, set BENCHMARKOOR_API_KEY, uv sync, uv run python -m app.sync, uv run uvicorn app.main:app.

…r tooltips

…op, scaling; file time_lost counts only deficits

…story, time map); expose in api/agent.md/leaderboard/trends

…threx phase ingestion + fkv - map ethrex runs to bal-devnet-7 commit (gh history, by time); show in api/agent.md/leaderboard/trends - lazy /run/{id} block-logs viewer + /api/runs/{id}/block_logs (live, no storage) - stream benchmarkoor.log per ethrex run: per-test exec/merkle/store phase split + fkv catch-up summary; raw logs discarded - targets now carry phase bottleneck (exec/merkle/store) + merkle overlap and fkv state in /agent.md + /api/{targets,fkv}

… endpoints

…it (stacked), API fallback otherwise

…ra/hover, calmer section headers, lighter muted, readable prose width, drop dead badge css

- load Inter globally (UI + Plotly chart font), inline svg favicon - desaturate client palette; leaderboard bars = ethrex pink vs muted slate (others) - /run: one clean bar per operation (worst block), fixes segmented-bar collision when ops repeat - verified all pages via headless browser at 2K

- headroom (Mgas/s if ethrex matched best per test) + deficit portfolio by phase/resource - staleness: store newest-run ts, header colors by snapshot age - gas-scaling page (/op): Mgas/s vs gas per op, ethrex vs others - merkle parallelism page (/merkle): serial-merkle ranking from overlap data - per-commit op regression: phase_history table appended each sync - failing-tests surfacing from run aggregate - regression detection /api/regressions + /api/freshness live check - unauthenticated GitHub REST fallback for commit fetch (incremental cache) when gh absent - JSON: /api/{headroom,merkle,failures,regressions,freshness}; nav + overview surfaces

…o vs best per gas), drop op dropdown; keep suite picker, op drill-down via ?op=

…'t overflow narrow cards

…ult + full-range toggle

…on heavy tables at limit>=500)

…dexed sort + offset 500s on large run sets)

edg-l added 2 commits June 11, 2026 11:05

Add benchmarkoor-dashboard: ethrex EL benchmark dashboard

4dc5c71

benchmarkoor-dashboard: agent.md brief + JSON target endpoints, heade…

5561ee0

…r tooltips

edg-l marked this pull request as ready for review June 11, 2026 09:31

edg-l added 14 commits June 11, 2026 11:40

benchmarkoor-dashboard: enrich targets with bottleneck (cpu/io/mem), …

2edeb1f

…op, scaling; file time_lost counts only deficits

benchmarkoor-dashboard: associate runs to ethrex commit (gh branch hi…

60a10e8

…story, time map); expose in api/agent.md/leaderboard/trends

benchmarkoor-dashboard: document phase/fkv ingestion + /api/fkv, /run…

816a67d

… endpoints

benchmarkoor-dashboard: /run shows parsed exec/merkle/store phase spl…

9e15d87

…it (stacked), API fallback otherwise

benchmarkoor-dashboard: softer phase-chart palette to match dark theme

7b76810

benchmarkoor-dashboard: design polish — thinner phase bars, table zeb…

cb1075f

…ra/hover, calmer section headers, lighter muted, readable prose width, drop dead badge css

benchmarkoor-dashboard: scaling page shows all ops as a heatmap (rati…

d484698

…o vs best per gas), drop op dropdown; keep suite picker, op drill-down via ?op=

benchmarkoor-dashboard: wrap overview stat-row so headroom stat doesn…

e7a1a18

…'t overflow narrow cards

benchmarkoor-dashboard: trends chart zooms to ethrex lifetime by defa…

140b8ae

…ult + full-range toggle

benchmarkoor-dashboard: lower default paginate page to 250 (API 500s …

a4fd12f

…on heavy tables at limit>=500)

benchmarkoor-dashboard: order runs paginate by id not timestamp (unin…

634d682

…dexed sort + offset 500s on large run sets)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add benchmarkoor-dashboard: ethrex EL benchmark dashboard#9

Add benchmarkoor-dashboard: ethrex EL benchmark dashboard#9
edg-l wants to merge 16 commits into
mainfrom
benchmarkoor-dashboard

edg-l commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

edg-l commented Jun 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant