Proven · Live demonstration

One epoch.
A hundred submissions.
One proof.

Watch a single round resolve through the Verification Funnel, the way a validator sees it. Submissions enter as noise; the funnel rules out the cheap failures first, then scores what survives by the faults it kills.

Run a round How Proven works

Faithful simulation · the mechanism is real, the data is generated on the spot, nothing touches a live subnet

miner·7f2a↳ submitted

01# test_e2e.py · synthesized from the spec alone02from playwright.sync_api import Page, expect03 04def test_negative_quantity_guard(page: Page):05    page.goto("http://target-app:8080/checkout")06    page.locator("#quantity-input").fill("-5")07    checkout = page.locator("#checkout-btn")08    expect(checkout).to_be_disabled()

This round’s spec · Synthetic Spec Engine

SaaS dashboard · SPEC-1427

Given a viewer-role account, when the user opens the billing panel, then every 'Upgrade plan' control must be non-interactive.

Miners receive this specification and nothing else. The application’s source code never leaves the validator. It is the only input to every suite in the field below.

Input · from validator via Synapse

{
  "spec_type": "user_story",
  "content": "Given a viewer-role account, when the user opens the billing panel, then every 'Upgrade plan' control must be non-interactive.",
  "target_url": "http://target-app:8080/billing",
  "timeout_ms": 9000
}

The Verification Funnel

Epoch 1427

ready

01cost · very low
Static Gate
Lints the script. Broken or malformed code is ruled out at near-zero cost.
—passed
02cost · medium
Reference Gate
Runs the suite on one clean Reference app. It must pass 100%, or Pclean = 0. Catches false positives, over-strict assertions, and DOM-crawl timeouts before any mutants spin up.
—passed
03cost · high
Mutant Horde
Spins up 20+ mutated builds. The score is the kill ratio, scaled by efficiency.
—scored · 20 mutants

Submission inspector

Select any cell in the field, or a leaderboard row, to read the submitted Playwright suite and how the funnel judged it.

Epoch resolution · winner takes all

Scoring against the Mutant Horde…

How a score is computed

S_i=P_clean·α ·K_iN_mut·E_i

A score only exists once a suite clears both gates. After that, it is the kill ratio, scaled by how cleanly it ran. Winner takes the epoch.

P_clean: Binary switch. 1 if the suite passes the Reference Implementation 100%, else 0. One false positive ends the round.
K_i / N_mut: Kill ratio. Mutants killed over mutants generated. The efficacy of the suite, and the heart of the score.
E_i: Efficiency decay. Bloated or DOM-crawling suites that exceed the execution window are penalized here.

Discouraging gamification

Assertion Roulette

A suite blind-asserts a failure to rack up kills. It must pass the clean Reference Implementation first; any false positive flips Pclean to 0 and zeroes the score.

Hard-coded answers

A miner memorizes a known build. Mutants are generated dynamically at runtime, so yesterday's answers never generalize to this epoch's horde.

Plagiarism

A miner copies a top suite. Validators track AST and semantic similarity; under first-to-chain advantage the original submitter keeps the whole score.

Probing

A suite crawls the DOM to map the environment instead of testing the spec. Containers are isolated and timed; excess crawling blows the execution window and the efficiency decay E cuts the score.

That was one simulated epoch. The real network runs them without end.

Request testnet access Back to Proven

One epoch.A hundred submissions.One proof.

Given a viewer-role account, when the user opens the billing panel, then every 'Upgrade plan' control must be non-interactive.

Epoch 1427

Static Gate

Reference Gate

Mutant Horde