This experiment modifies the executable governance kernel and evaluates B6 against B5 on held-out tasks.
| run_id | status | actions |
|---|---|---|
| pending-human-review | pre_review | human-gated |
No Evidence Docket, no empirical SOTA claim. Autonomous evidence production is allowed; autonomous claim promotion is not.