Empirical Prediction Architecture

Predictions stated before the substrate. Tested by the substrate.

8/8
v13.1.0 predictions held
0
Pre-specified predictions falsified
v13.0.0
Prior version cleanly falsified, motivating v13.1.0 corrective iteration

External AI governance measurement operates in a pre-paradigmatic domain. There is no panel of human raters that could be the ground truth, no historical record of governance failures whose magnitude is independently quantified, no regulatory benchmark whose calibration could be borrowed. Criterion validity in the classical sense is structurally unavailable. The substrate validates instead by construct validity: it produces values that satisfy a network of theoretically derived expectations stated before the substrate is sealed. Architectural envelope predictions follow by construction from methodology choices and must hold. Pre-specified predictions are quantitative empirical claims registered before sealing and tested against the sealed snapshot. Post-validation observations surface patterns after sealing but cannot be confirmed or falsified because they were not pre-registered. For the v13.1.0 substrate, all architectural envelope predictions held and all pre-specified predictions confirmed.

Figure 1 · v13.1.0 Pre-Registered Prediction Results
v13.1.0 SEALED 29 APRIL 2026 · ALL PREDICTIONS HELD ARCHITECTURAL ENVELOPE Three-score family monotonic ordering HELD 543/543 Edges-per-agent stability HELD 6.41-6.53 Floor membership identical across views HELD every institution PRE-SPECIFIED Distribution moments within range HELD mean 18.27, median 17.10 Sector gradient ordering HELD pension < SWF < ... < banks Evidence composition O+D majority HELD 63.0% combined Inferred bounded HELD 37.0%, within target No regulatory dimension >50% unknown HELD max 46.1% A prediction is only as defensible as the moment it was made. PILLAR VALUES SHOWN UNDER METHODOLOGY EXEMPTION FOR FALSIFIABILITY MAR®500.com

The sealed v13.1.0-production-day-zero-full snapshot was preceded by eight pre-registered predictions across two classes. Architectural envelope predictions follow by construction from methodology choices: failure of an envelope prediction constitutes methodology failure. Pre-specified predictions are quantitative empirical claims registered before sealing: the methodology stakes its credibility on producing values within stated ranges. All eight predictions held against the sealed substrate. The v13.0.0 scanner that preceded this iteration produced a clean falsification on a different prediction class, which motivated the v13.1.0 corrective interventions documented in Stationary Sea Part 1, Section 5.4. A falsifying experiment produces more information than a confirming one.

Source · Stationary Sea Part 1, Section 9 (EPA Evidence Summary) · v13.1.0-production-day-zero-full sealed 29 April 2026 Methodology · Stationary Sea Part 1 (Zenodo)
543 institutions · 12 sectors · 95,876 agents · 636,854 governance edges · substrate v13.1.0
Methodology grounded in The Stationary Sea (Part 1: Substrate Construction), Section 7 (The Empirical Prediction Architecture).