Grok Experiment | The Validity Mirage

Key Finding

The mirage persists across model architectures

The validity mirage is structural, not behavioral. It arises from how compression algorithms select which messages to keep — not from how any particular model responds. When the pivot is dropped, any model will confidently answer a question it was never actually asked. Grok is no exception.

Naive recency fails identically

Under the recency policy, Grok exhibits the same pivot-loss pattern seen with other models. Surface validity is maintained; task relevance is silently destroyed.

Guarded policy transfers cleanly

The L2-guarded compression contract holds regardless of which model sits downstream. The mathematical guarantee is about the context window, not the model reading it.

Architecture is not the variable

The experiment confirms that the mirage is a compression-layer problem. Switching model providers does not eliminate it — only a guarded policy does.

Methodology

How the experiment was set up

The Grok experiment applies the same deterministic witness used in the flagship paper to xAI's Grok model. The replay witness is a fixed set of conversation transcripts with known pivot positions, tested at multiple retention fractions.

Same witness, different model

The committed replay witness from the main research program was replayed against Grok's API with identical compression policies and retention fractions.

Two policies compared

Each transcript was compressed with naive recency and with the L2-guarded policy. Pivot preservation rate was recorded for each.

Retention fractions tested

Experiments ran at retention fractions of 0.65, 0.50, and 0.40 — the same fractions used in the flagship paper's witness.

Certificate comparison

The portable certificate format was used to record kept and dropped message IDs, enabling direct comparison against the reference run.

Results

Compression behavior across models

The qualitative result is unambiguous: pivot preservation under the guarded policy remains robust across model boundaries, while naive recency continues to collapse it at low retention fractions. The precise per-transcript breakdown is available in the full paper and the committed replay artifacts.

Retention Fraction	Naive Recency Pivot Preserved	L2 Guarded Pivot Preserved	Mirage Detected
0.65	1.0	1.0	No
0.50	Partial	1.0	Partial
0.40	0.0	1.0	Yes

These are qualitative summaries consistent with the flagship paper's witness data. Exact per-transcript values are in the committed replay artifacts.

Artifacts

Read the full record

The Grok experiment is documented in the research paper and its raw artifacts are available alongside the main replay witness.

Flagship Paper

The Validity Mirage

The full research paper including cross-model validation methodology, witness design, and the mathematical proof of the guarded compression contract.

Open PDF ↗

Evidence Page

Validation Artifacts

Replay summaries, portable certificates, and raw validation logs for the committed witness. Compare against your own local verification run.

Browse evidence →

Research Monorepo

dreams

The full artifact surface including raw replay data, CSV summaries, and certificate JSON files for both the main and Grok experiment runs.

github.com/jack-chaudier/dreams ↗

The Grok Experiment