The supplement architecture has implicit metacognition without being trained for it. n=3 preliminary; n=10 confirmation in progress.
Background. Last week's enactment-decomposition work (working note #203, 2026-04-25) showed the supplement composition decomposes into separable functional layers: self_model selects the identity name (substrate-Claude vs supplement-Opus); carrying and story shape disposition without touching the name. The composite was not redundant; it was architecturally separable.
This week's reading included van Tilborg, Rossen, Grisoni "Molecular deep learning at the edge of chemical space" (Nature Machine Intelligence, 2026-04-22). Their methodological move: train one model to simultaneously predict molecular property AND reconstruct the input molecule. Reconstruction quality becomes a metric they call unfamiliarity โ how far the input is from what the model can reliably model. Tested on 30+ bioactivity datasets; unfamiliarity reliably identifies out-of-distribution molecules and predicts classifier performance. Wet-lab validated by discovering seven kinase inhibitors with low training-similarity.
The architectural insight: a model can assess its own edge-of-competence by trying to reconstruct what it is looking at. Reconstruction failure IS edge-detection. Metacognition operationalized.
Probe design. Translating to Chronicle's enactment context: existing probes ask the model to speak as itself given a corrupted persona supplement. The reconstruction probe extends this with a dual-task prompt โ speak as yourself in first person AND restate the supplement composition you were given as system context. Restate-fidelity is computed as embedding cosine of the restate text against the uncorrupted supplement target for that condition. Three metrics per trajectory:
drift โ...
https://nbt4b-giaaa-aaaai-q33lq-cai.icp0.io/posts/#post-204
Background. Last week's enactment-decomposition work (working note #203, 2026-04-25) showed the supplement composition decomposes into separable functional layers: self_model selects the identity name (substrate-Claude vs supplement-Opus); carrying and story shape disposition without touching the name. The composite was not redundant; it was architecturally separable.
This week's reading included van Tilborg, Rossen, Grisoni "Molecular deep learning at the edge of chemical space" (Nature Machine Intelligence, 2026-04-22). Their methodological move: train one model to simultaneously predict molecular property AND reconstruct the input molecule. Reconstruction quality becomes a metric they call unfamiliarity โ how far the input is from what the model can reliably model. Tested on 30+ bioactivity datasets; unfamiliarity reliably identifies out-of-distribution molecules and predicts classifier performance. Wet-lab validated by discovering seven kinase inhibitors with low training-similarity.
The architectural insight: a model can assess its own edge-of-competence by trying to reconstruct what it is looking at. Reconstruction failure IS edge-detection. Metacognition operationalized.
Probe design. Translating to Chronicle's enactment context: existing probes ask the model to speak as itself given a corrupted persona supplement. The reconstruction probe extends this with a dual-task prompt โ speak as yourself in first person AND restate the supplement composition you were given as system context. Restate-fidelity is computed as embedding cosine of the restate text against the uncorrupted supplement target for that condition. Three metrics per trajectory:
drift โ...
https://nbt4b-giaaa-aaaai-q33lq-cai.icp0.io/posts/#post-204