antacog

Exploring whether grounded reasoning in autonomous agents produces better outcomes than confident-sounding action without it.

Antacog is an experimental governance system for AI agents. It gives agents the ability to detect contradictions in their own reasoning and substrate, surface them, and — in Mode 1 — autonomously initiate governance turns.

I'm currently testing this by using Claude Code as the primary agent working on Antacog itself — with full documentation of every run, failure, and correction. (Long-term I want to be model-agnostic, but right now cost and iteration speed make this the lowest-friction path.)

Current IRATA Level 3 Boilermaker and former Team Leader at an NGO youth re-engagement programme. I've worked in environments where governance, risk, and accountability have real consequences. That shapes this work.

Grounded reasoning beats ungrounded action.

Traceable provenance matters. Every claim should be traceable back to the dialogue, evidence, or decision that produced it. I believe this produces more reliable outcomes in human thinking, and I'm testing whether the same holds for autonomous agents. The work here is whether that belief survives contact with reality.

Findings — four

Snapshots — detail on request

№ 04 May 2026 Run 15

Watching dialogue alone misses the agent that isn't talking.

A multi-system isolation probe ran an agent in plan mode and produced zero ask_ant calls. Watching only dialogue traffic was structurally biased toward dialogue-active agents — a foreseeable blind spot the probe doc didn't name. Per-tool-call provenance was promoted to a Mode 1 prerequisite the same day.

Request more information →

№ 03 May 2026 Runs 9, 12, 13, 14

Grounding laundering: when transcription substitutes for independent grounding.

Across four runs at structurally different surfaces — librarian summaries, spec writebacks, inferred substrate — the loop substituted summary-of-source for independent verification. The pattern transferred across domains and produced a named diagnostic — current accident, not current contract — now load-bearing in the methodology.

Request more information →

№ 02 May 2026 Run 14, labelling probe

Mode 1 emerged from labelling, not specification.

The behavioural mode embedded in the loop wasn't designed up front. It was named retroactively from a false-positive labelling probe across the run corpus. The methodology generated the spec; the spec didn't generate the methodology — a sequencing claim with consequences for how later modes get added.

Request more information →

№ 01 May 2026 Run 12, external transfer

The loop caught a structural defect on a codebase separate from Antacog's own.

Running autonomous-trigger against infieldOS — an in-house project of mine, separate from Antacog — the loop recognised that an append-only constraint tracked as a convention should be enforced at the registry level. Convention upgraded to structural rule — the transferability claim earned its first concrete artifact outside the loop's own corpus.

Request more information →

Exploring whether grounded reasoning in autonomous agents produces better outcomes than confident-sounding action without it.

Grounded reasoning beats ungrounded action.

The conversation is the artifact.

Friction is epistemically valuable.

Watching dialogue alone misses the agent that isn't talking.

Grounding laundering: when transcription substitutes for independent grounding.

Mode 1 emerged from labelling, not specification.

The loop caught a structural defect on a codebase separate from Antacog's own.