Essay 6.1 — The Markov Phasic Brain, Part 1 of 10. Essay 6 opens here; Parts 2 through 10 follow.


Now we open the cycle.

Essay 5 introduced two pieces of the seed agent’s foundation, side by side. The always-on plugins are the agent’s reflexes — each one owns a concern that fires regardless of which phase the agent is in or whether a phase is even active: plugin edit safety, context window discipline, job lifecycle, interaction legibility, structured questioning. They run continuously, each in its own lane, with its own state. The CLAUDE.md layer is the substrate the agent’s structured cognitive work writes into — a hierarchy of CLAUDE.md files, a knowledge directory, a memory layer that survives across phases and sessions.

This essay opens the system that does the structured work — the phasic system — and how it uses the CLAUDE.md layer to think ahead, gather experiential data, and process that data into the agent’s longer-term memory forms.

The seed agent’s cognitive work happens in phases. A phase is a temporary mode of operation, scoped to one job, with a strictly defined purpose, a strictly defined set of allowed tools, and a strictly defined kind of output. One phase is active at a time. Phases progress in a fixed order. The agent cannot skip a phase, cannot blend two phases, cannot stay inside a phase indefinitely.

The current prototype runs the OPEVC cycle (currently five phases in the prototype: observe, plan, execute, verify, and condense). OPEVC is the acronym, the name the brain calls its own cycle. The final phase, CONDENSE, plays a different role from the others. The earlier phases do work on the project; CONDENSE does work on the brain. We call CONDENSE the cycle’s cognitive organ for that reason. Still, it sits inside the OPEVC ring, not outside it. The architecture supports adding more. A custom seed could introduce a research phase between observe and plan, or split execute into execute and integrate. The current phase count is the prototype’s answer; the discipline of compartmentalized phasing is the architecture.

Two states travelled with us across the previous essay without being named: idle and gmode. We left them out of Essay 5 to keep the substrate description clean. They come back now, because the full Markov brain does not run without them.

What we are about to open is what Essay 3.1 called the agent’s cognitive metabolism — the rhythm of breathing in context, working on it, breathing out memory. The phasic layer is that metabolism made mechanical.

This essay opens that discipline compartment by compartment.

The journey ahead

Essay 6 unfolds across short sub-essays:

Essay 6.2 maps the discipline; Essays 6.3 through 6.7 deep-dive each phase, one per essay. Essays 6.8 through 6.10 are for the architects in the audience — the mechanisms that let the cycle stay honest across long horizons.


From Action Space to Markov Brain

In Essay 1 we drew the seed agent’s starting condition as an action space — the set of moves a CLI agent can pick at any given moment. Use deep reasoning. Use tools — read, write, edit, run. Ask for permission. Delegate to another agent. Stop. Each step the LLM samples from those moves probabilistically. The next step samples again. The result, without further structure, is what that essay called a random walk — a Markov chain where every path is equally likely, and the same prompt twice can produce two different journeys.

Then we added the first layer of structure: hooks. Hooks fire on every tool call, every stop, every prompt. They can block an action, modify it, or trigger another. In Essay 1's framing, hooks turn the probabilistic chain into a deterministic pipeline. The LLM still proposes the next move; the hooks decide whether the move is allowed.

The phasic layer takes that idea one fractal step further.

A phase is a flavor of the action space. Inside OBSERVE, the action space is narrowed — write tools to project files are gone, only the read-and-synthesize moves are available. Inside EXECUTE, the action space widens for the altered list and tightens elsewhere. Each phase is a customized Markov chain — its own restricted action space, its own hook-enforced rules, its own bias toward the kind of move that phase wants to encourage.

The cycle on top is itself a Markov chain — but its "actions" are not individual tool calls. Its actions are the phases. The agent moves between phases. Each move from one phase to the next obeys gates and edges declared by the orchestrator plugin. A Markov brain whose moves are themselves Markov chains. Fractal, by design.

This is what the title points at. The phasic brain is Markov on two levels. Inside any phase, the agent’s tool calls are a constrained action-space chain. Across the cycle, the phases themselves are a state machine with declared edges and no hidden continuation.

The OPEVC cycle. Forward edges (white) advance automatically. Backward edges (brown) are explicit choices. Idle holds the agent between cycles; condense returns to idle.
The OPEVC cycle. Forward edges (white) advance automatically. Backward edges (brown) are explicit choices. Idle holds the agent between cycles; condense returns to idle.

Why Phases

The naive version of agent cognition is: "read the prompt, do the thing." A user asks for a feature; the agent goes off and implements it. Whatever observation, planning, building, and checking happens, happens in one undifferentiated stream of tool calls.

This breaks for the same reason a one-line safety script breaks. There are several different kinds of cognitive work, each with different needs, and mixing them produces sloppy work in all of them.

Observation needs breadth. Planning needs alternatives. Execution needs speed. Verification needs independence from execution. Each one favors a different mental posture, and each one favors a different set of tools. The default agent runs all four kinds of work through the same mode, and the result is what every operator who has tried to drive an agent through a non-trivial task has seen: the agent jumps to implementation before it has read enough; it improvises mid-execute and rationalizes the improvisation as the plan; it self-verifies, sees the work as correct because it was the one who built it, and ships a regression.

Phases force separation. The kind of cognition the agent is doing is announced. The tools it has access to match the kind. The output it produces is tagged with which phase it came from. When the agent transitions, the system commits the prior phase’s work as an episodic memory before unlocking the next phase. There is no quietly drifting from one mode into another.

The mechanism is mechanical. The justification is cognitive: separated kinds of thinking produce better thinking.

That justification is the load-bearing claim of the phasic layer.

A handful of mechanisms inside the cycle deserve a brief mention here before the next sub-essays open them. Every phase entry locks every tool until the agent sets a multiplier — an inverse-direction scalar capturing the agent’s forecast of how scope-heavy this phase will be for this job. A small multiplier declares "this is going to be heavy"; a large one declares "this should be quick." The choice is permanent for the cycle.

Behind the scenes the multiplier scales action-points against a point floor that must clear before advance becomes permissible — but the agent never sees points in any voice or injection; coaching stays qualitative to keep the gate from inviting point-chasing behavior.

Separately, every phase carries its own direct-action budget, an independent meter that tracks the ratio of direct file work versus subagent delegation and uses that ratio to inject voices or block actions.

CONDENSE carries a fix-in-cycle gate that refuses to close the cycle until bugs introduced in this very cycle are repaired first.

Job-graph mutations follow a lifecycle-symmetry rule — the phase that adds graph state is rarely the phase that removes it, because the right context for each operation lives in a different cognitive posture. We name each mechanism here so the sub-essays can use the labels; their machinery opens one by one.

What you would customize

The phasic layer is the most opinionated piece of the prototype, and almost every dimension of it is a customization surface. The architecture is the shape; the specific dials are the prototype’s answers.

The architect would tune the phase count. The prototype’s phase set — currently observe, plan, execute, verify, condense — covers the work of designing the seed agent itself. A seed working on long literature reviews might want a research phase between observe and plan, where deep external reading happens with a different budget arithmetic than observe’s broad sweep. A seed working in regulated drafting might split execute into execute and integrate. The phase count is a knob; the discipline of compartmentalization is the floor.

The architect would tune the acronym itself. OPEVC is the prototype’s name for its cycle. A custom seed could call it RUNS, OPERATE, or any other word that catches the kinds of cognitive work it values. The name shapes how the architect talks to their own seed; the talk shapes how the seed sees the work.

The architect would tune the tool-restriction granularity. The prototype publishes one guard per phase, each one with its own allow-and-block list of tools. The same architecture supports finer grain — per-phase block-lists scoped by subagent type, allow-lists conditioned on the focused job’s form, time-of-day rules for long-running research seeds. The guards are code; the granularity is what each architect’s work demands.

What the architect would not customize is the principle that each phase publishes its restrictions ahead of time and the guard enforces them. The principle is the floor: a phase that doesn’t fence the agent in is not a phase, it is a label.

The shape lifts cleanly off this prototype. A research lab’s seed could run literature-review jobs through read-source, extract-claim, cross-check, and synthesize phases — each with its own tool fence, writes forbidden during read-source, new sources forbidden during synthesize. A consulting seed could split client engagements into intake, match, scope, draft-deliverable, and review, with the drafting phase locked out of the client-source-data tools so it cannot improvise new facts mid-prose. The phasic cycle is the architecture; the phase names and the tool fences are yours. The honest limit is that the guards stop wrong-tool calls; they cannot stop a creative operator from working around the spirit of a phase in their prose. The discipline rests on the architect reading the injected voice and choosing to obey it. Gmode is the documented escape hatch when working around the discipline is the right move.


The foundation is in place: a Markov brain whose moves are themselves Markov chains, phases as the structural answer to mixed-mode cognition. The next sub-essay maps every edge of the cycle — the full transition graph, the discipline the per-phase tool restrictions enforce, and a quick map of what each phase produces before we open the compartments one at a time.


Essay 6.1 — The Markov Phasic Brain, Part 1 of 10.

Previous: Essay 5.9 — The Customization Guardrail — the gate that decides when substrate edits are admitted. Next: Essay 6.2 — The Discipline and the Map — the full transition graph and the tool-restriction pedagogy.