Brad Zhang / public archive

Brad Zhang

AI product notes, agent workflow writing, and open-source dossiers for founders and early technical teams.

Longform / primary stage

All X long posts and article-length notes, staged as the main read.

This section prioritizes X long posts and article-style writing above 500 characters, so the strongest arguments stop disappearing inside a fast feed.

Lead longform

Ongoing · Original Essay · 12 min read · 725 chars

Memory Is Not Context

A long-form essay on why agent memory should be treated as editorial selection rather than infinite accumulation.

Memory Is Not Context
A long-form essay on why agent memory should be treated as editorial selection rather than infinite accumulation.

Visual structure

Essay structure map

Built from summary and key paragraph positions

Memory Is Not ContextTHESISA long-form essay onwhy agent memoryshould be treated aseditorial selectionSIGNALMost agent systemsstill treat memory asan overflow bucket.When context windowsOPERATORA useful memory layerneeds at least threeeditorial acts:deciding what is worthIMPLICATIONThis is why the designof memory should sitcloser to publishing,indexing, and
Mermaid outline
flowchart LR
  thesis["A long-form essay on why agent memory should be treated as editorial selection rather than infinite accumul..."]
  signal["Most agent systems still treat memory as an overflow bucket. When context windows become expensive, people..."]
  operator["A useful memory layer needs at least three editorial acts: deciding what is worth preserving, deciding when..."]
  implication["This is why the design of memory should sit closer to publishing, indexing, and annotation than to raw buff..."]
  thesis -->|frames| signal
  signal -->|develops| operator
  operator -->|lands in| implication

Most agent systems still treat memory as an overflow bucket. When context windows become expensive, people start saving everything elsewhere and call the result memory. But durable memory is not a storage problem first. It is a judgment problem.

A useful memory layer needs at least three editorial acts: deciding what is worth preserving, deciding when it should be recalled, and deciding how much of it deserves to return. Without those acts, recall becomes clutter and persistence becomes vanity.

This is why the design of memory should sit closer to publishing, indexing, and annotation than to raw buffering. The system is not rewarded for remembering more. It is rewarded for remembering what changes the next decision.

Period

Undated

That is why recoverability matters more than continuity theater. Good systems survive partial forgetting by keeping explicit checkpoints, ownership boundaries, and summaries tha...

When Agents Forget, Design Finally Begins

A systems view of forgetting, compaction, and recoverability in multi-agent workflows.

Entry surface

Essay page

When Agents Forget, Design Finally Begins

Essay page / When Agents Forget, Design Finally Begins

10 min / standalone detail page and structure diagram

Forum edge

Board route

Field-notes board and release cadence

Board route / field-notes board

Return this longform entry to the forum’s live cadence.

Proof continuation

Proof route

Inspectable repository dossiers for this argument

Proof route / repository dossiers

Repositories, READMEs, and dossiers as proof.

That is why recoverability matters more than continuity theater. Good systems survive partial forgetting by keeping explicit checkpoints, ownership boundaries, and summaries that can be audited.

Visual structure

Essay structure map

Built from summary and key paragraph positions

When Agents Forget, Design Finally BeginsTHESISA systems view offorgetting,compaction, andrecoverability inSIGNALThe third handoff inan agent workflow isoften where theillusion breaks.OPERATORThat is whyrecoverability mattersmore than continuitytheater. Good systemsIMPLICATIONIn that sense,forgetting is not theopposite of design. Itis what forces design
Mermaid outline
flowchart LR
  thesis["A systems view of forgetting, compaction, and recoverability in multi-agent workflows."]
  signal["The third handoff in an agent workflow is often where the illusion breaks. Responsibility becomes vague, co..."]
  operator["That is why recoverability matters more than continuity theater. Good systems survive partial forgetting by..."]
  implication["In that sense, forgetting is not the opposite of design. It is what forces design to become visible."]
  thesis -->|frames| signal
  signal -->|develops| operator
  operator -->|lands in| implication

The third handoff in an agent workflow is often where the illusion breaks. Responsibility becomes vague, context becomes inherited rather than owned, and every participant assumes someone else is still tracking the missing details.

That is why recoverability matters more than continuity theater. Good systems survive partial forgetting by keeping explicit checkpoints, ownership boundaries, and summaries that can be audited.

In that sense, forgetting is not the opposite of design. It is what forces design to become visible.

Essay page

Direct read

When Agents Forget, Design Finally Begins

Open the standalone essay page for the structure diagram, rail, and direct URL.

Essay page / When Agents Forget, Design Finally Begins

10 min / direct essay detail entry

Forum

Board route

Field-notes board and time layer

Return to the faster field-notes board for cadence and release context.

Board route / field-notes board

Forum board / time layer

Signal map

Signal route

thesis clusters

Place this essay back into the wider thesis and topic clusters.

Topic route / thesis clusters

Thesis / topic continuation

Proof link

Proof route

Inspectable repository dossiers

Match the argument against repositories, READMEs, and dossiers as shipped proof.

Proof route / repository dossiers

Open source / inspectable proof

Inference as a Publishing Problem

Serving, latency, and evaluation become clearer when treated as editorial constraints instead of abstract infra metrics.

Entry surface

Essay page

Inference as a Publishing Problem

Essay page / Inference as a Publishing Problem

9 min / standalone detail page and structure diagram

Forum edge

Board route

Field-notes board and release cadence

Board route / field-notes board

Return this longform entry to the forum’s live cadence.

Proof continuation

Proof route

Inspectable repository dossiers for this argument

Proof route / repository dossiers

Repositories, READMEs, and dossiers as proof.

Seen this way, serving is not just distribution of compute. It is a form of editorial prioritization. Some results deserve fast publication. Others deserve delay, review, or suppression.

Inference infrastructure is often described in the language of throughput and cost. Those metrics matter, but they only become meaningful when tied back to visibility: what the user sees, how quickly they see it, and what level of certainty the system is willing to attach to it.

Seen this way, serving is not just distribution of compute. It is a form of editorial prioritization. Some results deserve fast publication. Others deserve delay, review, or suppression.

Evaluation then stops being an afterthought and becomes the discipline that defines what may be published automatically at all.

Essay page

Direct read

Inference as a Publishing Problem

Open the standalone essay page for the structure diagram, rail, and direct URL.

Essay page / Inference as a Publishing Problem

9 min / direct essay detail entry

Forum

Board route

Field-notes board and time layer

Return to the faster field-notes board for cadence and release context.

Board route / field-notes board

Forum board / time layer

Signal map

Signal route

thesis clusters

Place this essay back into the wider thesis and topic clusters.

Topic route / thesis clusters

Thesis / topic continuation

Proof link

Proof route

Inspectable repository dossiers

Match the argument against repositories, READMEs, and dossiers as shipped proof.

Proof route / repository dossiers

Open source / inspectable proof