AI-native developer tooling / agent workflows

Brad Zhang

Founding engineer and applied AI product builder for devtools, agent systems, workflow reliability, and open-source distribution.

X / @teach_fireworks

Archive / Long-form directory

Arrange the essays into a line worth returning to.

This archive is organized for rereading, not for feed-speed recency. Start with the cover essay, or move backward through recent entries and monthly sections.

Essays3
Deep reads3
Periods1
Archive cover

Start here

Memory Is Not Context

Useful memory is not what survives by default. It is what a system learns to preserve on purpose.

Most agent systems still treat memory as an overflow bucket. When context windows become expensive, people start saving everything elsewhere and call the result memory. But durable memory is not a storage problem first.…

Ongoing · Original Essay · 12 min read

Period

Undated

3 entries

Ongoing · Original Essay · 12 min read

Memory Is Not Context

A long-form essay on why agent memory should be treated as editorial selection rather than infinite accumulation.

Most agent systems still treat memory as an overflow bucket. When context windows become expensive, people start saving everything elsewhere and call the result memory. But durable memory is not a storage problem first.…

Ongoing · WeChat Import · 10 min read

When Agents Forget, Design Finally Begins

A systems view of forgetting, compaction, and recoverability in multi-agent workflows.

The third handoff in an agent workflow is often where the illusion breaks. Responsibility becomes vague, context becomes inherited rather than owned, and every participant assumes someone else is still tracking the miss…

Ongoing · X Thread · 9 min read

Inference as a Publishing Problem

Serving, latency, and evaluation become clearer when treated as editorial constraints instead of abstract infra metrics.

Inference infrastructure is often described in the language of throughput and cost. Those metrics matter, but they only become meaningful when tied back to visibility: what the user sees, how quickly they see it, and wh…

Archive | Brad Zhang