2 minute read

A great demo is not engineering.

Agentic systems can look convincing right up until they meet real operating conditions.

The layoff reversals from Part 1 made the problem visible: companies automated visible tasks before they engineered the workflow around them.

The field can produce impressive one-off results. What it still lacks, in most cases, is the operating discipline that makes those results repeatable.

Craft works until it doesn’t

Early in any field, systems are built through skill, intuition, and iteration.

They rely on:

  • Individual expertise
  • Informal workflows
  • Trial and error

That is where agentic systems still are.

Success often depends on someone who knows how to phrase prompts, recover from failures, and manually steer execution back on course. That is not yet a robust operating model. It is a skilled operator compensating for missing structure.

That is what companies that removed humans early ran into: they mistook operator effort for system design. The hidden coordination work was still there. It only became visible once the system started failing.

The signs of an artisanal system

You can recognize this stage quickly.

The system works, but:

  • Results vary depending on who operates it
  • Knowledge is implicit instead of encoded
  • Recovery is manual instead of built into the workflow
  • Scaling requires more intervention instead of more reuse

In engineering terms, the system is not repeatable.

Anthropic describes the operational root of this problem directly: context is a finite resource, so systems have to decide what to carry forward and what to compress. If that decision still lives in one operator’s head, the workflow has not matured.

The guidance is already shifting

By 2025 and 2026, the official guidance had already moved away from prompt craft as the main design surface.

The shift is from improvisation to workflow engineering:

  • Anthropic calls the problem context engineering: deciding what information the system should carry forward, in what form, and for which next step.
  • OpenAI recommends starting with the simplest agent that can work and only adding more specialization when prompt complexity, tool overload, or task divergence genuinely require it.

The point is not to eliminate prompts. It is to stop treating prompt skill as the system.

From craft to engineering

This transition is familiar because every engineering field goes through it.

Artisanal phase Engineered phase
Operator memory Explicit artifacts
Prompt heroes Defined roles
Manual recovery Validation and repeatable handoffs

Agentic systems are still early enough that outcomes often depend more on how the workflow is operated than on how it is designed.

That is not unusual. It is what immaturity looks like.

But maturity does not remove the next problem.

The next pressure

Moving from craft to structure makes a system more repeatable, but it also makes its limits easier to see.

Once work has to move across multiple stages, throughput is no longer set by individual brilliance. It is set by sequencing, coordination, and the slowest point in the flow.

👉 Part 3: Agentic systems are bound by the same fundamental limits