Benque Max AI LabJune 2026

AI is not a tool. In just a few months, it works with you.

You're probably enabling Claude Cowork wrong.

What do you choose?

“You take the blue pill — you stay in the past. Humans are the bottleneck. You will be stuck at 10x. You get out only if Anthropic lets you.
You take the red pill— you step into the future. Humans aren't the bottleneck anymore. You augment the whole organisation.”

Blue pill

AI for human augmentation

Make every individual faster. Tools the human reaches for — Claude Cowork on a laptop, Claude Code in the terminal, MCP-connected apps. Productivity gains are immediate, the friction of implementation is low, and the shape of the work mostly stays the same.

Path A · Today

Red pill

AI for community augmentation

Change the shape of the organisation itself. Persistent coworkers in the cloud — Claude Managed Agents, OpenClaw, signal-driven workers — owning chunks of work end-to-end across the whole team. They wake on signals, deliver, log, and stop. The org's capacity grows, not just any one person's.

Path B · What's coming

The trick is that you swallow both. But which pill first decides what kind of company you'll be in eighteen months.

The question for the coming months

Do we enable humans, or do we build autonomous systems?

Strip the metaphor and that's where every team running AI work seriously is going to land this year. The answer is both — but the order matters, and the defaults don't help.

The frontier is moving in this direction. Anthropic is shifting its centre of gravity from Claude Code toward Claude Managed Agents; OpenAI is doing the same with OpenClaw. The roadmap of the labs is the second path, not the first.

But the labs are slower than the operating companies can (and need to) be. And the foundation a company builds for path A doesn't automatically support path B. By default, three things break:

Default MCPEnabling an MCP server for a human's AI client is one-off access on one laptop. The autonomous worker running tomorrow in the cloud doesn't inherit it. To scale that setup you'd technically have to leave the laptop running 24/7.
Default OAuthThe standard OAuth flow authenticates a human. A persistent background worker needs machine-to-machine credentials with their own permission scopes — and those aren't what gets set up when an employee clicks “allow.”
Default dataConversations, tool calls, and overrides aren't captured anywhere the organisation can learn from. No feedback loop, no signal for what to automate next, no recursive self-improvement.

A foundation that supports both paths can be built. By default it isn't.

Let me show you

Here's how human augmentation with AI works.

Easiest way to make this concrete is to drop you inside a working day. An agent builds the scene around you — your company, your persona, your boss — from one or two lines you give it. Then we run a real moment of work.

Tell it a B2B SaaS company you could imagine working at and a sentence on what it does. Or click Use a sample. Anything obviously gibberish gets quietly replaced by something plausible.

Company you work at

Ramp

What it does

Corporate cards and finance automation. Replaces expense reports, manual reconciliation, and old-school T&E.

Hit play and watch an agent build your whole working world around Ramp — persona, boss, pipeline — tool call by tool call.

At its core this is a fully functioning agentic system — real agents, a CRM, Claude Cowork, the works. What you're watching is a recording of one real run.

Step 1 of 5

There's still more to the demo

Type a B2B SaaS company above (or click Use a sample) and hit Build to get going.

Contents

§1What we just saw
§2Why the hard part isn't the code

§1 · Recap of what we just saw

Augment humans now — and build the architecture ready for what's coming.

Two architectures, running the same underlying agentic loop. Path A is the tool the human reaches for — immediate productivity gains, the shape of the work mostly unchanged. Path B is the persistent coworker that wakes on a signal, reads its memory files, decides whether to act, and reports back. Capacity grows for the team rather than for one person at a keyboard.

The point is that the loop in the middle is the same. What makes one a tool and the other a coworker is what gets built around it — the activation surface, the persistence layer, the scoped permissions, the data the system can learn from. Building for Path A in a way that doesn't support Path B six months later is the trap that's easy to fall into by default.

So the move is to enable humans first and to use that work to lay the foundation Path B will rest on. Not two stacks; one foundation that supports both surfaces, with the human-facing surface shipped first because that's where value lands fastest and where the organisation learns what is worth automating next.

§2 · Why the hard part isn't the code

The architecture is the easy part.

The pieces in this demo — an agent loop, scoped tools, memory files, a signal-driven wake — are tractable. A small team can build the technical side in weeks. What decides whether an effort like this compounds or stalls is almost never technical. It's whether everyone stays honest about what's actually changing.

The comfortable story is “these are just tools to make people faster.” It holds for about a quarter. Then a coworker is quietly owning a chunk of work end-to-end, and the distance between the story and the reality starts costing trust. The version that lasts names the shift out loud — what's moving onto AI coworkers, what stays with people, on what timeline — said early, by someone with the standing to mean it.

And the systems that pull ahead are the ones that learn from what they can see. Every conversation, tool call, and human override is signal: for what to automate next, and for how to do it better. The earliest data is the most valuable — it's what teaches a system the shape of the work. Captured deliberately, with scoped access and audit logs, that feedback loop is what turns a handful of isolated agents into something with organisational intuition over time.

Optional · extra reading

Extra reading.

A Benque Max AI Lab publication

About the author

Markus Hav — Lead Researcher, Agents at Benque Max AI Lab, an AI research lab in Finland working on the agent layer above LLMs — self-emerging agentic patterns and autonomous behaviours.

Markus also works as Head of AI Automation at Hoxhunt, and brings years of self-emerging agent research alongside a background in finance and software development. This demo is one of the lab's explorations of what the agent layer makes possible.

Connect with Markus on LinkedIn →