§ Agentic Cloud · Cloudflare-native

Production AI agents on Cloudflare. Durable, observable, shipped.

A demo agent answers a prompt. A production agent remembers, retries, acts on tools, waits for a human, and survives a restart. We build the second kind — on the agentic cloud, with the primitives that handle the hard parts natively.

Scope an agent build →See the stack ↓

Verified against Agents Week 2026 (Apr 13–20) · re-checked monthly

§ 01 / The thesis

The cloud was built for one app serving many users. Agents break that model.

If even a fraction of knowledge workers each run several agents in parallel, you need compute for tens of millions of simultaneous sessions. Stateless functions have no GPU, no persistent per-instance memory, and a hard timeout — so you bolt on a database, a queue, a scheduler, and a session store. Cloudflare's stateful-serverless model gives each agent its own durable micro-server, with no idle cost.

Stateless function + bolt-ons

Lambda-style request/response. No GPU. No per-instance state. 900-second cap. You assemble a database, a queue, a cron service, a session store, and a vector DB — then operate all of it. The agent is the glue between five systems you maintain.

Agent = Durable Object

Each agent instance is a stateful micro-server with its own embedded SQL, WebSockets, and scheduling. It hibernates when idle (costs nothing), wakes on an event, and resumes exactly where it left off. The data tier, queue, and scheduler collapse into the agent itself.

§ 02 / The agentic-cloud stack

One request, traced through the production stack.

Every model call an agent makes flows through one named, observable path. Watch a single request travel the layers — each is a first-partyCloudflare primitive, not a system you stand up and operate.

// agent request lifecycle tracing

A request enters through identity, routes through the gateway, infers, calls tools, acts in a sandbox, and is wrapped in durable execution end to end. Tap any layer.

§ 03 / What production agents actually require

Seven things separate a demo from a deployment.

Most agent projects don't fail on model quality. They fail on reliability, state, and scope. Cloudflare's stack maps to each requirement natively — that mapping is the whole argument.

01
Durable state
Survives restarts, deploys, and failures. Per-agent SQL inside a Durable Object.
Agents SDK · DO
02
Error recovery & retries
Automatic, idempotent, step-level. Resume from the last successful checkpoint.
Workflows V2
03
Observability
Logs, traces, token + cost analytics on every inference, in one place.
AI Gateway
04
Cost control
Scale-to-zero, caching, rate limits, budget caps. No idle GPU spend.
Workers AI · Gateway
05
Human-in-the-loop
Approval gates for high-stakes actions — wait hours or days for a person.
step.waitForEvent
06
Tool security
Scoped OAuth, least privilege, egress control. The agent never sees the token.
Mesh · OAuth · egress proxy
07
Guardrails + bounded scope
Content moderation and a defined task boundary. An agent that can do anything is an agent you can't ship. Bounded scope is the difference between a tool and a liability.
Gateway Guardrails

§ 04 / Nine building blocks

The first-party primitives we build on.

01
Agents SDK
Agent = Durable Object. Per-agent state, WebSockets, scheduling, and now long-running sessions without eviction.
Durable state
02
Workflows V2
Durable execution — 50K concurrency, 300 creates/sec, step checkpoints, sleep to one year, human-in-the-loop.
Error recovery + HITL
03
Workers AI
Serverless GPU. Unified env.AI.run() binding — 70+ models across 12+ providers, multimodal, one line to switch.
Low-latency inference
04
AI Gateway
The inference layer for agents — observability, caching, unified billing, cross-provider failover, streaming resilience.
Observability + cost
05
MCP hosting
McpAgent on Durable Objects, workers-oauth-provider, MCP Server Portals, Code Mode for token efficiency.
Tool layer + security
06
Browser Run
The web action layer — 120 concurrent browsers, Live View, human-in-the-loop, CDP, WebMCP, session recordings.
Act · web
07
Sandboxes + Dynamic Workers
Isolated Linux computer per agent, or a millisecond isolate for snippets. Per-app DBs via DO Facets.
Act · code
08
Agent Memory + AI Search
Managed persistent memory that gets smarter over time, plus hybrid-retrieval RAG created dynamically per agent.
Memory + grounding
09
Security & identity
Cloudflare Mesh + Workers VPC, Managed OAuth (RFC 9728), resource-scoped permissions, sandbox egress proxy.
Guardrails + scope

// Supporting: Durable Objects · Vectorize · R2 · D1 · KV · Queues. Multi-channel add-ons: Voice (experimental), Email (beta). Preview/beta status verified against Agents Week 2026 — re-checked at each engagement.

§ 05 / Built for the workload shape

"The one-app-serves-many-users model the cloud was built on doesn't work for tens of millions of simultaneous agent sessions."
— Cloudflare CTO Dane Knecht & VP Product Rita Kozlov, Agents Week 2026. Each lit cell is an agent waking, acting, and hibernating independently — one Durable Object apiece, scale-to-zero.

§ 06 / The honest risk layer

Most agent projects fail. Here's the data — and why it argues for building them right.

40%+
of agentic AI projects will be canceled by end of 2027 — escalating costs, unclear value, inadequate risk controls.
Gartner · Jun 2025 · n=3,412
95%
of integrated GenAI pilots show no measurable P&L impact. The cause is the integration gap, not model quality.
MIT NANDA · 2025 · directional
$1.4B
Agentforce + Data 360 ARR, up 114% Y/Y, 9,500+ paid deals. Demand is real and monetizing.
Salesforce · Q3 FY26 · audited

The market data isn't an argument against agents. It's an argument for building them bounded, durable, and observable — which is exactly what the production stack provides, and exactly what a flashy demo that can't survive a restart never will.

§ 07 / Build vs. buy vs. us

Three ways to get an agent. One ships in 90 days.

There are three honest paths to production AI agent development. Here's how they compare — and why the one that ships in 90 days wins. Need broader custom AI software, not just an agent? That's AI development.

DIY on a framework

LangGraph is battle-tested orchestration — but you bring your own hosting, state store, scheduler, scaling, and ops
You own the 40% cancellation risk
Months to production-harden the parts the demo skipped

A generic AI agency

Senior pitch, junior delivery, offshore handoff
You get a demo that impresses in the room and breaks on the first restart
Open-ended retainer, no fixed scope

Truvisory®

Senior-engineer-led, Cloudflare-native, no offshore handoffs
Bounded, durable, observable — the seven requirements built in from day one
Fixed-scope, 90-day delivery — the person who scopes it builds it
Portable app logic and data model, even on a single-vendor runtime

// The honest caveat: Cloudflare is vendor concentration — Durable Objects have no drop-in equivalent on AWS or GCP today. For most commercial workloads that isn't a real constraint; where it is, we keep the application logic and data model portable and say so up front.

§ 08 / Voice & SMS agents

A production agent you can talk to right now.

The same agentic-cloud primitives power real-time voice and SMS. Our telephony agent answers calls 24/7 — it qualifies a project, answers pricing, routes to a human, or books a call. Real-time STT + TTS over WebSockets on the Agents SDK, with durable per-call state. Call it and hear a bounded, durable agent in production.

▸ Incoming +1 (303) 495-5859

Live · 24/7

"Hi, I'm Truvisory's voice agent. Tell me about your project — I'll route you, schedule a call, or answer pricing questions right now."

TRUVISORY VOICEVOICE · SMS · 24/7

Agents SDK real-time voice over WebSockets — continuous STT + TTS
Durable Object per call — transcript, intent, and routing state survive the session
Workflows for follow-up — book the call, send the SMS, hand to a human
AI Gateway guardrails + observability on every spoken turn

Call the agent now · +1 (303) 495-5859 →

§ Field notes

Agent engineering field notes

The full Cloudflare agent stack, layer by layer — durable state, execution, tools, memory, and the honest trade-offs.

// Pillar guideMay 2026

Building Production AI Agents on Cloudflare: The Complete Stack

How to build and host production AI agents on Cloudflare — durable state, durable execution, tools, and inference, mapped to a nine-part stack.

Start with the complete-stack guide, then go deep on each layer.

10 min readRead →

May 2026

The Cloudflare Agent Is a Durable Object: How Per-Agent State Works

9 min readRead →

May 2026

How to Build and Host a Remote MCP Server on Cloudflare (McpAgent, OAuth, Portals, Code Mode)

10 min readRead →

May 2026

Cloudflare Agents vs. LangGraph (and Where AWS Bedrock AgentCore Fits): Framework vs. Runtime

9 min readRead →

§ 09 / FAQ

What teams ask before scoping an agent build.

What is MCP, and why does it matter for us?

MCP — the Model Context Protocol, originated by Anthropic — is the standard way an agent connects to your tools and data through one authenticated surface instead of bespoke glue per integration. It matters because it makes the agent-to-system link governable: scoped access, least privilege, and one place to audit what the agent can reach.

Why the Agents SDK over LangGraph?

It's not framework versus framework — it's hosting. LangGraph is solid orchestration, but you still bring your own runtime, durable state, scheduler, scaling, and ops. The Agents SDK makes each agent a Durable Object with state and lifecycle built in, so the production hardening LangGraph leaves to you is already handled.

What makes an agent production-ready rather than a demo?

Seven things: durable state, error recovery and retries, observability, cost control, human-in-the-loop gates, tool security, and bounded scope. A demo answers a prompt; a production agent remembers, retries, acts on tools, waits for a person, and survives a restart. We build those in from day one, not after the demo impresses.

Can agents integrate with our existing systems?

Yes — that is what the MCP tool layer is for. The agent calls your CRM, database, internal APIs, or third-party services through scoped, authenticated tool surfaces, with credentials held at the edge so the agent never sees a token. You keep your systems; the agent gets least-privilege access to exactly what its task requires.

How do you handle agent security and auth?

Managed OAuth (RFC 9728) authenticates the agent on the user’s behalf — no insecure service accounts. Tools run behind scoped, least-privilege permissions; code executes in egress-proxied sandboxes; and every inference and tool call is logged through AI Gateway. An agent that can do anything is a liability, so bounded scope is part of the design.

What does an AI agent development company do?

An AI agent development company designs, builds, and ships production AI agents — not demos. In practice that means engineering the seven production requirements (durable state, retries, observability, cost control, human-in-the-loop gates, tool security via MCP, and bounded scope), wiring the integrations the agent needs, and deploying to a real runtime. We do AI agent development Cloudflare-native: senior-engineer-led, fixed-scope, in production in 90 days.

§ 10 / Scope a bounded agent build

Tell us the one workflow you want an agent to own.

A working call, not a discovery call. You bring one process. We come with a working hypothesis on the architecture, the primitives it maps to, and a fixed-scope, 90-day ballpark. No SDR. No drip campaign.

30 min · Tony directly
Calendar booking — single click, no form
We name the agent's bounded scope on the call
Post-call: 24-hour written architecture sketch, no obligation

// Not ready to talk? Take the 2-min AI Readiness Scorecard →

30 min with Tony directly — you leave with a working hypothesis, a stack pick, and a fixed-scope ballpark. Free, no pitch, no obligation.

July 2026

Loading open times…

No slot that works? Leave your email — Tony sends 3 times that do.

Production AI agents on Cloudflare. Durable, observable, shipped.

The cloud was built for one app serving many users. Agents break that model.

Stateless function + bolt-ons

Agent = Durable Object

One request, traced through the production stack.

Seven things separate a demo from a deployment.

Durable state

Error recovery & retries

Observability

Cost control

Human-in-the-loop

Tool security

Guardrails + bounded scope

The first-party primitives we build on.

Agents SDK

Workflows V2

Workers AI

AI Gateway

MCP hosting

Browser Run

Sandboxes + Dynamic Workers

Agent Memory + AI Search

Security & identity

Most agent projects fail. Here's the data — and why it argues for building them right.

Three ways to get an agent. One ships in 90 days.

DIY on a framework

A generic AI agency

Truvisory®

A production agent you can talk to right now.

Agent engineering field notes

Building Production AI Agents on Cloudflare: The Complete Stack

The Cloudflare Agent Is a Durable Object: How Per-Agent State Works

How to Build and Host a Remote MCP Server on Cloudflare (McpAgent, OAuth, Portals, Code Mode)

Cloudflare Agents vs. LangGraph (and Where AWS Bedrock AgentCore Fits): Framework vs. Runtime

What teams ask before scoping an agent build.

Tell us the one workflow you want an agent to own.