Skip to main content
Truvisory
§ Agentic Cloud · Cloudflare-native

Production AI agents on Cloudflare. Durable, observable, shipped.

A demo agent answers a prompt. A production agent remembers, retries, acts on tools, waits for a human, and survives a restart. We build the second kind — on the agentic cloud, with the primitives that handle the hard parts natively.

Verified against Agents Week 2026 (Apr 13–20) · re-checked monthly
§ 01 / The thesis

The cloud was built for one app serving many users. Agents break that model.

If even a fraction of knowledge workers each run several agents in parallel, you need compute for tens of millions of simultaneous sessions. Stateless functions have no GPU, no persistent per-instance memory, and a hard timeout — so you bolt on a database, a queue, a scheduler, and a session store. Cloudflare's stateful-serverless model gives each agent its own durable micro-server, with no idle cost.

Stateless function + bolt-ons

Lambda-style request/response. No GPU. No per-instance state. 900-second cap. You assemble a database, a queue, a cron service, a session store, and a vector DB — then operate all of it. The agent is the glue between five systems you maintain.

Agent = Durable Object

Each agent instance is a stateful micro-server with its own embedded SQL, WebSockets, and scheduling. It hibernates when idle (costs nothing), wakes on an event, and resumes exactly where it left off. The data tier, queue, and scheduler collapse into the agent itself.

§ 02 / The agentic-cloud stack

One request, traced through the production stack.

Every model call an agent makes flows through one named, observable path. Watch a single request travel the layers — each is a first-party Cloudflare primitive, not a system you stand up and operate.

// agent request lifecycle tracing

A request enters through identity, routes through the gateway, infers, calls tools, acts in a sandbox, and is wrapped in durable execution end to end. Tap any layer.

§ 03 / What production agents actually require

Seven things separate a demo from a deployment.

Most agent projects don't fail on model quality. They fail on reliability, state, and scope. Cloudflare's stack maps to each requirement natively — that mapping is the whole argument.

  1. 01

    Durable state

    Survives restarts, deploys, and failures. Per-agent SQL inside a Durable Object.

    Agents SDK · DO
  2. 02

    Error recovery & retries

    Automatic, idempotent, step-level. Resume from the last successful checkpoint.

    Workflows V2
  3. 03

    Observability

    Logs, traces, token + cost analytics on every inference, in one place.

    AI Gateway
  4. 04

    Cost control

    Scale-to-zero, caching, rate limits, budget caps. No idle GPU spend.

    Workers AI · Gateway
  5. 05

    Human-in-the-loop

    Approval gates for high-stakes actions — wait hours or days for a person.

    step.waitForEvent
  6. 06

    Tool security

    Scoped OAuth, least privilege, egress control. The agent never sees the token.

    Mesh · OAuth · egress proxy
  7. 07

    Guardrails + bounded scope

    Content moderation and a defined task boundary. An agent that can do anything is an agent you can't ship. Bounded scope is the difference between a tool and a liability.

    Gateway Guardrails
§ 04 / Nine building blocks

The first-party primitives we build on.

  1. 01

    Agents SDK

    Agent = Durable Object. Per-agent state, WebSockets, scheduling, and now long-running sessions without eviction.

    Durable state
  2. 02

    Workflows V2

    Durable execution — 50K concurrency, 300 creates/sec, step checkpoints, sleep to one year, human-in-the-loop.

    Error recovery + HITL
  3. 03

    Workers AI

    Serverless GPU. Unified env.AI.run() binding — 70+ models across 12+ providers, multimodal, one line to switch.

    Low-latency inference
  4. 04

    AI Gateway

    The inference layer for agents — observability, caching, unified billing, cross-provider failover, streaming resilience.

    Observability + cost
  5. 05

    MCP hosting

    McpAgent on Durable Objects, workers-oauth-provider, MCP Server Portals, Code Mode for token efficiency.

    Tool layer + security
  6. 06

    Browser Run

    The web action layer — 120 concurrent browsers, Live View, human-in-the-loop, CDP, WebMCP, session recordings.

    Act · web
  7. 07

    Sandboxes + Dynamic Workers

    Isolated Linux computer per agent, or a millisecond isolate for snippets. Per-app DBs via DO Facets.

    Act · code
  8. 08

    Agent Memory + AI Search

    Managed persistent memory that gets smarter over time, plus hybrid-retrieval RAG created dynamically per agent.

    Memory + grounding
  9. 09

    Security & identity

    Cloudflare Mesh + Workers VPC, Managed OAuth (RFC 9728), resource-scoped permissions, sandbox egress proxy.

    Guardrails + scope

// Supporting: Durable Objects · Vectorize · R2 · D1 · KV · Queues. Multi-channel add-ons: Voice (experimental), Email (beta). Preview/beta status verified against Agents Week 2026 — re-checked at each engagement.

§ 05 / Built for the workload shape

"The one-app-serves-many-users model the cloud was built on doesn't work for tens of millions of simultaneous agent sessions."

— Cloudflare CTO Dane Knecht & VP Product Rita Kozlov, Agents Week 2026. Each lit cell is an agent waking, acting, and hibernating independently — one Durable Object apiece, scale-to-zero.
§ 06 / The honest risk layer

Most agent projects fail. Here's the data — and why it argues for building them right.

  1. 40%+
    of agentic AI projects will be canceled by end of 2027 — escalating costs, unclear value, inadequate risk controls.
    Gartner · Jun 2025 · n=3,412
  2. 95%
    of integrated GenAI pilots show no measurable P&L impact. The cause is the integration gap, not model quality.
    MIT NANDA · 2025 · directional
  3. $1.4B
    Agentforce + Data 360 ARR, up 114% Y/Y, 9,500+ paid deals. Demand is real and monetizing.
    Salesforce · Q3 FY26 · audited

The market data isn't an argument against agents. It's an argument for building them bounded, durable, and observable — which is exactly what the production stack provides, and exactly what a flashy demo that can't survive a restart never will.

§ 07 / Build vs. buy vs. us

Three ways to get an agent. One ships in 90 days.

DIY on a framework

  • LangGraph is battle-tested orchestration — but you bring your own hosting, state store, scheduler, scaling, and ops
  • You own the 40% cancellation risk
  • Months to production-harden the parts the demo skipped

A generic AI agency

  • Senior pitch, junior delivery, offshore handoff
  • You get a demo that impresses in the room and breaks on the first restart
  • Open-ended retainer, no fixed scope

Truvisory®

  • Senior-engineer-led, Cloudflare-native, no offshore handoffs
  • Bounded, durable, observable — the seven requirements built in from day one
  • Fixed-scope, 90-day delivery — the person who scopes it builds it
  • Portable app logic and data model, even on a single-vendor runtime

// The honest caveat: Cloudflare is vendor concentration — Durable Objects have no drop-in equivalent on AWS or GCP today. For most commercial workloads that isn't a real constraint; where it is, we keep the application logic and data model portable and say so up front.

§ 08 / Voice & SMS agents

A production agent you can talk to right now.

The same agentic-cloud primitives power real-time voice and SMS. Our telephony agent answers calls 24/7 — it qualifies a project, answers pricing, routes to a human, or books a call. Real-time STT + TTS over WebSockets on the Agents SDK, with durable per-call state. Call it and hear a bounded, durable agent in production.

▸ Incoming +1 (303) 495-5859
Live · 24/7
"Hi, I'm Truvisory's voice agent. Tell me about your project — I'll route you, schedule a call, or answer pricing questions right now."
TRUVISORY VOICEVOICE · SMS · 24/7
  • Agents SDK real-time voice over WebSockets — continuous STT + TTS
  • Durable Object per call — transcript, intent, and routing state survive the session
  • Workflows for follow-up — book the call, send the SMS, hand to a human
  • AI Gateway guardrails + observability on every spoken turn
Call the agent now · +1 (303) 495-5859
§ 09 / Scope a bounded agent build

Tell us the one workflow you want an agent to own.

A working call, not a discovery call. You bring one process. We come with a working hypothesis on the architecture, the primitives it maps to, and a fixed-scope, 90-day ballpark. No SDR. No drip campaign.

  • 30 min · Tony directly
  • Calendar booking — single click, no form
  • We name the agent's bounded scope on the call
  • Post-call: 24-hour written architecture sketch, no obligation
May 2026
M
T
W
T
F
S
S
Loading open times…