On the Q4 earnings call, Vahe said FY27 is about "bringing the Agentic Operating System for the trades to life" and doubling the capacity of Max. That platform needs three primitives: stateful agents, durable orchestration, and edge-grade inference. We ship all three.
ServiceTitan · NASDAQ: TTAN · FY26 results (year ended Jan 31, 2026)
The retrieval primitive behind a tool-using agent like Atlas. Technician or office user asks in plain language; Workers AI embeds the query; Vectorize searches the tenant's pricebook, service-agreement, and dispatch corpus; results back in <100ms at the edge.
The search above is a preview. The Live Sandbox is the per-tenant inference + retrieval layer that sits behind Atlas — upload sample tenant content (pricebook page, service-agreement PDF, technician photo), watch Workers AI extract structure, see Vectorize index it, then query with natural language. Same primitive, every tenant, isolated by Workers for Platforms.
Each artifact lives in a per-tenant R2 bucket, gets embedded by Workers AI, and is queryable from any Atlas conversation through its Durable Object — sub-100ms, regardless of where the user is.
Atlas is a tool-using agent that lives per user, per tenant. The hard problem isn't the LLM call — it's the state, the orchestration, and the retrieval around it. Cloudflare gives you two composable primitives.
One DO per active Atlas conversation (or per Voice Agent call). Strongly consistent, single-threaded, edge-resident. Holds the agent's working memory, tool-call history, and session tokens — without Redis, sticky load balancing, or a separate session service.
Drop tenant documents (pricebooks, service agreements, technician manuals, regional regs) into R2. AI Search auto-generates and maintains a per-tenant Vectorize index. No embedding pipeline to build, no ETL to maintain.
Every Titan Intelligence workload mapped to a Cloudflare primitive. The three ringed in orange are the agentic platform foundation.
Opinionated framework for Atlas: tool-use, streaming, scheduling, hooks, memory. Your team builds trade-specific tools; we own the orchestration plumbing.
One DO per user / tenant / conversation. Strongly consistent, sticky, edge-resident. The memory and concurrency model Atlas and the Voice Agents need.
Durable, retryable, long-running. A Voice Agent call that books a job, charges a card, dispatches a tech, and texts the homeowner shouldn't break on a partial failure.
Sidecar work on the voice path: intent routing, PII redaction, transcript summarization. Llama / Mistral / Whisper / BGE — pay per use, no GPUs to run.
Cost per Titan Intelligence feature, prompt caching for high-overlap Atlas system prompts, provider fallback when OpenAI has an outage, full audit. Margin lever for FY27.
For Karen, Lucy, Piper, Jenny. Low-latency audio + server-side processing at the edge. Pairs with Workers AI for the sub-800ms voice-turn budget.
Per-tenant vector indexes over pricebooks, service agreements, technician manuals. AI Search handles ingest → chunk → embed → retrieve so engineering ships features, not pipelines.
D1 for agent-local operational data (transcripts, tool-call traces, session config). Hyperdrive pools connections to your existing Postgres / MySQL system of record.
Per the 4/22 TAR: proxy api.servicetitan.com, move WAF rules out of Log mode on top paths, rate-limit auth. API Shield is the natural next step once endpoints are proxied.
FY27 guidance: revenue growth 24% → ~16%, operating margin expanding meaningfully. Efficiency matters. AI cost goes from a line item to a P&L driver. These three primitives put Cloudflare directly on the levers ServiceTitan is being measured against publicly.
Workers AI sidecar work + Realtime WebRTC at the edge keeps Karen / Lucy / Piper / Jenny inside the natural-conversation latency window as you scale tenants.
AI Gateway in front of every Titan Intelligence call. Cost per feature, cost per tenant, prompt caching, provider fallback. Direct margin lever for the FY27 efficiency narrative.
Stateful, per-user, per-tenant agent infra at the edge. Atlas grounded in pricebook + service-agreement + dispatch context — at $1B+ run-rate scale.
api.servicetitan.com, exit Log-only WAF mode on top paths, rate-limit auth. Value extraction before any new contract.A 60-minute working session: your Atlas and Voice Agent technical leads with our Agents SDK and AI Gateway PMs. No slides, whiteboard. Concrete numbers — not architecture diagrams.
Built with Cloudflare Pages, Agents SDK, Durable Objects, Workflows, Workers AI, AI Gateway, Realtime, Vectorize, AI Search, D1, Hyperdrive, R2, and the security stack you already pay for.
Deployed globally to 330+ cities. Your DB of record and provider relationships unchanged.