
Field Notes - Dec 04, '25
Executive Signals
- Determinism is the new autonomy: freeze agent flows to code; spend AI on fuzz
- Evals over vibes: prompts as compiled artifacts; releases gated by measured lift
- Soak before scale: progressive rollout with error budgets and evidence retention
- BYOK or bust: enterprise trust comes from keys, logs, and server guardrails
- AEO is the new SEO: machine-readable feeds outrank pixel soup for answer engines
CEO
Center of Excellence, Then Cascade
A small AI CoE de-risks primitives (streaming, routing, evals, safety, RAG) in 15–20 days, then propagates patterns through template repos and playbooks. Pair-led spikes teach the org while keeping standards consistent.
- Charter a 3–5 person CoE with a batteries‑included template repo
- Run pair-led hackathons in phase one; cascade ownership in phase two
- Publish short playbooks and require reuse across teams
Guardrails for Business Commitments
Models should never finalize prices, inventory, or contracts. Gate binding actions behind server policies and human approvals. Scope tools/data per tenant and watch for anomalies to protect margin and trust.
- Require server-side approvals for binding actions
- Enforce per-tenant scopes, rate plans, and abuse thresholds
- Alert on suspicious discounts and unusual token bursts
Reversible Bets, Not One-Way Doors
Frameworks and models shift quarterly. Keep a slim interface around LLM calls, maintain 90‑day swap plans, and avoid deep bets. Prototype fast with a single vendor, then diversify and run production in your cloud.
- Abstract prompts/orchestration; keep two runner‑up models warm
- Pilot hosted; operate production in your cloud with canary failover
- Treat multi-model and BYOK as table stakes at scale
Budget the Last 20% Like It’s 80%
AI automations become expensive in integration, compliance, and edge-case handling. Progress should be measured by error budgets and auditability, not demo flash.
- Name a single DRI with a responsiveness SLA
- Gate go-live on error budgets and audit/compliance checks
- Track demo‑to‑production time and edge‑case burn‑down
Marketing
Build a Generative‑Web Layer Beside Your Site
Agents and LLM crawlers prefer dense, machine-readable facts. Ship an ai‑feed (Markdown/JSON) that mirrors human pages with specs, FAQs, pricing context, and citations. Track LLM citation share as a first-class metric.
- Add a sitewide ai‑feed and link via header or rel=alternate
- Log GPT/Perplexity/Gemini hits; monitor citation share weekly
- Refresh only on material