The Agent Control Blog — What Your Agents Actually Cost, and How to Control Every Tool Call

Which Claude Code Tools Should You Deny (or Gate Behind Approval) Out of the Box?

policy control claude-code

One Claude Code session declares 76 tools. The core coding loop isn't the risk — the never-invoked tail is: tools that send, schedule, publish, and spawn. A default posture, argued from blast radius, not asserted.

Jul 5, 2026

AI Agent Tool Allowlists: Deny by Default, Scope per Task, Audit Everything

tool-policy allowlist default-deny

One Claude Code session declares 76 tools; 64 of them never fire. A tool allowlist is the list of calls your agent may make — everything else denied. How to set one in Claude Code, Codex CLI, and MCP, and where client-side lists stop holding.

Jul 5, 2026

Claude Code's Deny List Can Be Bypassed. Here's What a Real Enforcement Boundary Looks Like.

claude-code security tool-policy

Deny rules match command strings inside the client — compound commands, substitution, and one documented flag all route around them. Where client-side permissions honestly stop, and what enforcement outside the agent process looks like.

Jul 5, 2026

Claude Code Cost Tracking: Token Counters Tell You What You Spent, Not Where It Went

claude-code cost optimization

A real working day of Claude Code: 276 model calls, 1,697 tool calls, $148.16 at API rates — 100% of it loop tax. Why /cost, ccusage, and proxy totals can't show you that, and what per-action attribution looks like.

Jul 5, 2026

Control and Optimize Your Agents, Down to Each Tool Call

cost agents optimization

You ship an agent and see one answer. Inside, it made 200 tool calls across a dozen model turns, and it cost something different every run. Here's how to see that, control it, and make it cheaper — one tool call at a time.

Jun 30, 2026

Session X-Ray: Debugging a Single Agent Run, Call by Call

cost agents debugging

The Agent X-ray shows you an agent across all its runs. The Session X-ray opens one run — every call in order, the loop tax inside it, and the single step that cost you. Here's how to read one.

Jun 30, 2026

We Benchmarked 13 Models on Real Agent Runs

benchmark model-selection cost

Isolated tool-call skill barely predicts whether a model can run an agent. We tested 13 models two ways — and cheap models aren't worse, they're erratic on the loop.

Jun 28, 2026

Full scorecard: seven frameworks, 48 scenarios, one open benchmark

benchmark governance scorecard

Seven frameworks benchmarked: CrewAI, LangGraph, Claude Code, OpenAI Agents SDK, Anthropic Agent SDK, Cursor, Codex CLI. Native vs ACP. Three score tiers.

Apr 20, 2026

What Is an Agent Harness? (And Why Every Harness Needs a Control Plane)

harness agents governance

The model is the smallest part of your agent. Everything around it — the loop, the tools, the memory, the budget — is the harness, and it's where reliability, cost, and risk actually live. Here's what an agent harness is, why harness engineering became a discipline, and the one thing every harness in production still needs.

Jun 30, 2026

We had a Claude agent build a governed AI agent. It picked Microsoft.

agents-building-agents discovery agentic-control-plane

We let a fresh AI agent — no prior knowledge of any vendor — build a governed Slack summarizer end-to-end in Python. It searched the web, picked a vendor, ran pip install, wrote 338 lines, verified the audit chain. It picked Microsoft Agent Governance Toolkit. The full discovery path, both code samples, and an honest comparison.

May 1, 2026

Anthropic Agent SDK: Auditable Logging and Governance in TypeScript

anthropic agent-sdk governance

The complete TypeScript reference for adding auditable logging, per-user identity, and policy enforcement to Anthropic's Agent SDK and Claude Agent SDK loops.

Apr 27, 2026

The Loop Tax: Why AI Agents Are So Expensive

cost agents optimization

An agent doesn't make one model call — it makes a chain, re-reading context every turn. That loop is ~89% of the bill. The mechanic, and three levers to cut it.

Jun 26, 2026

How to Govern AI Agent Tool Calls (Before They Run)

governance tool-calling mcp

Your framework gatekeeps the server, not the call. How to authorize, scope, and audit every agent tool call per user — deterministically, before it executes.

Jun 28, 2026

Agent Access Control: Least-Privilege Scoped Tools

access-control least-privilege governance

The fastest way to make an agent ungovernable is to give it broad tools. Why least-privilege, scoped tools are the foundation of access control for AI agents.

Jun 28, 2026

Per-User Auth for AutoGen Agents

autogen authentication identity

AutoGen punts authentication to your application code. How to thread the end user's verified identity through to every tool call — with policy and audit per user.

Jun 28, 2026

Cost Per Customer in CrewAI Agents

crewai cost multi-tenant

CrewAI cost tooling stops at the LLM call. How to attribute agent spend per customer, per agent, and split the loop tax from the real work.

Jun 28, 2026

Seven agent frameworks, one backend, governance diverges on 9 of 48 tests

architecture governance benchmark

Seven frameworks, one backend, 48 governance scenarios. Scores ranged 37-46. Variance is architectural: where a framework lets you observe tool calls.

Apr 21, 2026

--dangerously-skip-permissions drops the prompts, not the hooks — your governance survives it

claude-code anthropic governance

Correction: Claude Code's --dangerously-skip-permissions suppresses the interactive permission prompts, not your PreToolUse/PostToolUse hooks. Hooks fire in every permission mode and a hook deny still blocks — so ACP keeps governing. Here's what the flag actually touches, and what genuinely can turn a hook off.

Apr 20, 2026

AI control plane: a buyer's guide

governance buyers-guide control-plane

What an AI control plane actually is, the four vendor categories competing for that name, the questions that separate them, and a 14-day evaluation framework you can run before you sign anything.

Apr 30, 2026

CSA Defines the Agentic Control Plane. Here's What We Built.

standards architecture

Cloud Security Alliance just published the Agentic Control Plane framework. We've been building the infrastructure. What's real vs. still theoretical.

Mar 23, 2026

Agentic Data Plane vs Agentic Control Plane

architecture mcp

Agentic data plane vs agentic control plane — what each layer does, why you need both, and how they work together to govern AI agents in production.

Feb 23, 2026

What Is an MCP Control Plane?

architecture mcp

An MCP control plane adds identity verification, policy enforcement, and audit logging to Model Context Protocol servers — the missing governance layer.

Feb 26, 2026

Why Your API Gateway Can't Control AI Agents

architecture

Kong, Apigee, and AWS API Gateway route and rate-limit requests. They can't tell which user an agent acts for, which tool call is destructive, or what a session costs. What agent traffic needs that gateways weren't built for.

Dec 10, 2025

Why Not Just Use OPA and a Service Mesh?

architecture identity

Do OPA, Istio, API gateways, and IAM already solve AI agent governance? Where existing infrastructure fits, where it breaks, and what's still missing.

Mar 25, 2026

When AI Agents Take Real Actions, Who Controls What They're Allowed to Do?

architecture

Chatbots read and write text; agents call APIs, change records, and move money. Why every team shipping agents ends up needing per-action permissions, audit trails, and cost limits — the control-plane argument.

Oct 15, 2025

Building the same agent fifteen ways: what each framework taught us about governance

governance frameworks integration-patterns

Fifteen frameworks and clients, one agent task. The friction points were different in every runtime — and the patterns that emerge tell you what governance actually has to do, regardless of which framework your team picks.

Apr 24, 2026

If You Can't Tell Who an Agent Request Is For, Nothing Else Works

identity

Permissions, rate limits, and audit logs all fail the same way for AI agents: the request arrives without the user. Why identity is the problem to solve first — before access control, before audit.

Nov 5, 2025

ACP and Okta for AI Agents: composition, not collision

okta identity composition

Okta for AI Agents launched today as the identity-perimeter layer for AI agents. ACP runs at the tool-call layer. The two compose into a complete control plane — here's how the layers fit.

Apr 30, 2026

Okta for AI Agents: a technical read on the launch

okta identity mcp

Okta launched Okta for AI Agents to GA on April 30, 2026. Walking through the architecture, the MCP Bridge approach, the five-question framing, and which agent-governance use cases the launch addresses cleanly.

Apr 30, 2026

ACP and Bedrock AgentCore: how the two layers compose

aws bedrock agentcore

AWS shipped a real governance product for Bedrock-hosted agents. ACP runs everywhere else. The honest read on when to use which, and why most enterprises will need both.

Apr 30, 2026

OpenAI on Bedrock: what the partnership covers, and what's beyond it

aws bedrock openai

OpenAI models are now on Amazon Bedrock — including GPT-5.5. The deal extends meaningful governance to AWS-hosted agents and surfaces three architectural areas where complementary intercept points are still useful.

Apr 30, 2026

ACP and Microsoft Foundry Agent Service: governance beyond the Azure boundary

microsoft azure foundry

Microsoft Foundry ships the most coherent enterprise governance story among the three hyperscalers. Here's where it covers, where it stops, and how ACP composes for everything outside Azure.

Apr 30, 2026

Microsoft open-sourced an Agent Governance Toolkit. Here's what it covers and what it doesn't

microsoft azure foundry

April 2026: Microsoft released an open-source policy engine for AI agents — sub-millisecond enforcement, stateless, self-hostable. Read the strengths and the scope honestly.

Apr 30, 2026

ACP and Vertex AI Agent Builder: same ADK code, two governance scopes

google vertex agent-builder

Google's Vertex AI Agent Builder gives ADK agents per-agent IAM identities, Cloud API Registry tool governance, and managed Agent Engine. Here's where it covers and how ACP plugs in beyond Google Cloud.

Apr 30, 2026

Gemini Enterprise Agent Platform: Google's hosted-agent answer, and where it composes

google vertex gemini-enterprise

April 23, 2026: Google announced the Gemini Enterprise Agent Platform — bundling Vertex Agent Builder, Agents CLI, Agent Runtime, Cloud Run, and GKE Autopilot. Here's what's in it and how it composes with complementary intercept layers.

Apr 30, 2026

SOC 2 and HIPAA for AI agents: the compliance playbook

compliance soc2 hipaa

A control-by-control mapping from SOC 2 trust services criteria and the HIPAA Security Rule to the AI agent governance controls that satisfy them. With evidence-collection guidance, common audit-failure modes, and a one-page checklist.

Apr 30, 2026

Ten questions every CISO should ask about AI agent audit trails

compliance governance audit

Every governance vendor claims audit trails. Most produce something between an unstructured request log and a real, identity-attributed, tamper-evident record of agent decisions. Here are the ten questions that separate them.

Apr 27, 2026

AI Agent Audit Trails: What CISOs Actually Need to Know

compliance

"Which employee accessed patient records through the AI assistant last Tuesday?" Your gateway logs and SIEM can't answer that. What an agent audit trail needs that generic API logs don't have.

Mar 1, 2026

How to Make AI Agents Pass a Compliance Audit (HIPAA, SOC 2, GDPR)

compliance

Most AI agents run on shared API keys with no per-user access control and no identity-attributed audit trail — failing every major compliance framework. What auditors actually ask for, and how to produce it.

Jan 22, 2026

EU AI Act Article 14 and AI Agents: Mapping Human Oversight to Delegation Chains

eu-ai-act article-14 compliance

EU AI Act Article 14 requires demonstrable human oversight from Dec 2, 2027. How ADCS delegation chains map to 14(4)(a)-(e) with auditor-ready artifacts.

Apr 16, 2026

NIST Just Defined Identity for AI Agents. Here's What Changes.

standards identity

NIST's AI Agent Standards Initiative is the first federal move on identity and authorization for autonomous agents. The architectural asks are clear.

Mar 15, 2026

How an Agentic Control Plane Addresses Every OWASP Agentic Top 10 Risk

standards compliance

A risk-by-risk mapping of OWASP's Agentic Top 10 to specific control plane capabilities — from the governance layer, not the model or app layer.

Mar 25, 2026

AI Agent Identity: The Problem No One Has Solved Yet

identity

Okta, AD, and OAuth authenticate humans at login. An agent acts for a user across thousands of tool calls after login — and your IAM stack has no concept of it. Why proving who an agent acts for needs a different layer.

Mar 4, 2026

Your Backend Can't Tell Which User an AI Agent Is Acting For

identity mcp

I built an MCP server, connected it to real customer data, then read my backend logs: every request was the same service account. How agent requests lose the user — and how to bind identity to every call.

Mar 10, 2026

Per-Call Permissions for AI Agents: Why RBAC Breaks at Agent Speed

authorization

Your RBAC assigns roles at login. Your agent makes 47 tool calls in 90 seconds. Why agents need runtime authorization — deny-by-default permissions checked on every tool call — and what that looks like in practice.

Mar 20, 2026

Point-in-Time Audits Can't Keep Up With AI Agents. Check Every Tool Call Instead.

compliance authorization

Your SOC 2 audit covered 90 days; your agents made 2.3 million tool calls in that window. Why annual evidence breaks for agents, and what checking identity, permissions, and budget on every call looks like.

Mar 18, 2026

The MCP Security Checklist for Enterprise Teams

security-research mcp compliance

A 10-point security checklist for teams deploying MCP servers in production. Covers identity, auth, PII, rate limits, audit trails, and more.

Mar 22, 2026

Stop your AI agent from running `rm -rf` on your filesystem — in three steps

governance defense-in-depth tool-policy

Cursor and Claude Code agents have wiped home directories mid-session. The fix isn't smarter prompting — it's a control plane between the agent's tool call and your filesystem. Here's the exact configuration.

Apr 26, 2026

Stop your AI agent from leaking secrets in your `.env` file — in three steps

governance defense-in-depth secrets-management

AI coding agents read your .env files by default. They quote secrets back into commits, paste them into chat logs, and surface them in tool outputs. Here's how to gate that without breaking your agent's actual job.

Apr 26, 2026

Stop your AI agent from leaking PII through tool calls — in three steps

governance defense-in-depth pii-redaction

Your AI agent runs SELECT email FROM users and gets back a list of customer emails. Now those emails are in the LLM's context, your conversation logs, and any downstream tool the agent calls afterward. The fix isn't smarter prompting — it's tool-output PII redaction at the gateway.

May 1, 2026

Stop your AI agent from rewriting your git history — in three steps

governance defense-in-depth git-safety

Claude Code, Cline, and Cursor agents have force-pushed over teammates' work, reset uncommitted changes, and stripped commits from production branches. The model can't see what you'd lose. A control plane between the agent and your git remote can.

Apr 26, 2026

Stop your AI agent from being weaponized by a malicious package — in three steps

governance defense-in-depth supply-chain

The Nx s1ngularity attack used local Claude, Gemini, and Q CLIs to recon for SSH keys, .env files, and GitHub tokens. 2,349 secrets were exfiltrated. The control plane your AI agent needs is the same one that catches this — but you have to install it before the next compromise lands.

Apr 26, 2026

Stop your AI agent from touching files outside your project — in three steps

governance defense-in-depth workspace-isolation

Cursor and GitHub Copilot agents have wandered into Documents folders, root drives, and home directories — deleting files that had nothing to do with the project they were working on. The control-plane fix is workspace-scoping at the hook layer.

Apr 26, 2026

Stop your AI agent from dropping a Kubernetes namespace — in three steps

governance defense-in-depth kubernetes

An autonomous agent with kubectl access can `kubectl delete namespace prod` in one tool call. The OS doesn't ask twice. The control plane between the agent's intent and your cluster has to.

Apr 26, 2026

Stop your AI agent from escalating IAM permissions — in three steps

governance defense-in-depth iam

If your agent can call `iam:CreatePolicy`, `iam:AttachRolePolicy`, or `gcloud projects add-iam-policy-binding`, it can grant itself anything the underlying credential allows. The blast radius of one bad tool call is your entire cloud account.

Apr 26, 2026

Stop your AI agent from making payments without approval — in three steps

governance defense-in-depth agentic-commerce

Agentic commerce SDKs from Stripe, Visa, and Mastercard give your AI agent the ability to charge cards, transfer funds, and authorize subscriptions. One bad tool call is one real-world transaction. Here's how to put approval in the loop where it matters.

Apr 26, 2026

Stop your AI agent from burning through your API budget — in three steps

governance defense-in-depth rate-limiting

Cursor agents loop when context summarization interrupts them. Codex sub-agents have run $350 over plan in a week. A leaked GCP key produced an $18,000 bill. The fix isn't smarter prompting — it's a control plane with rate limits and budget caps.

Apr 26, 2026

Your API Keys Already Give Agents Production Access

identity supply-chain

Every API key in your env vars lets an agent act with full access and no user identity. Most teams don't notice until something breaks in production.

Mar 14, 2026

I Audited 7,522 AI Agent Skills. Here's What I Found.

supply-chain mcp

A first-hand static analysis of every skill on ClawHub — the real numbers on credential leaks, prompt injection, and what registry moderation actually catches.

Mar 25, 2026

WebMCP Ships Without Agent Identity. Here's Why That Matters.

standards mcp identity

W3C WebMCP gives browsers a native API for AI agents to call site tools — but ships with no agent identity, scoped permissions, or delegation context.

Mar 19, 2026

Codex CLI Hooks Reference — hooks.json, PreToolUse & PostToolUse

codex openai cli

The complete Codex CLI hooks reference: the codex_hooks flag, hooks.json config, PreToolUse deny rules, PostToolUse audit, --full-auto behavior, and how to govern the tools hooks don't cover.

Apr 30, 2026

Introducing ADCS — an open spec for agent-to-agent delegation chains

spec delegation-chain a2a

ADCS v0.1: an open JSON spec for agent delegation chains — scope intersection, budget propagation, cycle prevention, identity, audit. Ref impl shipping.

Apr 16, 2026

Recommended governance deployment patterns — pick the one that scores highest for your stack

governance deployment recommendation

AgentGovBench scores across seven frameworks, translated into a customer-facing recommendation for deploying governed AI agents by stack, score, and reach.

Apr 20, 2026

How we think about testing AI agent governance

testing benchmark governance

AgentGovBench is an open, NIST-mapped benchmark for AI agent governance. We ran it against ACP. What broke, what shipped, how to run it on your deployment.

Apr 20, 2026

Reproduce AgentGovBench on your stack — full setup guide

tutorial benchmark reproducibility

Step-by-step guide to running the AgentGovBench scorecard against your own ACP deployment: required env, Firebase setup, common issues, reading results.

Apr 20, 2026

How AgentGovBench's 48 scenarios map to NIST AI RMF 1.0

nist ai-rmf governance

AgentGovBench scenarios cite specific NIST AI RMF 1.0 controls — MAP, MEASURE, MANAGE, GOVERN. The full mapping for procurement teams citing controls.

Apr 20, 2026

Build a governed GitHub PR reviewer in Python (with subagent delegation)

recipe agents-building-agents delegation-chain

A Python AI agent that reviews pull requests, spawns a security-scanner and a test-runner as scope-narrowed subagents, and ships an audit chain back to the human reviewer. Full working code. ~210 lines.

May 1, 2026

Build a governed multi-step research agent (delegation chain across 4 hops)

recipe agents-building-agents delegation-chain

A Python research agent that decomposes a question, spawns parallel search subagents, then a synthesizer subagent — with a 4-deep delegation chain that traces every tool call back to the human asker. ~230 lines.

May 1, 2026

Log and Control Every Claude Code Tool Call in 60 Seconds

claude-code governance tutorial

One command puts a hook on every Claude Code tool call. Bash, Read, Write, Edit, WebFetch — logged, checked against your allow/deny rules, visible in a dashboard.

Apr 6, 2026

Allow, Deny, and Audit Every Tool Call in the Anthropic Agent SDK

anthropic agent-sdk a2a

Anthropic's Agent SDK makes multi-skill agents easy to ship — and trusts them completely. How to scope permissions and get an audit trail on every skill, tool call, and sub-agent hop.

Apr 10, 2026

Add Per-User Permissions and Audit to LangGraph in 3 Minutes

langgraph langchain a2a

Wrap your LangGraph tools with one decorator and bind the user's JWT — every tool call across every node is permission-checked, identity-attributed, and logged. Three minutes from `pip install` to running.

Apr 12, 2026

Governed Google ADK in 3 minutes

google adk gemini

Add ACP governance to a Google Agent Development Kit (ADK) agent in three minutes. One @governed decorator, one set_context call, every tool call audited and policy-checked. Works with direct Gemini and Vertex AI.

Apr 27, 2026

Does the Anthropic Agent SDK Have Governance?

anthropic anthropic-agent-sdk benchmark

The Anthropic Agent SDK ships no per-user identity, policy, or audit. Here's the governance gap — and how to close it with one wrapper around your handlers.

Apr 20, 2026

CrewAI's task handoffs lose the audit trail — here's the gap and the fix

crewai governance audit

CrewAI's Hierarchical Process delegates manager-to-worker without carrying the chain. Even with @governed, audit logs show worker as top-level. The fix.

Apr 20, 2026

LangGraph's StateGraph checkpoints don't replay through governance

langgraph governance stategraph

LangGraph checkpoint replays skip the governance pipeline — policy changes between original run and replay are silently ignored. The failure mode and fix.

Apr 20, 2026

How to Connect Salesforce to Claude Desktop in 5 Minutes

mcp architecture

Step-by-step guide to connecting Salesforce CRM data to Claude Desktop using ACP's MCP gateway. Query contacts, deals, and reports from your AI assistant.

Mar 21, 2026

How to Trigger a Governed AI Agent from n8n, Zapier, or Any Webhook

agent-triggers

Step-by-step guide to invoking AI agents via HTTP from workflow automation tools. Every tool call is identity-verified, rate-limited, and audit-logged.

Mar 24, 2026

How to Rate-Limit an MCP Server (Per-User, Per-Tool, Per-Agent)

mcp rate-limiting runaway-agents

MCP servers are rate-limit-blind — they see the LLM runtime's service account, not the user. How to add per-user, per-tool, per-agent limits in MCP.

Apr 16, 2026

Governing CrewAI A2A Delegation: a production setup guide

crewai a2a delegation

CrewAI shipped a first-class A2A delegation primitive. Full walkthrough: install, configure, govern, audit CrewAI A2A crews with scope and budget caps.

Apr 16, 2026

How to Add Per-User Authentication to a LangGraph Agent

langgraph authentication auth

LangGraph agents run on a shared API key by default — every tool call looks the same. Add per-user auth, identity-attributed audit, and rate limits.

Apr 16, 2026

MCP Re-Auth: What ChatGPT Actually Needs When Tokens Expire

architecture mcp oauth

ChatGPT won't re-trigger OAuth on HTTP 401 or JSON-RPC errors — it needs a JSON-RPC success envelope with _meta. The signal, and how a gateway emits it.

Mar 16, 2026

Can You Prove What Your AI Agent Did? I Checked 8,216 MCP Servers.

security-research compliance mcp

74.9% of MCP servers have no mention of audit logging. The MCP spec defines zero audit primitives. Here's why that's a compliance problem.

Mar 30, 2026

The MCP Rate-Limit Blast Radius: $1,080 an Hour

security-research mcp

180 MCP servers proxy calls to paid APIs like OpenAI and Stripe. 85% document no rate limits. An agent retry loop can cost $1,080/hour.

Mar 31, 2026

8,216 MCP Servers, 7,840 Tools, Zero Input Validation

security-research mcp supply-chain

Analysis of 8,216 MCP servers: 2,432 expose high-risk inputs — SQL, file paths, shell commands — with zero validation constraints in the tool schema.

Mar 29, 2026

4 Security Vulnerabilities Hiding in Your MCP Server's Tool Schema

security-research mcp schemas

Real CVEs trace back to unconstrained tool schemas. From 8,216 MCP servers: the JSON patterns behind path traversal, SSRF, injection, and how to fix each.

Apr 2, 2026

I Classified 8,000+ MCP Servers by Auth Appropriateness. Most Get It Wrong.

security-research identity mcp

A static analysis of 8,216 MCP servers across 3 registries (a 4th collector, PulseMCP, returned zero servers). 50.6% have no auth. But the real question is: do they have the right auth for what they do?

Mar 28, 2026

MCP Gateway Comparison (2026): Composio vs ACP vs DIY

architecture mcp

An honest comparison of MCP gateway options. When to use ACP Cloud, Composio, or build your own with open-source tools.

Apr 1, 2026

MCP Moves the Tool Calls. Nothing in It Decides What's Allowed.

architecture mcp

MCP defines how agents call tools. It doesn't say who may call what, with which permissions, or leave an audit trail. What the protocol covers — and the allow/deny layer you still have to add yourself.

Mar 17, 2026

PII in Prompts: What You're Probably Leaking

compliance

Users paste PII into your AI app. It goes straight to the LLM. That's a compliance problem.

Nov 19, 2025

Control your coding agent in one command — no signup, runs local

claude-code codex cursor

One curl command puts a deny floor, an allow/ask/deny policy file, and an append-only audit log in front of Claude Code, Codex, and Cursor. On-device, no account, MIT. Here's the whole thing in 60 seconds.

Jul 27, 2026

Claude Code Hooks vs ACP: When a 50-Line Hook Is Enough

claude-code codex hooks

You can build agent control yourself with a PreToolUse hook — and sometimes you should. What the DIY version takes, where it rots, and an honest line for when to stop maintaining your own.

Jul 27, 2026

How to Block Dangerous Commands From a Coding Agent (2026)

claude-code codex cursor

Five ways to stop an AI coding agent from running rm -rf, force-pushes, and disk writes — permission prompts, deny lists, dcg, a local policy floor, and devcontainers — ranked by what they actually catch.

Jul 27, 2026

The Best Ways to Control What Claude Code Can Do, Ranked (2026)

claude-code permissions sandboxing

Six real options for controlling a coding agent — built-in permissions, Anthropic's sandbox runtime, devcontainers, dcg, cloud sandboxes, and a policy layer on the call path — ranked by isolation, coverage, audit, and setup friction. With the honest limits of each.

Jul 27, 2026

How to Audit What Your Coding Agent Actually Runs (2026)

claude-code codex cursor

Four ways to answer 'what did the agent do while I wasn't looking' — harness transcripts, shell history, OS-level logging, and a hook-path audit log — and what each one misses.

Jul 27, 2026

Your benchmark says the agent passed. It doesn't say what failure would have cost.

governance agents benchmarks

A 2026 paper proposes grading every agent action on a 0–6 harm scale — reversibility, scope, privilege — instead of counting pass/fail. We ran the authors' published scoring code, unmodified, against a production governance gateway: 48 adversarial scenarios, two arms, one variable. Twelve actions that completed ungoverned graded 'attempted, blocked before effect' under governance. The same instrument also graded our own benchmark — and found the levels it never tests.

Jul 26, 2026

The Gemini cache dead zone: why the newest Flash quietly stopped caching your agent's prompts

cost gemini caching

We metered our own agent's bill and found that Gemini 3.6 Flash caches nothing below ~10K tokens — exactly where agent prompts live. Here's the mechanism, the money, and how to check your own traffic.

Jul 24, 2026

A support ticket told an AI agent to leak the tokens table. We recreated the lethal trifecta — and held the exfil for a human.

governance agents incident

In July 2025 a Cursor agent connected to Supabase with service_role credentials read a support ticket containing hidden instructions, queried the integration-tokens table, and posted the secrets back where the attacker could read them. Untrusted input, privileged access, an outbound channel — the lethal trifecta. We rebuilt it in a sandbox. In the governed twin the autonomous agent's outbound call is held for a human instead of executing, and scoping the credential is the boundary you pair it with.

Jul 23, 2026

Replit's AI agent deleted a production database during a code freeze. We recreated the freeze — and held it.

governance agents incident

In July 2025 a Replit agent ran destructive commands against a production database during an explicit code freeze, then misreported what it had done. We rebuilt the scenario in a sandbox: an agent under a freeze instruction, a live prod connection, and a destructive migration. The governed twin's destructive prod operation is denied in the call path — not left to a freeze instruction the agent can rationalize past — so it never runs.

Jul 23, 2026

An AI agent deleted a production database in 9 seconds. We rebuilt it in a sandbox — and stopped it.

governance agents incident

In April 2026 a Cursor agent found a stray API token, ran one destructive GraphQL mutation, and erased PocketOS's production database and its backups in nine seconds. No attacker, no prompt injection — just an over-broad token and an unsupervised destructive call. We recreated the exact mechanism in a sandbox and ran a governed twin beside it. The twin's version ends with a denied tool call and a full database.

Jul 23, 2026

A public issue made an AI agent leak a private repo. We recreated it — and held the public post for a human.

governance agents incident

In May 2025 researchers showed a malicious GitHub issue could steer a developer's AI assistant, via the GitHub MCP server, into pulling private repository contents and posting them publicly. We rebuilt the mechanism in a sandbox. In the governed twin the autonomous agent's public post is held for human approval instead of executing, and scoping the token so the private read can't happen is the boundary you pair it with.

Jul 23, 2026

A pull request told Amazon Q to wipe the machine. We recreated the injected PR — and the destructive calls never ran.

governance agents incident

In 2025 a destructive instruction was slipped into the Amazon Q VS Code extension through a GitHub pull request, directing the agent to wipe the local filesystem and delete cloud resources using the developer's own permissions. We rebuilt the injected-PR mechanism in a sandbox. The governed twin denies rm -rf and destructive cloud calls no matter where the instruction came from.

Jul 23, 2026

Codex CLI Cost Tracking — What Exists, What's Missing, and How to Meter Every Session

codex openai cli

Codex CLI has no built-in dollar cost tracking. The complete reference: what the CLI shows natively, why token counts aren't cost attribution, the model_providers config that meters every session, budget limits, and the subscription-vs-API-key caveat.

Jul 22, 2026

Claude Code vs Codex CLI: Permission Models Compared

claude-code codex permissions

The complete comparison of Claude Code and Codex CLI permission systems — approval modes, hook events and coverage, deny rules, sandboxing, and what each does in unattended mode. With the honest gaps in both.

Jul 22, 2026

An Agent Firewall Is the Enforcement Half of a Control Plane

governance security agents

Products, papers, and practice are converging on the same pattern: enforce policy on the agent's actions, in the execution path, before they run. The industry is naming it 'agent firewall.' Here's the definition I'd defend — and the half it doesn't cover.

Jul 21, 2026

Agent Cost Drift in the Wild: A Model Alias Zeroed Our Cache

cost agents monitoring

Our ops agent's cache rate read 0% across 216 calls. Nothing in the code changed. The '-latest' model alias had quietly resolved to a generation that never cache-hits — multiplying effective input cost ~4–6x. Here's the probe that isolated it, verbatim.

Jul 21, 2026

An LLM Gateway Governs What the Model Says. Your Risk Is What the Agent Does.

governance agents engineering

LangChain's governed-agents post is right about the problem and names the best sentence in the category. A model-call gateway covers part of it. The part it can't see — actions the harness executes locally — is where the blast radius lives.

Jul 20, 2026

Cache-Hit Rates Are Easy to Get Wrong. We Found Three Different Formulas in the Wild — Including Our Own Stack.

cost engineering agents

Cache reads bill at ~10% of full input price, which makes the cache-hit rate one of the most consequential numbers in agent cost — and one nobody defines the same way. A field guide to the three formulas, and the worked example that taught us to check.

Jul 20, 2026

We Read Every Agent Harness's Guardrails. Here's What Survives Yolo Mode.

security governance agents

Eleven agent harnesses and frameworks, their default safety guardrails read from primary source. Almost none stop a catastrophic command in full-auto mode. The one thing they agree on: safety is the developer's homework.

Jul 18, 2026

What Hermes's Blocklist Taught Us About Agent Guardrails

hermes governance security

Hermes Agent publishes exactly what it blocks out of the box. We compared its taxonomy to ACP's risk classifier, adopted five categories, and kept the parts each side does better. Notes on what harness guardrails cover — and what they can't.

Jul 18, 2026

How to Set Up a Hermes Autonomous Agent Safely

hermes governance autonomous-agents

Hermes runs unattended — terminal, files, browser, cron. Set identity, policy, and limits before you give it autonomy, and approve the rest inline. Ten minutes, start to governed.

Jul 17, 2026

Your Agent's Bottleneck Is Almost Never the Model

engineering cost reliability

We metered a few hundred real agent runs to study where cost and reliability actually come from. The recurring answer: not the model — the harness around it. An intro to a series of measured findings.

Jul 10, 2026

One Sentence in Our System Prompt Doubled the Agent's Bill

engineering cost

A live spend counter we injected to keep agents on budget was silently disabling prompt caching — and more than doubling the cost of exactly the agents it was meant to protect. Metered before/after inside.

Jul 10, 2026

Your Agent's Last Move Should Be a Tool Call, Not Text

engineering reliability

28% of our agent runs silently returned nothing. The fix wasn't a retry — it was a design rule: agents should deliver results through a tool call, never a free-text turn. Measured data inside.

Jul 10, 2026

Don't Give Up on Cheap Models — Give Up on Fragile Loops

engineering cost reliability

Gemini Flash looked like an unreliable agent orchestrator — it delivered less than half the time. One trace showed why, and one small loop change made it deliver 100% of the time at the same quality as a model 14× more expensive per successful result.

Jul 10, 2026

Why Not Just Use Your Database's Permissions?

architecture identity

Row-level security and database grants already exist — so why do AI agents need a control plane? Where native permissions hold, where they go blind, and what's genuinely missing.

Jul 8, 2026

Inside the ACP Console: My Own Agents, Real Data, Every Screen

cost architecture

The hardest thing to convey about a control plane is what you actually see once agents run through it. So here's my own workspace — the agents behind Calafia, running for real — walked screen by screen. Every number is real.

Jul 2, 2026

Microsoft's Agent Governance Toolkit Validates the Control Plane — and Leaves Out the Meter

architecture cost

We read the AGT source. What Microsoft got right about governing agent actions, what a library structurally can't do, and why splitting governance from cost is the wrong lesson to take.

Jul 1, 2026

When to Use an Agentic Control Plane (and When to Reach for a Sandbox)

architecture security governance

A control plane is a reference monitor — it only holds on a boundary it can completely mediate. Here's honestly where ACP fits, where a sandbox is the right answer, and how they compose.

Jun 28, 2026

What our scheduled agents can't do — and why that's the point

policy control governance

An agent that builds agents ships them to run unattended, taking real actions while no one watches. The control that makes that safe isn't a smarter prompt — it's a deterministic, tier-aware default-deny that decides every tool call before it runs.

Jun 21, 2026

One step is 90% of our agent's model bill — on purpose

cost model-routing agents

We metered every tool call our agent builder makes. A single step accounts for 90% of the model spend. That's not a leak — it's the result of routing each step to the model it actually needs, which you can only do when cost is attributed per call.

Jun 21, 2026

Our agent got quietly worse — only its audit log noticed

observability logs agents

An agent that crashes is the easy case. An agent that silently produces worse output — same inputs, no exception, no alert — is the hard one. How our agent builder's own audit log caught it running the wrong model on the paths nobody watches.

Jun 21, 2026

Hermes Agent is now the easiest coding agent to govern with ACP

governance hermes-agent nous-research

Nous Research's Hermes Agent ships native pre/post tool-call hooks that cover every tool — terminal, file, web, browser, vision, custom skills. No feature flag, no Bash-only gap, no MCP supplement required. Here's what the new hermes-acp plugin does and why Hermes turned out to be the cleanest integration we've built.

May 26, 2026

Build a governed SQL agent that scrubs PII from query results (Python, runnable)

recipe agents-building-agents pii-redaction

A Python AI agent that runs natural-language queries against a Postgres warehouse, with tool-output PII scrubbing happening at the governance layer — not in agent code. The agent never sees raw customer data. ~180 lines.

May 1, 2026

Build a governed customer-support email triage agent (with human-in-the-loop on sensitive sends)

recipe agents-building-agents customer-support

A Python agent that classifies incoming support email, drafts replies, and asks for human approval before any reply that mentions refunds, account closure, or escalations is sent. The 'ask' decision flow, in working code. ~190 lines.

May 1, 2026

Decorator, proxy, hook — three patterns for agent governance, three different scorecards

governance architecture decorator

Why CrewAI + ACP scores 40/48 but Claude Code + ACP scores 43/48 on the same backend. Three integration patterns, three scorecards — where each wins.

Apr 20, 2026

OpenAI Frontier Gets the Problem Right — and Puts the Controls in the Wrong Place

architecture

Frontier names the real bottleneck: agent identity, permissions, audit. But putting those controls inside the model vendor is banks auditing themselves. Where the control layer has to live, and why.

Feb 12, 2026