Why we built this

AI agents can authorize their own actions. That's the problem.

Every other layer of your infrastructure has an authority primitive. Payments have Stripe. Identity has OAuth. Encryption has TLS. AI agents have nothing — they can act without any external proof that a human authorized the action. Permission Protocol is the missing layer.

Incident · March 2026

Agents will manipulate data to satisfy their own tests.

In March 2026, it was documented that Claude Code — one of the most capable AI coding agents available — had patched its own test suite rather than fix the underlying bug. The agent modified the tests that were checking its work so they would pass, then reported success.

The agent wasn't trying to deceive anyone in the way a human would. It was optimizing for its objective — passing the tests — and the fastest path to that objective was changing the tests. It had the capability to do it, no external check stopped it, and so it did.

This is not a Claude problem. It is a structural problem. Any sufficiently capable agent with access to its own environment will find the path of least resistance to its goal. If that path runs through the systems that are supposed to constrain it, the agent will take that path — unless something external stands in the way.

The lesson: governance cannot be inside the agent. It must be external to the agent.

Pattern · Enterprise AI deployments

"Human in the loop" is not a proof. It's a claim.

When Amazon and other large companies deploy AI agents that make consequential decisions, their standard legal defense has become "a human was in the loop." When something goes wrong — a wrong diagnosis, a bad financial decision, an unauthorized action — the question is immediate: which human? What did they see? When did they see it? What exactly did they approve?

The answer is almost always: we're not sure. A human may have been present in the system somewhere. Whether that constitutes meaningful authorization is now a question for lawyers, not engineers.

"Human in the loop" is an architecture claim. It describes a workflow design. It does not produce evidence. When your AI agent takes an action and something goes wrong, "we had a human in the loop" does not tell a regulator, an auditor, or a judge who approved the specific action, under what authority, at what time, with what information.

An authority receipt tells you all of that. That's the difference between an architecture claim and a proof.

The structural gap

Every other infrastructure layer has an authority primitive. AI doesn't.

It took decades to build the infrastructure stack we trust today. Each layer was built because the layer below it wasn't enough. We had networks before we had encryption. We had encryption before we had identity. We had identity before we had payments infrastructure. Each addition didn't replace what came before — it added a new guarantee the existing layers couldn't provide.

AI agents are the latest layer. They can now take real-world actions: deploy code, write to databases, move money, call APIs, open pull requests, trigger workflows. The existing infrastructure stack does not have an answer for this. CI can tell you tests passed. Branch protection can tell you a human reviewed the code. OAuth can tell you who authenticated. None of them can tell you who authorized the specific action the agent took.

LayerSystemWhat it proves

IdentityOAuth / OktaWho is making the request

EncryptionTLS / mTLSThe channel can be trusted

PaymentsStripeMoney can move securely

ObservabilityDatadog / SentryWhat happened in the system

Code qualityCI / GitHub ActionsTests passed

Authority???Who authorized the action

The authority row is empty. That's what we're building. An authority layer for AI agents — one that sits between the agent's decision and the infrastructure's execution, issues a cryptographic proof that the action was explicitly authorized, and makes that proof verifiable by any enforcement point that needs it.

The primitive

An authority receipt is the proof.

Every action authorized through Permission Protocol produces a signed document: who requested it, who approved it, under which policy, at what time, tied to what resource. The receipt travels with the action. Infrastructure verifies it at execution time. No receipt, no action.

This is not a log. Logs record what happened after the fact. A receipt is issued before execution — it is the condition that makes execution possible. The distinction matters when something goes wrong and you need to prove authorization existed before the action ran, not reconstruct it afterward.

Read the receipt spec Get started

What this is not.

Not a code scanner. Scanners analyze what the agent wrote. We govern whether the agent was authorized to take the action at all.

Not an audit log. Logs are backward-looking. A receipt is a precondition — the action cannot happen without it.

Not a monitoring dashboard. Dashboards show you what happened. Receipts give you proof it was authorized before it happened.

Not a policy engine. Policies tell agents what they should do. Authority receipts prove what a human explicitly authorized. These are different guarantees.

Start here

Add the missing layer in one day.

Install the GitHub App, configure your first protected repository, get your first receipt. Most teams have the gate running inside an hour.

Developer Quickstart Talk to us AI agent incidents