AI Gateway · Pillar 5

The AI Gateway Proxy your security team can sign off on.

DPoP-bound capability keys. Streaming PII redaction nobody else has solved. Per-agent budgets. Per-employee attribution. Prompt firewall. HIPAA / PCI / GDPR compliance packs. All composed from primitives KnoxCall already audits.

Five things nobody else ships together.

The AI Gateway market is loud and shallow. Every vendor sells one of these. KnoxCall ships all five, audited, on day one — because they sit on top of primitives we already had (routes, secrets, vaults, alerts, packs, fleet graph, mTLS, VPN egress, leases, KMS-unseal).

Capability Keys (DPoP-bound)

kc_live_a_… keys with embedded capability scopes. RFC 9449 sender-constraint binds the key to a private key in the OS keychain. A stolen key without the matching private half is inert. Refresh rotation with theft detection invalidates the entire credential family on reuse.

Streaming PII Redaction

Hold-back FSM. PII split across SSE chunks (think SSNs straddling data: events) is detected at sentence boundaries before any token leaks downstream. Bedrock punts. Cloudflare buffers. We solve it.

Cost Budgets & Per-Employee Attribution

Per-agent daily / monthly USD caps. Pre-flight Redis check. Pricebook lookup post-flight for exact cost. X-KC-User header pins spend to an employee + team. Block / warn / fall-back to a cheaper agent on overage.

Prompt Firewall + Canary Leak

Heuristics catch obvious "ignore previous instructions" patterns in microseconds. Per-tenant canary tokens injected into system prompts trip a critical alert on extraction. Vector classifier in v1.2.

Compliance Packs

HIPAA Safe Harbor (18 identifiers + MRN formats), PCI (PAN + CVV + ABA + tokenization), GDPR (EU national IDs + RTBF), SOC 2 — one-click recognizer sets, retention, audit alerts, route templates. Sells itself in procurement.

10 Config Generators

Cursor / Claude Code / Cline / Continue / OpenAI SDK (Py + Node) / Anthropic SDK (Py + Node) / generic OpenAI- + Anthropic-compatible. Drop a JSON; new tools take a PR, not a release.

Streaming PII redaction, solved end-to-end.

Every other AI gateway buffers the response (you lose streaming) or sees PII leak through chunk boundaries (you lose compliance). KnoxCall does neither. Per-stream finite-state machine + 96-char sliding hold-back buffer + Aho-Corasick → regex+checksum → Presidio sidecar.

1

Tokenize on ingress (FF3-1, per-tenant key)

"Patient John Smith, SSN 123-45-6789" → LLM sees "Patient Mary Jones, SSN 847-29-1058". Token map keyed by conversation_id, 24h TTL. PHI never leaves your tenant boundary.

2

Stream upstream → hold-back FSM

Per-event JSON parse, append text to a 96-char hold-back buffer, run detector stack, tokenize / redact in place, emit prefix(buffer, len-96). Covers SSN, PAN (Luhn), Amex, IBAN, passport, BTC/ETH addresses, MRN.

3

Detokenize on response

JSONPath rewriter on the way back. Customer sees the original PHI verbatim. The LLM only ever saw tokens. Audit log records spans + recognizer + confidence — never the raw value.

One-click setup for every IDE that matters.

Pick your tool. We generate the exact files, env vars, and copy-paste shell snippets. DPoP-capable tools also receive a fresh ECDSA P-256 keypair installed straight into the OS keychain.

CursorOpenAI-compatible · DPoP
Claude CodeAnthropic · DPoP
ClineOpenAI-compatible · DPoP
ContinueOpenAI-compatible · DPoP
OpenAI SDKPython & Node
Anthropic SDKPython & Node
Generic OpenAICurl-friendly
Generic AnthropicCurl-friendly

vs API Stronghold · vs Bedrock · vs Cloudflare AI Gateway

Plain bearer tokens, post-hoc PII, no streaming proof, no compliance packs, $10 → "Contact Sales". The AI gateway market is asleep. We're not.

FeatureAPI StrongholdCloudflare AIBedrockKnoxCall AI Gateway
DPoP-bound capability keysplain bearerplain bearerIAM
Streaming SSE pass-throughunprovenbufferedproven, hold-back FSM
Streaming PII redactionrefusedrefusedpost-hoc onlyunique
HIPAA / PCI / GDPR packs××via Macie
Per-agent budgets + per-employee×××
Model allowlist + rewrite×××allow + deny + rewrite
Prompt firewall + canary leak×××
OIDC workload federation××via STS
Audit logs (free tier)strippedlimitedvia CloudTrailfull
OpenTelemetry GenAI×××
Pricing $49 tiergapfree / $20consumption

Stop hand-rolling AI proxies.

Capability keys, streaming PII redaction, budgets, attribution, and compliance packs — all in one audited gateway.